Software Alternatives, Accelerators & Startups

Top 12 Open-Source Alternatives to Diffbot

Diffbot
ScrapeHero Webhose.io Zyte Scrapy Webtap.ai CoffeeScript AirCode Dataflow Kit Ocean Protocol StormCrawler

Summary

The top open-source alternatives to Diffbot are ScrapeHero, Webhose.io, and Zyte. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. A web scraping service to collect data from websites, without any programming or DIY tools.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial
    • $5.0 / Monthly

    #API #Web Scraping #Data Dashboard 1 social mentions

  2. Webhose.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 1 social mentions

  3. 3
    We're Zyte (formerly Scrapinghub), the central point of entry for all your web data needs.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial

    #Web Scraping #Data Extraction #Web Crawling 1 social mentions

  4. 4
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 97 social mentions

  5. Extract data from any website using natural language queries—no coding needed.
    Pricing:
    • Open Source
    • $19.99 / Monthly (Pro Plan. Access to our AI-powered web scraper.)

    #Web Scraping #Data Extraction #AI

  6. Unfancy JavaScript
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Data Analysis 25 social mentions

  7. Serverless Node.js stack for API development
    Pricing:
    • Open Source

    #Productivity #Web Scraping #Data Extraction

  8. A cloud-based web scraping platform. Extract data from websites and automate workflows on the web.
    Pricing:
    • Open Source
    • Paid
    • Free Trial
    • $5.0 / Usage

    #Web Scraping #Website Screenshots #Data Extraction

  9. The open-source & privacy-preserving data sharing protocol
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Developer Tools 1 social mentions

  10. StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling

  11. Get News Data with API
    Pricing:
    • Open Source

    #API Tools #Data Extraction #Web Crawling 16 social mentions

  12. Apache Nutch is a highly extensible and scalable open source web crawler software project.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 2 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Diffbot.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Diffbot discussion

Log in or Post with
OSZAR »