Apache Nutch
Apache Nutch is a highly extensible and scalable open source web crawler software project.
- Open Source
Apache Nutch Alternatives
The best Apache Nutch alternatives based on verified products, community votes, reviews and other factors.
Latest update:
-
Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
-
StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
-
Clear. Fast. Unlimited. Residential & Mobile Proxies For Best Price .
-
Common Crawl
-
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...
-
ACHE is a web crawler for domain-specific search.
-
Turn the web into a database!
-
Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...
-
Custom Web Scraping & Powerful Web Crawling.
-
Octoparse provides easy web scraping for anyone. Our advanced web crawler, allows users to turn web pages into structured spreadsheets within clicks.
-
Apify is a web scraping and automation platform that can turn any website into an API.
-
Content Grabber is an automated web scraping tool.
-
HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.
-
Elasticsearch is an open source, distributed, RESTful search engine.