Apify is a JavaScript & Node.js based data extraction tool for websites that crawls lists of URLs and automates workflows on the web. With Apify you can manage and automatically scale a pool of headless Chrome / Puppeteer instances, maintain queues of URLs to crawl, store crawling results locally or in the cloud, rotate proxies and much more.
A startup from Prague, Czech Republic.
Ease of Use
Apify provides a user-friendly interface that makes it easy for users of all technical levels to create and manage web scraping tasks.
Scalability
Apify is built to handle tasks of various sizes, from small-scale projects to enterprise-level operations, making it a scalable solution.
Integration and API Support
It offers extensive API support, allowing for seamless integration with other tools and systems to enhance automated workflows.
Customizability
Users can customize their scraping bots (actors) with different settings and scripts to fit specific needs and requirements.
Cloud-based
Being a cloud-based platform, Apify allows users to run their scraping tasks without needing local resources, which is convenient and efficient.
Comprehensive Documentation
Apify provides thorough documentation and tutorials, which help users get started quickly and solve issues efficiently.
Community and Support
Apify has an active community and solid customer support to assist users with their needs and enhance their overall experience.
Promote Apify. You can add any of these badges on your website.
For deployment, we'll use the Apify platform. It's a simple and effective environment for cloud deployment, allowing efficient interaction with your crawler. Call it via API, schedule tasks, integrate with various services, and much more. - Source: dev.to / 1 day ago
We already have a fully functional implementation for local execution. Let us explore how to adapt it for running on the Apify Platform and transform in Apify Actor. - Source: dev.to / about 1 month ago
We've had the best success by first converting the HTML to a simpler format (i.e. markdown) before passing it to the LLM. There are a few ways to do this that we've tried, namely Extractus[0] and dom-to-semantic-markdown[1]. Internally we use Apify[2] and Firecrawl[3] for Magic Loops[4] that run in the cloud, both of which have options for simplifying pages built-in, but for our Chrome Extension we use... - Source: Hacker News / 8 months ago
Developed by Apify, it is a Python adaptation of their famous JS framework crawlee, first released on Jul 9, 2019. - Source: dev.to / 8 months ago
Hey all, This is Jan, the founder of [Apify](https://apify.com/)—a full-stack web scraping platform. After the success of [Crawlee for JavaScript](https://github.com/apify/crawlee/) today! The main features are: - A unified programming interface for both HTTP (HTTPX with BeautifulSoup) & headless browser crawling (Playwright). - Source: Hacker News / 10 months ago
In this article, I will walk you through everything, from crafting your initial scraping script (Actor) using the Apify SDK for TypeScript to deploying it to the Apify Actors Store for seamless data collection, and then, I will show you how to run your deployed Actor on the Apify platform. With Apify, you don't need to be a programming pro to harness the power of web scraping and start gaining insights. - Source: dev.to / about 1 year ago
I am surprised nobody mentioned https://apify.com/ and they even offer discount for YC startups as ex-graduate from the YC Combinator program. - Source: Hacker News / about 1 year ago
Web Scraping, Data Extraction and Automation · Apify ( https://apify.com/ ). Source: almost 2 years ago
At this point of the tutorial, I'll take the opportunity to do a bit of self-promotion. I'm the COO of Apify, a cloud platform that helps you develop, run, and maintain your web scrapers easily and efficiently. It comes with tons of features like queue storages and proxies, and it supports Puppeteer without any extra configuration. You can run the above scraper, save results and control everything with a powerful... - Source: dev.to / about 2 years ago
Apify a saas that can be helpful in this situation since you can use its api to call actors from your java code. Source: over 2 years ago
At FINN we use Checkly once again to run our scheduled E2E tests. Some teams also use a combination of Make and Apify to run scheduled E2E tests on smaller projects. - Source: dev.to / over 2 years ago
Crawlee: A new webscraping framework by Apify. - Source: dev.to / over 2 years ago
Actively maintained and developed by Apify—we use it ourselves! Source: over 2 years ago
Hi there! We dont have any benchmarks for Crawlee just yet, but we are working on those as we speak. We care deeply about bot detection, one of the features of Crawlee is generated fingerprints based on real browser data we gather - you can read more about it in the https://github.com/apify/fingerprint-suite repository, which is used under the hood in Crawlee. Crawlee is and always will be open source. It... - Source: Hacker News / over 2 years ago
Have you looked at Apify? They have a freelancer section. https://apify.com/. Source: over 2 years ago
I'm working on a personal project that involves A LOT of scraping, and through several iterations I've gotten some stuff that works quite well. Here's a quick summary of what I've explored (both paid and free): * Apify (https://apify.com/) is a great, comprehensive system if you need to get fairly low-level. Everything is hosted there, they've got their own proxy service (or you can roll your own), and their... - Source: Hacker News / over 2 years ago
I would add Apify it is a good alternative to Phantomubster for some workflow QApop.com for Quora marketing Zapier or intgromate to glue all the tools together. Source: about 3 years ago
Check out apify.com It can do everything you need. Source: about 3 years ago
To give you an example, I'm part of Apify, a web scraping and automation company. Here at Apify, we are partners with Thorn, an anti-human trafficking organization, and we use our web scraping technology to help identify and fight child traffickers online. Clearly, this is a good use of web scraping. Nevertheless, this same technology can be used for conducting illegal activities, such as collecting personal data... Source: over 3 years ago
You can host your code on Apify and then use their proxies. Residential proxies are part of their free plan, and if you need something more powerful in the future, they also offer rotating proxies for a cheap monthly subscription fee. Source: over 3 years ago
Apify is a web scraping tool specialized in Amazon’s data, and its aim is to provide what the official API of Amazon can not provide for the users. Apify’s Amazon Scraper harvest and download data, including detailed descriptions of online products, images of the items, prices, pictures, the name of the seller, condition of the article, whether they are new, refurbished, or broken, and all other information... - Source: dev.to / over 3 years ago
Do you know an article comparing Apify to other products?
Suggest a link to a post with product alternatives.
This is an informative page about Apify. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.
Managing the API is more convenient.
Is very good.