This project tests how the browser language can be changed with Puppeteer. It implements multiple options to set the language of Chrome and checks each option against BrowserLeaks to see how it affected the JavaScript proeprties and HTTP headers available by the browser. For more information, see my article The Puppeteer Language Experiment on DEV.to. - Source: dev.to / 17 days ago
In Crawlee, you can scrape JavaScript rendered websites using the built-in headless Puppeteer and Playwright browsers. It is important to note that, by default, Crawlee scrapes in headless mode. If you don't want headless, then just set headless: false. - Source: dev.to / 18 days ago
I am not in any way associated with the developers at puppeteer, but if you are looking for a way to contribute, they are open source. - Source: dev.to / about 1 month ago
Puppeteer is a Node library that provides a high-level API to control headless Chrome or Chromium. It's primarily used for browser automation, making it a powerful tool for end-to-end testing of web applications, taking screenshots, and generating pre-rendered content from web pages. - Source: dev.to / 3 months ago
While similar to Puppeteer, Cypress, and Selenium, there are some differences. Let’s find out what they are. - Source: dev.to / 8 months ago
The most widely used browser automation frameworks for scraping, end to end testing, and so on is literally called Puppeteer [1] :-) [1] https://github.com/puppeteer/puppeteer. - Source: Hacker News / 11 months ago
Puppeteer is a powerful browser automation library for web scraping and integration testing. However, the asynchronous, real-time API leaves plenty of room for gotchas and antipatterns to arise. - Source: dev.to / 12 months ago
Browserless.io hosts a REST API allowing to create remote puppeteer instances usable with puppeteer.connect(). Source: about 1 year ago
Some more interesting features: - The Lua Filter allows you to program your own functionality if something is missing from the built-in Filters. - Raw Puppeteer session control through Browserless. Source: about 1 year ago
Selenium, Puppeteer, and Playwright are the most used and known libraries. Avoiding them for performance reasons would be preferable, and they'll make scraping slower. But sometimes, there is no alternative. - Source: dev.to / about 1 year ago
Clone (or download the zip file) of puppeteer. URL: https://github.com/puppeteer/puppeteer. Source: about 1 year ago
Puppeteer is a browser automation library for JavaScript that uses the DevTools protocol to programmatically control Chromium or Chrome browsers. With more than 80K stars on GitHub, it is the de facto standard in headless browser automation. Puppeteer is written in TypeScript, which makes it easy to start with because of great code completion in IDEs. - Source: dev.to / about 1 year ago
Puppeteer is a powerful Node.js browser automation library for integration testing and web scraping. However, like any complex software, it comes with plenty of potential pitfalls. - Source: dev.to / over 1 year ago
What languages do you know? https://github.com/puppeteer/puppeteer works pretty well for this kind of thing. Source: over 1 year ago
I use this technique to build a personal dashboard. Rather than try to scrape data, then come up with a nice presentation for it, I just find a nice representation on the web for the data I want on the dashboard, then use Puppeteer[1] to automatically screenshot the specific DOM element that contains the thing I want. Works like a champ. [1]: - Source: Hacker News / over 1 year ago
That’s not how you scrape CSR Webpages, use something like puppeteer https://github.com/puppeteer/puppeteer. Source: over 1 year ago
Puppeteer by Google is a Node.js library providing an API for headless browser control via DevTools Protocol. Originally, it was an automated testing library, but it has been successfully used for web scraping. Supported browsers: Chrome / Chromium, limited – Firefox. Supported programming languages: Node.js; has an unofficial Python library, “Pyppeteer.”. - Source: dev.to / over 1 year ago
Essentially you're on your own. What I did, is create a custom browser extension that grabs my transactions as soon as I sign in to the issuer's platform (I got it working properly for Discover and Chase). If you're willing to go a little further you could probably use puppeteer to automatically log and and do some actions for you. Source: over 1 year ago
I didn't set up Puppeteer to take screenshots of the sites but decided to use a screenshot as a service API. It gives a bunch of free screenshots to render websites, so that's enough to play with our project. Because taking screenshots of the site is not a core topic of the post, and using Puppeteer would be a pain in the ass. - Source: dev.to / over 1 year ago
Sounds like something that should be automated through Puppeteer or Selenium. Source: over 1 year ago
You can start web-scraping with Node.js and puppeteer - https://github.com/puppeteer/puppeteer. There is plenty of tools for every language and platform. Source: over 1 year ago
Do you know an article comparing puppeteer to other products?
Suggest a link to a post with product alternatives.
This is an informative page about puppeteer. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.