Connect to remote browser services

apify / crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Apache License 2.0

15.48k stars 664 forks source link

Which package is the feature request for? If unsure which one to select, leave blank

@crawlee/browser (BrowserCrawler)

Feature

There are cloud browser services like Browserless. So that we can use remote browsers to run our automation tasks.

Motivation

Allows for remote programs to connect, pilot, and execute headless browser tasks

Ideal solution or implementation, and any additional constraints

Add a connectOptions similar to the launchOptions

Alternative solutions or implementations

No response

Other context

No response

apify / crawlee