apify / crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
https://crawlee.dev/python/
Apache License 2.0
4.64k stars 319 forks source link

Add a guide for deployment to Google cloud #707

Open janbuchar opened 4 days ago

janbuchar commented 4 days ago

https://crawlee.dev/docs/deployment/gcp-cheerio

francomanca93 commented 4 days ago

I would like to contribute to this guide, how can I do it?. I work with Google cloud every day.

janbuchar commented 4 days ago

Hi @francomanca93, and thank you so much for your offer! All you need to do is write the guide and put it in this folder - https://github.com/apify/crawlee-python/tree/master/docs/deployment. It should roughly match the same guide in the Javascript Crawlee docs (https://crawlee.dev/docs/deployment/gcp-cheerio). Let us know if you need any more pointers.