apify / crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
https://crawlee.dev/python/
Apache License 2.0
4.63k stars 319 forks source link

Crawlee for Python Hacktoberfest 2024 🧡 🐍 #555

Open souravjain540 opened 1 month ago

souravjain540 commented 1 month ago

Crawlee for Python Hacktoberfest 2024 [Starting Oct 1, 2024]

Hacktober 2024 Crawlee

Prizes 🏆

Valid Issues.

Rules and T&C.

Horlaitan15 commented 4 weeks ago

Hi, I'm not able to join the Discord channel and I would like to contribute to Crawlee. Came across it today. Thank you

akshay11298 commented 2 days ago

Hi @souravjain540 , how do we get the swags? Is there some form or something that needs to be filled from our side?