Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Currently, when a user runs their crawler, no log is printed, leaving users unaware of the crawler's progress and actions.
We have a log formatter already ready, but users need to configure it manually. Like this:
Can we make this setup a default? Possible solution: importing any module from Crawlee should configure the root logger automatically.