apify / crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
https://apify.github.io/crawlee-python/
Apache License 2.0
27 stars 1 forks source link

fix: handling of relative URLs in add_requests #213

Closed janbuchar closed 6 days ago

janbuchar commented 1 week ago
janbuchar commented 1 week ago

Good job! Could you please just add the change description to the changelog? Thanks.

I mean, sure. But is there any chance we could automate that? It's pretty much the reason why we bother with conventional commits after all.

vdusek commented 1 week ago

I mean, sure. But is there any chance we could automate that? It's pretty much the reason why we bother with conventional commits after all.

Completely agree; automating this would be great. One of the first issues opened addresses this #18. Hopefully, it can be done right after the public launch.