scrapy / scurl

Performance-focused replacement for Python urllib
Apache License 2.0
21 stars 6 forks source link

Urljoin canonicalize urls by default #49

Closed malloxpb closed 6 years ago

malloxpb commented 6 years ago

The reason why scrapy/scrapy#3332 failed is that the function urljoin uses GURL container, which canonicalize the joined urls by default. Therefore, we will need to come up with a way to fix this!

malloxpb commented 6 years ago

Resolved in #52