openzim / warc2zim

Command line tool to convert a file in the WARC format to a file in the ZIM format
https://pypi.org/project/warc2zim/
GNU General Public License v3.0
44 stars 4 forks source link

Handle base href #243

Closed benoit74 closed 5 months ago

benoit74 commented 5 months ago

Fix #238

Changes:

Remark:

benoit74 commented 5 months ago

why do we need the SimpleUrlRewriter?

This is kinda a mock for tests ; it was previously done with a lambda which was just returning the same url as passed, but it was kinda a hack and this does not work very nicely now that we need to pass both url and base_href to the rewriter.