hsiehsh168168 / warc-tools

Automatically exported from code.google.com/p/warc-tools
0 stars 0 forks source link

SRS 46 — The HTTrack archive file format and link strategy may vary from version to version of HTTrack... #52

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
SRS 46 — The HTTrack archive file format and link strategy may vary from
version to version of HTTrack, therefore it shall be possible to adapt the
migration scripts to deal with these changes.

Original issue reported on code.google.com by gordon.p...@gmail.com on 27 Jul 2008 at 10:13

GoogleCodeExporter commented 8 years ago
Though an HTTrack version is specified in the readme file, the HTTRack converter
works with the directory tree structure, not the HTTrack cacahe and config 
files, so
it should work with different versions. 

Also, HTTrack converter is based on other tools that are part of warc-tools 
(i.e.
app/python/file2warc.py) so it should be straightforward to adapt the migration 
scripts.

Original comment by gordon.p...@gmail.com on 24 Oct 2008 at 12:37