Closed amacal closed 1 year ago
@amacal Thank you for your report. There was a numerical overflow issue while decoding a large file. I will fix it and release a new version as soon as possible.
@amacal Version 0.2.11 has fixed this issue. You can update and try it out.
Cool! It works even for the biggest file in the dataset: enwiki-20230501-pages-meta-history10.xml-p5096070p5137514.7z
Reading entry: SevenZArchiveEntry { name: "", has_stream: true, is_directory: false, is_anti_item: false, has_creation_date: false, has_last_modified_date: true, has_access_date: false, creation_date: FileTime(0), last_modified_date: FileTime(133278673660200860), access_date: FileTime(0), has_windows_attributes: true, windows_attributes: 0, has_crc: true, crc: 3468506479, compressed_crc: 0, size: 482374799893, compressed_size: 0, content_methods: [] }
Ok(16)
Ok("<mediawiki xmlns")
I try to decompress just few bytes of two different files, one file works, the other one not. Both files work correctly with 7zip.
enwiki-20230501-pages-meta-history23.xml-p50555787p50564553.7z - works enwiki-20230501-pages-meta-history5.xml-p956483p958045.7z - doesn't work
I used the following code to test it:
output for enwiki-20230501-pages-meta-history5.xml-p956483p958045.7z:
output for enwiki-20230501-pages-meta-history5.xml-p956483p958045.7z: