Closed yzong12138 closed 5 months ago
The bug should comes from the line 162: It should change from
with io.BytesIO(unlzw3.unlzw(path)) as f:
to
with io.BytesIO(unlzw3.unlzw(Path(path))) as f:
Before it will pass a str
to the unlzw to decompress but now the input for the unlzw()
is a Path
so it will try to load the data from the path first.
closed with #247
nocr/trec-robust-2004 dataset, cannot ready the document from the file successfully A clear and concise description of what the bug is.
Affected dataset(s) nocr/trec-robust-2004
To Reproduce Steps to reproduce the behavior:
~/.ir_datasets/disks45/corpus/NEWS_data
Expected behavior