Closed lyy1994 closed 2 years ago
Oh, I find the reason: unzip
was interrupted, so the database is not complete. After I unzip the file again, this error disappears.
Another small question: I find build_db.py
takes much time, so I interrupt it and run it again in the backend. Meanwhile, I delete all newly generated files like feverous-wiki-docs.db
and feverous-wiki-docs.db-journal
. Is that OK?
Hi @lyy1994, build_db.py
does not modify feverous_wikiv1.db
so running build_db.py
again is fine after removing the generated files (the program will stop if it finds a file with the same name). Does that answer your question?
Thank you for your reply! It solves my question :)
When I run the command
PYTHONPATH=src/feverous python src/feverous/baseline/retriever/build_db.py --db_path data/feverous_wikiv1.db --save_path data/feverous-wiki-docs.db
in README to reproduce the baseline, I encounter the following error message:It seems that the data I downloaded is not complete. But the data is downloaded via
./scripts/download_data.sh
, which should be OK.Thanks for helping me!