neuml / paperetl

📄 ⚙️ ETL processes for medical and scientific papers
Apache License 2.0
342 stars 27 forks source link

sqlite3.OperationalError: database is locked #45

Closed Chriszhangmw closed 8 months ago

Chriszhangmw commented 1 year ago

!python -m paperetl.file paperetl/file/data paperetl/models

get the following error:

huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either:

Chriszhangmw commented 1 year ago

after I kill the process related to sqlite, run again, but get another error:

huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either:

davidmezzetti commented 1 year ago

I would delete everything from paperetl/models and try again. The database locked error would only happen if another process was accessing it. I would make sure there isn't multiple instances of the paperetl process running.

Chriszhangmw commented 1 year ago

image

Chriszhangmw commented 1 year ago

image the size is 0

davidmezzetti commented 1 year ago

Do you have data in paperetl/data? It seems like nothing was processed.

davidmezzetti commented 8 months ago

Closing this issue due to inactivity. Please re-open or open a new issue if there are further questions.