molecularsets / moses

Molecular Sets (MOSES): A Benchmarking Platform for Molecular Generation Models
https://arxiv.org/abs/1811.12823
MIT License
821 stars 241 forks source link

Error occurs when loading datasets, seems a gzip error #102

Open CandyClass opened 2 years ago

CandyClass commented 2 years ago

train = moses.get_dataset('train') the error shows: Not a gzipped file (b've') Seems a quite basic question but I just can't figure it out.

BWyueyue commented 2 years ago

Have you solved it?

CandyClass commented 2 years ago

Have you solved it?

nope, I give up on this package..

Liar-zzy commented 2 years ago

Have you checked the integrity of the file? I just solve it.

daniil-polykovskiy-insilico commented 2 years ago

Hi all, Insilico team will review this issue. We'll keep you posted

BWyueyue commented 2 years ago

Have you solved it?

It's fine now

yangzhao1230 commented 2 years ago

I met the same error. Could you please tell me how you solved it? Thank you in advance.

yangzhao1230 commented 2 years ago

Have you checked the integrity of the file? I just solve it.

Hi, I met the same error. Could you please explain the solution in more details? Thank you!

D-coban commented 1 year ago

Have you checked the integrity of the file? I just solve it.

Hi, I am also having the same issue. Would it be possible to explain how did you manage to solve the issue? Kind regards.

filipekstrm commented 11 months ago

Anyone solved this and would like to share how they did? I seem to have an issues also with moses.datasets.get_statistics and I am thinking it could be related

filipekstrm commented 11 months ago

Anyone solved this and would like to share how they did? I seem to have an issues also with moses.datasets.get_statistics and I am thinking it could be related

If anyone else stumbles upon this problem now, I had installed by cloning the repo and running python setup.py install, and by replacing the files in moses/dataset/data with files that I had on another system where the package didn't give this error and rerunning the installation script solved my problem

sei0o commented 3 months ago

On my environment this error message was shown when I forgot to fetch the files served via LFS. The dataset files (*.gz) are usually larger than code or documents so they are managed on the external storage. When LFS is enabled for a file, its content in the repository is replaced with a pointer to where the original content (dataset) is saved and it means git has to resolve those pointers and download the original content. You might want to try Installing Git Large File Storage and execute git lfs fetch .