dmis-lab / biobert-pytorch

PyTorch Implementation of BioBERT
http://doi.org/10.1093/bioinformatics/btz682
Other
299 stars 104 forks source link

The dataset download doesnt work anymore #34

Open ghost opened 1 year ago

ghost commented 1 year ago

Running download.sh returns an error. Probably google changed something:

BIOBERT_DATA not set; downloading to default path ('data'). --2022-12-13 19:19:09-- https://docs.google.com/uc?export=download&confirm=&id=1cGqvAm9IZ_86C4Mj7Zf-w9CFilYVDl8j Resolving docs.google.com (docs.google.com)... 173.194.216.139, 173.194.216.101, 173.194.216.100, ... Connecting to docs.google.com (docs.google.com)|173.194.216.139|:443... connected. HTTP request sent, awaiting response... 404 Not Found 2022-12-13 19:19:09 ERROR 404: Not Found.

gzip: stdin: unexpected end of file tar: Child returned status 1 tar: Error is not recoverable: exiting now

wonjininfo commented 1 year ago

Hi, It seems like my school has ceased to provide google drive access. I have updated the URLs in download.sh.

Also, we have made a mirror at the following URLs: http://nlp.dmis.korea.edu/projects/biobert-2020-checkpoints/biobert_v1.1_pubmed.tar.gz http://nlp.dmis.korea.edu/projects/biobert-2020-checkpoints/NERdata.zip http://nlp.dmis.korea.edu/projects/biobert-2020-checkpoints/REdata.zip http://nlp.dmis.korea.edu/projects/biobert-2020-checkpoints/QA.zip

Thank you. Best, Wonjin