Open lfoppiano opened 2 weeks ago
Hi @lfoppiano
It is likely that the download of https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_file_list.txt somehow failed, has error or was interrupted.
I think in case of failure, the downloaded file is still under data/pmc/oa_file_list.txt, if it's the case you can do a tail
on this file to see if the last line is broken. I am using apache FileUtils for the download, so maybe something more robust could help.
One solution, you can just rerun the command - it might work at some point if you have a good internet connection :)
I've ran the import, in the following order: crossref (I loaded around 96M records, before I had to stop because I ran out of space), HAL and then PMID, and I've got the following exception when running
./gradlew pmid
.Full log