pirovc / genome_updater

Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy
MIT License
139 stars 14 forks source link

Parallel error when dowloading #50

Closed lshilab closed 2 years ago

lshilab commented 2 years ago

Hi, I tried to download the virus genome indicated in the readme file and the program runs successfully in the home folder. But when trying to download in the /mnt/d/ folder, it shows a parallel error. I need to download to the /mnt/d/ because of space limitations. Can you check if there is something wrong with my settings? Thanks!


Mode: NEW - DOWNLOAD Working directory: /mnt/d/virus/all_rna_virus/

Downloading assembly summary [2022-04-16_08-25-17]

Include this in the report:

If you get the error on smaller/fewer files, please include those instead.

pirovc commented 2 years ago

Looks like it's a parallel issue. In genome_updater, parallel is set to use temporary directory, same as the output directory. You may be able to work around the problem by changing it here. Alternatively you could try using a different parallel version, maybe the one distributed by conda?

lshilab commented 2 years ago

Thank you for pointing out this solution, I think it's quite possible that this will solve the problem. Unfortunately, I am just starting to learn and don't quite understand exactly what to do with the change you are pointing about. I tried to bypass this problem by changing the output folder. I was using the script under wsl, and when I couldn't run it, the output folder was not in wsl, but on another disk. I tried putting the output folder in the home directory and was able to download normally. I guess this is exactly the folder problem you pointed out. Thanks again for the script, it was very useful.