Open yari-iw opened 2 years ago
Thank you for reporting this issue. We ever saw the same errors but they were not easily reproducible from our servers. We ever suspected this might be due to firewall protection or loading policy of NCBI. We will test again and very likely lower the default downloading threads from 3 to 1 in order to fit their protection policy. The option will be added then. We forgot to change the version number in the code. Will fix them together. Thanks again for your helpful feedback.
Hi @ythuang0522, thank you for your quick answer.
Hi, I'm using homopolish (polish mode) in a pipeline and I noticed that some of the results I was getting were not reproducible. The logs helped me to identify that the problem comes from the sequences download :
command:
python3 homopolish.py polish -t 12 -a $assembly -s $homopolish_db -m R9.4.pkl -o .
logs:The error comes from the fact that by default homopolish download sequences by batches of 3 and this seems to overload some clients. By changing the variable max_pool_size (from the download.py script) to 1 instead of 3, all sequences are correctly downloaded.
As this can be a problem for reproducibility (and can be stay unnoticed until a proper testing is performed) would it be possible to add an option to manually set the number of requests or to lower he number of requests by default ?
I'm using the latest version of homopolish cloned from github earlier today (which I suppose is v0.4) but the
--version
option tells me I'm using :Homopolish VERSION: 0.3.4