Codecov Report

Merging #57 (ec8cb49) into master (26395b6) will increase coverage by 12.91%. The diff coverage is 70.22%.

@@             Coverage Diff             @@
##           master      #57       +/-   ##
===========================================
+ Coverage   41.17%   54.08%   +12.91%     
===========================================
  Files           5        7        +2     
  Lines        1059     1773      +714     
===========================================
+ Hits          436      959      +523     
- Misses        623      814      +191

Impacted Files	Coverage Δ
pysradb/cli.py	`0.00% <0.00%> (ø)`
pysradb/download.py	`22.22% <20.68%> (-2.78%)`	:arrow_down:
pysradb/search.py	`79.81% <79.81%> (ø)`
pysradb/exceptions.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 26395b6...ec8cb49. Read the comment docs.

saketkc commented 4 years ago

This is super awesome @bscrow! Many thanks for your contribution and for the awesome work you have done over GSoC2020! I believe this will be a huge help to a lot of researchers!

I have left some comments, most of them are minor. It would be great if they can be addressed. Of all, it is particularly important we output all the URLs rather than selecting the best one ourselves.

Great work!

cc @mvdbeek @amalthomas111

amalthomas111 commented 3 years ago

When using -g it might be a good idea to have a dynamic naming prefix/suffix for the plots. Could use time stamps. Otherwise, plots would be overwritten.

amalthomas111 commented 3 years ago

For -G, -Y, -Z options it would be great if you could create a file in GitHub or locally which users can refer to, compiling possible options for each of these tags. In the help (-h) options, you can refer to the link of this file or local path.

amalthomas111 commented 3 years ago

pysradb search  -q "single-cell RNA-seq" -g  -D  01-01-2008:01-10-2020

This command does not work. Gives the error: ValueError: bins must be positive, when an integer

amalthomas111 commented 3 years ago

pysradb search -d geo -q "single-cell RNA-seq" -m 10K -o test_1ksc

First status bar was showing me 0/100000 [00:00<?, ?it/s]. I mentioned 10K, not 100K. When I mentioned 100, it is showing 1K, a factor of 10 is more. I think this happens with -d geo, not with sra. For both -m = 100 and 10K, I got a connection error: http.client.RemoteDisconnected: Remote end closed connection without response. During handling of the above exception, another exception occurred: Is it NCBI issue?

I am getting connection/operation time out for almost all m > 100 for db=geo/sra. Need to look into this!