issues
search
CDCgov
/
datasets-sars-cov-2
Benchmark datasets for WGS analysis of SARS-CoV-2. (https://peerj.com/articles/13821/)
Apache License 2.0
54
stars
18
forks
source link
Fix truncated reads
#17
Closed
lskatz
closed
2 years ago
lskatz
commented
2 years ago
Took advice from @BioWilko to fix truncated reads by not using the
--gzip
option on
fastq-dump
in #13
Hashsums changed as a result for all datasets
Hashsums varied with gzip metadata and so we had to check hashsums on uncompressed reads
Hashsums have been recalculated accordingly
Unit testing filled up the available space for one dataset and so unit testing has been altered to chunk 20 at a time
--gzip
option onfastq-dump
in #13