CDCgov / datasets-sars-cov-2

Benchmark datasets for WGS analysis of SARS-CoV-2. (https://peerj.com/articles/13821/)
Apache License 2.0
54 stars 18 forks source link

Request for inclusion of primer protocol data in dataset TSVs #16

Closed kevinlibuit closed 2 years ago

kevinlibuit commented 2 years ago

Having information on the primers utilized to generate the reads in the datasets could be useful for laboratories trying to re-create these assemblies while performing primer trimming properly.

niemasd commented 2 years ago

Agreed; including primer BED files (and ideally FASTA as well) would be incredibly helpful for reproducibility and for enabling benchmarking of complete end-to-end pipeline execution

iqbal-lab commented 2 years ago

+1

lskatz commented 2 years ago

I believe we finally have this all done after several incremental additions. Please reopen this ticket if you find anything missing. In one dataset, it was actual metagenomics with no primer scheme and so it was designated NA.