Closed taylorreiter closed 2 years ago
Model organisms (listed from memory, and from searching things like "illumina" on the SRA and seeing what taxonomies have the highest count):
Arcadia organisms:
Other:
Done here! https://github.com/Arcadia-Science/seqqc-build-contam-db. Integrated into workflow in #8
One of the goals of this pipeline is to screen new sequencing data for contamination.
Contamination in sequencing data can come from a lot of different sources: