Open ctb opened 2 years ago
postprocessing and cleaning MAGs / checking them against all the things
regulatory evaluation: "this organism is/is not widespread"
content based identification of sra data sets - scaled=1m, first 100 hashes, md5sum
or something like that
seeing if a sample is in the database / identifying if sample is there
differential privacy/dbgap search (at level of technical replicates)
biogeography - where might I look
discovering more examples of strains/species of an interesting species/genus
outbreak detection - plants and humans and animals / one health
spillover idea/spillover risk
"finding gut microbes" example writ larger
notification service of new matches
content-based (re)annotation of stuff in the SRA