Open erictleung opened 7 years ago
This is a curated database of different mock community datasets.. The issue is they are all raw datasets (fastq) and have to be processed through a pipeline to get OTU tables and taxonomy. But if we can do that, we can include these datasets in our tools with expected and measured diversity values. https://github.com/caporaso-lab/mockrobiota/tree/master/data/. Would like everybody's opinion on whether this will be useful? If so I can chose a few relevant datasets, run it through the usual mothur pipeline and generate OTU tables and taxonomy to use.
Mock communities are commonly used, so it would be good to make use of their known structure to possibly compare to and see how these measures work for them.