Closed eparejatobes closed 7 years ago
Illumina raw datasets to run the tests are here s3://era7p/mg7-test/data/in/
and the datasets for PacBio testa are here s3://era7p/pacbio/data/in/
, files ending with 16S.fastq.gz
OK good; we just need to put them where the code says they are.
@eparejatobes where exactly?
I found this "resources.ohnosequences.com", mg7.organization)/mg7.artifact/
but don't know the exact values for mg7.organization and mg7.artifact
@marina-manrique
organization
is ohnosequences
artifact
is mg7
version
is whatever sbt version
tells youBut I think what @eparejatobes meant is to synchronize locations (code vs. wherever we want to keep this data), not just put data where the code says (because currently code refers to some strange old locations).
You can check current actual locations for input data in sbt with this snippet:
> ohnosequences.test.mg7.BeiMockPipeline.inputSamples.foreach { case (sampleID, (l, r)) => println(s"${sampleID}\n ${l.resource}\n ${r.resource}") }
Here's example output:
ERR1049996
s3://era7p/mg7-test/data/out/reads-preprocessing/ERR1049996_1_val_1.fq.gz
s3://era7p/mg7-test/data/out/reads-preprocessing/ERR1049996_2_val_2.fq.gz
Same for other pipelines.
Ok, so I'm checking this with @eparejatobes later, thanks! @laughedelic
I've put the input data here
s3://resources.ohnosequences.com/ohnosequences/mg7/mock-communities-data/illumina/
s3://resources.ohnosequences.com/ohnosequences/mg7/mock-communities-data/pacbio/
@eparejatobes should I change something in the code?
@marina-manrique update/fix these two pipeline definitions here:
@rtobes @eparejatobes I've just realised that, for Illumina, we tested the tool with preprocessed reads. So far we have only the preprocessed data of the sample ERR1049996. Do you want me to do the same preprocessing with the rest of the illumina samples so we can test MG7 with all the illumina datasets?
This needs to be updated to the current master (after major changes in #112). I'll do it later.
After #104, we need to check that everything is where it should.