benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution
http://benjjneb.github.io/dada2/
GNU Lesser General Public License v3.0
459 stars 142 forks source link

Mock Community in DADA2 fungal ITS tutorial #1064

Closed sarahLy9 closed 4 years ago

sarahLy9 commented 4 years ago

Hello, I have a question about the data used for the fungal ITS DADA2 tutorial (https://benjjneb.github.io/dada2/ITS_workflow.html)

I understand you received the mock community from Bakker et al 2018. In that paper, he describes a few versions of the mock community, Staggered A, Staggered B, and Even. Which community did you use in the tutorial?

Also, am I correct in understanding you used ITS1 primers (ITS1F/ITS2) for this dataset?

I am troubleshooting my classification step, I used the staggered version B in my dataset but I am hoping to narrow down my issue by analyzing your tutorial dataset in parallel but I want to make sure I understand how the data was generated.

Thank you, Sarah

benjjneb commented 4 years ago

@nagasridhar Do you remember which of the Bakker mocks were used in the ITS tutorial?

nagasridhar commented 4 years ago

Hi Ben and Sarah,

For the ITS tutorial, we used the entire set of Amplicon Sequencing Library

1. This library does consist of Even, Staggered A and Staggered B.

The primers are BITS primers for forward reads and B58S3 for reverse reads.

The library used in the tutorial is described in detail in Section 2.3 in the Bakker et al 2018 paper.

On Mon, Jul 6, 2020 at 5:00 PM Benjamin Callahan notifications@github.com wrote:

@nagasridhar https://github.com/nagasridhar Do you remember which of the Bakker mocks were used in the ITS tutorial?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/benjjneb/dada2/issues/1064#issuecomment-654462642, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHLJGYKQQISUDYWMK5QCCGDR2I3OZANCNFSM4ORZD46Q .

sarahLy9 commented 4 years ago

Thank you both for you quick reply.

Do you have a metadata file available to tell me which sample corresponds to which mixture in the test data? I do not see any info available in the link where I downloaded the file (https://www.ebi.ac.uk/ena/data/view/PRJNA377530).

My apologies if I am missing something, I am new to NGS analysis. I included staggered mixture B in my experiment but did not identify any of the major taxa that are supposed to be in the mixture so I was hoping it was a classification issue. Being able to test my classifier on your data would help me narrow down my issue.

nagasridhar commented 4 years ago

Hi Sarah,

The details linking the Run file/fastq file to the community is in the link below:

https://www.ncbi.nlm.nih.gov/Traces/study/?query_key=13&WebEnv=NCID_1_21797049_130.14.18.48_5555_1594093597_1137155363_0MetA0_S_HStore&o=acc_s%3Aa

On Mon, Jul 6, 2020 at 9:11 PM sarahLy9 notifications@github.com wrote:

Thank you both for you quick reply.

Do you have a metadata file available to tell me which sample corresponds to which mixture in the test data? I do not see any info available in the link where I downloaded the file ( https://www.ebi.ac.uk/ena/data/view/PRJNA377530).

My apologies if I am missing something, I am new to NGS analysis. I included staggered mixture B in my experiment but did not identify any of the major taxa that are supposed to be in the mixture so I was hoping it was a classification issue. Being able to test my classifier on your data would help me narrow down my issue.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/benjjneb/dada2/issues/1064#issuecomment-654540997, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHLJGYNPOJK5RXICZPIOB4LR2JY5FANCNFSM4ORZD46Q .

sarahLy9 commented 4 years ago

Sorry, that link does not work, it says page not found.

nagasridhar commented 4 years ago

I think NCBI creates a unique link for each individual/session.

The easiest way to do this is go to:

https://www.ncbi.nlm.nih.gov/Traces/study/?

Enter the NCBI BioProject Number: PRJNA377530.

That should give you access to the metadata.

On Tue, Jul 7, 2020 at 10:34 AM sarahLy9 notifications@github.com wrote:

Sorry, that link does not work, it says page not found.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/benjjneb/dada2/issues/1064#issuecomment-654906636, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHLJGYKD4RRXDALT4LF6LJTR2MW67ANCNFSM4ORZD46Q .