BikLab / BITMaB-workshop

Workshop tutorials for the 2017 Benthic Invertebrates, Metagenomics, and Bioinformatics Workshop in Corpus Christi, Tx (January 9-13, 2017)
6 stars 5 forks source link

already demultiplexed fastqs: mapping file + prep of fastq files #3

Open elpape opened 7 years ago

elpape commented 7 years ago

Hi Taruna or Holly!

I just wanted to make sure I understood how the mapping file for already demultiplexed fastqs should be set up.

In addition, I was wondering for step 1c in the tutorial, why you only truncate the reverse primer? should you not truncate the forward primer as well?

Thanks! Ellen

tarunaaggarwal commented 7 years ago

Hi Ellen,

  1. I'd keep the dummy string in the barcode column. Sometimes Qiime complains if a cell is empty.

  2. The LinkerPrimerSequence is a little confusing. Yes, just put the sequence down in the LinkerPrimerSequence column. If it helps, here is sample mapping file from Holly. You will notice that she has both forward and reverse primer seqs in the LinkerPrimerSequence column but has an additional column with the primer orientation info. You can create a mapping file like that as well.

  3. We truncate the reverse primer only because the forward primer should have already been removed during the demultiplexing step. You can check for the presence of your forward primer in your seqs by typing the following. The result should be 0.

grep -c "^primer-seq" file.fasta

Hope this helps. Let me know if you have more questions. Thanks! Taruna

elpape commented 7 years ago

Hi Taruna,

Thanks for your feedback and my apologies for my late reply (I will hopefully find the time to continue working on this in April).

I am confused about putting both the forward and reverse primers in the same column (under LinkerPrimerSequence). As can be seen from the example mapping file, this means that you have two diff sample IDs for the same sample (one with suffix F04 and another one with suffix R22). Does that not complicate later processing/analyses, as they are in fact the same sample? Probably you can just merge these two later on (making use of pattern recognition), I guess..

Thanks, Ellen

tarunaaggarwal commented 7 years ago

Hi Ellen,

Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the dataset was a little lacking. These data of ours contain non-overlapping amplicons which makes things complicated. I agree with you that the samples IDs are two different IDs.

So here is my new answer--

LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. This is what your mapping file should look like. Please double check the primer sequences.

elpape commented 7 years ago

Hi Taruna,

It seems the mapping file you are referring to has been removed. Can you put it back?

Thank you!! Ellen

From: Taruna [mailto:notifications@github.com] Sent: 28 February 2017 19:09 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)

Hi Ellen,

Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the data set was a little lacking. This data of ours contains non-overlapping amplicons which makes things a little complicated. I agree with you that the samples IDs are two different IDs.

So here is my modified answer. LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. Thishttps://www.dropbox.com/s/tpdyj9xzqz0l9i4/epape_mapping_file_Feb2017.txt?dl=0 is what your mapping file should look like. Please double check the primer sequences.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-283118208, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_fROce0RZBdU4HLf5X1Nd8wDdhRHks5rhGLGgaJpZM4L9Y2n.

tarunaaggarwal commented 7 years ago

It is fixed now...Sorry about that

elpape commented 7 years ago

Hi Taruna!

Sorry don’t want to be difficult but the link still does not work :/

Cheers, Ellen

From: Taruna [mailto:notifications@github.com] Sent: 20 March 2017 18:54 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)

It is fixed now...Sorry about that

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-287843502, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_T16xFfEOHMcoJowgCaw9NPvG50yks5rnr1AgaJpZM4L9Y2n.

tarunaaggarwal commented 7 years ago

Weird! Okay, here it is. Hopefully this one works!

elpape commented 7 years ago

Yes! This works! ☺

From: Taruna [mailto:notifications@github.com] Sent: 22 March 2017 15:48 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)

Weird! Okay, herehttps://www.dropbox.com/s/y1z4woakzzjvvmf/epape_mapping_file_March2017.png?dl=0 it is. Hopefully this one works!

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-288422208, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_S8aSAo3F6UvQH783MKBYfqugqNkks5roTSkgaJpZM4L9Y2n.

jianshu93 commented 6 years ago

what if forward primer is not removed in per sample based situation. How can I remove forward primer in that case. How should I change the parameter in split_libraries_fastq.py parameters to remove forward primer? add 'mapping_fps' parameter?