Open elpape opened 7 years ago
Hi Ellen,
I'd keep the dummy string in the barcode column. Sometimes Qiime complains if a cell is empty.
The LinkerPrimerSequence is a little confusing. Yes, just put the sequence down in the LinkerPrimerSequence column. If it helps, here is sample mapping file from Holly. You will notice that she has both forward and reverse primer seqs in the LinkerPrimerSequence column but has an additional column with the primer orientation info. You can create a mapping file like that as well.
We truncate the reverse primer only because the forward primer should have already been removed during the demultiplexing step. You can check for the presence of your forward primer in your seqs by typing the following. The result should be 0.
grep -c "^primer-seq" file.fasta
Hope this helps. Let me know if you have more questions. Thanks! Taruna
Hi Taruna,
Thanks for your feedback and my apologies for my late reply (I will hopefully find the time to continue working on this in April).
I am confused about putting both the forward and reverse primers in the same column (under LinkerPrimerSequence). As can be seen from the example mapping file, this means that you have two diff sample IDs for the same sample (one with suffix F04 and another one with suffix R22). Does that not complicate later processing/analyses, as they are in fact the same sample? Probably you can just merge these two later on (making use of pattern recognition), I guess..
Thanks, Ellen
Hi Ellen,
Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the dataset was a little lacking. These data of ours contain non-overlapping amplicons which makes things complicated. I agree with you that the samples IDs are two different IDs.
So here is my new answer--
LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. This is what your mapping file should look like. Please double check the primer sequences.
Hi Taruna,
It seems the mapping file you are referring to has been removed. Can you put it back?
Thank you!! Ellen
From: Taruna [mailto:notifications@github.com] Sent: 28 February 2017 19:09 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)
Hi Ellen,
Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the data set was a little lacking. This data of ours contains non-overlapping amplicons which makes things a little complicated. I agree with you that the samples IDs are two different IDs.
So here is my modified answer. LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. Thishttps://www.dropbox.com/s/tpdyj9xzqz0l9i4/epape_mapping_file_Feb2017.txt?dl=0 is what your mapping file should look like. Please double check the primer sequences.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-283118208, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_fROce0RZBdU4HLf5X1Nd8wDdhRHks5rhGLGgaJpZM4L9Y2n.
It is fixed now...Sorry about that
Hi Taruna!
Sorry don’t want to be difficult but the link still does not work :/
Cheers, Ellen
From: Taruna [mailto:notifications@github.com] Sent: 20 March 2017 18:54 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)
It is fixed now...Sorry about that
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-287843502, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_T16xFfEOHMcoJowgCaw9NPvG50yks5rnr1AgaJpZM4L9Y2n.
Weird! Okay, here it is. Hopefully this one works!
Yes! This works! ☺
From: Taruna [mailto:notifications@github.com] Sent: 22 March 2017 15:48 To: BikLab/BITMaB-workshop BITMaB-workshop@noreply.github.com Cc: Ellen Pape Ellen.Pape@UGent.be; Author author@noreply.github.com Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3)
Weird! Okay, herehttps://www.dropbox.com/s/y1z4woakzzjvvmf/epape_mapping_file_March2017.png?dl=0 it is. Hopefully this one works!
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/BikLab/BITMaB-workshop/issues/3#issuecomment-288422208, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AYf2_S8aSAo3F6UvQH783MKBYfqugqNkks5roTSkgaJpZM4L9Y2n.
what if forward primer is not removed in per sample based situation. How can I remove forward primer in that case. How should I change the parameter in split_libraries_fastq.py parameters to remove forward primer? add 'mapping_fps' parameter?
Hi Taruna or Holly!
I just wanted to make sure I understood how the mapping file for already demultiplexed fastqs should be set up.
BarcodeSequence is in my case not needed (since reads are already demultiplexed) and this column can be either left empty (according to description on the qiime website) or can contain a dummy string (e.g. NNNN)
LinkerPrimerSequence: is this the sequence of the forward gene-specific primer? (if so, why is it called LinkerPrimerSequence - very confusing) the Qiime website says: "each value in that column corresponds to the primer used to amplify that sample". since in my case primers were extended in a 2-step PCR where in the first step the gene-specific primer was added with a special tag attached, do I also need to include the sequence of the tag? My forward gene-specific primer should be SSU_F04, so I should just put down the sequence of this primer in this column, right?
ReversePrimer: this is the reverse gene-specific Primer? should I include the tag? (in my case, primer should be SSU_R22)
In addition, I was wondering for step 1c in the tutorial, why you only truncate the reverse primer? should you not truncate the forward primer as well?
Thanks! Ellen