Open HitMonk opened 2 weeks ago
Some additional information would help in diagnosing your read loss:
plotQualityProfile
on a typical sample?derepFastq(..., verbose=TRUE)
on a typical sample?Hello @benjjneb Thank you for your reply. Below are all the answers the to questions:
- What sequencing instrument/chemistry generated this
The samples were used to prepare a PacBio SMRTbell library using AMPure PB bead-based purification and sequenced on a PacBio Sequel IIe system.
- What environment are you sequencing?
The samples are from a fermented plant roots, commonly used locally in daily diet.
- What is the output of plotQualityProfile on a typical sample?
PlotQualittyProfile before trimming:
PlotQualityProfile after trimming:
- What is the output to the console of derepFastq(..., verbose=TRUE) on a typical sample?
Ah! This was the other question that I had but forgot to mention in the first post. It seems like there are many unique sequences too - >95% of the data. Is it possible that the high error rate shows up as unique sequences?
drp_filts <- derepFastq(filts, verbose=TRUE)
Dereplicating sequence entries in Fastq file:
Encountered 7880 unique sequences from 8115 total sequences read.
Dereplicating sequence entries in Fastq file:
Encountered 13492 unique sequences from 14767 total sequences read.
Dereplicating sequence entries in Fastq file:
Encountered 17853 unique sequences from 22369 total sequences read.
Thank you once again for your time and support. Please let me know if you would like any additional information from my side.
Best,
Hello All! Im working with PacBio reads for the very first time. Compared to the tutorial, Im see im loosing really large number of reads. Below is the table:
Additionally, why do i see a drop in read number post primer trimming?
Below are all the commands that I have used:
Primer trimming:
Give path and trim primers
Quality filtering and trimming:
STEP 2. Dereplicate the reads
STEP 3. Learn Errors
err2 <- learnErrors(drp_filts, errorEstimationFunction=PacBioErrfun, BAND_SIZE=32, multithread=TRUE)
STEP 4. DADA2 main inference
dd2 <- dada(drp_filts, err=err2, BAND_SIZE=32, multithread=TRUE)
Is there anything that im doing wrong? Im open to all suggestions and advice from your side.
Best.