Closed yuw444 closed 2 years ago
Hi @yuw444 - I'm a bit confused by that example. The two reads shown are not the same 25834:1173
vs 31345:1261
in the fastq 'header' lines.
@TomSmithCGAT, sorry for the confusion. I have updated the question above.
The coding scheme(format) of R1 and R2 after extraction may benefit the pipeline. But cellranger will complain if I feed them into cellranger pipeline. So, I was thinking put the filtered R1, R2 back into cellranger pipeline to keep the consistency of my project, as I used cellranger pipeline for UMI counts.
Probably easiest if instead of putting the sequence back in the R1, you discard the processed R1, and rebuild it with the sequence intact from the unprocessed file.
Thanks for your suggestion.
It seems to me that the format fastq file has been changed, is there any way I can keep their original format after extraction.
Before: R1:
R2:
After extraction:
R1:
R2:
I saw
CBs
has been added in the topline after extraction inR2.filtered.fastq.gz
, I could manually put their back toR1.filtered.fastq.gz
file. However, the reading quality line has been deleted after extraction inR1.filtered.fastq.gz
.In my setting, I have to put the
filtered.fastq.gz
back to cellranger pipeline.Please help.
Thanks so much.