Open alexyfyf opened 1 year ago
Hi @alexyfyf ,
Thanks for pointing out the problems of those files.
I have corrected those two files and updated them in the S3 bucket. Please have a look.
Please let us know if issues are found for other files as well!
Thank you. Warm regards, Ying
Hi Ying,
I did spot another file from dRNA also corruputed.
SGNex_MCF7_directRNA_replicate2_run2
It has quite a few problems, and I used the following code to fix it.
zcat SGNex_MCF7_directRNA_replicate2_run2.fastq.gz | sed 's/.*@/@/g' | sed '$d' | gzip > SGNex_MCF7_directRNA_replicate2_run2_fixed.fastq.gz
You can have a look and see if there's a better way.
Cheers, Alex
Hi Alex,
Thanks for the heads-up again and sharing your code for correcting that.
I think that's good already.
I have uploaded the corrected version just now.
Thank you Regards, Ying
Hi team,
I have downloaded some cDNA fastq files from you s3 repo. I found 2 files are not correctly formatted when I run QC with NanoPlot.
The first one has additional strings before the @ character of the first read.
The second one has a read with an unmatching length of quality score.
Can you confirm this? Cheers, Alex