Closed biochristmas closed 3 months ago
Hi @biochristmas
So you are finding that the minimum length of the input reads are 500bp, while that of the processed reads is 50bp?
This is likely due to some of the reads being chimeric and split into multiple, smaller subreads.
Closing due to no response
I ran the command to identify full-length transcripts: pychopper -k PCS111 -r CK.report.pdf -u CK.unclassified.fq -w.rescued.fq CK.derRNA.fastq CK.full_length_output.fq. I conducted length statistics on the input sequences (CK.derRNA.fastq), with a minimum length of 500bp. The full-length transcripts (CK.full_length_output.fq) have a minimum length of 50bp. Why are the sequences derived from the input sequence (full-length transcript sequences) shorter instead?