bcgsc / RNA-Bloom

:hibiscus: reference-free transcriptome assembly for short and long reads
Other
85 stars 7 forks source link

Sequence header #14

Closed giantoni closed 3 years ago

giantoni commented 3 years ago

Hi,

I am so sorry for this question, but I didn't find an explanation of the sequence header of RNA-Bloom output.

All my sequences have this type of header: >E4.L.4 l=306. Please, confirm if I am right:

E4 is the gene number and L4 is the isoform number?

Many thanks in advance!

Gi

rnabloom -left sample_1_R1.fastq.gz -right sample_1_R2.fastq.gz -revcomp-right -ntcard -t 80 -outdir sample1_transcriptome

kmnip commented 3 years ago

Hi,

There is currently no gene name assigned in RNA-Bloom. E4.L.4 is the isoform name. E4 indicates that the isoform was extended from a fragment with minimum k-mer coverage >= 10,000 (1E4). L indicates that the seed fragment was longer than the minimum fragment length threshold.

Ka Ming

Akanksha2511 commented 2 years ago

Hi, can you please explain the sequence header. For example in here ">2 l=3215 c=1529.4984 s=2" what is the before and after . part of c.

Thanks, Akanksha

kmnip commented 2 years ago

l is the sequence length s is the ID of the seed sequence to used from the previous stage

Akanksha2511 commented 2 years ago

Could you please also explain what is c in the sequence header.

Thanks, Akanksha

On Thu, 10 Feb 2022 at 7:23 pm, Ka Ming Nip @.***> wrote:

l is the sequence length s is the ID of the seed sequence to used from the previous stage

— Reply to this email directly, view it on GitHub https://github.com/bcgsc/RNA-Bloom/issues/14#issuecomment-1034620274, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADBOG4SQS7DHO6OUSGCZ7PDU2NYZDANCNFSM4TL5EULQ . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you commented.Message ID: @.***>

kmnip commented 2 years ago

c is the estimated per-base read depth. It's very experimental, not very accurate.