summary file - Githubissues

junchaoshi / sports1.1

Small non-coding RNA annotation Pipeline Optimized for rRNA- and tRNA-Derived Small RNAs

GNU General Public License v3.0

45 stars 16 forks source link

summary file #25

Closed daixiaozhuan closed 2 years ago

daixiaozhuan commented 2 years ago

Hi Junchao,

I want to generate the ratio of different RNA types. Firstly, I extracted the information directly from the file "*.mapped.sorted.txt" in the "processing_report" folder. But I check the reads number is not correct. Then I checked the summary file as below, why the Match_Genome number is larger than the Clean_Reads? Do you have any advice on which file should I use the output the summary? 企业微信截图_16397197048240

Thanks, Xiaozhuan

junchaoshi commented 2 years ago

It's not a normal result. Could you check the processing log file to see if everything works fine?

daixiaozhuan commented 2 years ago

Hi,

There is no error information in the log file.

Is it because of the multi-mapping reads? I checked several samples, which all have the same problem.

Xiaozhuan

daixiaozhuan commented 2 years ago

Hi,

This is the command that I used as below:

Xiaozhuan

junchaoshi commented 2 years ago

I mean the log file in the processing_report folder since you performed the -k parameter.

daixiaozhuan commented 2 years ago

It only has one txt file as below:

junchaoshi commented 2 years ago

Since there are warnings in the processing file, please make sure your input files and/or the reference files are valid.

daixiaozhuan commented 2 years ago

I checked the input and reference. But I didn't find any error. By the way, I want to make sure that I use the result in the "processing_report" to report the ratio of different small RNA types. Is it correct?

junchaoshi commented 2 years ago

Since there are warnings in the processing file, something should be wrong with your processing step. The ratio in the processing file represents the unique species of the total species aligned rather than the reads ratio.

daixiaozhuan commented 2 years ago

If I want to know the reads ratio. Then which file should I use?

daixiaozhuan commented 2 years ago

There are warnings because I used the reads that only can mapped to genome. So the unmapped genome part is empty.

junchaoshi commented 2 years ago

The reads of each type of small RNA and each sequence are in the [sample_name]_output.txt and [sample_name]_summary.txt which are in the [sample_name]_reusult folder. You do not need to remove genome_unmapped reads before annotation. The genome mapping information is already included in the result folder. And it may cause incorrect counting, as you asked at the beginning.

junchaoshi commented 2 years ago

Please read the README file carefully, the instruction of each output file is already included.