dieterich-lab / DCC

DCC uses output from the STAR read mapper to systematically detect back-splice junctions in next-generation sequencing data. DCC applies a series of filters and integrates data across replicate sets to arrive at a precise list of circRNA candidates.
https://dieterichlab.org/software/
GNU General Public License v3.0
36 stars 20 forks source link

small issues for bowtie2, star index missing GTF ect #81

Closed xiongxxp closed 2 years ago

xiongxxp commented 4 years ago

Hi Tobias, Very nice tool for circular RNA study. I am trying this tool and met some issues. Generally, I follow the method described in your paper "Deep Computational Circular RNA Analytics from RNA-seq Data". But I met some issues.

  1. When I run this following command, the running results keep printing out on my screen. If add >nohup.log command, the write-in nohup.log will be quite big. image
  2. image looks like there is no GTF in the code of method. Is it normal or typo? I modified it and added the GTF file for building index.

  3. image image In this step, the paper recommends getting a GTF file from the UCSC Browser. But the browser only allows select either RepeatMasker or Simple Repeats, how do I choose both of them?

  4. When I try to test multiple samples, there is a problem with linearCount file. All of the samples will get the same counts. image image

Thank you for your help. Best, Xiong

tjakobi commented 4 years ago

Hi Tobias, Very nice tool for circular RNA study. I am trying this tool and met some issues. Generally, I follow the method described in your paper "Deep Computational Circular RNA Analytics from RNA-seq Data". But I met some issues.

Hi @xiongxxp,

thank you for using circtools!

  1. When I run this following command, the running results keep printing out on my screen. If add >nohup.log command, the write-in nohup.log will be quite big. image

I see. It seems the -S option of bowtie2 is missing. Please add -S /devn/null to your command line. This will pipe aligned reads to the system's null device.

image looks like there is no GTF in the code of method. Is it normal or typo? I modified it and added the GTF file for building index.

The GTF file is used by STAR for spliced alignments and by DCC for gene annotation if that is what you mean.

3. image image In this step, the paper recommends getting a GTF file from the UCSC Browser. But the browser only allows select either RepeatMasker or Simple Repeats, how do I choose both of them?

You should use Repeat Masker.In the next step you should be able to choose between BED and GTF file format for export.

  1. When I try to test multiple samples, there is a problem with linearCount file. All of the samples will get the same counts. image image

That looks like you're always using the same BAM file for all samples. How do your BAM file list, and all other input files of DCC look like? I see you are not using -B to supply the BAM files - you should do that, otherwise DCC might guess the BAM files wrong.

Thank you for your help. Best, Xiong

Cheers, Tobias