tderrien / FEELnc

FEELnc : FlExible Extraction of LncRNA
GNU General Public License v3.0
79 stars 28 forks source link

FEELnc pipeline can use of without a reference genome? #14

Closed tiramisutes closed 7 years ago

tiramisutes commented 7 years ago

Hello, In the README.md file you mean this pipeline can use of without a reference genome. But I read all the file and find it need ref_annotation.GTF and ref_genome.FASTA as mandatory arguments.

vwucher commented 7 years ago

Hello,

The FEELnc_codpot.pl (coding potential) can be used without a GTF file. The GTF is only mandatory for FEELnc_filter.pl and FEELnc_classifier.pl, because the purpose of the two modules is to check the intersection/relation of the "new" annotation and the reference annotation.

Regarding FEELnc_codpot.pl, it can be used with only a FASTA file of the mRNA sequences. The annotation in GTF is only mandatory if you want that FEELnc extract the mRNA sequences for you. Below are some examples:

With only FASTA file of mRNA and lncRNA sequences: FEELnc_codpot.pl -i candidate_new_transcripts.fa -a -l known_mRNA.fa -l known_lncRNA.fa

With the annotation of mRNA and lncRNA and the genome sequence: FEELnc_codpot.pl -i candidate_new_transcripts.fa -a known_mRNA.gtf -l known_lncRNA.gtf -g ref_genome.fa

If you don't have a set of lncRNAs, you can use the shuffling option instead of the lncRNA sequences/reference file: --mode=shuffle instead of -l known_lncRNA.fa or -l known_lncRNA.gtf

Regards, Valentin Wucher

tiramisutes commented 7 years ago

Thanks for your explanation.