Closed buhanfeng closed 2 years ago
Hello,
The pipeline does not explicitly perform the analysis without a reference GTF file.
Fortunately, there is a file created called "TAR_reads.bed.gz" in rule "rule calcHMMbed" of the Snakefile which contains all the peaks in your dataset. You can actually use the regions defined in "TAR_reads.bed.gz" file as your annotations or genome reference. You can easily convert this bedfile to a .gtf or .refFlat format (refer to convertRefFlatToGTF.R script for help) for downstream analysis.
Best, Michael
Hi Michael,
Your advice is very helpful, I use a dummy gtf file to run the pipeline, and obtain TAR_reads.bed.gz .
Thanks for your help.
Hi all,
I'm dealing with a strange species which has no gtf at all. Is there any chance running this pipeline totally without 'REFGTF'?