Closed Taoo777 closed 1 year ago
@quirze
Hello @Taoo777,
Indeed, it seems like the input format that clean_OCTFTA_output.py
expects is not necessarily the one provided in the previous step on the pipeline.
Some of the extra attributes expected in the input GTF are added by the script run_ChIPseeker_TE_annot.R
.
If you would like to avoid the error messages, I would suggest modifying the function make_TEgtf_print_line
in your clean_OCTFTA_output.py
script to ignore the attributes that cause the error.
I hope this was helpful. Quirze
@quirze OK! I know how to get extra attributes of input GTF now. I tried to modify the function
make_TEgtf_print_line
and successfully solved the error problems, thank you for your reply!
Hello! Thanks for this good workflow. I thought one input file (TE_GTF)
danRer11.TEtrans_uID.gtf
in3.make_TE_classification.sh
may be incorrect (shown below).According to the workflow, TE_GTF:
danRer11.TEtrans_uID.gtf
was generated in2.make_gene_and_TE_counts.sh
, as described in the following code in <2.make_gene_and_TE_counts.sh>:After running the above code, I got
danRer11.TEtrans_uID.gtf
, which looks like:However, it's not the format that
clean_OCTFTA_output.py
recognizes, and the correct format looks like:And I got the following errors when running
./scripts/adapt_GTF_TEtranscript.py
in3.make_TE_classification.sh
So I suspected danRer11.TEtrans_uID.gtf might not be the right input file for the following code of <3.make_TE_classification.sh>.
Can you help me check if there is anything wrong with the input file of
3.make_TE_classification.sh
? Thank you!