Closed Vijithkumar2020 closed 2 months ago
Since blastx translates the query sequence in all six reading frames to blast against a protein database, the logic would push you to extract only what is supposed to be translated i.e: CDS.
Thank you so much for the timely response. This means I am supposed to use
agat_sp_extract_sequences.pl -g infile.gff -f infile.fasta -t cds
Regards
On Thu, Oct 3, 2024 at 5:19 PM Jacques Dainat @.***> wrote:
Since blastx translates the query sequence in all six reading frames to blast against a protein database, the logic would push you to extract only what is supposed to be translated i.e: CDS.
— Reply to this email directly, view it on GitHub https://github.com/NBISweden/AGAT/issues/498#issuecomment-2391219888, or unsubscribe https://github.com/notifications/unsubscribe-auth/APVJDN3MBZ7N2EIZ3UAWGULZZUVLDAVCNFSM6AAAAABPJE2QROVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJRGIYTSOBYHA . You are receiving this because you authored the thread.Message ID: @.***>
I have completed AUGUSTUS de novo gene prediction, and I want to perform the homology-based gene annotation using BLASTX. The AUGUSTUS has output a *.gff file as follows:
Now, I am confused if
agat_sp_extract_sequences.pl -g infile.gff -f infile.fasta -t gene
would be better to extract the transcripts.