nigyta / dfast_core

DDBJ Fast Annotation and Submission Tool
76 stars 14 forks source link

Annotated genome as input #40

Closed iquasere closed 2 years ago

iquasere commented 2 years ago

Greetings!

I want to use DFAST to identify a set of proteins, instead of inputting an entire chromossome I want to skip the gene calling step and directly input protein sequences in FAA format. Is this possible with DFAST?

nigyta commented 2 years ago

Thanks for using DFAST. Since DFAST is designed to handle the genome FASTA file as input, it cannot accept protein FASTA file. However, if you already have a gene calling result in GFF format (such as an output from GeneMark-S2), you can provide it with --gff option and skip the gene calling process.

If you only have a FAA file, you cannot use DFAST, but you can manually search them against the DFAST reference database, which is stored in db directory under the directory DFAST is installed.

iquasere commented 2 years ago

Good to know, thank you ^^