Gaius-Augustus / GALBA

GALBA is a pipeline for fully automated prediction of protein coding gene structures with AUGUSTUS in novel eukaryotic genomes for the scenario where high quality proteins from one or several closely related species are available.
Other
121 stars 4 forks source link

Protein.fa files #28

Closed zhoudreames closed 1 year ago

zhoudreames commented 1 year ago

I have no-human genome(pig), and I want to do gene annotation for it. but I don't know how to choose the protein files, Should I only choose a pig protein file or can I combine multiple species. Thanks~

KatharinaHoff commented 1 year ago

I recommend to choose 4 or more protein sets (e.g. pig, human, mouse, … and maybe goat or cow or even both). Concatenate the files, simplify headers.

We have previously looked into pig. Sus scrofa has a decent annotation.

It is important to repeat mask the genome, first.

zhoudreames @.***> schrieb am Di. 11. Apr. 2023 um 02:57:

I have no-human genome(pig), and I want to do gene annotation for it. but I don't know how to choose the protein files, Should I only choose a pig protein file or can I combine multiple species. Thanks~

— Reply to this email directly, view it on GitHub https://github.com/Gaius-Augustus/GALBA/issues/28, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJMC6JC2LXMWN3LFA5BYDZLXAST55ANCNFSM6AAAAAAWZTAIJA . You are receiving this because you are subscribed to this thread.Message ID: @.***>

zhoudreames commented 1 year ago

We have previously looked into pig. Sus scrofa has a decent annotation

Thank you for your detail response, I try it.