mtisza1 / Cenote-Taker2

Cenote-Taker2: Discover and Annotate Divergent Viral Contigs (Please use Cenote-Taker 3 instead)
MIT License
56 stars 7 forks source link

Minimum protein length #48

Open DarrenObbard opened 1 year ago

DarrenObbard commented 1 year ago

This is less of an issue, and more of a feature.

I am using cenote-taker2 to annotate some manually curated contigs. It seems keen to annotate proteins as short as 30AAs that lack any homology to relatives.

How do I control the minim length of annotated proteins?

Thanks!

D

mtisza1 commented 1 year ago

Hey Darren, thanks for opening this issue. I added an enhancement tag to it.

There is nothing in the code right now to do this. It's a feature that could be added without too much trouble, and I have plans to make updates this summer (but I can't promise anything).

For your purposes, is it important that the output of Cenote-Taker 2 never annotates ORFs below a minimum length? Or would you like to be able to edit .gbf files and regenerate .sqn files based on your edits?

Thanks,

Mike

DarrenObbard commented 1 year ago

Editing post hoc would be fine. Do Genbank Still accept sqn? I thought one had to submit the annotation table? - it is mainly that I would want easy way of regenerating the tbl and .gb files