davidemms / OrthoFinder

Phylogenetic orthology inference for comparative genomics
https://davidemms.github.io/
GNU General Public License v3.0
705 stars 188 forks source link

How to annotate the protein function of each OG from the result of OrthoFinder #699

Open neptuneyt opened 2 years ago

neptuneyt commented 2 years ago

Dear OrthoFinder, Thanks for your such amazing work. There are multiple protein sequences under each OG, how do I determine the final protein function of each OG. Two ideas, first, annotate all OG members and finally take the most as the final annotation; or take the longest protein annotation in OG as the final annotation, seems like the first one is more reasonable? Or is there any tool to get the best function of OG directly? Thansk a lot.

lalalagartija commented 2 years ago

I do the first one because taking the longest protein would result in over-annotation. Automatic annotation tools are usually bad. Or do you mean a tool to determine which of the ORFs can give you the function for the whole OG ? Maybe like finding the centroid ORF ? Maybe try Uclust