Open aguang opened 1 week ago
Hi aguang,
The HOGs (1st column) are the distinct orthogroups. In the example you have shown, two HOGs have the same Orthogroup ID. This is because those two HOGs were mistakenly merged into one in the clustering step. OrthoFinder sees that there is a duplication at the root, and so correctly seperates them into two HOGs in the N0 file
There is some discussion of this in https://github.com/davidemms/OrthoFinder/issues/367
Hope this is useful!
Thanks,
Laurie
I'm just wondering, what is the Orthogroup column in
N*.tsv
mean? I think the distinct orthogroups should be the HOGs, but sometimes I see that different rows will have the same Orthogroup ID, and I'm not sure what if anything that is supposed to mean. Are they Orthogroup IDs for a given gene tree and otherwise have no relation?Example (I am looking at these files in R):