Closed valearna closed 5 years ago
Instead of creating a request to Human gene names API for each gene, we can download a file containing the data we need for all genes at the same time. This will save A LOT of time.
We decided to remove rule 3. and we now include all human orthologs without excluding those with only 1 method.
3. For orthology to human genes, use only those human genes that have been predicted by more than one orthology prediction methodNon-elegans species: (i) is an ortholog of \<worm gene symbol> (ii) is an ortholog of \<worm gene1 symbol>, \<worm gene2 symbol>, and \<worm gene3 symbol> (iii) is an ortholog of members of the C. elegans \<gene class name> gene class including \<worm gene symbol1> - up to three genes sorted by decreasing popularity (using Textpresso paper score) (iV) is an ortholog of members of the C. elegans \<gene class1 name>,, and including \<worm genesymbol1> - up to three genes sorted by decreasing popularity; also limit number of gene classes to 3, based on member Textpresso paper popularity score.
How to pick orthologs for non-elegans species when tied for orthology methods (tie-breaker rules): We have too many C.elegans genes listed as orthologs for a non-elegans gene; these will be pruned using popularity (via number of publications) and (gene class name):
How to pick human orthologs for C. elegans when tied for orthology methods (tie-breaker rules):