Planteome / noctua

Graph-based modeling environment for biology, including prototype editor and services
BSD 3-Clause "New" or "Revised" License
0 stars 2 forks source link

Define pipeline for bringing in genes and proteins of interest #1

Open cmungall opened 7 years ago

cmungall commented 7 years ago

@jaiswalp - what species should this instance cover?

And is there a reliable source for canonical gene or protein sets for that species? Would we use gramene?

jaiswalp commented 7 years ago

Via GAF in batches or incremental. Source can be Ensembl, Ensembl-Gramene (for plants), Phytozome, and other external DBs.

jaiswalp commented 7 years ago

Ideally it is a lot of management via GAFa with redundant pieces of information. I also suggest that at some point we should think out forking the GAF to two file. 1) with basic gene/object info and 2) with Ontology-based annotation. That way we can modify data in one and not disturb the other piece.