geneontology / helpdesk

The Gene Ontology Helpdesk
http://help.geneontology.org
16 stars 6 forks source link

human proteome #161

Closed ValWood closed 6 years ago

ValWood commented 6 years ago

How do I access the exact human reference proteome dataset that is currently used by GO?

Is this it? https://www.ebi.ac.uk/reference_proteomes

@Antonialock if so we should use this one, as it should align with GO

Thanks,

Val

ValWood commented 6 years ago

Except that it has fragments, and pseudogenes, and it doesn't seem straightforward to access the HGNC IDs

kltm commented 6 years ago

@ValWood That's a good question. Honestly, if it's not an upstream or derived file and not accessible via metadata in the GAFs (then available in AmiGO or SPARQL), I'm not sure how you'd get at it. Can you think of any way of filtering down to the data that you want?

Antonialock commented 6 years ago

This is the ticket I was referring to in our email exchange Val

ValWood commented 6 years ago

OK I totally forgot that I opened this ticket. Anyway @Antonialock we will proceed with the dataset you are using now. We'll keep this one open as I need to know for the GOC discussion http://wiki.geneontology.org/index.php/2018_Montreal_GOC_Meeting_Agenda representing complete proteomes

I guess the question is, is this the set of human uniprot accessions that can be submitted?

@selewis

related ticket https://github.com/geneontology/go-annotation/issues/2082

kltm commented 6 years ago

This issue was moved to geneontology/go-site#819