Closed justaddcoffee closed 4 years ago
@justaddcoffee If you have your hands full, I can work on this next
Sure thanks @deepakunni3
Let me know if you want some help as I already converted the file in proper json so easy to transform in any way you want.
https://geneontology-public.s3.amazonaws.com/covid-19_targets.json
@lpalbou Since you are already half way there, would you like to take a stab at this?
@lpalbou also let @deepakunni3 or @justaddcoffee know if you want to chat briefly about how ingests work in kg_covid_19 - might save you some effort
See also #19 we want to have uniprot IDs associated with each viral protein, can get from ftp://ftp.ebi.ac.uk/pub/contrib/goa/uniprot_sars-cov-2.gpi
Just sorting out a few things here first but yep, in the ingest I will provide a mapping of virus gene name (SARS-CoV-2 E -> Uniprot)
Hi @lpalbou - any update on this? I think I started an ingest for this, glad to finish this up if you are swamped right now
ETA tomorrow would it be fine ?
On Mon, Apr 6, 2020 at 9:43 AM Justin Reese notifications@github.com wrote:
Hi @lpalbou https://github.com/lpalbou - any update on this? I think I started an ingest for this, glad to finish this up if you are swamped right now
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Knowledge-Graph-Hub/kg-covid-19/issues/31#issuecomment-609907020, or unsubscribe https://github.com/notifications/unsubscribe-auth/AFZAMDQPKPGRXNVCAZYNPPTRLIBEFANCNFSM4LSFF2MQ .
-- Laurent-Philippe Albou PhD Bioinformatics University of Southern California Affiliate Lawrence Berkeley National Laboratory
ETA tomorrow would it be fine ?
Sure! Thanks LP
https://twitter.com/intact_project/status/1247087922005921792?s=19
Note this is now available with other interactions for related viruses from interact 3
On Mon, Apr 6, 2020, 10:14 Justin Reese notifications@github.com wrote:
ETA tomorrow would it be fine ?
Sure! Thanks LP
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/Knowledge-Graph-Hub/kg-covid-19/issues/31#issuecomment-609922133, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAAMMOLHZW2QVNTDUQ2J7ULRLIENBANCNFSM4LSFF2MQ .
They are now speaking of >1200 interactions, nearly 4x the gordon set.
Is this data reliable ? I could update the covid page at some point to show the ribbon for either dataset.
On Mon, Apr 6, 2020 at 11:35 PM Chris Mungall notifications@github.com wrote:
https://twitter.com/intact_project/status/1247087922005921792?s=19
Note this is now available with other interactions for related viruses from interact 3
On Mon, Apr 6, 2020, 10:14 Justin Reese notifications@github.com wrote:
ETA tomorrow would it be fine ?
Sure! Thanks LP
— You are receiving this because you commented. Reply to this email directly, view it on GitHub < https://github.com/Knowledge-Graph-Hub/kg-covid-19/issues/31#issuecomment-609922133 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AAAMMOLHZW2QVNTDUQ2J7ULRLIENBANCNFSM4LSFF2MQ
.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Knowledge-Graph-Hub/kg-covid-19/issues/31#issuecomment-610203089, or unsubscribe https://github.com/notifications/unsubscribe-auth/AFZAMDTBCOSSUWBGT4G774DRLLCR3ANCNFSM4LSFF2MQ .
-- Laurent-Philippe Albou PhD Bioinformatics University of Southern California Affiliate Lawrence Berkeley National Laboratory
It is IntAct and Birgit, so yes it's reliable!
I'll leave it to you decide what and how to ingest. Having a PSI-XML adapter will be more generically useful than an ad-hoc adapter for this one supplemental table. But it sounds like you are mostly there, so maybe just complete it, and we make another ticket for psi-xml/intact?
If you wanted, you could get just the Gordon interactions from the XML. It's file unassigned2316.xml in the ingest. This has 27 interactions. Hmm, this sounds too few?...
Morning @cmungall and @lpalbou
Yes, this week we released the first big set of coronavirus data, which includes >1200 interactions across multiple papers and includes 27 interactions from the Gorden et al. preprint. The PMID will be updated in the IntAct release following official release of the paper and its PMID assignment.
There will be more CoV data in our next release so a PSI-XML adaptor would be useful for you. We also go through the painful task of manually extracting data from tables etc during curation so you don't have to do it again!
If, in the future, you find interaction preprints that you want to ingest, please check with us to see if we are already processing it.Best way to contact the team is through the "feedback" button on our homepage as that will generate a ticket.
I'll be on the mid-term call today (although, I guess no time for questions/discussion on that call). Happy to set up a separate call as well. My skype is birgit_cb.
Birgit
This ticket is obviated by #74 - per convo with Birgit, it'd be more efficient to ingest IntAct data which includes the Gordon et al data
The Gordon et al paper on Biorxiv: https://www.biorxiv.org/content/10.1101/2020.03.22.002386v1
Probably we want supp table 2 (SARS-COV-2 <-> human protein ppi), but lots of other info to ingest here later