monarch-initiative / dipper

Data Ingestion Pipeline for Monarch
https://dipper.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
57 stars 26 forks source link

add cellosaurus #456

Open mellybelly opened 7 years ago

mellybelly commented 7 years ago

http://web.expasy.org/cellosaurus/description.html

However, consider the Creative Commons Attribution-NoDerivs License!!!! We may need to get special permission to include and/or reimplement their sources, related to prelim work we had done before Cellosaurus existed.

@mbrush can you investigate?

cmungall commented 7 years ago

How does it compare to CLO?

mellybelly commented 7 years ago

It includes cross references for CLO, see http://web.expasy.org/cellosaurus/description.html It has non-human species coverage. Its more like a registry of cell lines and their providers: http://web.expasy.org/cgi-bin/cellosaurus/faq and there is more metadata and associations to publications, eg. make some comparisons for those that have links: http://web.expasy.org/cellosaurus/CVCL_J722 http://purl.obolibrary.org/obo/CLO_0001334

To be fair we were trying to define something similar with pharma companies before this came out.

Here is a more extensive record: http://web.expasy.org/cellosaurus/CVCL_0547 but it may be that we just want to bring in a few more cell line resources and not get from cellosaurus

mellybelly commented 7 years ago

also see ftp://ftp.expasy.org/databases/cellosaurus @jmcmurry

mbrush commented 7 years ago

Nice write-up from Amos about getting cellosaurus into Wikidata, including several modeling issues and use cases: https://docs.google.com/document/d/1kEySzc0-yDEdLNPKKeAU-Mcm-c-dMHS64Ra85AV1RvM/edit

TomConlin commented 6 years ago

My initial look at the data

notes:

Their XML schema.

Their xml schema

kshefchek commented 6 years ago

TODO from @cmungall check the cellosaurus obo file

cmungall commented 6 years ago

sounds like we should just get it from wikidata

cmungall commented 5 years ago

trying to prioritize this. What would this buy us?

TomConlin commented 5 years ago

if I recall they touch on ~600 species

cmungall commented 5 years ago

what's the use case for including in monarch?

If we were to bring it in, could we not just bypass the xml and load it as an ontology?