NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 0 forks source link

[Source]: Mass Spectrometry Interactive Virtual Environment (MassIVE) #139

Open gtsueng opened 2 months ago

gtsueng commented 2 months ago

Source Name

Mass Spectrometry Interactive Virtual Environment (MassIVE)

Source URL

https://massive.ucsd.edu/ProteoSAFe/static/massive.jsp

API Documentation: https://ccms-ucsd.github.io/MassIVEDocumentation/#api/#massive-dataset-information

Source Description

MassIVE is a community resource developed by the NIH-funded Center for Computational Mass Spectrometry to promote the global, free exchange of mass spectrometry data. MassIVE datasets can be assigned ProteomeXchange accessions to satisfy publication requirements.

Short Description

MassIVE is an NIH supported repository that includes mass spectrometry data.

Source Access

No access issue, account not needed Sample record: https://massive.ucsd.edu/ProteoSAFe/proxi/v0.1/datasets/MSV000094651

Source Funding

NIH

Source Relevance

NIAID high priority resource

Related WBS task

For internal use only. Assignee, please select the status of this issue

Status Description

No response

Source to-do list

gtsueng commented 2 months ago

MassIVE has a list of their public datasets here: https://massive.ucsd.edu/ProteoSAFe/datasets.jsp#%7B%22query%22%3A%7B%7D%2C%22table_sort_history%22%3A%22createdMillis_dsc%22%7D

gtsueng commented 2 months ago

This repository was discussed during the biweekly meeting dated 2024.04.30. Per the discussion, we will create a parser for this resource.

gtsueng commented 1 month ago

The mapping for this resource can be found here: https://docs.google.com/spreadsheets/d/115bj6yY7jOt3w_23SIcmRer1OijoD8wQDADcVpgC_wk/edit#gid=779139043

Note that the each MassIVE Dataset record appears to have and assigned doi for the record as well as license information that's viewable from the record, but does not seem to appear in the API json response

Example record on website: https://massive.ucsd.edu/ProteoSAFe/dataset.jsp?accession=MSV000094651

Vs metadata retrievable from the API: https://massive.ucsd.edu/ProteoSAFe/proxi/v0.1/datasets/MSV000094651

gtsueng commented 2 weeks ago

This source is now available on Staging: https://data-staging.niaid.nih.gov/search?q=&filters=%28includedInDataCatalog.name%3A%28%22MassIVE%22%29%29

The following issues have been identified and are in the process of being addressed

gtsueng commented 4 days ago

Per the discussion at the bi-weekly meeting dated 2024.06.25, moving forward, the NIAID team should be looped in on all communications with any repository including technical discussions.

For the record, @DylanWelzel please forward any email chains you received regarding your inquiries to MassIVE to NIAIDDataEcosystem@mail.nih.gov