biocaddie / prototype_issues

Used to report and track bioCADDIE prototype issues
3 stars 5 forks source link

Repository Addition/Selection Process #257

Open DataMedFeedback opened 7 years ago

DataMedFeedback commented 7 years ago

Hi there,

My name is Daniella Lowenberg and I am the product manager for the Data Publication platform Dash, that is based out of the office of the UC president. We take any field of research data from the UC system and I notice that though this is geared towards biomedical data, places like Dryad and Dataverse are included. Do all of their non-biomedical data also appear in DataMed or do they curate it so only biomedical tagged datasets are indexed by DataMed?

I look forward to your reply.

Best, Daniella

California Digital Library Daniella Lowenberg | Research Data Specialist & Product Manager University of California Curation Center (UC3) | 415 20th Street 4th Floor | Oakland, CA 94612 daniella.lowenberg@ucop.edu

yul129 commented 7 years ago

For Dataverse, DataMed only includes the metadata tagged datasets; for Dryad, we do not filter the data upon ingestion.


From: aegururaj [notifications@github.com] Sent: Wednesday, March 29, 2017 11:32 AM To: biocaddie/prototype_issues Cc: Yueling Li; Mention Subject: Re: [biocaddie/prototype_issues] Repository Addition/Selection Process (#257)

Assigned #257https://github.com/biocaddie/prototype_issues/issues/257 to @yul129https://github.com/yul129.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/biocaddie/prototype_issues/issues/257#event-1020977370, or mute the threadhttps://github.com/notifications/unsubscribe-auth/ALPhs4-KGVI3VU6QF9Uu4iN5Gq8i3toGks5rqqPRgaJpZM4MtLSq.

aegururaj commented 7 years ago

@yul129 , does tagged here mean "tagged as biomedical data"?

jgrethe commented 7 years ago

Since there is no universal "biomedical data tag" we currently bring in all datasets. In the future, one could develop metrics (e.g. based on overlap of the record with biomedical terminologies) to score a dataset as being biomedically relevant. And this score could be used to either not ingest certain metadata records or to effect their score in the search results.