bmir-radx / radx-project

This repo serves as a primary location for tracking issues that don't quite fit into our other dedicated repositories
0 stars 0 forks source link

Develop methods to evaluate metadata quality #105

Open yancao77 opened 5 months ago

yancao77 commented 5 months ago

We need to develop methods to evaluate metadata quality in the RADx Data Hub.

Approach:

  1. Start by considering the possibility of using the Spreadsheet Validator that Josef developed for HubMAP.
  2. Alternatively, extend the current RADx Metadata Validator to include suggestion and repairing support.
yancao77 commented 3 months ago

Slides of Data Hub’s Metadata Quality Improvement

You can access the slides here. It defines three main steps involved in the metadata quality improvement pipeline:

Metadata Type Template Definition Evaluation Repair
Data File Metadata In Progress In Progress Not Started
Variable Metadata Not Started Not Started Not Started
Study Metadata Not Started Not Started Not Started

Evaluation Criteria

The evaluation criteria can be found here.

Draft of Study Metadata Template

You can access the draft of the study metadata template here.