HumanCellAtlas / metadata-schema

This repo is for the metadata schemas associated with the HCA
Apache License 2.0
64 stars 32 forks source link

Add sequencing_run_batch to sequence_file #1557

Closed arschat closed 4 days ago

arschat commented 2 months ago

For which schema is a change/update being suggested?

I would like to request an update to the sequence_file.json schema.

What should the change/update be?

I would like to add a new field - sequencing_run_batch - to this schema to allow data contributors to assign a sequencing run batch(s) to a file.

This update constitutes a minor change to the schema(s) it affects.

What new field(s) need to be changed/added?

Why is the change requested?

Tier 1 metadata does include a library_sequencing_run that is described as follows:

The identifier (or accession number) that indicates which samples' libraries were sequenced in the same run.

After discussions with Integration team, we clarified that it's not recording a single sequencing run but sequencing batch that might involve multiple sequencing runs.