hubmapconsortium / metadata-consistency

1 stars 0 forks source link

Missing `library_id` field in metadata for all Slide-seqs #6

Open icaoberg opened 1 year ago

icaoberg commented 1 year ago

All published Slide-seq datasets are missing the field library_id in the ingest metadata. Should these datasets have this field?

metadata['ingest_metadata']['metadata']['library_id']

For example, for dataset HBM348.FXGT.728

{'acquisition_instrument_model': 'NovaSeq 6000',
 'acquisition_instrument_vendor': 'Illumina',
 'analyte_class': 'RNA',
 'assay_category': 'sequence',
 'assay_type': 'Slide-seq',
 'bead_barcode_offset': '1,27',
 'bead_barcode_read': 'R1',
 'bead_barcode_size': '8,6',
 'contributors_path': 'extras/contributors.tsv',
 'data_path': '.',
 'description': 'Slide-seq on the kidney medulla region.',
 'donor_id': 'UCSD0027',
 'execution_datetime': '2021-03-21 13:26',
 'is_targeted': 'False',
 'is_technical_replicate': 'False',
 'library_adapter_sequence': "5'- AAGCAGTGGTATCAACGCAGAGTGAATGGG -3'",
 'library_average_fragment_size': '491',
 'library_construction_protocols_io_doi': '10.17504/protocols.io.bpgzmjx6',
 'library_final_yield_unit': 'ng',
 'library_final_yield_value': '67.6',
 'library_layout': 'paired-end',
 'library_pcr_cycles': '13',
 'library_pcr_cycles_for_sample_index': '12',
 'operator': 'Evan Murray',
 'operator_email': 'emurray@broadinstitute.org',
 'pi': 'Evan Macosko',
 'pi_email': 'emacosko@broadinstitute.org',
 'protocols_io_doi': '10.17504/protocols.io.bpgzmjx6',
 'puck_id': 'Puck_210113_34',
 'rnaseq_assay_method': 'Slide-Seq',
 'sequencing_phix_percent': '0',
 'sequencing_read_format': '42/8/0/60',
 'sequencing_read_percent_q30': '93.04',
 'sequencing_reagent_kit': 'S2',
 'tissue_id': 'UCSD0027-LK-2-1-5',
 'version': '1'}