Closed malloryfreeberg closed 5 years ago
😱
@malloryfreeberg are we going to break this up by type/by module issues? I am hesitant to start commenting here on things as this will be one unbearably long Issue. I am happy to do it, btw.
are we going to break this up by type/by module issues
Dear God, yes :)
Maybe we break them up by schema + kind of update. E.g. one ticket for updating all examples in donor_organism schema. We can decide as we go, but in general we shouldn't have too many individual updates lumped into one branch/PR.
@zperova can you take care of the imaging_protocol.json
and imaging_preparation_protocol.json
updates? I haven't yet spelled out the updates for the latter, so that will need to be done.
Paris has offered suggestions for some of the imaging_protocol field examples:
immersion_medium_type
tile_size_y
tile_size_x
NB: This is an ongoing Epic that will track individual schema patch updates.
If you open an issue to address any items here, please check them off the list in the Epic and link the issue to the Epic.
Description
As a consumer of the HCA metadata schemas, I would like to be sure that the metadata standards follow a set of consistent formatting and rules. **Acceptance Criteria**Type
Type:Biomaterial
date_established
field to describe how to enter the date in the correct formatcatalog_url
supplier
karyotype
biomaterial_core
publication
to "Publication" (avoid redundant "cell line" words)genus_species
(example will live in the ontology module)cell_type
(example will live in the ontology module)cell_cycle
(example will live in the ontology module)tissue
(example will live in the ontology module)disease
(example will live in the ontology module)biomaterial_core
selected_cell_type
(example will live in the ontology module)genus_species
(example will live in the ontology module)total_estimated_cells
field. Also change to "Enter 1 for well-based assays."total_estimated_cells
fieldbiomaterial_core
genus_species
(example will live in the ontology module)organism_age
to "Age"organism_age
that shows how to enter a rangeorganism_age
to indicate that an age range can be enteredorganism_age_unit
(example will live in the ontology module)organism_age_unit
to "Age unit"development_stage
(example will live in the ontology module)diseases
(example will live in the ontology module)diseases
gestational_age
to indicate that a range can be enteredgestational_age
that shows how to enter a single valuegestational_age_unit
(example will live in the ontology module)height
that shows how to enter a rangeheight
to indicate that a range can be enteredheight_unit
(example will live in the ontology module)weight
that shows how to enter a rangeweight
to indicate that a range can be enteredweight_unit
(example will live in the ontology module)biomaterial_core
overview_image
to guidelinesoverview_image
overview_image
tooverview_images
since the field is an arrayimaged_slice_thickness
toslice_thickness
to remove redundant "imaged" wordimaged_slice_thickness
model_for_organ
tomodel_organ
to remove extra word ("for") and be consistent with related termmodel_organ_part
organoid_age
toage
to remove redundant word ("organoid")organoid_age_unit
toage_unit
to remove redundant word ("organoid")organoid_size
tosize
to remove redundant word ("organoid")organoid_size_unit
tosize_unit
to remove redundant word ("organoid")organoid_morphology
tomorphology
to remove redundant word ("organoid")organoid_growth_environment
togrowth_environment
to remove redundant word ("organoid")organoid_stored_oxygen_levels
tostored_oxygen_levels
to remove redundant word ("organoid")biomaterial_core
genus_species
(example will live in the ontology module)model_for_organ
(example will live in the ontology module)model_organ_part
(example will live in the ontology module)organoid_age_unit
(example will live in the ontology module)organoid_size_unit
(example will live in the ontology module)organoid_morphology
biomaterial_core
genus_species
(example will live in the ontology module)organ
(example will live in the ontology module)organ
organ
field to guidelinesorgan_part
(example will live in the ontology module)organ_part
field to guidelinesdiseases
fielddiseases
(example will live in the ontology module)collection_time
Type:File
reference_version
ncbi_taxon_id
ncbi_taxon_id
to describe where to find valid taxon IDsgenus_species
(example will live in the ontology module)assembly_type
description to "The assembly type of the genome reference."reference_type
description to "The type of the reference."read_index
to "The sequencing read this file represents."read_index
field to indicate that if a sequencing experiment is single-end, enter "read1".insdc_run
insdc_run
: "Accession can be from the DDBJ, NCBI, or EMBL-EBI and must start with DRR, SRR, or ERR.".technical_replicate_group
file_description
Type:Project
supplementary_links
project_core
insdc_project
: "Accession can be from the DDBJ, NCBI, or EMBL-EBI and must start with DRP, SRP, or ERP."geo_series
: "Accession must start with GSE."array_express_investigation
: "Accession must start with E-."insdc_study
: "Accession can be from the DDBJ, NCBI, or EMBL-EBI and must start with PRJE, PRJN, or PRJD."funders
to "Funding sources(s)" to remove redundancy ("Project")Type:Protocol
protocol_type
computational_method
aggregate_cell_uniformity
protocol_core
aggregate_formation_method
protocol_core
collection_method
(example will live in the ontology module)target_pathway
protocol_core
and
small_molecules`protocol_core
dissociation_method
(example will live in the ontology module)protocol_core
markers
to "A list of markers used to enrich for or against certain cells."markers
to indicate that markers should be space-delimited and include + or - depending on if the markers were enriched for or against. Should this be an array field since it can accept more than 1 value?enrichment_method
(example will live in the ontology module)protocol_core
ipsc_induction_method
pluripotency_test
input_nucleic_acid_molecule
(example will live in the ontology module)library_construction_approach
(example will live in the ontology module)end_bias
to indicate common usage (e.g. "full length" for SS2)strand
to indicate common usage (e.g. "second" for 10X v2)library_preamplification_method
(example will live in the ontology module)cdna_library_amplification_method
(example will live in the ontology module)instrument_manufacturer_model
instrument_manufacturer_model
(example will live in the ontology module)local_machine_name
paried_end
to "Whether the sequenced molecule was sequenced from both ends."Type:Process
process_core
insdc_experiment
to "INSDC experiment accession"length_of_time
that shows how to enter a rangelength_of_time
to indicate that an age range can be enteredlength_of_time_unit
(example will live in the ontology module)process_type
(example will live in the ontology module)deviation_from_protocol
Module
Module:Biomaterial
percent_cell_viability
,percent_necrosis
,cell_size
cell_size
andcell_viability_method
cell_size_unit
(example will live in the ontology module)cause_of_death
cause_of_death
,cold_perfused
,normothermic_regional_perfusion
,time_of_death
,hardy_scale
, andorgan_donation_death_type
cold_perfused
to "Whether perfusion with cold fluid was used to help preserve tissues before heart stopped."normothermic_regional_perfusion
to "Whether entire body was perfused with oxygenated blood."parent
child
sibling
growth_medium
,culture_environment
,drug_treatment
, andfeeder_layer_type
drug_treatment
andpassage_number
ethnicity
and add how to enter more than 1 ethnicityethnicity
(example will live in the ontology module)alcohol_history
,medication
, andsmoking_history
alcohol_history
andmedication
strain
(example will live in the ontology module)storage_method
storage_time_unit
(example will live in the ontology module)preservation_method
to say "Enter 'fresh' if not preserved."preservation_method
gross_description
gross_image
microscopic_description
microscopic_image
Module:Ontology
Module:Process
retail_name
manufacturer
expiry_date
kit_titer
insdc_experiment
: "Accession can be from the DDBJ, NCBI, or EMBL-EBI and must start with DRX, SRX, or ERX."barcode_offset
: "Enter 0 for beginning of read." (and remove from description)white_list_file
whilte_list_file
descriptionwhilte_list_file
plate_id
to "Plate ID"well_id
cell_quality
to "Quality of well if imaged..."cell_quality
to "Well quality"Module:Project
contact_name
that doesn't have a middle initialemail
phone
descriptionphone
institution
laboratory
address
currently in descriptioncorresponding_contributor
to be "Whether the individual is a primary point of contact for the project.project_role
Module:Protocol
Core
Core:Biomaterial
genotype
biosd_biomaterial
to "A DDBJ, NCBI, or EMBL-EBI BioSample accession."biosd_biomaterial
: "Accession must start with SAMD, SAMN, or SAME."insdc_biomaterial
to "An INSDC (International Nucleotide Sequence Database Collaboration) sample accession."insdc_biomaterial
: "Accession can be from the DDBJ, NCBI, or EMBL-EBI and must start with DRS, SRS, or ERS."Core:File
file_format
checksum
file_name
Core:Process
process_location
operator
Core:Project
project_short_name
project_short_name
that value should have no spaces and be less than xx charactersproject_title
project_description
Core:Protocol
protocol_name
descriptionprotocol_id