Closed MitchellShiell closed 7 months ago
Almost done generating the corresponding Elasticsearch mapping to match the schema created in this ticket. Noticed a couple items that needed clarification and we may want to adjust here:
samples.donor
vs donor
. is this a duplicate property?
Resolution: yes, but the samples.donor
comes from the song base schema, and we cannot remove it right now. https://www.overture.bio/documentation/song/admin/schemas/#the-song-base-schema
tumour
(British) vs tumor
(Atlantic) ?
Resolution: keeping tumour
because of base schema limitations.
Archives'fileID
vs file/object ids? (archive.ENCODE.fileID
could be a specific property name + spelling)
Resolution: @edsu7 to confirm property names for each archive.
followup
vs follow_up
: consistent spacing/punctuation for donor.primary_diagnosis.follow_up
, and its properties.
Resolution: we're option for a consistent follow_up
. **Not needed will be camel-cased followUp
update collaborator
to collaborator(s)
center
vs centre
. e.g. experiment.sequencing_center
. this property is spelled in Atlantic, which is not consistent.
Resolution: changing to centre, for consistency with British spelling, and base schema limitations.
specimen.cell_line.treatment.treatment_typed
. extra “d” in property name, at the end
Resolution: Remove the typo.
camelCasing: The base schema is in camelCase, and we should keep things consistent in the whole schema. Resolution: we're unifying to camelCase wherever possible
Overture Playground Dataset
Outline and organize a representative open-access mock data set to include with the DMS.
Metadata Tasks
File Data Tasks
General Tasks
Links
Co-ordination doc Song Metadata Fields Base Schema Updates Link to issue