IHEC / epiATLAS-metadata-harmonization

IHEC metadata merging and cleanup
Apache License 2.0
3 stars 4 forks source link

cell origin part of name in cell_type #12

Closed SchulzLab closed 2 years ago

SchulzLab commented 3 years ago

Entries such as multipotent progenitor from cord blood (3 rows) should be changed to

cell_type: multipotent progenitor cell with a new entry in the origin_sample / tissue_type column : Cord Blood It is not clear to me which of the other metadata columns is to be prefered. I could find entries for Cord Blood in tissue_type as well as origin_sample, but not necessarily both of those fields filled for one entry (such as IHECRE00003825.5)

A related problem is for terms such as aortic endothelical cell This should be changed to

cell_type: endothelial cell as it already has the term Aorta in the origin_sample column (see e.g. IHECRE00004741.1)

There are many types of endothelial cells for which this may be done (e.g. umbilical artery endothelial cell, coronary artery endothelial cell, …)