Pilot project to attempt to fill in missing ancestry metadata in the CZ CELLxGENE and HCA DCP data corpus by inferring ancestry from single-cell sequencing reads using Monopogen or other forthcoming tools.
Outcome 3: The cost and feasibility of computing and filling missing Census metadata from sequence or expression data is determined so that data consumers can perform more specific and powerful searches and analyses (Enrich metadata and annotation of cells)
Milestone 1: Define scope of pilot to determine feasibility of updating missing ethnicity metadata in CELLxGENE data corpus (Jan Q1)
Milestone 2: Identify resources necessary to land pilot and plan sequence of steps necessary to realize pilot (Feb Q1)
Milestone 3: Write up pilot findings and make GO/No GO decision (beginning of Q2)
Pilot project to attempt to fill in missing ancestry metadata in the CZ CELLxGENE and HCA DCP data corpus by inferring ancestry from single-cell sequencing reads using Monopogen or other forthcoming tools.
Outcome 3: The cost and feasibility of computing and filling missing Census metadata from sequence or expression data is determined so that data consumers can perform more specific and powerful searches and analyses (Enrich metadata and annotation of cells)