Cellular-Semantics / CL_KG

Building a Cell Ontology Knowledge-Base from data, and LLMs
Apache License 2.0
0 stars 0 forks source link

Add curated datasets and update ontology configuration in data pipeline #25

Closed ubyndr closed 2 months ago

ubyndr commented 2 months ago

This PR introduces the following changes to the data pipeline:

  1. Curated Dataset Sources:

    • Added 3 new curated dataset files to the curated_data directory:
      • GutAtlas.Org Author Category JB V1 - Colon Immune.csv
      • GutAtlas.Org Author Category JB V1 - Fetal and Pediatric Cell Atlas.csv
      • GutAtlas.Org Author Category JB V1 - Teichmann and Burclaff.csv
  2. Ontology Updates:

    • Added two lung datasets and the clm-kg ontology to the fullontologies.txt configuration file.