sdsc-ordes / kg-llm-interface

Langchain-powered natural language interface to knowledge-graphs.
Apache License 2.0
14 stars 3 forks source link

KG-LLM: Streamline data ingestion #5

Closed cmdoret closed 1 year ago

cmdoret commented 1 year ago

Currently, ChromaDB can be populated from an RDF file or SPARQL endpoint using chroma_build.py. In practice populating Chroma from RDF files directly is impractical and extremely slow. This should be split into 2 separate steps:

  1. RDF files -> SPARQL endpoint (trivial)
  2. SPARQL endpoint -> ChromaDB

Requirements: