magda-io / magda

A federated, open-source data catalog for all your big data and small data
https://magda.io
Apache License 2.0
509 stars 93 forks source link

LLM Indexing Strategy: Generic Data structure & Opensearch Index Schema Design #3536

Open t83714 opened 4 months ago

t83714 commented 4 months ago

Description

This ticket is about the data structure & OpenSearch index schema design.

Technical Requirements

Proposed Data structure & Indexing Structure

We will only use one index for storing all LLM indexing information.

We will define the following fields: