OpenPecha / rag_prep_tool

MIT License
0 stars 0 forks source link

RAG0003: Embed text chunks and populate vector database (2) #4

Closed teny19 closed 1 month ago

teny19 commented 1 month ago

Description

Using the "Alibaba-NLP/gte-large-en-v1.5" embedding model, the text chunks will be vectorized and inserted into the ChromaDB vector database. The stored embeddings will be indexed and enriched with the metadata information.

Expected Output

Vector database that includes all the embedded text chunks with their metadata.

Implementation Plan

Image

Implementation tasks