Closed earth2travis closed 2 months ago
Got KT from @santteegt, started on this as well!!
Waiting for output connector / pipeline to be working from @santteegt !!! then can do local testing for most compatible model
The output connector is ready for testing. Here are some instructions on how to use it with gaia:
data_explore.ipynb
notebookdocker run -p 6333:6333 -p 6334:6334 -v ./qdrant_dev:/qdrant/storage:z qdrant/qdrant:v1.10.
gaianet
folder (e.g. "boardroom_test-xxxxxx.....snapshot"){
...
"embedding_collection_name": "boardroom_test",
"embedding_ctx_size": "768",
...
"snapshot": "boardroom_test-xxxxxx.....snapshot",
....
}
gaia init
and gaia start
Data doesn't get inserted into qdrant vector db!!! Can you share the Jupyter notebook with cleanup stuff!!!
@wtfsayo https://github.com/raid-guild/gaianet-rag-api-pipeline/blob/main/experiments/data_explore_cftoc.ipynb just click run all cells
Still nothing got inserted to vector store @santteegt, is it working on your machine?
@wtfsayo yes. It's been working for me all the time. What error are you getting?
Started to work now!
The output connector is ready for testing. Here are some instructions on how to use it with gaia:
- Open the
data_explore.ipynb
notebook- Run a local qdrant instance using docker (just for embeddings + snapshot generation):
docker run -p 6333:6333 -p 6334:6334 -v ./qdrant_dev:/qdrant/storage:z qdrant/qdrant:v1.10.
- For testing purposes enable the http input connector (omit the airbyte connector cells) to extract the first page from the proposals endpoint
- Execute the rest of the pipeline using this input data
- Download the snapshot file using the URL displayed at the end of the pipeline execution and save it on the
gaianet
folder (e.g. "boardroom_test-xxxxxx.....snapshot")- Shutdown the qdrant docker instance
- On the gaia node side, the config file should have the following updates:
{ ... "embedding_collection_name": "boardroom_test", "embedding_ctx_size": "768", ... "snapshot": "boardroom_test-xxxxxx.....snapshot", .... }
- run
gaia init
andgaia start
going to try this now!! (with llama3-8b)
Select appropriate LLM embedding model for the Gaia node. The chosen model should be compatible with the data structure and use case of the project, ensuring optimal performance and accuracy. The decision-making process will include evaluating different models based on criteria such as embedding quality, processing speed, and resource requirements.