Closed abhinavkulkarni closed 9 months ago
There's a short explanation of how codebase indexing happens incrementally here, and the rest of the indexing code resides here, build_index.py
being the entrypoint.
We're working on transitioning away from the Python server, so for a short time we won't be focusing on indexing, but plan to really get these things right soon after
Thanks @sestinj,
In #2, you mentioned despite having only 5
embeddings in the chromadb
, relevant results were returned from meilisearch
. How does that work? Do you simultaneously do embedding search and keyword search to retrieve relevant chunks for RAG?
Hey @sestinj,
Thanks for the great work!
I recently loaded this project
continuedev/continue
in my VSCode and I tried the/codebase
command on it with a few sample queries. I have a few questions:I tried to find details in the documentation, but couldn't find much.
The
continue/docs/docs/walkthroughs/codebase-embeddings.md
mentions embeddings are stored in~/.continue/embeddings
, but I believe they are stored in~/.continue/index/chroma/
. The~/.continue/embeddings
folder on my machine is empty.I opened the above db using
chromadb
python package andcollection.count()
prints5
. How can this be the case?chroma_client = chromadb.PersistentClient(Path("~/.continue/index/chroma/default/chroma").expanduser().as_posix()) collection_name = "chroma-default" collection = chroma_client.get_collection(name=collection_name) collection.count()