Closed kristenbrann closed 1 year ago
This updates the database to store embeddings on files AND embeddings on chunks of text within the files. Each database entry now looks like this:
{ md5hash: string; embedding: Vector; chunks: [{ contents: string; embedding: Vector; }] }
Stores md5 hash on the entry and uses that to compare on open to see if existing files need to be reindexed.
In the indexing function, call the embeddings api with multiple texts at once. Saves greatly on time!
Embeddings now at the file AND block level
This updates the database to store embeddings on files AND embeddings on chunks of text within the files. Each database entry now looks like this:
Check if files have changed
Stores md5 hash on the entry and uses that to compare on open to see if existing files need to be reindexed.
Batch embeddings
In the indexing function, call the embeddings api with multiple texts at once. Saves greatly on time!
Screenshot ðŸ¤
Misc