Universal Sentence Encoder for embedding

The project seems like that it uses scikit KNN retriever. Qdrant's cosine similarity search will be more powerful than KNN. The main feature i can get from it is how I get the embedding vectors.

I already implemented Transformer embedding, which uses embedding model from Huggingface. This requires pytorch library, and your link requires tensorflow. I will add both as seperate modules and add some requirements.txt files.

BTW, both local embeddings require a lot of computing resources. Make sure your environment is strong enough. I tested intfloat/e5-large-v2 and it almost freezed my 13600K, handling 20 pages of PDF.

c0sogi / LLMChat

Universal Sentence Encoder for embedding #38