digitalfabrik / integreat-chat

Interface to self-hosted large language models and vector databases to provide improved Integreat Chat functionality
https://integreat-app.de
MIT License
1 stars 0 forks source link

Evaluate MiniLLM Performance #10

Closed dasgoutam closed 2 weeks ago

dasgoutam commented 8 months ago

Given a consistent retrieval mechanism, evaluate the performance of MiniLLM's

Select a list of MiniLLM's to evaluate and compare performances.

At a later stage, the 'best performing' MiniLLM can be used to compare with larger models(higher parameters)

svenseeberg commented 6 months ago

Currently implemented in Google Colab.

svenseeberg commented 1 month ago

Maybe we can use llama3.2:3b to classify the incoming messages as "question"/"not a question".

svenseeberg commented 1 month ago

Maybe we can use llama3.2:3b to classify the incoming messages as "question"/"not a question".

https://github.com/digitalfabrik/integreat-chat/blob/main/integreat_chat/core/settings.py#L54-L56