Evaluate MiniLLM Performance

digitalfabrik / integreat-chat

Interface to self-hosted large language models and vector databases to provide improved Integreat Chat functionality

https://integreat-app.de

MIT License

1 stars 0 forks source link

Closed dasgoutam closed 2 weeks ago

dasgoutam commented 8 months ago

Given a consistent retrieval mechanism, evaluate the performance of MiniLLM's

Select a list of MiniLLM's to evaluate and compare performances.

At a later stage, the 'best performing' MiniLLM can be used to compare with larger models(higher parameters)

svenseeberg commented 6 months ago

Currently implemented in Google Colab.

svenseeberg commented 1 month ago

Maybe we can use llama3.2:3b to classify the incoming messages as "question"/"not a question".

svenseeberg commented 1 month ago

Maybe we can use llama3.2:3b to classify the incoming messages as "question"/"not a question".