Josephrp / DataTonic

🌟DataTonic : A Data-Capable AGI-style Agent Builder of Agents , that creates swarms , runs commands and securely processes and creates datasets, databases, visualisations, and analyses.
https://www.tonic-ai.com
MIT License
86 stars 29 forks source link

Priority Task : start using trulens to evaluate Gemini #1

Closed Josephrp closed 11 months ago

Josephrp commented 11 months ago

🤔How To

Check our References

trulens github + notebooks : https://github.com/truera/trulens/tree/main/trulens_eval/examples

Ideas for Evaluation

Work

What it takes : literally just running a notebook.

we will include the notebooks in the submission and write up

Josephrp commented 11 months ago

hey there @mie-h and @Zochory : https://github.com/Tonic-AI/DataTonic/tree/main/evaluation this is a folder where we will first start working on the trulens evaluations which are a hackathon requirement + good practice while building an app 🫡

Josephrp commented 11 months ago

hey there @mie-h & @Zochory : i added default prompts to the baseline prompts folder we can use those in a trulens evaluation.

Josephrp commented 11 months ago

consider using this to generate "system prompts" for gemini

Josephrp commented 11 months ago

added an incomplete example notebook : https://github.com/Tonic-AI/DataTonic/blob/main/evaluation/results/modelcomparision.ipynb

Josephrp commented 11 months ago

big thank you to 🏆😎 @MN-Noor for producing the first TruLens with gemini on RAG using open ai!

Open tasks :

we'll all work on this together, normally if everyone does one, or at least contributes to a good one we will have secured this task.

Zochory commented 11 months ago

Est-ce que l'on ajouterait pas d'autres multimodal LLM ? comme celui ci dans les evals ? https://huggingface.co/sshh12/Mistral-7B-LoRA-ImageBind-LLAVA