Arize-ai / phoenix

AI Observability & Evaluation
https://docs.arize.com/phoenix
Other
4.05k stars 299 forks source link

Benchmark existing models on hallucinations dataset #5266

Open Jgilhuly opened 3 weeks ago

Jgilhuly commented 3 weeks ago

https://github.com/Arize-ai/dataset-generation-research