openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.56k stars 2.56k forks source link

Idea for Evals: Emotion and sentiment analysis Evals #637

Open Pabreetzio opened 1 year ago

Pabreetzio commented 1 year ago

Understanding emotions and sentiments is an essential aspect of human communication. The system's ability to recognize these emotions enables more appropriate, context-aware, and empathetic responses. Emotions and sentiments can be complex and nuanced, requiring a system to perform fine-grained analysis and interpretation. Evaluating a system's performance in this area provides insights into its overall ability to process and understand subtleties in language.

There are several different emotions that Evals could be written for:

I suppose this could be covered in one Eval or many. If many, it might be good to come up with some sort of structure for organizing Evals for different sentiments or at least documenting what was covered already and what has yet to be covered. Sarcasm, for instance, was covered with an eval that contained news articles from the Onion. While article headlines are certainly one way of testing for sarcasm, others could be a dataset of tweets or reviews of products or examples from fiction.

Ein-Tim commented 1 year ago
gauravjaincr7 commented 1 year ago

for SEO - latent semantic indexing is already there, check out their Docs!!!!!

DOGMATIL commented 1 year ago

https://github.com/openai/evals/issues/637#issuecomment-1509863288

DOGMATIL commented 1 year ago

excellent

DOGMATIL commented 1 year ago

https://github.com/openai/evals/issues/637#issuecomment-1509863288

DOGMATIL commented 1 year ago

that's what you want for dinner tonight to get it out of