Passing in model_kwargs

EQ-bench / EQ-Bench

A benchmark for emotional intelligence in large language models

MIT License

180 stars 13 forks source link

Hi there! I'm not supporting min_p directly (this is by design so that the benchmark runs with repeatable params). But you can specify sampler config with ooba like this:

# lib/run_query.py

def run_ooba_query(...):
...
data = {
"mode": "instruct",        
"messages": messages,
"instruction_template": prompt_format,
"max_tokens": completion_tokens,
"temperature": temp,
"user_bio": "",
"min_p": 0.1
}

You may also wish to hardcode temp here if you are using min_p. EQ-Bench uses temp 0.01 by default, and the creative writing test uses temp 0.7

EQ-bench / EQ-Bench

Passing in model_kwargs #22