Closed CoderBak closed 5 months ago
python inference.py -m claude-2.1 -d mt_bench --evaluation_set train\[:5\] --model_type base
Result: 9.00
Result: 9.00