Closed ggbetz closed 1 month ago
Qwen models fail to generate reasoning traces. https://github.com/logikon-ai/cot-eval/blob/f9bfe8f757edbed49324df680214a24fbde37213/src/cot_eval/__main__.py#L139C1-L146C53
Might however be related to https://github.com/logikon-ai/cot-eval/issues/48, as I've been testing the smallest base model only...
Let's skip 1.5 and directly go for Qwen2...
For
{XX}
in [0.5B, 1.8B, 4B, 7B, 14B, 32B, 72B]:Check:
Parameters: