Closed zhimin-z closed 10 months ago
Hi zhimin! Thank you for the question. GSM100 uses 16 examples where 8 of these examples are token from chain-of-thought-hub and the remaining 8 examples are written by us. We verify our 8 examples on turbo-16k results show that using more examples improves turbo from 80 -> 84.