how to change y1? - Githubissues

Hello,

Thanks for your interest in our work! In our provided implementation, we set $\gamma_1 = 1$ because we observed that the performance is nearly the same for $\gamma_1 = 2$, and it decreases for larger values of $\gamma_1$. This is due to the low acceptance rate for Llama-68M. To keep things simple, our open-source code uses $\gamma_1 = 1$.

If you’d like to try using better draft models with higher acceptance rates, you can directly modify the function linked below. You only need to add an extra inner loop for $\gamma_1$:

https://github.com/Infini-AI-Lab/TriForce/blob/e865a1df7ded2b43bc309106c05371c429fc10f1/utils/decoding.py#L182-L222

If you have any further questions, feel free to ask.

Infini-AI-Lab / TriForce

how to change y1? #6