Closed zankner closed 11 months ago
We are working on V1.0. So at that time, you will expect us to show more detailed information on how the ablation works (and maybe an Arxiv report).
Understood. If I am benchmarking for a paper should I assume that the settings reported in the repository, ie (mc_sim_7b_63), are the current optimal inference settings?
If you have your own model, and you want to generate a sparse tree, please refer to the preview version. The folder contains a readme that will guide you customize your tree settings step by step.
Understood thanks! Is there a timeline for v1 will be fully released?
It will be very soon... We still have some minor issues to try to fix :) pls stay tuned!
Are there any breaking changes? Ie if I have been basing my code off the main branch are there any bugs or issues with that branch? Sorry for all the questions
The eval folder is self-contained and should be compatible with the original one. We are trying to implement other recent models w full-finetuning and the branch is not tested yet...
Ah ok thank you very much! Great work btw!
Awesome project! I was wondering if you would be able to share the mt-bench results for the different Medusa configs. Specifically from this ablation: