xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding
Apache License 2.0
3.02k stars 311 forks source link

How do we get this into llama cpp? [feature request] #54

Open e-p-armstrong opened 1 month ago

e-p-armstrong commented 1 month ago

Seems like an absolutely awesome project. I do a lot of domain expert LLM finetuning so this would be amazing to have in my work. What has to be done to get this into common inference engines like lcpp?

onofreiciuc commented 3 weeks ago

Even add it into bitnet.cpp