sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.
https://sglang.readthedocs.io/en/latest/
Apache License 2.0
5.24k stars 371 forks source link

[Feature] DRY repetition penalty #1350

Closed vnkc1 closed 2 weeks ago

vnkc1 commented 2 weeks ago

Checklist

Motivation

DRY is a modern repetition penalty which ramps up penalties on N-grams to avoid looping behavior.

The penalty is commonly used on Llama 3.1 8B by practitioners to avoid repetition spirals.

Related resources

https://github.com/oobabooga/text-generation-webui/pull/5677

https://www.reddit.com/r/SillyTavernAI/comments/1eg2pq5/good_info_on_dry_to_get_you_started_has/

zhyncs commented 2 weeks ago

@vnkc1 ref https://github.com/sgl-project/sglang/pull/1187

vnkc1 commented 2 weeks ago

Thanks

vnkc1 commented 2 weeks ago

Closing this issue