Closed vnkc1 closed 2 weeks ago
DRY is a modern repetition penalty which ramps up penalties on N-grams to avoid looping behavior.
The penalty is commonly used on Llama 3.1 8B by practitioners to avoid repetition spirals.
https://github.com/oobabooga/text-generation-webui/pull/5677
https://www.reddit.com/r/SillyTavernAI/comments/1eg2pq5/good_info_on_dry_to_get_you_started_has/
@vnkc1 ref https://github.com/sgl-project/sglang/pull/1187
Thanks
Closing this issue
Checklist
Motivation
DRY is a modern repetition penalty which ramps up penalties on N-grams to avoid looping behavior.
The penalty is commonly used on Llama 3.1 8B by practitioners to avoid repetition spirals.
Related resources
https://github.com/oobabooga/text-generation-webui/pull/5677
https://www.reddit.com/r/SillyTavernAI/comments/1eg2pq5/good_info_on_dry_to_get_you_started_has/