Closed terrykong closed 1 month ago
Fixes a crash that occurs when using llama3 tokenizers since they don't have a pad_id in their huggingface tokenizer.
The fix is to use an out of bounds id if there's no pad_id, then swap it with eos_id right after generation. Credit to @gshennvm for the idea
# Add a code snippet demonstrating how to use this
Pre checks:
max_steps=-1
validation
What does this PR do ?
Fixes a crash that occurs when using llama3 tokenizers since they don't have a pad_id in their huggingface tokenizer.
The fix is to use an out of bounds id if there's no pad_id, then swap it with eos_id right after generation. Credit to @gshennvm for the idea
Changelog
Usage
Before your PR is "Ready for review"
Pre checks:
Checklist when contributing a new algorithm
max_steps=-1
andvalidation
?Additional Information