Closed saturosfz closed 3 days ago
worked under exllamav2 dev branch
For people encountering the same issue in exllamav2 v0.2.3
, if filter_prefer_eos=False
then it will work as intended as well. Currently Formatron treats eos_token
as a normal token so filter_prefer_eos=True
will always sample eos_token
once it is allowed.
Version: formatron 0.4.7 exllamav2 0.2.3 Problem: output incomplete json Code: