Closed ccmaymay closed 2 years ago
I'm using a simple fix text = tokenizer.decode(tokenizer.encode(text))
that changes the text in _complete before feeding it to the model. This might have other unintended consequences but seems to work fine on my side for now
@nweir127 I made the fix you suggested and this issue appears to be resolved, thank you. Are there any unintended consequences of turning off clean_up_tokenization_spaces
? I haven't seen anything so far but I'm wondering if I'm just not looking in the right places.
From @nweir127:
Would be a good bug to use to kick off regression testing.