Open CosmicMac opened 8 months ago
I tried this in both VS Code and Intellij and found that the encoding looked as expected (though Gemma gives interesting answers).
I'm wondering if this might be the model literally outputting "é" due to something it saw in its dataset. If you say something like "repeat after me: 'é'", can you get it to output the correct encoding?
gemma on acid :)
Unfortunately same problem with the repeat prompt:
A quick test in console:
For me, it looks like a double utf-8 encoding. As I'm using a french OS maybe there is auto encoding occurring before forced encoding (or the other way round)? It would explain why you can't reproduce the glitch on your system.
Ah this makes sense. Is this built into the OS, or might there be a setting that I could change in order to simulate this?
Unfortunately I have no idea :(
Before submitting your bug report
Relevant environment info
Description
Extended characters in responses are badly encoded (eg. "é" instead of "é"). Encoding is OK in direct responses from ollama when prompting with terminal.
To reproduce
1/ Select any gemma model 2/ Prompt "Translate elegant to french"
Log output
No response