Open AnneYang720 opened 2 years ago
@AnneYang720 did you find a workaround?
What about using "backslashreplace" mode instead of "ignore"?
@kamyabzad I think in this case, we should get the original bytes as result, rather than try any kind of unicode decoding? Since user may need to convert this back to a numpy array or float array.
I don't see a good solution or workaround under current search result parsing codebase though, maybe we need some ideas from the maintainers.
Version:
Platform: Python 3.9.2 on Debian GNU/Linux 11
Description: The bytes is converted to string in the vector search results and there is an error in this conversion. The bytes including
b'\x80'
is converted to a wrong string.Example Code
The original bytes
b'\x80\x00\x00\x00'
is converted to string'\x00\x00\x00'
.Reason