-
This is more a diffuse bunch of questions that an actual TODO:
I will put a section on encoding genre to the new upcoming ODD version, see #67 As it turns out I don't fully understand our encoding …
-
Hello, I am sorry that I can not find the code parts related to using the language model. Could you help to point out the specific part?
-
**Is your feature request related to a problem? Please describe.**
Currently the hexwidget text column isn't very helpful for non ASCII text. Even for English text, many programming environments us…
-
I'm writing in French, so lots of accented characters.
I'm encoding text in ASCII, more precisely ISO-8859-1 (in fact ISO-8859-15 to include €)
This works fine under Windows, Linux, IOS. It display…
-
Is there a specified text encoding for TWKB?
It would be nice if the spec mandated a standard encoding. This would allow more standard use of TWB for things like encoding geometry inside JSON.
…
-
Hi! I've noticed that text_tokenizer-function does not work with non-ASCII letters. I noticed it when trying to use my own pre-trained Finnish word embeddings as part of a text classifying model.
…
-
Hi, while working with your tee implementation, I noticed, that the `encoding="..."` option it was not available in the `run `method.
I later stumbled into an error: `UnicodeDecodeError: 'utf-8' c…
-
Issues:
The repo use `text-encoding` lib in the utils https://github.com/uber/xviz/blob/master/modules/parser/src/utils/text-encoding.js#L18
`text-encoding` is deprecated and no longer maintai…
-
I am encountering an issue with detecting text encoding from PDF files. While the encoding detection works correctly for .txt files, it consistently returns None for PDF files.
Steps to Reproduce:
…
-
## 概要
日本語を含むbufferを開いた際に、コードコメントを使用すると改行の入り方がおかしくなっている気がする。
`Warning: multiple different client offset_encodings detected for buffer, this is not supported yet` がmessageとして発生することがあるが、上記の改行と相関がある…