-
# [KING](https://www.youtube.com/playlist?list=PL7VdqFXO0LzfGbXAsnBqWcnIwMvAV1IWR) // [QUEEN](https://youtu.be/7KtO-gJpEBs)
# [BEST EVER... take to streets?](https://www.youtube.com/@Rootswise)
…
-
I found this error in almost all Chinese dictionaries (it might be connected this issue with UTF16):
I can paste an entry here.
-
We have a long prompt (shown at the bottom of this message). Tiktoken (with the encoding name `cl100k_base`) says this prompt has 2554 tokens, but OpenAI's API says that this prompt contains 2561 toke…
-
Hi 👋 Thank you for creating yarr. I've been using it for ages and it's wonderful! No other feed reader that I know of is so delightfully minimalistic and so easy to self-host thanks to the choice o…
-
**Describe the source**
"The ABC Cantonese-English Comprehensive Dictionary ... comprises about 15,000 lexical entries that are unique to the colloquial Cantonese language as it is spoken and written…
-
For Segmenter, seems like the first crate we could consider incorporating is https://github.com/makotokato/uax14_rs
@sffc @Manishearth @makotokato - would it make sense to consider it for ICU4X?
-
Hello,
Running on ubuntu 22.04
During installation, when I run make test, I get the following error.
`$ sudo make test
[sudo] password for alexander:
cd lua_osml10/tests/ && ./runtests.lu…
-
I fired up a fresh Debian VM to investigate in failing tests:
```
…
calling osml10n.geo_transcript("42", "thai ถนนข้าวสาร 100", { 100, 14, 101, 15 }):
[ERROR] (expected thai thanon khaosan 100, go…
-
For programming languages with non-ascii filenames, e.g. https://github.com/wenyan-lang/wenyan, https://github.com/AnonymousAAArdvark/qi/tree/master/docs, https://github.com/ProjectDimligh…
-
Today Meilisearch normalizes Chinese characters by converting traditional characters into simplified ones.
#### drawback
This normalization process doesn't seem to enhance the recall of Meilisearc…