-
# Extra pre-checklist checklist
- [x] Update Diplomat (https://github.com/rust-diplomat/diplomat/pull/323, https://github.com/unicode-org/icu4x/pull/3304)
- [x] (@Manishearth) Write a changelog …
-
Reference: https://www.youtube.com/watch?v=orfhB33Mf3M&t=2460s#:~:text=这是经常拿枪的手才会长的老茧
會長 会长 [hui4 zhang3] /president of a club, committee etc/
But it can be read as `will grow/develop` too
-
微博内容精选
-
We have converted a corpus of transcribed speech from Penn Treebank to UD, and we would like to represent contiguous sentences that occur as part of a dialogue turn in order to faithfully represent th…
-
There are many data files located here:
https://github.com/unicode-org/icu4x/tree/main/provider/datagen/data/segmenter
Is this the best place for the source of truth, or can we source them from …
-
As part of the API review with @markusicu, he pointed out something I had noticed before and we discussed on Slack, which is that LineSegmenter does not return a breakpoint at index 0.
Here is the …
-
The type for grapheme cluster segmentation is called GraphemeClusterBreakSegmenter. I think it should be called GraphemeClusterSegmenter.
In general, the names of types should have the following pa…
-
Hi! I'm working on the Unicode-based engine [citeproc-lua](https://github.com/zepinglee/citeproc-lua) which requires conversion between sentence case and title case for titles. At the moment I'm using…
-
## ❓ Questions and Help
#### What is your question?
(This is from #5283. I though it is better to separate since it is the new kind of error)
I tried to reproduce wav2vec-U 2.0 with python 3.…
-
## Description
In my hospital (CHU de Brest), ADICAP codes are written like this:
```
ADICAP :B.H.HP.A7A0
Cotations :
ZZQX217 R-AHC-100-A001 R-AHC-10-A015
```
In this case dots s…