-
I use the following code to train punkt for Chinese, but it doesn't produce desired result:
```python
input_str_cn = "台湾之所以出现这种危机,是台湾不但长年低薪,且不知远景在哪里。20世纪90年代,台湾的大学毕业生起薪不到新台币3万元(约合人民币6594元),到了今天,依然…
-
some cases like "一千一千", "一万五万", ("a thousand a thousand", "ten thousands fifty thousands") will be auto-calculated in the results (2000, 60000), but it's not make sense in Chinese.
-
Hello! I am running into a word-sense disambiguation issue where CoreNLP seems to be systematically struggling with adverbs that share a surface form with different words. For example, "number" (as in…
-
There is a growing interest in multilingual R packages, and this relates to recent [rOpenSci work](https://ropensci.org/multilingual-publishing/).
Would it be possible to add the ability to spellch…
-
I've received several reports of confusion between the `excerpt` and `description` fields on proposals. We should relabel these to better explain how they'll be used in OCW (and, optionally, at the co…
-
As pointed in https://github.com/act-rules/act-rules.github.io/pull/1864#issuecomment-1161693515,
```html
For more details about your order contact us.
For more details about how to retur…
Jym77 updated
3 months ago
-
Some homonyms have the same spelling but different pronunciations based on their context. For example it's "**a** unionized group of workers" but "**an** unionized atom". Handling these correctly woul…
-
I'm having trouble with English's non-determinism: I keep hitting expressions that have multiple valid interpretations, and the runtime doesn't always return the same result. It happens infrequently, …
-
This problem is in the latest snapshot and earlier (2023-11-22 and possibly before that date).
Sample text for Text Language English (American).:
... into the tree of life's radiate rainbow color…
-
It looks like the English and German voices fail to pronounce dates and times like this (only ones I've tested):
English: `4/23/2021, 5:02:54 PM`
German: `23.4.2021, 17:02:51`
I know this is a …