-
Code to reproduce
```
import pycantonese as pc
pc.word_segmentation.Segmenter()
Traceback (most recent ca…
-
If you're on Windows, you may hit `UnicodeDecodeError` when importing pycantonese. The problem has been resolved at the GitHub source code, and I've made a pre-release to PyPI. So for now the workarou…
-
Officially, Yale romanization should distinguish high level tone ā from high falling tone à (jyutping doesn't). Would it be possible to make pycantonese do that?
-
I've gone through the documentation but did not find how to convert Chinese sentence to jyutping. For example something like this.
```python
import pycantonese as pc
pc.parse_to_jyutping("我係香港人")…
-
Hello,
please excuse any ignorance on the topic of romanization of Cantonese, as I neither know the language nor it's pronunciation rules. My use case is merely transliterating actor and role names…
ghost updated
4 years ago
-
## 合併詞庫
經過各位嘅努力而家呢個倉庫已經基本成型了。不過我覺得仲有一個問題需要解決,就係將https://github.com/rime/rime-jyutping 入邊嘅詞庫都加埋入去。目前rime-jyutping入邊有15794個帶拼音(都冇聲調)或者冇拼音嘅詞條,而https://github.com/sgalal/rime-cantonese/issues/10 入邊提到嘅`jy…
-
Hi,
It is possible to return all of the sentences in character from the corpus with its pos tag?
-
`import pycantonese as pc
corpus = pc.hkcancor()
print(len(corpus.words()))
print(len(corpus.characters()))
aa = corpus.search(nucleus='aa')
print(len(aa))
`
I tried build-in hkcancor cor…
-
Hi there,
I am using the default corpus and found that there are 149781 words but 0 characters.
What can be causing this issue? I am using pycantonese version 2.0.
![screenshot from 2018-01-26 1…
-
I tried to look up the 門氏五虎將, or some other phases like 仆街 or "Collect skin", but none of these are available.
Should I implement them?
Thank you