-
The version of HKCanCor published on [HuggingFace](https://huggingface.co/datasets/nanyang-technological-university-singapore/hkcancor/tree/main) by NTU is different from the version offered by this l…
-
**Describe the bug**
Segmenter removes space of English words in code-mixed sentence, for example this sentence:
> 這是Career Centre
**To reproduce**
Here is the code:
```
import pycantonese
fr…
-
**Describe the bug**
I read this and understand the corpora used for characters_to_jyutping are.
(i) the HKCanCor corpus data included in the PyCantonese library, and (ii) the rime-cantonese data
…
-
**Feature you are interested in and your specific question(s):**
I'm studying Word Segmentation of PyCantonese (https://pycantonese.org/word_segmentation.html), does the function return also the star…
-
**Describe the bug**
A clear and concise description of what the bug is.
Error thrown when calling pycantonese.parse_jyutping('hng6')
**To reproduce**
Steps to reproduce the behavior, including …
-
I try to use the jyutping to convert characters to jyutping, but I found some character can be convert:
for example:
txt='昆省急救服务中心嘅医护人员昆省警方。'
the output is:
[('昆', 'gwan1'), ('省', 'saang2'), ('急救…
-
**Feature you are interested in and your specific question(s):**
Is there any method that does jyutping to ipa ? I know there's a jyutping to tipa method now, would be great if also have jyutping t…
-
目前個 `.segment()`效率有啲低,好似唔係最優算法。@graphemecluster @zhanruiliang 之後可能會開個 PR 睇下點優化。另外順便解決埋 https://github.com/jacksonllee/pycantonese/issues/32 嘅分詞問題。
-
Probably inappropriate to make an issue here, but I am working on a parser with Nom for parsing CHIlDE for Jyutping, good work!
-
**Feature you are interested in and your specific question(s):**
I want new output style request of pycantonese.characters_to_jyutping something like this:
```
>>>pycantonese.characters_to_jyutping…