Closed EdwinYam closed 2 years ago
Essentially the only way to differentiate the correct case is with part of speech information, something that is outside of the scope of the g2p model.
You can optionally train with this additional parameter but in my opinion is kind of pointless as English only has a finite defined set of heteronyms which can be manually replaced after doing part of speech tagging.
I have an implementation combining heteronym parsing with deep phonemizer for an example implementation: https://github.com/ionite34/Aquila-Resolve
Essentially the only way to differentiate the correct case is with part of speech information, something that is outside of the scope of the g2p model.
You can optionally train with this additional parameter but in my opinion is kind of pointless as English only has a finite defined set of heteronyms which can be manually replaced after doing part of speech tagging.
I have an implementation combining heteronym parsing with deep phonemizer for an example implementation: https://github.com/ionite34/Aquila-Resolve
Thanks a lot for your great work! I will take a deep dive into your repo
Hello, thanks for your great work! I found that dp.phonemizer cannot handle heteronym problems well.
For example:
turns into
while
record
should be[R], [EH], [K], [ER], [D]
Is there any suggestion? Thanks