Open fquirin opened 3 years ago
Actually I'm starting to think that hallo --> h '{ l @ U
in the German Zamia lexicon is just wrong and referring to the English pronunciation :sweat_smile: since related words are for example hallodri --> h a l 'o: d R i:
(almost identical to German hallo) and halloween --> h '{ l @ w i n
(the English hallo).
This would make the espeak IPA to SAMPA pipeline with gruut almost correct except for the apostrophe.
[EDIT]
To be honest I'm confused about what's the correct symbol here. According to Wikipedia and this converter ˈ
(unicode U+02C8) is "
(unicode U+0022) in X-SAMPA, but I don't see it anywhere in the Zamia Kaldi lexicon. X-SAMPA seems to have different flavors :see_no_evil: (Conlang X-SAMPA (CXS))
[EDIT2] Guenter himself seems to use this mapping
Guenter's phonemes seem to be like X-SAMPA, but not quite it. I have a English map for Zamia, but I will need to add a German map too :+1:
We had some discussions about it, maybe it helps ^^: https://github.com/gooofy/zamia-speech/issues/79
My conclusion was that I kind of need a manual check:
"
to '
(roughly Guenters IPA Normalization map)
Hi Michael,
I've been experimenting with gruut-ipa and Zamia
lexicon.txt
from the popular models 'kaldi-generic-de-tdnn_250-r20190328' and ''kaldi-generic-en-tdnn_250-r20190609" and I'm having trouble getting the expected results. As far as I understand Zamiaslexicon.txt:
is in sampa format, so I selected two words from the German file:and then used espeak-ng to generate phonems:
Finally I've tried to convert the espeak results to sampa with gruut-ipa:
But none of the results matches the lexicon.txt entries. Any help or hints would be appreciated! :-)