Open shahizat opened 1 year ago
I can't see anything obviously wrong. What output do you get?
Hi @synesthesiam, thanks for your reply. Kazakh model can not synthesize the numbers, it basically skips it. However, russian and english models can synthesize numbers.
Hi @synesthesiam,
I've also observed that, below comand using your src/python_run example also can not read numberical letters with more than one digit numbers, but your binary piper example can synthesize.
echo '3 4 4 4 5 1835 1865 Шоқан Шыңғысұлы Уәлиханов шын есімі Мұхаммед Қанафия 1835 1865' | scripts/piper --model kk-issai-high.onnx --output_file welcome.wav --speaker 1
It reads 3 4 4 4 5 numbers, skips 1835 1865 and then continues Шоқан Шыңғысұлы....
Best regards, Shakhizat
Hi @synesthesiam,
It is not a critical issue, if you happen to know the answer.
I modified your python_run code to accept only numpy arrays for speech synthesis, but as a result, I lost the ability to synthesize numerical letters in the Kazakh model. However, the English model is functioning correctly.
I suspect that I also lost the functionality of the auto punctuation. What do you think about it?
Here is the code
Best regards, Shakhizat