Closed ikm565 closed 1 year ago
your input is too short. Shorter than the first layer's kernel size.
We can try to pad samples that are shorter than 13 chars after phoneme conversion.
@loganhart420 can you give it a look?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
your input is too short. Shorter than the first layer's kernel size.
We can try to pad samples that are shorter than 13 chars after phoneme conversion.
@loganhart420 can you give it a look?
this got lost in my notifications. I'm checking on this now
@loganhart420 any updates?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.
Describe the bug
run tts --text "Text for TTS" --model_name "tts_models/en/ljspeech/speedy-speech" --out_path output.wav then you will get an error: "RuntimeError: Calculated padded input size per channel: (11). Kernel size: (13). Kernel size can't be greater than actual input size"
To Reproduce
run tts --text "Text for TTS" --model_name "tts_models/en/ljspeech/speedy-speech" --out_path output.wav then you will get an error: "RuntimeError: Calculated padded input size per channel: (11). Kernel size: (13). Kernel size can't be greater than actual input size"
Expected behavior
No response
Logs
Environment
Additional context
No response