jaywalnut310 / glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search
MIT License
660 stars 151 forks source link

Error using mel generated from glow-tts for hifi-gan training #66

Open ankurdhuriya opened 2 years ago

ankurdhuriya commented 2 years ago

getting error - RuntimeError: stack expects each tensor to be equal size, but got [8192] at entry 0 and [8129] at entry 12

Do anyone have a solution to do it ?

OnceJune commented 2 years ago

You should either:

  1. Drop the too short audio;
  2. Pad the too short audio to segment length with zeros at the end.