Open cvillela opened 1 year ago
That is definitely possible and would be really great to have! We could not try this due to computational constraints.
Awesome! Will try it out. How much VRAM do you think is necessary for attempting it?
@deepanwayx Also, I see that the "Tango Prompt Bank" is all in 16.000Hz. Would you guys have the raw dataset, not resampled, available?
Hey!
I was wondering if it was possible to train the model in 48kHz audio, and then generate audio directly in 48kHz. Has anyone attempted this?