[Help] How to do data processing of the tta project?

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

https://openhlt.github.io/amphion/

MIT License

4.41k stars 373 forks source link

[Help] How to do data processing of the tta project? #168

Open spiralanch opened 5 months ago

spiralanch commented 5 months ago

Problem Overview

I'd like to train a TTA model on the data. But have trouble in data processing.

Expected Outcome

A script for constructing triplet training data (instruction, input audio, output audio)?

RMSnow commented 4 months ago

Hi @spiralanch, could you attach some screenshots and more details about your trouble? @HeCheng0625 will follow up that!

spiralanch commented 4 months ago

Hi @RMSnow @HeCheng0625 It is the same question described in #176 .