SlangLab-NU / torgo_vc

0 stars 0 forks source link

Audio token completion using Vall-E for impaired audio #2

Open aanchan opened 11 months ago

aanchan commented 11 months ago

WWW As a researcher having completed a recent literature review (summarized here I would like to see if atypical speech can be synthesized as an audio completion task using audio tokens extracted from neural codecs. Recently a model called vall-e and vall-e-x were released with open source implementations not by the original authors: https://github.com/Plachtaa/VALL-E-X/tree/master

AC The goal of this ticket would be to explore audio prompt completion as a task suitable for our use case. A few unknowns here are: