b04901014 / UUVC

Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units.
MIT License
73 stars 9 forks source link

Voice Transformation on Impaired Speech #1

Closed Raha304 closed 1 year ago

Raha304 commented 1 year ago

Hi, dear Li-Wei

I am working on dysarthric voice conversion and I want to use your idea in the " Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech " paper. I saw your repo for this paper but I don't understand which part of the code is related to perceptual similarity implementation. I am so thankful if you can help me with this problem. Pleeeeeeeeeeease

b04901014 commented 1 year ago

Hi,

Sorry for the late reply. It's been years since I visit this codebase. If you are referring to the Eq. 4 in the paper, I think it's at https://github.com/b04901014/ISGAN/blob/master/src/model.py#L176 which simply calculates the L1 distance of the discriminator layers between speech segments.

b04901014 commented 1 year ago

I found the issue in the original repo... I'll close this one.