-
The samples provided in `datasets_fullband/clean_fullband/VocalSet_48kHz_mono` have a reported sampling rate of 48kHz.
The real sampling rate, however, is 16kHz, which results in a mickey mouse ty…
-
Hello and thank you sharing your great work, but I have some questions.
1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not …
-
Feature request: Is singing voice conversion possible?
-
### 상반기 회고 1마디씩
### News
- Conferecnes
- ACL 2023 (7.9 - 14, Toronto) - 네이버 논문 10개 발표 많이 들러 주세요~
- ["사용료 내느니 뉴스 뺄래"... 구글·메타, 캐나다서 뉴스 제공 중단 선언](https://n.news.naver.com/mnews/article/469/000…
-
-
-
**Is your feature request related to a problem? Please describe.**
Currently, RVC models with pitch guidance seem to have an f0 range from 50 Hz to 1.1 kHz. When I feed an audio sample outside of thi…
-
For unseen F to seen M conversion, the resulting pitch is very close to the source speaker , especially if the source pitch is much higher than seen M pitch.
I've used SR-based data augmentation s…
-
### Actual behaviour
I'm having a txt file with 4 players defined. And performous shows only 2 lines of text (as it'd have been a duet mode):
![image](https://user-images.githubusercontent.com/39…
-
While using the program, I've noticed that German umlauts (ä, ö, ü) are not pronounced correctly. Instead of the correct pronunciation, the umlauts are replaced with strange characters or sounds, maki…