-
@Vaibhavs10
Thanks for sharing our work!
We have changed the license to MIT License. (So please change our license information in ReadMe.MD!)
Now, you can use for commercial product.
Ple…
-
I tried to debug `ch01_Introduction.ipynb`, set a breakpoint at `X_train = feature.transform(x_train),` then press F11 to enter the souce code function
![image](https://github.com/ctgk/PRML/assets…
-
Hi, great project. Just curious why you decided to update the license from MIT to CC-BY-NC-SA?
-
Please check whether this paper is about 'Voice Conversion' or not.
## article info.
- title: **HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Va…
-
Thanks for the repo. I had around 30 hours of custom Hindi data which I wanted to train and test the model on. Training only the TTV part on 4x A6000 GPUs with 64 batch size, I tried inferencing with …
-
Hello! Thank you for a great model! I am trying to fine-tune your model. You have quite uncommon predicted pitch value range. Could you please share what library did you use for pitch detection? Thank…
-
Thanks for very cool project.
This is the best and simple LLM-based TTS Implementation I have ever seen!
For audio quality, I highly recommend adding MS-STFT Discriminator of Encodec, and MS-SB…
-
Hi!
Thank you for your excellent work!
I have a question about the MOS evaluation in the paper.
In the paper, there are 95% confidence interval in the MOS assessment of roughly 2, which I think is …
-
I've setup synthetic image and blurred it with an anisotropic Gaussian kernel. I started off with using simple `ADMM` following the example [here](https://scico.readthedocs.io/en/stable/examples/decon…
-
分布の距離を測る指標のKL距離についての記事。
https://www.countbayesie.com/blog/2017/5/9/kullback-leibler-divergence-explained