-
There was a 500 spoken word file somewhere. Need to run FFT analysis and identify frequency transforms.
-
Hello
I've been going through the the instructions on gitpage however I was not able to figure out how one prepares their own dataset for finetuning.
Could anyone share this if they have been s…
-
Thanks for you great work on implementing FACodec!
I found the data file in https://github.com/Plachtaa/FAcodec/blob/master/data/val.txt has some labels, like speaker id, phonemes. How can I get thes…
-
### Describe the bug
Hi everyone. I'm new to the world of ML, so I'm not used to training AI models...
I really want to create my own TTS model using coqui's VITS trainer, so I've done a lot of re…
-
**Speech SDK log taken from a run that exhibits the reported issue.**
[AzureSpeechLogFile.txt](https://github.com/user-attachments/files/15890867/AzureSpeechLogFile.txt)
**A stripped-down, simpl…
-
1. Loop through the lexicon1000, print the phonemes from lexicon, vertically aligned with phonemes from LTS
```
about: ah b-aw1-t
ah b-ow1-t
act: ae1-k-t
...
```
2. Post a link to the re…
dhowe updated
6 years ago
-
In French sometimes two words sound like one
DEBUG:larynx:Words for 'oui, c'est un': ['oui', ',', "c'est", 'un']
DEBUG:larynx:Phonemes for 'c'est un': ['#', 's', 'e', 't', '#', 'œ̃', '#', '‖', '‖']
…
-
The original version of `panphon` deals well with affricates and co-articulated consonants:
```
panphon.FeatureTable().ipa_segs('t͡s') # ['t͡s']
panphon.FeatureTable().ipa_segs('ɡ͡b') # ['ɡ͡b']…
-
Is there direct command line to call to convert text to phonemes? i don't want the alignments to the audio, just the phonemes. The use case is after training with a TTS model, in the inference time w…
-
look, the code tells us this
```
phoneme o
vwl starttype #o endtype #o
length 100
ChangeIfNotStressed(V)
FMT(vwl_ru/o)
endphoneme
```
this works in words like косая.
but in phoneme V…