lowerquality / gentle

gentle forced aligner
https://lowerquality.com/gentle/
MIT License
1.46k stars 295 forks source link

List of phones Gentle uses and what they map to #302

Closed SuperSonicHub1 closed 2 years ago

SuperSonicHub1 commented 2 years ago

I'm trying to use Gentle to create a lip-flap animator, and while the software works for me, I don't really know how to use its outputs.

In my program, each phoneme of the English language will be assigned to a photo, and so I need to know what each phoneme uttered is. Gentle gives me this data, but the phoneme names don't seem to match any digitization of, say, the IPA. Instead, I get obscure phoneme names like dh_B and ah_E. dh is usually transcribed as th so this caught me off-guard at first, and I have no idea what E means.

If I could get a list of all of Gentle's possible phonemes and their mappings to the IPA, that would make my life a lot easier.

strob commented 2 years ago

The suffixes (B, I, E) stand for "beginning", "middle", and "end" (of the word in which they appear)

SuperSonicHub1 commented 2 years ago

@strob Thanks! Now all I need are a list of phonemes.

tscizzle commented 2 years ago

I was just checking this repo to ask the same exact thing! How timely.

Even if the phoneme list comes from somewhere else, like Kaldi, any clue as we might go about finding it, or relevant documentation?

SuperSonicHub1 commented 2 years ago

Found it! http://www.speech.cs.cmu.edu/cgi-bin/cmudict Thanks, carykh!