SlangLab-NU / torgo_inference

0 stars 2 forks source link

Look to convert phoneme level outputs to word level outputs #24

Open aanchan opened 11 months ago

aanchan commented 11 months ago

WWW While working with phonemes at the sentence level it is hard to compose word sequences out of those phonemes. This ticket would need to investigate methods/papers/online toolkits where phonemes can be mapped back to words. Traditionally ASR systems used a lexicon like the CMUDict Lexicon or use a finite state transducer (FST). Our G2P is an FST under the hood, so that mapping might be tool specific. There might be a mode within our choice of G2P to convert phoneme sequences back to words. I may be that we might need to use an FST based on OpenFST via Kaldi or some other tool.

AC A wiki page on possible alternatives and options.

jindaznb commented 11 months ago

Wiki: https://github.com/SlangLab-NU/links/wiki/Inverse-G2P

GDocs: https://docs.google.com/document/d/1OJ3gCK96HF_-DXaq7mleK_Zq2oGRmtqP8kk7ph2q1Gw/edit#heading=h.x5t42o5b0250