-
I am trying to figure out the effect of setting speaker model in kaldi recognizer:
`rec.SetSpkModel(spk_model)`
How does this help in asr?
Is it only used in normalizing cmvn, or is there mor…
-
Help is very much needed on all this, any contribution will be appreciated.
- [x] Convert C++ api to pure C api for improved interoperatbility (mingw, ios).
- [ ] Windows build (pull request #28)
…
-
I have tried
`python speakerlab/bin/infer_sv.py --model_id $model_id --wavs input.wav`
This exports the numpy array file. How can I get the inference info from trained model that this object is…
-
from 10 folders only one folder data will extracted
-
Syndromes which share the same names should be re-named with unique descriptions.
For example, interaction_biology.txt has sets of ' Biology Interaction' syndromes, which should be re-named to identi…
-
Found your wonderful software, but had minor issue when loading an Amazon Transcribe transcript that had the variant format for independent audio channels as oppose to the typical speakers format.
…
-
It would be most useful if we can train the system to differentiate who said something. Depending on the person we could then start or ignore a command. For instance:
- a guest in the house can't r…
-
I have many audio files with human speech.
I want to group it by speaker.
For the test I get one long file (about 18 minutes) and get embedings for it (about 80 vectors). It meas each vector has abo…
-
-
统计 开源数据 和 爬虫源, 不断更新中... 欢迎追加编辑