-
I am trying to reproduce the results on timit dataset.
I am having some doubt related to inference:
1. I have not seen any documentation for inferring a pre-trained model/checkpoint on sample au…
-
Hi @cvqluu , guys,
Is it possible to get output file with transcribed text for each of the speakers (dialogue with speaker label in a text file for example) ?
a) it's the case to connect sdializ…
-
### System Info
transformers@3.0.0-alpha.6
chrome 127
macos
### Environment/Platform
- [X] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Desktop app (…
-
WWW
As a researcher wanting to understand the impact of phonetic error correction using language models on word-level recognition from dysarthric speakers I would like to run an experiment using acous…
-
||link|
|----|---|
|paper| [CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation](https://arxiv.org/abs/2108.11626) |
|code| [paperswithcode](h…
-
Speaker Recognition not working correctly when I use different device for training and testing, Can you please help
-
Hi,
I am trying to find the d_vector for speaker diarization or speaker verification task using the AM-MobileNet1D model.
I have modified my previous inference script to compute the d_vector of …
-
Hi, this work is really interesting. I would like to ask two informations...
Is it possible to realize a speaker diarization like this in real time? Hence, for example, while many people are speaki…
-
Hello
A user on Stack Overflow (not me) has reported a problem with speech_recognition grabbing the audio from a Zoom call if you run a script whilst on Zoom:
https://stackoverflow.com/questions/6…
-
We are using it on Android devices. Initially, we used the webSocket server. If the speaker is far away from the device, about 1 meter away, it will be difficult or unrecognizable.
Later, we switched…