-
- Based on [Speaker-Diarization](https://github.com/chohj1111/Speaker-Diarization)
- Dataset : [VoxCeleb](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html)
-
When transcribing a file using spx.exe with [Azure AI Speech to text containers](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-howto)
with the commandline:
…
-
🐛 Bug
when i run the code about-- quick_start_zh.md([FunASR](https://github.com/alibaba-damo-academy/FunASR/tree/main)/[docs](https://github.com/alibaba-damo-academy/FunASR/tree/main/docs)/[modelsc…
-
### Description
Train ECAPA-TDNN on adults' and children's voices
### Tasks
- [ ] explore the impact of training data window on speaker diarization performance (in a child-adult setting) and on spe…
-
## Title & Topic
- speaker diarization이 뭔지 알아보는 n부작 (아마 2부작..) 중 두번째, DNN 기반 SD
- 주제 : DNN이 접목된 speaker diarization
- Speaker Diarization with LSTM (https://arxiv.org/abs/1710.10468) 페이퍼 리뷰
##…
-
## Title & Topic
- speaker diarization이 뭔지 알아보는 n부작 (아마 2부작..) 중 첫번째 소개편!
- 타이틀 미정
- 주제 : speaker diarization의 개요, 역사, 관련 기술
- < Speaker Diarization: A Review of Recent Research >(아래 레퍼런스 링크) 많…
-
### Tested versions
pyannote.audio = 3.3.1
### System information
ubuntu
### Issue description
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained(
"pyannote/…
-
Hi
When we run this command for speaker diarization, it generates a png file. Does it generate any segment file with time frame and speaker id? If not, how do we generate it?
pythonw audioAnalysis…
-
Is there a way to return the word timestamp of a sentence?
example:
input sentence: "Hello readers,welcome!"
output:
[{
"word": "Hello",
"start_time": 0.02,
"end_time": 0.36,
},
{
"word": …
-
### Description
When running the voice activity detection tutorial on google colab (possibly other environments as well, but I've only tested on colab so far), I get a `ValidationError` when import…