-
how we record & transcribe now:
1. record chunk of audio of 30s on each device
2. use local voice activity detection model to extract speech frames, if not enough, skip transcription
3. transcrib…
-
I've recently met the issue with kaldi diarization that it doesn't work for short files (
-
Speaker diarization is where you annotate a transcript by noting which words were spoken by which speakers.
There are tools in Python that do this. It would be great to try them out and see if any …
-
```
Hi,
I'm trying to process an audio file but I always get the same error: "IOError:
File test_voices_.i.seg empty"
The audio file is PCM16, I'm using ubuntu 12.10, and I have replaced the
sphin…
-
```
Hi,
I'm trying to process an audio file but I always get the same error: "IOError:
File test_voices_.i.seg empty"
The audio file is PCM16, I'm using ubuntu 12.10, and I have replaced the
sphin…
-
I have a 5 second audio file with two different speakers one after the other. It looks like it is able to recognize the two speakers but the time stamps must be wrong because it says that speaker 2 st…
-
Seems to be a problem with Waveform preprocessor.
Nothing related to this PR.
Merging but will need to have a look at this.
_Originally posted by @hbredin in https://github.com/pyannote/pyannote-…
-
I am the creator of [`pyannote.audio`](https://github.com/pyannote/pyannote-audio) speaker diarization toolkit.
I understand that you went with @josepatino's PyBK because of its speed but I'd love …
-
The preprocess_wav function will trim out the silences in the audio file, and return a new wav, but it did not return information about which segments were cut. If we are dealing with a video, the aud…
-
Generally, it's not a low-hanging fruit to fine-tune language model yet. Better/cheaper techniques are needed.
Creative articulator (CA) project allows synthesizing summary-to-original-text dataset…