-
Events
client
session.update
00:00.88
client
conversation.item.create
00:00.88
client
response.create
00:00.88
server
session.created
00:00.96
server
session.updated
00:00.97
server…
-
Hi, This is a very excellent work. And I use it in a speech translation work. But I find it that the _whisper_online_server.py_ runs best at the first 30-seconds. But from the 30s to more buffers, the…
-
I have encountered an issue with the voice assistant when synthesizing Chinese text. The time interval between LLM and synthesized speech outputs is noticeably longer when the output is in Chinese co…
-
https://github.com/facebookresearch/fairseq/tree/main/examples/mms
How can we use Meta's model for:
* Transcribing or playing back speech community checking comments
* Playing the audio of the commun…
-
```
from datasets import load_dataset
from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor
import soundfile as sf
import torch
from jiwer import wer
librispeech_eval = load_dataset("li…
-
### Checklist
- [X] I believe the idea is awesome and would benefit the framework
- [X] I have searched in the issue tracker for similar requests, including closed ones
### Description
Telegram Pre…
-
review it!
-
I successfully made your pipeline example run on my Mac. I did not expect to meet an assistant, but understand a bit more now about the intention of this project.
I would like to build a pipeline …
-
Another small issue. Sometimes the transcriptions are all shifted significantly forward in time. So that the transcription occurs seconds before the speech. This usually adjusts itself later in the tr…
-
Hi, I’m having trouble understanding your guidelines. Could you assist me in integrating a transcriber for BigBlueButton (BBB) using the Whisper-large-v3 model?
I’d like it to function similarly to…