-
10分钟的音频 依然没有时间戳
模型;whisper-large-v3
## ❓ Questions and Help
from funasr import AutoModel
model = AutoModel(
model="iic/Whisper-large-v3",
vad_model="iic/speech_fsmn_vad_zh-cn-16k-com…
-
# Speech Summary Matching
Speech summarization refers to the process of condensing spoken language into a shorter version while retaining its essential meaning and key points. Speech summarization…
-
A custom dataset was generated for the basis of the words that are read incorrectly and the v1 large model was trained on the text to voice and I received the output of the pth file and loaded the mod…
-
### Description
What Does RMSEnergyExtractor Do?
Calculates RMS Energy:
RMS energy is a measure of the power of an audio signal. It is computed as the square root of the average of the squared …
-
Dataloader name: `lsvsc/lsvsc.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?lsvsc
| Dataset| lsvsc |
|-------------|---|
| Description | A large-scale Vietnamese speec…
-
Hello,
i want to build a simple offline hotword detection and tried your example script:
```
from pocketsphinx import LiveSpeech
speech = LiveSpeech(lm=False, keyphrase='forward', kws_thre…
-
Hi, thank you for all the work you did with the models training.
Recently I discovered quite good speech corpus for Japanese language: https://github.com/laboroai/LaboroTVSpeech. Could you please b…
-
https://github.com/IBM/build-custom-stt-model-with-diarization/blob/master/README.md#2-create-watson-speech-to-text-service
Standard Plan not available anymore
➜ build-custom-stt-model-with-di…
-
### Verified issue
- [X] Someone from the team allowed me to create an issue here
### Issue Content
Hi Everyone,
Thanks for your interest in LearnHouse, this is a good first issue for anyon…
-
### System Info
- `transformers` version: 4.46.2
- Platform: Linux-5.15.0-124-generic-x86_64-with-glibc2.31
- Python version: 3.9.5
- Huggingface_hub version: 0.26.2
- Safetensors version: 0.4.…