-
# Task Name
Speech language classification
## Task Objective
The task requires the model to classify the language of the speech. The model needs to classify which the language of the speech …
-
### Description
The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…
-
想使用sencevoice模型进行asr,并且区分说话人。加载一直报错。
加载模型代码如下
from funasr import AutoModel
from funasr.utils.postprocess_utils import rich_transcription_postprocess
model = AutoModel(model="/data/asr/SenseVoice…
-
# Task Name
Multilingual Speech to Speech Translation (s2st): converting speech from one language directly into speech in another language. This task requires the model to have strong multilingual …
-
when I use
```
echogarden align-transcript-and-translation 01.mp3 01.txt 01_translate.txt 01.json 01.srt
```
I got
```
Echogarden v1.5.0
Start stage 1: Align speech to transcript
Transcode…
-
Hello,
I successfully trained a model in Spanish using Spanish + English data! Great news.
The model unfortunately speaks with an accent when I use an English prompt with English source text and t…
-
### 🥰 需求描述
当前每次点击播放按钮都会发送一个请求获取音频,即使是完全相同的内容
可不可以增加一个缓存功能,相同的内容直接调取缓存?
### 🧐 解决方案
对朗读内容、语音速度、语音模型等进行哈希,储存在localStorage中
### 📝 补充信息
_No response_
-
# Multi-Lingual Speech Recognition
## Task Objective
Automatic Speech Recognition (ASR) is the task to transcribe the content of speech into text. In this multi-lingual ASR task, our objective i…
-
Bad documenttaion. not very long errors
Detecting toxicity in outputs generated by Large Language Models (LLMs) is crucial for ensuring that these models produce safe, respectful, and appropriate con…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues]([https://github.com/langgenius/dify/issues](https://github.com/fishaudio/fish-speech/issues)), including closed …