-
Please consider implementing Meta's MMS with speech recognition support for over 1000 languages at a drastically reduced error rate compared to Whisper:
![image](https://github.com/kaixxx/noScribe/…
-
### ML-Crate Repository (Proposing new issue)
:red_circle: **Project Title** : German Italian Speech Analysis
:red_circle: **Aim** : Analyse and visualize different aspects of the German and Italian…
-
我用的是v0.6.0版本,windows系统,单独跑test.py是没有问题的,参考asrserver.py,我重新基于Flask写了一个简单的api服务,如下:
import json
from flask import Flask, request, jsonify
from SpeechModel251 import ModelSpeech
from LanguageModel2 i…
-
# Project Description
## Project Overview
Develop a _Speech to Text conversion_ system using **Azure AI Speech Studio** and Azure Cognitive Services. Leverage Azure's speech recognition services to …
-
# Nonce word detection
Wug testing is a linguistic experiment in which a speaker (often children) is asked to morphologically inflect a word that is nonexistent in a language (a nonce word) such as…
-
I am trying to align a new language(Sinhalese Language) using MFA. But what I have is only a speech corpus and I do not have a pronunciation dictionary. I was going through [https://montreal-forced-al…
-
Hello everyone, below is my code for fine-tuning XTTS for a new language. It works well in my case with over 100 hours of audio.
https://github.com/nguyenhoanganh2002/XTTSv2-Finetuning-for-New-Lang…
-
### Describe the issue
I'm using open ai whisper model with onnxruntime.
And when running with directml execution provider and medium model it failed with error
```console
2024-08-21 00:45:47.…
-
### Describe the bug
Hi everyone. I'm new to the world of ML, so I'm not used to training AI models...
I really want to create my own TTS model using coqui's VITS trainer, so I've done a lot of re…
-
[MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research](https://arxiv.org/pdf/2406.18301)
The above paper has just open-sourced a dataset fo…