-
### First Steps Update
- [x] **Project Initialization**: Set up the repository and created the initial README file.
- [ ] **Data Organization**: Collected and organized Telugu music into four catego…
-
In UNIT4 : Pretrained models for audio classification
We’ll load an official [Audio Spectrogram Transformer](https://huggingface.co/docs/transformers/model_doc/audio-spectrogram-transformer) checkpo…
-
# Task Name
Spoken digit recognition - AudioMNIST
## Task Objective
The task's objective is to classify audio samples of spoken digits (0-9) into their corresponding Arabic number representat…
-
Hello, I just started to intercess in this, and a week later I managed to launch the project on the Android studio and get applications for installations. But I can’t figure out how to create a JSON c…
-
# (Speech Recognition) Connectionist Temporal Classification 리뷰 및 설명 | Simon's Research Center
이 포스트는 개인적으로 공부한 내용을 정리하고 필요한 분들에게 지식을 공유하기 위해 작성되었습니다. 지적하실 내용이 있다면, 언제든 댓글 또는 메일로 알려주시기를 바랍니다. 상당 부분 r…
-
### Check for previous/existing GitHub issues/module proposals
- [X] I have checked for previous/existing GitHub issues/module proposals.
### Check this module doesn't already exist in the modul…
-
Here is the result for [SpeechTokenizer](https://github.com/ZhangXInFD/SpeechTokenizer).
The bit rate is 2kbps, following are the results:
**Results in exps/results.txt**
Codec SUPERB applica…
-
Here is the result for [SemantiCodec](https://haoheliu.github.io/SemantiCodec/)
This is a 16Khz codec with three different bit rates:
1. For token rate 100 with book size 16384 the bit rate is 1.35 …
-
# 16 kHz 2kbps
## parameter size:
encoder (including quantizer) : 29MB decoder: 40MB
### exps/results.txt
Codec SUPERB application evaluation
Stage 1: Run speech emotion recognition.
Acc: 74.…
-
Hi there 👋
Let's translate the course to `pt-BR` so that the whole community can benefit from this resource 🌎!
Below are the chapters and files that need translating - let us know here if yo…
rrg92 updated
2 months ago