-
I'd like to raise a concern about how quantization is currently handled in SpeechBrain. While training my own k-means quantizer on the last layer of an ASR model, I noticed that the interface was not …
-
## Goal
Experiment on WhisperVQ model for better result on multilingual. Hypothesis the current codebook is only 512 which is a small space to compress the multilingual capability.
## Learning Goa…
-
The aim of this task is to add certain well known audio modficiations which can impact readability of watermark:
- changing sampling rate / resampling
- time and frequency domain filtering / equal…
-
## Problem Overview
Currently it takes about 5-6 seconds to generate an audio below 10 seconds with prompt audio about 10+ second on one 3090 ti. It takes about 12G VRAM and 100% GPU util. So seems n…
-
# TensorRT Model Optimizer - Product Roadmap
[TensorRT Model Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer) (ModelOpt)’s north star is to be the best-in-class model optimization toolki…
-
I have read the [doc](https://github.com/tensorflow/tflite-micro/blob/4b5f835e603ac33312921932d760d45a7844cc97/tensorflow/lite/micro/examples/micro_speech/README.md),but I don't know how to run audio_…
-
When my script batch processes a bunch of audio files using the approach you gave me to use a list of files and their settings when processing, if a single file fails for any reason, it prevents the t…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…
bflad updated
1 month ago
-
# Dataset
1. Refactor the self cognition dataset to support multi-lingual QAs.
# Megatron PreTrain
1. Support more Megatron models
2. Support dataset split
# Fine-tuning
1. RAG LLM training …
-
### System Info / 系統信息
cuda 12.2,centos7
### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [X] docker / docker
- [ ] pip install / 通过 pip install 安装
- [ ] installation from source …