audio-quantization Search Results

speechbrain/speechbrain #2764

SpeechBrain Quantization refactoring

I'd like to raise a concern about how quantization is currently handled in SpeechBrain. While training my own k-means quantizer on the last layer of an ASR model, I noticed that the interface was not …

Adel-Moumen updated 1 day ago

janhq/ichigo #120

planning: Train our own Quantizer for multilingual speech to…

## Goal Experiment on WhisperVQ model for better result on multilingual. Hypothesis the current codebook is only 512 which is a small space to compress the multilingual capability. ## Learning Goa…

hahuyhoang411 updated 36 minutes ago

gukush/audio-watermark-242 #14

Add support for Audio modifications/distortions

The aim of this task is to add certain well known audio modficiations which can impact readability of watermark: - changing sampling rate / resampling - time and frequency domain filtering / equal…

gukush updated 2 weeks ago

open-mmlab/Amphion #344

[Help]: Is there anyway to speed up inferencing of MaskGCT

## Problem Overview Currently it takes about 5-6 seconds to generate an audio below 10 seconds with prompt audio about 10+ second on one 3090 ti. It takes about 12G VRAM and 100% GPU util. So seems n…

treya-lin updated 1 week ago

NVIDIA/TensorRT-Model-Optimizer #108

[RFC] TensorRT Model Optimizer - Product Roadmap

# TensorRT Model Optimizer - Product Roadmap [TensorRT Model Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer) (ModelOpt)’s north star is to be the best-in-class model optimization toolki…

hchings updated 2 days ago

tensorflow/tflite-micro #2926

Micro Speech:How to run audio_preprocessor.py independently?

I have read the [doc](https://github.com/tensorflow/tflite-micro/blob/4b5f835e603ac33312921932d760d45a7844cc97/tensorflow/lite/micro/examples/micro_speech/README.md),but I don't know how to run audio_…

ctwillson updated 3 days ago

shashikg/WhisperS2T #50

Handle batch processing when few files fails in the whole ba…

When my script batch processes a bunch of audio files using the approach you gave me to use a list of files and their settings when processing, if a single file fails for any reason, it prevents the t…

BBC-Esq updated 2 months ago

hashicorp/terraform-provider-aws #11190

Feature Request: MediaConvert Preset Resource

### Community Note * Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…

bflad updated 1 month ago

modelscope/ms-swift #1617

SWIFT 2.4 TO DO LIST

# Dataset 1. Refactor the self cognition dataset to support multi-lingual QAs. # Megatron PreTrain 1. Support more Megatron models 2. Support dataset split # Fine-tuning 1. RAG LLM training …

tastelikefeet updated 1 month ago

xorbitsai/inference #2554

1000+ results
for audio-quantization