neural-audio-upsampling Search Results

zhenye234/xcodec #13

Reproducibility problems with Librispeech model

I want to thank all the authors for the great work that they have done with this paper. I am trying to reproduce the Librispeech model training to get a better sense of how the model is training i…

Vanlogh updated 1 week ago

balthazarneveu/gyraudio #4

Reading list

# Supervised audio separation ### [U-Net on STFT](https://research.atspotify.com/publications/singing-voice-separation-with-deep-u-net-convolutional-networks/) (Jansson 17') ### [Wave UNet](https://…

balthazarneveu updated 11 months ago

maum-ai/nuwave #5

Different sampling rates

Hi! Did you observe trainings with different sampling rates such as 8K->16K, 8K-> 22K, 16K->22K, etc.. ? (diferent from [demo page](https://mindslab-ai.github.io/nuwave/)) and what changes shou…

EmreOzkose updated 3 years ago

xiph/rnnoise #92

How to train and run the network with audio with a sampling …

I would like to train the network if I use 16k sample rate audio, and require a frame length of 32ms and a frame shift of 16ms. How should I modify the parameters in the preprocessing, thank you for y…

LikeSwim updated 4 years ago

xiph/LPCNet #178

Improve LPCNet to make it run on CPU without any GPU ?

[Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU](https://xploreqa.ieee.org/document/9455356/) GPU is expensive and power-hungry.

stayforapple updated 2 years ago

google/lyra #106

Feeding FB audio outputs WB

Running the codec with full-band input (48 khz sr): bazel-bin/encoder_main --input_path=path/to/fullband-audio/*.wav --output_dir= dir/to/bs_output --bitrate=6000 bazel-bin/decoder_main --encoded_…

ChamMoradi updated 1 year ago

rishikksh20/SoundStorm-pytorch #3

Problems with SoundStorm

Have trained `update_v2` branch on : * Extracted Semantic token from HuBert Large layer 16 with 1024 cluster Kmean. (`50 tok/sec`) * Extracted Acoustic token from Encodec 24 khz sample rate, 240 ho…

rishikksh20 updated 1 year ago

rhasspy/piper #210

GPU Resource Estimation

Hello All, Have a question on GPU Resource Requirement for a Training Project I am doing with Piper. Following the Training Guide and the video by Thorsten Müller. Data: Single Speaker, 18,000 …

roughhewer updated 10 months ago

ibab/tensorflow-wavenet #112

Global condition and Local conditioning

In the white paper, they mention conditioning to a particular speaker as an input they condition globally, and the TTS component as an up-sampled (deconvolution) conditioned locally. For the latter, t…

thomasmurphycodes updated 5 years ago

f90/Wave-U-Net #13

Question - About Prediction time over CPU and GPU

I'm doing some tests for CPU and GPU environment usages for prediction (Predict.py). I'm using an audio file `Audio: mp3, 44100 Hz, stereo, fltp, 192 kb/s` of duration `00:03:15.29` ```sh $ ffpro…

loretoparisi updated 3 years ago

31 results for neural-audio-upsampling

31 results
for neural-audio-upsampling