audio-spectrogram-transformer Search Results

251 results
for audio-spectrogram-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

as-ideas/TransformerTTS #38

RuntimeError: CUDA out of memory

Hey guys! I get the following error when trying to convert my spectrograms to audio when using melGAN: ``` RuntimeError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 4.00 GiB total capaci…

DanBigioi updated 4 years ago
5
dynamic-superb/dynamic-superb #83

[Task] Respiratory Sound Classification

# Task Name Respiratory Sound Classification ## Task Objective The objective of this task is to predict if an audio of respiratory sound indicates early-stage fatal lung diseases for better d…

leo5470 updated 3 months ago
4
OpenGVLab/InternVideo #129

[Help requested] Inference InternVideo2_clip model.

Hello InternVideo team, You guys have done a great job with this project! In your paper, you use the Stage 2 model for the task of temporal grounding on QVHighlight [Lei et al., 2021] and Charad…

gracikk-ds updated 1 month ago
35
elixir-nx/bumblebee #209

Support Text to Speech

Hello! As Speech to Text models such as Whisper are added having access to some of the impressive AI Text to Speech models would be a nice way to close the loop! My current suggestion for a model …

zolrath updated 5 months ago
12
huggingface/transformers #27453

Audio-MAE - ViTMAE for audio

### Model description This model is is a Self-supervised Vision Transformer that uses patch reconstruction as the spectrogram task. It extends MAE (which is already on HuggingFace) for audio. This mo…

justinluong updated 7 months ago
15
as-ideas/TransformerTTS #51

Issues replicating the examples

My predict.py: ``` from utils.config_manager import ConfigManager from utils.audio import Audio from scipy.io.wavfile import write config_loader = ConfigManager('ljspeech_autoregressive_trans…

Bardo-Konrad updated 4 years ago
4
bkraad47/fat_llama #18

[SUGGESTIONS] floating point, dequantize and more

Hi there, since I've some experiences in this field (audio delossify/upscale) I'd like to share what I have learned: - During a lossy audio treating, the best approach is to carefully decode and pr…

MarcoRavich updated 1 month ago
11
mlfoundations/open_clip #384

MuLaN

The new [MusicLM](https://arxiv.org/abs/2301.11325) relies on an audio CLIP named [MuLaN](https://arxiv.org/abs/2208.12415) I will build out an initial implementation [here](https://github.com/luci…

lucidrains updated 1 year ago
11
huggingface/optimum-neuron #640

Error: KeyError when exporting M2M100 model using optimum-cl…

I encountered an issue when trying to export the facebook/m2m100_418M model using the optimum-cli tool. The error message indicates that the m2m-100-encoder is not supported, despite m2m-100 being lis…

javaid-manzoor-lc updated 3 days ago
2
huggingface/transformers #31831

Add MultiStepLR with Warmup Scheduler

### Feature request I would like to propose the addition of a new learning rate scheduler that combines MultiStepLR with a warmup phase. Currently, the Transformers library does not include a sched…

penguinwang96825 updated 3 months ago
3

上一页 1...1 2 3 4 5 6 7...26 下一页

251 results for audio-spectrogram-transformer

251 results
for audio-spectrogram-transformer