-
### Feature request
Whisper processor does not currently rescale to the expected [-1, 1) that it requires.
### Motivation
Consistency between model processor layers.
### Your contribution
-
-
I'm currently using the colab at https://colab.research.google.com/github/teticio/audio-diffusion/blob/master/notebooks/gradio_app.ipynb
Steps to reproduce:
1. Run the first cell
```
try:
#…
-
Thank you for your great work and sharing it!
Do you have any recommendation to use your models to label audio at a higher resolution, say 1 sec or lower? Or even mel frame level?
I've tried app…
-
System:Windows10
Python: 3.8.10
requirements.txt:
absl-py==1.4.0
audioread==3.0.0
certifi==2022.12.7
cffi==1.15.1
charset-normalizer==3.0.1
colorama==0.4.6
contourpy==1.0.7
cycler==0.11.0…
-
# 🌟 New model addition
## Model description
In the past decade, convolutional neural networks (CNNs)
have been widely adopted as the main building block for endto-end audio classification model…
-
-
i'm excited to try this out!
i attempted to train, feeding in a MockTextAudioDataset similar to the example on AudioLM's page (that worked with the semantic trainer there), but encountered the foll…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
Hello everyone, I used Stable diffusion this morning and …
-
### Feature request
Hi, it seems some audio models expect `(batch_size, feature_size, n_frames)` (e.g. whisper) while others expect `(batch_size, sequence_length)` (e.g. wavlm, wav2vec2). Could thi…
-
discord, 04/15/2023:
gbs-c in custom preset -> passthrough, playing nintendont, went black for a second and restored
unsure if crt or gbs-c failed
no message in logs (computer was awake but locke…