-
### Feature request
Would be great if we could export `audio-spectrogram-transformer`-type models using Optimum to ONNX. Right now, I get this error:
```
(transformers-v2) victor@Victors-MBP Desk…
-
How can i get the pretrained_teacher model by myself instead of using the ones u provided?
-
Hi there,
Thank you for the great work!
I have some problem.
In the Google colab environment
```
!git clone https://github.com/FasterDecoding/Medusa.git
%cd Medusa
!pip install -e .
!pyth…
-
Dear Minster. Gong
Thanks a lot for your pioneering work in the field of audio processing, and warmhearted comments every time.
I have a question about using MixUp method in AST. Since I saw the cod…
-
Thank you for the code and inference script.
I understand that the PaSST model has been trained on AudioSet with sampling rate of 32kHz.
I am trying to make inference using the pre trained model.
…
-
I want to transform the table detection model from detr to onnx. Some models available in HF are either "nielsr/detr-table-detection" or "microsoft/table-transformer-detection".
I try both and with…
-
It seems to me that there is currently no way to define a sweep where some parameters depends on another one, is it correct?
What I have in mind is something like this example from `hyperopt` ([sou…
-
### Model/Pipeline/Scheduler description
TorToise is a multi-voice text-to-speech system, which describes a way to apply recent advances in the image generative domain to speech synthesis. It would…
-
I thought we could start a discussion on what/how we'd like to see as far as spectrogram augmentation in the project.
We had already some design discussion about this in #29. Having augmentation do…
-
i want to train only audioldm1 and not audioldm2 how can i do that !