-
I am a graduate student from China, and our team recently had the privilege of studying your article on the 'Audio Spectrogram Transformer'. We were truly impressed by the content and scope of your wo…
-
I apologize for asking repeatedly.
In the paper's 5-A. Datasets chapter, the data preprocessing method was described.
The voice uses a sampling rate of 16 kHz, 4 seconds.
N=1024, H=64, window size=…
-
Hello,
Immense thanks to all of those who worked on this project, it's really great. There's of course still room for improvement, but I think this is a step forward in terms of OSS TTS, so thanks…
-
像是缺失了文件
Unrecognized model in D:\LIUGEGE\ComfyUI\models\Joy_caption_alpha\text_model. Should have a `model_type` key in its config.json, or contain one of the following strings in its name: albert, a…
-
### System Info / 系統信息
CUDA Version: 12.2
Transformers:4.45.1
Python:3.10.12
操作系统:ubuntu
vllm:0.6.2
### Who can help? / 谁可以帮助到您?
_No response_
### Information / 问题信息
- [X] The official exa…
-
Hi!
First of all, amazing work! I'm trying to load the model with the pretrained weights from HF, but I'm receiving an error, while doing so.
My first attempt:
`model = AutoModelForSeq2SeqLM.from…
-
This issue contains the test results for the upstream sync, develop PR, and release testing branches. Comment 'proceed with rebase' to approve. Close when maintenance is complete or there will be prob…
-
Hello. Do you know how to turn this: https://github.com/nivibilla/build-nanogpt into TTS instead of audio-to-audio?
-
### Feature request
The support is [already present in huggingface/transformers](https://github.com/huggingface/transformers/pull/27662).
But when I try to export LLaVA model to neuron format, i…
lifo9 updated
3 weeks ago
-
### Describe the bug
I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…