-
### Question
Hello Authors,
Thanks for your amazing work and provide the trained weight in https://huggingface.co/mucai/llava-next-vicuna-7b-m3. When I download the weight and test, something wron…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits of both this extension and the webui
### What happened?
I encountered an err…
-
Support to convert model black-forest-labs/FLUX.1-schnell, receive this error:
after running:
`python -m python_coreml_stable_diffusion.torch2coreml --convert-unet --convert-text-encoder --convert…
-
Refer to https://swift.readthedocs.io/zh-cn/latest/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.html
[rank0]: File "/usr/local/lib/python3.10/site-packages/transformers/trainer.py", …
-
hi,
I basically followed: https://www.modelscope.cn/models/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
and thought the `24G` gpu memory would be enough for the model:
![image](https://github.co…
-
Just wanted to let you guys know that instead of "from pytorch_transformers import ..." it is renamed to just transformers, so "from transformers import ..."
-
Hello,
Going via the training.
Some small ideas for improvements.
#######################
Transformers, what can they do?
https://huggingface.co/learn/nlp-course/en/chapter1/3
A)
Curren…
-
**Describe the bug**
AdamW implementation (see [here](https://github.com/NVIDIA/apex/blob/a7de60e57f0534266841e1733262601ad76aaa74/csrc/multi_tensor_adam.cu#L333)) does not truly decouple the weight…
-
### System Info
Platform: M3 Max
OS: MacOS Sequoia
### Who can help?
_No response_
### Information
- [ ] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An officiall…
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing ans…