pytorch-transformer Search Results

1000+ results
for pytorch-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mu-cai/matryoshka-mm #5

[Question] The usage of official weight

### Question Hello Authors, Thanks for your amazing work and provide the trained weight in https://huggingface.co/mucai/llava-next-vicuna-7b-m3. When I download the weight and test, something wron…

YuchenLiu98 updated 3 weeks ago
1
d8ahazard/sd_dreambooth_extension #1487

Diffusion_pytorch_model.bin` Not Found in Expected Directory…

### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits of both this extension and the webui ### What happened? I encountered an err…

TheRealDrCarbon updated 6 hours ago
2
apple/ml-stable-diffusion #347

Support - FLUX black-forest-labs/FLUX.1-schnell

Support to convert model black-forest-labs/FLUX.1-schnell, receive this error: after running: `python -m python_coreml_stable_diffusion.torch2coreml --convert-unet --convert-text-encoder --convert…

mgierschdev updated 2 months ago
1
QwenLM/Qwen2-VL #163

AttributeError: 'AdamW' object has no attribute 'train'（solv…

Refer to https://swift.readthedocs.io/zh-cn/latest/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.html [rank0]: File "/usr/local/lib/python3.10/site-packages/transformers/trainer.py", …

fabulousfeng updated 1 month ago
3
QwenLM/Qwen2-VL #297

CUDA out of memory for Qwen2-VL-7B-Instruct-GPTQ-Int8 on RTX…

hi, I basically followed: https://www.modelscope.cn/models/Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8 and thought the `24G` gpu memory would be enough for the model: ![image](https://github.co…

shaojun updated 1 week ago
4
hint-lab/bert-relation-classification #6

pytorch_transformers became transformers

Just wanted to let you guys know that instead of "from pytorch_transformers import ..." it is renamed to just transformers, so "from transformers import ..."

Famke2604 updated 3 years ago
1
huggingface/course #718

Transformers, what can they do?

Hello, Going via the training. Some small ideas for improvements. ####################### Transformers, what can they do? https://huggingface.co/learn/nlp-course/en/chapter1/3 A) Curren…

jzaba123 updated 1 month ago
1
NVIDIA/apex #1849

AdamW implementation does not truly decouple learning rate a…

**Describe the bug** AdamW implementation (see [here](https://github.com/NVIDIA/apex/blob/a7de60e57f0534266841e1733262601ad76aaa74/csrc/multi_tensor_adam.cu#L333)) does not truly decouple the weight…

leenachennuru updated 22 hours ago
2
Zyphra/transformers_zamba2 #3

Error when running `pip install -e`

### System Info Platform: M3 Max OS: MacOS Sequoia ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officiall…

hg0428 updated 2 days ago
2
OpenBMB/MiniCPM-V #612

[BUG] <title>vllm 推理v2.6 在一张3090 报错显存不够

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

weiiWill updated 30 minutes ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for pytorch-transformer

1000+ results
for pytorch-transformer