-
V100 cannot use flash attention, so I changed to using eager to calculate attention,
self.self_attn = IDEFICS_VISION_ATTENTION_CLASSES["eager"](config)
but the following error occurred:
…
-
I am trying to run an example code of flant5-small as follows.
System: Windows
Device: GPU
```
import torch
import intel_extension_for_pytorch as ipex
from bigdl.llm.transformers import AutoMo…
-
**Issue**
MakeHuman crashes upon startup with a segmentation fault when run on Wayland.
**Expected behavior**
MakeHuman starts up without any issue
**System information**
System: Ubuntu 22.04.3
…
-
warnings.warn("torch.distributed.reduce_op is deprecated, please use "
Traceback (most recent call last):
File "tools/train_net.py", line 180, in
main()
File "tools/train_net.py", line 1…
-
跑了web_demo.py
报错了
```bash
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/gradio/routes.py", line 437, in run_predict
output = await app.get_blocks().proces…
-
File "c:\users\administrator\appdata\local\programs\python\python38\lib\site-packages\paddlenlp\transformers\ernie_gen\modeling.py", line 388, in from_pretrained
model.set_state_dict(m)
File …
-
## ❓ Questions and Help
Traceback (most recent call last):
File "webcam.py", line 80, in
main()
File "webcam.py", line 71, in main
composite = coco_demo.run_on_opencv_image(img)
F…
-
Is there support for llama3.2 with TensorRT-LLM? I tried engine build but got a rope error? Maybe it is related to the context length? Thanks.
-
When I try to merge two Yi-34B-chat into one MoE model, in the last step expert prompts: I get the following error:
`RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling cublasCreate(…
-
Traceback (most recent call last):
File "tools/train_net.py", line 186, in
main()
File "tools/train_net.py", line 179, in main
model = train(cfg, args.local_rank, args.distributed)
…