-
### Checklist
- [ ] The issue exists after disabling all extensions
- [X] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused by a …
-
Flash attn 2.5.7 always complains about the input data type even when it's clearly a correct one.
I'm using the base image `nvcr.io/nvidia/pytorch:24.03-py3`
```
>>> import torch, flash_attn
>>>…
-
`The model weights are not tied. Please use the `tie_weights` method before using the `infer_auto_device` function.
╭─────────────────────────────── Traceback (most recent call last) ────────────────…
-
As discussed in #74, the current way to model an external service that a service within the mesh can route to is by modeling the external service as a VirtualNode. For example, if you had two service…
-
Traceback (most recent call last):
File "bert.py", line 651, in
main()
File "bert.py", line 541, in main
cache_dir=PYTORCH_PRETRAINED_BERT_CACHE / 'distributed_{}'.format(args.local_r…
-
**Describe**
I am using LayoutLM V2 model. I am trying to finetune the the model by using my custom dataset. I got bellow error message.
Please tell me how to resolve the error.
you can download…
-
**Describe the bug**
```
2023-05-31 11:33:20 INFO [auto_gptq.modeling._base] Quantizing mlp.dense_4h_to_h in layer 7/70...
Traceback (most recent call last):
File "quant_with_alpaca.py", line 17…
-
tokenizer: `moka-ai/m3e-base`
错误:
```shell
Traceback (most recent call last):
File "webui_xbl_stable.py", line 449, in
model_status = init_model()
File "webui_xbl_stable.py", line 166…
-
I am using trtllm 0.8.0 (added moe support following llama's implementation). we serve models with trtllm_backend (docker images triton-trtllm-24.02)
[qwen2-moe-57B-A14B](https://huggingface.co/Qwe…
-
## ❓ Questions and Help
I followed the install.md provided in git source and https://github.com/facebookresearch/maskrcnn-benchmark/issues/1042, and everything works fine until the build. The build…