-
Thank you for bringing attention to the code.
In the functions 'create_global_model_copy' and 'copy_params,' only the variables of ResNet are copied, excluding BatchNorm layer (BN) statistical inf…
-
Plain Performer, if you are working with say images or other modalities!
ERROR:ModuleNotFoundError: No module named 'local_attention'
where is the local_attention module?
-
root@iZ0xiaotv8ztqk9kkzy72iZ:~/MindSearch# python3 -m mindsearch.app --lang en --model_format internlm_server --search_engine DuckDuckGoSearch
INFO: Started server process [3266]
INFO: Waiti…
-
### Motivation
For current large model inference, KV cache occupies a significant portion of GPU memory, so reducing the size of KV cache is an important direction for improvement. Recently, severa…
-
报下面的错误
```bash
RuntimeError: weight should have at least three dimensions
Traceback (most recent call last):
File "/mnt/bn/arnold-ghh-test/mlx/users/guihonghao/playground/ghh_swift/swift/example…
-
I have read the text and found that I have to install the flash-attn1.x to fit my Turing GPU, so I get the source package from github: https://github.com/Dao-AILab/flash-attention/releases?page=6. The…
-
I am using python 3.5. When running `python eval.py` I get
```
Graph loaded
name: GeForce GTX 960
major: 5 minor: 2 memoryClockRate (GHz) 1.1775
pciBusID 0000:01:00.0
Total memory: 2.00GiB
Fr…
-
Traceback (most recent call last):
File "train_dreambooth.py", line 822, in
main(args)
File "train_dreambooth.py", line 475, in main
images = pipeline(example["prompt"]).images
Fil…
-
### System Info
CPU: X86
Memory size: 2TB
GPU Name: H20
TensorRT-LLM: 0.10.0
OS:Alibaba Cloud Linux release 3 (Soaring Falcon)
GPU Driver:550.54.15
CUDA:cuda_12.4.r12.4/compiler.33961263_0
Do…
-
I'm trying to run a DeepSeek-V2.5 model.
Command used: ```python -m ktransformers.local_chat --model_path ./DeepSeek-V2.5/ --gguf_path ../ ```
```
Chat: hi
Traceback (most recent call last):
Fi…