local-attention Search Results

1000+ results
for local-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hfzhang31/A3FL #1

Some questions about model parameters.

Thank you for bringing attention to the code. In the functions 'create_global_model_copy' and 'copy_params,' only the variables of ResNet are copied, excluding BatchNorm layer (BN) statistical inf…

KoalaYan updated 2 weeks ago
1
lucidrains/performer-pytorch #49

Plain Performer, if you are working with say images or other…

Plain Performer, if you are working with say images or other modalities! ERROR:ModuleNotFoundError: No module named 'local_attention' where is the local_attention module?

haoshuai714 updated 3 years ago
1
InternLM/MindSearch #209

Just follow the README and meet /lmdeploy/src/turbomind/kern…

root@iZ0xiaotv8ztqk9kkzy72iZ:~/MindSearch# python3 -m mindsearch.app --lang en --model_format internlm_server --search_engine DuckDuckGoSearch INFO: Started server process [3266] INFO: Waiti…

bombert updated 1 month ago
1
vllm-project/vllm #5751

[RFC]: Support sparse KV cache framework

### Motivation For current large model inference, KV cache occupies a significant portion of GPU memory, so reducing the size of KV cache is an important direction for improvement. Recently, severa…

chizhang118 updated 15 hours ago
16
modelscope/ms-swift #1778

使用Lora微调MiniCPM-V-2_6，合并后再Lora训练出现问题

报下面的错误 ```bash RuntimeError: weight should have at least three dimensions Traceback (most recent call last): File "/mnt/bn/arnold-ghh-test/mlx/users/guihonghao/playground/ghh_swift/swift/example…

guihonghao updated 1 month ago
6
Dao-AILab/flash-attention #1132

How can I install a feaible Flash-Attention version on my Tu…

I have read the text and found that I have to install the flash-attn1.x to fit my Turing GPU, so I get the source package from github: https://github.com/Dao-AILab/flash-attention/releases?page=6. The…

eileen2003-w updated 2 months ago
2
Kyubyong/tacotron #74

Errors when running eval

I am using python 3.5. When running `python eval.py` I get ``` Graph loaded name: GeForce GTX 960 major: 5 minor: 2 memoryClockRate (GHz) 1.1775 pciBusID 0000:01:00.0 Total memory: 2.00GiB Fr…

ErfolgreichCharismatisch updated 6 years ago
4
TheLastBen/fast-stable-diffusion #928

Training Error

Traceback (most recent call last): File "train_dreambooth.py", line 822, in main(args) File "train_dreambooth.py", line 475, in main images = pipeline(example["prompt"]).images Fil…

Wxcct updated 1 year ago
5
NVIDIA/TensorRT-LLM #1904

Fail to buid inference trt_llm image : make: *** [Makefile:6…

### System Info CPU: X86 Memory size: 2TB GPU Name: H20 TensorRT-LLM: 0.10.0 OS：Alibaba Cloud Linux release 3 (Soaring Falcon) GPU Driver：550.54.15 CUDA：cuda_12.4.r12.4/compiler.33961263_0 Do…

dadaguai-jiangjun updated 2 months ago
1
kvcache-ai/ktransformers #96

Error Compile with `TORCH_USE_CUDA_DSA` to enable device-sid…

I'm trying to run a DeepSeek-V2.5 model. Command used: ```python -m ktransformers.local_chat --model_path ./DeepSeek-V2.5/ --gguf_path ../ ``` ``` Chat: hi Traceback (most recent call last): Fi…

drrros updated 9 hours ago
5

上一页 1...20 21 22 23 24 25 26...100 下一页

1000+ results for local-attention

1000+ results
for local-attention