linear-attention-model Search Results

1000+ results
for linear-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2388

[BUG] Running FP8 quantized model fails on NVIDIA L4 (repack…

### System Info - **Hardware**: AWS g6.12xlarge (us-east-2) / 4x NVIDIA L4 GPU - **OS**: Ubuntu 24.04 LTS (Noble Numbat) - **NVIDIA Driver**: nvidia-open 560.28.03 - **CUDA**: 12.6 - **Docker**: …

DrNochi updated 2 weeks ago
5
huggingface/diffusers #7864

MotionMaster: Training-free Camera Motion Transfer For Video…

### Model/Pipeline/Scheduler description Currently, most existing camera motion control methods for video generation with denoising diffusion models rely on training a temporal camera module, and nec…

clarencechen updated 3 weeks ago
1
vllm-project/vllm #7115

[Bug]: PaliGemma detection task is failing

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

nph4rd updated 8 hours ago
8
abetlen/llama-cpp-python #1717

How to use this model?

llama_model_loader: loaded meta data with 32 key-value pairs and 219 tensors from /data/huggingface/hub/models--city96--t5-v1_1-xxl-encoder-gguf/snapshots/005a6ea51a7d0b84d677b3e633bb52a8c85a83d9/./t5…

dzy1128 updated 1 month ago
2
EleutherAI/lm-evaluation-harness #2335

Dynamical prompt with extremely promising results #RIPrompt

This is a little bit of a plug, so I'll keep it short! I'm trying to nail down _**exactly** what's going on here_. https://riprompt.com https://riprompt.com/riprompt.txt https://chatgpt.com/g/g-9…

anthonyrisinger updated 3 days ago
1
karpathy/minGPT #135

What is the purpose of `c_proj` here?

https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/mingpt/model.py#L42 Why do we need an additional linear transformation after the MHA and before the MLP when the dim…

brynhayder updated 5 months ago
1
wejoncy/QLLM #139

llama-2-7b-chat gptq quantize & onnx export fail: RuntimeErr…

Thanks for sharing work for LLM quantization & onnx export. I follow the script in '[Convert to onnx model](https://github.com/wejoncy/QLLM?tab=readme-ov-file#convert-to-onnx-model)' section, and g…

lifelongeeek updated 2 weeks ago
1
lucidrains/taylor-series-linear-attention #2

Replicating Results?

Thank you for the code! I've been using it as a reference for my own implementation. Have you replicated the results in the original blogpost..? Based on your update in the readme, it seems like you h…

fattorib updated 8 months ago
24
tencent-ailab/IP-Adapter #168

model loading errors when testing a trained model

Hi, I have trained a new model but meet errors when testing, I did it as: 1. train a model with: ``` accelerate launch --num_processes 2 --multi_gpu --mixed_precision "fp16" \ tutorial_train.py …

qpc1611094 updated 1 month ago
9
hailo-ai/hailo_model_zoo #108

How to parse CLIP to HAR?

I noticed that CLIP is already present in the Hailo Model Zoo, which suggests that conversion is possible. [link](https://github.com/hailo-ai/hailo_model_zoo/blob/833ae6175c06dbd6c3fc8faeb23659c9efaa2…

jayong-sv updated 1 month ago
2

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for linear-attention-model

1000+ results
for linear-attention-model