linear-transformer Search Results

1000+ results
for linear-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #796

TypeError: LlamaRotaryEmbedding.__init__() got an unexpected…

Hi there, so I am loading a finetuned Llama 2 13b model, and I get this error. Here's part of the error: File /usr/local/lib/python3.10/dist-packages/unsloth/models/loader.py:172, in FastLanguag…

DaddyCodesAlot updated 3 days ago
11
danchern97/tda4atd #2

Question on the number of heads used in the analysis

Thank you for sharing the code. I'm confusing something, I would appreciate if my understanding is correct. 1. Are you using the all heads output for the analysis? The paper you mentioned 'Roles …

sehunfromdaegu updated 1 month ago
1
sayakpaul/diffusers-torchao #32

How should a quantized + compiled model be serialized?

@jerryzh168 I think this could be beneficial to be able to load a quantized and compiled model and proceed straight to inference. However, I am not sure what functions to use to make this happen. …

sayakpaul updated 15 hours ago
16
NVIDIA/TransformerEngine #1014

AttributeError: module 'transformer_engine' has no attribute…

I reinstall `pip install flash-attn==2.6.1` in NGC pytorch docker image 24.06. When I run train job, I got follow error: ``` Traceback (most recent call last): File "/data1/nfs15/nfs/bigdata/zha…

Lzhang-hub updated 1 month ago
3
vllm-project/vllm #5406

hidden-states from final (or middle layers)

### 🚀 The feature, motivation and pitch I am trying to extract hidden states from the final layer of llama3-8b (i.e., the final batch_size, seq_length, n_emb vector _before_ computing the logits). Wo…

janphilippfranken updated 3 months ago
1
FoundationVision/LlamaGen #24

[Feature] ControlNet support via process similar to PixArt's…

Hi! I have found your work much interesting and inspiring ever since the first VAR release. However, it would be nice for such a project to implement much-used image-conditional generation in the m…

kabachuha updated 2 months ago
1
huggingface/optimum-quanto #228

`qint4` failing with PixArt Transformer

Install `diffusers` first. And then do: ```python from diffusers import DiffusionPipeline from optimum.quanto import quantize, freeze, qint4 import torch ckpt_id = "ptx0/pixart-900m-1024…

sayakpaul updated 4 weeks ago
8
ibis-project/ibis-ml #32

feat: preprocessing transformation priorities

Building upon the deliverables outlined in [issue #19](https://github.com/ibis-project/ibisml/issues/19), the objective is to enhance the coverage of ibisml machine learning preprocessing transformati…

jitingxu1 updated 2 months ago
2
NVIDIA/TransformerEngine #1020

Bulid transformer enginer is failed caused by cmake comman…

An error occurred when I tried to download transformer enginner following the official tutorial! （https://docs.nvidia.com/deeplearning/transformer-engine/user-guide/installation.html）I have try some …

sfdeggb updated 2 months ago
1
vllm-project/llm-compressor #189

Qwen1.5-MoE-A2.7B-Chat w4a16 Quantization Failed

**Describe the bug** I tried to quantize Qwen1.5-MoE-A2.7B-Chat with w4a16 for vllm PR: https://github.com/vllm-project/vllm/pull/7766 raise error TypeError: forward() got multiple values for argume…

donpromax updated 1 week ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for linear-transformer

1000+ results
for linear-transformer