linear-attention-model Search Results

1000+ results
for linear-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mashaan14/VisionTransformer-MNIST #1

Fundamental error in the code

The following line of code in notebook I believe is incorrect: `transformer_input_expanded = model.transformer[0].linear[0](transformer_input)[0]` This is taking the hidden state of the MLP ('li…

HugoFry updated 2 months ago
3
arXiv/html_feedback #2216

Redering errors

### Description This excerpt, as well as others in the article Mamba: Linear-Time Sequence Modeling with Selective State Spaces, have rendering errors ### (Optional:) Please add any files, screensho…

LisandraMoura updated 1 week ago
2
pytorch/pytorch #136270

Training (backward) crashes when using `torch.narrow`, neste…

### 🐛 Describe the bug Using nested tensors generated with `torch.narrow` as inputs to `torch.nn.functional.scaled_dot_product_attention` works fine in the forward pass of the model. However, both …

davidbuterez updated 2 weeks ago
1
Vitek-Lab/MSstats #131

check replicate still false

hi msstats team . I'm not sure this code is intended to confirm the existence of technical replicates of the data. But using **all** will return false for my result. https://github.com/Vitek-Lab/MS…

YoujiaMa updated 1 month ago
2
IBM/fastfit #20

The model 'FastFit' is not supported for text-classification

```from fastfit import FastFit model = FastFit.from_pretrained("fast-fit") model ``` gives ``` FastFit( (encoder): MPNetModel( (embeddings): MPNetEmbeddings( (word_embedding…

daboe01 updated 3 months ago
1
huggingface/transformers #33900

Modular converter ignores my `Config` and my `ModelOutput` c…

### System Info - `transformers` version: 4.46.0.dev0 - Platform: macOS-15.0-arm64-arm-64bit - Python version: 3.11.6 - Huggingface_hub version: 0.25.1 - Safetensors version: 0.4.5 - Accelerate …

tonywu71 updated 1 week ago
2
tenstorrent/tt-metal #13368

Llama 3.2

Bring up Llama 3.2 model family on Wormhole, T3K and TG

yieldthought updated 1 day ago
2
ucinlp/autoprompt #61

RuntimeError: CUDA error

Dear author, I am sure that all the versions of my packages are correct. I used CUDA version 10.1 to adapt to Torch version 1.4. However, I meet an error when Evaluation as follows: Traceback (most…

enhaohuang updated 1 month ago
1
wutaiqiang/MoSLoRA #3

Low-quality image output from subject_driven_generation

Following the `README.md`, I tested the `subject_driven_generation`: ```bash sh train_sdxl_lora_cat.sh python3 infer.py ``` and got low-quality images from mixer model, while the vanilla lora rem…

vitrun updated 1 day ago
8
Mintplex-Labs/anything-llm #2446

[Chore]: bump `node-llama-cpp` to 3.1.1 to support newer mod…

### How are you running AnythingLLM? Docker (local) ### What happened? Docker sees my models. I start chatting in my workspace, and then I get an error "Failed to load model" ``` anythingllm |…

PeterTucker updated 13 hours ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for linear-attention-model

1000+ results
for linear-attention-model