linear-attention-model Search Results

1000+ results
for linear-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

shap/shap #1859

RuntimeError: only Tensors of floating point dtype can requi…

I am using `shap==0.38.1` `torch==1.7.1` I have a MLP that takes a text input (preprocessed/encoded as LongTensor) and the model trains fine for the classification problem at hand. I use the Long…

bvaidyan updated 1 month ago
3
huggingface/autotrain-advanced #734

[BUG] AttributeError: module 'torch.library' has no attribut…

### Prerequisites - [X] I have read the [documentation](https://hf.co/docs/autotrain). - [X] I have checked other issues for similar problems. ### Backend Colab ### Interface Used CLI ### CLI Co…

azmeer36 updated 4 weeks ago
5
lovemefan/SenseVoice.cpp #8

在arm64 windows 电脑上没有识别出结果

# 在arm64 windows 电脑上没有识别出结果脚本: ```bash @echo off setlocal :: 指定模型路径 set MODEL_PATH=C:\\Users\\Admin123\\.cache\\modelscope\\hub\\lovemefan\\SenseVoiceGGUF\\gguf-fp16-sense-voice-small.bin ::…

wmx-github updated 1 month ago
2
ggerganov/llama.cpp #9235

Bug: Error loading custom model in llama.cpp: Tensor 'blk.0.…

### What happened? I encountered an issue while loading a custom model in llama.cpp after converting it from PyTorch to GGUF format. Although the model was able to run inference successfully in PyTor…

zsq0216 updated 6 days ago
1
ridgerchu/matmulfreellm #39

training model with huggingface

I tried to train matmulfreellm model following "https://github.com/ridgerchu/matmulfreellm/issues/9#issuecomment-2193970930". But I kept faced " cuda device-side assert triggered". Could you give me m…

KETTY2 updated 1 month ago
2
abetlen/llama-cpp-python #1645

All requests end with 'finish_reason': 'length' when the max…

All requests end with 'finish_reason': 'length' when the max_tokens=-1 parameter is set. What could be the problem? **Model**: https://huggingface.co/IlyaGusev/saiga_mistral_7b_gguf/resolve/main/…

tur0kmagalp updated 3 weeks ago
1
Cadene/vqa.pytorch #18

model specifications not coherent with the MLB paper

The model configuration is not the same as described in the paper. There is a softmax layer missing at the end of the model. The paper concatenates the attention * vision features for all the glimpses…

backpropper updated 6 years ago
3
Mozilla-Ocho/llamafile #577

Bug: Huge difference between prompt processing (tokens/sec) …

### What happened? For llama cpp I had downloaded the q4_k_m quantized [model](https://huggingface.co/jxtngx/Meta-Llama-3.2-1B-Q4_K_M-GGUF/tree/main) and used [llama-bench](https://github.com…

mathav95raj updated 2 days ago
6
codertimo/BERT-pytorch #60

self.d_k = d_model // h gives 64 dimension ?

https://github.com/codertimo/BERT-pytorch/blob/d10dc4f9d5a6f2ca74380f62039526eb7277c671/bert_pytorch/model/attention/multi_head.py#L15 Looks that **self.d_k = d_model // h ---> embed size 768 divi…

BerenLuthien updated 4 years ago
1
HumanSignal/label-studio-ml-backend #307

Error in model loading when called predict method

I am trying to integrate SAM.The model is getting loaded but when I try to annotate. I am getting this error here. I will really appreciate any help on this (mask_decoder): MaskDecoder( (transf…

pritamfocal updated 7 months ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for linear-attention-model

1000+ results
for linear-attention-model