attention-model Search Results

1000+ results
for attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

michaelfeil/infinity #276

Classify endpoint not available for a finetuned DebertaV2For…

### System Info Running `infinity` via docker (`michaelf34/infinity:latest`) + using the REST API to call the model ### Information - [X] Docker - [ ] The CLI directly via pip ### Tasks - [X] An …

dblakely updated 2 weeks ago
3
OFA-Sys/Chinese-CLIP #312

load_from_name 加入 flash-attn 支持

感谢你如此好的代码实现，他对我的帮助很大，但是我在使用load_from_name 函数时，我发现并不支持flash-attn ，因此我自己实现了这一块的代码，但是我不确定实现是否正确，尽管它可以正常运行。以下是代码片段 ``` ###### ------- ps: add use_flash_attention keyword ------- ###### def load_fro…

ZechengLi19 updated 1 month ago
4
huggingface/peft #1902

Model encapsulation

### System Info Hi guys, I have some complex models where I use just part of sub-models of transformers e.g. Below I used `AutoModelForCausalLM.from_pretrained()` but normally it would be something…

tomekrut updated 5 days ago
1
Messi-Q/AMEVulDetector #3

Questions about using the cross attention model

Dear professor Peng Qian， Recently I have read the latest paper published by your team in IJCAI-21-《Smart Contract Vulnerability Detection: From Pure Neural Network to Interpretable Graph Feature and…

urnotcoward updated 2 months ago
2
Dao-AILab/flash-attention #1016

Logit soft-capping

As you probably know, yesterday Google released Gemma2 with superior performance and robustness https://storage.googleapis.com/deepmind-media/gemma/gemma-2-report.pdf One of the key changes was log…

kabachuha updated 8 hours ago
6
huggingface/local-gemma #23

TypeError: 'QuantoConfig' object is not subscriptable

**While running the example code in Readme.md** `from local_gemma import LocalGemma2ForCausalLM from transformers import AutoTokenizer import os os.environ['HUGGINGFACEHUB_API_TOKEN'] = '' os.…

mehulgupta2016154 updated 4 days ago
3
WongKinYiu/yolov9 #454

Attention mechanism

So, I'm trying to implement an attention module in the V9 head architecture after the ELAN block, but it seems like I'm not keeping up with the underlying architecture of YOLOv9. I want to implement t…

p-dot-max updated 1 month ago
3
lllyasviel/Omost #20

[BUG] Attention mask and pad token not set

`The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results. Setting `pad_token_…

sild928 updated 1 month ago
1
YvanYin/Metric3D #127

# Problems in saving the entire model and executing onnx

Thank you for sharing the project. I am a beginner in this field, and currently, I have encountered issues while trying to save the entire pytorch model and exporting it to onnx. While saving the …

chuanzeruge updated 7 hours ago
2
comfyanonymous/ComfyUI #3847

Model Merge Not Working Again

Seems the model merge is giving the WARNING SHAPE MISMATCH error again. I looked into the merge_patcher file seen from other errors from back in March and see that the typo was corrected, so seems the…

rking2981 updated 1 week ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for attention-model

1000+ results
for attention-model