attention-model Search Results

1000+ results
for attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Avdhesh-Varshney/Jarvis #68

📃: Image Captioning with Attention Mechanism

:red_circle: **Title** : Image Captioning with Attention Mechanism :red_circle: **Aim** : Develop an image captioning system using an attention-based model. :red_circle: **Brief Explanation** : …

Avdhesh-Varshney updated 5 days ago
4
YOLOonMe/EMA-attention-module #2

add attention model

What is the position of the attention module added in the network when you conduct the experiment?

ChenJian7578 updated 7 months ago
8
FlagOpen/FlagEmbedding #873

使用deepspeed训练后保存模型出现size mismatch

我的训练启动命令： ```bash torchrun --nnodes $NNODES --nproc_per_node $NPROC_PER_NODE \ --node_rank $RANK --master_addr $MASTER_ADDR --master_port $MASTER_PORT \ -m FlagEmbedding.baai_general…

SingL3 updated 3 weeks ago
1
comfyanonymous/ComfyUI #3911

After use sd3_medium_incl_clips_t5xxlfp16.safetensors model …

### Your question After use sd3_medium_incl_clips_t5xxlfp16.safetensors model , ComfyUI is disconnected.But other model,such as dreamshaperXL_v21TurboDPMSDE.safetensors can run right. ### Logs …

Desperado1001 updated 5 days ago
3
unslothai/unsloth #680

sliding_window shouldn't be applied when flash_attn not inst…

I've been finetuning unsloth/Phi-3-mini-4k-instruct-bnb-4bit with a T4, which doesn't support flash attention, so I don't have it installed. During evaluation, I've been running into the following …

rossbm updated 1 week ago
2
salesforce/awd-lstm-lm #24

Attention Model

Any ideas on how to incorporate attention model from http://pytorch.org/tutorials/intermediate/seq2seq_translation_tutorial.html ?

mocialov updated 6 years ago
1
nl8590687/ASRT_SpeechRecognition #186

Attention model????

You said this repo has at least one model which is created with attention. But I can't find any. could you edit that? Or show me which model is attention base?

masoudMZB updated 4 years ago
4
SeanLee97/AnglE #82

Use of causal models for generation

This is an amazing work. I have been working on something that would require me to evaluate the generated outputs of models like Mistral, using a prompt like: `"Fill the [MASK] token in the sentence.…

dipankarsrirag updated 1 week ago
3
IDEA-Research/GroundingDINO #333

GroundingDINO module needs to be built for every prediction …

Hi, I am following the example code [here](https://github.com/IDEA-Research/GroundingDINO/blob/main/demo/inference_on_a_image.py) to setup the GroundingDINO inferencing in triton. I am trying to run t…

aganesh9 updated 2 weeks ago
2
huggingface/parler-tts #62

attention_mask

hi, I have attention_mask problem mismatch in the cross attenstion can you please explain this line: requires_attention_mask = "encoder_outputs" not in model_kwargs ? why is comed after this: …

netagl updated 2 weeks ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for attention-model

1000+ results
for attention-model