attention-mechanism Search Results

1000+ results
for attention-mechanism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bigscience-workshop/petals #614

Performance improving chances in the future

Hi there, I've been following this work for a few months and found it's really an amazing idea to run LLMs over the Internet, while I'm also trying to improve Petals' performance on model inference in…

oldcpple updated 1 month ago
1
tensorflow/tensorflow #57833

Error getting persistent gradients from model converted from…

Click to expand! ### Issue Type Bug ### Source binary ### Tensorflow Version tf 2.10 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 18.04 ### Mobile de…

NoAchache updated 3 hours ago
8
yilundu/improved_contrastive_divergence #16

Synthesizing action trajectories with Langevin dynamics

Thanks for the great work! Are there any tips for training with the improved contrastive divergence objective? I'm trying to build a multi-modal robotic manipulation model that takes in videos and tex…

andrewliu2001 updated 2 months ago
5
awslabs/dgl-lifesci #226

AttentiveFP supernode missing

Hello, thank you for your work. I have interest in the AttentiveFP implementation from the paper "Pushing the Boundaries of Molecular Representation for Drug Discovery with the Graph Attention Mech…

alexbui91 updated 1 month ago
1
huggingface/transformers #27285

Implement Cross Attention in LLAMA Model

### Feature request The current implementation of the LLAMA model in the Hugging Face Transformers repository supports self-attention layers as per the standard design of transformer models. I prop…

eitamar-saraf updated 4 months ago
7
beyond-btc/feedback #333

Feedbacks and Recommendations

Overall, Beyond Tech testnet demonstrates a promising foundation for EVM swaps, but there are several technical areas that i think might require attention to optimize performance and user experience. …

Kaizer4show updated 3 weeks ago
3
intel/torch-xpu-ops #761

[Evaluated] Support of SDP // Issues of test_transformers_xp…

### 🚀 The feature, motivation and pitch 1. NotImplementedError: Could not run 'aten::_to_copy' with arguments from the 'NestedTensorXPU' backend cases: test_transformers.py::TestTransformersXPU::te…

PenghuiCheng updated 5 days ago
1
vishesh9131/Versatile-Interpretive-Syntax-Handler-VISH #2

002AX

**Issue: Multi-Head Attention Producing Incorrect Vectors** The multi-head attention mechanism in our transformer model appears to be producing incorrect vectors. Specifically, the attention matrix…

vishesh9131 updated 6 months ago
1
huggingface/transformers #34678

Bug when using StaticCache in Qwen2.5 Inference

### System Info ```shell Collecting environment information... WARNING 11-10 14:19:08 _custom_ops.py:14] Failed to import from vllm._C with ImportError('/mnt/bbuf/vllm-backup/vllm/_C.abi3.so: undef…

BBuf updated 1 day ago
7
facebookresearch/SparseConvNet #219

How to implement attention mechanism using sparse modules？

Hello, I am working on a 2-dimensional UNet for sparse image denoising and I would like to integrate the attention mechanism (below) into the UNet network. But I did not succeed, if possible, I would …

Shaoyi0520 updated 2 years ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for attention-mechanism

1000+ results
for attention-mechanism