neural-attention Search Results

1000+ results
for neural-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #9222

[feature request] Implementing Block Sparse Operations

**TL;DR:** Implementing block-sparse operations for faster matrix-multiplication. Is this something worth adding to PyTorch? Goals: 1. Faster matrix-multiplication by taking advantage of block-…

rdspring1 updated 2 years ago
20
bluss/matrixmultiply #34

Use optimal kernel parameters (architectures, matrix layouts…

I am trying to figure out what to use as optimal kernel parameter for different architectures. For example, it looks like blis is using 8x4 for Sandy Bridge, but 8x6 for Haswell. Why? What lead the…

SuperFluffy updated 2 years ago
7
amusi/ECCV2024-Papers-with-Code #36

欢迎分享ECCV 2024 论文和代码 / Welcome to share the paper and code of…

[The format of the issue] Paper name/title: Project link: Paper link: Code link:

amusi updated 2 weeks ago
46
QcraftWeb/qcraft-article #18

[2020/09/07]QBlog 01 | 3D物体检测、行为预测和运动检测，一文解析激光雷达中时序融合的研究现状和发…

![头图2 ](https://user-images.githubusercontent.com/53006892/109469329-daddde80-7aa8-11eb-86e1-0102bc216efb.jpg) 在自动驾驶领域，基于激光雷达 (LiDAR) 的3D物体检测和运动行为预测是一种普遍的方案。目前绝大部分关于激光雷达的物体检测算法都是基于单帧的。激光雷达的多帧时序数据，提供了…

gameclub000 updated 7 months ago
1
CUNY-CL/yoyodyne #216

Embeddings not being passed to lstm module

For some reason, someone added an `embeddings` argument to `BaseModule` but didn't update the other modules to pass it. Why are we passing embeddings as an argument? Those should be an attribute of…

bonham79 updated 1 week ago
16
leela-zero/leela-zero #2259

Starting a new training run with ladder code

Ever since LeelaZero was released, people have been wanting to use it to analyze their games, and it's become incredibly easy and popular to review with LeelaZero. Now, LeelaZero has become insanely s…

featurecat updated 5 years ago
63
HyunjoonCho/CS492I-IntroToDL-project #1

Project Proposal Day

각 주제에 대해 세밀한 검토를 하기 전 우선 아이디어 중심으로 제안 * Paper/Architecture figure to code * AI 분야 논문이나 관련 description을 곧 바로 PyTorch 코드로 바꿔주는 프로그램 * nn.Module을 쌓아 구현하기 때문에 이러한 형태를 학습시킬 수 있을 것 같다는 추측..? * Con…

HyunjoonCho updated 3 years ago
7
meta-llama/llama #407

Error: 70B Model quantizing on mac: tensor 'layers.0.attenti…

Used this model: https://huggingface.co/meta-llama/Llama-2-70b Used these commands: ``` $ convert-pth-to-ggml.py models/LLaMa2-70B-meta 1 ``` ``` $ ./quantize ./models/LLaMa2-70B-meta/ggml…

NilsHellwig updated 1 year ago
34
openvinotoolkit/openvino #25393

[BUG] [GPU] Phi3 Medium int4 Runtime Error: probability tens…

### 🐛 Describe the bug Hi, Running Phi3 Medium on LocalAI with OpenVINO backend I found that while the int8 quantization is working correctly, the int4 quant gives the following error after few to…

fakezeta updated 3 months ago
9
GFNOrg/torchgfn #185

SubTB simple running example ?

The given main TorchGFN example runs fine, including the line: # 4 - We define the GFlowNet. gfn = TBGFlowNet(init_logZ=0., pf=pf_estimator, pb=pb_estimator) # We initialize logZ to 0 …

TomAvrech updated 1 month ago
4

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for neural-attention

1000+ results
for neural-attention