gemm Search Results - Githubissues

1000+ results
for gemm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iree-org/iree-turbine #120

October 2024 Tkw Release Notes

This issue lists all feature requests and improvements slated for the Oct 2024 Tkw release. - [ ] Flash Attention Implementation - [ ] Flash Attention Performance Improvements - [x] Implicit GEMM…

harsh-nod updated 2 weeks ago
1
onnx/models #439

Deprecated old models may need to be updated

# Bug Report ### Which model does this pertain to? A bunch of models can't be run by ONNX Runtime. Some ops are not supported by ONNX runtime for now. As this repo is strongly connected with …

YuhengHuang42 updated 3 years ago
2
pytorch/pytorch #125767

RFC: Integrate Arm Compute Library (ACL) into PyTorch as a s…

## Issue description For aarch64 linux platform, Arm Compute Library ([ACL](https://github.com/ARM-software/ComputeLibrary)) is the recommended GEMM backend for PyTorch via MKLDNN. Currently ACL is…

snadampal updated 5 months ago
4
harvard-acc/gem5-aladdin #31

For each algorithm of MachSuite, how to generate the corresp…

For each algorithm of MachSuite, how to generate the corresponding executable file and the necessary files configured in gem.5.cfg such as dynamic_trace.gz? For example, I want to simulate the test_a…

better1017 updated 3 years ago
3
iree-org/iree #9689

Fix performance of mma sync

### Request description From Nod.ai meeting 6/30, filing new issue ### What component(s) does this issue relate to? _No response_ ### Additional context _No response_

allieculp updated 1 year ago
9
microsoft/onnxruntime #10278

Gemm layer is not quantized with QGemm node but with QLinear…

Hi, I work with a simple onnx network exported from pytorch. The last fully connected layer (with bias) is exported as a Gemm node. After quantization (quantize_static) with the last onnxrt versio…

ghost updated 2 years ago
2
numcl/numcl #4

Benchmarking & optimization

The first step toward optimization is to know where you are now. + [x] Write a benchmark against numpy & julia + ideas + [ ] broadcasting -- switch to the compilation based approach, similar to …

guicho271828 updated 5 years ago
2
davidavdav/InplaceLinalg.jl #3

@inplace should not scale aguments

Consider: ``` julia> @macroexpand @inplace C -= R*2*S :(InplaceLinalg.C_AB!(C, 1, -R, 2, S)) ``` If `R` is a matrix (or vector) then `-R` is not done inplace---and unnecesarily in any case, since…

bsxfan updated 5 years ago
5
clatfd/GNN-ART-LABEL #6

I have some errors... Could you help me?

I am really interested in your work. I tried to use this tool, However, I had troubles and could not solve these. [ArtLabel_errors.pdf](https://github.com/clatfd/GNN-ART-LABEL/files/7906232/ArtLabe…

JinyongChung updated 2 years ago
3
NVIDIA/TensorRT-LLM #1580

Fail to build int4_awq on Mixtral 8x7b

### System Info ubuntu 20.04 tensorrt 10.0.1 tensorrt-cu12 10.0.1 tensorrt-cu12-bindings 10.0.1 tensorrt-cu12-libs 10.0.1 tensorrt-llm 0.10.…

gloritygithub11 updated 1 week ago
17

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for gemm

1000+ results
for gemm