onednn Search Results - Githubissues

1000+ results
for onednn

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openxla/xla #11772

[XLA:CPU] oneDNN Softmax gives inaccurate results

From https://github.com/google/jax/issues/20856, `jax.nn.softmax` returns different results under `jax.jit` because the `jax.jit` version calls oneDNN custom call. Code: ``` import jax import j…

penpornk updated 4 months ago
1
OpenNMT/CTranslate2 #1072

Feature request: AMD GPU support with oneDNN AMD support

Hi, CTranslate2 uses oneDNN. oneDNN latest versions has [support for AMD GPU](https://github.com/oneapi-src/oneDNN/tree/master/src/gpu/amd). It [require Intel oneAPI DPC++](https://developer.codeplay.…

santhoshtr updated 3 weeks ago
43
pytorch/pytorch #91182

onednn(mkldnn) backend support for quantized operators

### 🚀 The feature, motivation and pitch Feature request: Onednn backend support for quantized operators, INT8 at least. Motivation: Given the acceptable accuracies from INT8 quantized inference an…

snadampal updated 1 month ago
22
webmachinelearning/webnn-native #371

Failed to build oneDNN backend with upgraded oneDNN dependen…

PR #366 with [latest commit@ebf149c](https://github.com/oneapi-src/oneDNN/commit/ebf149c48ecdeea5279003c25e20d6d8c7a657a3) and PR #330 with [commit@59e6099](https://github.com/oneapi-src/oneDNN/commit…

BruceDai updated 1 year ago
1
pytorch/pytorch #114848

[RFC]Intel GPU oneDNN Upstreaming

### 🚀 The feature, motivation and pitch # Motivation Intel GPUs could significantly improve the workload performance. As described in [[RFC] Intel GPU Upstreaming](https://github.com/pytorch/pytor…

ZhiweiYan-96 updated 4 months ago
1
pytorch/pytorch #131958

[DEBUG] Strange behavior observed with PyTorch 2.4.0 + Windo…

### 🐛 Describe the bug ## Situation I'm reporting a "bug", however there is no explicit error message and it's a change in the observed behavior during inference. This begun with `torch==2.4.0` but …

Burhan-Q updated 1 day ago
40
oneapi-src/oneDNN #1667

Bad perf for matmul of ba tensors

Hello 1dnn team, Just asking if it is really expected for ba matmul to be so slow: eg: ``` M=63448 K=640 N=2 tag time ab 18 ba 1790 ``` With benchdnn : ``` wtambellini@lawtambe3 onednn-…

WilliamTambellini updated 1 month ago
5
pytorch/pytorch #134709

[dashboard][aarch64] fp16 is slower than bf16

From https://github.com/pytorch/pytorch/pull/134282#issuecomment-2307157197, in the aarch64 dashboard results, if we benchmark with fp16, it is 2x~10x slower than bf16, often causing timeout in cases.…

desertfire updated 2 weeks ago
3
oneapi-src/oneDNN #2088

help needed for improving conv with given shape

Hi dear team, Is there any other way to accelerate this conv (ic=16, oc=16, height=208, width=32, stride=1, kernel=3) on a single core? ```bash ONEDNN_VERBOSE=1 numactl -C 1 -m 0 ./benchdnn --m…

shawnxhong updated 1 day ago
4
intel-analytics/ipex-llm #10146

ONEDNN_VERBOSE does not work

Is there a method to trace the execution of oneDNN API calls while executing inference with BigDL-LLM?

HLneoh updated 6 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for onednn

1000+ results
for onednn