dnnl Search Results - Githubissues

oneapi-src/oneDNN #2167

MacOS ci release mode build issue with gcc-14

The MacOS release-mode gcc-14 aarch64 CI started failing after this commit: https://github.com/oneapi-src/oneDNN/commit/55b52c67967b61d439c84df8b34cfb006bf528eb The log can be seen [here](https://g…

theComputeKid updated 3 weeks ago

tensorflow/tensorflow #76311

Build Failure on AWS Graviton3 with Custom oneDNN (oneDNN-3.…

### Issue type Build/Install ### Have you reproduced the bug with TensorFlow Nightly? No ### Source source ### TensorFlow version tf v2.17.0 ### Custom code No ### OS pla…

deepeshfujitsu updated 4 weeks ago

intel-analytics/ipex-llm #12334

ipex-llm-ollama-installer-20240918.exe安装后用另一个exe调用文件夹中的start…

1、双击安装ipex-llm-ollama-installer-20240918.exe 默认安装在C:\Users\OPS17\ipex-llm-ollama 2、拷贝ipex-llm-ollama文件夹到C:\aipc\ 3、这个时候运行ipex-llm-ollama里面的start.bat，可以正常运行，并且访问什么都正常 4、这个时候用Python编写一个程序，并打包为…

dayskk updated 5 days ago

microsoft/onnxruntime #21958

Use AppendExecutionProvider_Dnnl api to add onednn EP，No su…

### Describe the issue First i use commond '--use dnnl' to build a onnxruntime.dll enable oneDNN support. When i use this code to register a dnnl EP to ORT： `int threadNum = 4;` `bool enable_cpu_m…

yunhaolsh updated 1 month ago

intel/graph-compiler #38

Run tests in CI

- [x] gtest for integration tests - these are built by enabling `-DGC_TEST_ENABLE=on`. Currently, there's only one built now, run as `cd test/dnnl/ && ./test_dnnl_c_interface`. I think it's missing a …

kurapov-peter updated 3 weeks ago

bytedeco/javacpp-presets #1518

Couldn't load libjnidnnl.dylib

``` Caused by: java.lang.UnsatisfiedLinkError: Can't load library: /Users/booster/.javacpp/cache/dnnl-3.3.4-1.5.10-macosx-x86_64.jar/org/bytedeco/dnnl/macosx-x86_64/libjnidnnl.dylib ``` Caused by…

b005t3r updated 4 months ago

oneapi-src/oneDNN #724

DNNL MatMul with SGEMM

Hello, I found that dnnl::matmul is slow when they are not using SGEMM. For example, if I assigned src/weights/bias with 50x50 matrix, it takes 760 microseconds and 100x100 is going to be 5ms. Ho…

affranchi updated 3 months ago

intel/intel-extension-for-pytorch #312

Mismatched constructor for dnnl::convolution_forward::primit…

### Describe the bug Original issue: https://github.com/oneapi-src/oneDNN/issues/1611 I am compiling pytorch from source according to the instructions from https://intel.github.io/intel-extens…

heligan updated 1 year ago

oneapi-src/oneDNN #1898

[Proposal] Add cpu alloc/free callback to support customlize…

# Summary During our pytorch development, we found Windows system memory alloctor is worse performance, and slow down the whole pytorch performance. After add third party memory alloctor, pytorch imp…

xuhancn updated 1 week ago

ROCm/AMDMIGraphX #3519

[BF16] GPU Implementation

**Idea:** Cast FP32/FP16 to BF16. Casting will be different based on type: - FP32 to BF16: truncate last 16 bits from mantissa, exponent stays the same - FP16 to BF16: more involved process --…

richagadgil updated 1 month ago

1000+ results for dnnl

1000+ results
for dnnl