avx512 Search Results - Githubissues

1000+ results
for avx512

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/mxnet #20235

adding blis in addition to openblas, atlas and mkl

## Description this is another blas library which is faster than openblas, and unlike it, it supports avx512 as well. although not as fast as mkl in x86 and x86_64 processors, but it outperforms othe…

brightening-eyes updated 3 years ago
2
SWIFTSIM/SWIFT #3

AVX512F instructions

Hi, First of all - thanks for creating (and open-sourcing) this swift code! Looks great! I was looking through the SIMD wrappers for `AVX512F` in `vector.h` and I noticed a few wrappers that re…

manodeep updated 6 years ago
6
apache/arrow #42951

[C++][Parquet] Experiment to see if using SIMD shuffle opera…

Followup from PARQUET-1840 for current benchmarks it seems that doing removing the memset somehow either has no impact or is slightly worse. We should investigate using SIMD operations to speed up sp…

asfimport updated 3 months ago
7
JuliaSIMD/CPUSummary.jl #6

CPUSummary.jl v0.1.14 breaks CI of Trixi.jl on skylake-avx51…

We observed some specific problems when going from CPUSummary.jl v0.1.8 to v0.1.14 at [Trixi.jl](https://github.com/trixi-framework/Trixi.jl). Everything is fine with the old version of CPUSummary.jl.…

ranocha updated 2 years ago
8
BlueBrain/HighFive #617

Example for adding auto serialisation support for user defi…

**Is your feature request related to a problem? Please describe.** On master branch we can: ```c++ auto v = std::vector(); /* fill v */ file.createDataSet("/path", v); ``` But we will fai…

aborzunov updated 1 year ago
4
llvm/llvm-project #34224

[X86] Should we prefer AVX VBLENDPS/VBLENDPS over VMOVSS/VMO…

| | | | --- | --- | | Bugzilla Link | [34876](https://llvm.org/bz34876) | | Version | trunk | | OS | All | | CC | @RKSimon | ## Extended Description There are no EVEX versions of VBLENDPS/VBLENDPD…

topperc updated 2 years ago
1
intel-analytics/ipex-llm #11442

Issue while using IPEX-LLM optimized Mixtral model in Langch…

Below code works when I am using Mixtral model from Ollama directly. But when I use the IPEX-LLM optimized Mixtral model, the tool does not work. This is an easy tool for testing functionality , whic…

tsantra updated 3 months ago
8
intel/torch-xpu-ops #296

[Evaluated] issues report for test_dataloader_xpu.py

### 🐛 Describe the bug 1. https://github.com/intel/torch-xpu-ops/issues/261 - closed 2. Tensor isn't pinned with DataLoader(..., pin_memory=True,..) command: ```python PYTORCH_ENABLE_XPU_FALLBA…

PenghuiCheng updated 3 months ago
3
mudler/LocalAI #3367

Can't start LocalAI (with REBUILD) on Xeon X5570 - Unwanted …

**LocalAI version:** Using Docker image: `localai/localai:latest-aio-gpu-hipblas` **Environment, CPU architecture, OS, and Version:** - Ubuntu 22.04 - Xeon X5570 [Specs](https://ark.intel.c…

chris-hatton updated 1 month ago
1
llvm/llvm-project #35539

[X86][AVX512] Failure to fold broadcast into 'zmm' instructi…

| | | | --- | --- | | Bugzilla Link | [36191](https://llvm.org/bz36191) | | Version | trunk | | OS | Windows NT | | CC | @topperc,@ZviRackover | ## Extended Description ```ll define @mul…

RKSimon updated 7 months ago
1

上一页 1...69 70 71 72 73 74 75...100 下一页

1000+ results for avx512

1000+ results
for avx512