intel-intrinsics Search Results

1000+ results
for intel-intrinsics

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

arrayfire/arrayfire #3011

Max method seems to be sequential for large data on CPU back…

Max method seems to be sequential for large data on CPU backend. While iterating over whole data using gfor is much faster. I wanted to use atomic intrinsic methods with gfor but they are unsupported…

mehran-kh-z updated 4 years ago
3
dotnet/runtime #13026

Add CIL instructions to detect arithmetic overflows without …

The current implementation of the CIL ISA runs counter to the design [guidelines for .NET exception handling](https://docs.microsoft.com/en-us/dotnet/standard/exceptions/best-practices-for-exceptions)…

RobertBouillon updated 4 weeks ago
12
hsivonen/simd #6

More intrinsics?

I've only started really learning about SIMD and how to use it about three days ago. I'm trying to convert some code from Sleef: https://github.com/shibatch/sleef to Rust-SIMD. However, some of their …

rennis250 updated 6 years ago
3
dotnet/runtime #256

Support for Intel SHA extensions

Intel SHA instructions assist with hardware acceleration of the SHA-1 and SHA-256 hash algorithms. Current Ryzen processors that support these instructions can reach SHA-256 speeds of around 2 GB/s…

Thealexbarney updated 2 years ago
23
Chowdhury-DSP/chowdsp_fft #1

Using chowdsp::fft::FFT_REAL may cause crashes

Hello. I happened to find your article when I searched for "PFFFT avx", and since I was using single-precision PFFFT, I was trying to switch to chowdsp_fft. However, as a result, it started crashin…

lewloiwc updated 6 days ago
4
halide/Halide #4610

Performance degrading with parallel, unroll etc scheduling m…

Hi, I am working on Windows also my intel processor supports AVX2 intrinsics. I tried the scheduling tutorial example (https://halide-lang.org/tutorials/tutorial_lesson_05_scheduling_1.html ) and…

jsksra1 updated 4 years ago
3
llvm/llvm-project #64204

[i686] Cannot select llvm.{min,max}imum.{f32,f64}

It seems that none of these intrinsics are currently implemented for non-x86_64 (with the x86 backend): - llvm.minimum.f32 - llvm.minimum.f64 - llvm.maximum.f32 - llvm.maximum.f64 Error outpu…

Urgau updated 1 year ago
2
Weijun-H/Read-Some-Paper #31

The FastLanes Compression Layout: Decoding >100 Billion Inte…

**Abstract** The open-source FastLanes project aims to improve big data formats, such as Parquet, ORC and columnar database formats, in multiple ways. In this paper, we significantly accelerate dec…

Weijun-H updated 8 months ago
1
spack/spack #25696

Installation issue: libxsmm

### Steps to reproduce the issue ```console spack install libxsmm@1.16.1 %gcc@11.2.0 ``` ### Information on your system ```console spack debug report * **Spack:** 0.16.2-3941-79c2d55830 …

amaji updated 1 year ago
7
m-a-d-n-e-s-s/madness #168

need AVX-512 kernels

We need to support AVX-512 instructions in order to support Knights Landing (KNL) and future Xeon processors effectively. AVX-512 is not monolithic (see [Wikipedia](https://en.wikipedia.org/wiki/AVX-…

jeffhammond updated 7 years ago
5

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for intel-intrinsics

1000+ results
for intel-intrinsics