simd-vectorization Search Results

1000+ results
for simd-vectorization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kokkos/kokkos #2642

Add some variant for getting "ThreadVectorRange" to translat…

Sometimes the compiler heuristic with `pragma ivdep` does not get us the desired vectorization. There should be some way to opt in to more aggressive vectorization via `pragma omp simd` for the `OpenM…

AndrewGaspar updated 1 year ago
6
aesara-devs/aesara #685

Set `error_model="numpy"` in `njit` decorator

If we don't set this option, we could miss some vectorization/SIMD optimizations in our Numba conversions.

brandonwillard updated 1 year ago
2
dsharlet/LiveSPICE #99

Use SIMD to optimize some things

Row reduction on numeric types (as opposed to symbolic `Expression`s) is fairly SIMD friendly. However, when I've profiled this, `RowReduce` doesn't seem to take a lot of time. The other big row reduc…

dsharlet updated 3 years ago
2
pytorch/pytorch #122281

[inductor][cpu] basic_gnn_gcn fp32 static/dynamic shape defa…

### 🐛 Describe the bug new_perf_regression in 2024-02-11 | suite | name | batch_size_new | speed_up_new | inductor_new | eager_new | compilation_latency_new | batch_size_old | speed_up_old | induc…

WeizhuoZhang-intel updated 3 days ago
3
apache/skywalking #11337

[BanyanDB] Support SIMD instruction

### Search before asking - [X] I had searched in the [issues](https://github.com/apache/skywalking/issues?q=is%3Aissue) and found no similar feature requirement. ### Description Vectorization has …

sollhui updated 1 year ago
1
E3SM-Project/EKAT #38

Vectorization inconsistency between packed min/max

**Describe the bug** I cannot understand why min and max have different vectorization macros (one has `vector_disabled`, while the other has `vector_simd`). It appears to me as a potential bug. **…

bartgol updated 4 years ago
4
snapview/tungstenite-rs #36

Consider SIMD for unmasking

Now that SIMD intrinsics for x86 have been stabilized, it might be worthwhile to add explicit SIMD to accelerate unmasking. For example, autobahn-python [uses](https://github.com/crossbario/autobahn-p…

Shnatsel updated 6 years ago
8
openmc-dev/openmc #2844

Random Ray Inner Loop Optimization + Vectorization

Performance of the random ray solver in OpenMC is highly sensitive to the performance of the flux attenuation kernel that forms the inner loop of the simulation. This inner loop is responsible for per…

jtramm updated 7 months ago
1
llvm/llvm-project #100293

[InstCombine] Missed optimization after #84628

## Statement I found a C++ code pattern missed optimization after #84628 that is widely used in [Verilator](https://github.com/verilator/verilator.git) generated C++ codes which consume [CIRCT](htt…

cyyself updated 1 month ago
5
kokkos/kokkos.github.io #67

Kokkos Core: SIMD

crtrott updated 9 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for simd-vectorization

1000+ results
for simd-vectorization