issues
search
omlins
/
ParallelStencil.jl
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
BSD 3-Clause "New" or "Revised" License
301
stars
31
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix AD for threads
#104
omlins
closed
12 months ago
0
Add README documentation on AD and extend docstring documentation and unit tests
#103
omlins
closed
1 year ago
0
Add documentation, unit tests and small fixes for automatic differentiation
#102
omlins
closed
1 year ago
0
Add high-level support for architecture-agnostic automatic differentiation
#101
omlins
closed
1 year ago
0
AMDGPU v0.5.0 compat
#100
luraess
closed
12 months ago
1
Make generic Enzyme-based AD of ParallelStencil kernels more convenient
#99
omlins
closed
1 year ago
0
Make shared memory allocation robust for compilation throughout all CUDA/AMDGPU versions
#98
omlins
closed
1 year ago
0
Add documentation for memopt optimization, CellArrays and AMDGPU
#97
omlins
closed
1 year ago
0
Add documentation for memopt, CellArrays and AMDGPU
#96
omlins
closed
1 year ago
1
Remove need to have any packaged pre installed
#95
omlins
closed
1 year ago
0
Add error message for not supported memopt cases
#94
omlins
closed
1 year ago
0
Fix AMDGPU shared memory allocation
#93
omlins
closed
1 year ago
0
Non cartesian gather!
#92
LaurentPlagne
closed
1 year ago
2
CUDA Crash with julia 1.9.0
#91
LaurentPlagne
closed
1 year ago
8
How to implement custom finite differencing operators
#90
TakeTwiceDailey
opened
1 year ago
8
Example for init_global_grid_usage
#89
LaurentPlagne
closed
1 year ago
3
Thread (CPU) Float32/Float64 performance comparison on miniapp acoustic2D
#88
LaurentPlagne
closed
1 year ago
12
Create and update GPU unit tests
#87
omlins
closed
1 year ago
0
Generalize loopopt
#86
omlins
closed
1 year ago
0
Caching the GPU kernels?
#85
korbinian90
closed
1 year ago
1
enable dependabot for GitHub actions
#84
ranocha
opened
1 year ago
2
Make hide communication compatible with incremental compilation
#83
omlins
closed
1 year ago
0
`@hide_communication` error when inside a module and using `CUDA.jl` backend
#82
GiackAloZ
closed
1 year ago
5
Generalize memopt and create and update GPU unit tests
#81
omlins
closed
1 year ago
0
Debugging and profiling workflow
#80
smartalecH
opened
1 year ago
3
Efficient cache-aware and cache-oblivious implementations
#79
smartalecH
opened
1 year ago
1
Heterogenous stencil formulations
#78
smartalecH
closed
1 year ago
4
Improving loop kernel speed
#77
smartalecH
closed
1 year ago
2
dimensionality restrictions
#76
koehlerson
closed
1 year ago
3
Creating modules that depend on `ParallelStencil.jl`
#75
smartalecH
opened
1 year ago
3
Fix miniapp
#74
luraess
closed
10 months ago
0
Possible race condition
#73
luraess
closed
10 months ago
0
Fix kernel println
#72
smartalecH
closed
10 months ago
1
Code generation for multiple stencils
#71
smartalecH
opened
1 year ago
2
Control backend selection with Preferences
#70
jpsamaroo
closed
1 year ago
2
Add support for AMDGPU
#69
omlins
closed
1 year ago
0
Multithreaded array initialization
#68
carstenbauer
opened
1 year ago
8
Add working Julia environment(s) for the examples (toml files)
#67
carstenbauer
opened
1 year ago
2
Add initial implementation of loop based optimizations
#66
omlins
closed
1 year ago
0
Disable subnormals inside @parallel blocks
#65
smartalecH
closed
1 year ago
8
GPUCompiler error when running 3D diffusion example on the GPU
#64
muendlein
closed
1 year ago
4
Fix enum @fill without celldims
#63
omlins
closed
1 year ago
0
Enable allocation with enums using @fill and @rand
#62
omlins
closed
1 year ago
0
Question about @inn macro
#61
ttaczak
closed
1 year ago
4
Fix typo in doc
#60
luraess
closed
1 year ago
0
@fill and @alloc allocators
#59
albert-de-montserrat
closed
2 years ago
1
Remove non-used array
#58
luraess
closed
2 years ago
2
Add macro to compute harmonic averages
#57
albert-de-montserrat
closed
2 years ago
0
Update hide communication doc
#56
luraess
opened
2 years ago
0
Update README.md
#55
omlins
closed
2 years ago
0
Previous
Next