Closed njansson closed 1 month ago
TGV fails to converge.
Some infinity shows up for the start residual and pressure is also at a massive scale starting the second iteration.
Details
Do you get any other error messages? The kernel uses quite some shared memory.
Also, it might be something with the padded version. (lx 4, 8, 16)
TGV fails to converge. Some infinity shows up for the start residual and pressure is also at a massive scale starting the second iteration. Details
Do you get any other error messages? The kernel uses quite some shared memory.
Also, it might be something with the padded version. (lx 4, 8, 16)
Nope, the crash is triggered on a floating point exception not a memory issue.
TGV fails to converge. Some infinity shows up for the start residual and pressure is also at a massive scale starting the second iteration. Details
Do you get any other error messages? The kernel uses quite some shared memory. Also, it might be something with the padded version. (lx 4, 8, 16)
Nope, the crash is triggered on a floating point exception not a memory issue.
Sure, the issue is a in the padded version (indexing issues)
TGV fails to converge. Some infinity shows up for the start residual and pressure is also at a massive scale starting the second iteration. Details
Do you get any other error messages? The kernel uses quite some shared memory. Also, it might be something with the padded version. (lx 4, 8, 16)
Nope, the crash is triggered on a floating point exception not a memory issue.
Sure, the issue is a in the padded version (indexing issues)
Should be fixed now
The combined ax_helm performs ~20% faster than issuing 3x standard ax_helm kernels (on an Nvidia A100)