Closed doraemonho closed 1 year ago
Feel free to open a PR.
Do you have/find cases where memory is the bottleneck compared to efficiency?
Feel free to open a PR.
Do you have/find cases where memory is the bottleneck compared to efficiency?
Definitely 3D simulation on GPU. With a high-end gaming/data center GPU, the speed is fine but we already approaching to the resolution limit of using a single GPU. Reducing 50% memory can further make the Cubes 1.25 times larger in length.
Cool! Go for it. Open a PR and I can help if needed.
Cool! Go for it. Open a PR and I can help if needed.
Thanks!I submitted the PR. I will need your help to review it and make the merge if the code is fine.
Hi,
I modify a bit of FourierFlows for adding a new time-stepper to support low-storage RK4 scheme. (The code is in below) This is a classical 5 stages 2 registers process to trade off 25~30% performance but saving 50% of storge space for RK4 method. [1]. I wonder could we make FourierFlows support this feature in the next update.
[1] M.H. Carpenter, C.A. Kennedy, Fourth-order 2N-storage Runge–Kutta schemes, Technical Report NASA TM-109112, NASA Langley Research Center, VA, June 1994.