About the speed for jax plenoxels

The JAX code (in its current state) is not optimized for speed; it could likely be sped up substantially by further engineering (in JAX), and indeed this is one of our TODOs going forward. While developing Plenoxels, we often used this higher-level version to try out ideas, and then optimized the final version in CUDA for speed. Certainly the speed of the CUDA version is in part due to hand-engineering the implementation (and even that version could probably be accelerated further), but this hand-engineering speedup was only really possible because of the simplicity of the method. Note also that most automatic differentiation libraries are already optimized for neural networks, so while you could probably get some benefit from re-implementing something like original (TensorFlow) NeRF in CUDA, I'm not sure how much the speedup would be.

sarafridov / plenoxels

About the speed for jax plenoxels #5