Closed DSaurus closed 6 months ago
I find an interesting thing. The spatial-temporal encoding process directly handles a 4-dimensional input to obtain its encoding. However, I remember that the input for TCNN should be either 2 or 3 dimensions.
Hi,
It does support up to 4D inputs. There are more efficient ways to do this point and time encoding of course.
Did you get any error? With what tcnn version?
No, I don't get an error. But does TCNN actually implement a huge O(n^4) hash grid to represent NeRF-T? That's amazing.
Yeah, I saw some recent reconstruction methods using 4D hash grids as well, like this NeurIPS work: https://arxiv.org/pdf/2310.17527.pdf
It is computationally heavy though. It might make sense to use a better representation for this, e.g., 4D planes or deformation based. The best thing would be nerfplayer, as it can represent deformations and newness content (like the fire, water from the firehydrant, etc.), which deformations alone can not handle.
Hi @sherwinbahmani ,
I'm working on implementing 4D-fy as an extension of threestudio in this repository and have invited you as a collaborator. You're welcome to join the project if interested.