At the end of the paper, in 'Limitations and Future Work', you write:
Second, the input of our methods is restricted to two consecutive frames, which results in the inability to leverage information from multiple consecutive frames. In future work, we will attempt to extend our approach to multi-frame inputs without introducing excessive overhead
Would you be able to share any general thoughts on how you would approach this?
Hi, thanks for the great research.
At the end of the paper, in 'Limitations and Future Work', you write:
Would you be able to share any general thoughts on how you would approach this?
Thank you!