qianqianwang68 / omnimotion

Apache License 2.0
2.14k stars 126 forks source link

How is it different from DeepMind's TapNet? #6

Open yhyu13 opened 1 year ago

yhyu13 commented 1 year ago

https://github.com/deepmind/tapnet#tapir-demos

Deepmind had a similar work using the same testing images in your work? How is your work different from deepmind fundamentally?

I am a hobbyist, so it's gratfull for you to spend time explaining briefly the differences, purpose, approach, and result wise.

Thanks!

qianqianwang68 commented 1 year ago

Thank you for your question! TAPIR is undoubtfully an amazing work. Our method and TAPIR are fundamentally different in the way they work and I believe they are complementary.

Lastly, in my opinion, we need both generalizable methods like TAPIR which learns very useful priors from data, and test-time optimization methods like ours that can take the noisy motion data and refine them for a particular video sequence for better quality and coherence.

dongxinyu1030 commented 1 year ago
  • To perform the optimization, our method takes the raw tracking results from existing methods as the noisy supervising signal.

For this step : "To perform the optimization, our method takes the raw tracking results from existing methods as the noisy supervising signal." Do your method need trajectories across all video frames or just frames before the current time t?

qianqianwang68 commented 1 year ago

It takes the trajectories across all video frames.