facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.
https://co-tracker.github.io/
Other
3.93k stars 253 forks source link

Release CoTracker2.1 #104

Closed lvoursl closed 2 months ago

lvoursl commented 2 months ago

In this PR we're adding updated version of CoTracker2 (named CoTracker2.1) which follows the same architecture but was trained with different window size (16 instead 8) that leads to better performance on TAP-Vid benchmarks.

The inference code via torch.hub is updated and pushed, new metrics are also release in the query first mode.

chenshihfang commented 2 months ago

In this PR we're adding updated version of CoTracker2 (named CoTracker2.1) which follows the same architecture but was trained with different window size (16 instead 8) that leads to better performance on TAP-Vid benchmarks.

The inference code via torch.hub is updated and pushed, new metrics are also release in the query first mode.

@lvoursl Hi, would version 2.1 conduct a training cost and inference speed comparison with version 2.0?

lvoursl commented 2 months ago

Hi @chenshihfang, we haven't done in-depth benchmarking but our empirical observation that bigger window size leads to faster training and inference.