Open MonolithFoundation opened 1 month ago
Hi, looks like VILA trained a lot of videos, how does the being sampled? And how does it dealed with S2?
For this release, we use uniform sampling. Each frame can go through S2 w/o problem.
Hi, looks like VILA trained a lot of videos, how does the being sampled? And how does it dealed with S2?