yliu-cs / PiTe

PiTe: Pixel-Temporal Alignment for Large Video-Language Model
5 stars 1 forks source link