haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
https://llava.hliu.cc
Apache License 2.0
19.54k stars 2.15k forks source link

Where does the video interpolate indicates in the code? #1532

Open lucasjinreal opened 4 months ago

lucasjinreal commented 4 months ago

such as the implementation of linear scaling in rotary position embeddings (RoPE), we apply a similar scaling approach in LLaVA-NeXT. B

Any indicates which code implements this linear scaling feature on video?