Where does the video interpolate indicates in the code?

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

https://llava.hliu.cc

Apache License 2.0

19.54k stars 2.15k forks source link

Open lucasjinreal opened 4 months ago

lucasjinreal commented 4 months ago

such as the implementation of linear scaling in rotary position embeddings (RoPE), we apply a similar scaling approach in LLaVA-NeXT. B

Any indicates which code implements this linear scaling feature on video?