mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
https://mbzuai-oryx.github.io/Video-ChatGPT
Creative Commons Attribution 4.0 International
1.05k stars 92 forks source link

linear layer #31

Closed Adeelyousaf closed 11 months ago

Adeelyousaf commented 11 months ago

Hi, can you confirm the dimensions of the linear layer that you learned?

Kind Regards

hanoonaR commented 11 months ago

Hi @Adeelyousaf ,

Thank you for your interest in our work. The dimensions of the learned linear layer is: 1024 x 4096. Please let us know if you have any further questions.