OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Apache License 2.0
1.44k stars 88 forks source link

Allow InternVideo2/multi_modality to be pip installed #195

Closed rongou closed 1 week ago

rongou commented 1 month ago

We are trying to use the InternVideo2 multi_modality model, but right now it's quite painful, we have to git clone the repo, fix the imports, and use a custom pyproject.toml to pip install it.

This change makes it a little easier. Once merged, you can run:

pip install \
"git+https://github.com/OpenGVLab/InternVideo.git@main#egg=internvideo2_multi_modality&subdirectory=InternVideo2/multi_modality/"

to install it. For the two flash_attn modules, provide more pip workers:

MAX_JOBS=24 pip install \
"git+https://github.com/OpenGVLab/InternVideo.git@main#egg=internvideo2_multi_modality[extra-git-deps]&subdirectory=InternVideo2/multi_modality/"