Experimental repo for testing various ways of aggregating information across video frame embeddings
Accuracy | |
---|---|
Top-1 | 0.31 |
Top-5 | 0.56 |
mean(Top1, Top5) | 0.44 |
Accuracy | |
---|---|
Top-1 | 0.41 |
Top-5 | 0.65 |
mean(Top1, Top5) | 0.53 |
EmbeddingDatasetReader isn't in the current pip install version of clip_video_encode. Please install with python setup.py install
from the cloned clip_video_encode repo.
Huggingface load_dataset is not currently implemented for the CLIP-Kinetics700 dataset (https://github.com/iejMac/clip-video-encode/issues/14). Please use:
git lfs install
git clone https://huggingface.co/datasets/iejMac/CLIP-Kinetics700
cd CLIP-Kinetics700
git lfs pull