How to train MMT from scratch for other databases e.g. v3c1

gabeur / mmt

Multi-Modal Transformer for Video Retrieval

Apache License 2.0

259 stars 41 forks source link

First of all, thanks a lot for sharing such a great work. It is really interesting reading your paper and work.

Furthermore, If I want to analyze the performance of MMT on other databases like V3C1, how will I extract the expert embeddings for raw videos. Do I have to extract them on my own first and your code can only work for pre-computed features? or your code also extracts the experts features from videos? As in the code, pre-computed features for databases e.g. MSRVTT, LSMDC are provided and it does not seem that there is any file for extracting embedded features from pretrained experts that you used.

Is this possible for you to share the "models (pre-trained experts) and code" you used for extracting expert embedding from videos.

gabeur / mmt

How to train MMT from scratch for other databases e.g. v3c1 #15