Closed 993917172 closed 3 years ago
Our code of video feature extraction has already been available at here. Our text embedding is learned from the original text, instead of pre-extracted text features.
Our code of video feature extraction has already been available at here. Our text embedding is learned from the original text, instead of pre-extracted text features.