jshi31 / NAFAE

Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses"
MIT License
30 stars 5 forks source link