Closed yangjiangeyjg closed 3 years ago
Hi, thanks for your interest. In fact, we follow the convention where the softmax function is preferred to sigmoid for video-level classification. This is, I conjecture because the cross-entropy with softmax is easier to optimize than the binary cross-entropy with sigmoid when considering the small dataset size (e.g., 200 videos for THUMOS'14). In addition, the softmax function is enough to get accurate video-level predictions.
Hi, thanks for your interest. In fact, we follow the convention where the softmax function is preferred to sigmoid for video-level classification. This is, I conjecture because the cross-entropy with softmax is easier to optimize than the binary cross-entropy with sigmoid when considering the small dataset size (e.g., 200 videos for THUMOS'14). In addition, the softmax function is enough to get accurate video-level predictions.
Thanks!
It is a multi-label classification problem. @Pilhyeon