Usage of Beta - Githubissues

MichiganCOG / A2CL-PT

Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)

MIT License

46 stars 8 forks source link

Hi, Thank you for your interest. Let's say that beta is 0. Then, the new attention (Eq. (6) of the paper) will be constant over time, so it is supposed to have lower values for the activity features when compared to the original attention (Eq. (2) of the paper). In other words, it is supposed to have higher values for the background features when compared to the original attention. This statement holds for any beta that is lower than 1. We found that randomly generating beta from [0.001, 0.1] for each training sample produces a good performance. Of course, higher beta makes the triplet (Eq. (7)) harder (like hard example mining), which might provide a better training signal... but we did not confirm this. More experiments are needed to validate it! Kyle

MichiganCOG / A2CL-PT

Usage of Beta #3