The initial weights - Githubissues

Sorry for the late reply. In our implementation, we use UCF101 and CK+ to pretrain the network with their frames from samples. We consider UCF101 as the 'temporal' prior knowledge because the dataset contains more motion video information rather than facial information. We regard CK+ as 'spatial' knowledge because this dataset has a shorter temporal length but abundant facial emotion information. We will release the network weights later in our repository. Thank you for your attention to our work.

divertingPan / STA-DRN

The initial weights #3