Closed liveseongho closed 3 years ago
Hi,
Thanks for your interests in this project.
We have provided with the best training config. The performance reported in the paper is from 5000 steps on 8 GPUs.
For hard negatives, we strictly followed the original TVR work on how the model get trained. Please have a check on their repo. Thanks.
Thanks for your quick response! 😃
Hi, I'm trying to fintune TVR dataset with HERO pretrained model. But with 5000 or 10000 train steps, I failed to reach the performance of the paper.
Also, the paper doesn't describe any about hard negative sampling, but it seems to be important.