I directly ran the code of the code base without any modification. The results are as follows
08/28/2021 06:50:45 [Epoch 100] dev acc: 0.70696 (took 220s)
08/28/2021 06:50:45 checkpoint: tmp/dtfixup
08/28/2021 06:50:45 best dev accuracy: 0.72340
08/28/2021 06:50:45 checkpoint: tmp/dtfixup
The best dev accuracy is only 72.3%, Maybe I missed something?
For the Experiment Configuration, I found that the batch in the code is 32 and the batch in the paper is 16. Is this the reason for my failure?
I directly ran the code of the code base without any modification. The results are as follows
08/28/2021 06:50:45 [Epoch 100] dev acc: 0.70696 (took 220s) 08/28/2021 06:50:45 checkpoint: tmp/dtfixup 08/28/2021 06:50:45 best dev accuracy: 0.72340 08/28/2021 06:50:45 checkpoint: tmp/dtfixup
The best dev accuracy is only 72.3%, Maybe I missed something? For the Experiment Configuration, I found that the batch in the code is 32 and the batch in the paper is 16. Is this the reason for my failure?