Closed APPLE-XMT closed 1 year ago
It's very normal to converge within 5 epochs since the dataset is very huge.
ok thanks for your answer
Hello, I see that you mentioned in the paper that the label range of the dataset is 0-4, but I found that there is no label of 0 in the amazon dataset. Why is this?
I also found this and don't know either.
Hello, it is a great honor to read your article. When I am doing the recurrence work, I always use the learning rate in the config file to achieve the best results in the 1-2 epoch. Is this correct?