pmixer / SASRec.pytorch

PyTorch(1.6+) implementation of https://github.com/kang205/SASRec
Apache License 2.0
331 stars 90 forks source link

Something confused about the datasets #19

Open RealHanYing opened 2 years ago

RealHanYing commented 2 years ago

The codes are running well, but if I substitute the given dataset for the dataset used in Tisasrec which was also wirtten by you (the first line is 1 1193 5 ...) the result is too good to be normal (NDCG is bigger than 0.75 after 20 epochs). But I have not found the reason till now. What is the difference if I use the dataset in Tisasrec with the third column and forth column unused? Thank you. ps. If I use the dataset with the first line 1 2970 4 ..., which is a re-order version of the dataset mentioned above, the result is normal...

pmixer commented 2 years ago

@RealHanYing that sounds interesting, you are encouraged to find the root cause on your own just like how you find the problem by playing with datasets, pls make sure the dataset got parsed well by print() or other better validation methods.