pykt-team / pykt-toolkit

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models
https://pykt.org
MIT License
194 stars 53 forks source link

IEKT在数据集Bridge2006跑的时候出现不收敛的情况 #149

Open MyGithub1234567890 opened 7 months ago

MyGithub1234567890 commented 7 months ago

f3a2747b61de6ce8701217ff30fa6ee

sonyawong commented 6 months ago

f3a2747b61de6ce8701217ff30fa6ee

我理解在IEKT的训练过程中引入了Policy Gradient的强化学习算法(论文section4.3 Model Learning), 所以loss会出现震荡. 不过可以看到valid auc一直有在上升, 模型一直有在学, 直到达到我们设定的early stop.