andreeaiana / newsreclib

PyTorch-Lightning Library for Neural News Recommendation
https://newsreclib.readthedocs.io/en/latest/
MIT License
43 stars 8 forks source link

MIND dataset URLs update #23

Open gerardsimons opened 3 months ago

gerardsimons commented 3 months ago

It seems that the old MIND dataset URL are now defunct. I reached out to one of the authors and they pointed me to a new set of URLs which I have updated. I am testing it now withexperiment=nrms_mindsmall_pretrainedemb_celoss_bertsent but have updated other URLs including the tests, but not sure if that works as expecting

gerardsimons commented 3 months ago

NOTE that I noticed that the data was wrapped in another subdir, I fixed it but not very clean probably. It works for the train small, not sure about the larger datasets yet.