jrzaurin / pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
Apache License 2.0
1.3k stars 190 forks source link

datset #225

Closed xylovezxy closed 2 months ago

xylovezxy commented 3 months ago

Hello, the Movielens dataset in this library only has a 100k version. Can you add a 1M version. This allows for the use of larger datasets.

jrzaurin commented 3 months ago

Hey @xylovezxy

Maybe you could do it yourself?

This time is really simple, you just need the code here: https://github.com/jrzaurin/pytorch-widedeep/tree/master/pytorch_widedeep/datasets

On our side we prefer to keep the datasets relatively small to be stored in github and also to be tractable to run quick experiments