elephantscale / datasets

1 stars 4 forks source link

where is this data? #1

Open markkerzner opened 6 years ago

markkerzner commented 6 years ago

complete_ratings_raw_data = sc.textFile('/data/movielens/ml-latest/ratings.csv')

in the movie lens lab

timfox456 commented 6 years ago

There's a script called download.sh that you can use to download the data. It's not checked in. The url is at: http://files.grouplens.org/datasets/movielens/ml-latest.zip

markkerzner commented 6 years ago

do we want it as part of the dataset or as part of the lab for automatic download?

Mark Kerzner, President, Elephant Scale http://elephantscale.com/, LinkedIn https://www.linkedin.com/in/markkerzner/ Book a call with me here http://www.meetme.so/markkerzner Mobile: 713-724-2534, Skype: mark.kerzner1 Twitter: @elephantscale https://twitter.com/elephantscale

On Thu, Sep 6, 2018 at 7:50 AM Tim Fox notifications@github.com wrote:

There's a script called download.sh that you can use to download the data. It's not checked in. The url is at: http://files.grouplens.org/datasets/movielens/ml-latest.zip

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/elephantscale/datasets/issues/1#issuecomment-419080771, or mute the thread https://github.com/notifications/unsubscribe-auth/AAk2eX_PPf7vfPM4GwLrXwe2kWM3Pw1-ks5uYRn7gaJpZM4WbYhI .

sujee commented 4 years ago

@williamjfermo investigate this and please update the lab instructions if necessary

williamjfermo commented 4 years ago

On it.

williamjfermo commented 4 years ago

Hi Sujee I looked at this I don't see any specific lab using just this dataset. I see the dataset for the movies located here.

The closest I could find that might use this data set is this but it is not in a lab format and don't see anywhere that it would make use of data unless lab would need to be built.

I was looking for something like this lab

sujee commented 4 years ago

this dataset would be used in one of these labs