abishek-as / Netflix-Movie-Recommendation

This repository is for Netflix movie recommendations using various content and collaborative-based methods like Word2vec, Node2vec, Sentence Transformer, MiniBatchKMeans, Cosine Similarity, Pearson's Correlation, and Singular Value Decomposition (SVD).
4 stars 5 forks source link

Dataset unavailable #1

Open renovat10 opened 1 year ago

renovat10 commented 1 year ago

I have cloned the repo and while trying to run the notebooks, I got an error saying the dataset is not available. So I changed the notebook to read the csv file from the parent directory, and still got an error because the csv file included in the repo is not a proper csv file. Then I tried getting the dataset from the linked kaggle datasets. From there I found the titles dataset, but the prices dataset was not as described in the redme of the repo. It was a bunch of txt files actually. Where can I find the Prices data or is there a way to preprocess the txt files in the kaggle dataset to get the dataset?

renovat10 commented 1 year ago

Okay so apparently I needed to have git-lfs installed to clone the dataset. But even after getting it, I was unable to get the dataset since the maintainer isn't paying for the lfs service anymore. Is there someplace else that I can get the dataset? Or I there a way to preprocess the data in the kaggle dataset to be a usable csv file?