mboccenti / Recommender_system_project

Online courses recommender system w Machine Learning
21 stars 6 forks source link

Dataset Information #1

Closed harshit-333 closed 11 months ago

harshit-333 commented 11 months ago

Sir, can you please tell me where did you get the link to this dataset or how did you access this dataset?

mboccenti commented 11 months ago

The datasets are stored in the cloud as per the lab instructions.

# Point to the datasets stored on the cloud course_genre_url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBM-ML321EN-SkillsNetwork/labs/datasets/course_genre.csv"

ratings_url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBM-ML321EN-SkillsNetwork/labs/datasets/ratings.csv"

you can access them as Pandas dataframes: course_df = pd.read_csv(course_genre_url) ratings_df = pd.read_csv(ratings_url)

harshit-333 commented 11 months ago

That means you just used this link directly to get the .CSV files? I might be asked where I got this dataset from so what should be my answer since I can't just show them this link as it will not look authentic.

So it would be helpful if you tell me something more about this dataset like where is this collected from or how to access it manually from Google or something like that.

On Sat, 18 Nov, 2023, 2:21 pm Marco Boccenti, @.***> wrote:

The datasets are stored in the cloud as per the lab instructions.

`# Point to the datasets stored on the cloud course_genre_url = " https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBM-ML321EN-SkillsNetwork/labs/datasets/course_genre.csv "

ratings_url = " https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBM-ML321EN-SkillsNetwork/labs/datasets/ratings.csv"`

you can access them as Pandas dataframes: course_df = pd.read_csv(course_genre_url) ratings_df = pd.read_csv(ratings_url)

— Reply to this email directly, view it on GitHub https://github.com/mboccenti/Recommender_system_project/issues/1#issuecomment-1817448361, or unsubscribe https://github.com/notifications/unsubscribe-auth/AV5MQTRJY7JXVVQ5ORSVCA3YFBZKBAVCNFSM6AAAAAA7QYOAAKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMJXGQ2DQMZWGE . You are receiving this because you authored the thread.Message ID: @.***>

mboccenti commented 11 months ago

Sure thing.

This project contains the datasets and the exercises for the IBM Machine Learning Certification labs by Yan Luo. https://www.coursera.org/professional-certificates/ibm-machine-learning

Thanks.

harshit-333 commented 11 months ago

So that means the dataset is not publicly available like on kaggle or other open source apart from the .csv link you shared right!

On Sat, Nov 18, 2023 at 2:45 PM Marco Boccenti @.***> wrote:

Sure thing.

This project contains the datasets and the exercises for the IBM Machine Learning Certification labs by Yan Luo https://www.linkedin.com/in/yan-luo-96288783. https://www.coursera.org/professional-certificates/ibm-machine-learning

Thanks.

— Reply to this email directly, view it on GitHub https://github.com/mboccenti/Recommender_system_project/issues/1#issuecomment-1817453563, or unsubscribe https://github.com/notifications/unsubscribe-auth/AV5MQTU5BR67EWYHH6KJ63DYFB4ELAVCNFSM6AAAAAA7QYOAAKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMJXGQ2TGNJWGM . You are receiving this because you authored the thread.Message ID: @.***>

mboccenti commented 11 months ago

I could not answer that. If you would like more information about the origin and availability of the datasets, you should ask the dataset and course's author, whose link I provided . This project contains my notes and exercises for the course. Cheers.