Humorloos / IE683

0 stars 0 forks source link

Write explanation why all dataset requirements are satisfied - @ashishrana160796 @subashp93--- until 14:30 --- #9

Closed subashp93 closed 3 years ago

subashp93 commented 3 years ago

(Netflix User Interactions) https://www.kaggle.com/vodclickstream/netflix-audience-behaviour-uk-movies (User - Movie Interactions): This dataset has 671736 entities and 8 attributes. 6 attitudes are overlapping with other table's attributes. No missing values at all so this dataset is satisfied with our requirement.

subashp93 commented 3 years ago

https://www.kaggle.com/ashishgup/netflix-rotten-tomatoes-metacritic-imdb (Netflix movies): 15480 entities, 29 attributes , 15 attributes are overlapping and 24.5% missing values(less than 30%) so satisfied our requirements

subashp93 commented 3 years ago

https://www.kaggle.com/ruchi798/movies-on-netflix-prime-video-hulu-and-disney/version/2 (Steaming movies): 16744 entities,16 attributes, 11 attributes are overlapping and 8.5% missing values(less than 30%). This dataset also satisfied our requirements

subashp93 commented 3 years ago

https://www.kaggle.com/stefanoleone992/imdb-extensive-dataset (IMDB Movies): 85855 entities,22 attributes, 18 attributes are overlapping and 15.3% missing values. satisfied

subashp93 commented 3 years ago

https://www.kaggle.com/stefanoleone992/imdb-extensive-dataset?select=IMDb+names.csv (IMDB Actors): 297705 entities, 17 columns, 4 attributes overlapping and 43.8% missing values(height attribute has 85% missing values, birth_detail has 62.8%,date_of_birth 62.8%, place_of_birth has 65.1%, death_details has 86.6%, date_of_death,place_of_death,spouses_string have more than 80% null values. I think its better to exclude this attributes from our dataset.