It might be good to take a version of the dataset hosted on a public library like Zenodo (or other).
I am wondering if Kaggle has a license over the dataset that prevents from making it available publicly, or if they have a license that prevents such practices.
http://biostat.mc.vanderbilt.edu seems to be down, thus the link http://biostat.mc.vanderbilt.edu/wiki/pub/Main/DataSets/titanic3.xls does not work.
It might be good to take a version of the dataset hosted on a public library like Zenodo (or other).
I am wondering if Kaggle has a license over the dataset that prevents from making it available publicly, or if they have a license that prevents such practices.
I used: https://github.com/joanby/python-ml-course/raw/master/datasets/titanic/titanic3.xls instead. It seems that it is the same dataset.