justinetaylor / mids-w207-final-project

Clear Cut Solution - https://www.kaggle.com/c/forest-cover-type-prediction
1 stars 1 forks source link

mids-w207-final-project

Primary Files:

  1. exploratory_data_analysis.ipynb: Jupyter notebook with a detailed analysis of the training data
  2. feature_engineering.py: Python library containing all transformations
  3. models.py: Python library containing all models and configurations
  4. clear_cut_solution.ipynb: Jupyter notebook with descriptions, solutions and test results

Repo Map

Computing Environment

Work was conducted in the kmartcontainers/207final container (Dockerhub link). It is a custom container which adds the xgboost library to the jupyter/tensorflow-notebook docker container as put together by the jupyter development team. Details of how to set up the container to run on your machine or GCP as well as details of the container creation are in the comp_setup/ComputeSetup.md file.