Contributors: Sarah Eshafi, Hui Tang, Long Nguyen, Marek Boulerice
This repository covers a machine learning model analysis with a goal to predict angiographic coronary disease in patients. Data is pulled from patients undergoing angiography at the Cleveland Clinic in Ohio. This analysis is composed of Exploratory Data Analysis, testing of various machine models on a training data set, model optimization via hyperparameter, and final model performance analysis. The final model is shown to have promising results, though limitations apply and further testing and optimization is recommended.
To run the analysis:
note - the instructions in this section also depends on running this in a unix shell (e.g., terminal or Git Bash)
To replicate the analysis, install Docker. Then clone this GitHub repository and run the following command at the command line/terminal from the root directory of this project:
docker compose up
Copy the link from the output (the link would look like below)
and paste it to your browser and change the port number from 8888
to 9999
to launch jupyter notebook.
The software code contained within this repository is licensed under the MIT license. See the license file for more information.