UBC-MDS / DSCI_522_Breast_cancer_predictors

Decision tree analysis of breast cancer result metrics to deduce the strongest predictor of malignancy
0 stars 2 forks source link

Project workflow #7

Open nazra-inari opened 5 years ago

nazra-inari commented 5 years ago

Milos,

I took the effort to break our projects into steps. Each step could be a script by itself (where applicable)

1) Data Import and Wrangling 2) Exploratory Data analysis 3) Split Data set into training and testing 4) Hyperparameter Optimization for max_depth 5) Hyperparameter Optimization for min_smaples_split 6) Refit, Run, Plot final model. 7) Generate Report.

I'd really advise you make notes or have a paragraph as a write up for each section as you're working on it. This will allows us to tie together with ease at the end

milicmil commented 5 years ago

Hey Arzan,

Sounds good.

I dont think there will be much wrangling, but it is good to double

I will work on 3 and 4 and let me know when you are done with the EDA and Import. You mentioned you will work on 5, so let us meet tomorrow(23rd) and see what are results are and bring it all together.

nazra-inari commented 5 years ago

Hi Milos,

Made a few changes to the repo this morning.

As of now I have completed 1 and 2. Great that you've completed 3 and 4.

Here's what I am working on now:

I would like to know what you're working on and whether you can also add your step 4 to Rmd report in the reports folder. Do also to convert your jupyter notebook to a Python Script.