This module should take the raw data and process them reliably to create a X data frame with the rows being the observation and the y the label we are trying to predict. Here we should leave the feature as raw as possible so that we can test multiple normalizaiton scheme and such.
This module should take the raw data and process them reliably to create a X data frame with the rows being the observation and the y the label we are trying to predict. Here we should leave the feature as raw as possible so that we can test multiple normalizaiton scheme and such.