ClimbsRocks / machineJS

[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
https://github.com/ClimbsRocks/auto_ml
408 stars 64 forks source link

allow custom validation splits, like numer.ai uses #150

Closed ClimbsRocks closed 8 years ago

ClimbsRocks commented 8 years ago

but we want to keep the rest of the process intact.

we'll have to:

  1. add in "validationSplit" as a column type in data-formatter
  2. remove that column from the data otherwise
  3. save that column in it's own separate entity, like we do with the id column
  4. set a flag for "custom validation split".
  5. make sure this flag gets saved with fileNames.
  6. once in the file where we create the validations, first check this flag.
  7. if it's set, then just create the validation split from the validationSplitColumn.npz file. don't worry about running checks against data length or any of that.

from there, we should just be able to follow the normal process.

ClimbsRocks commented 8 years ago

finished! check out the docs at data-formatter for how to use it