CodeSpaceHQ / MENGEL

A framework that applies machine learning algorithms and automates the process of finding the right algorithm for the job.
6 stars 1 forks source link

Text Handler code, tests, and README #109

Closed ZakeryFyke closed 8 years ago

ZakeryFyke commented 8 years ago

This adds three functions to the text_handler module. They allow for the replacement of text values, which pandas calls objects, with corresponding numeric categorical values based on the ratio of unique values to total values as given by the user. It also allows for the conversion of all text values which cannot be converted to numeric values to be converted to "NaN"s, which can then be dealt with by the data_fillers. Tests for all 3 functions and an update to the README are also included.