DistrictDataLabs / cultivar

Multidimensional data explorer and visualization tool.
http://trinket.districtdatalabs.com
Apache License 2.0
52 stars 18 forks source link

Research Auto-analysis Feature #14

Open bbengfort opened 9 years ago

bbengfort commented 9 years ago

Auto analysis assigns each column/feature a data type (dtype in the parlance of NumPy and Pandas), e.g. categorical, numeric, real, integer, etc. This types must be automatically inferred from the dataset.

Questions to answer:

Interesting stuff/libraries in: Data Type Recognition/Guessing of CSV data in python

bbengfort commented 8 years ago

Deliverable: A markdown document in the docs section that describes auto-analysis, answering the above questions.

Note: our documentation generator is mkdocs, which you'll need pip install it, and learn to use it.