Skdata is a library of data sets for machine learning and statistics. This module provides standardized Python access to toy problems as well as popular computer vision and natural language processing data sets.
The project is hosted at github: http://jaberg.github.com/skdata
There are several options for installation:
From scratch:
From a fresh git checkout:
python setup.py develop
python setup.py install
See http://jaberg.github.com/skdata
Join the mailing list: https://groups.google.com/forum/#!forum/skdata
Github maintains an up-to-date list of direct contributors: https://github.com/jaberg/skdata/graphs
A special thanks goes to David Cox, who provided inspiration and design guidance, and generally got this project started.
This work was supported in part by the National Science Foundation (IIS-0963668).