mvanderkroon / cobr

Data Science toolkit
Apache License 2.0
7 stars 3 forks source link

DS-toolkit features (in no particular order)

install virtualenvwrapper and setup two virtual environments

  1. sudo apt-get install python-pip
  2. sudo pip install virtualenv
  3. mkdir ~/.virtualenvs
  4. sudo pip install virtualenvwrapper
  5. echo 'export WORKON_HOME=~/.virtualenvs'
  6. echo '. /usr/local/bin/virtualenvwrapper.sh'
  7. mkvirtualenv dev2.7 --python=`which python2.7`
  8. mkvirtualenv dev3.4 --python=`which python3.4`

install dependencies (in both virtual environments)

Showcase

The show case will detail how to profile a database, visualize the metamodel, index an Elastic Search instance for deep database search as well as expose the database via HTTP.

Let's get some test data from launchpad so that we have a predictable data set to work from.

TBD...

Profiling

First, create a config.ini file by copying the template_config.ini file and filling in the empty fields.

Start the profiler from the dev3.4 virtual environment created earlier

Implicit primary key detection

Implicit foreign key detection