using timeseries_sample.ipynb, steps to understanding the data

Below are a few outlined steps I used to begin understanding the transformation of raw data through the pipeline stages.

I would first suggest opening one of the raw data files such as: https://raw.githubusercontent.com/e-mission/e-mission-server/master/emission/tests/data/real_examples/shankari_2015-jul-22
Within the raw data file, you are able to visualize the layout of the data. Next, you may want to see how this data is processed through the pipeline, using the provided notebook, timeseries_sample.ipynb
Open this notebook, it should be set to use the July 22nd data. The notebook allows you to step through the small bits of code, teaching you have to access the database, and objects such as the confirmed and cleaned trips and sections.
I would add print statements to each variable you are unsure of, and run that line of code within the notebook. This will allow you to visualize the process and see how the pipeline and query pull from the stored data after it has run through the intake pipeline.

e-mission / e-mission-docs