Repository for docs and issues. If you need help, please file an issue here. Public conversations are better for open source projects than private email.
Within the raw data file, you are able to visualize the layout of the data. Next, you may want to see how this data is processed through the pipeline, using the provided notebook, timeseries_sample.ipynb
Open this notebook, it should be set to use the July 22nd data. The notebook allows you to step through the small bits of code, teaching you have to access the database, and objects such as the confirmed and cleaned trips and sections.
I would add print statements to each variable you are unsure of, and run that line of code within the notebook. This will allow you to visualize the process and see how the pipeline and query pull from the stored data after it has run through the intake pipeline.
Below are a few outlined steps I used to begin understanding the transformation of raw data through the pipeline stages.
Some videos to help you set up and understand how to use the Jupyter Notebook: https://www.youtube.com/watch?v=DKiI6NfSIe8 , https://www.youtube.com/watch?v=HW29067qVWk