IBM / visualize-data-with-python

A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
https://developer.ibm.com/patterns/visualize-data-with-python/
Apache License 2.0
59 stars 78 forks source link
bluemix call-for-code data-science dsx ibm ibm-developer-technology-cognitive ibmcode journey jupyter-notebook notebook pixiedust spark

WARNING: This repository is no longer maintained

This repository will not be updated. The repository will be kept available in read-only mode.

Visualize and analyze data from the 2017 flood in Houston, TX using a Jupyter Notebook on IBM Watson Studio

In this Code Pattern we will use some standard techniques for data science and data engineering running on IBM Watson Studio to analyze publicly available data for the 2017 flooding in Houston, TX. Watson Studio is an interactive, collaborative, cloud-based environment where data scientists, developers, and others interested in data science can use tools (e.g., RStudio, Jupyter Notebooks, Spark, etc.) to collaborate, share, and gather insight from their data.

When the reader has completed this Code Pattern, they will understand how to:

The intended audience for this Code Pattern is application developers and other stakeholders who wish to utilize the power of Data Science quickly and effectively.

Flow

architecture

  1. Load the Jupyter notebook onto the IBM Watson Studio platform.
  2. USGS data from the Houston flood of 2017 is loaded into the notebook.
  3. The notebook is used to clean the data, and then display it.
  4. A PixieApp dashboard is created and can be interacted with.
  5. Mapbox and Folium are used for map visualizations

Included technologies

Prerequisites

Steps

Follow these steps to setup and run this Code Pattern. The steps are described in detail below.

  1. Sign up for the Watson Studio
  2. Create the notebook
  3. Run the notebook

1. Sign up for Watson Studio

Sign up for IBM's Watson Studio. By creating a project in Watson Studio a free tier Object Storage service will be created in your IBM Cloud account. Take note of your service names as you will need to select them in the following steps.

Note: When creating your Object Storage service, select the Free storage type in order to avoid having to pay an upgrade fee.

2. Create the notebook

Create Notebook

3. Run the notebook

NOTE: There are points in the notebook where you will have to enter your Mapbox Token to render the map.

When a notebook is executed, what is actually happening is that each code cell in the notebook is executed, in order, from top to bottom.

Each code cell is selectable and is preceded by a tag in the left margin. The tag format is In [x]:. Depending on the state of the notebook, the x can be:

There are several ways to execute the code cells in your notebook:

Sample Output

Note: Some interactive map functionality, like Options and Layers will not work. To see these, you must run the notebook itself.

License

This code pattern is licensed under the Apache Software License, Version 2. Separate third party code objects invoked within this code pattern are licensed by their respective providers pursuant to their own separate licenses. Contributions are subject to the Developer Certificate of Origin, Version 1.1 (DCO) and the Apache Software License, Version 2.

Apache Software License (ASL) FAQ