data-liberation-project / phmsa-hazmat-incident-reports

Data from decades of PHMSA's "5800.1" hazardous material transportation incident reports
https://www.data-liberation-project.org/datasets/phmsa-hazmat-incident-reports/
7 stars 3 forks source link

PHMSA "5800.1" Hazmat Transportation Incident Reports

Federal law requires transportation-related spills, explosions, and other safety-endangering incidents involving hazardous materials to be reported to the Pipeline and Hazardous Materials Safety Administration (PHMSA).

Specifically, "each person in physical possession of a hazardous material at the time that any of [certain types of incidents] occurs during transportation (including loading, unloading, and temporary storage) must submit a Hazardous Materials Incident Report on DOT Form F 5800.1 [...] within 30 days of discovery of the incident".

These "5800.1" forms include details about the location of the incident, mode of transportation, parties involved, hazardous materials involved, causes of failure, fatalities, injuries, financial cost, and more.

PHSMA publishes the submitted reports through an online portal. That portal, however, is brittle and does not provide a straightforward mechanism to download the full set of submitted reports.

This repository, developed by the Data Liberation Project, aims to do the following:

Available Data

In the data/fetched directory, you can find CSVs containing the report information, with one month of data per CSV. Note that recent months' data may be incomplete (due to yet-unsubmitted reports), and will be regularly updated.

You can clone or download the repository to get the full set of files. Alternatively, you can access them individually via URL: https://raw.githubusercontent.com/data-liberation-project/phmsa-hazmat-incident-reports/main/data/fetched/YYYY-MM.csv, swapping out YYYY for the year and MM for the two-digit month.

The files are split into months to stay within GitHub's file size limits. You can combine them with your preferred toolset. For example, using xsv, you could run xsv cat rows data/fetched/*.csv > combined.csv.

A set of all years incidents filtered to just the most “serious” are available in the data/processed/filtered/ directory.

Resources

Notes

Methodology

Please see the METHODOLOGY.md document for a description of how this repository fetches and processes the data.

Contributors

Many thanks to the volunteers who have contributed to this repository:

Licensing

This repository's code is available under the MIT License terms. The raw data files (those in data/fetched) are public domain. All other data files are available under the Creative Commons CC BY-SA 4.0 license terms.

Questions?

File an issue in this repository or email Jeremy Singer-Vine at jsvine@gmail.com.