Federal law requires transportation-related spills, explosions, and other safety-endangering incidents involving hazardous materials to be reported to the Pipeline and Hazardous Materials Safety Administration (PHMSA).
Specifically, "each person in physical possession of a hazardous material at the time that any of [certain types of incidents] occurs during transportation (including loading, unloading, and temporary storage) must submit a Hazardous Materials Incident Report on DOT Form F 5800.1 [...] within 30 days of discovery of the incident".
These "5800.1" forms include details about the location of the incident, mode of transportation, parties involved, hazardous materials involved, causes of failure, fatalities, injuries, financial cost, and more.
PHSMA publishes the submitted reports through an online portal. That portal, however, is brittle and does not provide a straightforward mechanism to download the full set of submitted reports.
This repository, developed by the Data Liberation Project, aims to do the following:
In the data/fetched
directory, you can find CSVs containing the report information, with one month of data per CSV. Note that recent months' data may be incomplete (due to yet-unsubmitted reports), and will be regularly updated.
You can clone or download the repository to get the full set of files. Alternatively, you can access them individually via URL: https://raw.githubusercontent.com/data-liberation-project/phmsa-hazmat-incident-reports/main/data/fetched/YYYY-MM.csv
, swapping out YYYY
for the year and MM
for the two-digit month.
The files are split into months to stay within GitHub's file size limits. You can combine them with your preferred toolset. For example, using xsv
, you could run xsv cat rows data/fetched/*.csv > combined.csv
.
A set of all years incidents filtered to just the most “serious” are available in the data/processed/filtered/
directory.
Multiple Rows Per Incident
column.Report Type
column provides that detail, and includes values such as "Undeclared Shipment with no Release" and "A specification cargo tank 1,000 gallons or greater containing any hazardous materials that [...]".Please see the METHODOLOGY.md document for a description of how this repository fetches and processes the data.
Many thanks to the volunteers who have contributed to this repository:
This repository's code is available under the MIT License terms. The raw data files (those in data/fetched
) are public domain. All other data files are available under the Creative Commons CC BY-SA 4.0 license terms.
File an issue in this repository or email Jeremy Singer-Vine at jsvine@gmail.com
.