This tool converts HTML files containing the text of the Congressional Record into structured text data. It is particularly useful for identifying speeches by members of Congress.
From the repository root, type python -m congressionalrecord.cli -h
for instructions.
This software is released as-is under the BSD3 License, with no warranty of any kind.
In Python 3 using venv
for e.g.:
git clone https://github.com/unitedstates/congressional-record.git
cd congressional-record
python3 -m venv .venv
.venv/bin/python -m pip install -e .
then .venv/bin/python -m congressionalrecord.cli -h
to see usage instructions.
Judd, Nicholas, Dan Drinkard, Jeremy Carbaugh, and Lindsay Young. congressional-record: A parser for the Congressional Record. Chicago, IL: 2017.