edgi-govdata-archiving / ECHO_modules

ECHO_modules is a Python package for analyzing a copy of the US Environmental Protection Agency's (EPA) Enforcement and Compliance History Online (ECHO) database
GNU General Public License v3.0
3 stars 6 forks source link

Consider adding time filters to DataSetResults #63

Open ericnost opened 10 months ago

ericnost commented 10 months ago

Most programs have records back to 2001 or even earlier. We already filter out records prior to 2001. We could consider adding an optional argument to store_results() for users to specify a time frame.

For instance:

ds = make_data_sets(["SDWA Serious Violators"]) # Create a DataSet for handling the data
recent_violators = ds["SDWA Serious Violators"].store_results(region_type="Watershed", region_value=["01412422"], timeframe = [2018, 2023]) # Store results for this DataSet as a DataSetResults object.
seneca_sdwa.dataframe

timeframe could take a list where the first value is the start and the second value is the stop (inclusive or exclusive).