uchicago-dsi / climate-cabinet-campaign-finance-tracker

1 stars 2 forks source link

Use SIC6 codes instead of descriptions #93

Open trevorspreadbury opened 7 months ago

trevorspreadbury commented 7 months ago
    # TODO: Use SIC6 codes instead of descriptions

https://github.com/dsi-clinic/2024-winter-climate-cabinet-campaign-finance-tracker/blob/9db429d8f209843075b787b08dc4dded5f71a787/EDA/uchicago_business_EDA.py#L64

The SIC Codes are very useful because they are hierarchical. You can browse through definitions here. So if you click on 'Mining' and see that 13 is 'Oil and Gas Extraction' we can be sure that any SIC that starts with 13 is Oil and Gas. We can look at SIC 28 to explore 'Chemicals and Allied Products' more because there might be some involved in fossil fuels. After browsing through the 4-digit definitions, none of these seem to be fossil fuel related. Next we see that 29 is 'Petroleum Refining and Related Industries'. Since related industries is vague, click on it to see more details. 291 is Pretroleum Refining so all that start 291 are relevant. 295 is Asphalt, which is petroleum based, but not sure if Climate Cabinet would want to include them so make a note of it and continue. Go through all the categories like this.

You can also do this for a newer industry classification schema, NAICS. See definitions here.

If a business's NAICS, or any of its SIC codes match, we'll consider them oil and gas.