RMI-PACTA / r2dii.data

Datasets to Align Financial Markets with Climate Goals
https://rmi-pacta.github.io/r2dii.data
Creative Commons Zero v1.0 Universal
6 stars 6 forks source link

bug: `sic_classification` doesn't seem to be consistent with current SIC codes online #356

Closed jdhoffa closed 7 months ago

jdhoffa commented 7 months ago

The codes in the exported sic_classification dataset require 5-digit codes:

Screenshot 2024-03-19 at 09 14 16

The SIC code database only allows codes containing a maximum of 4-digits:

Screenshot 2024-03-19 at 09 14 50

It is very likely that the sic_classification dataset is outdated, as it was first merged 4 years ago.

Possible actions (not mutually exclusive): Determine if the dataset is referring to some different SIC classification system Deprecate the sic_classification dataset Update the sic_classification dataset to the latest version

AB#10292

jdhoffa commented 7 months ago

It seems that this system is consistent with the south African SIC codes: https://www.statssa.gov.za/additional_services/sic/mdvdvmg4.htm#MAJOR%20DIVISIONS,DIVISIONS%20AND%20MAJOR%20GROUPS