epogrebnyak / data-rosstat-kep

Time series dataset of Rosstat Short-term Economic Indicators ("KEP") publication
http://www.gks.ru/wps/wcm/connect/rosstat_main/rosstat/ru/statistics/publications/catalog/doc_1140080765391
6 stars 6 forks source link

Varnames class: optionally mask ugly headers (eg ' / of which: export of goods – total ') with proper variable names #108

Closed epogrebnyak closed 8 years ago

epogrebnyak commented 8 years ago

dataframes.Varnames():

epogrebnyak commented 8 years ago

https://github.com/epogrebnyak/rosstat-kep-data/blob/master/kep/extract/dataframes.py#L249-L256

epogrebnyak commented 8 years ago

Some header lines in https://github.com/epogrebnyak/rosstat-kep-data/blob/master/output/varnames.md actually do mot name variable well:

| TRADE_GOODS_EXPORT_bln_usd | / of which: export of goods – total | <...> | AQM | | TRADE_GOODS_IMPORT_bln_usd | / import of goods – total | <...> | AQM |

Need to mask this these labels with extra text, possibly Segment().head_dicts[ ] - possibly third/forth element in list

epogrebnyak commented 8 years ago

Same thing stated at #98

epogrebnyak commented 8 years ago

Add more masks here: https://github.com/epogrebnyak/rosstat-kep-data/blob/master/kep/reader/label.py#L35-L36