This repository contains Python 3 modules that retrieve, clean, subset and otherwise transform various data sets used in research. The objective is to abstract these tasks and keep them separate from research code that performs actual analysis on the data.
ceic
— CEIC Data's China Premium Databasechip
— China Household Income Projectchfs
— China Household Finance Surveycn_nbs
— National Bureau of Statistics of ChinaThe modules are largely independent but have a roughly similar API. Each module…
load_ceic()
that returns data in a clean, Pythonic form.import_ceic()
that processes raw data sets into a cache in the directory of the name (e.g. ceic/
for ceic.py
).python3 -m ceic
. Invoking a module without any arguments gives basic usage instructions, but the code is also documented.The variable requirements
in the top-level module gives the dependencies for each module.
If you use this code, please cite using the DOI above.