dsgelab / finregistry-data

FinRegistry data preprocessing scripts
3 stars 2 forks source link

Finregistry data

This repository contains data preprocessing scripts for the following datasets in FinRegistry:

In addition, scripts for generating a pedigree in FinRegistry are available here: https://github.com/dsgelab/FinRegistry_pedigree

The preprocessing steps of the datasets are summarized in the GitHub Releases. The code used for generating the processed dataset is attached to each release.

The repository also includes scripts used for profiling each dataset for the FinRegistry data dictionary. Please note that profiling list-type columns is not currently implemented.