dsgelab / finregistry-data

FinRegistry data preprocessing scripts
3 stars 2 forks source link

thl_soshilmo #23

Open demmlerj opened 1 year ago

demmlerj commented 1 year ago

Filename needs to be harmonised:

Quick fix in the findata subdirectory:

cp thl2019_1776_soshilmo.csv.finreg_IDsp findata/soshilmo_2021-08-04.csv
demmlerj commented 1 year ago

add feather files:

import pandas as pd
import os

os.chdir("/data/processed_data/thl_soshilmo/findata")

df = pd.read_csv("soshilmo_2021-08-04.csv", low_memory=False)
df.to_feather("soshilmo_2021-08-04.feather")