EmmaRocheteau / TPC-LoS-prediction

This repository contains the code used for Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit (https://dl.acm.org/doi/10.1145/3450439.3451860).
https://dl.acm.org/doi/10.1145/3450439.3451860
MIT License
76 stars 25 forks source link

Preprocessing MIMIC-IV issue #11

Closed SewoongLee closed 1 year ago

SewoongLee commented 1 year ago

When running the command \copy D_HCPCS FROM 'd_hcpcs.csv' DELIMITER ',' CSV HEADER NULL ''

in postgresql, it says

ERROR: 0xe2 0x80 byte combined character (encoding: "UHC") has no corresponding character code in "UTF8" encoding Syntax: COPY d_hcpcs, line 88856

image image

SewoongLee commented 1 year ago

image It was not the issue of mimic-iv, but the issue of Korean language environment and psql. UHC was actually 'Unified Hangul Code', and Hangul is a character of Korean.