OHDSI / CommonDataModel

Definition and DDLs for the OMOP Common Data Model (CDM)
https://ohdsi.github.io/CommonDataModel
877 stars 448 forks source link

ERROR: character with byte sequence 0x9d in encoding "WIN1252" has no equivalent in encoding "UTF8" #440

Closed rjiang9 closed 3 years ago

rjiang9 commented 3 years ago

When I load the vocabulary by using the script of OMOP CDM vocabulary load - PostgreSQL.sql, it thow out the error:

ERROR: character with byte sequence 0x9d in encoding "WIN1252" has no equivalent in encoding "UTF8"

Please help.

cgreich commented 3 years ago

You need to set your Postgres to UTF8. We have a ton of concept_name records containing Á, Í or Ï, and they will choke when you try importing into a default Win1252 Postgres. If you cannot change the Postgres character encoding you may want to replace those with A, I and I.

rjiang9 commented 3 years ago

Thank you very much for the response @cgreich. I will test it further. The database encoding was set win1252.