JeroenDeDauw / Replicator

CLI tool for importing entities from Wikidata / Wikibase
https://wikibase.consulting
Other
23 stars 6 forks source link

Unicode error during import #13

Open rmdort opened 6 years ago

rmdort commented 6 years ago

I am running Replicator with mysql. And Importing an entity id Q76

php replicator import:api Q76 -v

Getting this error

Importing entity 1: Q76...
    * Deserializing... done.
    * Inserting into Dump store... done.
    * Inserting into Term store... donedoneFAILED!
     Error details: An exception occurred while executing 'INSERT INTO ts_labels (text, text_lowercase, language, entity_id, entity_type) VALUES (?, ?, ?, ?, ?)' with params ["\u00d8\u00a8\u00d8\u00a7\u00d8\u00b1\u00d8\u00a7\u00d9\u0083 \u00d8\u00a3\u00d9\u0088\u00d8\u00a8\u00d8\u00a7\u00d9\u0085\u00d8\u00a7", "\xe3\x98\xe2\xa8\xe3\x98\xe2\xa7\xe3\x98\xe2\xb1\xe3\x98\xe2\xa7\xe3\x99\xe2\x83\x20\xe3\x98\xe2\xa3\xe3\x99\xe2\x88\xe3\x98\xe2\xa8\xe3\x98\xe2\xa7\xe3\x99\xe2\x85\xe3\x98\xe2\xa7", "ar", "Q76", "item"]:

SQLSTATE[HY000]: General error: 1366 Incorrect string value: '\xE3\x98\xE2\xA8\xE3\x98...' for column 'text_lowercase' at row 1

I think its something to do with utf8 encoding. Any workaround?

JeroenDeDauw commented 6 years ago

Do you know what value is causing the error?

JeroenDeDauw commented 6 years ago

The error does not happen with SQLite. So you can perhaps use that instead of MySQL. I've updated the README install instructions for Vagrant. They should work now :)

image