SBentley / qvd-utils

Read Qlik Sense .qvd files
https://pypi.org/project/qvd/
Apache License 2.0
49 stars 18 forks source link

Improve support for CJK characters #4

Closed ljvmiranda921 closed 3 years ago

ljvmiranda921 commented 3 years ago

Hi there,

Thanks for this tool. It has been really helpful!

I'm just having issues regarding Chinese/Japanese/Korean characters. Whenever they are encoded, they look garbled. I'm not sure if it's a UTF-8 parsing in the Rust or Python side.

Steps to reproduce

I've attached the sample input and output for this issue. Here's the minimum reproducible example

from qvd import qvd_reader
df = qvd_reader.read("sample_data.qvd")
df.to_csv("sample_data.csv")

Input

Download link (expires after 24h): https://wormhole.app/wjll0#vX3tLuzrucIFtO-LaWRpgg

Output sample_data.csv

SBentley commented 3 years ago

Hi thanks for raising the issue and providing a file to test. I've pushed a new version that should now support all UTF-8 strings.