joweich / chat-miner

Parsers and visualizations for chats
MIT License
566 stars 57 forks source link

Corrected invalid unicode sequences in facebook export #77

Closed joweich closed 1 year ago

joweich commented 1 year ago

Apparently, the facebook exports have invalid unicodes represented as byte sequences. This has been raised in #73. More information can be found here: https://stackoverflow.com/questions/50826787/decode-or-unescape-u00f0-u009f-u0091-u008d-to This PR is meant to fix #73 by correcting invalid unicode into their respective characters.

joweich commented 1 year ago

Does not catch all cases. #73 will be fixed in in PR #78 by going with a resolution provided in https://stackoverflow.com/questions/50008296/facebook-json-badly-encoded.