In an example UTF-8 CSV which contains Chinese character and emoji (a typical Venmo statement), chardet is not correctly detect charset (Recognized as Windows-1252, Turkish). And it is difficult to set charset manually. It would be nice to allow manual setting of charset, which is normally known by the user.
Original report by Chenxing Luo (Bitbucket: chazeon, GitHub: chazeon).
In an example UTF-8 CSV which contains Chinese character and emoji (a typical Venmo statement),
chardet
is not correctly detect charset (Recognized as Windows-1252, Turkish). And it is difficult to set charset manually. It would be nice to allow manual setting of charset, which is normally known by the user.Example as attached.