Closed xrotwang closed 3 years ago
The R equivalent of utf-8-sig seems to be UTF-8-BOM. A CSV dialect spec reader should understand such common variants as well.
utf-8-sig
UTF-8-BOM
Other relevant names seem to be Latin-1 and latin1.
Latin-1
latin1
Maybe use codecs.lookup first, to be able to create useful error messages, rather than failing while trying to read the data.
The R equivalent of
utf-8-sig
seems to beUTF-8-BOM
. A CSV dialect spec reader should understand such common variants as well.Other relevant names seem to be
Latin-1
andlatin1
.Maybe use codecs.lookup first, to be able to create useful error messages, rather than failing while trying to read the data.