Open kbenoit opened 8 years ago
By my reckoning,
We could replace all of the readLines calls with stri_read_lines (although that function is labelled experimental). Presumably jsonlite and XML know how to deal with their encodings, which leaves html and doc. XML::htmlTreeParse has an encoding option, but I don't think stringi is designed to autodetect encoding of marked-up text. I'm not sure what to do with antiword, it doesn't look like you can specify an output encoding, which means it might be platform-dependent...
I should also note that we don't currently "include functions for diagnosing encodings on a file-by-file basis", because the stringi encoding detection stuff is not currently exposed.
I'm putting this on the long list for the next release.
Our README states:
But this is hardly true, since we use the base
iconv()
that happens throughfile()
inget-functions.R
, not stringi.We should go through carefully to ensure consistency, and also change our claims to be accurate.