Closed pfistfl closed 3 years ago
I can agree. I guess this is then documented here. I find it a little cumbersome to always have to include this line, as treating it as a factor is what people wanna do in most cases, but this is a simple enough solution.
I don't think we should do that, for the same reason that we don't encode discrete numeric columns: Semantically,
character
columns are not to be encoded. If one encounters data where encoding makes sense, then the column has the wrong type and should be converted (just as we would do with discrete numerics in this case). The canonical way of converting is(possibly followed by fixfactors or other).
We may want to have a test whether character->factor->encode works, factors in R are a difficult topic sometimes.