Open nvanva opened 6 months ago
Would be nice to save the original encoding of each document. This might be useful during the further pre-processing steps, e.g. for langid.
Would be nice to save the original encoding of each document. This might be useful during the further pre-processing steps, e.g. for langid.