mlcommons / training_results_v0.5

This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.
https://mlcommons.org/en/training-normal-05/
Apache License 2.0
35 stars 54 forks source link

Explicitly use utf-8 encoding #10

Open Stonesjtu opened 5 years ago

Stonesjtu commented 5 years ago

Since it's supposed to deal with non-ascii characters, it's better to ensure open the text file as 'utf-8' encoding.

It fixes my problem when post-processing the corpus.