Open MarkWuNLP opened 7 years ago
Try to read the book source from this address: http://www.gutenberg.org/files/2701/2701-0.txt
Set the link here: def get_data(download=not os.path.exists(DATA_FILE)): """Downloads and parses Moby Dick.""" if download: subprocess.check_output( ['wget', 'http://www.gutenberg.org/files/2701/2701-0.txt', '-O', DATA_FILE]) ...
Seems gutenberg.org text files encoded as gzip.
Hi, Could you please write the running requirement of this source code? I use python 2.7 on Ubuntu 14.04 and there is a Unicode DecodeError.