The corpus token model run I am finishing exited prematurely with:
thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: Utf8Error { valid_up_to: 459, error_len: Some(1) }', libcore/result.rs:916:5
note: Run with `RUST_BACKTRACE=1` for a backtrace.
It could be time to look into the pull requests and integrate #6
This has been resolved, I believe mostly via the patches placed in rust-libxml, corpus_token_model succeeds gracefully with the 08.2017 arXMLiv dataset now
The corpus token model run I am finishing exited prematurely with:
It could be time to look into the pull requests and integrate #6