bwbaugh / twitter-corpus

Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.
Other
46 stars 8 forks source link

UnicodeDecodeError causes writer-thread to exit #1

Closed bwbaugh closed 11 years ago

bwbaugh commented 11 years ago

Rarely there is UnicodeDecodeError when trying to write a line to the corpus file, but because it is unhandled it causes the writer-thread to exit. In this case, the stream-queue continues to get filled, but the additional incoming tweets can never be retrieved.