As a first cut, run the classifier on some tweets that are obviously polar/non-polar, some NYT comments, some Tumblr posts, etc., and see whether it passes sanity check.
Eventually, we should do this in a principled way: hand-tag a set of Twitter/NYT/Tumblr data like we did before, and get actual accuracy numbers for comparison.
As a first cut, run the classifier on some tweets that are obviously polar/non-polar, some NYT comments, some Tumblr posts, etc., and see whether it passes sanity check. Eventually, we should do this in a principled way: hand-tag a set of Twitter/NYT/Tumblr data like we did before, and get actual accuracy numbers for comparison.