Closed simongray closed 8 years ago
Perhaps will need to develop a separate reddit comment preprocessing annotator that can add a list of "quoted sentence indexes" to the annotation as well as remove reddits markdown noise before any additional processing has been performed on it.
I can create a simple annotator (QuoteAnnotator.class) that marks sentences as quotes (QuoteAnnotation.class). Should be quite useful.
I'll make sure that the SemanticAnnotator described in #23 properly understands the opinion holder of a sentence.
Should be simple enough to implement. Since this could potentially be a general feature, will need annotator option to treat something as a quote. Also need to do preprocessing of other reddit comment syntax (to remove it before parsing).