Wordseer / wordseer

The WordSeer text analysis tool, written in Flask.
http://wordseer.berkeley.edu/
42 stars 16 forks source link

sentences longer than 1000 characters are stored as sentence fragments #325

Open macfarlandian opened 9 years ago

macfarlandian commented 9 years ago

1000 characters is the cutoff for CoreNLP input; long sentences are split into shorter fragments for parsing. They should be recombined into a single sentence after parsing.