Wordseer / wordseer

The WordSeer text analysis tool, written in Flask.
http://wordseer.berkeley.edu/
40 stars 16 forks source link

Word is not actually a word in many cases #49

Closed abendebury closed 10 years ago

abendebury commented 10 years ago

Index issues

keien commented 10 years ago
{'space_before': '', 'lemma': 'this', 'tag': 'DT', 'word': ('This', {'CharacterOffsetEnd': '4', 'Lemma': 'this', 'PartOfSpeech': 'DT', 'CharacterOffsetBegin': '0'})}
{'space_before': '', 'lemma': 'be', 'tag': 'VBZ', 'word': ('is', {'CharacterOffsetEnd': '7', 'Lemma': 'be', 'PartOfSpeech': 'VBZ', 'CharacterOffsetBegin': '5'})}
{'space_before': '', 'lemma': 'the', 'tag': 'DT', 'word': ('the', {'CharacterOffsetEnd': '11', 'Lemma': 'the', 'PartOfSpeech': 'DT', 'CharacterOffsetBegin': '8'})}
{'space_before': '', 'lemma': 'text', 'tag': 'NN', 'word': ('text', {'CharacterOffsetEnd': '16', 'Lemma': 'text', 'PartOfSpeech': 'NN', 'CharacterOffsetBegin': '12'})}
{'space_before': '', 'lemma': 'of', 'tag': 'IN', 'word': ('of', {'CharacterOffsetEnd': '19', 'Lemma': 'of', 'PartOfSpeech': 'IN', 'CharacterOffsetBegin': '17'})}
{'space_before': '', 'lemma': 'post', 'tag': 'NN', 'word': ('post', {'CharacterOffsetEnd': '24', 'Lemma': 'post', 'PartOfSpeech': 'NN', 'CharacterOffsetBegin': '20'})}
{'space_before': '', 'lemma': '3', 'tag': 'CD', 'word': ('3', {'CharacterOffsetEnd': '26', 'Lemma': '3', 'PartOfSpeech': 'CD', 'CharacterOffsetBegin': '25'})}
{'space_before': '', 'lemma': '.', 'tag': '.', 'word': ('.', {'CharacterOffsetEnd': '27', 'Lemma': '.', 'PartOfSpeech': '.', 'CharacterOffsetBegin': '26'})}
{'space_before': '', 'lemma': 'I', 'tag': 'PRP', 'word': ('I', {'CharacterOffsetEnd': '29', 'Lemma': 'I', 'PartOfSpeech': 'PRP', 'CharacterOffsetBegin': '28'})}
{'space_before': '', 'lemma': 'love', 'tag': 'VBP', 'word': ('love', {'CharacterOffsetEnd': '34', 'Lemma': 'love', 'PartOfSpeech': 'VBP', 'CharacterOffsetBegin': '30'})}
{'space_before': '', 'lemma': 'blog', 'tag': 'NNS', 'word': ('blogs', {'CharacterOffsetEnd': '40', 'Lemma': 'blog', 'PartOfSpeech': 'NNS', 'CharacterOffsetBegin': '35'})}
{'space_before': '', 'lemma': '.', 'tag': '.', 'word': ('.', {'CharacterOffsetEnd': '41', 'Lemma': '.', 'PartOfSpeech': '.', 'CharacterOffsetBegin': '40'})}