slanglab / phrasemachine

Quickly extract multi-word phrases from a corpus
http://slanglab.cs.umass.edu/phrasemachine/
MIT License
190 stars 26 forks source link

consistent name for pos tags #2

Open brendano opened 7 years ago

brendano commented 7 years ago

the thing that is the sequence of POS tag strings.

get_phrases(postags=) but in the returned dict it's {'pos': [....]}

it's inconsistent because dict key 'pos' is how we do it in the corenlp wrapper and conll format converters. but it would be better to be consistent within this package.

brendano commented 7 years ago

by contrast, tokens is consistent (the sequence of word token strings).

get_phrases(tokens=) and the return dict is {'tokens': [...]}