dpalmasan / TRUNAJOD2.0

An easy-to-use library to extract indices from texts.
https://trunajod20.readthedocs.io/en/latest/
MIT License
29 stars 7 forks source link

Implement universal POS tags ratio #63

Open dpalmasan opened 3 years ago

dpalmasan commented 3 years ago

SpaCy tokens can contain universal POS tags and detailed POS tags, both properties in a spacy Token are pos_ and tag_ respectively. Currently, the function pos_ratio is kind of misleading as in the docstring it is describing pos_ tags, but computing ratio using tag_. This ticket is to fix this and implement a similar function using universal POS tags.

https://github.com/dpalmasan/TRUNAJOD2.0/blob/master/src/TRUNAJOD/surface_proxies.py#L515

Acceptance Criteria