biolab / text-semantics

The package with scripts for semantic analyser project
MIT License
4 stars 5 forks source link

Small script fixes #51

Closed ajdapretnar closed 3 years ago

ajdapretnar commented 3 years ago
  1. Filtering on text length happens on metadata directly. Otherwise tokens and metadata don't match.
  2. Removing punctuation from text is adapted so that text without space after punctuation (non-standard texts such as suggestions to the government) are properly handled.
  3. TODO: use .iloc instead of .loc when selecting instances by index.
ajdapretnar commented 3 years ago

This was not ready to be merged.