Small script fixes - Githubissues

biolab / text-semantics

The package with scripts for semantic analyser project

MIT License

4 stars 5 forks source link

Closed ajdapretnar closed 3 years ago

ajdapretnar commented 3 years ago

Filtering on text length happens on metadata directly. Otherwise tokens and metadata don't match.
Removing punctuation from text is adapted so that text without space after punctuation (non-standard texts such as suggestions to the government) are properly handled.
TODO: use .iloc instead of .loc when selecting instances by index.

ajdapretnar commented 3 years ago

This was not ready to be merged.