Repository for machine learning tool, MeL, that assist in providing insights for open text data. This tool is part of the 10x Machine Learning as a Service project (formerly known as Qualitative Data Management).
support ZWJ sequences emoji and skin tone modifer emoji in TweetTokenizer
METEOR evaluation now requires pre-tokenized input
Code linting and type hinting
implement get_refs function for DrtLambdaExpression
Enable automated CoreNLP, Senna, Prover9/Mace4, Megam, MaltParser CI tests
specify minimum regex version that supports regex.Pattern
avoid re.Pattern and regex.Pattern which fail for Python 3.6, 3.7
Thanks to the following contributors to 3.6.5
Tom Aarsen, Saibo Geng, Mohaned Mashaly, Dimitri Papadopoulos, Danny Sepler,
Ahmet Yildirim, RnDevelover, yutanakamura
Version 3.6.4 2021-10-01
deprecate nltk.usage(obj) in favor of help(obj)
resolve ReDoS vulnerability in Corpus Reader
solidify performance tests
improve phone number recognition in tweet tokenizer
refactored CISTEM stemmer for German
identify NLTK Team as the author
replace travis badge with github actions badge
add SECURITY.md
Thanks to the following contributors to 3.6.4
Tom Aarsen, Mohaned Mashaly, Dimitri Papadopoulos Orfanos, purificant, Danny Sepler
Version 3.6.3 2021-09-19
Dropped support for Python 3.5
Run CI tests on Windows, too
Moved from Travis CI to GitHub Actions
Code and comment cleanups
Visualize WordNet relation graphs using Graphviz
Fixed large error in METEOR score
Apply isort, pyupgrade, black, added as pre-commit hooks
Prevent debug_decisions in Punkt from throwing IndexError
Resolved ZeroDivisionError in RIBES with dissimilar sentences
Initialize WordNet IC total counts with smoothing value
Fixed AttributeError for Arabic ARLSTem2 stemmer
Many fixes and improvements to lm language model package
Fix bug in nltk.metrics.aline, C_skip = -10
Improvements to TweetTokenizer
Optional show arg for FreqDist.plot, ConditionalFreqDist.plot
edit_distance now computes Damerau-Levenshtein edit-distance
Thanks to the following contributors to 3.6.3
Tom Aarsen, Abhijnan Bajpai, Michael Wayne Goodman, Michał Górny, Maarten ter Huurne,
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/18F/10x-MLaaS/network/alerts).
Bumps nltk from 3.4.5 to 3.6.5.
Changelog
Sourced from nltk's changelog.
... (truncated)
Commits
b422364
updates for 3.6.503e4b4e
Modernised nltk.org website (#2845)9f468d3
Merge pull request #2851 from DimitriPapadopoulos/lgtm_errors8ce97b2
Add a unit test, fix typos2538164
Enhancement: Add ZWJ sequences Emoji and Skin Tone Modifier Emoji support to ...836b98e
Accept pre-tokenized references & hypothesis for METEOR calculation (#2822)82ceb20
refactor: perfom linting for punkt.py (#2830)c05b0e7
use latest version of pip (#2846)6d39c90
Implement get_refs function for DrtLambdaExpression (#2847)f554129
LGTM.com error: Wrong number of arguments in a class instantiationDependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/18F/10x-MLaaS/network/alerts).