Update Latin defaults to support noun chunks, update lexical/tokenizer defaults and add example sentences (#12538).
Add option to return scores separately keyed by component name with spacy evaluate --per-component, Language.evaluate(per_component=True) and Scorer.score(per_component=True) (#12540).
Support custom token/lexeme attribute for vectors (#12625).
Support spancat_singlelabel in spacy debug data CLI (#12749).
Typing updates for PhraseMatcher and SpanGroup (#12642, #12714).
đź”´ Bug fixes
#12569: Require that all SpanGroup spans come from the current doc.
📦 Trained pipelines updates
We have added new pipelines for Slovenian that use the trainable lemmatizer and floret vectors.
🙏 Special thanks to @​orglce for help with the new pipelines!
The English pipelines have been updated to improve handling of contractions with various apostrophes and to lemmatize "get" as a passive auxiliary.
The Danish pipeline da_core_news_trf has been updated to use vesteinn/DanskBERT with performance improvements across the board.
⚠️ Backwards incompatibilities
SpanGroup spans are now required to be from the same doc. When initializing a SpanGroup, there is a new check to verify that all added spans refer to the current doc. Without this check, it was possible to run into string store or other errors.
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Updates the requirements on spacy[transformers] to permit the latest version.
Release notes
Sourced from spacy[transformers]'s releases.
Commits
6fc153a
Merge pull request #12794 from adrianeboyd/chore/v3.6.0-24e19ec7
Docs for v3.6.0 (#12792)76329e1
Revert "Temporarily skip download CLI related tests in CI"a119114
Revert "Temporarily skip tests for compat table"830dcca
SpanFinder: set default max_length to 25 (#12791)eab9293
Use 'exclude' instead of 'disable' (#12783)bd23951
Fix problem with missing syntax highlighting languages causing runtime crash ...57a230c
Remove section about parallel training with Ray (#12770)fb0da3e
Support custom token/lexeme attribute for vectors (#12625)337a360
Use spans_ prefix for default span finder scores (#12753)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)