Creates a new sub-package called spacier (ha) in which to consolidate spaCy-focused functionality.
Adds a components module to spacier for custom spaCy language pipeline components, and includes two to start: one that computes text stats via the TextStats class, and another that merges named entities into a single token using spaCy's new Doc.retokenize functionality.
Marks the spacy_pipelines module for deprecation in v0.7.0.
More functionality to come, this is just a non-feature-breaking down payment.
Motivation and Context
Feature request from Ines in #168. 🙂
How Has This Been Tested?
I added tests, and they all pass. Huzzah.
Types of changes
[ ] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to change)
Checklist:
[x] My code follows the code style of this project.
[x] My change requires a change to the documentation, and I have updated it accordingly.
Description
spacier
(ha) in which to consolidate spaCy-focused functionality.components
module tospacier
for custom spaCy language pipeline components, and includes two to start: one that computes text stats via theTextStats
class, and another that merges named entities into a single token using spaCy's newDoc.retokenize
functionality.spacy_pipelines
module for deprecation in v0.7.0.More functionality to come, this is just a non-feature-breaking down payment.
Motivation and Context
Feature request from Ines in #168. 🙂
How Has This Been Tested?
I added tests, and they all pass. Huzzah.
Types of changes
Checklist: