Closed percevalw closed 9 months ago
All modified and coverable lines are covered by tests :white_check_mark:
:exclamation: No coverage uploaded for pull request base (
core-refacto@b3274b5
). Click here to learn what that means.
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
Description
Added
edsnlp.data
api (json, brat, spark, pandas) and LazyCollection object to efficiently read / write data from / to different formats & sources.docs.configure(...)
Changed
to_disk
methods can now return a config to override the initial config of the pipeline (e.g., to load a transformer directly from the path storing its fine-tuned weights)eds.tokenizer
tokenizer has been added to entry points, making it accessible from the outsideedsnlp.data
APIpipe
wrapper in favor of the new processing APIFixed
Checklist