Provide options to Configure NLP Engine

In the current code, as per my understanding - the default Spacy model (en_core_web_lg) is utilised when the AnalyzerEngine() is instantiated.

It would be helpful if we could pass in parameters which would indicate the model to be used in the Analyser. Accordingly, based on the compute and accuracy requirements, folks could toggle the model used. This would also help in loading models for different languages.

Ref; https://github.com/microsoft/presidio/blob/main/docs/samples//python/customizing_presidio_analyzer.ipynb

guardrails-ai / detect_pii

Provide options to Configure NLP Engine #11