microsoft / presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text and images
https://microsoft.github.io/presidio
MIT License
3.87k stars 578 forks source link

SpacyRecognizer: List of Tuples for check label groups argument #976

Open dhruvmullick opened 1 year ago

dhruvmullick commented 1 year ago

https://github.com/microsoft/presidio/blob/aa50e6bbcd8bef6c0281c7c0eb1b354bc20ae55e/presidio-analyzer/presidio_analyzer/predefined_recognizers/spacy_recognizer.py#L53

I believe this should be Optional[List[Tuple[Set, Set]]]. This would let us support multiple tuples like is the default case with self.CHECK_LABEL_GROUPS.

omri374 commented 1 year ago

Thanks, you are correct. We'll address that sometime soon, feel free to create a PR if interested.

dhruvmullick commented 1 year ago

Sure, will take this up. @omri374 , can you give me write access for the PR?

omri374 commented 1 year ago

@dhruvmullick please fork the repo and create a PR from the fork