-
Recently as new [paper](https://arxiv.org/abs/2202.00443) released a new -
> Text Anonymization Benchmark (TAB), a brand-new dataset containing 1268 court cases from the European Court of Human Rig…
-
Thanks for this open source project!
I'm currently writing my thesis on text anonymization models and their performance. I would like to assess different models on different datasets and I'd also lik…
-
From the README (https://github.com/DigitalPhonetics/VoicePAT?tab=readme-ov-file#anonymization) it says that to anonymize my own data I should modify the following fields in the config file:
```
dat…
-
Hi,
The TAB dataset and evaluation approach is amazing! It would be very useful for those interested to train models on this dataset, to have it as a [HuggingFace dataset](https://huggingface.co/da…
-
Comparison with [pydicom](https://github.com/pydicom/pydicom) and other DICOM handling packages should be added at some point.
Some helpful benchmark(s):
- Anonymization (for text/other well-formatt…
-
### Feature request
When calling scrub.scrub_text, instead of just replacing with `***`, we would like to retain semantic meaning. e.g. given:
```
Alice said hi to Bob
```
We want:
```
s…
-
Follow-up on #59
In anonymization mode the whole text is alawys displayed in the graph.
-
It would be useful to be able to customize the anonymization output (`XXXX-[Line Number]`) *per search term*. The desired output should be set from the user interface when anonymizing a repo.
For e…
-
has to be refined
![grafik](https://github.com/dzhw/metadatamanagement/assets/17981185/5a861fba-ab11-4dd7-a6e2-25d63c7a7710)
-
- [ ] Default to using the [much faster `System.Text.Json`-based FHIR parser](https://github.com/miracum/fhir-pseudonymizer#usesystemtextjsonfhirserializer) and remove the option to unset it
- [ ] Fo…
chgl updated
10 months ago