openvinotoolkit / datumaro

Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
https://openvinotoolkit.github.io/datumaro/
MIT License
549 stars 135 forks source link

Ability to remove sensitive information and background from images #240

Open zhiltsov-max opened 3 years ago

zhiltsov-max commented 3 years ago

When publishing a dataset and when training a model (to avoid bias) it could be useful to remove sensitive information like car plate numbers, names and other personal info on medical images, faces etc. Another variant - is removal of background (noise) from images for anomaly detection tasks. One more - removal of reflections on the image.

zhiltsov-max commented 3 years ago

OpenCV G-API now (in 5.0) provides the option to blur faces efficiently, it can be useful for this feature