kubeagi / arcadia

A diverse, simple, and secure all-in-one LLMOps platform
http://www.kubeagi.com/
Apache License 2.0
80 stars 23 forks source link

Data processing - protect sensitive data properly, replace data with random ones #239

Open nkwangleiGIT opened 10 months ago

nkwangleiGIT commented 10 months ago

Integrate presidio with data processing, and allow data pipeline to mask sensitive data

https://github.com/microsoft/presidio/blob/8b9d961d1f4ddb65e124a480f3c72c63894b0b5f/docs/samples/python/GPT3_synth_data.ipynb#L33

nkwangleiGIT commented 10 months ago

Try to refactor privacy_transform.py to use the approach above?

wangxinbiao commented 10 months ago

Try to refactor privacy_transform.py to use the approach above?

ok