tripl-ai / questions

Forum for asking questions relating to Arc which are not defects.
https://arc.tripl.ai
0 stars 2 forks source link

data security #2

Closed a140262 closed 5 years ago

a140262 commented 5 years ago

How to enable data encryption at rest and in transit in ARC?

seddonm1 commented 5 years ago

A few answers: 1) encryption at rest should be handled by your storage target (for example Amazon S3 offers encryption at rest). 2) the Hadoop libraries support encyption: https://hadoop.apache.org/docs/r2.9.2/hadoop-aws/tools/hadoop-aws/index.html#Encrypting_objects_with_S3A 3) you can turn on additional options to Spark for encryption: https://spark.apache.org/docs/latest/security.html 3) if using Terraform like: https://github.com/tripl-ai/deploy it would be good to implement a Bastion server and potentially run the jobs in their own temporary subnet.