Open alanmiller opened 2 years ago
@urykhy as our resident HDFS expert do you have an opinion on how difficult it would be to add TDE support?
What is sounds like is happening is that rclone is just copying the encrypted files and not decrypting them.
For the immediate term, I'd suggest mentioning this limitation in the documentation.
This certainly sounds like a good short term measure
If the documentation gets updated to mention this limitation you could also include this workaround: Copy the data you intend to rclone, out of the TDE encryption zone, rclone the copy, delete the copy. E.g.:
- Then run rclone: rclone copy source_cluster:/un-encrypted/data dest_prod:/encrypted-zone/data
there is no mistake here ? in my experiments we can't upload to encrypted-zone:
HADOOP_CONF_DIR=/tmp/xhadoop-conf-dir rclone copy test.plain hadoop:/test/key/
2021/10/31 22:07:02 ERROR : test.plain: Failed to copy: create /test/key/test.plain: create call failed with ERROR_APPLICATION (org.apache.hadoop.hdfs.UnknownCryptoProtocolVersionException)
btw, just to note. i currently working to implement TDE for upstream.
btw, just to note. i currently working to implement TDE for upstream.
Thank you :-)
Any updates on this issue (feature) resolution ?
Any updates on this issue (feature) resolution ?
waiting on https://github.com/colinmarc/hdfs/pull/281
The associated forum post URL from
https://forum.rclone.org
Forum Post: https://forum.rclone.org/t/does-rclone-support-hdfs-tde/26996/9
What is your current rclone version (output from
rclone version
)?rclone v1.56.2
What problem are you are trying to solve?
I'm trying to copy HDFS files from a source cluster that has Hadoop TDE enabled to a destination cluster where TDE is not enabled. I've posted examples in the forum post above but the summary is that:
How do you think rclone should be changed to solve that?
For the immediate term, I'd suggest mentioning this limitation in the documentation. For the longer term, rclone should retrieve the unencrypted content of the HDFS files if they are in an encryption zone and transmit those contents to the destination cluster.
How to use GitHub