I'm running azcopy sync command to backup data from azure data lake to azure storage account.
I'm using command like this:
azcopy sync https://mydatalake.blob.core.windows.net/my-container/ https://mystorageaccount.blob.core.windows.net/my-container/ --recursive --log-level=NONE --delete-destination=true
My azure data lake contains tens of millions small and medium files. Sync operation and copying of newly created files takes ~10 minutes, while deleting extra objects on destination takes many hours.
Moreover, even I specified --log-level=NONE I see messages like follow for each removed file:
6142703 Files Scanned at Source, 6844507 Files Scanned at Destination, 2-sec Throughput (Mb/s): 0 INFO: Deleting extra object: DELTA/path/to/my/file.parquet
Questions:
1) Is it possible to delete files on destination by batches?
2) How to turn off 'Deleting extra object' logging?
Dear all,
I'm running azcopy sync command to backup data from azure data lake to azure storage account.
I'm using command like this:
azcopy sync https://mydatalake.blob.core.windows.net/my-container/ https://mystorageaccount.blob.core.windows.net/my-container/ --recursive --log-level=NONE --delete-destination=true
My azure data lake contains tens of millions small and medium files. Sync operation and copying of newly created files takes ~10 minutes, while deleting extra objects on destination takes many hours.Moreover, even I specified --log-level=NONE I see messages like follow for each removed file:
6142703 Files Scanned at Source, 6844507 Files Scanned at Destination, 2-sec Throughput (Mb/s): 0 INFO: Deleting extra object: DELTA/path/to/my/file.parquet
Questions: 1) Is it possible to delete files on destination by batches? 2) How to turn off 'Deleting extra object' logging?
Details:
ubuntu:22.04
azcopy version 10.26.0
Environment variables:
Kind regards,