StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
https://starrocks.io
Apache License 2.0
9.01k stars 1.81k forks source link

[BugFix] The hdfs directory is not synchronized when the spark resource is deleted #45886

Open blanklin030 opened 6 months ago

blanklin030 commented 6 months ago

Steps to reproduce the behavior (Required)

2024-05-14 01:42:12,077 INFO (pending_load_task_scheduler_pool-1|498) [SparkRepository.rename():316] finished to rename file, originPath=hdfs://ClusterNmg/user/prod_xxx/sparketl/1384206915/spark_repositorydbtb_sr1019adb1d38c/archive_1.0.0/libspark-dpp-1.0.0-jar-with-dependencies.jar, destPath=hdfs://ClusterNmg/user/prod_xxx/sparketl/1384206915/spark_repositorydbtb_sr1019adb1d38c/archive_1.0.0/__lib_70688c469808112f344091125a860404_spark-dpp-1.0.0-jar-with-dependencies.jar


+ 3. drop spark resource

drop resource spark_resource


+ 4. The hdfs directory is not synchronized when the spark resource is deleted

[hadoop@bigdata-starrocks-xxx ~]$ hdfs dfs -ls hdfs://ClusterNmg/user/prod_xxx/sparketl/1384206915/spark_repository__spark_resource/archive_1.0.0/ Found 2 items -rw-r--r-- 3 prod_xxx supergroup 394653421 2024-05-20 10:54 hdfs://ClusterNmg/user/prod_xxx/sparketl/1384206915/spark_repository__spark_resource/archive_1.0.0/lib_62eff19a2751990e17b47aa258fb7623_spark-2x.zip -rw-r--r-- 3 prod_xxx supergroup 4013682 2024-05-20 10:53 hdfs://ClusterNmg/user/prod_xxx/sparketl/1384206915/spark_repositoryspark_resource/__archive_1.0.0/lib_70688c469808112f344091125a860404_spark-dpp-1.0.0-jar-with-dependencies.jar



### Expected behavior (Required)
drop spark resource and delete spark directory

### Real behavior (Required)
drop spark resource and the spark directory didn't remove

### StarRocks version (Required)
 - You can get the StarRocks version by executing SQL `select current_version()`
github-actions[bot] commented 18 hours ago

We have marked this issue as stale because it has been inactive for 6 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to StarRocks!