trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
https://trino.io
Apache License 2.0
10.52k stars 3.03k forks source link

Trino Hive connector Could not obtain block HDFS replication factor 3 #14942

Open mzabrar opened 2 years ago

mzabrar commented 2 years ago

Currently, Hadoop replication factor is 3 and it's a large-scale cluster. If one of the worker node not reachable query get failed . How to config trino to retry the query for other 2 available worker nodes without failing the query

SQL Error [16777225]: Query failed (#20221030_084857_00226_7srw6): Error opening Hive split hdfs://hacluster/user/hive/warehouse/ps.db/table_19283/no=9/d487a5f6be727b5-d665f67b16360803_1518851677_data.0.parq (offset=0, length=5254626): Could not obtain block: BP-952478870-192.168.212.2-1451608027649:blk_40604014088_44408048387 file=/user/hive/warehouse/ps.db/table_19283/no=9/d487a5f6be727b5-d665f67b16360803_1518851677_data.0.parq

tiankx1003 commented 1 year ago

This may be useful for you, #14111

atifiu commented 1 year ago

@mzabrar Are you able to resolve this ?