Closed Stone-0526 closed 1 year ago
@Hisoka-X PTAL
I also met, I was using the scene is hive (data volume: 100 million +) to doris. I find that the reason is that when the amount of data is large, the barrier message lags behind, causing checkpoint timeout, and the checkpoint timeout causes rollback tasks. I think it should be similar to flink's solution to increase the priority of barrier messages
I also met, I was using the scene is hive (data volume: 100 million +) to doris. I find that the reason is that when the amount of data is large, the barrier message lags behind, causing checkpoint timeout, and the checkpoint timeout causes rollback tasks. I think it should be similar to flink's solution to increase the priority of barrier messages
That's a great idea
I also met, I was using the scene is hive (data volume: 100 million +) to doris. I find that the reason is that when the amount of data is large, the barrier message lags behind, causing checkpoint timeout, and the checkpoint timeout causes rollback tasks. I think it should be similar to flink's solution to increase the priority of barrier messages
hi @xiaofan2022 .I have the same problem.Can you explain the solution in detail.
Search before asking
What happened
I want to send 100 million data to Doris, But task always fail because of NullPointerException. I tried three times.
Doris Version: 1.2.7.1 Doris Table:
CREATE TABLE IF NOT EXISTS olap.seatunnel_fake (
idBIGINT COMMENT "主键",
nameVARCHAR(256) COMMENT "名称",
ageINT COMMENT "年龄",
timeDATETIME COMMENT "时间" ) DUPLICATE KEY(
id,
name,
age) DISTRIBUTED BY HASH(
id) BUCKETS 8 PROPERTIES ( "replication_num" = "1");
SeaTunnel Version
SeaTunnel 2.3.3 Connector Doris 2.3.3 Connector Fake 2.3.3
SeaTunnel Config
Running Command
Error Exception
Zeta or Flink or Spark Version
No response
Java or Scala Version
No response
Screenshots
No response
Are you willing to submit PR?
Code of Conduct