Open Ariznawlll opened 1 month ago
https://grafana.ci.matrixorigin.cn/goto/cAK5e5jSg?orgId=1
根本原因是rpc的包超过100M,由于dn的拆包实现是有一定的误差,所以建议把rpc max message size 改成200M再进行测试。
[0817] job url: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/10419909260
昨天改了配置之后,搜日志看到还是有这个报错,但是job里面执行没报错
博哥说需要在logservice这里加上配置 [logservice.rpc] max-message-size "200M"
配置已加,等测试结果
修改配置后没问题了
最近几次没再出现过这个问题, loki里main也没有报错'context deadline execeed'
最新一次测试结果:
https://github.com/matrixorigin/mo-nightly-regression/actions/runs/10509468758/job/29135554152
【0904】 job url: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/10685465952/job/29635505459
commit:c03cbb087cafdf71c7ea076d082630cfb9cf830c
[0905] big-data-regression: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/10722568409/job/29736831271
对应时间dn重启过:
还没处理
Is there an existing issue for the same bug?
Branch Name
main
Commit ID
4d2a745cd39ed0f8a680acded5904536acee65a5
Other Environment Information
Actual Behavior
job url: https://github.com/matrixorigin/mo-nightly-regression/actions/runs/10368751013/job/28728120148
pod状态:
出错时间dn重启过:
context deadline exceeded报错相关log: https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22GB3%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-big-data-20240813%5C%22%7D%20%7C%3D%20%60context%20deadline%20exceeded%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221723625980000%22,%22to%22:%221723625988000%22%7D%7D%7D&schemaVersion=1&orgId=1
Expected Behavior
No response
Steps to Reproduce
Additional information
No response