Open ming12713 opened 1 week ago
In my Case, Kafka Writes to Doris via Connector Sink Mode, When Doris is restarted, the connector continues to write data. The logs parse the coordinator BE IP. Is it possible that the connector is using the StreamLoad method to write data? This data is synchronized to the FE meta with bdb, but it has not yet been synchronized to the BE. If the BE is restarted at this moment, the FE may negotiate a BE coordinator IP that it cannot connect to, causing cluster issues. Is my understanding correct?
what's the doriscluster
spec, please share the yaml. In k8s, if the IP is not static when restarted, please set enable_fqdn_mode = true
to use fqdn communicate.
The connector sink mode uses streamload method to insert data.
what's the
doriscluster
spec, please share the yaml. In k8s, if the IP is not static when restarted, please setenable_fqdn_mode = true
to use fqdn communicate. The connector sink mode uses streamload method to insert data. @intelligentfu8 I observed the StreamLoad mechanism,The FE selects a BE (Backend) as the Coordinator node in a round-robin manner, which is responsible for scheduling the import job, and then returns an HTTP redirect to the client. The redirect uses the BE pod IP instead of svc , the reason might be related to this. https://doris.apache.org/docs/data-operate/import/import-way/stream-load-manual/
doriscluster.yaml
apiVersion: v1
items:
- apiVersion: doris.selectdb.com/v1
kind: DorisCluster
metadata:
labels:
app.kubernetes.io/instance: doriscluster
app.kubernetes.io/name: doriscluster
app.kubernetes.io/part-of: doris-operator
name: doriscluster
namespace: doris
resourceVersion: "18187746"
uid: 9b4d358b-ac8c-491c-8701-6a7ce61f4bdb
spec:
beSpec:
annotations:
selectdb/dorisclsuter.component: be
envVars:
- name: HOME
value: /opt/selectdb
image: selectdb/doris.be-ubuntu:2.1.1
limits:
cpu: 24
memory: 64Gi
nodeSelector:
kubernetes.io/hostname: loshu-kube-ds01
persistentVolumes:
- mountPath: /opt/apache-doris/be/storage
name: doris-be
replicas: 1
requests:
cpu: 2
memory: 8Gi
service:
servicePorts:
- nodePort: 32422
targetPort: 9060
- nodePort: 30652
targetPort: 8040
- nodePort: 30891
targetPort: 9050
- nodePort: 31420
targetPort: 8060
type: NodePort
systemInitialization:
command:
- /sbin/sysctl
- -w
- vm.max_map_count=2000000
feSpec:
annotations:
selectdb/dorisclsuter.component: fe
configMapInfo:
configMapName: fe-configmap
resolveKey: fe.conf
envVars:
- name: HOME
value: /opt/selectdb
image: selectdb/doris.fe-ubuntu:2.1.1
limits:
cpu: 8
memory: 32Gi
nodeSelector:
kubernetes.io/hostname: loshu-kube-ds
persistentVolumes:
- mountPath: /opt/apache-doris/fe/doris-meta
name: doris-fe
replicas: 1
requests:
cpu: 2
memory: 4Gi
service:
servicePorts:
- nodePort: 30148
targetPort: 8030
- nodePort: 30252
targetPort: 9020
- nodePort: 31341
targetPort: 9030
type: NodePort
systemInitialization:
command:
- /sbin/sysctl
- -w
- vm.max_map_count=2000000
Search before asking
Version
2.11
What's Wrong?
Doris Installation via Operator, 1 BE Node and 1 FE Node, After restarting both the Doris FE and BE nodes, the FE node fails to start normally and reports the error mentioned above. The BE IP 10.42.1.19 mentioned in the error is the previous BE pod IP, not the SVC IP. The FE configuration for service discovery is set to use SVC (Service) method, but now the BE is 10.42.1.6.
pod network cidr 10.42.1.x/16 svc network cidr 10.43.48.x
What You Expected?
fix issue
How to Reproduce?
No response
Anything Else?
No response
Are you willing to submit PR?
Code of Conduct