EMQX Cluster stuck on `CoreNodesProgressing`

onedr0p commented 7 months ago

Describe the bug

I have deployed emqx operator and tried create a EMQX cluster, and if I set the replicas to 3 the EMQX cluster stays in a CoreNodesProgressing state.

❯ k get emqx -A
NAMESPACE   NAME    STATUS                 AGE
database    emqx5   CoreNodesProgressing   50m

To Reproduce

Deploy Talos Linux (might not matter)
Deploy Cilium CNI without kube-proxy and with L2 announcements (might not matter)
Deploy emqx-operator with Flux (might not matter)
Deploy EMQX resource with Flux (might not matter)
Notice sometimes the readiness gates keep flapping between each core replica every second and that the cluster status never goes to Running

helm values

    fullnameOverride: emqx-operator
    image:
      repository: ghcr.io/emqx/emqx-operator
    development: true

cluster definition

apiVersion: apps.emqx.io/v2beta1
kind: EMQX
metadata:
  name: emqx5
  namespace: database
spec:
  image: public.ecr.aws/emqx/emqx:5.6.0
  config:
    data: |
      log.console.level = debug
  coreTemplate:
    spec:
      replicas: 3
  # FIXME: Also the below doesn't work for whatever reason
  # listenersServiceTemplate:
  #   metadata:
  #     annotations:
  #       io.cilium/lb-ipam-ips: 192.168.42.129
  #   spec:
  #     type: LoadBalancer

operator logs

2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a mutating webhook  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxPlugin", "path": "/mutate-apps-emqx-io-v1beta4-emqxplugin"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/mutate-apps-emqx-io-v1beta4-emqxplugin"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a validating webhook    {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxPlugin", "path": "/validate-apps-emqx-io-v1beta4-emqxplugin"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/validate-apps-emqx-io-v1beta4-emqxplugin"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/convert"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Conversion webhook enabled  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxPlugin"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a mutating webhook  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxBroker", "path": "/mutate-apps-emqx-io-v1beta4-emqxbroker"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/mutate-apps-emqx-io-v1beta4-emqxbroker"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a validating webhook    {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxBroker", "path": "/validate-apps-emqx-io-v1beta4-emqxbroker"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/validate-apps-emqx-io-v1beta4-emqxbroker"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Conversion webhook enabled  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxBroker"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a mutating webhook  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxEnterprise", "path": "/mutate-apps-emqx-io-v1beta4-emqxenterprise"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/mutate-apps-emqx-io-v1beta4-emqxenterprise"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a validating webhook    {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxEnterprise", "path": "/validate-apps-emqx-io-v1beta4-emqxenterprise"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/validate-apps-emqx-io-v1beta4-emqxenterprise"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Conversion webhook enabled  {"GVK": "apps.emqx.io/v1beta4, Kind=EmqxEnterprise"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  skip registering a mutating webhook, object does not implement admission.Defaulter or WithDefaulter wasn't called   {"GVK": "apps.emqx.io/v2beta1, Kind=Rebalance"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Registering a validating webhook    {"GVK": "apps.emqx.io/v2beta1, Kind=Rebalance", "path": "/validate-apps-emqx-io-v2beta1-rebalance"}
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Registering webhook {"path": "/validate-apps-emqx-io-v2beta1-rebalance"}
2024-04-09T18:00:30Z    INFO    controller-runtime.builder  Conversion webhook enabled  {"GVK": "apps.emqx.io/v2beta1, Kind=Rebalance"}
2024-04-09T18:00:30Z    INFO    setup   starting manager
2024-04-09T18:00:30Z    INFO    starting server {"kind": "health probe", "addr": "[::]:8081"}
2024-04-09T18:00:30Z    INFO    controller-runtime.metrics  Starting metrics server
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Starting webhook server
2024-04-09T18:00:30Z    INFO    controller-runtime.metrics  Serving metrics server  {"bindAddress": ":8080", "secure": false}
I0409 18:00:30.529284       1 leaderelection.go:250] attempting to acquire leader lease database/19fd6fcc.emqx.io...
2024-04-09T18:00:30Z    INFO    controller-runtime.certwatcher  Updated current TLS certificate
2024-04-09T18:00:30Z    INFO    controller-runtime.webhook  Serving webhook server  {"host": "", "port": 9443}
2024-04-09T18:00:30Z    INFO    controller-runtime.certwatcher  Starting certificate watcher
I0409 18:01:10.585794       1 leaderelection.go:260] successfully acquired lease database/19fd6fcc.emqx.io
2024-04-09T18:01:10Z    DEBUG   events  emqx-operator-controller-manager-df7dcc4dc-ww4dg_c97157c6-372c-4aba-a1ad-7014e583c8e8 became leader {"type": "Normal", "object": {"kind":"Lease","namespace":"database","name":"19fd6fcc.emqx.io","uid":"4489c0e9-d609-4907-962f-5a4e3202a963","apiVersion":"coordination.k8s.io/v1","resourceVersion":"52014238"}, "reason": "LeaderElection"}
2024-04-09T18:01:10Z    INFO    Starting EventSource    {"controller": "emqxbroker", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxBroker", "source": "kind source: *v1beta4.EmqxBroker"}
2024-04-09T18:01:10Z    INFO    Starting EventSource    {"controller": "emqxplugin", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxPlugin", "source": "kind source: *v1beta4.EmqxPlugin"}
2024-04-09T18:01:10Z    INFO    Starting Controller {"controller": "emqxplugin", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxPlugin"}
2024-04-09T18:01:10Z    INFO    Starting Controller {"controller": "emqxbroker", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxBroker"}
2024-04-09T18:01:10Z    INFO    Starting EventSource    {"controller": "emqxenterprise", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxEnterprise", "source": "kind source: *v1beta4.EmqxEnterprise"}
2024-04-09T18:01:10Z    INFO    Starting Controller {"controller": "emqxenterprise", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxEnterprise"}
2024-04-09T18:01:10Z    INFO    Starting EventSource    {"controller": "rebalance", "controllerGroup": "apps.emqx.io", "controllerKind": "Rebalance", "source": "kind source: *v2beta1.Rebalance"}
2024-04-09T18:01:10Z    INFO    Starting Controller {"controller": "rebalance", "controllerGroup": "apps.emqx.io", "controllerKind": "Rebalance"}
2024-04-09T18:01:10Z    INFO    Starting EventSource    {"controller": "emqx", "controllerGroup": "apps.emqx.io", "controllerKind": "EMQX", "source": "kind source: *v2beta1.EMQX"}
2024-04-09T18:01:10Z    INFO    Starting Controller {"controller": "emqx", "controllerGroup": "apps.emqx.io", "controllerKind": "EMQX"}
2024-04-09T18:01:10Z    INFO    Starting workers    {"controller": "emqxenterprise", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxEnterprise", "worker count": 1}
2024-04-09T18:01:10Z    INFO    Starting workers    {"controller": "emqxbroker", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxBroker", "worker count": 1}
2024-04-09T18:01:10Z    INFO    Starting workers    {"controller": "emqxplugin", "controllerGroup": "apps.emqx.io", "controllerKind": "EmqxPlugin", "worker count": 1}
2024-04-09T18:01:10Z    INFO    Starting workers    {"controller": "emqx", "controllerGroup": "apps.emqx.io", "controllerKind": "EMQX", "worker count": 1}
2024-04-09T18:01:10Z    INFO    Starting workers    {"controller": "rebalance", "controllerGroup": "apps.emqx.io", "controllerKind": "Rebalance", "worker count": 1}
2024-04-09T18:01:23Z    INFO    got different pod template for EMQX core nodes, will create new statefulSet {"controller": "emqx", "controllerGroup": "apps.emqx.io", "controllerKind": "EMQX", "EMQX": {"name":"emqx5","namespace":"database"}, "namespace": "database", "name": "emqx5", "reconcileID": "ede3c87a-9937-424c-9ffb-bc67f5797dac", "statefulSet": {"name":"emqx5-core-6796d44f","namespace":"database"}, "patch": "{should create new StatefulSet}"}
2024-04-09T18:01:23Z    INFO    KubeAPIWarningLogger    unknown field "spec.coreTemplate.metadata.creationTimestamp"

a replica logs

EMQX_API_KEY__BOOTSTRAP_FILE [api_key.bootstrap_file]: /opt/emqx/data/bootstrap_api_key
EMQX_DASHBOARD__LISTENERS__HTTP__BIND [dashboard.listeners.http.bind]: 18083
EMQX_RPC__PORT_DISCOVERY [rpc.port_discovery]: manual
EMQX_CLUSTER__DNS__RECORD_TYPE [cluster.dns.record_type]: srv
EMQX_CLUSTER__DNS__NAME [cluster.dns.name]: emqx5-headless.database.svc.cluster.local
EMQX_CLUSTER__DISCOVERY_STRATEGY [cluster.discovery_strategy]: dns
EMQX_NODE__ROLE [node.role]: core
EMQX_NODE__DATA_DIR [node.data_dir]: data
EMQX_NODE__COOKIE [node.cookie]: ******
EMQX_NODE__NAME [node.name]: emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local
2024-04-09T18:01:28.385208+00:00 [info] event=server_setup_successfully driver=tcp port=5369 socket="#Port<0.7>"
2024-04-09T18:01:28.385415+00:00 [info] msg: gen_rpc_dispatcher_start
2024-04-09T18:01:28.386486+00:00 [notice] msg: Starting mria
2024-04-09T18:01:28.386932+00:00 [notice] msg: Starting mnesia
2024-04-09T18:01:28.387048+00:00 [debug] msg: Ensure mnesia schema
2024-04-09T18:01:28.392071+00:00 [notice] msg: Creating new mnesia schema, result: ok
2024-04-09T18:01:28.401294+00:00 [notice] msg: Starting shards
2024-04-09T18:01:28.401471+00:00 [debug] msg: rlog_schema_init
2024-04-09T18:01:28.406159+00:00 [info] msg: Setting RLOG shard config, shard: '$mria_meta_shard', tables: ['$mria_rlog_sync',mria_schema]
2024-04-09T18:01:28.406322+00:00 [info] msg: Converging schema
2024-04-09T18:01:28.406539+00:00 [info] msg: Setting RLOG shard config, shard: '$mria_meta_shard', tables: ['$mria_rlog_sync',mria_schema]
2024-04-09T18:01:28.406839+00:00 [info] msg: starting_rlog_shard, shard: '$mria_meta_shard'
2024-04-09T18:01:28.407120+00:00 [info] msg: Starting ekka
2024-04-09T18:01:28.407285+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: '$mria_meta_shard'
2024-04-09T18:01:28.407364+00:00 [notice] msg: Mria is running
2024-04-09T18:01:28.407439+00:00 [info] msg: Starting ekka
2024-04-09T18:01:28.407648+00:00 [info] msg: Ekka is running
2024-04-09T18:01:28.407692+00:00 [info] msg: Ekka is running
2024-04-09T18:01:28.407771+00:00 [notice] msg: (re)starting_emqx_apps
2024-04-09T18:01:28.408972+00:00 [debug] msg: starting_app, app: emqx_conf
2024-04-09T18:01:28.409107+00:00 [debug] msg: mria_mnesia_create_table, name: cluster_rpc_mfa, options: [{type,ordered_set},{rlog_shard,emqx_cluster_rpc_shard},{storage,disc_copies},{record_name,cluster_rpc_mfa},{attributes,[tnx_id,mfa,created_at,initiator]}]
2024-04-09T18:01:28.411630+00:00 [info] msg: Adding table to a shard, shard: emqx_cluster_rpc_shard, table: cluster_rpc_mfa
2024-04-09T18:01:28.411767+00:00 [debug] msg: ensure_local_table, '$span': start, table: cluster_rpc_mfa
2024-04-09T18:01:28.411828+00:00 [info] msg: Setting RLOG shard config, shard: emqx_cluster_rpc_shard, tables: [cluster_rpc_mfa]
2024-04-09T18:01:28.411951+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: cluster_rpc_mfa
2024-04-09T18:01:28.412031+00:00 [debug] msg: mria_mnesia_create_table, name: cluster_rpc_commit, options: [{type,set},{rlog_shard,emqx_cluster_rpc_shard},{storage,disc_copies},{record_name,cluster_rpc_commit},{attributes,[node,tnx_id]}]
2024-04-09T18:01:28.414256+00:00 [info] msg: Adding table to a shard, shard: emqx_cluster_rpc_shard, table: cluster_rpc_commit
2024-04-09T18:01:28.414388+00:00 [debug] msg: ensure_local_table, '$span': start, table: cluster_rpc_commit
2024-04-09T18:01:28.414536+00:00 [info] msg: Setting RLOG shard config, shard: emqx_cluster_rpc_shard, tables: [cluster_rpc_commit,cluster_rpc_mfa]
2024-04-09T18:01:28.414664+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: cluster_rpc_commit
2024-04-09T18:01:28.414790+00:00 [info] msg: starting_rlog_shard, shard: emqx_cluster_rpc_shard
2024-04-09T18:01:28.415064+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_cluster_rpc_shard
2024-04-09T18:01:28.415512+00:00 [info] msg: wait_for_cluster_rpc_shard, result: ok
2024-04-09T18:01:28.415624+00:00 [info] msg: wait_for_cluster_rpc_tables, result: ok
2024-04-09T18:01:28.416279+00:00 [info] msg: skip_sync_cluster_conf, reason: This is a single node, or the first node in the cluster
2024-04-09T18:01:28.445587+00:00 [debug] msg: started_apps, apps: [emqx_conf]
2024-04-09T18:01:28.445739+00:00 [debug] msg: starting_app, app: esockd
2024-04-09T18:01:28.446120+00:00 [debug] msg: started_apps, apps: [esockd]
2024-04-09T18:01:28.446213+00:00 [debug] msg: starting_app, app: gproc
2024-04-09T18:01:28.446533+00:00 [debug] msg: started_apps, apps: [gproc]
2024-04-09T18:01:28.446586+00:00 [debug] msg: starting_app, app: emqx_http_lib
2024-04-09T18:01:28.446756+00:00 [debug] msg: started_apps, apps: [emqx_http_lib]
2024-04-09T18:01:28.446797+00:00 [debug] msg: starting_app, app: bcrypt
2024-04-09T18:01:28.447216+00:00 [debug] msg: started_apps, apps: [bcrypt]
2024-04-09T18:01:28.447258+00:00 [debug] msg: starting_app, app: ranch
2024-04-09T18:01:28.447474+00:00 [debug] msg: started_apps, apps: [ranch]
2024-04-09T18:01:28.447517+00:00 [debug] msg: starting_app, app: cowboy
2024-04-09T18:01:28.447716+00:00 [debug] msg: started_apps, apps: [cowboy]
2024-04-09T18:01:28.447766+00:00 [debug] msg: starting_app, app: esasl
2024-04-09T18:01:28.447904+00:00 [debug] msg: started_apps, apps: [pbkdf2,esasl]
2024-04-09T18:01:28.447940+00:00 [debug] msg: starting_app, app: emqx_utils
2024-04-09T18:01:28.448126+00:00 [debug] msg: started_apps, apps: [jiffy,emqx_utils]
2024-04-09T18:01:28.448185+00:00 [debug] msg: starting_app, app: emqx_durable_storage
2024-04-09T18:01:28.448370+00:00 [debug] msg: started_apps, apps: [emqx_durable_storage]
2024-04-09T18:01:28.448419+00:00 [debug] msg: starting_app, app: emqx
2024-04-09T18:01:28.449105+00:00 [notice] alarm_handler: {clear,lc_mem_alarm}
2024-04-09T18:01:28.450241+00:00 [debug] msg: skip_init_config_load, loader: emqx_conf, reason: Some application has set another config loader
2024-04-09T18:01:28.450465+00:00 [debug] msg: mria_mnesia_create_table, name: bpapi, options: [{type,set},{storage,ram_copies},{attributes,[key,version]},{rlog_shard,emqx_common_shard}]
2024-04-09T18:01:28.451396+00:00 [notice] alarm_handler: {clear,lc_runq_alarm}
2024-04-09T18:01:28.454812+00:00 [info] msg: Adding table to a shard, shard: emqx_common_shard, table: bpapi
2024-04-09T18:01:28.454967+00:00 [debug] msg: ensure_local_table, '$span': start, table: bpapi
2024-04-09T18:01:28.455087+00:00 [info] msg: Setting RLOG shard config, shard: emqx_common_shard, tables: [bpapi]
2024-04-09T18:01:28.455824+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: bpapi
2024-04-09T18:01:28.455968+00:00 [info] msg: starting_rlog_shard, shard: emqx_common_shard
2024-04-09T18:01:28.456239+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_common_shard
2024-04-09T18:01:28.464698+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_trie, options: [{rlog_shard,route_shard},{record_name,emqx_trie},{attributes,[key,count]},{type,ordered_set},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.467132+00:00 [info] msg: Adding table to a shard, shard: route_shard, table: emqx_trie
2024-04-09T18:01:28.467326+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_trie
2024-04-09T18:01:28.467481+00:00 [info] msg: Setting RLOG shard config, shard: route_shard, tables: [emqx_trie]
2024-04-09T18:01:28.467765+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_trie
2024-04-09T18:01:28.467864+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_route, options: [{type,bag},{rlog_shard,route_shard},{storage,ram_copies},{record_name,route},{attributes,[topic,dest]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.470379+00:00 [info] msg: Adding table to a shard, shard: route_shard, table: emqx_route
2024-04-09T18:01:28.470515+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_route
2024-04-09T18:01:28.470649+00:00 [info] msg: Setting RLOG shard config, shard: route_shard, tables: [emqx_route,emqx_trie]
2024-04-09T18:01:28.470786+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_route
2024-04-09T18:01:28.470852+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_route_filters, options: [{type,ordered_set},{rlog_shard,route_shard},{storage,ram_copies},{record_name,routeidx},{attributes,[entry,unused]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true},{decentralized_counters,true}]}]}]
2024-04-09T18:01:28.473096+00:00 [info] msg: Adding table to a shard, shard: route_shard, table: emqx_route_filters
2024-04-09T18:01:28.473243+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_route_filters
2024-04-09T18:01:28.473303+00:00 [info] msg: Setting RLOG shard config, shard: route_shard, tables: [emqx_route,emqx_route_filters,emqx_trie]
2024-04-09T18:01:28.474328+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_route_filters
2024-04-09T18:01:28.474482+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_routing_node, options: [{type,set},{rlog_shard,route_shard},{storage,ram_copies},{record_name,routing_node},{attributes,[name,const]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.476560+00:00 [info] msg: Adding table to a shard, shard: route_shard, table: emqx_routing_node
2024-04-09T18:01:28.476710+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_routing_node
2024-04-09T18:01:28.476804+00:00 [info] msg: Setting RLOG shard config, shard: route_shard, tables: [emqx_route,emqx_route_filters,emqx_routing_node,emqx_trie]
2024-04-09T18:01:28.478025+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_routing_node
2024-04-09T18:01:28.479148+00:00 [info] msg: starting_rlog_shard, shard: route_shard
2024-04-09T18:01:28.479471+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: route_shard
2024-04-09T18:01:28.480154+00:00 [info] msg: routing_schema_used, schema: v2
2024-04-09T18:01:28.481017+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_shared_subscription, options: [{type,bag},{rlog_shard,emqx_shared_sub_shard},{storage,ram_copies},{record_name,emqx_shared_subscription},{attributes,[group,topic,subpid]}]
2024-04-09T18:01:28.483394+00:00 [info] msg: Adding table to a shard, shard: emqx_shared_sub_shard, table: emqx_shared_subscription
2024-04-09T18:01:28.483568+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_shared_subscription
2024-04-09T18:01:28.483641+00:00 [info] msg: Setting RLOG shard config, shard: emqx_shared_sub_shard, tables: [emqx_shared_subscription]
2024-04-09T18:01:28.483761+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_shared_subscription
2024-04-09T18:01:28.483844+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_exclusive_subscription, options: [{rlog_shard,emqx_exclusive_shard},{type,set},{storage,ram_copies},{record_name,exclusive_subscription},{attributes,[topic,clientid]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.486496+00:00 [info] msg: Adding table to a shard, shard: emqx_exclusive_shard, table: emqx_exclusive_subscription
2024-04-09T18:01:28.486625+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_exclusive_subscription
2024-04-09T18:01:28.486752+00:00 [info] msg: Setting RLOG shard config, shard: emqx_exclusive_shard, tables: [emqx_exclusive_subscription]
2024-04-09T18:01:28.486876+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_exclusive_subscription
2024-04-09T18:01:28.487002+00:00 [info] msg: starting_rlog_shard, shard: emqx_exclusive_shard
2024-04-09T18:01:28.487115+00:00 [info] msg: starting_rlog_shard, shard: emqx_shared_sub_shard
2024-04-09T18:01:28.487236+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_exclusive_shard
2024-04-09T18:01:28.487334+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_shared_sub_shard
2024-04-09T18:01:28.489009+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_banned, options: [{type,set},{rlog_shard,emqx_common_shard},{storage,disc_copies},{record_name,banned},{attributes,[who,by,reason,at,until]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.491955+00:00 [info] msg: Adding table to a shard, shard: emqx_common_shard, table: emqx_banned
2024-04-09T18:01:28.492120+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_banned
2024-04-09T18:01:28.492234+00:00 [info] msg: Setting RLOG shard config, shard: emqx_common_shard, tables: [bpapi,emqx_banned]
2024-04-09T18:01:28.492399+00:00 [notice] msg: Shard schema change, new_table: emqx_banned, shard: emqx_common_shard
2024-04-09T18:01:28.492390+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_banned
2024-04-09T18:01:28.492473+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_banned_rules, options: [{type,set},{rlog_shard,emqx_common_shard},{storage,disc_copies},{record_name,banned},{attributes,[who,by,reason,at,until]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.497377+00:00 [info] msg: Adding table to a shard, shard: emqx_common_shard, table: emqx_banned_rules
2024-04-09T18:01:28.497509+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_banned_rules
2024-04-09T18:01:28.497620+00:00 [info] msg: Setting RLOG shard config, shard: emqx_common_shard, tables: [bpapi,emqx_banned,emqx_banned_rules]
2024-04-09T18:01:28.497750+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_banned_rules
2024-04-09T18:01:28.497759+00:00 [notice] msg: Shard schema change, new_table: emqx_banned_rules, shard: emqx_common_shard
2024-04-09T18:01:28.498762+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_channel_registry, options: [{type,bag},{rlog_shard,emqx_cm_shard},{storage,ram_copies},{record_name,channel},{attributes,[chid,pid]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.501155+00:00 [info] msg: Adding table to a shard, shard: emqx_cm_shard, table: emqx_channel_registry
2024-04-09T18:01:28.501310+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_channel_registry
2024-04-09T18:01:28.501441+00:00 [info] msg: Setting RLOG shard config, shard: emqx_cm_shard, tables: [emqx_channel_registry]
2024-04-09T18:01:28.501679+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_channel_registry
2024-04-09T18:01:28.501775+00:00 [info] msg: starting_rlog_shard, shard: emqx_cm_shard
2024-04-09T18:01:28.501999+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_cm_shard
2024-04-09T18:01:28.502101+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_activated_alarm, options: [{type,ordered_set},{storage,disc_copies},{local_content,true},{record_name,activated_alarm},{attributes,[name,details,message,activate_at]}]
2024-04-09T18:01:28.504565+00:00 [info] msg: Adding table to a shard, shard: undefined, table: emqx_activated_alarm
2024-04-09T18:01:28.504704+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_activated_alarm
2024-04-09T18:01:28.504799+00:00 [info] msg: Setting RLOG shard config, shard: undefined, tables: [emqx_activated_alarm]
2024-04-09T18:01:28.504902+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_activated_alarm
2024-04-09T18:01:28.504993+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_deactivated_alarm, options: [{type,ordered_set},{storage,disc_copies},{local_content,true},{record_name,deactivated_alarm},{attributes,[activate_at,name,details,message,deactivate_at]}]
2024-04-09T18:01:28.507646+00:00 [info] msg: Adding table to a shard, shard: undefined, table: emqx_deactivated_alarm
2024-04-09T18:01:28.507853+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_deactivated_alarm
2024-04-09T18:01:28.508195+00:00 [info] msg: Setting RLOG shard config, shard: undefined, tables: [emqx_activated_alarm,emqx_deactivated_alarm]
2024-04-09T18:01:28.508405+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_deactivated_alarm
Listener ssl:default on :8883 started.
Listener tcp:default on :1883 started.
Listener ws:default on :8083 started.
Listener wss:default on :8084 started.
2024-04-09T18:01:28.518174+00:00 [debug] msg: started_apps, apps: [os_mon,hocon,lc,emqx]
2024-04-09T18:01:28.518270+00:00 [debug] msg: starting_app, app: emqx_retainer
2024-04-09T18:01:28.518779+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_retainer_index_meta, options: [{type,set},{rlog_shard,emqx_retainer_shard},{storage,ram_copies},{record_name,retained_index_meta},{attributes,[key,read_indices,write_indices,reindexing,extra]},{storage_properties,[{ets,[compressed,{read_concurrency,true},{write_concurrency,true}]},{dets,[{auto_save,1000}]}]}]
2024-04-09T18:01:28.521119+00:00 [info] msg: Adding table to a shard, shard: emqx_retainer_shard, table: emqx_retainer_index_meta
2024-04-09T18:01:28.521234+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_retainer_index_meta
2024-04-09T18:01:28.521320+00:00 [info] msg: Setting RLOG shard config, shard: emqx_retainer_shard, tables: [emqx_retainer_index_meta]
2024-04-09T18:01:28.521421+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_retainer_index_meta
2024-04-09T18:01:28.521539+00:00 [info] msg: starting_rlog_shard, shard: emqx_retainer_shard
2024-04-09T18:01:28.521935+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_retainer_shard
2024-04-09T18:01:28.522050+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_retainer_message, options: [{type,ordered_set},{rlog_shard,emqx_retainer_shard},{storage,ram_copies},{record_name,retained_message},{attributes,[topic,msg,expiry_time]},{storage_properties,[{ets,[compressed,{read_concurrency,true},{write_concurrency,true}]},{dets,[{auto_save,1000}]}]}]
2024-04-09T18:01:28.524416+00:00 [info] msg: Adding table to a shard, shard: emqx_retainer_shard, table: emqx_retainer_message
2024-04-09T18:01:28.524592+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_retainer_message
2024-04-09T18:01:28.524695+00:00 [info] msg: Setting RLOG shard config, shard: emqx_retainer_shard, tables: [emqx_retainer_index_meta,emqx_retainer_message]
2024-04-09T18:01:28.524830+00:00 [notice] msg: Shard schema change, new_table: emqx_retainer_message, shard: emqx_retainer_shard
2024-04-09T18:01:28.524830+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_retainer_message
2024-04-09T18:01:28.524955+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_retainer_index, options: [{type,ordered_set},{rlog_shard,emqx_retainer_shard},{storage,ram_copies},{record_name,retained_index},{attributes,[key,expiry_time]},{storage_properties,[{ets,[compressed,{read_concurrency,true},{write_concurrency,true}]},{dets,[{auto_save,1000}]}]}]
2024-04-09T18:01:28.531577+00:00 [info] msg: Adding table to a shard, shard: emqx_retainer_shard, table: emqx_retainer_index
2024-04-09T18:01:28.531753+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_retainer_index
2024-04-09T18:01:28.531915+00:00 [info] msg: Setting RLOG shard config, shard: emqx_retainer_shard, tables: [emqx_retainer_index,emqx_retainer_index_meta,emqx_retainer_message]
2024-04-09T18:01:28.532049+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_retainer_index
2024-04-09T18:01:28.532056+00:00 [notice] msg: Shard schema change, new_table: emqx_retainer_index, shard: emqx_retainer_shard
2024-04-09T18:01:28.532872+00:00 [debug] msg: started_apps, apps: [emqx_retainer]
2024-04-09T18:01:28.532947+00:00 [debug] msg: starting_app, app: emqx_modules
2024-04-09T18:01:28.533116+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_delayed, options: [{type,ordered_set},{storage,disc_copies},{local_content,true},{record_name,delayed_message},{attributes,[key,delayed,msg]}]
2024-04-09T18:01:28.535676+00:00 [info] msg: Adding table to a shard, shard: undefined, table: emqx_delayed
2024-04-09T18:01:28.535807+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_delayed
2024-04-09T18:01:28.536031+00:00 [info] msg: Setting RLOG shard config, shard: undefined, tables: [emqx_activated_alarm,emqx_deactivated_alarm,emqx_delayed]
2024-04-09T18:01:28.536648+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_delayed
2024-04-09T18:01:28.537332+00:00 [debug] msg: emqx_topic_metrics_started
2024-04-09T18:01:28.537545+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_trace, options: [{type,set},{rlog_shard,emqx_common_shard},{storage,disc_copies},{record_name,emqx_trace},{attributes,[name,type,filter,enable,payload_encode,extra,start_at,end_at]}]
2024-04-09T18:01:28.540344+00:00 [info] msg: Adding table to a shard, shard: emqx_common_shard, table: emqx_trace
2024-04-09T18:01:28.540557+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_trace
2024-04-09T18:01:28.540688+00:00 [info] msg: Setting RLOG shard config, shard: emqx_common_shard, tables: [bpapi,emqx_banned,emqx_banned_rules,emqx_trace]
2024-04-09T18:01:28.541188+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_trace
2024-04-09T18:01:28.541222+00:00 [notice] msg: Shard schema change, new_table: emqx_trace, shard: emqx_common_shard
2024-04-09T18:01:28.543222+00:00 [debug] msg: started_apps, apps: [emqx_modules]
2024-04-09T18:01:28.543334+00:00 [debug] msg: starting_app, app: emqx_plugins
2024-04-09T18:01:28.543878+00:00 [debug] msg: started_apps, apps: [emqx_plugins]
2024-04-09T18:01:28.543934+00:00 [debug] msg: starting_app, app: emqx_slow_subs
2024-04-09T18:01:28.544403+00:00 [debug] msg: started_apps, apps: [emqx_slow_subs]
2024-04-09T18:01:28.544441+00:00 [debug] msg: starting_app, app: emqx_resource
2024-04-09T18:01:28.545386+00:00 [debug] msg: started_apps, apps: [jsx,ecpool,emqx_resource]
2024-04-09T18:01:28.545436+00:00 [debug] msg: starting_app, app: emqx_bridge_mqtt
2024-04-09T18:01:28.545620+00:00 [debug] msg: started_apps, apps: [emqtt,emqx_bridge_mqtt]
2024-04-09T18:01:28.545707+00:00 [debug] msg: starting_app, app: emqx_postgresql
2024-04-09T18:01:28.545850+00:00 [debug] msg: started_apps, apps: [emqx_postgresql]
2024-04-09T18:01:28.545934+00:00 [debug] msg: starting_app, app: emqx_bridge_http
2024-04-09T18:01:28.546357+00:00 [debug] msg: started_apps, apps: [ehttpc,emqx_bridge_http]
2024-04-09T18:01:28.546425+00:00 [debug] msg: starting_app, app: emqx_management
2024-04-09T18:01:28.546582+00:00 [info] msg: wait_for_cluster_rpc_shard, result: ok
2024-04-09T18:01:28.546686+00:00 [info] msg: wait_for_cluster_rpc_tables, result: ok
2024-04-09T18:01:28.546825+00:00 [notice] msg: new_node_first_catch_up_and_start_commit., node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', tnx_id: 0
2024-04-09T18:01:28.546919+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_app, options: [{type,set},{rlog_shard,emqx_common_shard},{storage,disc_copies},{record_name,emqx_app},{attributes,[name,api_key,api_secret_hash,enable,extra,expired_at,created_at]}]
2024-04-09T18:01:28.549695+00:00 [info] msg: Adding table to a shard, shard: emqx_common_shard, table: emqx_app
2024-04-09T18:01:28.549809+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_app
2024-04-09T18:01:28.549948+00:00 [info] msg: Setting RLOG shard config, shard: emqx_common_shard, tables: [bpapi,emqx_app,emqx_banned,emqx_banned_rules,emqx_trace]
2024-04-09T18:01:28.550071+00:00 [notice] msg: Shard schema change, new_table: emqx_app, shard: emqx_common_shard
2024-04-09T18:01:28.550065+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_app
2024-04-09T18:01:28.550592+00:00 [debug] msg: init_bootstrap_api_keys_from_file, file: <<"/opt/emqx/data/bootstrap_api_key">>
2024-04-09T18:01:28.551478+00:00 [debug] msg: started_apps, apps: [trails,cowboy_swagger,minirest,emqx_management]
2024-04-09T18:01:28.551557+00:00 [debug] msg: starting_app, app: emqx_opentelemetry
2024-04-09T18:01:28.553023+00:00 [debug] msg: started_apps, apps: [opentelemetry_api,grpc,opentelemetry_exporter,opentelemetry,opentelemetry_api_experimental,opentelemetry_experimental,emqx_opentelemetry]
2024-04-09T18:01:28.553096+00:00 [debug] msg: starting_app, app: emqx_redis
2024-04-09T18:01:28.553443+00:00 [debug] msg: started_apps, apps: [eredis,eredis_cluster,emqx_redis]
2024-04-09T18:01:28.553518+00:00 [debug] msg: starting_app, app: emqx_exhook
2024-04-09T18:01:28.554482+00:00 [debug] msg: started_apps, apps: [emqx_exhook]
2024-04-09T18:01:28.554552+00:00 [debug] msg: starting_app, app: emqx_connector
2024-04-09T18:01:28.558367+00:00 [debug] msg: started_apps, apps: [jose,emqx_connector]
2024-04-09T18:01:28.558451+00:00 [debug] msg: starting_app, app: emqx_ldap
2024-04-09T18:01:28.558614+00:00 [debug] msg: started_apps, apps: [eldap,emqx_ldap]
2024-04-09T18:01:28.558685+00:00 [debug] msg: starting_app, app: emqx_bridge
2024-04-09T18:01:28.560288+00:00 [debug] msg: started_apps, apps: [emqx_bridge]
2024-04-09T18:01:28.560371+00:00 [debug] msg: starting_app, app: emqx_rule_engine
2024-04-09T18:01:28.561827+00:00 [debug] msg: started_apps, apps: [rulesql,quickrand,uuid,emqx_rule_engine]
2024-04-09T18:01:28.561928+00:00 [debug] msg: starting_app, app: emqx_mongodb
2024-04-09T18:01:28.562707+00:00 [debug] msg: started_apps, apps: [bson,poolboy,mongodb,emqx_mongodb]
2024-04-09T18:01:28.562811+00:00 [debug] msg: starting_app, app: emqx_dashboard
2024-04-09T18:01:28.563039+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_admin, options: [{type,set},{rlog_shard,emqx_dashboard_shard},{storage,disc_copies},{record_name,emqx_admin},{attributes,[username,pwdhash,description,role,extra]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.565873+00:00 [info] msg: Adding table to a shard, shard: emqx_dashboard_shard, table: emqx_admin
2024-04-09T18:01:28.566120+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_admin
2024-04-09T18:01:28.566247+00:00 [info] msg: Setting RLOG shard config, shard: emqx_dashboard_shard, tables: [emqx_admin]
2024-04-09T18:01:28.566373+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_admin
2024-04-09T18:01:28.566442+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_admin_jwt, options: [{type,set},{rlog_shard,emqx_dashboard_shard},{storage,disc_copies},{record_name,emqx_admin_jwt},{attributes,[token,username,exptime,extra]},{storage_properties,[{ets,[{read_concurrency,true},{write_concurrency,true}]}]}]
2024-04-09T18:01:28.569199+00:00 [info] msg: Adding table to a shard, shard: emqx_dashboard_shard, table: emqx_admin_jwt
2024-04-09T18:01:28.569328+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_admin_jwt
2024-04-09T18:01:28.569430+00:00 [info] msg: Setting RLOG shard config, shard: emqx_dashboard_shard, tables: [emqx_admin,emqx_admin_jwt]
2024-04-09T18:01:28.569566+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_admin_jwt
2024-04-09T18:01:28.569633+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_dashboard_monitor, options: [{type,set},{local_content,true},{storage,disc_copies},{record_name,emqx_monit},{attributes,[time,data]}]
2024-04-09T18:01:28.575080+00:00 [info] msg: Adding table to a shard, shard: undefined, table: emqx_dashboard_monitor
2024-04-09T18:01:28.575237+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_dashboard_monitor
2024-04-09T18:01:28.575367+00:00 [info] msg: Setting RLOG shard config, shard: undefined, tables: [emqx_activated_alarm,emqx_dashboard_monitor,emqx_deactivated_alarm,emqx_delayed]
2024-04-09T18:01:28.575539+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_dashboard_monitor
2024-04-09T18:01:28.575704+00:00 [info] msg: starting_rlog_shard, shard: emqx_dashboard_shard
2024-04-09T18:01:28.576011+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_dashboard_shard
2024-04-09T18:01:28.576423+00:00 [info] msg: loading_desc, file: /opt/emqx/lib/emqx_dashboard-5.0.33/priv/desc.en.hocon
2024-04-09T18:01:28.663358+00:00 [info] msg: loading_desc, file: /opt/emqx/lib/emqx_dashboard-5.0.33/priv/desc.zh.hocon
Listener http:dashboard on :18083 started.
2024-04-09T18:01:28.755161+00:00 [info] msg: started_listener_ok, name: 'http:dashboard', pid: <0.2944.0>, port: 18083
2024-04-09T18:01:28.756440+00:00 [info] msg: dashboard_sso_user_added, username: admin, role: <<"administrator">>
2024-04-09T18:01:28.760697+00:00 [debug] msg: started_apps, apps: [emqx_dashboard]
2024-04-09T18:01:28.760833+00:00 [debug] msg: starting_app, app: emqx_auto_subscribe
2024-04-09T18:01:28.768326+00:00 [debug] msg: started_apps, apps: [emqx_auto_subscribe]
2024-04-09T18:01:28.768677+00:00 [debug] msg: starting_app, app: emqx_auth
2024-04-09T18:01:28.774502+00:00 [info] msg: authn_chains_initialization_done
2024-04-09T18:01:28.779310+00:00 [debug] msg: started_apps, apps: [emqx_auth]
2024-04-09T18:01:28.779421+00:00 [debug] msg: starting_app, app: emqx_auth_postgresql
2024-04-09T18:01:28.779863+00:00 [debug] msg: started_apps, apps: [emqx_auth_postgresql]
2024-04-09T18:01:28.779939+00:00 [debug] msg: starting_app, app: emqx_auth_mongodb
2024-04-09T18:01:28.780235+00:00 [debug] msg: started_apps, apps: [emqx_auth_mongodb]
2024-04-09T18:01:28.780302+00:00 [debug] msg: starting_app, app: emqx_auth_mnesia
2024-04-09T18:01:28.780504+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_acl, options: [{type,ordered_set},{rlog_shard,emqx_acl_sharded},{storage,disc_copies},{attributes,[who,rules]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.785482+00:00 [info] msg: Adding table to a shard, shard: emqx_acl_sharded, table: emqx_acl
2024-04-09T18:01:28.785626+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_acl
2024-04-09T18:01:28.785813+00:00 [info] msg: Setting RLOG shard config, shard: emqx_acl_sharded, tables: [emqx_acl]
2024-04-09T18:01:28.788528+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_acl
2024-04-09T18:01:28.788764+00:00 [info] msg: starting_rlog_shard, shard: emqx_acl_sharded
2024-04-09T18:01:28.789201+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_authn_mnesia, options: [{rlog_shard,emqx_authn_shard},{type,ordered_set},{storage,disc_copies},{record_name,user_info},{attributes,[user_id,password_hash,salt,is_superuser]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.789200+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_acl_sharded
2024-04-09T18:01:28.793880+00:00 [info] msg: Adding table to a shard, shard: emqx_authn_shard, table: emqx_authn_mnesia
2024-04-09T18:01:28.794021+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_authn_mnesia
2024-04-09T18:01:28.794135+00:00 [info] msg: Setting RLOG shard config, shard: emqx_authn_shard, tables: [emqx_authn_mnesia]
2024-04-09T18:01:28.794351+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_authn_mnesia
2024-04-09T18:01:28.794576+00:00 [info] msg: starting_rlog_shard, shard: emqx_authn_shard
2024-04-09T18:01:28.794919+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_authn_shard
2024-04-09T18:01:28.794921+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_authn_scram_mnesia, options: [{rlog_shard,emqx_authn_shard},{type,ordered_set},{storage,disc_copies},{record_name,user_info},{attributes,[user_id,stored_key,server_key,salt,is_superuser]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.797600+00:00 [info] msg: Adding table to a shard, shard: emqx_authn_shard, table: emqx_authn_scram_mnesia
2024-04-09T18:01:28.797744+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_authn_scram_mnesia
2024-04-09T18:01:28.797847+00:00 [info] msg: Setting RLOG shard config, shard: emqx_authn_shard, tables: [emqx_authn_mnesia,emqx_authn_scram_mnesia]
2024-04-09T18:01:28.798258+00:00 [notice] msg: Shard schema change, new_table: emqx_authn_scram_mnesia, shard: emqx_authn_shard
2024-04-09T18:01:28.798247+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_authn_scram_mnesia
2024-04-09T18:01:28.798459+00:00 [debug] msg: started_apps, apps: [emqx_auth_mnesia]
2024-04-09T18:01:28.798522+00:00 [debug] msg: starting_app, app: emqx_prometheus
2024-04-09T18:01:28.805482+00:00 [debug] msg: started_apps, apps: [prometheus,emqx_prometheus]
2024-04-09T18:01:28.805603+00:00 [debug] msg: starting_app, app: emqx_auth_http
2024-04-09T18:01:28.805843+00:00 [debug] msg: started_apps, apps: [emqx_auth_http]
2024-04-09T18:01:28.805903+00:00 [debug] msg: starting_app, app: emqx_gateway
2024-04-09T18:01:28.806730+00:00 [debug] msg: register_gateway_succeed, callback_module: emqx_gateway_coap
2024-04-09T18:01:28.806811+00:00 [debug] msg: register_gateway_succeed, callback_module: emqx_gateway_exproto
2024-04-09T18:01:28.806881+00:00 [debug] msg: register_gateway_succeed, callback_module: emqx_gateway_lwm2m
2024-04-09T18:01:28.806962+00:00 [debug] msg: register_gateway_succeed, callback_module: emqx_gateway_mqttsn
2024-04-09T18:01:28.807015+00:00 [debug] msg: register_gateway_succeed, callback_module: emqx_gateway_stomp
2024-04-09T18:01:28.807461+00:00 [debug] msg: started_apps, apps: [emqx_gateway]
2024-04-09T18:01:28.807549+00:00 [debug] msg: starting_app, app: emqx_gateway_exproto
2024-04-09T18:01:28.807815+00:00 [debug] msg: started_apps, apps: [emqx_gateway_exproto]
2024-04-09T18:01:28.807868+00:00 [debug] msg: starting_app, app: emqx_gateway_coap
2024-04-09T18:01:28.807997+00:00 [debug] msg: started_apps, apps: [emqx_gateway_coap]
2024-04-09T18:01:28.808052+00:00 [debug] msg: starting_app, app: emqx_gateway_lwm2m
2024-04-09T18:01:28.808304+00:00 [debug] msg: started_apps, apps: [emqx_gateway_lwm2m]
2024-04-09T18:01:28.808359+00:00 [debug] msg: starting_app, app: emqx_gateway_stomp
2024-04-09T18:01:28.808685+00:00 [debug] msg: started_apps, apps: [emqx_gateway_stomp]
2024-04-09T18:01:28.808742+00:00 [debug] msg: starting_app, app: emqx_auth_redis
2024-04-09T18:01:28.809080+00:00 [debug] msg: started_apps, apps: [emqx_auth_redis]
2024-04-09T18:01:28.809128+00:00 [debug] msg: starting_app, app: emqx_auth_jwt
2024-04-09T18:01:28.809434+00:00 [debug] msg: started_apps, apps: [emqx_auth_jwt]
2024-04-09T18:01:28.809478+00:00 [debug] msg: starting_app, app: emqx_auth_ldap
2024-04-09T18:01:28.809732+00:00 [debug] msg: started_apps, apps: [emqx_auth_ldap]
2024-04-09T18:01:28.809780+00:00 [debug] msg: starting_app, app: emqx_mysql
2024-04-09T18:01:28.810139+00:00 [debug] msg: started_apps, apps: [mysql,emqx_mysql]
2024-04-09T18:01:28.810245+00:00 [debug] msg: starting_app, app: emqx_auth_mysql
2024-04-09T18:01:28.810503+00:00 [debug] msg: started_apps, apps: [emqx_auth_mysql]
2024-04-09T18:01:28.810549+00:00 [debug] msg: starting_app, app: emqx_gateway_mqttsn
2024-04-09T18:01:28.810765+00:00 [debug] msg: started_apps, apps: [emqx_gateway_mqttsn]
2024-04-09T18:01:28.810806+00:00 [debug] msg: starting_app, app: emqx_telemetry
2024-04-09T18:01:28.811429+00:00 [debug] msg: started_apps, apps: [emqx_telemetry]
2024-04-09T18:01:28.811484+00:00 [debug] msg: starting_app, app: quicer
2024-04-09T18:01:28.811288+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_telemetry, options: [{type,set},{storage,disc_copies},{rlog_shard,emqx_telemetry_shard},{record_name,telemetry},{attributes,[id,uuid]}]
2024-04-09T18:01:28.811946+00:00 [debug] msg: started_apps, apps: [quicer]
2024-04-09T18:01:28.811998+00:00 [debug] msg: starting_app, app: emqx_psk
2024-04-09T18:01:28.812148+00:00 [debug] msg: mria_mnesia_create_table, name: emqx_psk, options: [{rlog_shard,emqx_psk_shard},{type,ordered_set},{storage,disc_copies},{record_name,psk_entry},{attributes,[psk_id,shared_secret,extra]},{storage_properties,[{ets,[{read_concurrency,true}]}]}]
2024-04-09T18:01:28.814019+00:00 [info] msg: Adding table to a shard, shard: emqx_telemetry_shard, table: emqx_telemetry
2024-04-09T18:01:28.814160+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_telemetry
2024-04-09T18:01:28.816795+00:00 [info] msg: Adding table to a shard, shard: emqx_psk_shard, table: emqx_psk
2024-04-09T18:01:28.816943+00:00 [debug] msg: ensure_local_table, '$span': start, table: emqx_psk
2024-04-09T18:01:28.817057+00:00 [info] msg: Setting RLOG shard config, shard: emqx_telemetry_shard, tables: [emqx_telemetry]
2024-04-09T18:01:28.818396+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_telemetry
2024-04-09T18:01:28.818506+00:00 [info] msg: Setting RLOG shard config, shard: emqx_psk_shard, tables: [emqx_psk]
2024-04-09T18:01:28.818599+00:00 [info] msg: starting_rlog_shard, shard: emqx_telemetry_shard
2024-04-09T18:01:28.820340+00:00 [debug] msg: ensure_local_table, '$span': {complete,true}, table: emqx_psk
2024-04-09T18:01:28.820415+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_telemetry_shard
2024-04-09T18:01:28.820463+00:00 [info] msg: starting_rlog_shard, shard: emqx_psk_shard
2024-04-09T18:01:28.820719+00:00 [info] msg: Shard fully up, node: 'emqx@emqx5-core-6796d44f-1.emqx5-headless.database.svc.cluster.local', shard: emqx_psk_shard
2024-04-09T18:01:28.821847+00:00 [info] msg: emqx_psk_disabled
EMQX 5.6.0 is running now!
2024-04-09T18:01:28.821940+00:00 [debug] msg: started_apps, apps: [emqx_psk]
2024-04-09T18:01:29.652448+00:00 [info] msg: regenerate_minirest_dispatch, elapsed: 897, i18n_lang: en, listeners: ['http:dashboard']
2024-04-09T18:01:30.557021+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:01:30.557092+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:01:37.339655+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:01:37.339852+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:01:39.242310+00:00 [debug] msg: telemetry_data_reported
2024-04-09T18:01:43.052094+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:01:43.052336+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:01:50.996718+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:01:50.996940+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:01:58.934370+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:01:58.934576+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:05.357208+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:05.357431+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:12.802955+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:12.803233+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:18.432421+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:18.432504+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:24.488638+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:24.488888+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:32.164340+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:32.164568+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:39.037639+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:39.037942+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:44.526227+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:44.526498+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:50.574662+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:50.574910+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive
2024-04-09T18:02:56.669933+00:00 [debug] Ekka(AutoCluster): join result: ignore
2024-04-09T18:02:56.670146+00:00 [info] Ekka(AutoCluster): all discovered nodes are alive

Expected behavior

For the cluster to be in a Running state

Anything else we need to know?:

If I set the core replicas to 1 everything is happy

Environment details::

Kubernetes version: Talos Linux v1.29.3
Cloud-provider/provisioner: N/A
emqx-operator version: 2.2.19
Install method: helm

onedr0p commented 7 months ago

It's also worth noting I only see these svc, nothing related to the core pods.

❯ k get svc -n database
NAME                                               TYPE           CLUSTER-IP      EXTERNAL-IP      PORT(S)             AGE
emqx-operator-controller-manager-metrics-service   ClusterIP      10.43.9.231     <none>           8080/TCP            134m
emqx-operator-webhook-service                      ClusterIP      10.43.45.107    <none>           443/TCP             134m

onedr0p commented 7 months ago

kubectl -n database get emqx emqx5 -o json

{
    "apiVersion": "apps.emqx.io/v2beta1",
    "kind": "EMQX",
    "metadata": {
        "annotations": {
            "apps.emqx.io/last-emqx-configuration": "log.console.level = debug\n",
            "kubectl.kubernetes.io/last-applied-configuration": "{\"apiVersion\":\"apps.emqx.io/v2beta1\",\"kind\":\"EMQX\",\"metadata\":{\"annotations\":{},\"name\":\"emqx5\",\"namespace\":\"database\"},\"spec\":{\"config\":{\"data\":\"log.console.level = debug\\n\"},\"coreTemplate\":{\"spec\":{\"replicas\":3}},\"image\":\"public.ecr.aws/emqx/emqx:5.6.0\"}}\n"
        },
        "creationTimestamp": "2024-04-09T18:11:01Z",
        "generation": 2,
        "name": "emqx5",
        "namespace": "database",
        "resourceVersion": "52022722",
        "uid": "62eec439-9224-4eeb-a106-434ec94b16b2"
    },
    "spec": {
        "clusterDomain": "cluster.local",
        "config": {
            "data": "log.console.level = debug\n",
            "mode": "Merge"
        },
        "coreTemplate": {
            "metadata": {},
            "spec": {
                "containerSecurityContext": {
                    "runAsGroup": 1000,
                    "runAsNonRoot": true,
                    "runAsUser": 1000
                },
                "livenessProbe": {
                    "failureThreshold": 3,
                    "httpGet": {
                        "path": "/status",
                        "port": "dashboard"
                    },
                    "initialDelaySeconds": 60,
                    "periodSeconds": 30
                },
                "podSecurityContext": {
                    "fsGroup": 1000,
                    "fsGroupChangePolicy": "Always",
                    "runAsGroup": 1000,
                    "runAsUser": 1000,
                    "supplementalGroups": [
                        1000
                    ]
                },
                "readinessProbe": {
                    "failureThreshold": 12,
                    "httpGet": {
                        "path": "/status",
                        "port": "dashboard"
                    },
                    "initialDelaySeconds": 10,
                    "periodSeconds": 5
                },
                "replicas": 3,
                "resources": {},
                "volumeClaimTemplates": {
                    "resources": {}
                }
            }
        },
        "image": "public.ecr.aws/emqx/emqx:5.6.0",
        "revisionHistoryLimit": 3,
        "updateStrategy": {
            "evacuationStrategy": {
                "connEvictRate": 1000,
                "sessEvictRate": 1000,
                "waitTakeover": 10
            },
            "initialDelaySeconds": 10,
            "type": "Recreate"
        }
    },
    "status": {
        "conditions": [
            {
                "lastTransitionTime": "2024-04-09T18:11:02Z",
                "message": "Create new statefulSet",
                "reason": "CreateNewStatefulSet",
                "status": "True",
                "type": "CoreNodesProgressing"
            }
        ],
        "coreNodes": [
            {
                "controllerUID": "c3c2f68a-8a30-4542-97d7-f356024863fe",
                "edition": "Opensource",
                "node": "emqx@emqx5-core-6796d44f-0.emqx5-headless.database.svc.cluster.local",
                "node_status": "running",
                "otp_release": "25.3.2-2/13.2.2",
                "podUID": "5376b8c9-fc07-48c3-b9a7-2591f6342027",
                "role": "core",
                "uptime": 45232,
                "version": "5.6.0"
            }
        ],
        "coreNodesStatus": {
            "currentReplicas": 1,
            "currentRevision": "6796d44f",
            "readyReplicas": 1,
            "replicas": 3,
            "updateReplicas": 1,
            "updateRevision": "6796d44f"
        },
        "replicantNodesStatus": {}
    }
}

Rory-Z commented 7 months ago

Hi @onedr0p check EMQX pod log, I found this: 2024-04-09T18:01:50.996718+00:00 [debug] Ekka(AutoCluster): join result: ignore, it is means the EMQX application can not found any nodes by DNS server. Could you please check the DNS server, you can create a ubuntu pod in the EMQX pod namespace and running this command nslookup -type=srv emqx5-headless.database.svc.cluster.local

ahinko commented 7 months ago

Hi @Rory-Z, I'm able to replicate this issue as well by using the same configuration and versions as @onedr0p.

I'm guessing that the dns issue is caused by the missing services. Just as @onedr0p mentioned above the only svc I see is for the operator and the webhook. No emqx5-headless svc.

ahinko commented 7 months ago

Follow up: I reverted back to version 2.2.14 of the operator and with that version the exact same cluster configuration works. The services are created and the cluster quickly becomes Ready.

After that I can bump the version back to 2.2.19 and since the services has already been created by the old version of the operator everything still works. But deploying a new cluster fails again.

Rory-Z commented 7 months ago

@ahinko @onedr0p Thanks for feedback, I got the same result, I think this is a bug from 2.2.19, let me fix it.

Rory-Z commented 7 months ago

@onedr0p @ahinko EMQX operator 2.2.20 has been released, please try it.

ahinko commented 7 months ago

@Rory-Z 2.2.20 fixes the issue. Thank you for the quick fix.

onedr0p commented 7 months ago

Thanks for the update @Rory-Z looks to be working here!

emqx / emqx-operator