Closed perrynzhou closed 3 years ago
Final graph:
+------------------------------------------------------------------------------+
1: volume rep-vol-client-0
2: type protocol/client
3: option opversion 70200
4: option clnt-lk-version 1
5: option volfile-checksum 0
6: option volfile-key rep-vol
7: option client-version 7.9
8: option process-name fuse
9: option process-uuid CTX_ID:7cb66914-80ea-4621-837b-9effb6d3c539-GRAPH_ID:0-PID:277349-HOST:dgdpl1926-PC_NAME:rep-vol-client-0-RECON_NO:-0
10: option fops-version 1298437
11: option ping-timeout 42
12: option remote-host 172.16.84.37
13: option remote-subvolume /rep-vol/brick
14: option transport-type socket
15: option transport.address-family inet
16: option transport.socket.ssl-enabled off
17: option transport.tcp-user-timeout 0
18: option transport.socket.keepalive-time 20
19: option transport.socket.keepalive-interval 2
20: option transport.socket.keepalive-count 9
21: option strict-locks off
22: option send-gids true
23: end-volume
24:
25: volume rep-vol-client-1
26: type protocol/client
27: option ping-timeout 42
28: option remote-host 172.16.84.41
29: option remote-subvolume /rep-vol/brick
30: option transport-type socket
31: option transport.address-family inet
32: option transport.socket.ssl-enabled off
33: option transport.tcp-user-timeout 0
34: option transport.socket.keepalive-time 20
35: option transport.socket.keepalive-interval 2
36: option transport.socket.keepalive-count 9
37: option strict-locks off
38: option send-gids true
39: end-volume
40:
41: volume rep-vol-client-2
42: type protocol/client
43: option ping-timeout 42
44: option remote-host 172.16.84.42
45: option remote-subvolume /rep-vol/brick
46: option transport-type socket
47: option transport.address-family inet
48: option transport.socket.ssl-enabled off
49: option transport.tcp-user-timeout 0
50: option transport.socket.keepalive-time 20
51: option transport.socket.keepalive-interval 2
52: option transport.socket.keepalive-count 9
53: option strict-locks off
54: option send-gids true
55: end-volume
56:
57: volume rep-vol-replicate-0
58: type cluster/replicate
59: option afr-pending-xattr rep-vol-client-0,rep-vol-client-1,rep-vol-client-2
60: option background-self-heal-count 128
61: option data-self-heal-window-size 1024
62: option shd-max-threads 64
63: option use-compound-fops off
64: subvolumes rep-vol-client-0 rep-vol-client-1 rep-vol-client-2
65: end-volume
66:
67: volume rep-vol-dht
68: type cluster/distribute
69: option lock-migration off
70: option force-migration off
71: subvolumes rep-vol-replicate-0
72: end-volume
73:
74: volume rep-vol-utime
75: type features/utime
76: option noatime on
77: subvolumes rep-vol-dht
78: end-volume
79:
80: volume rep-vol-write-behind
81: type performance/write-behind
82: subvolumes rep-vol-utime
83: end-volume
84:
85: volume rep-vol-open-behind
86: type performance/open-behind
87: subvolumes rep-vol-write-behind
88: end-volume
89:
90: volume rep-vol-quick-read
91: type performance/quick-read
92: subvolumes rep-vol-open-behind
93: end-volume
94:
95: volume rep-vol-md-cache
96: type performance/md-cache
97: subvolumes rep-vol-quick-read
98: end-volume
99:
100: volume rep-vol-io-threads
101: type performance/io-threads
102: option enable-least-priority yes
103: subvolumes rep-vol-md-cache
104: end-volume
105:
106: volume rep-vol
107: type debug/io-stats
108: option log-level INFO
109: option threads 16
110: option latency-measurement off
111: option count-fop-hits off
112: option global-threading off
113: subvolumes rep-vol-io-threads
114: end-volume
115:
116: volume meta-autoload
117: type meta
118: subvolumes rep-vol
119: end-volume
120:
+------------------------------------------------------------------------------+
[2021-01-27 12:33:28.870056] I [fuse-bridge.c:5266:fuse_init] 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24 kernel 7.23
[2021-01-27 12:33:28.870106] I [fuse-bridge.c:5894:fuse_graph_sync] 0-fuse: switched to graph 0
[2021-01-27 12:33:28.871811] W [MSGID: 108027] [afr-common.c:2256:afr_attempt_readsubvol_set] 0-rep-vol-replicate-0: no read subvols for /
[2021-01-27 12:33:28.871972] E [fuse-bridge.c:5335:fuse_first_lookup] 0-fuse: first lookup on root failed (Transport endpoint is not connected)
[2021-01-27 12:33:28.873075] W [fuse-bridge.c:1374:fuse_attr_cbk] 0-glusterfs-fuse: 10: LOOKUP() / => -1 (Transport endpoint is not connected)
[2021-01-27 12:33:28.878640] W [fuse-bridge.c:1374:fuse_attr_cbk] 0-glusterfs-fuse: 11: LOOKUP() / => -1 (Transport endpoint is not connected)
[2021-01-27 12:33:28.894137] I [fuse-bridge.c:6200:fuse_thread_proc] 0-fuse: initiating unmount of /mnt/fuse/rep
The message "W [MSGID: 108027] [afr-common.c:2256:afr_attempt_readsubvol_set] 0-rep-vol-replicate-0: no read subvols for /" repeated 2 times between [2021-01-27 12:33:28.871811] and [2021-01-27 12:33:28.878612]
[2021-01-27 12:33:28.894639] W [glusterfsd.c:1596:cleanup_and_exit] (-->/lib64/libpthread.so.0(+0x7ea5) [0x2ab193331ea5] -->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xe5) [0x562d2f520625] -->/usr/sbin/glusterfs(cleanup_and_exit+0x6b) [0x562d2f52048b] ) 0-: received signum (15), shutting down
[2021-01-27 12:33:28.894677] I [fuse-bridge.c:7017:fini] 0-fuse: Unmounting '/mnt/fuse/rep'.
[2021-01-27 12:33:28.894698] I [fuse-bridge.c:7022:fini] 0-fuse: Closing fuse connection to '/mnt/fuse/rep'.
root@172.16.84.54 /var
Mount succeeds only when lookup succeeds. Lookup fails on plain replica if quorum is not met. So it is working as expected. This behavior is decided because the only replica that is available could have stale/bad data, there is no way to tell.
@mohit84 @xhernandez @amarts @BarakSason @mohit84 @tshacked @csabahenk @pranithk after two of three node crash,mount failed on replica 3 volume .but i think if one of three gluster node is active,that should let client mount success, is right? i want to mount success ,even through two of three nodes crash,just one node is alive because we build replica 3 volume.