Open jianwang-ntu opened 2 days ago
When we test performance on the network disk, the IO drop significantly .
(base) ➜ ~ df -h /home/jovyan
Filesystem Size Used Avail Use% Mounted on
10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800G 745G 56G 94% /home/jovyan
(base) ➜ ~ cd /home/jovyan
(base) ➜ ~ bash testio.sh
Testing: Random read/write performance
randreadwrite: (g=0): rw=randrw, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=libaio, iodepth=1
...
fio-3.28
Starting 8 processes
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
randreadwrite: Laying out IO file (1 file / 512MiB)
Jobs: 8 (f=8): [m(8)][100.0%][r=5373KiB/s,w=1773KiB/s][r=1343,w=443 IOPS][eta 00m:00s]
randreadwrite: (groupid=0, jobs=8): err= 0: pid=140036: Sat Sep 28 09:12:14 2024
read: IOPS=1429, BW=5717KiB/s (5854kB/s)(1340MiB/240009msec)
slat (usec): min=8, max=2754, avg=43.54, stdev=24.12
clat (usec): min=514, max=157748, avg=4146.01, stdev=4565.81
lat (usec): min=546, max=157807, avg=4189.90, stdev=4566.06
clat percentiles (usec):
| 1.00th=[ 799], 5.00th=[ 955], 10.00th=[ 1057], 20.00th=[ 1188],
| 30.00th=[ 1270], 40.00th=[ 1369], 50.00th=[ 1516], 60.00th=[ 3097],
| 70.00th=[ 5407], 80.00th=[ 7701], 90.00th=[ 9896], 95.00th=[11863],
| 99.00th=[18744], 99.50th=[23200], 99.90th=[37487], 99.95th=[47449],
| 99.99th=[78119]
bw ( KiB/s): min= 2216, max= 8224, per=100.00%, avg=5719.83, stdev=111.66, samples=3832
iops : min= 554, max= 2056, avg=1429.95, stdev=27.91, samples=3832
write: IOPS=477, BW=1911KiB/s (1957kB/s)(448MiB/240009msec); 0 zone resets
slat (usec): min=9, max=435, avg=49.95, stdev=27.30
clat (usec): min=1255, max=237993, avg=4149.47, stdev=8256.68
lat (usec): min=1325, max=238155, avg=4199.78, stdev=8257.19
clat percentiles (usec):
| 1.00th=[ 1827], 5.00th=[ 2040], 10.00th=[ 2147], 20.00th=[ 2278],
| 30.00th=[ 2376], 40.00th=[ 2442], 50.00th=[ 2540], 60.00th=[ 2638],
| 70.00th=[ 2737], 80.00th=[ 3064], 90.00th=[ 6521], 95.00th=[ 10683],
| 99.00th=[ 30278], 99.50th=[ 67634], 99.90th=[119014], 99.95th=[128451],
| 99.99th=[160433]
bw ( KiB/s): min= 704, max= 3280, per=100.00%, avg=1911.87, stdev=52.10, samples=3832
iops : min= 176, max= 820, avg=477.95, stdev=13.03, samples=3832
lat (usec) : 750=0.44%, 1000=4.99%
lat (msec) : 2=37.79%, 4=25.88%, 10=22.32%, 20=7.57%, 50=0.81%
lat (msec) : 100=0.14%, 250=0.06%
cpu : usr=0.22%, sys=1.07%, ctx=666983, majf=0, minf=127
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=343008,114663,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=1
Run status group 0 (all jobs):
READ: bw=5717KiB/s (5854kB/s), 5717KiB/s-5717KiB/s (5854kB/s-5854kB/s), io=1340MiB (1405MB), run=240009-240009msec
WRITE: bw=1911KiB/s (1957kB/s), 1911KiB/s-1911KiB/s (1957kB/s-1957kB/s), io=448MiB (470MB), run=240009-240009msec
Testing: Random read performance
randread: (g=0): rw=randread, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=libaio, iodepth=1
...
fio-3.28
Starting 8 processes
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
randread: Laying out IO file (1 file / 512MiB)
Jobs: 8 (f=8): [r(8)][100.0%][r=10.3MiB/s][r=2645 IOPS][eta 00m:00s]
randread: (groupid=0, jobs=8): err= 0: pid=140181: Sat Sep 28 09:16:48 2024
read: IOPS=2628, BW=10.3MiB/s (10.8MB/s)(2464MiB/240005msec)
slat (usec): min=7, max=4125, avg=42.86, stdev=22.26
clat (usec): min=432, max=143128, avg=2998.37, stdev=3461.03
lat (usec): min=488, max=143158, avg=3041.56, stdev=3461.13
clat percentiles (usec):
| 1.00th=[ 717], 5.00th=[ 865], 10.00th=[ 947], 20.00th=[ 1074],
| 30.00th=[ 1172], 40.00th=[ 1237], 50.00th=[ 1303], 60.00th=[ 1401],
| 70.00th=[ 2343], 80.00th=[ 5276], 90.00th=[ 8291], 95.00th=[ 9896],
| 99.00th=[14746], 99.50th=[17695], 99.90th=[27395], 99.95th=[32375],
| 99.99th=[43254]
bw ( KiB/s): min= 5360, max=14104, per=100.00%, avg=10519.42, stdev=145.11, samples=3832
iops : min= 1340, max= 3526, avg=2629.85, stdev=36.28, samples=3832
lat (usec) : 500=0.01%, 750=1.42%, 1000=12.40%
lat (msec) : 2=55.47%, 4=6.27%, 10=19.89%, 20=4.23%, 50=0.31%
lat (msec) : 100=0.01%, 250=0.01%
cpu : usr=0.25%, sys=1.35%, ctx=978340, majf=0, minf=94
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=630793,0,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=1
Run status group 0 (all jobs):
READ: bw=10.3MiB/s (10.8MB/s), 10.3MiB/s-10.3MiB/s (10.8MB/s-10.8MB/s), io=2464MiB (2584MB), run=240005-240005msec
Testing: Random write performance
randwrite: (g=0): rw=randwrite, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=libaio, iodepth=1
...
fio-3.28
Starting 8 processes
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
randwrite: Laying out IO file (1 file / 512MiB)
Jobs: 8 (f=8): [w(8)][100.0%][w=4256KiB/s][w=1064 IOPS][eta 00m:00s]
randwrite: (groupid=0, jobs=8): err= 0: pid=140475: Sat Sep 28 09:20:48 2024
write: IOPS=1329, BW=5320KiB/s (5447kB/s)(1247MiB/240076msec); 0 zone resets
slat (usec): min=9, max=23259, avg=49.30, stdev=113.73
clat (usec): min=987, max=270938, avg=5961.55, stdev=15564.56
lat (usec): min=1003, max=270969, avg=6011.19, stdev=15564.97
clat percentiles (usec):
| 1.00th=[ 1565], 5.00th=[ 1778], 10.00th=[ 1893], 20.00th=[ 2057],
| 30.00th=[ 2180], 40.00th=[ 2278], 50.00th=[ 2376], 60.00th=[ 2507],
| 70.00th=[ 2638], 80.00th=[ 2933], 90.00th=[ 6652], 95.00th=[ 19268],
| 99.00th=[ 98042], 99.50th=[116917], 99.90th=[137364], 99.95th=[145753],
| 99.99th=[214959]
bw ( KiB/s): min= 1392, max= 9754, per=100.00%, avg=5325.46, stdev=176.57, samples=3832
iops : min= 348, max= 2438, avg=1331.23, stdev=44.13, samples=3832
lat (usec) : 1000=0.01%
lat (msec) : 2=16.19%, 4=70.71%, 10=4.70%, 20=3.53%, 50=2.07%
lat (msec) : 100=1.85%, 250=0.94%, 500=0.01%
cpu : usr=0.15%, sys=0.78%, ctx=465152, majf=0, minf=79
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=0,319282,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=1
Run status group 0 (all jobs):
WRITE: bw=5320KiB/s (5447kB/s), 5320KiB/s-5320KiB/s (5447kB/s-5447kB/s), io=1247MiB (1308MB), run=240076-240076msec
Testing: IO Latency on individual request
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=1 time=1.54 ms (warmup)
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=2 time=1.98 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=3 time=1.67 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=4 time=1.71 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=5 time=1.64 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=6 time=1.63 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=7 time=1.63 ms (fast)
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=8 time=1.66 ms
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=9 time=1.54 ms (fast)
4 KiB <<< . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB): request=10 time=1.47 ms (fast)
--- . (ceph 10.2.214.1:6789,10.2.214.3:6789,10.2.214.4:6789:/volumes/csi/csi-vol-02fc1163-a67d-4cd3-8483-7b4545dacc19/edeaae84-b42c-4090-a814-61e2e0d9c10c 800 GiB) ioping statistics ---
9 requests completed in 14.9 ms, 36 KiB read, 602 iops, 2.36 MiB/s
generated 10 requests in 9.00 s, 40 KiB, 1 iops, 4.44 KiB/s
min/avg/max/mdev = 1.47 ms / 1.66 ms / 1.98 ms / 132.2 us
The tool can be found from https://gist.github.com/jsandas/a3705a619214470b2d92b1fb2b559ad9