ideawu / ssdb

SSDB - A fast NoSQL database, an alternative to Redis
http://ssdb.io/
BSD 3-Clause "New" or "Revised" License
8.19k stars 1.41k forks source link

ssdb写一段时间后,自己退出,需要手动restart才能启动 #1302

Open AmFanyuxi opened 5 years ago

AmFanyuxi commented 5 years ago

好几次发现下班回家-d后台运行的ssdb,第二天发现测试环境大量connect refuse。

日志看不到报错信息,如图: 2019-03-18 18:57:55.706 [DEBUG] server.cpp(373): w:0.011,p:0.022, req: get [82], resp: ok [2485] 2019-03-18 18:57:55.709 [DEBUG] server.cpp(373): w:0.013,p:0.026, req: get [82], resp: ok [2132] 2019-03-18 18:57:55.710 [DEBUG] server.cpp(373): w:0.012,p:0.080, req: multi_zset [69] 10001110000000000000000000320076 3250, resp: ok 1 2019-03-18 18:57:55.710 [DEBUG] server.cpp(373): w:0.008,p:0.080, req: multi_zset [69] 10001110000000000000000000573557 414, resp: ok 1 2019-03-18 18:57:55.713 [DEBUG] server.cpp(373): w:0.012,p:0.024, req: get [82], resp: ok [2971] 2019-03-18 18:57:55.718 [DEBUG] server.cpp(373): w:0.013,p:0.027, req: get [51], resp: ok [2709] 2019-03-18 18:57:55.719 [DEBUG] server.cpp(373): w:0.011,p:0.017, req: get [51], resp: ok [2122] 2019-03-18 18:57:55.726 [DEBUG] server.cpp(373): w:0.016,p:0.028, req: get [82], resp: ok [2928] 2019-03-18 18:57:55.726 [DEBUG] server.cpp(373): w:0.010,p:0.021, req: get [82], resp: ok [2167] 2019-03-18 18:57:55.728 [DEBUG] server.cpp(373): w:0.011,p:0.025, req: get [51], resp: ok [3177] 2019-03-18 18:57:55.729 [DEBUG] server.cpp(373): w:0.011,p:0.090, req: multi_zset [69] 10001110000000000000000000430400 5413, resp: ok 1 2019-03-18 18:57:55.731 [DEBUG] server.cpp(373): w:0.009,p:0.067, req: multi_zset [69] 10001110000000000000000000792593 13, resp: ok 1 2019-03-18 18:57:55.740 [DEBUG] server.cpp(373): w:0.012,p:0.028, req: get [82], resp: ok [2122] 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(46): ssdb-server 1.9.6 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(47): conf_file : /opt/ssdb/ssdb-master/ssdb.conf 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(48): log_level : debug 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(49): log_output : /opt/ssdb/ssdb-master/log/log.txt 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(50): log_rotate_size : 1000000000 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(52): main_db : ./var/data 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(53): meta_db : ./var/meta 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(54): cache_size : 500 MB 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(55): block_size : 32 KB 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(56): write_buffer : 64 MB 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(57): max_open_files : 500 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(58): compaction_speed : 500 MB/s 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(59): compression : yes 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(60): binlog : yes 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(61): binlog_capacity : 20000000 2019-03-19 12:00:00.017 [INFO ] ssdb-server.cpp(62): sync_speed : -1 MB/s 2019-03-19 12:00:01.180 [INFO ] binlog.cpp(185): binlogs capacity: 20000000, min: 18676174, max: 38676174, 2019-03-19 12:00:01.190 [INFO ] server.cpp(159): server listen on 0.0.0.0:8888 2019-03-19 12:00:01.190 [INFO ] server.cpp(171): auth : on 2019-03-19 12:00:01.190 [INFO ] server.cpp(209): readonly: no 2019-03-19 12:00:01.190 [INFO ] server.cpp(218): slowlog_timeout: 20.000 ms 2019-03-19 12:00:01.191 [DEBUG] cluster.cpp(13): Cluster init 2019-03-19 12:00:01.279 [DEBUG] ttl.cpp(108): load 0 keys into fast_keys 2019-03-19 12:00:01.281 [INFO ] serv.cpp(339): key_range.kv: "", "" 2019-03-19 12:00:01.281 [INFO ] ssdb-server.cpp(85): pidfile: ./var/ssdb.pid, pid: 20650 2019-03-19 12:00:01.281 [INFO ] ssdb-server.cpp(86): ssdb server started.

从日志看,下午6点以后没有日志记录,直到今天12点重启,还请帮忙看下问题。怎么解决? 是我的配置有问题么?

AmFanyuxi commented 5 years ago

补充又挂了,后面就没有了日志信息。

2019-03-19 13:25:42.991 [DEBUG] server.cpp(373): w:0.001,p:0.001, req: auth TJiptvadmin1234-fanyuxi-test-ssdb-pwd, resp: ok 1 2019-03-19 13:25:42.991 [WARN ] server.cpp(368): slowlog w:2.032,p:200.375, req: get [51], resp: ok [1819] 2019-03-19 13:25:42.991 [WARN ] server.cpp(368): slowlog w:297.622,p:0.398, req: multi_zset [69] 10001110000000000000000000833063 459, resp: ok 1 2019-03-19 13:25:42.991 [WARN ] server.cpp(368): slowlog w:0.768,p:293.629, req: get [82], resp: ok [2757] 2019-03-19 13:25:42.991 [WARN ] server.cpp(368): slowlog w:2.004,p:295.643, req: get [82], resp: ok [2641] 2019-03-19 13:25:42.991 [WARN ] server.cpp(368): slowlog w:1.561,p:296.185, req: get [82], resp: ok [3110] 2019-03-19 13:25:43.004 [DEBUG] server.cpp(373): w:0.001,p:0.003, req: ping, resp: ok 2019-03-19 13:25:43.096 [DEBUG] server.cpp(275): new link from 111.30.65.66:8971, fd: 21, links: 36 2019-03-19 13:25:43.096 [DEBUG] server.cpp(373): w:0.001,p:0.002, req: auth TJiptvadmin1234-fanyuxi-test-ssdb-pwd, resp: ok 1 2019-03-19 13:25:43.109 [DEBUG] server.cpp(373): w:0.002,p:0.004, req: ping, resp: ok 2019-03-19 13:25:43.139 [DEBUG] server.cpp(275): new link from 111.30.65.66:8972, fd: 22, links: 37 2019-03-19 13:25:43.139 [DEBUG] server.cpp(373): w:0.002,p:0.003, req: auth TJiptvadmin1234-fanyuxi-test-ssdb-pwd, resp: ok 1 2019-03-19 13:25:43.154 [DEBUG] server.cpp(373): w:0.001,p:0.005, req: ping, resp: ok 2019-03-19 13:25:43.190 [WARN ] server.cpp(368): slowlog w:0.013,p:184.783, req: get [51], resp: ok [3238] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.013,p:185.485, req: get [82], resp: ok [1188] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.018,p:188.846, req: get [51], resp: ok [2731] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.013,p:187.164, req: get [82], resp: ok [1162] 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:297.965,p:200.231, req: multi_zset [69] 10001110000000000000000000846394 5399, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:498.206,p:0.107, req: multi_zset [69] 10001110000000000000000000851152 1406, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:200.081,p:0.054, req: multi_zset [69] 10001110000000000000000000655671 6000, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:200.084,p:0.065, req: multi_zset [69] 10001110000000000000000000686311 418, resp: ok 1 2019-03-19 13:25:43.202 [DEBUG] server.cpp(373): w:0.018,p:0.059, req: get [82], resp: ok [1184] 2019-03-19 13:25:43.206 [DEBUG] server.cpp(373): w:0.011,p:0.025, req: get [82], resp: ok [1176]

^C [root@ledisdb log]# tail -f log.txt 2019-03-19 13:25:43.190 [WARN ] server.cpp(368): slowlog w:0.013,p:184.783, req: get [51], resp: ok [3238] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.013,p:185.485, req: get [82], resp: ok [1188] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.018,p:188.846, req: get [51], resp: ok [2731] 2019-03-19 13:25:43.191 [WARN ] server.cpp(368): slowlog w:0.013,p:187.164, req: get [82], resp: ok [1162] 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:297.965,p:200.231, req: multi_zset [69] 10001110000000000000000000846394 5399, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:498.206,p:0.107, req: multi_zset [69] 10001110000000000000000000851152 1406, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:200.081,p:0.054, req: multi_zset [69] 10001110000000000000000000655671 6000, resp: ok 1 2019-03-19 13:25:43.192 [WARN ] server.cpp(368): slowlog w:200.084,p:0.065, req: multi_zset [69] 10001110000000000000000000686311 418, resp: ok 1 2019-03-19 13:25:43.202 [DEBUG] server.cpp(373): w:0.018,p:0.059, req: get [82], resp: ok [1184] 2019-03-19 13:25:43.206 [DEBUG] server.cpp(373): w:0.011,p:0.025, req: get [82], resp: ok [1176]

AmFanyuxi commented 5 years ago

然后需要重启:

[root@ bin]# ./ssdb-server /opt/ssdb/ssdb-master/ssdb.conf -s stop ssdb-server 1.9.6 Copyright (c) 2012-2015 ssdb.io

process: 20650 not running [root@bin]# ./ssdb-server /opt/ssdb/ssdb-master/ssdb.conf -d -s restart ssdb-server 1.9.6 Copyright (c) 2012-2015 ssdb.io

process: 20650 not running

rmdwirizki commented 3 years ago

Recently I've faced the the same issue. Suddenly ssdb won't let me connect and just throwing "connection refused". In my case, it happen when a large traffic suddenly come in. And there is not much you can see in the logs. I believe it has something to do with max_open_files in my linux server. Do you find a solution for this problem? @AmFanyuxi

ideawu commented 3 years ago

进程莫名退出, 一般是如下原因: