lealone / Lealone

比 MySQL 和 MongoDB 快10倍的 OLTP 关系数据库和文档数据库
Other
2.45k stars 516 forks source link

还是不能select #45

Closed cxwxz closed 11 years ago

cxwxz commented 11 years ago

013-08-26 20:59:17,630 INFO org.apache.zookeeper.server.persistence.FileTxnSnapLog: Snapshotting: 0x0 to /home/hadoop/zookeeper/version-2/snapshot.0 2013-08-26 20:59:17,661 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Follower sid: 0 : info : org.apache.zookeeper.server.quorum.QuorumPeer$QuorumServer@42197bb7 2013-08-26 20:59:17,661 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Follower sid: 1 : info : org.apache.zookeeper.server.quorum.QuorumPeer$QuorumServer@17094d48 2013-08-26 20:59:17,727 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Synchronizing with Follower sid: 0 maxCommittedLog=0x0 minCommittedLog=0x0 peerLastZxid=0x0 2013-08-26 20:59:17,727 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Sending DIFF 2013-08-26 20:59:17,739 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Synchronizing with Follower sid: 1 maxCommittedLog=0x0 minCommittedLog=0x0 peerLastZxid=0x0 2013-08-26 20:59:17,749 INFO org.apache.zookeeper.server.quorum.LearnerHandler: Sending DIFF 2013-08-26 20:59:17,771 WARN org.apache.zookeeper.server.quorum.Leader: Commiting zxid 0x100000000 from /192.168.2.28:2888 not first! 2013-08-26 20:59:17,772 WARN org.apache.zookeeper.server.quorum.Leader: First is 0x0 2013-08-26 20:59:17,772 INFO org.apache.zookeeper.server.quorum.Leader: Have quorum of supporters; starting up and setting last processed zxid: 0x100000000 2013-08-26 20:59:19,240 INFO org.apache.zookeeper.server.persistence.FileTxnLog: Creating new log file: log.100000001 2013-08-26 20:59:20,715 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:delete cxid:0x13 zxid:0x100000014 txntype:-1 reqpath:n/a Error Path:/hbase/backup-masters/ubuntu-26,60000,1377521958838 Error:KeeperErrorCode = NoNode for /hbase/backup-masters/ubuntu-26,60000,1377521958838 2013-08-26 20:59:20,750 INFO org.apache.zookeeper.server.NIOServerCnxnFactory: Accepted socket connection from /192.168.2.28:57106 2013-08-26 20:59:20,758 WARN org.apache.zookeeper.server.NIOServerCnxn: Exception causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not running 2013-08-26 20:59:20,758 INFO org.apache.zookeeper.server.NIOServerCnxn: Closed socket connection for client /192.168.2.28:57106 (no session established for client) 2013-08-26 20:59:22,541 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610000 type:create cxid:0x7 zxid:0x10000001c txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot Error:KeeperErrorCode = NoNode for /hbase/online-snapshot 2013-08-26 20:59:22,543 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610002 type:create cxid:0x7 zxid:0x10000001d txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot Error:KeeperErrorCode = NoNode for /hbase/online-snapshot 2013-08-26 20:59:22,572 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610000 type:create cxid:0x8 zxid:0x10000001f txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot 2013-08-26 20:59:22,584 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610000 type:create cxid:0x9 zxid:0x100000021 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/acquired Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/acquired 2013-08-26 20:59:22,590 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:create cxid:0x1e zxid:0x100000022 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/acquired Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/acquired 2013-08-26 20:59:22,597 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610000 type:create cxid:0xb zxid:0x100000024 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/reached Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/reached 2013-08-26 20:59:22,604 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:create cxid:0x20 zxid:0x100000025 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/reached Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/reached 2013-08-26 20:59:22,612 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x40bab4fb610000 type:create cxid:0xd zxid:0x100000027 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/abort Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/abort 2013-08-26 20:59:22,617 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:create cxid:0x22 zxid:0x100000028 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/abort Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/abort 2013-08-26 20:59:23,154 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820003 type:create cxid:0x7 zxid:0x100000030 txntype:-1 reqpath:n/a Error Path:/hbase/online-snapshot/acquired Error:KeeperErrorCode = NodeExists for /hbase/online-snapshot/acquired 2013-08-26 20:59:28,382 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:delete cxid:0x32 zxid:0x100000032 txntype:-1 reqpath:n/a Error Path:/hbase/root-region-server Error:KeeperErrorCode = NoNode for /hbase/root-region-server 2013-08-26 20:59:28,447 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:delete cxid:0x3a zxid:0x100000035 txntype:-1 reqpath:n/a Error Path:/hbase/table92/-ROOT- Error:KeeperErrorCode = NoNode for /hbase/table92/-ROOT- 2013-08-26 20:59:30,450 INFO org.apache.zookeeper.server.PrepRequestProcessor: Got user-level KeeperException when processing sessionid:0x140bab4fb820000 type:delete cxid:0x4e zxid:0x10000003f txntype:-1 reqpath:n/a Error Path:/hbase/table92/.META. Error:KeeperErrorCode = NoNode for /hbase/table92/.META. 2013-08-26 21:00:24,784 INFO org.apache.zookeeper.server.PrepRequestProcessor: Processed session termination for sessionid: 0x140bab4fb820007 2013-08-26 21:00:25,950 INFO org.apache.zookeeper.server.PrepRequestProcessor: Processed session termination for sessionid: 0x40bab4fb610005 2013-08-26 21:00:25,985 INFO org.apache.zookeeper.server.NIOServerCnxnFactory: Accepted socket connection from /192.168.2.27:58822 2013-08-26 21:00:25,990 INFO org.apache.zookeeper.server.ZooKeeperServer: Client attempting to establish new session at /192.168.2.27:58822

已经是用最新的版本编译过,hadoop 为安全起见已经是 format 过。还是上次相同的错误。 这是dead 中 日志

codefollower commented 11 years ago

zk是不是有问题啦,你试试用zk client能连得上么

codefollower commented 11 years ago

这个日志看不出region server为什么会dead,还有更全的日志么?

cxwxz commented 11 years ago

你好: 现提供相关的日志供参考。 用户建表 是USER 表, 建表与新增是OK 的。就用SELECT 时 存储该表的 regionserver 会自动 dead. 在日志中没找到错误明显的提示。

关于集群 hbase 环境有用过 另一个 phoenix 进行测过是正常 的。相关的 zookeeper 的监听端口 telnet 也是正常 的。

cxwxzhtc

发件人: codefollower 发送时间: 2013-08-26 21:32 收件人: codefollower/Lealone 抄送: cxwxz 主题: Re: [Lealone] 还是不能select (#45) 这个日志看不出region server为什么会dead,还有更全的日志么? — Reply to this email directly or view it on GitHub.

codefollower commented 11 years ago

运行region server的JVM退出信息都没有吗?比如aborting server 或stopping server之类的

cxwxz commented 11 years ago

能否给个 该项目的 hbase 集群 hbase-site.xml 文件 供配置参考一下。 在这几天的测试中发现, 用 where rowkey='3' 之类的查询还是可行的。 其它的查询 都会 使 RegionServer 内存溢出的 dead。 注,本人的测试环境用 test 项目中的 hbase-site.xml 也会内存溢出 而造成 dead。

codefollower commented 11 years ago

我是在我本机上面测的,没配什么的,如下:

<configuration>
    <property>
        <name>hbase.zookeeper.quorum</name>
        <value>127.0.0.1</value>
    </property>

    <property>
        <name>hbase.coprocessor.master.classes</name>
        <value>com.codefollower.lealone.hbase.engine.HBaseMasterObserver</value>
    </property>

    <property>
        <name>hbase.regionserver.impl</name>
        <value>com.codefollower.lealone.hbase.engine.HBaseRegionServer</value>
    </property>
</configuration>
cxwxz commented 11 years ago

经过这几天的测试发现,不能配置多台 hbase.zookeeper.quorum 否则一查询表所在的RegionServer 就会出现内存溢出而导致 dead 的情况出现。   目前 用的环境是  hadoop 1.1.2 一台 namenode 三台 datanode
hbase 0.94.10  一台 HMaster 三台 HRegionServer 三台 hbase.zookeeper.quorum 时会出现 dead 的情况,目前配一台  hbase.zookeeper.quorum使用查询是正常使用。 是不是 zookeeper 的调度有问题,在配多台 hbase.zookeeper.quorum 情况下。

codefollower commented 11 years ago

我看看你在使用lealone client时是怎么指定jdbc url的?

这问题有点奇怪,lealone其实都没跟zookeeper打交道。

cxwxz commented 11 years ago

jdbc:lealone:tcp://ubuntu-29:9092,ubuntu-27:9092,ubuntu-28:9092/hbasedb 用这种方式,同时配置多个 就会出现内存 溢出的情况 用以下方式则,正常使用。 jdbc:lealone:tcp://ubuntu-29:9092/hbasedb jdbc:lealone:tcp://ubuntu-28:9092/hbasedb

codefollower commented 11 years ago

谢谢报告问题,我查一下原因。

codefollower commented 11 years ago

jdbc:lealone:tcp://ubuntu-29:9092,ubuntu-27:9092,ubuntu-28:9092/hbasedb 这种方式是H2数据库的遗留方式, 写数据时,每个server都写一份,会有问题, 正确应该是随机选一个server,然后sql先转给它,最后再由它确定往哪写。 我正在修复这个bug。

另外, Issue #55 也有可能是这个原因导致。