Alexius Ludeman
2012-01-18 01:00:43 UTC
hi,
I am getting the error from all 5 zookeeperd, "Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314" every
microsecond(?).
I have a 5 node cluster on ubuntu maverick in aws. I've backported
zookeeperd and now am running 3.3.4+dfsg1-1. My apps are
using zookeeper-3.3.3.jar, but within the next 3-4 weeks I will be
upgrading to zookeeper-3.3.4.jar. This problem also existed with
zookeeperd on 3.3.3, in hopes of upgrading would solve my problem.
I have stopped the whole world, rm -rf /var/lib/zookeeper and restarted
without any change to this log message. I have several other environments
without this error showing, same versions of client and server but only 3
zookeeperd, and much fewer clients.
Our use of zookeeper is: appName is a key and the value is a server:port.
After the app reads in its required appName(s), it puts watches on these
appNames looking for changes. So we have a bunch of apps with many
watches. The client count is below.
I would be grateful if anyone has some insight on how to debug further.
thanks
lex
zookeeper-0.p2:/var/log/zookeeper/zookeeper.log:
2012-01-17 22:23:59,097 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38653
2012-01-17 22:23:59,097 - INFO [Thread-2309197:***@1435][] -
Closed socket connection for client /127.0.0.1:38652 (no session
established for client)
2012-01-17 22:23:59,097 - WARN [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@639][] - Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314
2012-01-17 22:23:59,097 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1435][] - Closed socket connection for
client /127.0.0.1:38653 (no session established for client)
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38654
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1237][] - Processing stat command from /
127.0.0.1:38654
2012-01-17 22:23:59,098 - INFO
[Thread-2309198:NIOServerCnxn$***@1153][] - Stat command output
2012-01-17 22:23:59,098 - INFO [Thread-2309198:***@1435][] -
Closed socket connection for client /127.0.0.1:38654 (no session
established for client)
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38655
2012-01-17 22:23:59,099 - WARN [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@639][] - Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1435][] - Closed socket connection for
client /127.0.0.1:38655 (no session established for client)
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38656
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1237][] - Processing stat command from /
127.0.0.1:38656
Connected to zookeeper-0.p2
envi
Environment:
zookeeper.version=3.3.4--1, built on 12/09/2011 00:03 GMT
host.name=zookeeper-0.p2.use.color.internal
java.version=1.6.0_20
java.vendor=Sun Microsystems Inc.
java.home=/usr/lib/jvm/java-6-openjdk/jre
java.class.path=/etc/zookeeper/conf:/usr/share/java/jline.jar:/usr/share/java/log4j-1.2.jar:/usr/share/java/xercesImpl.jar:/usr/share/java/xmlParserAPIs.jar:/usr/share/java/zookeeper.jar
java.library.path=/usr/lib/jvm/java-6-openjdk/jre/lib/amd64/server:/usr/lib/jvm/java-6-openjdk/jre/lib/amd64:/usr/lib/jvm/java-6-openjdk/jre/../lib/amd64:/usr/java/packages/lib/amd64:/usr/lib/jni:/lib:/usr/lib
java.io.tmpdir=/tmp
java.compiler=<NA>
os.name=Linux
os.arch=amd64
os.version=2.6.35-24-virtual
user.name=zookeeper
user.home=/var/lib/zookeeper
user.dir=/
Trying 10.118.145.79...Connected to zookeeper-0.p2
stats
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.40.50.163:45526[1](queued=0,recved=2819,sent=2820)
/10.124.143.97:46512[0](queued=0,recved=1,sent=0)
/10.40.37.34:33308[1](queued=0,recved=2795,sent=2796)
/10.205.49.18:35339[1](queued=0,recved=2671,sent=2672)
/10.205.49.18:42419[1](queued=0,recved=2718,sent=2719)
/10.80.82.184:43535[1](queued=0,recved=2819,sent=2820)
/10.84.133.213:52533[1](queued=0,recved=2817,sent=2844)
Latency min/avg/max: 0/0/200
Received: 7330810
Sent: 7331408
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.34.37.214...Connected to zookeeper-1.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.79.6.245:38198[1](queued=0,recved=2680,sent=2681)
/10.98.119.91:37301[1](queued=0,recved=2689,sent=2690)
/10.79.35.125:59744[1](queued=0,recved=2681,sent=2682)
/10.39.81.28:38047[1](queued=0,recved=2666,sent=2667)
/10.124.143.97:48969[0](queued=0,recved=1,sent=0)
Latency min/avg/max: 0/0/81
Received: 4984011
Sent: 4984267
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.38.85.114...Connected to zookeeper-2.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.84.133.213:51499[1](queued=0,recved=2655,sent=2656)
/10.124.143.97:39412[0](queued=0,recved=1,sent=0)
/10.39.81.28:56431[1](queued=0,recved=2715,sent=2716)
/10.204.183.195:40180[1](queued=0,recved=2664,sent=2665)
/10.32.51.156:59390[1](queued=0,recved=2634,sent=2635)
/10.79.6.245:42161[1](queued=0,recved=2731,sent=2732)
/10.118.14.138:40355[1](queued=0,recved=2643,sent=2644)
Latency min/avg/max: 0/0/33
Received: 6650278
Sent: 6651369
Outstanding: 0
Zxid: 0x100014e31
Mode: leader
Node count: 197
Trying 10.101.42.61...Connected to zookeeper-3.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.204.230.212:57523[1](queued=0,recved=2719,sent=2720)
/10.85.19.23:52916[1](queued=0,recved=2836,sent=2837)
/10.79.35.125:54742[1](queued=0,recved=2734,sent=2735)
/10.124.143.97:60909[0](queued=0,recved=1,sent=0)
/10.204.230.212:57918[1](queued=0,recved=2674,sent=2675)
/10.100.90.101:35977[1](queued=0,recved=2680,sent=2681)
/10.98.119.91:42147[1](queued=0,recved=2745,sent=2746)
Latency min/avg/max: 0/0/123
Received: 8430215
Sent: 8430724
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.204.59.55...Connected to zookeeper-4.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.100.90.101:34437[1](queued=0,recved=2728,sent=2729)
/10.204.183.195:49961[1](queued=0,recved=2706,sent=2707)
/10.124.143.97:34797[0](queued=0,recved=1,sent=0)
Latency min/avg/max: 0/0/19
Received: 46410
Sent: 46411
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
I am getting the error from all 5 zookeeperd, "Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314" every
microsecond(?).
I have a 5 node cluster on ubuntu maverick in aws. I've backported
zookeeperd and now am running 3.3.4+dfsg1-1. My apps are
using zookeeper-3.3.3.jar, but within the next 3-4 weeks I will be
upgrading to zookeeper-3.3.4.jar. This problem also existed with
zookeeperd on 3.3.3, in hopes of upgrading would solve my problem.
I have stopped the whole world, rm -rf /var/lib/zookeeper and restarted
without any change to this log message. I have several other environments
without this error showing, same versions of client and server but only 3
zookeeperd, and much fewer clients.
Our use of zookeeper is: appName is a key and the value is a server:port.
After the app reads in its required appName(s), it puts watches on these
appNames looking for changes. So we have a bunch of apps with many
watches. The client count is below.
I would be grateful if anyone has some insight on how to debug further.
thanks
lex
zookeeper-0.p2:/var/log/zookeeper/zookeeper.log:
2012-01-17 22:23:59,097 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38653
2012-01-17 22:23:59,097 - INFO [Thread-2309197:***@1435][] -
Closed socket connection for client /127.0.0.1:38652 (no session
established for client)
2012-01-17 22:23:59,097 - WARN [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@639][] - Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314
2012-01-17 22:23:59,097 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1435][] - Closed socket connection for
client /127.0.0.1:38653 (no session established for client)
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38654
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1237][] - Processing stat command from /
127.0.0.1:38654
2012-01-17 22:23:59,098 - INFO
[Thread-2309198:NIOServerCnxn$***@1153][] - Stat command output
2012-01-17 22:23:59,098 - INFO [Thread-2309198:***@1435][] -
Closed socket connection for client /127.0.0.1:38654 (no session
established for client)
2012-01-17 22:23:59,098 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38655
2012-01-17 22:23:59,099 - WARN [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@639][] - Exception causing close of
session 0x0 due to java.io.IOException: Len error 1835955314
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1435][] - Closed socket connection for
client /127.0.0.1:38655 (no session established for client)
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:NIOServerCnxn$***@251][] - Accepted socket
connection from /127.0.0.1:38656
2012-01-17 22:23:59,099 - INFO [NIOServerCxn.Factory:
0.0.0.0/0.0.0.0:2181:***@1237][] - Processing stat command from /
127.0.0.1:38656
Connected to zookeeper-0.p2
envi
Environment:
zookeeper.version=3.3.4--1, built on 12/09/2011 00:03 GMT
host.name=zookeeper-0.p2.use.color.internal
java.version=1.6.0_20
java.vendor=Sun Microsystems Inc.
java.home=/usr/lib/jvm/java-6-openjdk/jre
java.class.path=/etc/zookeeper/conf:/usr/share/java/jline.jar:/usr/share/java/log4j-1.2.jar:/usr/share/java/xercesImpl.jar:/usr/share/java/xmlParserAPIs.jar:/usr/share/java/zookeeper.jar
java.library.path=/usr/lib/jvm/java-6-openjdk/jre/lib/amd64/server:/usr/lib/jvm/java-6-openjdk/jre/lib/amd64:/usr/lib/jvm/java-6-openjdk/jre/../lib/amd64:/usr/java/packages/lib/amd64:/usr/lib/jni:/lib:/usr/lib
java.io.tmpdir=/tmp
java.compiler=<NA>
os.name=Linux
os.arch=amd64
os.version=2.6.35-24-virtual
user.name=zookeeper
user.home=/var/lib/zookeeper
user.dir=/
Trying 10.118.145.79...Connected to zookeeper-0.p2
stats
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.40.50.163:45526[1](queued=0,recved=2819,sent=2820)
/10.124.143.97:46512[0](queued=0,recved=1,sent=0)
/10.40.37.34:33308[1](queued=0,recved=2795,sent=2796)
/10.205.49.18:35339[1](queued=0,recved=2671,sent=2672)
/10.205.49.18:42419[1](queued=0,recved=2718,sent=2719)
/10.80.82.184:43535[1](queued=0,recved=2819,sent=2820)
/10.84.133.213:52533[1](queued=0,recved=2817,sent=2844)
Latency min/avg/max: 0/0/200
Received: 7330810
Sent: 7331408
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.34.37.214...Connected to zookeeper-1.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.79.6.245:38198[1](queued=0,recved=2680,sent=2681)
/10.98.119.91:37301[1](queued=0,recved=2689,sent=2690)
/10.79.35.125:59744[1](queued=0,recved=2681,sent=2682)
/10.39.81.28:38047[1](queued=0,recved=2666,sent=2667)
/10.124.143.97:48969[0](queued=0,recved=1,sent=0)
Latency min/avg/max: 0/0/81
Received: 4984011
Sent: 4984267
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.38.85.114...Connected to zookeeper-2.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.84.133.213:51499[1](queued=0,recved=2655,sent=2656)
/10.124.143.97:39412[0](queued=0,recved=1,sent=0)
/10.39.81.28:56431[1](queued=0,recved=2715,sent=2716)
/10.204.183.195:40180[1](queued=0,recved=2664,sent=2665)
/10.32.51.156:59390[1](queued=0,recved=2634,sent=2635)
/10.79.6.245:42161[1](queued=0,recved=2731,sent=2732)
/10.118.14.138:40355[1](queued=0,recved=2643,sent=2644)
Latency min/avg/max: 0/0/33
Received: 6650278
Sent: 6651369
Outstanding: 0
Zxid: 0x100014e31
Mode: leader
Node count: 197
Trying 10.101.42.61...Connected to zookeeper-3.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.204.230.212:57523[1](queued=0,recved=2719,sent=2720)
/10.85.19.23:52916[1](queued=0,recved=2836,sent=2837)
/10.79.35.125:54742[1](queued=0,recved=2734,sent=2735)
/10.124.143.97:60909[0](queued=0,recved=1,sent=0)
/10.204.230.212:57918[1](queued=0,recved=2674,sent=2675)
/10.100.90.101:35977[1](queued=0,recved=2680,sent=2681)
/10.98.119.91:42147[1](queued=0,recved=2745,sent=2746)
Latency min/avg/max: 0/0/123
Received: 8430215
Sent: 8430724
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197
Trying 10.204.59.55...Connected to zookeeper-4.p2
stat
Zookeeper version: 3.3.4--1, built on 12/09/2011 00:03 GMT
Clients:
/10.100.90.101:34437[1](queued=0,recved=2728,sent=2729)
/10.204.183.195:49961[1](queued=0,recved=2706,sent=2707)
/10.124.143.97:34797[0](queued=0,recved=1,sent=0)
Latency min/avg/max: 0/0/19
Received: 46410
Sent: 46411
Outstanding: 0
Zxid: 0x100014e31
Mode: follower
Node count: 197