James Hardwick
2014-09-08 18:22:02 UTC
Hi All,
Iâm experiencing an issue on multiple hosts w/ Zookeeper 4.6 where Apache Solr filled the /overseer/queue node too full and can no longer read from it, and now Iâm trying to ârmr /overseer/queueâ to get things working again. Both systems have 200k+ child nodes of the node at fault.
On both systems I set -Djute.maxbuffer=5242880 within the zkServer.sh throughout the cluster and -Djute.maxbuffer=10000000 in zkCli.sh. On one system I couldnât get this to work until I set zkCliâs setting substantially higher than the zkServerâs, but I *did* get it to work and have since cleared the queue for that given system.
However, Iâm beating my head against a wall for our other system. Iâve set all of the exact same settings and am having no luck rmrâing the node. Iâve tried bumping the maxbuffer settings to 2-4x as high and still no luck. Every attempt from zkCli results in "ConnectionLossException: KeeperErrorCode = ConnectionLoss for /overseer/queue"
Iâm at my wits end here. Iâve checked everything over and over and cannot see any reason why this should not be working. It appears as a correctly set JVM arg when I grep the zookeeper process. Any advice from anyone is appreciated!
--
James Hardwick
Iâm experiencing an issue on multiple hosts w/ Zookeeper 4.6 where Apache Solr filled the /overseer/queue node too full and can no longer read from it, and now Iâm trying to ârmr /overseer/queueâ to get things working again. Both systems have 200k+ child nodes of the node at fault.
On both systems I set -Djute.maxbuffer=5242880 within the zkServer.sh throughout the cluster and -Djute.maxbuffer=10000000 in zkCli.sh. On one system I couldnât get this to work until I set zkCliâs setting substantially higher than the zkServerâs, but I *did* get it to work and have since cleared the queue for that given system.
However, Iâm beating my head against a wall for our other system. Iâve set all of the exact same settings and am having no luck rmrâing the node. Iâve tried bumping the maxbuffer settings to 2-4x as high and still no luck. Every attempt from zkCli results in "ConnectionLossException: KeeperErrorCode = ConnectionLoss for /overseer/queue"
Iâm at my wits end here. Iâve checked everything over and over and cannot see any reason why this should not be working. It appears as a correctly set JVM arg when I grep the zookeeper process. Any advice from anyone is appreciated!
--
James Hardwick