splunk / splunk-shuttl

Splunk app for archive management, including HDFS support.
Apache License 2.0
36 stars 19 forks source link

QA shuttl control from search head #125

Closed petterik closed 11 years ago

petterik commented 11 years ago

Setup:

Run happy path tests. You should be able to control all the Shuttl's from the UI of the Search Head. Thawing, Listing and Flushing. Run sad paths, crashing Splunks, Shuttls. Be evil.

Comment on anything that you find weird.

Klevmarken commented 11 years ago

Shuttle Test 1 - Nice Test: 1 Master / Hadoop namenode 1 Search head / Hadoop datanode 2 Search peers / Hadoop datanode

Once shuttl had been enabled on all the nodes the Search head displayed the following message: "Search results may be incomplete, peer ip-10-4-150-48-ec2-user's search ended prematurely. This may be caused by a variety of reason, please consult logs on peer for details!"

What logs should be consulted?

Shuttl Test 2 - Massive Test:

1 Master / Hadoop Namenode 1 Search head / Hadoop datanode 10 Search peers / Hadoop datanodes

Error 500 Shutdown in progress java.lang.IllegalStateException: Shutdown in progress at java.lang.ApplicationShutdownHooks.add(ApplicationShutdownHooks.java:57) at java.lang.Runtime.addShutdownHook(Runtime.java:209) at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1439) at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:254) at com.splunk.shuttl.archiver.filesystem.hadoop.HadoopArchiveFileSystemFactory.doCreate(HadoopArchiveFileSystemFactory.java:53) at com.splunk.shuttl.archiver.filesystem.hadoop.HadoopArchiveFileSystemFactory.createWithPropertyFile(HadoopArchiveFileSystemFactory.java:44) at com.splunk.shuttl.archiver.filesystem.hadoop.HadoopArchiveFileSystemFactory.create(HadoopArchiveFileSystemFactory.java:37) at com.splunk.shuttl.archiver.filesystem.ArchiveFileSystemFactory.supportedArchiveFileSystem(ArchiveFileSystemFactory.java:105) at com.splunk.shuttl.archiver.filesystem.ArchiveFileSystemFactory.getByNameAndLocalFileSystemPaths(ArchiveFileSystemFactory.java:96) at com.splunk.shuttl.archiver.filesystem.ArchiveFileSystemFactory.getWithConfiguration(ArchiveFileSystemFactory.java:78) at com.splunk.shuttl.archiver.thaw.BucketThawerFactory.createWithConfigAndSplunkSettingsAndLocalFileSystemPaths(BucketThawerFactory.java:52) at com.splunk.shuttl.archiver.thaw.BucketThawerFactory.createDefaultThawer(BucketThawerFactory.java:41) at com.splunk.shuttl.server.mbeans.rest.ThawBucketsEndpoint.thawBuckets(ThawBucketsEndpoint.java:92) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:616) at com.sun.jersey.spi.container.JavaMethodInvokerFactory$1.invoke(JavaMethodInvokerFactory.java:60) at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$TypeOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:185) at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:75) at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:288) at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:108) at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:147) at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:84) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1469) at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1400) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1349) at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1339) at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:416) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537) at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:708) at javax.servlet.http.HttpServlet.service(HttpServlet.java:802) at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:594) at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:485) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:119) at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:521) at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:233) at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1065) at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:412) at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:192) at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:999) at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:117) at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:111) at org.eclipse.jetty.server.Server.handle(Server.java:351) at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:451) at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:916) at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:634) at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230) at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:76) at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:609) at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:45) at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:599) at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:534) at java.lang.Thread.run(Thread.java:679)

{"buckets":[],"failed":[]}curl: (52) Empty reply from server curl: c(u5r2l): E(m5p2t)y Ermepptlyy rferpolmy sferrovme rs erver curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server curl: (52) Empty reply from server

:The following WARN were caught when spamming the script over and over. 

2013-02-28 18:37:39,637 WARN com.splunk.shuttl.server.distributed.RequestOnSearchPeers: warning="Executed request on distributed peer" happened="java.lang.RuntimeException: Too many open files" result="will add to exceptions, which can be retrieved with getExceptions()"

2013-02-28 18:37:39,701 WARN com.splunk.shuttl.server.distributed.RequestOnSearchPeers: warning="Executed request on distributed peer" happened="java.lang.RuntimeException: java.net.SocketException: Too many open files" result="will add to exceptions, which can be retrieved with getExceptions()"

"failed":["java.lang.RuntimeException: java.io.IOException: Unable to delete directory /mnt/KlevisTestArea/splunk/var/lib/splunk/cluster_shuttl_test/thaweddb/db_1350851422_1312494429_26."]}