Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
In Tutorials->Architecture
_static/logisland-architecture.png architecture diagram is not found
In Tutorials->Index Apache logs->2. Parse the logs records
typo : "For this tutorial we will handle some apache logs with a splitText parser and send them to Elastiscearch Connect a shell" => missing '.' between ElasticSearch and Connect
Moreover in this section we first describe how to launch the streaming job. Then there is an explanation of different stages of this job (setup/stream1/stream2). When I first read this section, in a first time, I didn't realize that it was explanations and not instructions. So perhaps we should explicit that point in th documentation
in section "3. Inject some Apache logs into the system", the second NASA http log file no longer exist and should be removed (Aug 04 to Aug 31, ASCII format, 21.8 MB gzip compressed)
in section "4. Monitor your spark jobs and Kafka topics",
we mentioned "Another tool can help you to tweak and monitor your processing http://sandbox:9000/"
In docker image (both in docker registry and the one built, the tools is not started. I guess that it is kafka-manager and it doesn't be to be shipped iin docker image)
In Tutorials->Architecture _static/logisland-architecture.png architecture diagram is not found
In Tutorials->Index Apache logs->2. Parse the logs records
typo : "For this tutorial we will handle some apache logs with a splitText parser and send them to Elastiscearch Connect a shell" => missing '.' between ElasticSearch and Connect
Moreover in this section we first describe how to launch the streaming job. Then there is an explanation of different stages of this job (setup/stream1/stream2). When I first read this section, in a first time, I didn't realize that it was explanations and not instructions. So perhaps we should explicit that point in th documentation
in section "3. Inject some Apache logs into the system", the second NASA http log file no longer exist and should be removed (Aug 04 to Aug 31, ASCII format, 21.8 MB gzip compressed)
in section "4. Monitor your spark jobs and Kafka topics", we mentioned "Another tool can help you to tweak and monitor your processing http://sandbox:9000/" In docker image (both in docker registry and the one built, the tools is not started. I guess that it is kafka-manager and it doesn't be to be shipped iin docker image)