Comparison with popular streaming engine as Flink

hi, there is a hydra doc that may help you decide in case you don't know it already http://oss-docs.addthiscode.net/hydra/latest/user-guide/index.html

my personal opinions are: comparing hydra to Uber's AthenaX+Flink, the basic functionality is data aggregation, and empower users to manage jobs without coding applications.

pros for AthenaX+Flink

more rich sql-like queries via conversion by query planner. hydra's query grammar can be found in the doc
rich task level alerts (hydra only alerts on task status but not task results)
more options in terms of input/output format. it supports kafka/cassandra/memsql/mysql/elasticsearch while hydra supports mainly kafka/output of anther job/file/S3 (WIP)
real map-reduce model, while hydra's reduce machine is a single host
more real time? hydra jobs can be set to run in intervals, but not in a "real" real-time fashion. but i am not 100% sure of how Uber's stack works in details

pros for hydra

no external dependencies (except rabbitmq and zookeeper), meaning you don't have to manage Calcite Flink Yarn, levelDB... before you can actually do things
sort of the same as 1, but it is worth emphasizing that hydra is a all-in-one system, including data storage, replication, processing, query, job management, web UI...

addthis / hydra

Comparison with popular streaming engine as Flink #284