Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!

Promise level: Intriguing


Apache Storm

Search for: Apache Storm

Storm site at apache

Apache Storm Architecture

Search for: Apache Storm Architecture

This goes into the history of Storm by its author

Basic framework is in Clojure (JVM) based

Real time processing of events in parallel



Logic in any language abstraction

Zoo Keeper to keep state

Spark and Apache Storm

Search for: Spark and Apache Storm

Here is a long discussion on this

Apache storm and Apache Flink

Search for: Apache storm and Apache Flink

Here is some clarity to the soup of names

//Streaming based

Storm - Pure real time stream processing

Storm/Trident - with some batching capabilities added


Real time even processing bolted on to Kafka


Real time processing with batch capabilities


In memory batch processing with real time processing enabled

Here is a good discussion on storm and flink