“Lambda Architecture” for Real-Time Hashtag Analysis
Join the DZone community and get the full member experience.
Join For FreeIn wrote an article in the Datasalt blog where I show how to use Trident, Hadoop and Splout SQL together to build a toy example "lambda architecture". There you will learn the basics of Trident, a higher-level API on top of Storm, and Splout SQL, a fast SQL read-only DB for Hadoop.
The example architecture is hosted on github . It simulates counting the number of appearances of hashtags in tweets, by date. The ultimate goal is to solve this simple problem in a fully scalable way, and provide a remote low-latency service for querying the evolution of the counts of a hashtag, including both consolidated and real-time statistics for it.
You can read the full article here.
Opinions expressed by DZone contributors are their own.
Comments