Over a million developers have joined DZone.

KassandraMRHelper: A New Tool for Cassandra and Hadoop

DZone's Guide to

KassandraMRHelper: A New Tool for Cassandra and Hadoop

· Java Zone
Free Resource

Are you joining the containers revolution? Start leveraging container management using Platform9's ultimate guide to Kubernetes deployment.

Users of Cassandra and Hadoop may be interested in a new tool from Knewton called KassandraMRHelper. The purpose of the tool is implied in the name - simplifying the process of extracting data out of Cassandra and into Hadoop, and map-reducing the data - but it takes some different approaches from other techniques. According to the overview of KassandraMRHelper on Knewton's blog:

[KassandraMRHelper] doesn’t require a live Cassandra cluster to extract the data from. This allows us to re-run map-reduce jobs multiple times without worrying about any performance degradation of our production services. This means that we don’t have to accommodate more traffic for these offline analyses, which keeps costs down.

Knewton's full blog post also includes a breakdown of how it all works, as well as a small tutorial including sample code. Anybody working with Cassandra and Hadoop should take a look.

Using Containers? Read our Kubernetes Comparison eBook to learn the positives and negatives of Kubernetes, Mesos, Docker Swarm and EC2 Container Services.


Opinions expressed by DZone contributors are their own.


Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.


{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}