Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

KassandraMRHelper: A New Tool for Cassandra and Hadoop

DZone's Guide to

KassandraMRHelper: A New Tool for Cassandra and Hadoop

· Java Zone
Free Resource

Get the Edge with a Professional Java IDE. 30-day free trial.

Users of Cassandra and Hadoop may be interested in a new tool from Knewton called KassandraMRHelper. The purpose of the tool is implied in the name - simplifying the process of extracting data out of Cassandra and into Hadoop, and map-reducing the data - but it takes some different approaches from other techniques. According to the overview of KassandraMRHelper on Knewton's blog:

[KassandraMRHelper] doesn’t require a live Cassandra cluster to extract the data from. This allows us to re-run map-reduce jobs multiple times without worrying about any performance degradation of our production services. This means that we don’t have to accommodate more traffic for these offline analyses, which keeps costs down.

Knewton's full blog post also includes a breakdown of how it all works, as well as a small tutorial including sample code. Anybody working with Cassandra and Hadoop should take a look.

Get the Java IDE that understands code & makes developing enjoyable. Level up your code with IntelliJ IDEA. Download the free trial.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}