Over a million developers have joined DZone.

Hadoop, MapReduce and Hive: How to Use Non-Java Languages, Such as R

DZone's Guide to

Hadoop, MapReduce and Hive: How to Use Non-Java Languages, Such as R

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

This recent tutorial from Tom Hanlon at Hortonworks demonstrates how to use non-Java languages - R, in particular - to work with Hadoop data through MapReduce and Hive. Hanlon begins with a brief overview of Hadoop, and then divides his tutorial into two sections covering different approaches: Streaming (a MapReduce job from the command line) and Hive (passing data through a script).

Though the tutorial focuses on R, it is also meant to open doors for users working with other languages, such as Python, Ruby, and Linux commands or Shell scripts. To get started with Hadoop data using languages other than Java, take a look at Hanlon's full tutorial on Hortonworks.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.


Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}