Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

When Hadoop Gets Stuck: Debugging Hive

DZone's Guide to

When Hadoop Gets Stuck: Debugging Hive

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

This recent article from David Chaiken at Altiscale discusses how to debug Hive (Hadoop) through an anecdote regarding a customer's struggling Hive job. According to Chaiken, there are downsides to working with Hadoop - "[it] is powerful," he says, "and it’s experiencing a tremendous rate of innovation, but it also has many rough edges" - and sometimes Hadoop does not offer a lot of information in terms of what has gone wrong.

The error in Chaiken's anecdote is eventually solved, and it's a simple mistake that snowballs into a bigger problem because of some of these rough edges in Hadoop, but Chaiken's walk-through of his solution provides an interesting look at debugging a Hive job when the problem is unclear. Take a look at the full article and find out one way to approach a Hive job that has lost its way.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}