Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

When Hadoop Gets Stuck: Debugging Hive

DZone's Guide to

When Hadoop Gets Stuck: Debugging Hive

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

This recent article from David Chaiken at Altiscale discusses how to debug Hive (Hadoop) through an anecdote regarding a customer's struggling Hive job. According to Chaiken, there are downsides to working with Hadoop - "[it] is powerful," he says, "and it’s experiencing a tremendous rate of innovation, but it also has many rough edges" - and sometimes Hadoop does not offer a lot of information in terms of what has gone wrong.

The error in Chaiken's anecdote is eventually solved, and it's a simple mistake that snowballs into a bigger problem because of some of these rough edges in Hadoop, but Chaiken's walk-through of his solution provides an interesting look at debugging a Hive job when the problem is unclear. Take a look at the full article and find out one way to approach a Hive job that has lost its way.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}