Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Using Oozie 4.4.0 with Hadoop 2.2

DZone's Guide to

Using Oozie 4.4.0 with Hadoop 2.2

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

The current version of Oozie (4.0.0) doesn’t build correctly when you try and target Hadoop 2.2. The Oozie team have a fix going into release 4.0.1 (see OOZIE-1551), but until then you can hack the Maven files to get it working with 4.0.0.

First download the 4.0.0 version from https://oozie.apache.org/, and then unpackage it. Next run the following command to change the Hadoop version being targeted:

cd oozie-4.0.0/
find . -name pom.xml | xargs sed -ri 's/(2.2.0\-SNAPSHOT)/2.2.0/'

Now all you need to do is target the hadoop-2 profile in Maven and you’ll be all set:

mvn -DskipTests=true -P hadoop-2 clean package assembly:single


Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}