Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Run Apache Hive on Windows in 6 Easy Steps

DZone's Guide to

Run Apache Hive on Windows in 6 Easy Steps

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

Note: You need to have cygwin installed to run this tutorial, as Hadoop (needed by Hive) needs cygwin to run on windows. At a minimum, Basic, Net (OpenSSH,tcp_wrapper packages) and Security related Cygwin packages need to be present in the system.

Here are the 6 steps:

1. Download WSO2 BAM 2.0.0. It’s free and open source.

2. Extract it to a preferred location. Let’s call it $BAM_HOME.

3.Start the server by executing the wso2server.bat file present in $BAM_HOME/bin. The server would startup on the default port 9443 on the machine’s IP.

4. Log in to the web console at https://localhost:9443 using the default credentials, i.e. username: admin, password: admin and clicking “Sign-In”.

WSO2 BAM login screen

WSO2 BAM login screen

5. Navigate to the “Add Analytics” option by clicking the menu item on the left hand menu.

WSO2 BAM left hand menu - add analytics option

WSO2 BAM left hand menu – add Analytics option

6. Now execute your Hive script, by entering the script and clicking execute!

Note: Follow this KPI sample document to see a sample working for you in no time, with results appearing on a dashboard. Also, notice that you can schedule the Hive script as well.

Execute Apache Hive script

Execute Apache Hive script

I have to thank my colleague Buddhika Chamith, as all this was possible because of some grueling work done by him. Also, I hate the fact Hadoop and Hive makes it so hard to run stuff on Windows, especially since this is a Java application. Read about those concerns here.

 

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}