Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

How to Create Blob Storage and HDInsight Hadoop Clusters With Windows Azure

DZone's Guide to

How to Create Blob Storage and HDInsight Hadoop Clusters With Windows Azure

When you create blob storage and HDInsight Hadoop clusters with Windows Azure, you can provision Spark, Storm, Kafka, and HBase with Windows Azure HDInsight.

· Big Data Zone
Free Resource

See how the beta release of Kubernetes on DC/OS 1.10 delivers the most robust platform for building & operating data-intensive, containerized apps. Register now for tech preview.

HDInsight is a platform that provides a facility to provision Hadoop, Spark, Storm, HBase, Kafka clusters, and R Servers on Windows Azure.

Now, you will learn how to set up a Hadoop cluster on Windows Azure. First, you will require blob storage for provisioning the Hadoop cluster on Windows Azure. The specified blob container will act as a file system like HDFS.

How to Create Blob Storage

Log onto the Azure portal.

On the left pane of the portal,  select New Storage > Storage account.Image title

In the Create Storage dialog, provide the cluster name and select the kind of account and its performance.

Select Replication depending on your choice. You will see three types of replication: locally redundant storage (LRS), geo-redundant storage (GRS), and read-access geo-redundant storage (RA-GRS).

Select the Subscription and Resource group where you need create your blob storage. You can create these or use existing resource groups.

Select the Location where you need to create your blob storage and then click Create.Image title

1. Create Hadoop Cluster

On the left pane of the portal,  select New > Intelligence + analytics > HDInsight.Image title

2. Configure Basic Settings

In the basic dialog, provide Cluster name and Subscription.

Click Cluster configuration and select Cluster type to Hadoop from the drop-down. Then, change the Operating system from Windows to Version.

Provide your username and password. Provide the remote desktop username and it will require you to RDP the Hadoop cluster.

Select the existing Resource group or create a new one where you need to host your Hadoop cluster.

Select the Location where you need to create your Hadoop cluster and click Next.

Image title

3. Set Storage Settings

In the Storage dialog, you can create new storage within your subscription or select existing storage.

You will see a Default container name already filled up, but you can change this according to your requirements. Then, click Next.

Image title

4. Confirm Configurations

In the Confirmation dialog, you can customize the cluster size, applications, and settings.

Image title

Click Create. It will take up to 20 minutes to provision your Hadoop cluster on Windows Azure HDInsight.

Image title

Once the Hadoop cluster is deployed, you can click on your Hadoop cluster instance name. It will show you various featured like remote desktop, diagnose, scale cluster, etc.

Image title

Similarly, you can provision Spark, Storm, Kafka, and HBase with Windows Azure HDInsight.

Now, you know how to create blob storage and HDInsight Hadoop clusters with Windows Azure!

New Mesosphere DC/OS 1.10: Production-proven reliability, security & scalability for fast-data, modern apps. Register now for a live demo.

Topics:
hdinsight ,hadoop ,windows azure ,clusters ,big data ,tutorial

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}