This post is about the HD Insight Tools Visual Studio Add In - for accessing Azure Hadoop Hive from Visual Studio. I hacked together the same on my way back from the MVP summit, and a post is due. If you are new, last week I wrote about issuing LINQ queries against the Hive meta store – read that post for a general introduction about LINQ to Hive.
You may use HDInsight Tools Visual Studio Add In for
- Generating wrapper classes for Hive tables for easily executing LINQ to Hive queries.
- Running Hive SQL queries from Visual Studio and for viewing the results
Please note that you need to provision a Hadoop cluster in Azure, or need HDInsight installed locally (See these earlier posts related to Hadoop on Azure - Analyzing some ‘Big Data’ using C# and Extracting Top 500 MSDN Links from Stack Overflow ).
Installing The Add In
You can install the Add In from Visual Studio Tools –> Extensions and Updates.
Generating LINQ To Hive Model Classes
Once you install the Add In, Goto View-> Other Windows –> Hive Explorer to bring up the Hive Explorer tool Window. Click Add Connection to add a hive connection, by providing the server details, port, user name and password and click Save.
Once you add a connection, you can expand the connection to view the tables and fields in the Hive. Now, you can right click on the Added connection name in the tool Window, and click ‘Generate Code’ context menu to generate the model classes. Also, make sure you install the Microsoft.Hadoop.Hive nuget package by issuing the install-package Microsoft.Hadoop.Hive –pre nuget command.
You can use the generated model to query Hive using LINQ as mentioned in the previous article.
You can use the Queries window of Hive Explorer to issue Hive Queries. Choose the connection in the combo box, and shoot your query to view the results.
Happy Coding. I’m soon uploading the source code of Hive Explore to Github after some clean up once I’ve some time.