Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

HAWQ and MADlib for Hadoop Go Open-Source

DZone's Guide to

HAWQ and MADlib for Hadoop Go Open-Source

Pivotal announced that two of their offerings, HAWQ and MADlib, will become open-source under the Apache Software Foundation.

· Database Zone
Free Resource

Download the Guide to Open Source Database Selection: MySQL vs. MariaDB and see how the side-by-side comparison of must-have features will ease the journey. Brought to you in partnership with MariaDB.

Today, Pivotal announced that two of their offerings, HAWQ and MADlib, will become open-source under the Apache Software Foundation.

HAWQ is a SQL analytics engine for Hadoop built on HDFS, the distributed Java-based filesystem by Hortonworks. MADlib is a library for scalable in-database analytics/machine learning.

HAWQ is interesting because it allows you to get data in a declarative way, instead of for instance writing procedural MapReduce algorithms. From an enterprise perspective, businesses now have available to them an open-source tool that's already been proven in proprietary versions in demanding and high-scale environments.

MADlib, having been previously propriety, will surely become a valuable open-source tool for data scientists and developers interested in Big Data applications.

If you're familiar with Hadoop and want to explore MADlib and HAWQ on a Hadoop cluster, Pivotal Academy offers a course to get you up-to-speed which is currently open for registration. You can check out "MH101: MADlib HAWQ over Hadoop" here.

Pivotal will continue to support and develop HAWQ and MADlib, just as other private organizations such as Oracle support and develop open-source projects like Java.

Interestingly, some analysts have pointed out that this announcement from Pivotal is akin to an attack on Oracle and an attempt to "kill the traditional database." Whether this is actually the ultimate goal of this move by Pivotal and if they will be successful in their endeavor is obviously yet to be seen.

Interested in reducing database costs by moving from Oracle Enterprise to open source subscription?  Read the total cost of ownership (TCO) analysis. Brought to you in partnership with MariaDB.

Topics:
database ,pivotal ,hawq ,madlib ,hadoop

Opinions expressed by DZone contributors are their own.

THE DZONE NEWSLETTER

Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

X

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}