Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Attack of the Hadoop Hybrids!

DZone's Guide to

Attack of the Hadoop Hybrids!

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.


 As if IBM and EMC crashing the Hadoop party wasn’t strange enough, now it seems that every data warehousing, database and appliance vendor has a Hadoop co-existence story and, in many cases, a hybrid architecture that they claim provides a best of both worlds solution to data warehousing / big data analytics.

The good news is that this is proof-positive that Hadoop has arrived as an enterprise solution. The bad news, particularly with some of these hybrid architectures, is that this is not a best of both worlds solution. In fact, at least in our opinion, the hybrid model is a worst of all worlds solution.

On the face of it, hybrid architecture should make sense. After all, Hadoop with its focus on unstructured data and its low-cost model should be the ideal complement to a traditional data warehousing solution with a focus on structured data and pre-built enterprise integration.

But how does this work when, in many cases, the two software architectures can’t co-exist on the same hardware or operating system platform? And how does it work when already stretched IT or DevOps teams are expected to manage two totally dissimilar hardware and software architectures and two datasets that need to be kept in sync in order to support up-to-date and accurate analytics?

Of course, the answer is that, for most companies, it doesn’t work. Maybe some large enterprises have the resources to make this work but, for the rest of us, it’s hard enough to maintain one architecture let alone two.

That’s why Treasure Data’s solution is a service. Dealing with the operational complexities of hybrid environments isn’t something that analytic users – or even DevOps – should have to focus on or worry about, as it adds no value to the business. Leave that to us. After all, that’s why we’re here.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}