Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Hadoop Alternatives: When Your Data Isn't as Big as You Thought

DZone's Guide to

Hadoop Alternatives: When Your Data Isn't as Big as You Thought

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

This post from Chris Stuccio's blog takes a critical look at the use of Hadoop and Big Data as buzzwords by asking an interesting question: What if your data isn't as big as you think it is? He offers a very concise summary of Hadoop's purpose (a straightforward MapReduce system) and  limitations (nothing but a straightforward MapReduce system). Most interesting, though, is the list of alternatives he offers for people who he believes are not really working with big data. Solutions are covered for datasets of a variety of sizes:

  • Hundreds of megabytes
  • Ten-ish gigabytes
  • A couple of terabytes
  • Five terabytes and larger

The five and up range is when Hadoop becomes the best choice. So, if you think you might have been wasting your energy using Hadoop for datasets that are actually "kind of large," or "a little significant," or just generally not big enough to justify Hadoop, check out Chris Stuccio's full post for some new ideas.

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}