Over a million developers have joined DZone.

Why Isn’t Big Data Called Small Data?

DZone's Guide to

Why Isn’t Big Data Called Small Data?

If there were a more accessible adjective to discuss ''big'' data, then maybe, just maybe, more people would be comfortable exploring it.

· Big Data Zone
Free Resource

Learn best practices according to DataOps. Download the free O'Reilly eBook on building a modern Big Data platform.

Sometimes, I think that Big Data has a branding problem.

You see, for data scientists to gain trust and buy-in from their colleagues, they have to explain how their analysis can add value. They take a “data ocean” of information and distil it into highly specific and actionable insights for every internal customer, refining and refreshing it along the way to ensure that it is as relevant as possible.

It's like they take the most powerful telescope imaginable and look for a speck of dust on the moon. “Here you go, this precise set of data will prove that you are right.”

The success of Big Data initiatives (to a large extent) comes in the ability to drill down from the planetary level to the sub-atomic level. It’s all about getting to those small insights that would never have appeared had you not started large and refocused, refocused, and refocused. Of course, this doesn’t mean that the bigger trends are not relevant, but we have a tendency to view anything “large” with a certain amount of mistrust.

Somehow, we naturally think that “big” things have a bigger margin for error, although the assumptions that we made on the way to the smaller insights could equally have been flawed.

So, what do we trust more — “big” things or “small” things?

Something tells me that we actually trust small things more, but I am very happy to hear the inevitably differing views of the readers.

I have a suspicion that when data science professionals talk about “Big Data” with their colleagues, the first reaction is for their eyes to glaze over and they automatically expect not to understand it. “It’s ‘big,’ and it needs a team of Ph.D.s to analyze it, so how would I possibly get my head around it?” If, on the other hand, there were a more accessible adjective such as small or tiny in front of the word “data” then maybe the end-clients would feel happier exploring it? Maybe I am overthinking things. For me, when something is described as small, it seems that bit more manageable.

Whatever the branding, Big Data is taking over our lives, both at work and at home. And the more we all seek to explore it, the more value we realize that it holds. The distillation process from large to small is key in many decision-making processes, and there are of course many potentially flawed assumptions that we can make along the way. But it is always possible to go back and correct those assumptions. Getting to “small data” from “big data” is the key for most of us in our lives, but we only do it if we are not daunted by the scope of the initial first few decisions.

We live in a data-rich society. If we think that the data is too big to mean anything for us, we are mistaken. It is remarkably easy to turn it into small, actionable data, despite the “Big Data” moniker that follows it around.

Find the perfect platform for a scalable self-service model to manage Big Data workloads in the Cloud. Download the free O'Reilly eBook to learn more.

big data ,data science ,data analytics

Published at DZone with permission of Matthew Reaney. See the original article here.

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}