Over a million developers have joined DZone.

5 Mistakes in Visualizing Different Types of Data and How to Overcome Them

Today we have lots of tools to turn our data into pictures, graphs, and charts. But a picture isn't worth 1000 words unless it makes sense. Here are some common mistakes people make with data representation with some suggestions about how to avoid them.

· Big Data Zone

Learn how you can maximize big data in the cloud with Apache Hadoop. Download this eBook now. Brought to you in partnership with Hortonworks.

The popularity and impact of data visualizations has increased dramatically over a relatively short space of time. Google Trends shows a near 100% increase in search frequency for data visualizations since 2009, and we have seen a multitude of tools and software become available, allowing almost anyone to create data visualizations with relative ease.

We are instinctively more drawn to images than text, as the brain is able to process images at a far quicker rate. However, this doesn’t mean you can just throw together a mass of images and shapes onto a dashboard and expect to wow your audience. Much like the cognitive aspects behind our attraction to images, there are other inherent – and to some extent, subconscious – behaviors that become relevant. One of those is first impressions.

We all know the saying: first impressions last a lifetime. But how much truth is there behind it? Well, as it turns out; quite a lot. Similar to the instinctive fight or flight response, humans perform an act of unconscious thinking called rapid cognition; more instinctual and quicker than the deliberate decision-making style of thinking we are accustomed to. Rapid cognition is our ability to dig deeper and gauge what is really important from a very short experience. As much as we’re told to never judge a book by its cover, this ability to rapidly parse through large amounts of information and decide what’s most important without engaging in slower, more rational ways of thinking is something we do every day.

Psychologists call the phenomenon ‘thin-slicing’: perceiving details or information within seconds that might take months or years of evaluation with the rational part of the mind. Malcolm Gladwell describes it as the following:

Thin-slicing is not an exotic gift. It is a central part of what it means to be human. We thin-slice whenever we meet a new person or have to make sense of something quickly… …we come to rely on that ability because there are lots of situations where careful attention to the details, even for no more than a second, can tell us an awful lot.

The good news is, you’re able to change and disprove any false first impressions someone may have of you as they get to know you. Online, however, this is much more difficult as our attention spans are at record lows. With it being more difficult than ever to arrest your reader’s attention, you can’t afford to let bad first impressions get in the way of your data visualizations – especially when the message that’s buried deeper is well worth exploring.

To prevent this, we’re going to discuss 5 of the most common mistakes to avoid when it comes to visualizing different types of data.

1.  Data Overload

Many data visualizations and BI dashboards fall victim to data overload – the overcrowding of content, some of which may not add anything to the understanding of the data. For example, while a 3-dimensional chart may look impressive, they can often make the interpretation of data more difficult.

In the same vein, a BI dashboard with 5 charts and numerous labels may showcase a notable amount of findings but is ultimately useless if your reader cannot distinguish what they’re looking at. Unnecessary illustrations, drop shadows, fonts, and ornamentations can distract from the data, so use them sparingly. In most instances, less is more.

2.  Accessing Axis

When dealing with quantitative data, bar or line charts are two of the best methods of visualizing your content. One common mistake is often to do with the chart axes; while it may seem efficient to start the Y-axis value above zero for larger values, this can truncate the bars and prevent an accurate representation of their values.

3.  Don’t Slice Too Thin

Dealing with whole numbers, data often comes in the form of part-to-whole relationships, better known as pie charts. Pie charts are an extremely popular method of conveying data, and yet are much maligned for being, as Walter Hickey puts it, “incredibly bad at the one thing they’re ostensibly designed to do”.

Without section labels, it’s actually very difficult to distinguish the sizes of pie ‘segments’ (could you tell the difference between 36% and 37%?) so ensure all areas of your chart are clearly labeled. Also worth considering is the number of categories used; too many different segments can make it hard to differentiate between each.

4.  Crossed Wires

Data that lies within a certain range is often used to showcase change over time. Line charts are therefore an effective way of conveying the changes or differences between the data over time. You may have started to notice a trend here, but it’s important not to use too many lines in your chart. Having a mass of interchanging lines across a chart can quickly become confusing, so we suggest not using any more than 4 series.

5.  Be Appropriate

Heat maps are one of the newest charts in the data visualization world and have quickly become popular. Using geographical space as a base is perfect for categorical data, but there are a few obstacles that can trip you up. Color and data ranges should both be used appropriately with heat maps.

Some colors stand out more than others, which can give unnecessary weight to data. Instead, use a single color with varying shades to show levels of intensity. For the data itself, select 3-6 numerical ranges that distribute the data evenly between them. +/- signs can extend the high and low ranges.

Effective storytelling through data is a required skill which will help you drive influence in your organization – download this whitepaper now to learn more!

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks

Topics:
graphing ,visualization ,data presentation

Published at DZone with permission of Josh Anderson, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

SEE AN EXAMPLE
Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.
Subscribe

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}