entice all of the hype lately inside knowledge science, however I’d argue they’re each secondary to a extra essential—and often-ignored—part of the sector.
When coping with knowledge, there are two important steps:
- Processing and analyzing the information to extract significant insights.
- Conveying these insights to others.
The second level is essential and sometimes neglected. The world’s most superior algorithm or useful perception is ineffective if nobody can perceive it. As an information scientist, you need to be taught to convey your insights to others. There’s multiple purpose for this, with the obvious one being that if the correct individuals perceive the information, the world at massive will profit. Nonetheless, there’s one other equally essential purpose: It’s usually in describing our findings to others that we uncover errors, extra profound information, or additional areas for exploration.
On this article, we’ll look at a robust and efficient device which may help obtain the second step above: knowledge visualization. That is the primary in a sequence of articles that may take absolute newbies deep into the realm of knowledge visualization. This primary article is common and lightweight, meant as an introduction to the sector as an entire. In later articles, I’ll get into the extra technical features, ultimately concluding by instructing you how one can construct your personal knowledge visualizations.
With that information, you’ll be armed to sort out your knowledge in new, thrilling methods.
“The best worth of an image is when it forces us to note what we by no means anticipated to see.” –John Tukey
What Counts as a Information Visualization?
Many individuals view knowledge visualization by a restricted lens, solely classifying normal graphs, reminiscent of bar charts, line charts, and the like, as true knowledge visualizations. Considered from this angle, knowledge visualization didn’t materialize till the center of the 18th century. (We’ll see some examples under.)
Nonetheless, we might do properly to broaden our minds. Visible transformations of knowledge are in no way restricted to our conventional concepts. They’ve been round for 1000’s of years. For instance, right here is the Imago Mundi [1], the oldest recognized map on the earth, found as a relic of the traditional metropolis of Babylon:
This map locations Babylon on the heart and was doubtless an especially useful gizmo for visualizing what we now formally name geospatial knowledge. It is likely one of the world’s earliest knowledge visualizations.
There are a plethora of comparable figures and pictures from numerous historic civilizations—cave work, calendars, stone carvings, even Egyptian hieroglyphics—these are all successfully visible representations of knowledge that have been obscure of their preliminary type. Viewing these examples as knowledge visualizations leads us to an essential precept:
At its core, knowledge visualization is nothing greater than taking some knowledge—be it numerical, textual, or in any other case—and making use of a change to characterize it visually.
This foundational precept results in a number of associated subjects primarily involving the simplest strategies to conduct these transformations, the place efficient loosely interprets to “sincere, straightforward to know, and informative.”
Early Examples of Information Visualizations
Now that we now have broadened our views regarding what constitutes an information visualization, allow us to check out some trendy examples. Under is a chart from 1644 developed by Michael Florent Van Langren [2]. It is likely one of the earliest graphical representations of what we take into account to be conventional statistical knowledge, depicting estimates of the distinction in longitude between Rome and Toledo.

Let’s take into account a extra concerned instance subsequent—one which instantly highlights Tukey’s quote above.
Under is a map of London’s Soho District in 1854 [3]. It was designed by John Snow so as to decide if there have been any patterns within the cholera outbreak that was debilitating the city on the time:

Trying towards the middle of the map, we are able to see an exceptionally massive variety of deaths close to the water pump on Broad Road. An investigation decided that this pump was contaminated and was a significant reason for the unfold of the illness.
This instance highlights precisely the precept from John Tukey we famous above: Probably the greatest makes use of of knowledge visualization is to shortly see insights which can be tough to search out within the knowledge’s preliminary type.
Precision and Flexibility
Information visualization is a broad and deep matter that may be approached in some ways. That stated, there are two ideas that you need to take note regardless of the particular type of knowledge visualization you have interaction in: precision and flexibility.
A superb knowledge visualization doesn’t attempt to accomplish ill-defined duties, reminiscent of displaying the essence of or summarizing every thing essential a couple of knowledge set. Statements like these are subjective and primarily inconceivable to realize.
Fairly, knowledge visualization highlights a selected and well-defined side of the related knowledge in a means that makes it simpler to know for the consumer. You must all the time articulate precisely what you need to categorical about your knowledge earlier than you even start designing a visualization.
To internalize this precept, it’s useful to recall what the aim of an information visualization is to start with: to show insights from an information set in a transparent and helpful means. We need to make the information simpler to know. Being exact ensures we obtain this objective. A visualization that makes an attempt to do an excessive amount of would possibly find yourself complicated the viewer much more. It’s a lot better to supply a visualization which covers much less knowledge in a clearer means. High quality is extra essential than amount.
Check out the information desk under, which incorporates details about salaries from totally different cities round the US.
Identify | Metropolis | Earnings | Occupation |
---|---|---|---|
Sarah Mitchell | Denver, CO | $72,500 | Advertising and marketing Supervisor |
Jamal Rodriguez | Houston, TX | $58,300 | Electrician |
Priya Desai | Seattle, WA | $91,200 | Software program Engineer |
Thomas Nguyen | Chicago, IL | $64,800 | Nurse |
Which of the next is the higher visualization alternative for the above knowledge?
- A visualization that makes an attempt to simplify the data within the knowledge desk utilizing a bar chart that has names on one axis and salaries on the opposite axis, makes use of colour to distinguish amongst cities, and makes use of a texture on the bars (dashed traces, diagonal traces, and so on.) to tell apart amongst careers.
- The identical visualization as above, however this time excluding the majors. In different phrases, a bar chart of names and salaries which colours the bars primarily based on location.
It’s tempting to decide on the primary one, however the truth is, it tries to do an excessive amount of. Higher to show restricted, focused info than to confuse your viewers.
Along with being exact, sustaining flexibility can also be essential. There isn’t a such factor as an ideal knowledge visualization. There’s all the time room for enchancment, and knowledge visualizations usually change into higher with every revision. After all, in some unspecified time in the future, an information visualization should be shared with others and serve its goal.
This results in a quandary—how a lot revision is sufficient revision? There isn’t a definitive reply to this query. The method of revising a visualization should be undertaken with care. Asking too many individuals for recommendation will doubtless lead to a bunch of half-baked, conflicting opinions. Then again, publishing the primary draft of a visualization—i.e., not revising it in any respect—is more likely to result in a subpar consequence.
Though there isn’t a excellent resolution, there are just a few tips you possibly can comply with:
- Establish 2-3 individuals to provide you suggestions in your visualization.
- Strive to make sure your checklist of individuals encompasses the next:
- A reviewer who’s proficient in designing knowledge visualizations
- A reviewer who has a powerful understanding of the information that’s getting used to develop the visualization (e.g., a political scientist for election knowledge)
- A reviewer who’s a part of the meant viewers for the visualization
- Undergo 2-3 rounds of suggestions and revision with this identical checklist of individuals. It will be sure that enhancements to the visualization are steady and logical.
Last Ideas and Trying Ahead
In some ways, knowledge visualization is akin to writing. Even essentially the most prolific and proficient authors have editors, and their books undergo in depth revision earlier than being accepted for publishing. Why? For the easy purpose that good writing is essentially depending on the viewers, and punctiliously curated revision ensures one of the best expertise for the eventual readers of a e book. The identical thought applies to knowledge visualization.
By following these tips, you possibly can make sure you develop a strong knowledge visualization which is grounded in greatest practices, accurately shows the information at hand, and is comprehensible for the meant viewers.
They’re the important thing to efficient knowledge visualization, and the muse for superior visualization methods that might be mentioned in future articles. Till then.
References
[1] https://commons.wikimedia.org/wiki/File:The_Babylonian_map_of_the_world,_from_Sippar,_Mesopotamia..JPG
[2] The Visible Show of Quantitative Info, Edward Tufte
[3] https://picryl.com/media/snow-cholera-map-1-cbadea