0:03
This is Lecture 2 for Lesson 5. Let's talk about colors on maps.
So, colors on thematic maps should really correspond to the types of data you're
trying to show. So, do you have numerical data or do you
have categorical data? Did you take a census and count up a
bunch of things and come up with a rate? Or do you just have the overall
preference for let's say pizza toppings or somehting like that per county.
And color schemes are designed then to include three major types that correspond
to whether or not you have numerical or categorical data.
You have sequential color schemes where you show less stuff to more stuff.
You have diverging color schemes where you're showing stuff above and below an
average. They have qualitative color schemes which
use a different color to signify different category.
So, let's say apples and oranges, and pizza, curry [UNKNOWN] and tonkatsu.
And I'll show you some examples now. So, let's start with a bad example, just
so that I could pound it into your head, make sure you don't do this ever when you
make a map. This is a sequential set of data here
showing the internet users per 100 people per country.
So, it's less of that to more of that per country.
But I've applied a qualitative color scheme just to kind of throw you off
here. A lot of people do this sort of stuff
though as a general rule because they don't know any better.
But why would you have purple being more than green, which is more than orange,
which is more than yellow, and then more than red.
You could kind of learn how to read this map.
But in general, it's very hard to pick out right away which places are high,
right, and which places are low. Or what's the average?
you can't do it very easily. You have to really study this to figure
it out. Contrast that to the appropriate use of
sequential colors here. This is sequential data, so let's look at
sequential colors. This uses five different variations in
hue, lightness, and saturation, to show less internet users per 100 people, and
then more at the high end. And you could see right away where the
low places are versus where the high places are.
It's pretty simple. You can do the same kind of thing, using
a diverging color scheme, which I think are quite useful.
So, what you do with diverging colors is kind of similar but you pick a middle
range. And you say this is the average category.
And then, you make that maybe white or gray or something rather neutral because
the average is not that interesting to you.
But what you want to show here is two ends of the spectrum.
So, we've got green scheme kind of going upward from the average so that you can
see the high areas. And then a purple sort of, I don't know
what you call it, mauve color, below average, you can look at low places.
And I think divergent schemes like this are great for showing really, you know,
what's above and below normal. And then, this is the appropriate use of
a qualitative color scheme. Let's say I had every country in the
world vote on one preferred pizza topping.
So, you had to have like have a contest and pick one for each country.
In this case, you know, Australia and United States and Canada love them with
some sausage. And Brazil's really excited about
pineapple, along with Saudi Arabia and Russia, which totally makes sense, right?
So, qualitative colors make sense here. Because a named hue that's you know, red,
or purple, or green, or whatever it is. You know, corresponds to the fact that we
have very different answers here that really aren't comparable or it's not less
or more, right? Pineapple is not more or less than
pepperoni. They're just different things.
So, in that case when you have different things you're trying to show that are
categorically different, qualitative colors make a lot of sense.
3:20
Speaking of color, I want to talk about rainbows and why rainbows kill people.
So, rainbow color schemes, which are also called spectral color schemes, that's the
right way to call them, are commonly misused on maps.
Weather maps are some of the worst offenders, where you show, show
precipitation amounts from less to more by showing green going to orange, going
to red, going to yellow to purple to white sometimes at the very top.
It doesn't make any sense. It should just show less to more color,
because it's less to more rain, right? Almost every heat map you see floating
around these days also uses these spectral color schemes.
And they're bad for the reasons I've already mentioned.
They use qualitative colors to show sequential or divergent data.
So, is blue more than yellow? Is purple more than blue?
You might have a legend that tells you the answer to that question.
But it's very hard for people to learn those things.
And it's certainly difficult to pick out a visual pattern right away on a map, if
you've got that kind of thing going on. Furthermore, rainbows can actually kill
people. And that's only a little bit hyperbolic
on my part. A recent study showed that doctors
interpreting medical images of the heart that used spectral color schemes, made
worse decisions about the patients' outcomes.
Than they did using the simple sequential color schemes.
So, I think they actually can kill people.
4:30
Here's an example of a poorly designed rainbow color schemed map just so you
never make them. hopefully you'll never make them anyway.
This shows tweets about Mitt Romney and President Obama during the presidential
election in 2012, and this is just a density map.
these are geo-located tweets, so it should just show west to more of these
tweets, right? Tweets about the same topic.
And it looks really cool with this rainbow color scheme, right?
It looks very lively and, and sort of engaging and dynamic, right?
It looks like you want to understand what's going on here, but take a closer
look at it and try to make sense out of it for a second.
Look closely at the color scale. There's orange, sort of, down towards the
lowest end. And red is at the top end.
And orange and red look similar to each other.
But they're on opposite ends of the actual spectrum.
And then, let's see. Blue is more than green.
Which is, green is more than yellow, though.
So, how would you know for sure, right away, what was the low spots on the map
and what were the sort of hot spots on the map?
I, I mean, it, it looks very compelling, at first, but it's actually almost
impossible to interpret if you really try to go beyond, you know, the presence or
absence of data. Contrast that to this map, which just
shows less to more purple. This is the appropriate way to deal with
this kind of data. Here, I'm showing density, so I just
want to show less to more color, of one color.
And now, I can see right away where I don't have as many of the tweets, and
where I have lots of tweets. So, there you go.
Rainbows kill people. Stop using them.
Data classification is another aspect of thematic map design that's important to
cover in class. So, assigning observations to categories
is called data classification. And there are three major types that I
want you to know about. There's equal interval classification,
quantile classification, and natural break classification.
And there's plenty of other methods beyond that, but it's sort of outside of
the scope of the class to go into that in this, this sort of explanation.
So, the first method that is shown here at the top right is equal interval
classification. What you care about here is establishing
an interval in terms of the value range. So, I, if I have 50 observations along a
set of values like I've shown here, I just decide that I want five classes.
So, I split it up into equal categories, zero to 20, 20 to 40, and so on.
That's the value range, sort of driving the classification there.
In contrast, quantiles, I want to make categories the same size in terms of the
number of observation, not the value range.
So, I don't care what the observations sort of said about, you know, did a
particular place get 22 on a certain score or whatever.
I don't care about that, I just care about having, in this case, five
categories of equal size in terms of number of observations.
So, each category in this case has ten observations.
I count up to ten, and then I make a break, and then I make the next class.
And so on. Finally, natural breaks, in the bottom
right corner, this method is driven by some fancy mathematics.
So, you let an algorithm decide where the so-called natural breaks in the data
distribution are, and it actually works pretty well in practice.
you don't need to understand the algorithm, because it's just not germane
to this class. Let's look at some map examples of each
of these methods though, so you can see how they work.
In this case, I've got some fake data because I love making fake data.
showing the proportion of people that are admirers of beautiful Audi station wagons
like I am. And this map, in particular, shows an
equal interval classification. So, what I care about here, if you look
at the legend is neatness and, and sort of the value range for each category.
So, the first category is zero to 20 people per 100 and so on.
21 to 40, etcetera. And if you look next to that little set
of categories there in the legends, you can see I've actually printed here the
number of states that are in each class. And if you pay attention to this in the
subsequent maps you'll see that it varies quite a bit depending on the kind of
classification I use. So, in this case, in the lowest category
has only six states in it, whereas some of the higher categories have over ten.
8:22
Let's look at quantiles using the same exact data.
So here, remember with quantiles, all I want to do is put the same number of
observations in each class. So, I count up the first ten states, and
then I put them in a class. And then, I go for the next ten
observations and put them in a class. And then, the value range can vary.
And there's not an equal interval in between them.
So, the first category here is zero to 30.
The second one is 31 to 45. So, it's much smaller than the first
category. And so on.
And you could see though, that the number of states per class, which is what
quantiles do, stays the same, right? So, you end up with a nice, visually
balanced map when you do quantile classification.
8:58
Finally, you've got natural breaks. And all these differences are a little
bit subtle, but you can tell different stories using the different
classification methods. Natural breaks is using some math, behind
the scenes, to try to figure out where the so-called natural, sort of, down
points are between clumps in the data. And you could see that the class breaks
vary quite a bit, so you have zero to 14, 15 to 32 is the next one.
33 to 49, etcetera. And in the lowest class, there's only
five members in that class. There's only five states in that low
category. But in some of the higher categories,
there's 12 or 13. So, you can end up with very different
visual balances on your maps, depending on the kind of classification you use.
9:36
Finally, I feel like I'd be remiss without talking a little bit about
textile maps. You're not going to do a lot of text
design on the maps that you make in this class because of the tools we're using.
But designing textile maps is a huge aspect of cartography and frankly there
could be a whole class just on that. And just to give you a flavor for it,
think about choosing fonts and positioning labels to make a map work for
your entire city that you live in at multiple scales, that would work on a
mobile device regular computer, and in a big printed sheet.
It's really hard work. And there's still a lot of manual effort
that goes into placing labels and making sure those things are readable on a lot
of the maps that you use everyday. And one very simple thing that's kind of
a fun little thing to learn about text design of maps is this graphic shows
here, is positioning labels. And this is something that hopefully you
can carry forward and use in other things you might do.
If you have a point here, like if I have this purple point, and you have to assign
a label to it, and you want to position that label somewhere, the best spot, spot
to put it first is where position one is. And if that position doesn't work because
there's a road crossing it, or a city there, or something else that blocks it,
you can go to position two. And if that doesn't work, then you can
try position three, and four, and five, and six, and so on.
So, this just gives you a way to prioritize label placement.
That's kind of quick and dirty. And hopefully, something you can take
away from this class. So at this point, you've learned about
the geospatial revolution. With spatial thinking, and thinking like
a geographer is all about. Where spatial data comes from how spatial
analysis works. And now, a fair bit about how to design a
good map. Hopefully, this has been an engaging
enterprise for you to take this MOOC. And hopefully what I've done with these
lectures and the written content and the labs and stuff made sense.
And you're able to take some of this forward with you.
And make great maps to share with everybody else.
Appreciate your time and I had a really fun time making this class.