Color palettes

For an increasing variable like decades, it is appropriate to apply an increasing color palette. A diverging palette may also be acceptable, though sometimes this requires centering the palette on some special value depending on the data and how it is being viewed. Neither of these are fundamentally requiremed, however, and are not always the best approach. A qualitative palette may work well even for ordinal or numeric data if certain conditions are met.

Let’s consider the colors used in the Community Charts application. First of all, in the plots previously shown, color differences between bars help primarily to visually separate the narrow bars grouped together by month. Second, the decades shown are not evenly spaced in time. Third, the first bar has a color unrelated to the rest of the increasing palette in order to help separate historical data from projected climate model outputs and a 30-year period from decadal periods. This is a good choice. This is just to say that the increasing color palette doesn’t have very many colors in it.

Fourth, while temperature increases are apparent through time, a look at precipitation plots reveals that an increasing color palette has been applied to the levels of the decade variable regardless of whether precipitation increases across decades. “Of course!” you might say. After all, I just got through saying that the increasing palette applies to decades, not climate data. Then why does the temperature plot range from yellow to red and the precipitation plot use blue hues? On this point alone it would be worth considering using the same colors for both plots. How about four more shades of gray? Shades of gray are more discernible to the human eye than similar shades of some other colors.

This choice of “heat” colors for temperature and “wet” colors for precipitation brings up the fifth point about these plots, which is that in general data should not be redundantly represented by multiple plot attributes. It gets people asking too many questions, the frustrating kind, frustrating because the questions could have been avoided by making a less ambiguous or confusing plot.

In this case, temperature is represented by height, a position along the y-axis, labeled with temperature values, and similarly with precipitation. They should not also be differentiated by color palette. Even though this is across plots rather than within plots, the two are often taken together. If the temperature plot was nothing but orange and the precipitation plot nothing but blue, this would be a fair distinction between plots. In such a case, color would truly pertain to climate variable. Use of the two gradients for decades in conjunction with colors suggestive of each respective climate variable, however, is not best practice and potentially confusing. Colors undeniably are intended to represent decades, as the legend in each plot shows, but here we are trying to use that color to represent two different things, one of which is already represented by height.

Lastly, in these plots 30-year historical and decadal bars are ordered sequentially and there are very few of them per month to concern ourselves with. Nevertheless, since they are not explicitly labeled, a color key of some kind is still necessary. However, not only ought they not be colors which could be confused with temperature or precipitation values, but considering there are only four colors in the increasing palette anyhow, which refer to unequally spaced decades no less, it would not be the end of the world to offer a qualitative color palette option.

Some users may find this easier to see, and with a combination of unequally spaced periods of unequal length, separated in time by a number of years, a case can be made for treating these period categories as falling somewhere between nominal and ordinal forms of data. Either approach is appropriate.

Here are a couple examples using qualitative colors.

Anchorage temperature

Anchorage precipitation