Does the IPCC describe uncertainties in a quantitative way that is accessible to readers? I expressed my opinion in comments: the IPCC generally resorts to qualitative discussions, and in particular, they become quite qualitative when discussing individual models.
I think this vagueness is unfortunate as it results in various people arguing about whether the models are “good”, “bad”, “useful”, “useless” without ever figuring out whether they disagree about a) the quantitative measures of accuracy and precision or whether they b) agree about the level of accuracy and precision, but disagree about whether that level of accuracy is sufficient for a particular use.
For example: Are we arguing whether or not the models predict the temperature of the planets surface within ±5C? Or do we agree on they can do that, but we are arguing about whether ±5C is sufficiently accurate for some particular use? (And in that case: what use?)
So, my contention is this: The IPCC, at least in the AR4, tended to avoid describing the level of accuracy easily digestible form.
This was not always the case. Atmoz provided a link showing that a clearly labled Taylor diagram was included in Chapter 8 of the IPCC’s Third Assessment Report (TAR.) These sorts of diagrams permit readers to quickly assess how closely each model matches observed data overall.
Diagrams weren’t provided for everything in the TAR, but some where. So, clearly the IPCC did at one time provide some accessible quantitative assessments of the individual models on which they based their projections.
However, as far as I can see, the authors of the Fourth Assessment Report (AR4) tended to omit these sorts of clear synopses from Chapter 8. In the AR4, narratives discussing individual models start out seeming quantitative. Alas, like a mirage in the desert, this “seeming” vanishes as the reader takes the time to examine the figures while reading the text.
Let’s look at an example: The comparison of simulation results to observations of mean surface temperature.
Comparison of Simulation Results of IPCC Models to Observations of Mean Surface Temperature
Section 8.3 of the WG1 report to the AR4 is called “Evaluation of Contemporary Climate as Simulated by Coupled Climate”, and so is the section where one might seek comparisons. Section 8.3.1.1.1 discusses temperature and we find discussion and images of:
b) the the difference between predictions of annual average temperature averaged over all models and observations and
c) the difference between observations and predictions of annual average surface temperature by individual models.
Instead, I’ll skip the discussion of (a) and (b) , and focus on (c)
With regard to individual models, the text says, in words:
“Individual models typically have larger errors, but in most cases still less than 3C except at high latitudes (see Figure 8.2 b and Supplementary Material, Figre S8.1) Some of the larger errors occur in regions of sharp elevation changes ….
Ok.. so I read “less than 3C”. That seems quantitative. But what does the qualifier “in most cases” mean? And what of the caveats: a) outside the high latitudes, in regions without sharp elevation changes, c) not including the eastern portions of tropical basins. How much does that leave?
Unfortunately, the figures aren’t conveniently located near the text. So, we must find Figure S8.1 in the supplemental materials. Here is the first individual model shown, BCC-CM1:
Using my eyeballs finely tuned to detect colors, I would say errors are greater than ± 3C in many regions of the globe. All of Central America and the Gulf of Mexico are dark blue (-5C). There is quite a bit orange (+3C-+5C) over Asia– at the latitude of Chicago, which I think is not considered “high” latitude. What of the dark blue (-5C) over south east Asia?
So, would you say this model has errors less than less than ±3C except at high latitudes? Or that outside the high latitudes errors are less than 3C except in places with rapidly varying terrain?” (Like, for example, Florida and the Gulf of Mexico? )
Admittedly, BCC-CM1 appears to be the worst vis-a-vis this particular metric. So, what about other models? Not to pick on models by bloggers… but….
What about GISS Model E?
On this metric, GISS Model E looks better than BCC-CM1. But, I think it also displays large regions with errors noticeably greater than ±3 C. — and mean regions well away from the poles.
But maybe you’ll see something different when you look at this:
What are we to make of those large blobs of dark blue (-3C to -4C) off the east coast of the mid-atlantic states? Or the fire engine red off the coast of Equador, Peru and Chile?
I guess off the coast of Equador and Peru is in the eastern part of a tropical ocean basin. But… isn’t there some red next to Chile?
In any case, at least in my opinion, the wording of the IPCC document tends instill the idea that errors are bounded by roughly ±3C. That’s the only number mentioned. Certainly, the words never mention the magnitude of the maximum errors on the graphs: 5C. In many individual models, errors this large appear in fairly large regions, both inside and outside the high latitudes.
What about the rest of the GCMs?
Scanning over the two pages of graphics, overall, the rest of the models look as good (or as bad) as Model E. Model GFDL-CM2.0 seems to be off by more than ±3C over nearly the whole northern hemisphere. Or… maybe it just seems that way to me. Being off by 5C is certainly never called out in the narrative.
So do the words and the figures match?
Let’s read these again:
“Individual models typically have larger errors, but in most cases, less than 3C except at high latitudes (see Figure 8.2 b and Supplementary Material, Figre S8.1) Some of the larger errors occur in regions of sharp elevation changes ….
Well, clearly this is spin. The commonly occuring 5C is never mentioned. Even given the caveats, seems rather misleading. At first glance, reading the words without looking at the graphs tucked away in a separate section, I thought the text suggested
“More than 50% of models have maximum errors less than ±3C (except outside high latitudes.”
Now, I’ve looked at the graphs. I’m not going to try to count specks, the text can’t possibly mean that. Maybe it means:
Or, it could mean something else. I can’t really integrate all those color pictures by eyeballing them.
Frankly, I think it might have been better if the IPCC had quantified what they believed about individual models using descriptive statistics.
How could the IPPC have quantified the data comparison?
There are many options; here are three:
- The IPCC could have provided a Taylor Diagram, calculating the rms of the errors indicated in the full collections of these figures. The result for each model could have been indicated and labeled in the legend.
- The IPCC could have produced figures similar to Figure 8.11 which shows the rms for climatological seasonal patterns, but used it to describe this metric. (They also could have included for all models used in predictions instead of only 14. )
- The IPCC could have provided a table showing the rms error for each model.
The data were available to turn into pretty color graphics. That means the data were available to post-process and create Taylor diagrams, tables or other charts.
This would have clarified the uncertainties for readers.
Alas… no…. in the AR4, the IPPC gave readers ambiguous prose accompanied by pretty colored pictures buried in the supplementary materials. And this is why, I periodically say the IPCC resorted to vague qualitative discussions of the accuracy of the models they used to make projections.
If any one in comments or at another climate blog wishes to defend the virtue of these models by citing literature that makes up for these lapses in the AR4, I would welcome that information. I’ll happily put up with rhetorical eyerolls. 🙂
My interpreation on first reading ‘mostly, except at high lattitudes’ is ‘if we exclude high lattitudes then mostly’
Normally when you read a ‘mostly’ etc our brain says ‘exception in the minority’. However two such minority exceptions in a row can lead to a misleading conclusion. The ‘except high lattitudes’ part may be 70% of the globe. And the ‘mostly < 3 degrees’ may be 70% of the ‘non high lattitude part’. Overall we could then have 70% of 70% = 49% of the globe in the ‘<3 degrees part’ of the globe.
Michael– That’s sort of just it. Once can claim the “mostly” wording in that sentence is true in some sense. But then again, one can defend Bill Clinton’s statements about the meaning of is, or not having sexual relations with Monica Lewinski.
In reality, if you knew someone who used language that way, you would consider them either intentionaly evasive or just incomprehensible. In either case, if you needed to know something, you would not rely on that person.
I don’t know if this helps you, but the proper placement of the second comma removes some ambiguity.
Individual models typically have larger errors, but in most cases still less than 3C, except at high latitudes (see Figure 8.2 b and Supplementary Material, Figure S8.1)
Assuming the author knows how to use the comma properly, both “… but in most cases still less than 3C …” and “… except at high latitudes …” independantly describe the “… have larger errors …”
In effect, we can read it as two seperate sentences which read, “Individual models typically have larger errors (in most cases less than 3C),” and “Individual Models typically have larger errors (except at high latitudes).”
And we are going to base stringent constraints on the world economy that might spin it out of control and destroy the struggling third world,( and maybe kill billions from famine and disease) on the placement of a comma?
Before entering the fray of anthropogenic CO2 or not, I spent over a month reading the physics justification of AR4. Rather trying to read, in between stops of pulling at my hair at the impossible stuff I was reading and expected to believe. I followed individual model runs for cloud formations, for example, which are not colored scatter plots, as well as the errors. I found that individual models each fit part of the data, and the color cloud combinations of them made the eye think that the “average” fitted. There was deliberate intent fool the cursory reader in all the plots.
In the plots you show, you will see that the large errors occur in the regions where heating is also large, so it seems the recipe is similar as for the hockey stick: where the errors are large throw large temperatures. I call this deliberate sleight of hand in biasing temperatures.
We are lucky there exist satellite data.
Hadcrut for May is out. Slightly higher anomaly then April.
Raphael–
Placement of the commas may change the meaning– but the meaning is not still not right. Most individual models have regions with errors larger than 3C, even outside high latitudes. Errors are large as 4C and 5C exist even outside high latitudes, places with rapidly varying terrain and the easter part of tropical basins.
Just calculating the rms of the errors and reporting that would have given people a standard metric for comparison.
And… anyway.. you suggest this:
The previous sentences says errors are 2C. So, if they mean that the meaning is totally wrong. All individual models have errors larger than 2C at high latitudes!
So… the individual models typically have larger errors… than what?
Anna
I disagree on ‘deliberate’. But the fact that the largest rates of warming are predicted in the locations were– as far as we can determine– the errors are largest, is a potential for introducing bias in the projected trends.
Well, consider the audience. The audience for this document is policy-makers and the general public. On that point, I’ve felt the IPCC’s document was written well for that audience. For the rest of us who have gone through it scratching our heads, the document doesn’t do that well of a job.
Plus, the new “everything is now consistent-with what the IPCC and our models predict” mantra I’ve seen at Grist and other places really has me going, “wait, what? What happened to weather does not equal climate and no one weather event can be blamed on
global warmingclimate change?”it’s a head scratcher. I am hoping the next IPCC update is a little better.
Terry–
I think the omission of tables and figures to summarize the information in the color diagrams is a particularly bad omission because the audience is supposed to be policy makers.
Regulating and permitting agencies always want summary information to guide their decisions. The pretty pictures are great as supporting background information, but to make final decisions, they want to know the actual differences in concrete terms.
The IPCC is written as if exactly two groups of people read the documents:
a) Climate modelers who should be reading the literature in peer reviewed articles anyway and
b) Grade school teachers and PR folks who are going to only read the FAQ’s and the asides in those sidebar boxes.
The tend to skirt the needs of people in between. These people are those who actually want information but may want to spend most of their days on things other than pouring over the climate science literature.
Lucia,
could they mean AVERAGE GLOBAL TEMP error less than 3C as opposed to specific areas?
They do seem to think averaging everything takes care of all issues.
KuhnKat– No. They can’t mean that because they just said the average over all models has an error less than 2C (outside high latitude regions)! That discussion isn’t quite as confusing. It’s still a bit tendentious because they never spit out the fact that the error in the polar regions appears to be 5C.