The Blackboard

Where Climate Talk Gets Hot!

Skip to: Content | Sidebar | Footer

Inadequate Reasons to For Suggesting the Falsification of IPCC Projections Doesn’t Apply.

17 March, 2008 (10:22) | Uncategorized

For some reason, many in comments seem anxious to find that the IPCC projections and their uncertainty intervals must be consistent with the recent weather pattern. The objections are scatter shot, odd, and unceasingly wrong. I will characterize some, and responde:

  1. The current falsification of the 2 C/century based on 6 years and 1 month of data doesn’t count, because one can show that 5 year trends with similar slopes occurred during the previous 30 years.

    Sorry William, but statistically, your funky little graphs tell us practically nothing if we assessing whether or not the recent IPCC claims are sound. The reasons they tell us almost nothing are:
    a) the period you selected has a trend lower than 2C/century, so more zero and negative slope periods are expected during that period rather than the recent one,
    b) the period you selected includes three major volcano eruptions which cause the major excursions from the mean trend. Larger variability, in and of itself resluts in a wider range of slopes than would occur during a period with no major volcanic eruptions. (The effect of the variability that occurred is incorported into the uncertainty intervals for the slope obtained using a regression),
    c) the period you selected has a larger serial autocorrelation in the deviations from the mean trend. This is due in large part to the volcanic eruptions which create an autocorrelation in the anomalous forcing. This serial autocorrelation is a very dramatic anomalous forcing further increases the apparent variability of 5 year trends beyond what would be expected during a period with no volcanic activity and finally,
    d) though this is trivial compared to the other inadequacies in your argument, the fact that it’s a five year trend, means that we expect larger variations in your slopes than in my falsification region.

  2. The current falsification doesn’t count because the projections into the future data before projections were made.

    Several people have suggested this. In particular, in comments at Stoat, Atmoz wants to include data from before 2001, and to start by identifying a local minimum in the data. William Connelly wants to ‘prove’ that flat trends are consistent with 2C/century by showing flat 5 year trends occurred during the seventies.

    Logically, if the IPCC made projections they ought to be tested against data that arrived after the projections were made. If we do not restrict ourselves this way, we permit the IPCC to “validate” against data they used to make their projections. This is not reasonable.

    But, even more importantly, both projections into the future and hind-casts of past data are based on an underlying theory. That theory itself says that the trend in the seventies was not supposed to be 2C/century.

    It may be ok to sneak a peak at that data to get a gut feeling about whether or not a single falsification is likely to “stick”.

    However, suggesting one can save “predictions” by including data that came in before the predictions were made is cheating. In anycase, when setting up a hypothesis test, we already assumed the IPCC projections were correct at least in part on the basis of past data. So, we cannot, in some sense “re-prove” their correctness based on that same exact same data we used to assume the correctness for the purpose of a hypothesis test.

  3. The current falsification doesn’t count unless it falsifies for every possible cherry picked subset pulled out of the full set.

    If one pulls the maximum out of a set, selecting end points based on looking at the data, limited by no rule, one will often be able to get the result one wishes. This is always true for any data set.

    Boris is suggesting that we should start in 2001, and then pretend it’s 2007, ignoring data afterwards. In which case, he guesses the IPCC will over-predict the trend. Unlike others who wish to use data used to make the projections to falsify them, this one wants to use even less data. However, it turns out that if we pretend it’s 2007 is, the central tendency of the IPCC and the uncertainty bands still fall above the best fit climate trend. However, we can’t falsify the IPCC trend based on that string because the uncertainty intervals are too large.

    To apply statistics, one must select a criterion for both validation and falsification. To deny the validity of a proof, one ought to have some sort of specific criterion to pick an alternate set. Simply picking an endpoint as the worst possible one doesn’t work.

  4. This falsification doesn’t count because the trend is too short.

    This falsification says that given the length of the trend and the weather data that actually materialized after the projection was made, the IPCC projections are falsified to a 95% confidence level. That means: Given the weather we have had since the IPCC made the projections, there is a 5% chance their projections for the climate trends are correct.

    So, why can others find strings where the IPCC trends are not to falsified? Well, for short tests, the major difficulty is β error is large. In fact, with less than 10 years data, if the trend were 0C/century, we expect β=50% of all 10 year trends calculated based on annual averages would fail to falsify a hypothesis that the underlying trend is 2 C/century or greater. (After 15 years, that β~5%). I discuss this here.

  5. The current falsification doesn’t count because the IPCC climate trends don’t include weather.
    No, the IPCC climate projections don’t include weather noise. That’s what makes the appropriate basis for testing!

    If the weather is thought to decompose into a climate trend and weather noise one might suggest that for short periods of time it follows linear relationship:

    T = m t + e

    Where T is the temperature, “m” is the underlying trend– a climate variable stripped of noise, and “e” is the weather noise.

    The current falsification obtains:
    a) the best fit estimate for a climate trend. This trend is a straight line and does not include “weather noise”. That’s the point of obtaining the trend. and
    b) uncertainties in the estimate for the climate trend.

    So, we obtain mlow < m < mhigh. These apply to “m”. The IPCC graphic illustrates projections about the trend, “m” stripped of weather noise. That is: that graphic does not include weather, the analysis I did, obtains “m” which do not include noise.

    (It is also possible to get estimates of the standard devaition of the weather noise. Had the IPCC provided estimates for the magnitude of “weather noise” or made weather predictions, we could test those estimates, by examining the variance in the weather noise obtained from the snippet of weather.

    They did not provide the estimate for weather noise. I don’t need them to test their estimate on “m”.

  6. Boris suggested the IPCC trends are ok because the only include 1σ confidence intervals.

    He thinks to disprove the 95% confidence intervals for the IPCC projections, I need to show that the 95% confidence intervals for the IPCC climate projections in now wayoverlap the 95% confidence intervals based on the weather realization.
    ComparisonThis is wrong.

    When testing two means with known variances, one cannot simply draw the 95% confidence intervals for both and conclude that if they overlap, they agree to 95%. In fact, that doesn’t work at all. To the right, I have the distributions of hypothetical trends, that may have occurred somewhere, some time. Assume the sample mean obtained using 10 measurements for the first is -2.16 C/century; the mean for the other is +2.16 per century. Assume the sample standard error for both means is 1 C/century. The 95% confidence intervals are shown.

    Note that the two 95% confidence bands overlap. However, if you perform a t-test, using a traditional method and compared the trends, you would discover that the sample means of +2.16 C/century and -2.16 C/century differ from each other with α < 0.01%. You an find the formula for doing this in most statistics textbooks.

    If you think a short while, it will become obvious why the means disagree to a very high level of confidence. To find the probability that the two means are equal to any particular value, we must multiply the two probabilities that “x” falls bewteen “p(x)dx” together and then perform a double integral over all “x”. It turns out that when means are identical, the two distributions must overlap a lot more than most people would guess. (Exact analyses depend on the number of data, and many details the IPCC did not include in their projections. But, basically, if the IPCC ranges is not in the range permitted by experimental data, it’s falsified.)

I think this covers most of the criticisms.

In reality: This falsification stands as what it is. It is one test, that indicates the IPCC projections are inconsistent with weather data that arrived after the time the IPCC made their projections.

Given the wording of some of the criticisms in comments here and elsewhere do recall what this falsification does not show and what remains possible (and even probablie):

  1. It does not falsify AGW. The data are consistent with a range of positive trends and the theory of AGW.
  2. It does not falsify the IPCC projections for the trends to absolute certainty. No statistical test can do that.
  3. If we include the 95% confidence intervals, calculated including serial autocorrelation, for year trends, the recent data is consistent with those trends. It’s just on the lower end.
  4. Any trend, including the recent one, can be an outlier. Things that happen 5% of the time, do happen 5% of the time. It is possible the weather will perk up. :)

Comments

Boris (Comment#1162)

Boris is suggesting that we should start in 2001, and then pretend it’s 2007, ignoring data afterwards. In which case, he guesses the IPCC will over-predict the trend. Unlike others who wish to use data used to make the projections to falsify them, this one wants to use even less data.

It’s irritating that you didn’t even come close to understanding my point. My point was that if you go back one year, the IPCC results look fine. A year out from that and they look bad. What does that tell you? Seriously, don’t think about stats, think about the climate system. What does it mean that your hypothesis test gets wildly different results based on one year’s worth of data? It should mean a lot if you’re paying attention.

Let’s see what it doesn’t tell you. It doesn’t tell you that there is a radiative cause for the recent cooling. The cause

must be

short term variation. Why? Because it’s one year’s worth of variation, that’s why. One year is very short term when we’re talking about climate.

So I wasn’t suggesting using less data to determine anything. I think it’s disingenuous to say that I was.

My only point in regards to the 1sd error bars is that you had repeatedly called them “full error bars.” You chose a poor graphic and made a big deal out of the fact that the trend was outside even the IPCC’s confidence intervals. That was shown to be an apples and oranges comparison.

I’d love to continue this discussion, but your insistence on misrepresenting what I say makes this impossible (If I wanted that, I’d hang out at CA more often). In addition, it doesn’t appear you wish to consider anything about the physical climate system. Good luck with your falsification tests. Maybe you could get one published and prove me wrong. I’d very much like to be wrong.

Boris (Comment#1163)

My comment got eaten by your spam filter. It was my penultimate one.

lucia (Comment#1166)

Boris:
What does failure to falsify with less data tell me? Practically nothing!

The problem with short data sets isn’t false positives, that’s controled by α for the test, which we set at 5%.

The problem with short data sets is the high likelyhood of false negatives. β error is high, we expect lots of false negatives. That is to say: With short data sets, we are likely to get lots of “failed to disprove” when the hypothesis is, in fact, false. hat would be point 4 in the post.

How was I to know your point was to do something to increase β error and not to cherry pick the high point? If you nave a specific reason for suggesting something, why not just say it directly?

Anyway, the IPCC projections were still above the central tendency. They just don’t falsify.

Ed Snack (Comment#1170)

Actually Boris, it’s not that your arguments are misrepresented at CA, but rather that your errors and obfustications are clearly identified and your arguments are therefore shot down in flames rather often. If you want your arguments accepted without analysis, post in the cheering section at Realclimate. No one will disagree with your particular beliefs there.

avfuktare vind (Comment#1172)

Boris,

“Let’s see what it doesn’t tell you. It doesn’t tell you that there is a radiative cause for the recent cooling. The cause must be short term variation. ”

So can you please explain how a positive radiative imbalance (IPCC reference) can result in a loss of energy in the climate system?

And when you’ve realized you can’t, can you please tell us what value you think that lends the set of climate models used by the IPCC to justify the policy it is pushing?

Raven (Comment#1173)

Lucia,

This paper was produced last year: http://pubs.giss.nasa.gov/docs.....f_etal.pdf

I found this statement to be quite unbelievable:

“Although published in 2001, these model projections are essentially independent from the observed climate data since 1990: Climate models are physics-based models developed
over many years that are not “tuned” to reproduce the most recent temperatures, and global sea-level data were not yet available at the time.”

Even if we take the statement at face value we know the modellers had knowledge of the temperature data when they published the TAR and I am pretty that models that did not match the trend were dropped or fixed.

However, if it is true you should be able apply the same statistical tests to the to this data. I assume that the warming trend from 1990 will not be refuted by the data even if the satellite data is included. However, if AGW advocates are going to nitpick about the La Nina causing the recent downward trend they should remember that the same could be said for the 1991 volcano and 1998 El Nino which exagerrated the warming trend in the 1990s.

The paper makes this comment as well:

“Although the concentration of other greenhouse gases has risen more slowly than assumed in the IPCC scenarios, an aerosol cooling smaller than expected is a possible cause of the extra warming.”

The less than expected aerosols makes the recent cooling trend even more remarkable.

I went looking for the latest aerosol data and found this: http://data.giss.nasa.gov/modelforce/trop.aer/

I find it interesting that aerosol forcings assumed to be constant since 1990 when the recent data suggests they are actually dropping: http://www.nasa.gov/centers/go.....mming.html

steven mosher (Comment#1179)

Lucia

I think you have nailed it. But I have one issue. The term falsification. I’d argue that the term
Disconfirm is more appropriate. Theories are never verified or falsified. They are confirmed and disconfirmed.

repeated disconfirmation results in a theory being dismissed or forgotten or ignored.

lucia (Comment#1185)

Hmm.. That word may be better as a matter of common usage. I guess I used this one because initially asked what sort of data would falsify. But in hypothesis testing we falsify, or fail to falsify hypotheses! Those are the terms. Confirming is actually quite rare.

Though, it can be done, as I pointed out in the post on β error.

The way it works is this: if you falsify enough supporting hypotheses, and do so repeatedly, you can later falsify any theories that require a specific hypothesis to be true.

But hypotheses are smaller things than theories. They are also generally very precise. That’s why we can test them using standard statistical methods.

AGW is a broad theory. It really isn’t possible to test it statistically. All we can do is test supporting hypotheses. “The global temperature trend was X C/century during the seventies” is a testable hypothesis.

nick (Comment#1194)

There’s a simple way of doing the test. For every month you have actual data.

If Actual > Prediction score 1 point for AGW.

If Actual < Prediction, score 1 point for Skeptic

Since its expected that this is a 50-50 bet, you can then use the binomial distribution to put some odds on the prediction being correct.

This avoids all the question of error bounds. Here the alarmist position is that if the temperature is in the error bounds, its proof of the prediction.

The flaw is that you also need to compare this against the natural trend null hypothesis. ie. Extrapolate 1900-1950 (pre AGW) and compare against actual with an error bounds constistant with the historical record. That also fits.

ie. The test they are making isn’t that AGW is proved. It is just that AGW is consistent. Lots of other hypothesis are also consistent.

They are making an incorrect inference that because (maybe) their model is consistent with reality, that its the only model.

Nick

nick (Comment#1339)

I think I’ve spotted one problem.

IPCC makes it’s prediction (I always thought it was 3C per century, not 2C per century)

The test for AGW is twofold. First that temperatures match the prediction. Secondly, and almost always missed off, that the temperatures don’t match the null hypothesis of no anthropogenic effect.

The question is then, what’s the null hypothesis?

Attempt 1, is that natural temperatures follow this distribution

t = k + e, where k is a fixed number.

The problem here is that temperatures were rising prior to enough CO2 being present.

t = k t + e

Where k is the rate of change prior after removing the models prediction of temperature rise due to CO2.

Unless you can show why this natural rise has stopped, you have to conclude it carries on.

Nick

Write a comment





Contact Lucia