<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Blackboard</title>
	<atom:link href="http://rankexploits.com/musings/feed/" rel="self" type="application/rss+xml" />
	<link>http://rankexploits.com/musings</link>
	<description>Where Climate Talk Gets Hot!</description>
	<lastBuildDate>Fri, 19 Mar 2010 21:41:23 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>HadCRUT: Down slightly from January.</title>
		<link>http://rankexploits.com/musings/2010/hadcrut-down-slightly-from-january/</link>
		<comments>http://rankexploits.com/musings/2010/hadcrut-down-slightly-from-january/#comments</comments>
		<pubDate>Fri, 19 Mar 2010 21:41:23 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9984</guid>
		<description><![CDATA[HadCRUT NH+SH posted  their temperatures anomaly for February:


The anomaly was 0.460 C, down from 0.495 C in January. ( Note that the January temperature was revise from 0.470 to 0.495.)
According to my tally, this was the 7th warmest February 4 in the HadCRUT NH&#038;SH record.  February anomalies are circled in the graph above; [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://hadobs.metoffice.com/hadcrut3/diagnostics/global/nh+sh/monthly">HadCRUT NH+SH</a> posted  their temperatures anomaly for February:<br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/HadleyData.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/HadleyData-500x341.jpg" alt="" title="HadleyData" width="500" height="341" class="size-medium wp-image-9983" /></a></p>
<ol>
<li>The anomaly was 0.460 C, down from 0.495 C in January. ( Note that the January temperature was revise from 0.470 to 0.495.)</li>
<li>According to my tally, this was the 7th warmest February 4 in the HadCRUT NH&#038;SH record.  February anomalies are circled in the graph above; as you can see most of those warm Februaries occurred this century. </li>
<li>If computed using data since Jan 2001, both the simple trend based on time only and the trend computed using a multiple regression involving both MEI and time remain negative. The nominal trend of 0.2C/decade remains outside the ±95% confidence intervals for the MEI corrected trend; these confidence intervals computed based the assumption that the residuals for the observations are ARMA(1,1).</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/hadcrut-down-slightly-from-january/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Carrot Eater&#8217;s Challenge: Rate of False Positives when applied to simulations pt. 1.</title>
		<link>http://rankexploits.com/musings/2010/carrot-eaters-challenge-rate-of-false-positives-when-applied-to-simulations-pt-1/</link>
		<comments>http://rankexploits.com/musings/2010/carrot-eaters-challenge-rate-of-false-positives-when-applied-to-simulations-pt-1/#comments</comments>
		<pubDate>Thu, 18 Mar 2010 17:56:00 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9909</guid>
		<description><![CDATA[WARNING: While adding GISS, I noticed I am not replicating my GISS results from the other computations precisely. That means there is at least 1 bug in my spreadsheet.  I&#8217;m looking for the bug.  In the meantime, I&#8217;m blanking out while I find the error. (It&#8217;s dinner time, and I may not find [...]]]></description>
			<content:encoded><![CDATA[<p><font color="blue">WARNING</font>: While adding GISS, I noticed I am not replicating my GISS results from the other computations precisely. That means there is at least 1 bug in my spreadsheet.  I&#8217;m looking for the bug.  In the meantime, I&#8217;m blanking out while I find the error. (It&#8217;s dinner time, and I may not find it until tomorrow.</font></p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/carrot-eaters-challenge-rate-of-false-positives-when-applied-to-simulations-pt-1/feed/</wfw:commentRss>
		<slash:comments>61</slash:comments>
		</item>
		<item>
		<title>UHI in the U.S.A.</title>
		<link>http://rankexploits.com/musings/2010/uhi-in-the-u-s-a/</link>
		<comments>http://rankexploits.com/musings/2010/uhi-in-the-u-s-a/#comments</comments>
		<pubDate>Thu, 18 Mar 2010 17:44:08 +0000</pubDate>
		<dc:creator>Zeke</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9913</guid>
		<description><![CDATA[Teasing out the Urban Heat Island effect can be a fiendishly difficult task. There are enough confounding factors that it is dangerously easy to simply pick a measure that shows what you want to show (be it a negligible or huge UHI) without including the nuances necessary.

Take this graph for example. It shows the U.S. [...]]]></description>
			<content:encoded><![CDATA[<p>Teasing out the Urban Heat Island effect can be a fiendishly difficult task. There are enough confounding factors that it is dangerously easy to simply pick a measure that shows what you want to show (be it a negligible or huge UHI) without including the nuances necessary.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Pseudo-Spencer-Chart.png"><img class="aligncenter size-full wp-image-9914" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Pseudo-Spencer-Chart.png" alt="" width="532" height="307" /></a></p>
<p>Take this graph for example. It shows the U.S. CONUS (e.g. lower 48) trends from USHCN stations in 2.5 x 3.5 lat/lon grids with both less than 10 population density and greater than 100 population density stations (via 2010 GWPv3 data). It compares the raw trend from the low pop density stations to the adjusted trend of all stations in the same gridcells, and finds dramatically different results. The raw data from low pop density stations shows a trend of 0.093 C per decade, while the adjusted data from all stations shows a trend of 0.238 C per decade, some 2.5 times larger. Does this provide clear evidence of <a href="http://www.drroyspencer.com/2010/03/direct-evidence-that-most-u-s-warming-since-1973-could-be-spurious/">“a substantial spurious warming component”</a>? Well, not necessarily. For that strong a conclusion, a more detailed and thorough examination of the data is required.</p>
<p><span id="more-9913"></span></p>
<p>Before we dive into a more detailed examination, lets quickly review the methods we are using to spatially weight and anomolize station data.</p>
<ul>
<li>This analysis uses a spatial gridding model outlined in this post: <a href="http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/">http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/</a></li>
<li>As always, the latest source code can be found here: <a href="http://drop.io/0yhqyon/asset/temp-analysis-v0-10-zip">http://drop.io/0yhqyon/asset/temp-analysis-v0-10-zip</a></li>
<li>Data used to produce all the graphs in this article is available in a spreadsheet here: <a href="http://drop.io/0yhqyon/asset/us-uhi-xls">http://drop.io/0yhqyon/asset/us-uhi-xls</a></li>
<li>Click on any of the graphs to embiggen</li>
</ul>
<p>I use 2.5 x 3.5 lat/lon gridcells and a 1961-1990 baseline for anomaly calculation because these are the standards used by NCDC in analyzing USHCN data. Based on the feedback from prior articles, I decided to de-emphasize the use of spaghetti graphs because they tend to obscure small (or even moderate) differences in datasets when the data is noisy. Instead, I’ll primarily be showing comparisons of trends in datasets over two different periods (1900 – 2009 and 1960 – 2009) to show long-term and “modern warming period” trends respectively.</p>
<p>I’ll be looking at three different proxies for urbanity:</p>
<ul>
<li>Population      density, via <a href="http://sedac.ciesin.columbia.edu/gpw/global.jsp">GWPv3 2010 spatial data</a></li>
<li>Urbanity,      via <a href="http://sedac.ciesin.columbia.edu/gpw/global.jsp">GRUMP urban boundry data</a> (using both population density and remote      sensing data of settlement extent to define urban borders)</li>
<li>Satellite      nightlight brightness, using <a href="http://www.ngdc.noaa.gov/dmsp/downloadV4composites.html">DMSP-OLS v4 data</a></li>
</ul>
<p>All of these are taken from external sources and are not obtained from any station metadata provided by NCDC. They are indexed to station lat/lon coordinates through work by Ron Broberg over at <a href="http://rhinohide.wordpress.com/">The Whiteboard.</a></p>
<p>To better understand the data used in this analysis, its worth discussing USHCN for a bit and the various raw and adjusted datasets that it provides.</p>
<p><strong>The U.S. Historical Climatological Network (USHCN)</strong></p>
<p>The USHCN provides a number of substantive improvements over the GHCN data I used in the prior global UHI analysis. For one, the station metadata is considerably better (specifically, the lat/lon coordinates are generally accurate, which is essential for the approach I’m taking). USHCN stations also are much denser than GHCN stations, which makes a pair-wise comparison approach effective without resulting in the loss of the majority of gridcells. Finally, USHCN stations tend to have long, continuous records and there is little station “dropout” in recent years.</p>
<p>The map below (from <a href="http://drop.io/0yhqyon/asset/menne-etal2009-pdf">Menne et al 2009</a>) shows all USHCN stations and COOP stations not included in USHCN in the United States. The spatial coverage of the network is quite good, though there is some local clustering that we can account for by spatially gridding the stations in our analysis.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-map.png"><img class="aligncenter size-full wp-image-9915" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-map.png" alt="" width="479" height="234" /></a></p>
<p>USHCN provides three different sets of data (raw, TOB, and F52) for three different temperature readings (mean, max, and min). Raw data is the rawest of the raw; a simple monthly average mean, max, and min from each station with very basic quality control checks. TOB data is the raw data corrected for bias introduced by changes in observation times over time for a portion of the USHCN stations without hourly temperature data (e.g. primarily those in the COOP network). F52 is the final adjusted dataset that takes the TOB data as a starting point and uses an algorithm to detect step changes and other inhomogeneities in individual station data by comparing the record of each station to that of other nearby stations. The F52 data also has missing datapoints infilled from nearby stations.</p>
<p>Time of observation (TOB) adjustments occur primarily in stations without hourly temperature data available. These stations will tend to measure temperatures only a few times a day, and the time they measure the temperature will have a strong effect on the max and min temps recorded. If the time of observation changes over time, it can introduce a bias into the raw temperature data. Because stations record the time of observation associated with each reading, researchers can measure how the TOB changes over time. The Figure below (also from <a href="http://drop.io/0yhqyon/asset/menne-etal2009-pdf">Menne et al 2009</a>) shows the general TOB for each station by year. Immediately obvious is a dramatic shift from PM TOB to AM TOB over the past 60 years; a change which <a href="http://drop.io/0yhqyon/asset/menne-etal2009-pdf">Menne et al 2009</a>, Karl et al. 1986, Vose et al 2003, Pielke et al 2007, and numerous other papers all conclude introduces a spurious cooling bias into the temperature record.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-changes.png"><img class="aligncenter size-full wp-image-9916" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-changes.png" alt="" width="472" height="386" /></a></p>
<p>Adjustments to correct the TOB effects have the net impact of slightly lowering temps at the start of the century and significantly increasing temps near the end of the century. The net effects of TOB adjustments to USHCNv2 are shown in the figure below (yet again from <a href="http://drop.io/0yhqyon/asset/menne-etal2009-pdf">Menne et al 2009</a>). The effects of TOB adjustments are effectively identical in the old USHCNv1 and the new USHCNv2.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Net-TOB-adjustment.png"><img class="aligncenter size-full wp-image-9917" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Net-TOB-adjustment.png" alt="" width="462" height="402" /></a></p>
<p>USHCNv2 introduced a new method for detecting and correcting inhomogeneities in station data that replaces the numerous independent adjustments used for station moves, UHI effects, and changes in measurement instruments (e.g. the shift to MMTS) in USHCNv1. The USHCNv2 inhomogeneity detection works by looking at multiple pair-wise comparisons of stations in a region to identify spurious step-changes or unusual changes in trends and correcting them to be in line with records from nearby stations. It also uses the documented station history to identify and correct for known changes in station location and instrument type. The net effects of non-TOB adjustments (e.g. F52 minus TOB) is shown in the figure below (again from <a href="http://drop.io/0yhqyon/asset/menne-etal2009-pdf">Menne et al 2009</a>). The pronounced increase in maximum temperature due to adjustments in the last 30 years is primarily associated with the switch from old LiG (liquid-in-glass) thermometers to new MMTS systems.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Menne-inhom-adj.png"><img class="aligncenter size-full wp-image-9918" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Menne-inhom-adj.png" alt="" width="493" height="418" /></a></p>
<p>To examine all the difference approaches to detecting UHI, we can use the three urbanity proxies identified earlier (pop density, urban boundaries, and satellite nightlights) to compare the trends in temperature constructed from stations in the same gridcells with different urbanity characteristics. Specifically, we will look at three pair-wise comparisons within 2.5 x 3.5 lat/lon gridcells:</p>
<ul>
<li>Stations      with less than 10 pop density and greater than 100 pop density</li>
<li>Stations      within urban boundaries and outside urban boundaries (rural)</li>
<li>Stations      with bright nightlight readings (40 to 63 brightness index) and dark      nightlight readings (0 to 20 brightness index)</li>
</ul>
<p>In all cases, the number of gridcells available for use in temperature reconstructions will be smaller than the total number of CONUS gridcells, because not all gridcells have stations of both types for a given proxy. The figure below shows the total number of 2.5 x 3.5 CONUS gridcells, and the number available for each pairwise comparison by urbanity proxy. The population density proxy is the most restrictive, allowing us to use only 30 percent of CONUS gridcells. The urban boundary proxy allows the use of 80 percent of gridcells, while the satellite nightlight proxy allows the use of 72 percent of gridcells.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Proxy-Gridboxes.png"><img class="aligncenter size-full wp-image-9920" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Proxy-Gridboxes.png" alt="" width="505" height="311" /></a></p>
<p>Lets examine each proxy in turn, and see what they tell us about the potential effect of changes in urban form on the temperature trend. It is worth noting that all three proxies represent a snapshot in time, and cannot perfectly account for changes in urban form over time. However, if the low population density / rural / dark stations represent the measurements least susceptible to UHI effects, the difference between the temperature trends reconstructed from just those stations and that from all stations in the gridcells used can give us an estimate of the relative effect of UHI, assuming that locations rarely become less dense / more rural / darker over time.</p>
<p>I’ve also chosen to focus on trends in mean temperatures rather than max or min due to time constraints, though expanding the analysis to include max/min temps would provide additional information about potential UHI effects.</p>
<p><strong>Population Density</strong></p>
<p>The population density data used in this analysis comes from the GWPv3 2010 estimates, a database maintained by Columbia University. The dataset provides 2.5 arc-minute resolution, and is mapped to USHCN stations using the station lat/lon provided in the metadata file. 1214 stations are successfully located using the 2.5 arc-minute data, while 2 additional stations are located via 15 arc-minute data and a final 2 by 60 arc-minute data. The figure below shows the 2010 population distribution for the CONUS.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-196.png"><img class="aligncenter size-full wp-image-9921" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-196.png" alt="" width="465" height="207" /></a></p>
<p>Having assigned a population density figure to each USHCN station, we can look at the distribution of stations by population density as shown below:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-stations-GWPv3.png"><img class="aligncenter size-full wp-image-9922" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-stations-GWPv3.png" alt="" width="505" height="311" /></a></p>
<p>The majority of stations have fewer than 25 people per square kilometer, with the plurality having less than 10 per square kilometer. Relatively few USHCN stations have a population density greater than 100, and even fewer have a density over 400. The two population brackets used in this analysis, under 10 and over 100, provides a large pool of stations in the former category and a relatively small pool in the latter category.</p>
<p>If we further restrict our analysis to gridcells with both under 10 and over 100 pop density stations, we get the figure below, showing around 150 stations with under 10 density and around 80 stations with over 100 density in 36 2.5 x 3.5 gridcells.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Density-stations.png"><img class="aligncenter size-full wp-image-9923" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Density-stations.png" alt="" width="505" height="311" /></a></p>
<p>We can use these stations to find the trend for the periods 1900-2009 and 1960-2009 for the three datasets (raw, TOB, and F52), along with their 5<sup>th</sup> to 95<sup>th</sup> percentile ranges. Note that the trend confidence intervals shown here are slightly smaller than the true confidence intervals, as they do not correct for autocorrelation present in the underlying data.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-UHI-Pairwise.png"><img class="aligncenter size-full wp-image-9924" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-UHI-Pairwise.png" alt="" width="499" height="363" /></a></p>
<p>The difference between the trends of low and high-density stations is quite large in the raw data. However, TOB adjustments appear to mostly eliminate the difference in trends. This may be due to the fact that stations in areas of lower population density regions are more likely to be COOP stations without hourly temperature readings, and thus more subject to spurious cooling biases due to changes in TOB.</p>
<p>We can take a closer look at the relative effect of each adjustment on the trend:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-Adj-density.png"><img class="aligncenter size-full wp-image-9925" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-Adj-density.png" alt="" width="504" height="311" /></a></p>
<p>Here we can see TOB adjustments have a large effect of the trend on low density stations, increasing the trend by close to 100 percent over both periods. The TOB adjustments have a negligible effect on high density stations for the 1900-2009 period and a minor effect for the 1960-2009 period. Inhomogeneity adjustments have a similar effect on both classes of stations for the 1900-2009 period and a slightly larger effect on low density stations in recent years.</p>
<p>To estimate the magnitude of the UHI effect on the overall trend for the gridcells in question, we can compare the trends in low density stations to those of all stations. This comparison is shown in two formats: the percent effects, which allows for a comparison across both time periods (calculated via all station trend / low density trend minus 1), and the absolute effects on the trends (calculated via all station trend minus low density trend, in degrees C per year).</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Density-Percent.png"><img class="aligncenter size-full wp-image-9927" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Density-Percent.png" alt="" width="505" height="311" /></a><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Density-Absolute.png"><img class="aligncenter size-full wp-image-9926" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Density-Absolute.png" alt="" width="505" height="312" /></a></p>
<p>In both cases, the UHI effect is much more pronounced in the raw data, is modest in the TOB data, and is negligible or even slightly negative in the inhomogeneity-adjusted data.</p>
<p><strong>Urban Boundaries</strong></p>
<p>Urban boundary data is taken from the GRUMP dataset, also from Columbia University. As <a href="http://rhinohide.wordpress.com/2010/03/13/ghcnv2-and-grump-rural-and-urban-extents/">Ron Broberg explains</a>, “The [GRUMP] data set consists of a merging of two sources of information: population and settlement extents. Population data is derived from a variety of sources, primarily national census. Settlement extent is derived from the Defense Meteorological Satellite Program (DMSP) Operational Linescan System (OLS) for a seven month period in 1994/1995, from an ESRI Digital Chart of the World (DCW), and Tactical Pilotage Charts (TPC).” A chart of CONUS GRUMP urban boundaries is shown below:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/grump-urban-us.png"><img class="aligncenter size-large wp-image-9929" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/grump-urban-us-1024x614.png" alt="" width="502" height="301" /></a></p>
<p>There are 97 gridcells with both urban and non-urban (rural) GRUMP stations that we can use in our pair-wise comparison. The total number of stations in those gridcells, as well as the number of urban and rural stations, are shown below. Unlike the population density case examined previously, all stations in the gridcells are classified as either urban or rural.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Urbanity-stations.png"><img class="aligncenter size-full wp-image-9931" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Urbanity-stations.png" alt="" width="505" height="312" /></a></p>
<p>Lets take a look at the trends for both periods for raw, TOB, and inhomogeneity-adjusted data:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-Urbanity-Pairwise.png"><img class="aligncenter size-large wp-image-9933" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-Urbanity-Pairwise-1024x744.png" alt="" width="502" height="365" /></a></p>
<p>And the relative effects of each round of adjustments on rural and urban stations:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-Adj-urbanity.png"><img class="aligncenter size-full wp-image-9934" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TOB-Adj-urbanity.png" alt="" width="505" height="312" /></a></p>
<p>Again, we see that TOB adjustments are considerably larger (nearly 2x) for rural stations than for urban stations, though the difference here is smaller than that observed in the population density analysis. This makes sense, since the population density proxy was looking at a more extreme division. We also se that inhomogeneity adjustments are similar for both datasets over the 1900-2009 period, but considerably larger for rural stations than urban stations over the 1960-2009 period.</p>
<p>Turning to the estimate of UHI (comparing the trend in rural stations to all stations in the gridcells), we find that the effect of urban form on temperature trend is diminished after each subsequent round of adjustments, with the small exception of the absolute effect of inhomogeneity adjustments in the 1900-2009 period.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Urbanity-Percent.png"><img class="aligncenter size-full wp-image-9936" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Urbanity-Percent.png" alt="" width="505" height="312" /></a><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Urbanity-Absolute.png"><img class="aligncenter size-full wp-image-9935" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/UHI-Urbanity-Absolute.png" alt="" width="506" height="312" /></a></p>
<p>In general, the effect of urban form estimated via the urban boundaries proxy is smaller than that found via the population density proxy, except in the final inhomogeneity-adjusted data where it is notably larger.</p>
<p><strong>Satellite Nightlights</strong></p>
<p>By measuring the visible light at night from space via satellites, we can create an assessment of the relative urbanity of a specific area. This analysis uses raw DMSP-OLS v4 nightlight data from 2008 kindly processed by Ron Broberg to assign a brightness rating ranging from 0 to 62 to each station. Its worth noting that DMSP-OLS data is also one of the inputs used by the GRUMP urban boundary designations, so these proxies are not completely independent. Below is an image showing satellite nightlight brightness for the CONUS.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USA_LIGH.gif"><img class="aligncenter size-large wp-image-9928" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USA_LIGH-1024x614.gif" alt="" width="502" height="301" /></a></p>
<p>NOAA used an older version of DMSP-OLS data to classify each USHCN and GHCN station as Dark, Dim, or Bright. Unfortunately, the scale used in that classification is different than the 0 to 63 scale provided by the current satellite data. After examining the distribution of stations by brightness, I somewhat arbitrarily labeled stations with a nightlight brightness of 0 to 20 as “Dark” and 40 to 63 as “Bright”. This leaves us with slightly under a third of stations that are neither dark nor bright, and provides a proxy that is less restrictive than the population density division but more restrictive than the binary urban/rural GRUMP proxy. It also gives us 87 gridcells with both dark and bright stations for pair-wise comparisons, with the number of stations available for each class for each year shown below:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Nightlight-stations.png"><img class="aligncenter size-full wp-image-9937" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Nightlight-stations.png" alt="" width="506" height="312" /></a></p>
<p>If we calculate the trends for dark, bright, and all stations over both periods for raw, TOB, and inhomogeneity-adjusted data we see that it looks fairly similar to the figure for urban boundaries.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-Nightlight-Pairwise.png"><img class="aligncenter size-large wp-image-9932" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-Nightlight-Pairwise-1024x744.png" alt="" width="502" height="365" /></a></p>
<p>A closer look at the relative effects of each adjustment shows that the TOB adjustments for dark stations are much larger than those we observed for rural stations, especially for the 1900-2009 period. The inhomogeneity adjustments are comparable for 1900-2009, but are quite small for bright stations post-1960.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-197.png"><img class="aligncenter size-full wp-image-9976" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-197.png" alt="" width="505" height="312" /></a></p>
<p>Looking at the estimated effects of urban form on the temperature trend (e.g. all station trend vs. dark stations only), we see a very large urban form effect present in the raw data for the 1900-2009 period, much of which disappears after TOB adjustments are applied.</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-201.png"><img class="aligncenter size-full wp-image-9980" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-201.png" alt="" width="506" height="312" /></a><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-204.png"><img class="aligncenter size-full wp-image-9979" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-204.png" alt="" width="507" height="313" /></a></p>
<p><strong>Conclusions</strong></p>
<p>By examining the effects of three different urbanity proxies on trends in raw, TOBs, and inhomogeneity-adjusted USHCN data for the periods of 1900 to 2009 and 1960 to 2009 we can get a good picture of all the different ways to evaluate the effects of urban form on temperature trends in the United States. Some of the results we obtained were expected, others somewhat less so.</p>
<p>Among expected results are that UHI effects seem to be larger (as a percent change of the trend) over the full century than over the past 50 years, at least in the urbanity and nightlight proxies. This holds across all three datasets, even the adjusted one, perhaps indicating the presence of a small-to-modest UHI signal robust to potential biases introduced by both TOBs and inhomogeneity issues. We also find that TOBs adjustments have a much larger effect on low pop density / rural / dark stations than their counterparts, something we would expect if these stations were more likely to be COOP stations without hourly temperature data available. What was slightly less expected is that inhomogeneity adjustments appear to have a much greater effect on low density / rural / dark stations in recent years (1960-2009) than their high density / urban / bright counterparts. It may be that this is due to MMTS-related inhomogeneities, since rural stations are perhaps less likely to experience a switch to MMTS.</p>
<p>So where does this leave us? It turns out that the size of the UHI effect you can claim from U.S. data depends largely on whether or not you accept the need for TOB and inhomogeneity adjustments. If you reject all adjustments, UHI appears rather significant (increasing the U.S. temp trend between 25 and 90 percent depending on the period and proxy used). If you accept all adjustments, UHI turns out to be fairly small (changing the trend by between -5 and 14 percent). If you just look at TOB adjustments but ignore inhomogeneity, you end up with modest UHI effects (increasing the trend between 10 and 25 percent). However, the analysis shown in the initial graph, comparing a non-urban raw dataset to the adjusted data from all stations gives a misleading impression, since the adjustments (both TOB and inhomogeneity) have a strong correlation with urban form.</p>
<p><strong>Update</strong></p>
<p>MikeC asked about micro-site effects, which aren&#8217;t necessarily perfectly correlated with changes in urban form. Since we do have data on the CRN rankings of some stations via the Surface Stations project, we can use CRN as a proxy for micro-site effects, with CRN12 assumed to have few micro-site biases and CRN345 assumed to have large microsite biases.</p>
<p>Because there are relatively few CRN-rated stations available to me (especially in the CRN12 bin), I can&#8217;t do a pair-wise comparison using the same grid cells, so there may be spatial coverage biases in these results. YMMV.</p>
<p>The full list of CRN12 and CRN345 stations in the &lt; 10 and &gt; 100 pop density brackets, along with their reconstructed anomalies and trends, are available here: <a href="http://drop.io/0yhqyon/asset/ushcn-crn-xls">http://drop.io/0yhqyon/asset/ushcn-crn-xls</a></p>
<p>And the results, using raw data:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-CRN.png"><img class="aligncenter size-large wp-image-9962" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/USHCN-CRN-1024x744.png" alt="" width="502" height="365" /></a></p>
<p style="text-align: left">CRN12 stations seem to have a higher trend than CRN345 stations, consistent with our earlier analysis and replication of Menne et al 2010. This may be due to the fact that CRN345 stations are more likely to have undertaken a switch to MMTS sensors (and had a spurious max temp cooling bias introduced), so I&#8217;d be careful about concluding too much regarding micro-site effects from this chart. That said, it at least suggests that micro-scale effects aren&#8217;t enormous.</p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/uhi-in-the-u-s-a/feed/</wfw:commentRss>
		<slash:comments>104</slash:comments>
		</item>
		<item>
		<title>Comparison of a trend of 0.2C/decade to NOAA: Since 2001 ( For Nathan )</title>
		<link>http://rankexploits.com/musings/2010/comparison-of-a-trend-of-0-2cdecade-to-gisstemp-since-2001-for-nathan/</link>
		<comments>http://rankexploits.com/musings/2010/comparison-of-a-trend-of-0-2cdecade-to-gisstemp-since-2001-for-nathan/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 16:09:55 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Climate models]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9893</guid>
		<description><![CDATA[In comments yesterday, Nathan seemed to indicate he wanted to see the result of some sort of test with data beginning in January 2001. Here&#8217;s a graph just for Nathan:

Above is a graph showing how the trend of 0.2 C/decade fits into ±95% uncertainty intervals computed for NOAA observation. The uncertainty intervals are estimated assuming [...]]]></description>
			<content:encoded><![CDATA[<p>In comments yesterday, Nathan seemed to indicate he wanted to see the result of some sort of test with data beginning in January 2001. Here&#8217;s a graph just for Nathan:<br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISS_Since_20011.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISS_Since_20011-500x341.jpg" alt="" title="NOAA_Since_2001" width="500" height="341" class="aligncenter size-medium wp-image-9895" /></a></p>
<p>Above is a graph showing how the trend of 0.2 C/decade fits into ±95% uncertainty intervals computed for NOAA observation. The uncertainty intervals are estimated assuming the residuals to the linear fit have an ARMA(1,1) structure, and part of the &#8216;weather noise&#8217; is explained by the MEI index.  (I have documented the methodology for estimating uncertainty intervals using ARMA(1,1) only partially at the blog. The method of estimating the uncertainty intervals uses  a &#8220;number of effective data points&#8221; method that is asymptotically correct as we acquire an infinite number of samples. In my applications, the coefficients required to estimate the effective number of data points are based on the first three lagged correlations in the correlogram for the residuals and monte-carlo tests indicate my method returns uncertainty intervals that are slightly too large.)</p>
<p>The graph shown  is one of the &#8220;default&#8221; graphs in the spreadsheet I update when new monthly temperature anomalies are published.  I alternate showing new data along with trends 1980 or 2001, with no particular pattern.  Sometimes I show a graph with both trends. If I noticed that the 2001 graph flipped form &#8220;rejecting&#8221; 0.2 C/decade (as it has been for  many months) to &#8220;accepting&#8221;, I would consider that news and show highlight that graph. Similarly, if 1980 flipped from &#8220;accepting&#8221; to &#8220;rejecting&#8221;, I&#8217;d highlight that one. Otherwise, it&#8217;s a toss up; earlier this month I happened to pick the <a href="http://rankexploits.com/musings/2010/gisstemp-0-71c-slightly-higher-than-january/">1980 graph</a> when presenting GISSTemp for this month.</p>
<p>As long as I&#8217;m showing the 2001 graph, I&#8217;ll discuss what this communicates.  When the analysis using the assumptions discussed briefly above is performed,  a trend equal to exactly 0.2 C/decade falls <em>outside</em> the ±95% confidence intervals for the trend consistent with the NOAA/NCDC data observations.   This means that based on the assumptions of this analysis, if we select a confidence level of 95%, we should treat the assumption that the trend is 0.2 C/decade as <em>false</em>.   </p>
<p>In contrast, because, for the purpose of analysis,  we&#8217;d assumed the trend of 0.2 C/decade projections is true, if that trend fell inside the uncertainty intervals, we&#8217;d continue to assume it was true. </p>
<p>Note that currently, if we create equivalent graphs for GISSTemp or Hadley, the analysis based on HadCRUT returns the result similar to NOAA shown above; while the analysis using GISSTemp results in a &#8216;fail to reject&#8217; conclusion. </p>
<p>In either event, before we&#8217;d performed the analysis, we would know that we would ultimately accept a chance of making the wrong decision, with errors falling in two possible classes:  </p>
<ol>
<li>False positive error. (i.e. &#8216;Type I&#8217; or &alpha; error.) Contingent on our statistical model being correct, (i.e. the residuals are ARMA(1,1), etc.), the chance of decreeing 0.2 C/decade false when it is, in fact, true would be &alpha;=(1-95%)=5%.   The way these tests are constructed, false positive error remains constant at the chosen level of  &alpha; no matter how much data we collect.  I&#8217;ve set up my test to give false positives at a rate of 5%.</li>
<li>False negative error. (i.e. &#8216;Type II&#8217; or &beta; error.) Once gain, contingent on our statistical model being correct, we have some chance of <i>failing to reject the null hypothesis</i>. In this case, that is saying the trend of 0.2C/decade is correct when it is incorrect.  The rate of &beta; error can never truly be known because it is a function of the the magnitude of the actual trend&#8211; which cannot be known. However, in the limit of zero data, this rate is 95% and declines as we obtain more data.  So, this type of error&#8211; accepting the null hypothesis, is the error we generally worry about when we have very little data.</li>
</ol>
<p>Other sources of making the wrong diagnosis is using an incorrect statistical model. This is not directly related to using a short time period. There is, however,  an indirect effect because lack of data can make it difficult to perform fiduciary tests to determine whether or statistical model is correct. For example, in the current case,  if we have insufficient information, but had assumed the residuals are AR(1) as opposed to ARMA(1,1), white or any other statistical model, tests to show our assumptions are incorrect will have little power.   </p>
<p>In any case: What the result illustrated in the graph above indicates is that, assuming the statistical model is true, and assuming 2001 is a good start year, we should <i>reject</i> the assumption that the trend of 0.2C/decade falls inside the range consistent with data.</p>
<h3>Have I shown results of other tests starting in 2001 recently?</h3>
<p>Since I&#8217;m  not entirely sure <em>why</em> Nathan seemed to suggest I stopped doing something starting specifically in January, 2001, I think it&#8217;s worth showing that I have been recently presented results of analysis using data that begin specifically in 2001 </p>
<p>That&#8217;s easy enough to show Nathan that I am in habit of including what results of statistical tests if we begin in Jan 2001 along with results starting in other years.  For example, in January of this year,  I posted  various graphs comparing the multi-model mean projections computed from 22 models used by the IPCC to create projections published in the AR4 to observations.  That method used in that post permitted me to include the uncertainty arising from both the &#8216;weather noise&#8217; and spread in biases for that collection of 22 models. The results for Hadley and GISTemp were posted <a href="http://rankexploits.com/musings/2010/multi-run-mean-ar4-projections-statistically-significant-from-observations-from-50-6070-00-and-01/">here</a> and  <a href="http://rankexploits.com/musings/2010/multi-model-mean-projection-rejects-gisstemp-start-dates-50-60-70-80/">here</a> respectively.   </p>
<p>Both graphs contain results of analysis initiated in decadal years since 1950, <em>supplemented with the year 2001</em>. </p>
<p>Some new readers might wonder why I always include 2001 in particular. I always include  2001 because the emissions scenarios or SRES used to drive climate models was first released in Nov. 2000 (<a href="http://www.grida.no/publications/other/ipcc_sr/">a noted here</a>). So 2001 is, in some sense, the first year presenting a pure comparison of observations to projections driven by the SRES.  (On can argue whether purity matters, but I think it&#8217;s worth noting the distinction between testing hindcasts and forecasts.)</p>
<p>I&#8217;m aware this post may prompt Nathan to clarify what precisely he thinks I stopped doing. One thing I do not believe I have stopped doing is comparing observations to IPCC projections with the analysis using data beginning in January 2001 and reporting whether the statistical test guides us to deem the projections &#8216;true&#8217; or &#8216;false&#8217;.  Quite often, if we chose a confidence level of 95%,  the result of a statistical analysis guides us to conclude a trend that we can connect to information reported in the IPCC AR4 is found to be  is false.  (For example, the trend of 0.2C/century is diagnosed as false above.)  Occasionally, we the analysis does not guide us to conclude a trend is not false. By convention, if that trend was the null hypothesis, we continue to treat it as true&#8211; because, by convention, we treat a null hypothesis  as true even if when we have absolutely no data.</p>
<p>It seems to me that I show graphs and presents result of analyses using data starting in Jan. 2001 rather consistently and plan to continue to do so. I will, of course, also continue to show what happens if we apply the same analysis using longer time spans, or what happens if we change the assumptions in our statistical model and also sometimes publish blog posts that don&#8217;t happen to include a graph or a trend computed starting in 2001.  </p>
<h3>Update</h3>
<p>I goofed up and showed a NOAA graph but discussed GISSTemp. I&#8217;ll update by adding the GISSTemp, which may well show trends inside the ±95%  uncertainty intervals as that&#8217;s what we get when I look at the full model with the spread as see if you click the link discussing that more detailed analysis. </p>
<div id="attachment_9905" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISSTemp1.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISSTemp1-500x341.jpg" alt="" title="GISSTemp" width="500" height="341" class="size-medium wp-image-9905" /></a><p class="wp-caption-text">Figure 2: GISSTemp since 2001.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/comparison-of-a-trend-of-0-2cdecade-to-gisstemp-since-2001-for-nathan/feed/</wfw:commentRss>
		<slash:comments>122</slash:comments>
		</item>
		<item>
		<title>GISSTemp +0.71C: Slightly higher than January.</title>
		<link>http://rankexploits.com/musings/2010/gisstemp-0-71c-slightly-higher-than-january/</link>
		<comments>http://rankexploits.com/musings/2010/gisstemp-0-71c-slightly-higher-than-january/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 22:48:11 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[global climate change]]></category>
		<category><![CDATA[GISSTemp]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9888</guid>
		<description><![CDATA[GISSTemp released their land/ocean anomalies for February. It&#8217;s 0.71C, just edging out January.  Rebaselined anomalies and trends since 1980 are shown below, along with with a trend of 0.2C/decade shown for reference:
This is a hot start for the year.  Factoids about the GISSTemp data:

This is the second highest Feb. anomaly in the record; [...]]]></description>
			<content:encoded><![CDATA[<p>GISSTemp released their land/ocean anomalies for February. It&#8217;s 0.71C, just edging out January.  Rebaselined anomalies and trends since 1980 are shown below, along with with a trend of 0.2C/decade shown for reference:</p>
<div id="attachment_9887" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISSTemp.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/GISSTemp-500x341.jpg" alt="" title="GISSTemp" width="500" height="341" class="size-medium wp-image-9887" /></a><p class="wp-caption-text">Figure 1: GISTemp Feb. 2010</p></div>
<p>This is a hot start for the year.  Factoids about the GISSTemp data:</p>
<ol>
<li>This is the second highest Feb. anomaly in the record; Feb. 1998 hit 0.80C.  </li>
<li>At 0.705C, the year is is tied for second highest average based on both Jan. and Feb. The Jan+Feb average for 2007 was 0.745C.</li>
</ol>
<p>With El Nino still alive and kicking, this year is in the running to break some GISTemp records.  Maybe next month. </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/gisstemp-0-71c-slightly-higher-than-january/feed/</wfw:commentRss>
		<slash:comments>149</slash:comments>
		</item>
		<item>
		<title>Satellite Trends for TLT</title>
		<link>http://rankexploits.com/musings/2010/satellite-trends-for-tlt/</link>
		<comments>http://rankexploits.com/musings/2010/satellite-trends-for-tlt/#comments</comments>
		<pubDate>Wed, 10 Mar 2010 16:58:49 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[global climate change]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9845</guid>
		<description><![CDATA[RSS, UAH (v5.2) and the new revised UAH (v5.3) February temperature anomalies for the lower troposphere have been posted; anomalies during the observational record are shown below along with the  multi-model mean for the surface temperature based on 20th century simulations extended into the 21st century using A1B : 
Comparison of the RSS and [...]]]></description>
			<content:encoded><![CDATA[<p><a href="www.remss.com/data/msu/monthly_time_series/RSS_Monthly_MSU_AMSU_Channel_TLT_Anomalies_Land_and_Ocean_v03_2.txt">RSS</a>, <a href="http://vortex.nsstc.uah.edu/public/msu/t2lt/tltglhmam_5.2">UAH (v5.2)</a> and the new revised <a href="http://vortex.nsstc.uah.edu/public/msu/t2lt/tltglhmam_5.3">UAH (v5.3)</a> February temperature anomalies for the <em>lower troposphere</em> have been posted; anomalies during the observational record are shown below along with the  multi-model mean for the <em>surface</em> temperature based on 20th century simulations extended into the 21st century using A1B : </p>
<div id="attachment_9847" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/SatelliteTrends.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/SatelliteTrends-500x341.jpg" alt="Temperature anomalies for the lower troposphere." title="SatelliteTrends" width="500" height="341" class="size-medium wp-image-9847" /></a><p class="wp-caption-text">Figure 1: Temperature anomalies for the lower troposphere. Projections for surface provided for reference.</p></div>
<p>Comparison of the RSS and UAH observations indicates the observed least squares trend applied to observations during the observational record continues to lag the least squares trend based on the multi-model mean anomaly for the surface. Of course, it would be more appropriate to compare to projections for the lower troposphere, but for monthly blog posts I&#8217;m using trends from simulations that are available at &#8220;The Climate Explorer&#8221;. The most recent individual monthly observations  are warmer than the value corresponding to the multi-model mean temperature anomaly.   We must all wait to see whether the current El Nino will provide a sufficient number or warm months to close the gap in the trend. </p>
<p>The current reported anomalies are compared to those from last month and February 1998 below: </p>
<table width="90%" border="1"  >
<tr ALIGN="center">
<td >Year</td>
<td>month</td>
<td>RSS </td>
<td>UAH 5.3 </td>
<td>UAH 5.2 </td>
</tr>
<tr ALIGN="center">
<td></td>
<td> </td>
<td COLSPAN="3"> Reported temperature anomaly in C</td>
</tr>
<tr ALIGN="center">
<td>2010</td>
<td>1</td>
<td>0.640 </td>
<td>0.630 </td>
<td>0.724 </td>
</tr>
<tr ALIGN="center" >
<td>2010</td>
<td>2</td>
<td>0.588 </td>
<td>0.613 </td>
<td>0.740 </td>
</tr>
<tr ALIGN="center">
<td>1998</td>
<td>2</td>
<td>0.736</td>
<td>0.753</td>
<td>0.753</td>
</tr>
</table>
<p>February 2010 exhibited a warm lower troposphere, but records for observed February values set in  1998 remains unbroken.   Now, we await GISS and Hadley to see what they show for February!</p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/satellite-trends-for-tlt/feed/</wfw:commentRss>
		<slash:comments>90</slash:comments>
		</item>
		<item>
		<title>Not So Spherical Cows: More Toy Problems.</title>
		<link>http://rankexploits.com/musings/2010/not-so-spherical-cows-more-toy-problems/</link>
		<comments>http://rankexploits.com/musings/2010/not-so-spherical-cows-more-toy-problems/#comments</comments>
		<pubDate>Tue, 09 Mar 2010 22:12:36 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9837</guid>
		<description><![CDATA[In discussions of the application of the Hansen/Lebedeff gridding method to &#8220;toy planet&#8221; data, a few people noted that the &#8220;toy&#8221; data used did not hightlight a known way bias  can be introduced into a computed temperature trend as a result of dropping out a whole bunch of stations. This can give the false [...]]]></description>
			<content:encoded><![CDATA[<p>In discussions of the application of the Hansen/Lebedeff gridding method to &#8220;toy planet&#8221; data, a few people noted that the &#8220;toy&#8221; data used did not hightlight a known way bias  <i>can</i> be introduced into a computed temperature trend as a result of dropping out a whole bunch of stations. This can give the false impression that I&#8217;m suggesting the anomaly method takes care of any and all data problems associated with adding or dropping thermometers used to measure the trend: It does not.  In fact, even when using the anomaly method, bias can be introduced when we add or drop the number of thermometers used to measure temperatures and the thermometers used each measure temperatures at locations that exhibit <i>different trends.</i> For example: Climate scientists predict that the polar latitudes will warm at a <i>faster rate</i> than tropical latitutes. If so, then dropping all the northern thermometers could result in a bias in trend. To show how, I&#8217;ve modified the spreadsheet and created synthetic data to create graphs that highlight how trends <i>can</i> be biased when thermometer are biased in a particular way.  </p>
<h3>The Toy Thermometers</h3>
<p>As in yesterday&#8217;s post, applied the Hansen and Lebedeff (1987) method to compute the &#8216;temperature&#8217; in a sub-box.  (For details download the <a href="http://pubs.giss.nasa.gov/abstracts/1987/Hansen_Lebedeff.html">pdf</a> and read <a href="http://rankexploits.com/musings/2010/giss-anomalies-more-spherical-cow/">yesterday&#8217;s post.</a>) The sub-box will be assumed be instrumented with 5 pairs of thermometers. The pairs will consist of one thermometer that has been placed in a &#8217;slow warming&#8217; location, warming at 0.01 C/year and one placed in a &#8216;fast warming&#8217; location warming at 0.03 C/year.  The crack team of scientists will be monitoring both sets of thermometers from years 1-28; afterwards they will monitor only the &#8220;slow warming&#8221; thermometers. Meanwhile, the monks across the street will continue to monitor the &#8220;fast warming&#8221; thermometer, permitting us to compare computed &#8216;temperatures&#8217; based on the all 10 thermometers and compare them to the set examined by the crack team of scientists. </p>
<h3>Raw Data</h3>
<p>This is synthetically generated data from all thermometers during the full period.  Notice one group definitely warms fater than the other. (Notice I also set the &#8216;noise&#8217; to a low value; I did t his on purpse to highlight the things that can go wrong. )</p>
<div id="attachment_9835" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/SlowFast.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/SlowFast-500x341.jpg" alt="" title="SlowFast" width="500" height="341" class="size-medium wp-image-9835" /></a><p class="wp-caption-text">Figure 1: Raw data for 'toy' problem with fast warming and slow warming thermometers.</p></div>
<p>If you examine that graph it is clear that if the scientists only used the &#8217;slow trend&#8217; thermometers &#8212; shown in red (because scientists expect warmer regions to warm more slowly) they would report that the sub-box was warming at the slow rate. </p>
<p>If scientists only used the &#8216;fast trend&#8217; (shown in blue because cliamte scientists believe cold regions will warm more slowly) thermometers, they would report the box was warming at the faster rate. </p>
<p>Even someone with the IQ of a vegetable would realize that if you want to know the rate at which temperature <i>changes</i> in the sub box, you want these thermometers distributed evenly; failing that, if all 5 &#8220;fast trend&#8221; thermometer were in one tight region, and the 5 slow warming spread out, you want to weight the various thermometers to avoid over sampling the &#8220;fast trend.&#8221;   Climate scientists actually <i>try</i> to do that by using area weighting and gridding. But for now, we&#8217;ll just focus on the sub-box and what happens to it&#8217;s computed temperature.</p>
<h3>Result after incorporating all available thermometers</h3>
<p>Yesterday, I wanted to show people how data from each thermometer was incorporated, so I showed intermediate computed results for the first two thermometers and when the 6th thermometer was incorporated.  Today, I&#8217;ll simply show the final graph comparing the &#8217;sub-box&#8217; temperature computed using the all thermometers during all 60 months&#8211; that is the graphs the monks monitoring all thermometers would create,  and also the &#8216;temperature&#8217; computed using only those available to the crack-team of scientists. </p>
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/EffectOfDroppingFastWarming.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/EffectOfDroppingFastWarming-500x341.jpg" alt="Compare Trends for Fast and Slow warming." title="EffectOfDroppingFastWarming" width="500" height="341" class="size-medium wp-image-9836" /></a>
<p>Notice that with <i>this</i> toy data, the result with stations dropped is different from the result if we had access to temperature made available only to the monks.  After month 28, when the &#8220;fast warming&#8221; thermometer were dropped, we see:</p>
<ol>
<li>The trend computed over all 60 data points declines using all thermometers (orange) is higher than the trend computed using all thermometers prior to year 28 but only the slow wearming thermometers after year 28.</li>
<li><i>Both</i> the &#8216;temperatures&#8217;  and the trend in the temperature a based on the series with drop outs shows a step change at year 28. </li>
</ol>
<p>Some of you who don&#8217;t like anomaly methods will think this result shows that raw temperature methods are better. That conclusion is <i>incorrect</i>.  We would see the same discontinuity in trends and similar discontinuities in anomalies if we used computed the temperature in the sub-grid using &#8216;raw&#8217; temperatures.  That&#8217;s because the problem here is not related to the choice of <I>data processing method;</i> it is related to biased sampling <i>relative to the question we wish to ask</i>.  </p>
<p>So, what does this problem show? It shows that if the <i>trends</i> vary in the sub-box, we <i>do</I> need to take care to distribute thermometers such that we properly register the variation in trends.   That is to say, if mountains tops&#8211; though cold&#8211; are warming <i>more rapidly</i> than the warmer valleys below, dropping all the &#8220;cold&#8221; mountain top thermometers will cause a bias. However, those who think droppping these &#8220;cold&#8221; thermometers the bias the computed trend  <i>toward more warming</i> would be mistaken. The  bias would be be to make the computed trend <i>toward less warming</i> and the reason would be because the warm valleys were warming <i>less rapidly</i>. </p>
<p>Now, what of climate and GISTemp? Computation in cells of for GISTemp use the method described here. During the &#8220;march of the thermometers&#8221; many of the thermometers near the cold polar regions were dropped. Climate scientists expect that these regions are warming at a <i>faster</i> rate than the tropics.   This means that if climate scientists theories about polar amplification are right, dropping northerly thermometers would tend to introduce in a <I>cold</i> bias in the computed trend.   </p>
<p>That said, while my toy example shows dramatic effect, it&#8217;s unlikely we&#8217;d expect a large cold effect due to &#8216;the march of the thermometers&#8217;.  For the most part, the we expect variation in long term trends to  strong spatial correlation. That is, we don&#8217;t expect the trend in El Salvador to differ dramatically from Guatemala City; instead we expect the long term trend in Greenland to differ from both those Central American cities.  So,  Eeven though the number of thermometers in cold locations was lost, the area used to compute the temperature for the surface of the earth,  protects somewhat against the loss of thermometers in polar regions introducing an large bias in the computed trend for the earth&#8217;s surface trend.   As long the loss of thermometers doesn&#8217;t result in a method failing to account for  <i>areas</i> with  fast warming, the first order effect on the computed trend for the earth will tend to be to add noise, not bias.   </p>
<p>In the end, the proof is in the pudding.  We can ask. &#8220;Did the cold bias materialize?&#8221; It seems not&#8211; of if it did materialize, it appears small. The exact same analyses by <a href="http://rankexploits.com/musings/2010/a-detailed-look-at-ushcn-mimmax-temps/">Zeke</a>, <a href="http://tamino.wordpress.com/2010/02/23/ghcn-preliminary-results/">Tamino,</a>, <a href="http://clearclimatecode.org/the-1990s-station-dropout-does-not-have-a-warming-effect/">CCC</a>, <a href="http://rhinohide.wordpress.com/2010/03/08/gistemp-high-alt-high-lat-rural/">The Whiteboard</a> etc. all suggest that any biases that might have been introduced by  &#8220;the march of the thermometers&#8221;  were quite small.   So, while this post shows a <i>potential</i> issue that can arise when applying GISTemps method of computing surface temperatures, the situation where that potential actually creates a real bias does not appear to have occurred.  </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/not-so-spherical-cows-more-toy-problems/feed/</wfw:commentRss>
		<slash:comments>301</slash:comments>
		</item>
		<item>
		<title>In search of the UHI signal</title>
		<link>http://rankexploits.com/musings/2010/in-search-of-the-uhi-signal/</link>
		<comments>http://rankexploits.com/musings/2010/in-search-of-the-uhi-signal/#comments</comments>
		<pubDate>Tue, 09 Mar 2010 15:56:09 +0000</pubDate>
		<dc:creator>Zeke</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9804</guid>
		<description><![CDATA[The Urban Heat Island (UHI) effect is something of a charged subject on climate science blogs. Depending on who you ask, you might hear that it either accounts for the majority of modern warming or that it doesn&#8217;t exist at all. While it is undeniable (and fairly easily shown) that both the site characteristics and [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left">The Urban Heat Island (UHI) effect is something of a charged subject on climate science blogs. Depending on who you ask, you might hear that it either accounts for the majority of modern warming or that it doesn&#8217;t exist at all. While it is undeniable (and fairly easily shown) that both the site characteristics and surrounding area have a measurable impact on the absolute temperature of the station, teasing out the actual effects of changes in urban form over time on temperature is notoriously difficult. Its hampered both by the lack of good historical metadata for many stations and basic disagreements on the best methods to use to determine the marginal contribution of UHI to the trend.</p>
<p>The article will provide a novel examination of UHI impacts on global temperature trends from the Global Historical Climatological Network (GHCN) using three different proxies for urbanity: <a href="ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/v2/v2.temperature.inv">subjective classifications</a> of stations as urban or rural in the station metadata, classifications of stations as &#8220;dark&#8221; or &#8220;bright&#8221; based on <a href="http://www.ngdc.noaa.gov/dmsp/downloadV4composites.html">nighttime satellite observations</a>, and the year 2000 population density at the station location via the <a href="http://sedac.ciesin.columbia.edu/gpw">Gridded Population of the World Version 3</a> database. All of these represent a single snapshot in time, and don&#8217;t show changes in urban characteristics over time. However, if we make the reasonably conservative assumption that areas rarely de-urbanize, the vast majority of UHI effects over the past few decades should appear in those stations classified as urban/bright/high density if these classifications are reasonably accurate.</p>
<p style="text-align: left"><span id="more-9804"></span></p>
<p style="text-align: left">Before we dive into data analysis, a few bits of housecleaning:</p>
<p>- As always, the latest source code for the spatial gridding model (version 0.8) can be found here: <a href="http://drop.io/0yhqyon">http://drop.io/0yhqyon</a><br />
- The raw model outputs used to generate all these graphs can be found in an excel file here: <a href="http://drop.io/0yhqyon/asset/uhi-xls">http://drop.io/0yhqyon/asset/uhi-xls</a><br />
- The model has been updated to use the standard NCDC baseline of 1961-1990 instead of the prior 1960-1970 baseline. The net effects of this change on global temp trends should be negligible.</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-76.png"><img class="aligncenter size-full wp-image-9805" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-76.png" alt="" width="589" height="375" /></a></p>
<p><strong>Detecting the UHI signal</strong></p>
<p>Lets start by simply estimating the global temperature using a number of different sets of stations. I&#8217;ll divide these into &#8220;urban&#8221; and &#8220;rural&#8221; proxies based on their expected effect on the resulting temperature vis-a-vis a run using all stations:</p>
<p>Urban proxies:</p>
<ul>
<li>Urban designation</li>
<li>Bright nightlights</li>
<li>Airport location</li>
<li>Population density &gt; 100 people per square kilometer</li>
<li>Population density &gt; 500 people per square kilometer</li>
<li>Population density &gt; 1000 people per square kilometer</li>
</ul>
<p style="text-align: left">Rural proxies:</p>
<ul>
<li>Rural designation</li>
<li>Dark nightlights</li>
<li>Population density &lt; 1 person per square kilometer</li>
<li>Population density &lt; 2 people per square kilometer</li>
<li>Population density &lt; 10 people per square kilometer</li>
</ul>
<p style="text-align: left">Note that these are by no means mutually exclusive. We would expect, for example, most stations with a population density of &lt; 1 person to fit into all five of the rural proxies.</p>
<p>You can see the number of stations (wmo_id imods) of each type by year in the figure below:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-77.png"><img class="aligncenter size-full wp-image-9806" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-77.png" alt="" width="589" height="375" /></a></p>
<p>For population density data (which was kindly provided by Ron Broberg over at the <a href="http://rhinohide.wordpress.com/">Whiteboard</a>), its worth looking quickly at the breakdown of stations by population density. Here is the cumulative percent of stations (1 = 100%) by population density for &lt; 100 pop density (people per square km):</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/under-100.png"><img class="aligncenter size-full wp-image-9807" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/under-100.png" alt="" width="585" height="426" /></a></p>
<p>And &lt; 1000 pop density:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/under-1000.png"><img class="aligncenter size-full wp-image-9808" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/under-1000.png" alt="" width="585" height="426" /></a></p>
<p>Here we see that over 60% of stations have a population density under 100 people per square km, and almost 90% of stations have under 1000 people per square km.</p>
<p>Now, we tell the model to simply estimate a global temperature using each of these different UHI proxies, to produce this lovely spaghetti graph:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-78.png"><img class="aligncenter size-full wp-image-9809" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-78.png" alt="" width="589" height="366" /></a></p>
<p>What immediately stands out is that, no matter which set of stations we use, the result isn&#8217;t immensely different. Apologies to those convinced that UHI would account for the bulk of modern warming, but that doesn&#8217;t seem to be the case at least based on the proxies we have available here.</p>
<p>What you will notice as odd if you look carefully at the graph is that the single highest trending set of stations appears to be those with &lt; 1 pop density! This clearly is not correct, and is due to the different spatial coverage by each proxy. Because there aren&#8217;t as many grid cells with stations &lt; 1 pop density as, say, those with dark or urban stations, they end up having a different trend simply by virtue of their location. If we really want to dive deeply into the impact of each of these factors, we need to develop a more advanced model.</p>
<p style="text-align: left"><strong>Pair-wise comparisons of UHI factors</strong></p>
<p>I added a module to my spatial gridding model that would identify all grid cells with at least x stations of each of a given set of classifications. For example, it can give me all grid cells that contain at least one urban and rural station, or all grid cells that have both low pop density and high pop density stations. This lets me compare trends generated using exactly the same spatial coverage.</p>
<p>I&#8217;ll be examining a number of pair-wise comparisons of 5&#215;5 grid cells containing at least one of both sets:</p>
<ul>
<li>Urban and rural stations</li>
<li>Dark and bright nightlight stations</li>
<li>&lt; 10 and &gt; 100 pop density stations</li>
<li>&lt; 2 and &gt; 500 pop density stations</li>
</ul>
<p style="text-align: left">However, restricting the cells like this can dramatically reduce the number of grid cells available for use in generating the temperature record. Particularly in the &lt; 2 and &gt; 500 pop density stations case, we end up with very few viable grid cells to use, and this can potentially introduce bias due to the selection of odd characteristics of the cell (e.g. cells that have both very rural and very urban areas with stations might also have other exogenous factors). To mitigate the most extreme case, we also try a 10&#215;10 grid cell model for &lt; 2 and &gt; 500 pop density stations (which gives us twice as many grid cells to work with, each with 4x the area of 5&#215;5 cells):</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-79.png"><img class="aligncenter size-full wp-image-9810" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-79.png" alt="" width="589" height="375" /></a></p>
<p>So lets dive right in and take a look at temperatures from stations categorized as urban and rural in the metadata, using only grid cells containing both:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-80.png"><img class="aligncenter size-full wp-image-9811" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-80.png" alt="" width="589" height="366" /></a></p>
<p>Here we see that rural stations are noticeably cooler than urban stations. Indeed, for this set of grid cells, the trend in urban stations over the period covered (0.199 C per decade) is 22% greater than the trend in rural stations (0.164 C per decade).</p>
<p>Next comes dark and bright stations based on satellite nightlight designation:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-81.png"><img class="aligncenter size-full wp-image-9812" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-81.png" alt="" width="589" height="365" /></a><br />
The trends here are quite similar to urban/rural; 0.198 C per decade for Bright, 21% greater than the 0.164 C trend for Dark.</p>
<p>Now we move on to examining population density, starting with &lt; 10 pop density vs. &gt; 100 pop density:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-82.png"><img class="aligncenter size-full wp-image-9813" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-82.png" alt="" width="590" height="368" /></a><br />
The trend here is 0.209 C per decade for the &gt; 100 pop density stations and 0.182 C per decade for &lt; 10 pop density stations, with the former being 15% larger than the latter.</p>
<p>Finally, we look at two cases for &lt; 2 vs. &gt; 500 pop density stations; 5&#215;5 grids with very sparse coverage, and 10&#215;10 grids with about 7 times more area covered:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-83.png"><img class="aligncenter size-full wp-image-9815" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-83.png" alt="" width="591" height="368" /></a><br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-84.png"><img class="aligncenter size-full wp-image-9814" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-84.png" alt="" width="592" height="369" /></a><br />
For the 5&#215;5 grids, we get 0.253 C per decade for &gt; 500 density stations, which is 35% larger than the 0.188 C trend for &lt; 2 density stations. The difference isn&#8217;t quite as dramatic for the 10&#215;10 grids, with the 0.231 C trend for &gt; 500 density only 16% larger than the 0.198 C trend for &lt; 2 density.</p>
<p>We can compare the percent differences in the trends side-by-side:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-85.png"><img class="aligncenter size-full wp-image-9824" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-85.png" alt="" width="589" height="367" /></a><br />
Now, a word of warning here: just because urban stations appear to be warming 22% faster than rural stations does not mean that 22% of observed warming is due to UHI, because the majority of stations are not urban (indeed, rural outnumber them nearly two to one). Additionally, these graphs don&#8217;t really tell us how a rural station compares to a dark station in its effects, for example. Identifying how much each UHI factor influences the overall observed warming is a bit trickier, and that&#8217;s what we will examine next.</p>
<p style="text-align: left"><strong>Multi-factor comparisons</strong></p>
<p>One way to compare the relative effect of each UHI proxy is to look at a small sub-set of grid cells that contain all the proxies we want to examine. For example, we can find between 40 and 100 grid cells for any given year that contain rural, urban, dark, bright, &lt; 10 pop, and &gt; 100 pop stations. When we construct a temperature record from each set, we get:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-86.png"><img class="aligncenter size-full wp-image-9823" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-86.png" alt="" width="592" height="377" /></a><br />
And we can break down the relative difference in the slope of the subset compared to all stations:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-871.png"><img class="aligncenter size-full wp-image-9825" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-871.png" alt="" width="555" height="375" /></a><br />
For these grid cells, we see that the urban/rural designation has the most impact, followed by population density and nightlights.</p>
<p>However, these results can be sensitive to the grid cells used. If we look at 10&#215;10 grid cells instead of 5&#215;5 grid cells (which, incidentally, lets us add in &lt; 2 density and &gt; 500 density stations), we get a somewhat different spaghetti graph:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-88.png"><img class="aligncenter size-full wp-image-9821" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-88.png" alt="" width="591" height="376" /></a><br />
And the effect comparison:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-89.png"><img class="aligncenter size-full wp-image-9820" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-89.png" alt="" width="554" height="375" /></a><br />
We can look at similar graphs for the max and min temps. Unfortunately, only about half the stations in GHCN provide max and min readings, so we are limited to 10&#215;10 grid cells even for the easy comparisons.</p>
<p>Here are max temps:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-90.png"><img class="aligncenter size-full wp-image-9819" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-90.png" alt="" width="592" height="377" /></a><br />
and min temps:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-171.png"><img class="aligncenter size-full wp-image-9826" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-171.png" alt="" width="592" height="377" /></a><br />
Similar to what we saw in the U.S., min temps show considerably more warming than max temps. If we look at the relative breakdown of each proxy, we see that in all cases UHI factors appear to have a greater impact on the min temp than the max temp:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-92.png"><img class="aligncenter size-full wp-image-9817" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-92.png" alt="" width="555" height="375" /></a><br />
Now, this method of comparing the temperature from all stations to the temperature record constructed using each factor isn&#8217;t ideal (though it makes for pretty comparison spaghetti charge), because of the small number of grid cells available where all factors hold.</p>
<p style="text-align: left">As an alternative, we can do separate pair-wise comparisons of records generated from all grid cells containing a certain proxy (say, Dark nightlights) using both Dark stations and all stations. The difference in the trend between dark stations and all stations gives us a good idea of the relative effect of the dark proxy while allowing us to keep a large sample size of stations and grid cells. If we do this for each proxy, we get the following:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-173.png"><img class="aligncenter size-full wp-image-9832" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-173.png" alt="" width="589" height="366" /></a><br />
Oddly enough, &lt; 10 density seems to have a slightly larger effect than &lt; 2 and &lt; 1 density, though this may be due to the rather small sample size of the latter two.</p>
<p style="text-align: left">To help put the size of these UHI estimates into perspective, I included the difference between the v2.mean temp using all stations and the land temp records provided by <a href="http://data.giss.nasa.gov/gistemp/tabledata/GLB.Ts.txt">GISSTemp </a>(using their own UHI-correction method) and <a href="ftp://ftp.ncdc.noaa.gov/pub/data/anomalies/annual.land.90S.90N.df_1901-2000mean.dat">NCDC</a> (using v2.mean_adj data).</p>
<p style="text-align: left"><strong>Next Steps</strong></p>
<p>Because the GPW population density data provides information for 1990, 1995, 2000, and 2005, it should be possible to compare direct and measurable changes in population density with station readings. Ron Broberg over at the <a href="http://rhinohide.wordpress.com/">Whiteboard</a> was instrumental in helping me obtain year 2000 GPW population density estimates for each GHCN and USHCN stations, and hopefully will be interested in collaborating on a project to measure the effects of density changes on temperature trends.</p>
<p style="text-align: left">
<p style="text-align: left">As always, click on images to embiggen.</p>
<p style="text-align: left">If anyone has suggestions on other subsets of stations that might capture UHI effects, I&#8217;d be happy to take a look.</p>
<p style="text-align: left">
<p style="text-align: left"><strong>Update</strong></p>
<p style="text-align: left">Per Carrot Eater&#8217;s request, here are the 1960-present trends from land stations via GISSTemp, NCDC, and my model&#8217;s output for all GHCN v2.mean stations:</p>
<ul>
<li>GISSTemp &#8211; 0.169 C per decade</li>
<li>NCDC &#8211; 0.221 C per decade</li>
<li>v2.mean &#8211; 0.201 C per decade</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/in-search-of-the-uhi-signal/feed/</wfw:commentRss>
		<slash:comments>169</slash:comments>
		</item>
		<item>
		<title>GISS Anomalies: More Spherical Cow.</title>
		<link>http://rankexploits.com/musings/2010/giss-anomalies-more-spherical-cow/</link>
		<comments>http://rankexploits.com/musings/2010/giss-anomalies-more-spherical-cow/#comments</comments>
		<pubDate>Mon, 08 Mar 2010 18:59:11 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9755</guid>
		<description><![CDATA[The &#8220;toy planet&#8221; discussion of the &#8220;basic&#8221; anomaly method in its most idealized form was such a hit, I&#8217;ve decided to discuss how the &#8216;bias&#8217; method used in section (3) of Hansen and Lebedeff 1987 &#8212; the method actually used by GISTemp&#8211; might be affected by deal with &#8220;the march of the thermometers&#8221;.  One [...]]]></description>
			<content:encoded><![CDATA[<p>The &#8220;toy planet&#8221; discussion of the &#8220;basic&#8221; anomaly method in its most idealized form was such a hit, I&#8217;ve decided to discuss how the &#8216;bias&#8217; method used in section (3) of Hansen and Lebedeff 1987 &#8212; the method actually used by GISTemp&#8211; might be affected by deal with &#8220;the march of the thermometers&#8221;.  One reason I am discussing GISTemp specifically is that Chiefio says:</p>
<blockquote><p>&#8220;The bulk of the GHCN analysis I have done is exactly this kind of “characterize the data” process. And it finds bias “by altitude”, “by latitude”, “by airport percentage”, and a few others. Those biases do not go away from the data just because someone has a way of fixing it that is not used in GIStemp.&#8221;</p></blockquote>
<p>He is correct that <i>if</i> biases are introduced into a data set for some reason, showing those biases are handled correctly by some method <i>other</i> than GISTemp doesn&#8217;t prove GISTemp  handles them correctly. Since Chiefio&#8217;s beef is specifically with GISTemp, I&#8217;m going to examine how GISTemp implements the anomaly method. Specifically, I will be discussing steps 2 and 5 in the Clear Climate Codes version of the program. (For more information on the steps, refer to <a href="http://rhinohide.wordpress.com/2010/03/08/gistemp-high-alt-high-lat-rural/">The  Whiteboard</a>.)  Step 3 implements <i>gridding</i> and <i>setting to a baseline</i> for &#8217;sub-boxes&#8217;; step 5 repeats this to create temperature anomalies for &#8220;boxes&#8221; . Of course both steps rely on GISTemp doing steps 2 and step 4 being sound (i.e. urban adjustment and merging ocean data);  I have not checked those.</p>
<p>My major conclusion will be: In isolation, steps 3 and 5, the &#8220;gridding&#8221; and &#8220;rebaselining&#8221; parts of the code <em>appear to be  &#8220;robust&#8221; to &#8220;the march of the thermometers&#8221; problem.</em>  To show this, I am going to explain what they do using a &#8220;toy&#8221; problem.  </p>
<h3>Gridding</h3>
<p>When discussing this, I will assume that the reader is familiar with the &#8220;toy planet&#8221; idea from <a href="http://rankexploits.com/musings/2010/the-pure-anomaly-method-aka-a-spherical-cow/">Friday&#8217;s post</a>; the purpose of these &#8220;toy planet&#8221; discussions is to help people focus on the general <i>idea</i> of what GISTemp does, and understand whether an issue could possibly be a problem under any circumstances at all. I also assume the reader is somewhat familiar with Hansen and Lebedeff <a href="http://pubs.giss.nasa.gov/abstracts/1987/Hansen_Lebedeff.html">pdf</a>. In particular, they understand that Hansen and Lebedeff&#8217;s method involves dividing the earths&#8217; surface into 80 number of equal area boxes illustrated below:</p>
<div id="attachment_9756" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Hansen_Lebedeff_80_regions.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Hansen_Lebedeff_80_regions-500x365.jpg" alt="GISTemp Hansen and Lebedeff 80 regions (i.e. boxes)" title="Hansen_Lebedeff_80_regions" width="500" height="365" class="size-medium wp-image-9756" /></a><p class="wp-caption-text">Figure 1: GISTemp divides the earth's surface into 80 equal area  'boxes'.</p></div>
<p>Each of the 80 boxes show above is further subdivided into 100 equal area sub-boxes; so there are 8,000 sub-boxes.  Each sub-box contains some number of stations, which I will refer to as &#8220;thermometers&#8221;.  Taking the output from GISTemps steps 1 and 2, GISTemp does this:</p>
<ol>
<li>Step 3: Compute the &#8216;temperature&#8217; in a sub-box from all thermometers in that box. Turn this into a temperature anomaly. (Oddly, for the purpose of entering step 5, this could be skipped.)</li>
<li>Step 4: Do something with ocean data. (I&#8217;m going to ignore this today.) </li>
<li>Step 5: Compute the &#8216;temperature&#8217; in a box based on temperature anomalies in the 100 sub-boxes. </li>
</ol>
<p>I&#8217;ve used the scare-quotes around &#8216;temperature&#8217; judiciously. The reason is that the &#8216;temperature&#8217; exiting an intermediate processing stage in steps 3 and steps 5 are one a somewhat odd impossible to describe baseline. This does not prevent them from being set to an appropriate one in the final operations of steps 3 and steps 5, but it can lead to some confusion if anyone tries to compare these &#8216;temperatures&#8217; to raw data.  </p>
<h3>Step 3: Computing the temperature anomaly for a sub-box.</h3>
<p><u>Theory</u><br />
Section 3. of <a href="http://pubs.giss.nasa.gov/abstracts/1987/Hansen_Lebedeff.html" rel="nofollow"> H&#038;L (pdf)</a> discusses the method of computing the temperature series for exactly one sub-box.  The process uses equations 1-3 below, with the average </p>
<div id="attachment_9759" class="wp-caption aligncenter" style="width: 510px"><br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Equ-1-3-Hansen-and-Lebedeff_1987.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Equ-1-3-Hansen-and-Lebedeff_1987-500x163.jpg" alt="Equations 1-3 Hansen and Lebdeff" title="Equ 1-3 Hansen and Lebedeff_1987" width="500" height="163" class="aligncenter size-medium wp-image-9758" /></a></p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Eqn3_text_Hansen_Lebedeff_1987.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Eqn3_text_Hansen_Lebedeff_1987-500x396.jpg" alt="" title="Eqn3_text_Hansen_Lebedeff_1987" width="500" height="396" class="size-medium wp-image-9759" /></a><p class="wp-caption-text">Figures 2a and 2b: Equations from Hansen and Lebedeff 1987.</p></div>
<p>with the quantities &#8220;T_bar&#8221; defined as in HL figure 5 below:</p>
<div id="attachment_9764" class="wp-caption aligncenter" style="width: 471px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TbarDefinition.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TbarDefinition-461x500.jpg" alt="Definition of T_bar from Hansen and Lebedeff" title="TbarDefinition" width="461" height="500" class="size-medium wp-image-9764" /></a><p class="wp-caption-text">Figure 3: Definition of Tbar.</p></div>
<p><u>Application to Toy Model</u><br />
Recall in Friday&#8217;s &#8220;Toy Model&#8221;, I had a &#8220;planet&#8221; with 5 pairs of thermometers distributed on 5 continents. The two thermometers in a pair were &#8220;near&#8221; each other, with one in a cold place, one in a warm place on that continent&#8211; I suggested one might be on the top of a mountain; on in a valley.  Let&#8217;s tweak the idea and think of the 5 pairs thermometers being a &#8220;sub-box&#8221; with each of the pairs spread around the sub box. Once again, the warm and cold thermometers in a pair are &#8216;near&#8217; each other. For example: maybe a &#8216;warm&#8217; one is mounted over an asphalt covered parking lot at Benedictine University while the &#8220;cool&#8221; one is in a grassy covered field at St. Procopius Abby, just across the street as seen below: </p>
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/WarmColdThermometer.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/WarmColdThermometer-500x306.jpg" alt="" title="WarmColdThermometer" width="500" height="306" class="size-medium wp-image-9765" /></a>
<p>Assume that the crack team of scientists has been monitored these thermometers once each January for 28 years and then for reasons of economy got rid of all the &#8216;cold&#8217; thermometers, retaining the warm ones; they then continued to <I>monitor</i> only the &#8220;warm&#8221; thermometers. Meanwhile, the patient Monks at the monastery continued to record temperature at their &#8220;cool&#8221; thermometer, so it will be available for our analysis with toy data.  Now, while I think we can all agree that there could be quality issues with the thermometer over the asphalt parking lot&#8211; but the question we are asking today is this: </p>
<p><indent>If the trend at all 10 thermometers was similar, and we use GISTemp to compute a temperature <em>anomaly</em>, would the loss of thermometers in month 29 result in any sudden warm bias in the computed anomaly for this grid?</indent></p>
<p>Let&#8217;s step through.  Suppose I have numbered the thermometers such that &#8216;1&#8242; has the longest record and &#8216;10&#8242; has the shortest records; don&#8217;t worry about ties.  Now, suppose all the raw data looks like this:</p>
<div id="attachment_9769" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/RawDataToyModel.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/RawDataToyModel-500x341.jpg" alt="" title="RawDataToyModel" width="500" height="341" class="size-medium wp-image-9769" /></a><p class="wp-caption-text">Figure 6: Raw Data From Toy Model.</p></div>
<p>Note that I made the temperature at &#8220;warm 1&#8243; really, really warm. It&#8217;s warmer than all the other thermometers. Note in year zero, it&#8217;s temperature is near &#8220;3&#8243; in whatever units I apply in this toy world. I made this very warm for a reason. Maybe thermometer is in a really, really big, very very heat absorbing parking lot.  Also, I generated all this data synthetically and I <i>know</i> all 10 series share an underlying trend of -0.02 C/year.  </p>
<p>Given the temperatures from 10 stations above, I could now go through the process of computing the &#8216;temperature&#8217; using equations 1-3 in Hansen and Lebedeff.  The end result of the process is T<sub>1,10</sub>(t); that is the quantity on the left hand side of equation 3b but with n=10.  </p>
<p>To begin, I need to compute values for n=1. This is a unique operation. </p>
<p>If there at data for thermometer &#8220;warm 1&#8243; at time (t), I set T<sub>1,1</sub>(t) = T<sub>1</sub>(t), where T<sub>1</sub>(t) is just the temperatures for &#8220;warm thermometer 1&#8243; shown in the figure above. I compute W<sub>1</sub> using equation (2); to do this I need to know the distance between &#8220;warm thermometer 1&#8243; and the center of the sub-box, d<sub>i</sub>, and the size of my reference distance, D.  I&#8217;d know d<sub>i</sub> based on the latitude and longitude of the thermometer location; in Hansen and Lebedeff this is D=1200 km.   After computing W<sub>1</sub>,  I set W<sub>1,1</sub>= W<sub>1</sub> using equation 2.  (Basically, the &#8216;weight&#8217; for the non-existent thermometer &#8216;zero&#8217; is W<sub>0</sub>=0.)</p>
<p>If there is no data at thermometer 1 at time(t), I leave T<sub>1,1</sub>(t) blank; (I could actually shove whatever I want in this variable. It will be multiplied by zero later).  I then set W<sub>1,1</sub>= 0 as in equation (3c). </p>
<p>I know have an array of T<sub>1,1</sub>(t), which I can plot. In my toy problem, thermometer one has data for all 60 years and it&#8217;s indicated in red below:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/FirstTwoThermometerHansenLebedeff.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/FirstTwoThermometerHansenLebedeff-500x341.jpg" alt="" title="FirstTwoThermometerHansenLebedeff" width="500" height="341" class="aligncenter size-medium wp-image-9772" /></a><div id="attachment_9772" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/FirstTwoThermometerHansenLebedeff.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/FirstTwoThermometerHansenLebedeff-500x341.jpg" alt="" title="FirstTwoThermometerHansenLebedeff" width="500" height="341" class="size-medium wp-image-9772" /></a><p class="wp-caption-text">Figure 7: T1,1(t) and T1,2(t) computed using equation 3b in Hansen and Lebedeff.</p></div></p>
<p>Notice that the initial temperature for &#8216;thermometer 1&#8243; is near &#8220;3&#8243; in whatever the heck temperature scale we are using. (This is a toy model after all.)  That&#8217;s hotter than any other thermometer. </p>
<p>Next, I need to compute the T<sub>1,<font color="blue">2</font></sub>(t), which is the temperature after incorporating in the data from thermometer n=<font color="blue">2</font>; in our case, this is &#8220;warm 2&#8243;.  To do this, I</p>
<ol>
<li>1. Identify times (t) where I have readings from <i>both</i> thermometers 1 and 2. In the next step use compute averages over these specific thermometer readings. ( In this toy example, when incorporating 2nd thermometer (i.e.  n=<font color="blue">2</font> ) case, both thermometers provide readings for all 60 years. ) </li>
<li>2. Compute the average temperature all thermometer 2. That&#8217;s called T_bar<sub>2</sub> in equation (1).  Using the same time periods, I compute the average temperature over all computed values for T<sub>1,<font color="blue">1</font></sub>(t) I obtained in the previous step.  That&#8217;s called T_bar<sub>1,<font color="blue">1</font></sub> in equation (1).  I subtract to obtain &delta;T<sub>2</sub>.  </li>
<li>3. Knowing the distance of thermometer 2 from the center of the sub-box,  compute W<sub>2</sub> using equation (2).</li>
<li>4. Compute W<sub>1,<font color="blue">2</font></sub>(t) using equation 3(a) or 3(c) to for each of the 60 times. In this toy example, I have temperature readings at thermometer <font color="blue">2</font> at every single time, so I use 3(a) in every case. </li>
<li>5. Compute T<sub>1,<font color="blue">2</font></sub>(t) using equation 3(b) or 3(d) for each of the 60 times.  In this toy example, I have temperature readings at thermometer <font color="blue">2</font> at every single time, so I use 3(b) in every case.</li>
</ol>
<p>At the end of this step, I have incorporated thermometer &#8220;2&#8243;. This is illustrated with yellow symbols in the figure above.</p>
<p><u>Interlude</u><br />
Now, recall that when we looked at <i>raw</i> data, warm thermometer 1 was much warmer than thermometer 2. But, at this stage in the process, the temperatures for thermometer &#8220;2&#8243; have been shifted warm.  Weird&#8230; huh?</p>
<p>This is an <i>intermediate</i> result of the Hansen and Lebedef method. When processing a &#8220;sub-box&#8221;, as data from each new thermometer, &#8216;n&#8217; is added to T<sub>1,<font color="blue">n-1</font></sub>(t), the temperature from the <i>new</i> thermometer is shifted so that the average of T<sub><font color="blue">n</font></sub> matches the average of T<sub>1,<font color="blue">n-1</font></sub> in the overlap region.   This means we are <i>not</i> computing the average temperature of the sub-box. That&#8217;s not a problem because <I>no one claims  T<sub>1,<font color="blue">n</font></sub>(t) is the average temperature in the box.</i>  The only thing anyone claims about  T<sub>1,<font color="blue">n</font></sub>(t) is that it correctly represent <i>changes</i> in temperatures over time. </p>
<p><u>Back to computation.</u><br />
In the toy problem I have 10 thermometers, so I need to repeat the process for incorporating temperature 2 with thermometers 3-10.   For this toy problem, thermometers 3-5 have data for all 60 years. This means data from these thermometers is incorporated exactly as we incorporated data for thermometer 2. I can keep cranking away without worry about the slightest change until I get to thermometer 6. </p>
<p>Thermometer 6  is the first &#8220;cold&#8221; thermometer and doesn&#8217;t have data after year 28.  So, when incorporate data for thermometer n=6,  I need to notice this that data only exist at <i>both</i> thermometer during the first 28 years. So, when computing the T_bar<sub>6</sub> and T_bar<sub>5</sub> for step 2, I compute the average over  only those 28 years. When I get to step 4, I notice that for early years, I have data at both thermometers, so I use 3(a) and 3(b) in steps 4 and 5. In later years, I use 3(c) and 3(d).  </p>
<p>It&#8217;s worth comparing T<sub>1,<font color="blue">5</font></sub>(t) to T<sub>1,<font color="blue">6</font></sub>(t). Note that T<sub>1,<font color="blue">5</font></sub>(t) = T<sub>1,<font color="blue">6</font></sub>(t) after 28 years, but differ before.  </p>
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/5th_6th-thermometers-HansenLebedeff.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/5th_6th-thermometers-HansenLebedeff-500x341.jpg" alt="" title="5th_6th thermometers HansenLebedeff" width="500" height="341" class="size-medium wp-image-9773" /></a>
<p>Because this post is motivated by some people&#8217;s worries that dropping all the &#8220;cold&#8221; thermometers in the GCHN record might cause GISTemp temperature to suddenly &#8220;jump&#8221; when the &#8220;cold&#8221; thermometers drop out, there are a few things worth noting. </p>
<p>1. Even though thermometer 6 is &#8220;cold&#8221; relative to the earlier thermometers updating to compute T<sub>1,<font color="blue">6</font></sub> by incorporating the &#8220;cold&#8221; thermometer into the computed value of T<sub>1,<font color="blue">5</font></sub>(t) does not drag the temperature down.</p>
<p>2. There is no &#8220;jump&#8221; at year 28 in the computed value of T<sub>1,<font color="blue">6</font></sub>(t).  </p>
<p>Once thermometer 6 is added, we repeat the procedure to incorporate 7-10.  The final result is T<sub>1,<font color="blue">10</font></sub>(t), illustrated by the blue diamonds below:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Thermometer10.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Thermometer10-500x341.jpg" alt="" title="Thermometer10" width="500" height="341" class="aligncenter size-medium wp-image-9774" /></a></p>
<p>Note the following features of the trend at this point:</p>
<p>1. The temperature T<sub>1,<font color="blue">10</font></sub>(t) is near the values for &#8220;really, really warm thermometer 1&#8243; shown in the figure illustrating &#8220;raw&#8221; data T<sub>1</sub>(t); the process did not seem to produce anything anyone would call an &#8220;average&#8221; temperature over all 10 thermometers in the grid box. This is because the process isn&#8217;t even <i>trying</i> to compute a true average temperature. It just doesn&#8217;t care. It wants a temperature that will be set to a baseline in the next step, and then used to compute averages for all stations. </p>
<p>2. T<sub>1,<font color="blue">10</font></sub>(t) <I>less  noisy</i> that T<sub>1,<font color="blue">10</font></sub>(t). Blending to reduce noise <I>is</i> a goal for the process.</p>
<p>3. When computed using all thermometers, T<sub>1,<font color="blue">10</font></sub>(t) does have a trend near -0.02 C/century as it should.  That is: the method creates a product that captures the correct trend. However, the actual temperatures may be shifted by some constant amount from the real honest to goodness average in the sub-box.  We don&#8217;t know what that constant is. For the purpose of determining <i>changes</i> in temperatures from year 0 to year 28 <i>we don&#8217;t care.</i> </p>
<p>4. When computed using with all cool thermometers dropped at year 28, T<sub>1,<font color="blue">10</font></sub>(t) also has a trend near -0.02 C/century as it should.  The  &#8220;march of the thermometers&#8221; did not affect this trend.  </p>
<p>So, the temperature in the grid box is ok.</p>
<p>Now, suppose we want to turn this into an anomaly based on the average for years 1-20. What do we do? Well, we compute the average temperature of T<sub>1,<font color="blue">10</font></sub>(t) based on years 1-20 and then subtract that average from every one of the 60 values of<br />
<indent>T<sub>1,<font color="blue">baseline</font></sub>(t) = T<sub>1,<font color="blue">10</font></sub>(t)- T<sub>average</sub>. </indent></p>
<p>This is now an <i>anomaly</i>. The method differs from the one I describe Friday, but it is equally robust to &#8220;march of the thermometers.</p>
<h3>What about the boxes?</h3>
<p>So far, I only described how to compute the temperatures of one sub-box. There are 100 of those in each &#8220;box&#8221;? How do we compute the temperature for the boxes? </p>
<p>Simple: Use the same process as for computing the temperature of a sub-box, <i>except</i></p>
<ol>
<li>Treat the &#8220;sub-boxes&#8221; as &#8220;thermometers&#8221;.</li>
<li>Treat the &#8220;boxes&#8221; as &#8220;sub-boxes&#8221;.</li>
<li>Simplify eqn (2) in Hansen and Lebedeff with W<sub>n</sub>=1 for all n.</li>
</ol>
<p>Obviously, since the method for computing temperatures of sub-boxes is robust to &#8220;march of the thermometers&#8221;, the method for computing the temperature of a &#8220;box&#8221; is robust to &#8220;march of the thermometers&#8221;. This is true for arbitrary toy data and should be true for real data.  The only caveat is that the data for individual stations must be good enough data. </p>
<h3>Conclusions: Going forward</h3>
<p>I can&#8217;t claim to have throroughly tested every pesky thing in step 3 and step 5 of GISSTemp. This is a &#8220;toy data&#8221; and designed to test precisely one feature: Does a sudden drop out of all &#8216;cold&#8217; thermometers in and of itself, suddenly inject any bias in the computed temperature trends in GISTemp.  Toy data is useful for answering these sorts of questions.</p>
<p>The conclusion stepping through the equations in Hansen and Lebedeff is that if implemented correctly, that the gridding portion of the GISTemp method should be robust to &#8220;march of the thermometers&#8221;. Specifically, it correct problems that would arise if one did <i>not</i> use an anomaly method when trying to determine whether the earth&#8217;s surface is warming overall.  </p>
<p>I also examined the Clear Climate Code implementation, and the code text performed the steps I describe above.  To the extent that CCC is a faithful re-working of GISTemp, this should mean that gridding in GISTemp should minimize any trend-injection due to a while bunch of cold stations being dropped circa 1980-1990.  </p>
<p>For those of you who do believe there are problems with GISTemp, and believe this based on runs posted by The Chiefio, the look for problems in homogenization, infilling or corrections to  UHI that are not necessarily specific to any &#8220;march of the thermometers&#8221;.  Then, if possible, describe precisely what feature of data causes GISTemp to &#8220;break&#8221; and create false trends. Anything that can happen with real data can also happen with properly created synthetic data. So, if you find a feature, we should be able to test that&#8211; either by running CCC with synthetic data or by looking at the mathematical descriptions published by Hansen himself.</p>
<p>Right now, it appears that loss of &#8220;cold&#8221; thermometers <em>by itself</em> <strong>cannot</strong> inject a warming trend into the data.   This demonstration should supplement all the other demonstrations that suggest that when run using real data the mere dropping of warm thermometers <i>does not</i> inject a warming trend into data. </p>
<h3>The spreadsheet</h3>
<p><a href='http://rankexploits.com/musings/wp-content/uploads/2010/03/AnomalyHansenLedebeff.xls'>AnomalyHansenLedebeff</a></p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/giss-anomalies-more-spherical-cow/feed/</wfw:commentRss>
		<slash:comments>493</slash:comments>
		</item>
		<item>
		<title>UAH Betting Results: Based on V 5.2</title>
		<link>http://rankexploits.com/musings/2010/uah-betting-results-based-on-v-5-2/</link>
		<comments>http://rankexploits.com/musings/2010/uah-betting-results-based-on-v-5-2/#comments</comments>
		<pubDate>Sat, 06 Mar 2010 03:58:19 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Betting]]></category>
		<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9733</guid>
		<description><![CDATA[Roy Spencer posted two versions of the UAH temperature of the lower troposphere! According to version 5.3, the temperature of the lower troposphere was 0.63C; according to version 5.2, it was 0.74C.  Despite the fact that I my bet of 0.64 would have been closer to correct using the splendid new version 5.3, we [...]]]></description>
			<content:encoded><![CDATA[<p>Roy Spencer posted <I>two</I> versions of the <a href="http://www.drroyspencer.com/2010/03/february-2010-uah-global-temperature-update-version-5-3-unveiled/">UAH temperature</a> of the lower troposphere! According to version 5.3, the temperature of the lower troposphere was 0.63C; according to version 5.2, it was 0.74C.  Despite the fact that I my bet of 0.64 would have been closer to correct using the splendid new version 5.3, we will be using version 5.2 to distributed quatloos. </p>
<p>This makes PEHarvey  this month&#8217;s top winner: he was who was bang on correct! Congratulations. Spend those quatloos wisely.</p>
<p>I am still able to access the detailed values for version 5.2 <a href="http://vortex.nsstc.uah.edu/public/msu/t2lt/tltglhmam_5.2">here</a>; as usual, I tacked the reading Roy posted for version 5.2 onto the record and plotted temperatures and trends from the inception of the record, since 2000 and since 2001.  I&#8217;ve also highlighed all Januaries and ran a trace so you can see how many months had temperatures exceeding the most recent one:<div id="attachment_9753" class="wp-caption aligncenter" style="width: 510px"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/UAH-Feb.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/UAH-Feb-500x341.jpg" alt="" title="UAH-Feb" width="500" height="341" class="size-medium wp-image-9753" /></a><p class="wp-caption-text">Figure 1: UHA TLT with OLD system.</p></div><br />
<span id="more-9733"></span><br />
I think it&#8217;s fair to say that according to version 5.2, this month was a scorcher. It&#8217;s the 2nd hottest February in the record, and the 3rd hottest day in the record.  </p>
<p>I&#8217;m sure you are all wondering what plots using version 5.3 look like. I have no idea. I don&#8217;t know the url where those will appear!  I suspect Roy will announce that address pretty soon. (Or maybe Anthony Watts beat him to the punch. Anthony&#8217;s ear to the ground sometimes seems to know things before anyone else!)</p>
<p>Meanwhile, for those of you wondering if you won any quatloos, the winnings are posted  below.</p>
<p></p><p></p>
<p><!-- ComputeWinnings=0.231?Display=1?Testing=1?DisplayBets=1Testing=1?ComputeWinnings=0.231?Testing=1?Observed=0.284?ComputeWinnings=1?Testing=1? --></p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/uah-betting-results-based-on-v-5-2/feed/</wfw:commentRss>
		<slash:comments>74</slash:comments>
		</item>
		<item>
		<title>The Pure Anomaly Method: AKA A spherical cow.</title>
		<link>http://rankexploits.com/musings/2010/the-pure-anomaly-method-aka-a-spherical-cow/</link>
		<comments>http://rankexploits.com/musings/2010/the-pure-anomaly-method-aka-a-spherical-cow/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 18:38:19 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9697</guid>
		<description><![CDATA[As some of my readers know, when someone raises a criticism of a particular widely used result in climate science, I often like to try to understand the main issue and discuss it using a toy problem or &#8220;cartoon&#8221;. That is: I like to create an artificial problem that highlights the issue that someone claims [...]]]></description>
			<content:encoded><![CDATA[<p>As some of my readers know, when someone raises a criticism of a particular widely used result in climate science, I often like to try to understand the <i>main</i> issue and discuss it using a <i>toy</i> problem or &#8220;cartoon&#8221;. That is: I like to create an artificial problem that highlights the issue that someone <i>claims</i> is causing a problem, and then explore whether or not that issue could either a) never-ever-ever possibly cause a problem, b) could cause a problem in some specific circumstances and c) if (b) figure out how close to that circumstance we might be.  These sorts of abstract cartoon  problems are actually very important to engineering&#8211; but I digress.</p>
<p>During the discussion of possible bias caused by &#8220;the march of the thermometers&#8221;, there has also been a lot of discussion of &#8220;the anomaly method&#8221;, along with people advancing theories about whether or not use of &#8220;the anomaly method&#8221; guaratees there can be no bias caused by &#8220;the marchof the thermometers&#8221;.  The answer, as in many problem is: Depends on what you mean by &#8220;the anomaly method&#8221; and how it is implemented. Today, I am going to discuss the anomaly method <i> in its simplest and most ideal form</i> and and explain how, when applied in its ideal form, anomalies can be useful for detecting temperature <i>changes</i> even when we do not know <i>absolute values</i> in temperatures.  </p>
<p>To support my explanation, I am going to create a &#8220;toy planet&#8221;, one which someone has placed thermometers which provide us with time series. The &#8220;temperatures&#8221; at each thermometer will be generated synthetically.  Because the data are synthetic, we will <i>know</i> the trend that would be measured if we could repeat the experiment with an infinite number of thermometers.  Then, we will make the someone decides to be more thrifty and reduce the number of thermometers. Using that data, we will estimate the synthetic trend based using the anomaly method. This will be contrasted with using a method based on simple weighting of the thermometers.</p>
<h3>The Toy Planet</h3>
<p>The &#8220;toy&#8221; planet will be assumed to be a planet with 5 large continents called each of which consists of mountains and vallies. A crack team of scientists will decide to place two thermometers on each continent with one placed on a mountain and one placed in the valley. These will be called the &#8220;cool&#8221; and &#8220;warm&#8221; thermometers respectively.</p>
<p>The scientists will start monitoring temperatures for some unstated reason.   During the first 25 years, the scientists continue to support all 10 thermometers, fixing broken ones running QA checks etc. But after a while, someone cuts their budget. Inspecting the data, the scientists decide they don&#8217;t need to record the temperatures from all 10 thermometers. To save manpower, they can record the temperature from with 5: one from each continent. Because mountain tops stations are difficult to maintain, they stop recording the mountain top thermometers. (Meanwhile a secret band of conspirators does record those, so <i>we</I> here in toyland can look at those temperatures.)</p>
<p>Then, in year 35, someone advances the theory that the planet is cooling at a rate of 0.2C a year. For the purpose of the toy-problem, this theory will turn out to be <I>correct</i>. (Otherwise, the toy model won&#8217;t teach us anything!) Suddenly, the scientists want to figure out if they can detect this based on the data they have.</p>
<p>What to do?  We are going to see that if they use the <i>anomaly</i> method the will be able to detect a trend.    </p>
<h3>Raw temperatures.</h3>
<p>Let&#8217;s now start to see whether we can detect the temperature trend on this planet. But since this is a toy planet, and the purpose is to explain how anomalies help us do this, I&#8217;m going to let the cat out of the bag and tell you that all temperature are generated synthetically, and the trend for the &#8216;planet&#8217; is -0.2C/year and the raw temperatures from all 10 thermometers for 35 years and the simple weighted average from all 20 thermometers are shown below:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TenRawTemps.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TenRawTemps-500x341.jpg" alt="" title="TenRawTemps" width="500" height="341" class="aligncenter size-medium wp-image-9707" /></a></p>
<p>Now, lets focus on  the warm and cold thermometers in region 1, including all thermometer readings (whether or not the crack team of scientists recorded them).  The temperature in the warm valley, on the cool mountain top and the contentinental average &#8212; estimated as the sum over both &#8212; are all shown below.  Notice all three series exhibit a downward trend. If we fit a straight line, all three series are &#8220;noisy&#8221; in the sense that the temperatures seem to vary randomly about a straight line. ( Of course, I generated them to do that.)</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Region1RawTEmps.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Region1RawTEmps-500x341.jpg" alt="" title="Region1RawTEmps" width="500" height="341" class="aligncenter size-medium wp-image-9709" /></a></p>
<p>Note that the trends for each thermometer are shown&#8211; as are the trends from all three thermometers.</p>
<h3>Temperature Anomalies</h3>
<p>Let&#8217;s suppose someone suggested that it seemed to him that the trends are all more-or-less similar. He wanted to figure out a way to see if the &#8220;noise&#8221; at the mountain top and valleys was similar. So, he creates an anomaly as follows: For the &#8220;warm&#8221; series, compute the average temperature over the first 20 years. Subtract that from all temperature from every warm temperature in the series. Repeat this for the &#8220;cool&#8221; series, and the average series.</p>
<p>This is plotted below:<br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/REgion2Anoms.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/REgion2Anoms-500x341.jpg" alt="" title="REgion2Anoms" width="500" height="341" class="aligncenter size-medium wp-image-9710" /></a></p>
<p>(Note: If you compare the trend for the &#8220;average&#8221; temperature computed using both the anomaly method to the trend based on the &#8220;raw&#8221; temperature shown in the previous figure, the two don&#8217;t match exactly.  This is due to a pesky EXCEL feature, which refreshes every time I copy an image!  However, if I don&#8217;t copy and image and just click back and forth in EXCEL,  both trend are always identical because when both series are complete with no dropouts, with respect to computing trends, the anomaly method is no different from the &#8220;raw&#8221; method. This is true even if, owing to &#8220;noise&#8221;, the trend from the &#8220;warm&#8221;, &#8220;cold&#8221; and average series are not equal to the &#8220;known&#8221; underlying trend of -0.2C/year. ) </p>
<p>Despite the fact that the computed trends will be identical computed using either method, I think you can already see a strength of the anomaly method: Using the &#8220;eyeball method&#8221;, you can already see that noise is the series is highly correlated. You also begin to detect the fact that the warm and cold thermometers may share a trend. (In this toy problem, they do.)   You can also being to imagine that you could make a pretty good guess about temperature at the top of the mountain using the thermometer in the valley. Your guess would be <i>imperfect</i>, but the two are correlated.  </p>
<p>For what it&#8217;s worth, here ar the temperature anomalies for all 10 thermometers.</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/TenTempAnoms.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/TenTempAnoms-500x341.jpg" alt="" title="TenTempAnoms" width="500" height="341" class="aligncenter size-medium wp-image-9718" /></a></p>
<h3>Thermometer Drop Out</h3>
<p>Now, back to the crack team of scientists. Remember, they didn&#8217;t come up with the theory of toy-planet cooling until <i>after</i> they stopped recording temperatures from the mountain top thermometers. Nevertheless, they are interested in testing the theory.  What can they do?</p>
<h3>Trends based on Raw Temperatures</h3>
<p>First, someone could suggest that raw temperatures are the only &#8220;thermodynamically meaningful&#8221; thing, mumbling words like &#8220;enthalpy&#8221;. That person might decide that,for this reason, they must, must, must use raw temperatures to do anything and everything.  So, they could compute the temperature by simply averaging over all thermometers&#8211; never mind that some were at the top of mountains and some at the bottom, and this scientists does not know the readings from the &#8220;cool&#8221; thermometers after year 25.</p>
<p>The plum colored circle shows what this scientist would compute for the &#8220;average temperature&#8221; over all thermometers.  For the first 25 year, it is equal to the average over the 5 warm-valley thermoeters; for the final 10 years it is equal to temperatuer of the cold thermometers.  what this scientist comes up with:</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/DropAnomRaw.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/DropAnomRaw-500x341.jpg" alt="" title="DropAnomRaw" width="500" height="341" class="aligncenter size-medium wp-image-9712" /></a></p>
<p>Note that while the average over all 10 thermometers would have shown cooling, averaging over the available thermometers shows warming. This happens when you <i>don&#8217;t</i> use the anomaly method.  Of course, unless this scientists didn&#8217;t look at the mountain and valley temperature separately, this scientist would like notice the problem because he could easily see that both the mountain top and valley thermometer show cooling.  </p>
<h3>Trends based on Anomalies</h3>
<p>So, what if the scientist recognized that he is <i>not</i> doing a thermodynamics problem. He is merely trying to detect whether or not the surface of the earth cooled. He might also want to use the maximum number of measurements available (because that results in less noise). He previously noticed the fact that the anomaly method tended to make all the data collapse around a singal trend. He might decide to do this:</p>
<p>1) Create anomalies for all 10 time series, using the baseline from 1-20.<br />
2) During periods when he has measurements from both warm and cool thermometers, compute the temperature <i>anomaly</i> for each region toy planet by averaging the anomalies from <i>both</i> warm and cool thermometers.<br />
3) When only only 1 thermometer is available, compute the anomaly for the continent using the anomaly from that thermometer.<br />
4) Compute the average for the toy planet by averaging over all 5 continents. </p>
<p>Below, I&#8217;ve illustrated the anomalies obtained as if the scientist had access to all 10 thermometers during the entire 35 years, and what he gets if he computed the anomaly with data available to him. (That is: ignoring temperaures from the cold thermometers after year 25.)</p>
<p><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/DropThermometerAnom1.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/DropThermometerAnom1-500x341.jpg" alt="" title="DropThermometerAnom" width="500" height="341" class="aligncenter size-medium wp-image-9714" /></a></p>
<p>Note that by using the anomaly method, the scientists gets much closer to the right answer. The data show cooling; his anomaly shows cooling.  </p>
<h3>Advantage of Anomaly Method</h3>
<p>Examining this problem we can see the major advantage to the anomaly method: When the avearge local temperature varies&#8211;as for mountains and vallies&#8211; and a database of temperatures over a long time period includes incomplete records, the anomaly method reduces any &#8220;warming&#8221; or &#8220;cooling&#8221; bias introduced by adding or subtracting thermometers from warmer or cooler regions.  </p>
<p>Does the anomaly method fix everything? No. In this problem, I used synthetic data. When doing this, the synthetic series for every thermometer was generated to give a trend of -0.2 C/century.  If I&#8217;d made the mountains and valleys warm at different rates, dropping out thermometers would have resulted in bias.  </p>
<p>Does this result show that the anomaly method <i>as implemented by GISSTemp, CRU or NOAA</i> cannot have any problems? No. I don&#8217;t know precisely how those groups <i>implement</i> the anomaly method. The implementation in GISSTemp is is sufficiently complicated that it needs to be inspected to verify that the method properly.  I&#8217;m pretty sure Hansen <i>intended</i> for the anomaly concept to be carried forward and applied fully&#8211; but I haven&#8217;t personally checked this.  Moreover, my <I>impression</I> based on Chiefio&#8217;s posts is that he thinks something happens during the complicated series of adjustments to prevent the anomalies from being applied fully.  So, he would describe the toy problem I show as &#8220;a spherical cow&#8221;.   </p>
<p>Do the results of this analysis mean the only anomalies are &#8220;interesting&#8221;? No. It means the anomaly method is advantageous when we want to tease out the trend for the earth&#8217;s surface temperature from the suboptimal database of temperatures we earthlings are stuck with.  In other circumstances, the anomaly method can conceal rather than reveal information.  For example, when testing whether or not climate models faithfully reproduce the climate of the earth, the anomaly method can <I>hide</I> problems. For example, if a model matches trends in surface temperature pretty well, but matches the surface temperatures of the earth poorly, a modeler might hide the short comings of his model by highlighting comparisons on an anomaly basis while avoiding comparisons to absolute temperatures.  They might, for example, highlight anomaly graphs in summaries for policy makers while relegating comparisons to absolute temperatures to the appendices of longer chapters.   An entire community might go so far as to decree that anomalies are the only &#8220;interesting&#8221; feature, when, in fact, both anomalies and absolute values are &#8220;interesting&#8221; with each revealing answer to different questions. </p>
<p>Returning to the question that has been perplexing us these past few weeks: Does the march of the thermometers introduce bias into metrics like GISSTemp?  As I noted, Chiefio thinks so.  It seems unlikely &#8212; and more and more unlikely as various people look at data in various ways. That said, I think Carrot Top and Nick Barnes are looking a the clear climate code and tweaking it to run a number of experiments that change the way the code deals with anomalies. </p>
<p>I&#8217;m beginning to think a good experiment would be to modify the <i>input</i> to test GISSTemp on synthetic data.  I hate to suggest runs and assign the work to the wide world of &#8220;someone other than me&#8221;.  So,  I&#8217;m <i>almost</i> tempted to figure out how to run Python on my mac and maybe figure out how to change the input files to compute results using &#8220;synthetic&#8221; data as input to see what happens. </p>
<p>I&#8217;ve never run or written Python. It&#8217;s going to be warm out this weekend.  So, for now: Nah.  But, hypothetically, it should be easy enough for a programmer to run CCC or GISSTemp replacing the actual data with appropriately difficult synthetic input and dropping the &#8220;cold&#8221; thermometers to see if this results in any warm bias. I think that would resolve the question of whether &#8220;the march of the thermometers&#8221; could <i>even hypothetically</i> result in a warm bias. If a big snow storm hits and people are stuck inside, maybe an eager beaver will do it.  <img src='http://rankexploits.com/musings/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<h3>End Notes;</h3>
<ol>
<li><u>EXCEL Spreadsheet</u>: <a href='http://rankexploits.com/musings/wp-content/uploads/2010/03/AnomalyNutshell.xls'>AnomalyNutshell</a></li>
<li>Chiefio&#8217;s upcoming post to explain <a href="http://chiefio.wordpress.com/2010/02/26/assume-a-spherical-cow-therefor-all-steaks-are-round/">how the march of the thermometers biases GISSTemp.</a> (I think.)</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/the-pure-anomaly-method-aka-a-spherical-cow/feed/</wfw:commentRss>
		<slash:comments>256</slash:comments>
		</item>
		<item>
		<title>A detailed look at USHCN min/max temps</title>
		<link>http://rankexploits.com/musings/2010/a-detailed-look-at-ushcn-mimmax-temps/</link>
		<comments>http://rankexploits.com/musings/2010/a-detailed-look-at-ushcn-mimmax-temps/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 19:33:44 +0000</pubDate>
		<dc:creator>Zeke</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9658</guid>
		<description><![CDATA[This post builds on a simple spatial gridding model outlined here. As always, the latest source code can be found at http://drop.io/0yhqyon, and I welcome folks helping improve it.
If we are looking at the effect of adjustments and station quality, we should really be looking at maximum and minimum temperature data rather than mean data, [...]]]></description>
			<content:encoded><![CDATA[<p>This post builds on a simple spatial gridding model outlined <a href="http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/">here</a>. As always, the latest source code can be found at <a href="http://drop.io/0yhqyon">http://drop.io/0yhqyon</a>, and I welcome folks helping improve it.</p>
<p>If we are looking at the effect of adjustments and station quality, we should really be looking at maximum and minimum temperature data rather than mean data, because mean data simply averages the max/min temps and a fair bit of information is lost in the process.</p>
<p><span id="more-9658"></span></p>
<p>Before we dive in to a look at the effects of adjustments (TOB and non-TOB) for all stations and CRN12/CRN345 stations, lets step back and talk a bit about baselines, anomalies, and grid sizes.</p>
<p>One of the downsides of the anomaly approach is that it makes it difficult to compare the magnitude of the difference between anomalies for different series at any discrete point in time. Comparing the trends is easy, but because anomalies are calculated respective to each series, its hard to definitively claim that a particular series was adjusted &#8220;down&#8221; or &#8220;up&#8221; relative to another because such statements are highly dependent on the chosen baseline. For example, if a series was not adjusted at all for the first half, and adjusted quite a bit for the second half, a simple comparison of the adjusted and non-adjusted anomalies using a baseline that stretches the whole length of the series would tend to show the adjusted series having lower temps than the non-adjusted series near the beginning and higher near the end, despite the fact that all adjustments really occurred in the second half.</p>
<p>Thankfully, we know from USHCN that most adjustments to the data did occur in the latter part, e.g. via charts like:<br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/ts.ushcn_anom25_diffs_urb-raw_pg.gif"><img class="aligncenter size-medium wp-image-9659" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/ts.ushcn_anom25_diffs_urb-raw_pg-500x386.gif" alt="" width="500" height="386" /></a><br />
(which shows v1 adjustments, but v2 show a similar pattern). To better display the net effect of adjustments over time, I&#8217;ve chosen to use a 1900-1940 baseline for all charts that show multiple adjustments to the same series, as that is the period that appears to have the least adjustments in USHCN. For charts that compare two different series I&#8217;ll stick with my standard baseline, mostly because there may be secondary effects (e.g. station availability) back that far that make a difference when comparing anomalies generated from differing sets of stations.</p>
<p>When I initially analyzed USHCN data I used the same 5&#215;5 grid as I was using for GHCN. However, upon reading a bit more about the <a href="http://www.ncdc.noaa.gov/oa/climate/research/ushcn/gridbox.html">methods NCDC uses</a> in gridding USHCN data I decided emulate their preferred approach of using 2.5&#215;3.5 grid boxes for the U.S. lower 48. Because the U.S. has so many stations, we can afford to lower the spatial resolution a bit without introducing regional biases, and the 2.5&#215;3.5 resolution appears to give us a good tradeoff between resolution and stations per grid box. The net effect of this switch appears to be a slight reduction in the U.S. lower 48 temperature trend vis-a-vis the 5&#215;5 grid boxes:</p>
<p style="text-align: left">
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1381.png"><img class="aligncenter size-full wp-image-9668" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1381.png" alt="" width="519" height="305" /></a></p>
<p style="text-align: left">
<p style="text-align: left"><strong>All U.S. Stations</strong></p>
<p style="text-align: left">Lets start our analysis of USHCN max/min temp data by looking at all stations and the sum total of adjustments (TOB and non-TOB).</p>
<p style="text-align: left">
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1372.png"><img class="aligncenter size-full wp-image-9666" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1372.png" alt="" width="599" height="351" /></a></p>
<p>We see from this that the net adjustments have a much larger effect on max data than min data, have a positive effect on the trends of both, and tend to make the trends of max and min temps much more similar than we see in the raw data. Next lets break down the adjustments a bit and look at max and min one at a time.</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-139.png"><img class="aligncenter size-full wp-image-9669" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-139.png" alt="" width="599" height="352" /></a><br />
For max temp data, the non-TOB adjustments represent the bulk of total adjustments. The net effect of the adjustments is to strongly increase the trend in recent years.</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-140.png"><img class="aligncenter size-full wp-image-9670" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-140.png" alt="" width="600" height="353" /></a><br />
For min temp data, we see quite a different pattern in adjustments. Here TOB adjustments have a slightly larger effect, while non-TOB adjustments are almost negligible. The trend in min temps over the past few decades is much stronger in the min raw data than the max raw data. Lets take a look at how non-TOB adjustments affect the resulting mean temperatures:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-141.png"><img class="aligncenter size-full wp-image-9671" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-141.png" alt="" width="601" height="354" /></a><br />
From this we can see that the net effect of non-TOB adjustments appears to create a mean trend that is quite similar to the trend in min temps for the U.S.</p>
<p style="text-align: left">
<p style="text-align: left"><strong>CRN12 and CRN345 Stations</strong></p>
<p style="text-align: left">Now, lets dig in a bit more and look at the same charts for well-sited stations (CRN12) and poorly-sited stations (CRN345) using the designations from SurfaceStations.org utilized in Menne et al.</p>
<p>First we have max temp data for CRN12</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-144.png"><img class="aligncenter size-full wp-image-9672" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-144.png" alt="" width="600" height="353" /></a><br />
And max temp data for CRN345</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-143.png"><img class="aligncenter size-full wp-image-9673" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-143.png" alt="" width="601" height="354" /></a><br />
Before we look at any adjustments, we can see that the trend in raw max data varies dramatically for the two classes of stations. The raw anomaly is only ~0.4 C for the CRN345 stations in the last decade, while it reaches close to 1 C in CRN12 stations! The TOB adjustments are similar for both, though slightly higher for CRN345 than CRN12. The non-TOB adjustments actually slightly decrease the trend in CRN12 stations, while fairly dramatically increasing the trend in CRN345 stations. Taken together, the net effect of the max temp adjustments to CRN345 stations is to make the adjusted CRN345 max temp record quite similar to both the raw and adjusted CRN12 max temp record. Now we turn to min temps, which are a bit more interesting.</p>
<p>Here are min temps for CRN12</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-145.png"><img class="aligncenter size-full wp-image-9674" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-145.png" alt="" width="601" height="354" /></a><br />
And min temps for CRN345</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-147.png"><img class="aligncenter size-full wp-image-9675" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-147.png" alt="" width="602" height="355" /></a><br />
Here the roles are reversed; raw min temps have a much lower trend in CRN12 data than CRN345 data, reaching only slightly below 0.4 C in CRN12 and to almost 0.8 C in CRN345. At first I thought somehow I must have switched up the data, but upon double checking it is indeed correctly plotted. Again, the TOB adjustments are similar in both sets, though again noticeably larger in min CRN345. Its the non-TOB adjustments that reveal a more perplexing picture.</p>
<p>The non-TOB adjustments in min CRN345 reduce the trend slightly, and seem to be quite similar to the non-TOB adjustments we saw in max CRN12.</p>
<p>We see a large positive non-TOB adjustment in CRN12 min data. If we look closer, it appears to mainly be a step-change in the data in the 1940s:</p>
<p style="text-align: left">
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-146.png"><img class="aligncenter size-full wp-image-9676" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-146.png" alt="" width="526" height="309" /></a><br />
The net effect of all adjustments is, again, to make the CRN12 and CRN345 min temps much more similar, though in the opposite way that we saw for max temps.</p>
<p>Now lets try our hand at replicating the first half of Figure 2 in Menne et al 2010:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-148.png"><img class="aligncenter size-full wp-image-9677" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-148.png" alt="" width="608" height="230" /></a><br />
First we need to change the baseline to 1979-2000, which is fairly trivial.</p>
<p>Lets look at max temps first,</p>
<p>both Unadjusted Maximum</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1491.png"><img class="aligncenter size-full wp-image-9679" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1491.png" alt="" width="606" height="305" /></a><br />
and Adjusted Maximum.</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-150.png"><img class="aligncenter size-full wp-image-9680" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-150.png" alt="" width="606" height="306" /></a></p>
<p>Now min temps, Unadjusted Minimum</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-151.png"><img class="aligncenter size-full wp-image-9681" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-151.png" alt="" width="606" height="307" /></a><br />
and Adjusted Minimum</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1522.png"><img class="aligncenter size-full wp-image-9684" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1522.png" alt="" width="606" height="307" /></a></p>
<p>All of these charts are almost identical, as far as I can tell, to the results of Menne et al. (albeit with 2009 data added).</p>
<p>Finally, lets look at the trends (in degrees C per decade) over the 1980-2009 period for each:</p>
<p style="text-align: left">CRN12 max raw : 0.224<br />
CRN345 max raw: 0.082<br />
CRN12 max adj : 0.243<br />
CRN345 max adj: 0.255<br />
CRN12 min raw : 0.126<br />
CRN345 min raw: 0.177<br />
CRN12 min adj : 0.186<br />
CRN345 min adj: 0.210</p>
<p style="text-align: left">This suggests, at least for the subset of stations examined, that either poor siting or a secondary factor (e.g. MMTS vs. CRS sensors) led to spurious cooling of maximum temps in recent decades. It also suggests that adjustment methods tend to make CRN12 and CRN345 records more similar.</p>
<p style="text-align: left">
<p style="text-align: left">Note: Click on charts to embiggen</p>
<p style="text-align: left">Raw data for all of these charts can be found <a href="http://drop.io/0yhqyon/asset/model-outputs-3-3-xls">here</a>. They are contained in the USHCN minmax and Menne minmax tabs (as well as the USHCN gridsize tab for that chart).</p>
<p style="text-align: left"><strong><br />
</strong></p>
<p style="text-align: left"><strong>UPDATE (3/5/10)</strong></p>
<p style="text-align: left">Matt Menne was kind enough to chat with me this morning about some of the relationships between raw, tob, and adjusted data.</p>
<p style="text-align: left">Among other issues, he explained how MMTS sensors report systemically lower max temps than Stevenson Screens (CRS), so the switch from CRS to MMTS sensors over the past few decades introduced a spurious cooling trend in max temps. This is one of the reasons, according to him, that raw/tob temps from CRN345 stations (which are primarily MMTS) and CRN12 stations (which have a much higher percentage of CRS sensors) have differing trends in max temps in recent years.</p>
<p style="text-align: left">As far as then 1940s step change in CRN12 min temps goes, he suggested that this was due at least in part to the movement of a number of stations from urban areas/surfaces to more rural areas during the world war 2 period. He mentioned that he had observed such moves in some of the CRN12 stations that he had looked closely at, and recommended that interested observers should undertake a more systemic look at the station history for CRN12 stations at the time to check for station moves during that period.</p>
<p style="text-align: left">Its worth noting that if the 1940s step change in CRN12 min temps was indeed associated with station moves and we accept it as correct, the TOB min temp data for CRN12 stations becomes similar to that of CRN345 stations:</p>
<p style="text-align: left"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-159.png"><img class="aligncenter size-full wp-image-9699" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-159.png" alt="" width="601" height="354" /></a>Note that the step change was removed by simply adding 0.4 C to all annual temps after 1945 for CRN12 stations.</p>
<p style="text-align: left">Dr. Menne also confirmed that TOB adjustments tend to be applied similarly to max and min readings, and that non-volunteer stations (e.g. airports) tended not to require TOB adjustments because of standard reporting times, while co-op stations (which often are more rural) were more likely to have differing thermostat reading times that necessitated TOB adjustments.</p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/a-detailed-look-at-ushcn-mimmax-temps/feed/</wfw:commentRss>
		<slash:comments>268</slash:comments>
		</item>
		<item>
		<title>Guest Post Invitiation to Chiefio</title>
		<link>http://rankexploits.com/musings/2010/guest-post-invitiation-to-chiefio/</link>
		<comments>http://rankexploits.com/musings/2010/guest-post-invitiation-to-chiefio/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 18:28:23 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9654</guid>
		<description><![CDATA[Many in comments have suggested that some of the rebuttals to the claim that the march of the thermometers actually biases temperature anomalies reported by NOAA, GISSTemp and CRU may be unnecessary.  How so, it&#8217;s possible no one has really made that claim. If so, the &#8216;rebuttals&#8217; would be countering strawmen (i.e. arguments that [...]]]></description>
			<content:encoded><![CDATA[<p>Many in comments have suggested that some of the rebuttals to the claim that the march of the thermometers actually biases temperature anomalies reported by NOAA, GISSTemp and CRU may be unnecessary.  How so, it&#8217;s possible no one has really made that claim. If so, the &#8216;rebuttals&#8217; would be countering strawmen (i.e. arguments that have never been made by anyone).  </p>
<p>That said, the three <i>may</i> have mad such a claims appear to be EM Smith,  Joe D&#8217;Aleo, and (possibly) Anthony Watts.  A document by <a href="http://icecap.us/images/uploads/NOAAroleinclimategate.pdf">&#8220;D&#8217;Aleo (pdf)</a> and  an <a href="http://scienceandpublicpolicy.org/originals/policy_driven_deception.html">SPPI publication</a> by D&#8217;Aleo and Watts, suggest the analysis to support the claims about warming biases in temperature anomaly records arising from the march of the thermometers was done by Chiefio (EM Smith.)</p>
<p>Possibly, the actual argument about bias arising from  the march of the thermometers is either more limited or more nuanced than it appears to those of us reading various pdfs, watching tv or sifting through blog posts.  For this reason, I am taking Magic Java&#8217;s suggestion and inviting Chiefio to clarify his thoughts on this matter.    I have extended the invitation as follows:</p>
<blockquote><p>Hello <a href="http://chiefio.wordpress.com">Chiefio,</a></p>
<p>Magic Java has requested that I invite you to guest post your response to the analyses by Zeke, Tamino, ClearClimateCode and Roy Spencer, each of which has investigated the effect of the 90&#8217;s loss of thermometers on any bias in computed temperature anomalies.   I think this is a good idea, and would like to invite you to guest post, if you are interested.</p>
<p>If you do accept, I would actually like to invite you to do two guest posts. In one, I would ask you to answer some very specific questions of my own devising. These may be questions you do not wish to answer&#8211; as they will put you on the spot.  The other would simply invite you to post your response to the analyses of others.  My most specific interest is to see and understand why you think the loss of thermometers has resulted in an actual warm bias in products like CRU, GISSTemp and NOAA&#8217;s surface temperature products for large areas of land. That is: I am interested in the effect averaged over areas as large as the US , the full Northern Hemisphere or the Southern Hemisphere.</p>
<p>With this in mind, I extend and invitation and hope you will take it up either at my blog, or at yours.</p>
<p>Thanks,<br />
Lucia Liljegren</p></blockquote>
<p>With some luck one or two guests posts will clarify Chiefio&#8217;s position on this.  Afterwards, we may be able to query key people&#8217;s positions as well. </p>
<p><b>Update Feb. 4</b> Chiefio politely declined the invitation to guest post, citing the need to focus on his own research. He evidently continue to post at his blog, engaging those points he considers important.  </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/guest-post-invitiation-to-chiefio/feed/</wfw:commentRss>
		<slash:comments>87</slash:comments>
		</item>
		<item>
		<title>Accuweather: Because Weather Matters Too!</title>
		<link>http://rankexploits.com/musings/2010/accuweather-because-weather-matters-too/</link>
		<comments>http://rankexploits.com/musings/2010/accuweather-because-weather-matters-too/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 17:46:30 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9645</guid>
		<description><![CDATA[This is a Sponsored Post written by me on behalf of AccuWeather. All opinions are 100% mine.
As many climate-blog addicts are aware, weather is not climate.  While we are all busy arguing whether this years DC snowstorm was caused by climate change, those in DC were probably just thankful for online services like AccuWeather [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is a Sponsored Post written by me on behalf of <a href="http://socialspark.com/metrics/click/disclosure?slot_id=191262&#038;url=http%3A%2F%2Faccuweather.com%2F" rel="nofollow">AccuWeather</a>. All opinions are 100% mine.</em></p>
<p>As many climate-blog addicts are aware, weather is not climate.  While we are all busy arguing whether this years DC snowstorm was caused by climate change, those in DC were probably just thankful for online services like <a href="http://socialspark.com/metrics/click/post?slot_id=191262&amp;url=http%3A%2F%2Fbeta2010.accuweather.com%2F" rel="nofollow">AccuWeather</a> which could give them advance warming about impending snowmaggedon.  </p>
<p>Today, I was lucky enough to learn of  <a href="http://beta2010.accuweather.com/" rel="nofollow">Accuweather&#8217;s new site</a>, which is still in beta.  </p>
<p>Visiting, I entered my zip code to learn it&#8217;s a sunny 38F in Lisle; temperatures will drop to 24F tonight.  More importantly to me, the monthly forecast predicts <em>we&#8217;ll</em> hit the 50s late in March. This means I may finally be able to erect my portable green-house and start some of the less tender annuals soon. I&#8217;m planning to buy seeds and potting soil this week. I start these in the window, but I do like to have some advance indications of weather conditions plant schedule actually starting the seeds accordingly.     (Lucky for me I&#8217;m not planning to go to Florida; an Accuweather story also warns &#8220;Breaking Weather: Chilly for Spring Breakers&#8221;.) </p>
<p>The site also has some useful forecast maps for the lower 48, including forecasts for rain, snow, UV, windchill, heat index and a number of other indicators as well as slick radar and satellite weather displays of current weather equipped with both &#8220;zoom&#8221; and &#8220;play&#8221; features.  </p>
<p>But enough about the weather! Accuweather now also links to their <a href="http://www.accuweather.com/global-warming.asp" rel="nofollow">a global-warming</a> area, which provided links to Brett Anderson&#8217;s blog. Guess what? The first comment on the most recent blog was placed by Patrick AKA Cyclonebuster!   If you visit, you can have a chat with our old friend.  <img src='http://rankexploits.com/musings/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />   </p>
<p><a href="http://socialspark.com/metrics/click/disclosure?slot_id=191262&#038;url=http%3A%2F%2Faccuweather.com%2F" rel="nofollow"><img alt="Visit my sponsor: Weather for Your Life" border="0" src="http://socialspark.com/metrics/view/post?slot_id=191262&#038;url=http%3A%2F%2Fsocialspark.com%2Fimages%2Fdisclosure_badges%2Fdisclosure_badge_grey_three.png" style="border:0" /></a><br />
<span id="more-9645"></span></p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/accuweather-because-weather-matters-too/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Timeline of &#8220;The march of the thermometers&#8221; meme</title>
		<link>http://rankexploits.com/musings/2010/timeline-of-the-march-of-the-thermometers-meme/</link>
		<comments>http://rankexploits.com/musings/2010/timeline-of-the-march-of-the-thermometers-meme/#comments</comments>
		<pubDate>Tue, 02 Mar 2010 17:17:32 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>
		<category><![CDATA[Chiefio]]></category>
		<category><![CDATA[CRU]]></category>
		<category><![CDATA[D'aleo]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9615</guid>
		<description><![CDATA[I have to admit I initially missed the whole &#8220;march of the thermometers results in overwhelming bias&#8221; and now I&#8217;m trying to put together a time-line. Mind you, I knew this meme was out there, but since there has never been any convincing evidence the march of the thermometers actually caused any large  bias [...]]]></description>
			<content:encoded><![CDATA[<p>I have to admit I initially missed the whole &#8220;march of the thermometers results in overwhelming bias&#8221; and now I&#8217;m trying to put together a time-line. Mind you, I knew this meme was <i>out there</i>, but since there has never been any convincing evidence the march of the thermometers actually <I>caused</i> any large  bias in the reported surface temperatures, I never expected it to get quite as much play as it did.  I guess if subscribed to cable TV, I would have noticed this meme had  hit the big time sooner.  </p>
<p>I&#8217;m now trying to put together a time-line of the more major milestones; as much as possible, I&#8217;m going to avoid editorializing. The initial milestones will focus on the meme itself. Later events will include spin-offs as people who appear to have dispensed with the notion that dropping thermometers from the record may have introduced bias begin to explore <i>other</i> issues that might have introduced bias. (These include UHI, siting issues, adjustments for TOBS etc.)  In invite readers to suggest some items to contribute to the timeline. I&#8217;m mostly interested in: Television coverage, more formal glossy documents (SPPI etc.), blog posts containing <i>analyses</i> rather than mere commentary, coverage by players directly involved, (hese seem to be mostly EM Smith and D&#8217;Aleo.)</p>
<p>I&#8217;m going to throw out what I have, along with dates, and I hope some of my readers can fill me in with posts, dates, major actors and others in the discussion of this meme.</p>
<h3>Meme introduced?</h3>
<p><u>Chiefio</u> August 17, 2009. I don&#8217;t know if this post introduced the meme, but way back in August, <a href="http://chiefio.wordpress.com/2009/08/17/thermometer-years-by-latitude-warm-globe/">Chiefio</a> muses </p>
<blockquote><p>We are still left with the fact that we move about 1/4 of the thermometer records from the cold places to the hot places. <em>That not only increases the impact of the hot places, but reduces the impact of the cold places.</em></p></blockquote>
<p>For a time, the notion seems to be discussed in comments, forums and blog posts at various places, but it does not hit the mainstream news. </p>
<h3>The Meme Goes Formal</h3>
<p><u><a href="http://www.spaceref.com/news/viewpr.html?pid=30000">KUSI News Story</a></u> broadcast Jan. 14, 2010,  cover D&#8217;Aleo article <a href="http://icecap.us/images/uploads/NOAAroleinclimategate.pdf">&#8220;Climategate: Leaked Emails Inspired Data Analyses Show Claimed Warming Greatly Exaggerated and NOAA not CRU is Ground Zero&#8221;(pdf)</a>; this very brief document is written by D&#8217;Aleo. The videos discussing the theory that loss of thermometers results in a bias in temperatures are <a href="http://www.kusi.com/weather/colemanscorner/84174357.html">here</a>. D&#8217;Aleo and Smith (aka Chiefio) are interviewed.</p>
<p><u>SPPI</u> January 23, 2010. SPPI publishes <a href="http://scienceandpublicpolicy.org/originals/policy_driven_deception.html">Surface Temperature Records: Policy Driven Deception? </a> written by D&#8217;Aleo and Watts. The station drop out issue is discussed on pages 9-23.  EM Smith (Chiefio) is heavily cited as the source for claims and analyses arguing that the changing number of stations biases the surface temperature record.  Discussions of other adjustments focusing on Urban Heat Island and station siting issues follow; this portion of the documents heavily cites Anthony Watts <a href="http://surfacestations.org">surfacestations.org</a> project and WattsUpWithThat. </p>
<p><u><a href="http://www.yaleclimatemediaforum.org/2010/01/kusi-noaa-nasa/">Zeke Hausfather</a></u>, January 21, 2010 posts a graph comparing the simple average temperature anomaly from 1,017 thermometers with data available through 2000 to 402 that stopped providing data sometime bewteen 1970 and 2000. He finds no singificant difference between the two traces suggesting that station drop out is not an important source of biase. </p>
<p><u><a href="http://www.drroyspencer.com/2010/02/new-work-on-the-recent-warming-of-northern-hemispheric-land-areas/">Roy Spencer</a></u> Feb. 20, 2010. Roy Spencer computes trends using data drawn from the NOAA-merged International Surface Hourly (ISH) dataset, a ground based thermometer record. Using area weighting, he compared land based temperature anomalies for the northern hemisphere computed thermometers in operation from <b>1986</b>-2010 to trends published by CRU. He finds no difference in trend&#8211; though the monthly data from the ISH dataset appears noisier.  </p>
<p><u><a href="http://chiefio.wordpress.com/2010/02/22/kusi-coleman-tv-show-discussion/">Chiefio (E.M. Smith)</a></u>, Feb 22, 2010: elaborats on the KUSI – Coleman TV show discussion which covered the story that dropping thermometers from the temperature record results in a warming bias in surface temperature anomoalies reported to the public.  The theory is summarized as </p>
<blockquote><p>My major point has simply been that much of the available data is not used. It is dropped on the floor. You can call it “deleted” or “dropped” or “ignored” or whatever. It is still NOT in the GHCN data set. The pattern of these “droppings” is that high latitude and high altitude stations are dropped, while low altitude and low latitude stations are kept (with an ever increasing percentage at warm heat islands of airports). There is a clear ’survivor bias’ toward warmer stations (and with warming trends, like airports), with warmer winters, more heat islands, and lots of tarmac.</p></blockquote>
<p>Smith also responds to those who suggest that dropping thermometer will have little effect, noting there are two reasons advanced for likely non-impact of dropping stations:</p>
<blockquote><p>First, that there is no person actively pruning thermometers. While the “spin” put on my position has tended to say there is active intentional removal of thermometers for malicious effect; I have gone out of my way to point out that I can not know any person’s intent, only the result. [...], I’m more interested in the FACT of the thermometer deletions (or drops) from the record and what that says about data bias; than about whether there has been a sin of omission or of commission. It’s a sin in either case. Was it murder or involuntary thermometer slaughter? In either case “It’s dead Jim”, and it’s wrong.</p></blockquote>
<p>So, using the rather vivid allusion to murder, it appears Chiefio is suggesting the bias due to dropping thermometers could be unintentional.  Some colorful allusions are also used when countering the notion that use of the anomaly process protects against bias when thermometers are dropped from the record:</p>
<blockquote><p>Second defense, that the “anomaly” process will prevent thermometer drops from having an impact. ( This is usually followed by a theoretical example of comparing a thermometer only to itself and showing that with perfect anomaly processing and an idealized unbroken record, there is no problem.) But the reality is that we don’t compare thermometers only to themselves and the records are horridly broken and with massive “fill in” with fantasy “data”. So we have “fantasy basket A” to “fantasy basket B” that change over time.</p>
<p>Well, thermometer change / drops / deletions DO have an impact. I’ve run a benchmark through the GIStemp code and using exactly the stations GISS dropped (from the USHCN data from 5/2007 to 11/2009 -when they put them back in, after some postings pointing out how to do it and that it was an ‘issue’… – perhaps just a coincidence…) and the anomaly map shows warming from those station being dropped. We can argue about the price of this streetwalker, but what it does is not in dispute.</p>
<p>The reason it fails to stop all survivor bias impact is two fold. One fixable, one less so.</p>
<p>First, it does not do the anomaly comparison “self to self” [ I call that "selfing" after the pollination process ] but rather “Basket A in time A” to “Basket B in time B”. And once the two things being used to create an “anomaly” are different from each other, you have opened the door for a variety of very subtile biases to change the result. This, too, is not in dispute. (Well, it is by some who are a bit slow to catch on, but it is not in dispute by the folks who wrote the code. One of the NASA FOIA emails admit to this problem and bias.)</p>
<p>The second reason is rather subtile, and it is one I’m “in discussions” about publishing; so I’ll not make it public until some decision is reached on that front. ( I may tire of the whole backbiting “peer reviewed publishing” process and just go for “public reviewed self published”. That is my leaning, but folks keep telling me it’s important to be “in the literature”… We’ll see.) Lets just say that it depends on some assumptions everyone makes that are wrong, and looking at what is ignored. It will impart survivor bias into the First Differences Method, and the Reference Station Method. I believe it will impart bias into the Climatology Anomaly Method as well, but the definition of that method might allow for an approach that would dampen the bias (i.e. “perfect” selfing and lifetime), so I have a bit more homework to do before asserting it as fact for all variations. Basically, for any system where the thermometers change over time, it allows for bias to show in the product. And that’s as far as I can go on that point right now. It is this property that, IMHO, lets the benchmark change for GIStemp.</p>
<p>The bottom line is that survivor bias from thermometer change matters, and there has been a heck fo a lot of biased thermometer change.</p></blockquote>
<p>I&#8217;d snip the above for brevity, but I couldn&#8217;t figure out how to do so while still convenying Chiefio&#8217;s intended point. </p>
<p><u><a href="http://tamino.wordpress.com/2010/02/23/ghcn-preliminary-results/">Tamino</a></u>, Feb 23, 2010 presents preliminary GHCN temperature analyses comparing area weighted temperature anomalies for the Norhern Hemisphere based on &#8220;cut-off&#8221; thermometers series and data from thermometers that remained in the record to the present time.  He finds no significant difference between the two traces. </p>
<p><u><a href="http://clearclimatecode.org/the-1990s-station-dropout-does-not-have-a-warming-effect/">Clear Climate Code</a></u>, Feb. 26, 2010 compares GISSTemp type calculations of global surface temperature anomalies based on the &#8220;full&#8221; and &#8220;cut-off&#8221; thermometer set. They find no major differences between the two traces. </p>
<p><u>The Blackboard</u> March 1, 2010, <a href="http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/">Zeke</a> kindly posts his weighed average of thermometers at my blog. He requests feedback and suggestions for improving the analysis.  His post also begins to address &#8220;spin-off&#8221; issues like UHI, TOBS etc.  </p>
<p><u>Chiefio</u>, March 1, 2010.  Though many others are finding it difficult to discern any quantifiable effect of loss in stations, on March 1, 2010 <a href="http://chiefio.wordpress.com/2010/03/01/japan-poster-child-for-the-smith-effect/">Chiefio</a> has posted a discussion of what he calls &#8220;The Smith Effect&#8221;&#8211; that is the step function in temperatures associated with drop outs in thermometers.   What is &#8220;The Smith Effect&#8221; or it&#8217;s cause? Chiefio tells us:</p>
<blockquote><p>I will not be discussing what the theory is behind The Smith Effect. At least, not until I’ve found out if it’s going into a paper for publication, or not. Once that is resolved, I’ll either post the details here (if it’s not going to publication) or post a pointer to the publication (so we all can sit on pins and needles waiting <img src='http://rankexploits.com/musings/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p></blockquote>
<p>So, we have to wait sometime before we read Chiefio&#8217;s full response. </p>
<p>If I&#8217;m not mistaken, those are the main posts that <i>directly</I> adress &#8220;the march of the thermometer&#8221; meme.  But, I do ask readers,  <u>Do you have more &#8220;march of the thermometer&#8221; milestones to add to my timeline?</u> I&#8217;d love to read them.</p>
<h3>Spin off: Temperature reconstructions.</h3>
<p>I&#8217;m also seeing some interesting spin off posts. These seem to be starting to the types of issues in the <i>second</i> half of the SPPI document authored by D&#8217;Aleo and Watts, and which tended to link to Watts&#8217; blog  and web site. That is, they seem to be mentioning UHI, station citing and issues surrounding adjustments.  The posts share similarities with &#8220;the march of the thermometer&#8221; posts in so much as they are creating reconstructions&#8211; but they differ in the specific comparisons made. </p>
<p>I&#8217;m going to discuss some of the spin-off posts below. </p>
<p><u><a href="http://www.drroyspencer.com/2010/02/spurious-warming-in-the-jones-u-s-temperatures-since-1973/">Roy Spencer</a></u> , February 27th, 2010 examines US air temperatures time series extending back to <b>1973</b>&#8211; a longer record than treated in his previous post. He compares Jones CRUTemp3 temperature analysis which is computed using (Tmin+Tmax)/2 from approved GHCN stations whose  number change over time, to a new temperature anomalies series which Roy computes based on a larger groups of ISH thermometers samped at 06, 12, 18, and 00 UTC. Roy series is based on thermometers that operate through out the entire period.   When comparing the two series, Jones CRUTemp3 exhibits a brisker warming trend than Roy&#8217;s new series; the difference amounts to 20% of all warming from 1973 forward.   </p>
<p>I don&#8217;t think Roy&#8217;s analysis can reveal the cause of the differences in trends. Investigation of CRU code, lists of stations, and underlying raw data  might reveal details and permit sensitivity studies to discover precisely what aspect of adjustments or addition and subtraction of individual stations might be the cause of differences in warming trend computed both ways.  </p>
<p><u>The Air Vent</u> I&#8217;ve been searching for other discussions. <a href="http://noconsensus.wordpress.com/2010/03/02/gridded-ghcn-temps-r2-5/">Jeff Id and Roman R</a> appear to be working on creating temperature anomaly time series using  &#8220;GHCN and Roman’s seasonal offsetting method&#8221;. Perhaps when they are satisfied with their method of weighting, they will post results comparing trends with a fuller set of thermometers compared to CRU. Equally likely, they will focus on other factors like UHI, TOBS, and the sorts of factors that have interested many of us for a long time.</p>
<p>For the time being, JeffId did  post a trend from 1978-(now?) based on their gridded product. They find the trend in the global temperature anomaly to be 0.186C/decade which they indicate is a drop from some previously computed product. (I don&#8217;t know what the GHCN trend is when computed other ways.) Jeff alerts readers their current series is still affected by UHI, siting issues, TOBS and other features. </p>
<p>If any of you can fill me in on additional milestones in &#8220;the march of the thermometers&#8221; meme and/or find additional discussion of reconstructions, let me know in comments. I&#8217;m very interested in figuring out how this all played out at think-tanks (SPPI), tv and blogs! </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/timeline-of-the-march-of-the-thermometers-meme/feed/</wfw:commentRss>
		<slash:comments>237</slash:comments>
		</item>
		<item>
		<title>March First Haiku</title>
		<link>http://rankexploits.com/musings/2010/march-first-haiku/</link>
		<comments>http://rankexploits.com/musings/2010/march-first-haiku/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 19:52:54 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Haiku]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9607</guid>
		<description><![CDATA[Say hello to march
Brave daffodils show green tops,
soon yellow flowers.

Anthony discussed daffodils not yet blooming in England. Mine aren&#8217;t blooming yet either.  I don&#8217;t have any idea how early they bloomed in April.  
]]></description>
			<content:encoded><![CDATA[<p><center>Say hello to march<br />
Brave daffodils show green tops,<br />
soon yellow flowers.<br />
<a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/March1DAffodils_1.jpg"><img src="http://rankexploits.com/musings/wp-content/uploads/2010/03/March1DAffodils_1-500x333.jpg" alt="" title="March1DAffodils_1" width="500" height="333" class="aligncenter size-medium wp-image-9606" /></a></center></p>
<p><a href="http://wattsupwiththat.com/2010/03/01/spring-sprang-sprung/">Anthony</a> discussed daffodils not yet blooming in England. Mine aren&#8217;t blooming yet either.  I don&#8217;t have any idea how early they bloomed in April.  </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/march-first-haiku/feed/</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
		<item>
		<title>A simple model for spatially-weighted temp analysis</title>
		<link>http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/</link>
		<comments>http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 18:03:35 +0000</pubDate>
		<dc:creator>Zeke</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9573</guid>
		<description><![CDATA[Inspired by Tamino&#8217;s recent efforts at producing a spatially-weighted analysis of temperatures from GHCN stations, I decided to make my own and &#8220;Open Source&#8221; it. While its currently written in STATA (the statistics software that I use for work), I&#8217;d be happy to work with anyone to port it to R or any other language [...]]]></description>
			<content:encoded><![CDATA[<p>Inspired by Tamino&#8217;s recent efforts at producing a spatially-weighted analysis of temperatures from GHCN stations, I decided to make my own and &#8220;Open Source&#8221; it. While its currently written in STATA (the statistics software that I use for work), I&#8217;d be happy to work with anyone to port it to R or any other language that is a bit more accessible. I hope that folks on both sides of the debate can work together to establish a method of spatial weighting that we can use to do temperature analysis going forward, since it allows us to look at the overall effects of adjustment methods and differing data sets, rather than focusing on individual anecdotes.</p>
<p>The latest version of the source code is available in the Temp_Analysis zip file here: <a href="http://drop.io/0yhqyon">http://drop.io/0yhqyon</a></p>
<p>The model goes through a number of different steps to generate a grid-weighted global (or local) anomaly:</p>
<ol>
<li>Import the data files. Currently the model has been used to import GHCN raw, GHCN adjusted, USHCN raw, USHCNv2 TOB, and USHCNv2 adjusted data, but it should be easily adaptable to any set of temperature data.</li>
<li>Assign each station to a 5&#215;5 lat/lon grid. This is accomplished by two for-loops that go through each 5-degree increment of latitude or longitude and check to see if each individual station is contained in that increment. If it is, it assigns the midpoint of the range as the station&#8217;s lat or lon grid value. Once these loops are complete, it concatenates the lat and lon grid value for each station to a gridbox identifier.</li>
<li>Weight each grid box by its area. This uses the equation grid_weight = <span style="text-decoration: line-through">4 * _pi^2 * 6378.1^2 * cos(lat*_pi/180) * 5/360 * 5/360</span> sin((latgrid+2.5)*_pi/180) &#8211; sin((latgrid-2.5)*_pi/180) to calcuate the relative size of each grid box based on the midpoint (thanks RomanM!). <span style="text-decoration: line-through">I got this equation from Gavin for interpreting GISS Model E outputs, though he mentioned that it was an approximation and not perfect. If anyone has a suggestion for a better way to calculate 5&#215;5 lat/lon grid box areas, please let me know.</span></li>
<li>Optionally apply various filters for different analysis. Filters include restricting stations used by latitude, longitude, country code, U.S. state, urbanity (for GHCN data only right now; don&#8217;t have a good urban indicator for USHCN data developed yet), station start year, end year, and duration of station record, as well as custom filters to replicate the Menne analysis and Long document.</li>
<li>Combine multiple imod records into a single wmo_id record for GHCN data. GHCN has multiple records associated with a single station ID in a number of cases, and this step simply averages all the records available for each station_id. It occurs after the filters are applied, so if you only want to look at rural stations, it won&#8217;t combine a rural imod with an urban imod in the rare case that a single wmo_id has both rural and urban records associated with it. In general, the wmo_id / imod differentiation has to do with cases where a station was moved or stopped reporting and was replaced by a nearby station, and mostly applies to locations outside the U.S. This step does nothing at all for USHCN data.</li>
<li>Calculate the anomalies for each month at each station relative to the 1960-1970 base period, and discard any stations with &lt; 10 years of temp data per GHCN methodology. This also discards all wmo_ids without a record in the 1960-1970 period. This is the part I&#8217;d like to improve, since it reduces the stations available by about 15% in GHCN data. Note that this has no real effect on USHCN data, since virtually all stations reporting currently have data from the 1960-1970 period.</li>
<li>Average the anomaly of all stations in each grid cell for each month to obtain the grid cell anomaly.</li>
<li>Calculate a grid-weight for each month of each grid cell in each year, where the grid weight is equal to the cell&#8217;s grid area divided by the total grid area of all grid cells available for that month.</li>
<li>Multiply the each grid cell anomaly by the grid weight for each month of each year and sum up the results across all grid cells to obtain the average by month.</li>
<li>Calculate the annual average anomaly via a weighted average of each monthly anomaly by the number of days per month, taking leap years into account.</li>
</ol>
<p>We can do some initial validation (to make sure things aren&#8217;t too screwy) by comparing GHCN raw data to GISSTemp land temperatures. Note that we don&#8217;t expect this to be perfect, because GISS adjusts GHCN data (and actually lowers the global temperature vis-a-vis the raw data, as both this analysis and <a href="http://noconsensus.wordpress.com/2010/02/26/global-gridded-ghcn-trend-by-seasonal-offset-anomaly-matching/">a similar one by Jeff Id</a> show). The Rural/Urban designation here is based on GHCN station metadata; it may not be perfect, and we can try other collections of stations that we identify as rural or good if we want (indeed, we do this with CRN12 stations in the U.S. later on).</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-971.png"><img class="size-full wp-image-9576 aligncenter" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-971.png" alt="" width="579" height="342" /></a></p>
<p>We can also compare stations with records post-1992 to those without, to quickly replicate the work that <a href="http://tamino.wordpress.com/2010/02/25/false-claims-proven-false/">Tamino</a> and the <a href="http://clearclimatecode.org/the-1990s-station-dropout-does-not-have-a-warming-effect/">CCC folks </a>did on addressing E.M. Smith&#8217;s whole station dropout arguement:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-98.png"><img class="aligncenter size-full wp-image-9577" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-98.png" alt="" width="564" height="302" /></a>Since no one has looked into it yet, we can consider the case of Northern Canada and see if station dropout there has had any effect on temperatures:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-108.png"><img class="aligncenter size-full wp-image-9578" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-108.png" alt="" width="552" height="295" /></a></p>
<p>Lets take a look at just the U.S. for a bit, since its where a lot of the criticism of temperature data has been centered.</p>
<p>We can start by looking at Raw and Adjusted GHCN data stations in the U.S. Lower 48:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-109.png"><img class="aligncenter size-full wp-image-9579" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-109.png" alt="" width="557" height="310" /></a></p>
<p>And break these stations down into Rural vs. Urban (using an 11 year running mean to smooth out some of the noise):</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1101.png"><img class="aligncenter size-full wp-image-9581" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-1101.png" alt="" width="546" height="321" /></a></p>
<p>In addition to the 400 or so GHCN stations in the U.S., we also have access to 1200+ stations in USHCN. Lets incorporate those into our model, and look at raw GHCN, adjusted GHCN, raw USHCN, adjusted USHCN, and GISS for the lower 48 U.S.:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-112.png"><img class="aligncenter size-full wp-image-9582" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-112.png" alt="" width="562" height="355" /></a></p>
<p>Since GISS uses adjusted USHCN as an input (and further tweaks it), we should expect those series to be quite close. Indeed, we find:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-113.png"><img class="aligncenter size-full wp-image-9583" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-113.png" alt="" width="562" height="356" /></a></p>
<p>USHCN and GHCN use somewhat different adjustments for Lower 48 U.S. stations. Lets compare them:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-114.png"><img class="aligncenter size-full wp-image-9584" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-114.png" alt="" width="532" height="343" /></a></p>
<p>Now, USHCN doesn&#8217;t have a simple Urban/Rural designation in the Metadata. I&#8217;m working on trying to figure one out from the station landuse data, but for the time being we can use Anthony and the SurfaceStation.org effort as a first pass (essentially replicating the work of Menne et al 2010). Here are all 71 CRN12 stations compared to the CRN345 stations:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-115.png"><img class="aligncenter size-full wp-image-9585" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-115.png" alt="" width="523" height="354" /></a></p>
<p>Similar to Menne et al, we find that &#8220;good&#8221; CRN12 stations have a slightly higher anomaly than the &#8220;poor&#8221; CRN345 stations using the raw USHCN data. We can also take a look at the adjusted USHCN data. There are two primary adjustments made to USHCN raw data in USHCN v2: time of observation (TOB) adjustments, and urbanization/discontinuity adjustments. Here are CRN12 stations for all three datasets (raw, TOB, and fully adjusted).</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-116.png"><img class="aligncenter size-full wp-image-9586" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-116.png" alt="" width="524" height="355" /></a></p>
<p>The majority of the adjustments that occur are TOB-related, but there are also non-TOB positive adjustments to the &#8220;good&#8221; stations, which is interesting. Lets compare this to the adjustments made to CRN345 stations:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-117.png"><img class="aligncenter size-full wp-image-9587" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-117.png" alt="" width="525" height="356" /></a></p>
<p>Here we see that both the TOB adjustments and the additional adjustments are larger for CRN345 stations than for CRN12 stations, which is a good sign. The difference between CRN12 and CRN345 stations in each dataset bears this out:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-118.png"><img class="aligncenter size-full wp-image-9588" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-118.png" alt="" width="526" height="357" /></a></p>
<p>After each round of adjustments, CRN345 stations become closer to CRN12 stations. However, the net adjustment to both stations are still positive, for both TOB (which is expected) and non-TOB adjustments. Lets looks at the non-TOB adjustments done to CRN12 and CRN345 in more detail:</p>
<p style="text-align: center"><a href="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-119.png"><img class="aligncenter size-full wp-image-9589" src="http://rankexploits.com/musings/wp-content/uploads/2010/03/Picture-119.png" alt="" width="526" height="357" /></a></p>
<p>Anyhow, thats it for now! If you have suggestions to improve the model, or things you are interested in seeing analyzed, let me know. Also, please download the source code (if you run STATA) or work with me to port it over to R (if you don&#8217;t) so you can play around with it yourselves.</p>
<p>Also, click on images to embiggen! And thanks Lucia for giving me a chance to do I guest post on my recent work. <img src='http://rankexploits.com/musings/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/feed/</wfw:commentRss>
		<slash:comments>111</slash:comments>
		</item>
		<item>
		<title>Anti-Spam: New Plugin.</title>
		<link>http://rankexploits.com/musings/2010/anti-spam-new-plugin/</link>
		<comments>http://rankexploits.com/musings/2010/anti-spam-new-plugin/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 15:05:35 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9569</guid>
		<description><![CDATA[My &#8220;spam&#8221; box has been filling lately, and the spam plugin has also been making lots of mistakes.  To reduce spam, I&#8217;ve added a new plugin: Comment E-Mail Verification.
Supposedly, if you have previously posted comments, you will not notice anything. If you have not previously posted a comment, the plugin will send an email [...]]]></description>
			<content:encoded><![CDATA[<p>My &#8220;spam&#8221; box has been filling lately, and the spam plugin has also been making lots of mistakes.  To reduce spam, I&#8217;ve added a new plugin: <a href="http://wordpress.org/extend/plugins/comment-email-verify/">Comment E-Mail Verification</a>.</p>
<p><i>Supposedly</i>, if you have previously posted comments, you will not notice anything. If you have <i>not</i> previously posted a comment, the plugin will send an email to the email you list, and you will be required to respond.  After that, you will be whitelisted.</p>
<p>I haven&#8217;t plowed through the plugin-code to verify that it actually works as advertised when first activated. I&#8217;m worried <i>everyone</i> is going to get an email.   If a few of you could try commenting, let me know what happens&#8230;. (I&#8217;m going to put up a few tests too.)</p>
<h3>Update</h3>
<p>This comment doesn&#8217;t seem to moderate comments when people use new emails. I turned it off.  </p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/anti-spam-new-plugin/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Bump! New thread for Station Drop Out Analyses.</title>
		<link>http://rankexploits.com/musings/2010/bump-new-thread-for-station-drop-out-analyses/</link>
		<comments>http://rankexploits.com/musings/2010/bump-new-thread-for-station-drop-out-analyses/#comments</comments>
		<pubDate>Sun, 28 Feb 2010 23:31:38 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9558</guid>
		<description><![CDATA[Zeke asked for a new thread for station drop out analyses. 
In that thread, Zeke described a  simple model to spatially grid and weight temperature data, and has asked for help improving it&#8221;
The latest source code is available here: 
The folder name is Temp_Analysis v0.3.zip.
It should easily support importing either GHCN or USHCN data [...]]]></description>
			<content:encoded><![CDATA[<p>Zeke asked for a new thread for <a href="http://rankexploits.com/musings/2010/effect-of-dropping-station-data/">station drop out analyses</a>. </p>
<p>In that thread, Zeke described a  simple model to spatially grid and weight temperature data, and has asked for help improving it&#8221;</p>
<p>The latest source code is available here: <a href="http://drop.io/0yhqyon" rel="nofollow"></a><br />
The folder name is Temp_Analysis v0.3.zip.<br />
It should easily support importing either GHCN or USHCN data now (or, frankly, any temperature data you want). </p>
<p>Zeke posted output for the code  in comment Comment#35622:</p>
<blockquote><p>Oh, and for those interested, here are the raw vs. adjusted data for the U.S. for all the different series:</p>
<p><img src="http://i81.photobucket.com/albums/j237/hausfath/Picture68.png" width="600" ></p>
<p>http://i81.photobucket.com/albums/j237/hausfath/Picture68.png</p>
<p>And comparing USHCN adjusted to GISS:</p>
<p><img src="http://i81.photobucket.com/albums/j237/hausfath/Picture69-2.png" width="600" ></p>
<p>http://i81.photobucket.com/albums/j237/hausfath/Picture69-2.png</p>
</blockquote>
<p><a href="http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/">New Thread!</a></p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/bump-new-thread-for-station-drop-out-analyses/feed/</wfw:commentRss>
		<slash:comments>33</slash:comments>
		</item>
		<item>
		<title>Chile earthquake kills 76 and triggers tsunami</title>
		<link>http://rankexploits.com/musings/2010/chile-earthquake-kills-76-and-triggers-tsunami/</link>
		<comments>http://rankexploits.com/musings/2010/chile-earthquake-kills-76-and-triggers-tsunami/#comments</comments>
		<pubDate>Sat, 27 Feb 2010 12:53:40 +0000</pubDate>
		<dc:creator>lucia</dc:creator>
				<category><![CDATA[Data Comparisons]]></category>

		<guid isPermaLink="false">http://rankexploits.com/musings/?p=9553</guid>
		<description><![CDATA[The first email I read this morning announced the earthquake in Chile.  It hit 8.8 on the Richtermoment magnitude scale and triggered a tsunami.  The reports I read say at least 76 dead. Those of you who pray, pray it won&#8217;t be many more. For news: Times Online,hawaii247 and Reuters.
]]></description>
			<content:encoded><![CDATA[<p>The first email I read this morning announced the earthquake in Chile.  It hit 8.8 on the<del datetime="2010-02-27T14:58:13+00:00"> Richter</del>moment magnitude scale and triggered a tsunami.  The reports I read say at least 76 dead. Those of you who pray, pray it won&#8217;t be many more. For news: <a href="http://www.timesonline.co.uk/tol/news/world/us_and_americas/article7043637.ece">Times Online<a>,<a href="http://www.hawaii247.org/2010/02/26/earthquake-rocks-chile-tsunami-advisories-for-chile-peru-and-ecuador/">hawaii247</a> and <a href="http://www.reuters.com/article/idUSTRE61Q0S920100227">Reuters</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://rankexploits.com/musings/2010/chile-earthquake-kills-76-and-triggers-tsunami/feed/</wfw:commentRss>
		<slash:comments>37</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.928 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-03-20 07:21:16 -->
