The Detection and Attribution of Northern Hemisphere Land Surface Warming (1850–2018) in Terms of Human and Natural Factors: Challenges of Inadequate Data

: A statistical analysis was applied to Northern Hemisphere land surface temperatures (1850–2018) to try to identify the main drivers of the observed warming since the mid-19th century. Two different temperature estimates were considered—a rural and urban blend (that matches almost exactly with most current estimates) and a rural-only estimate. The rural and urban blend indicates a long-term warming of 0.89 ◦ C/century since 1850, while the rural-only indicates 0.55 ◦ C/century. This contradicts a common assumption that current thermometer-based global temperature indices are relatively unaffected by urban warming biases. Three main climatic drivers were considered, following the approaches adopted by the Intergovernmental Panel on Climate Change (IPCC)’s recent 6th Assessment Report (AR6): two natural forcings (solar and volcanic) and the composite “all anthropogenic forcings combined” time series recommended by IPCC AR6. The volcanic time series was that recommended by IPCC AR6. Two alternative solar forcing datasets were contrasted. One was the

AR6's attribution statement on the causes of global warming was mainly based on a comparison of observed global temperature estimates to modeled "hindcasts" (retrospective "forecasts" of past climate) from the Coupled Model Intercomparison Project Phase 6 (CMIP6) simulations [2].The model hindcasts using only two natural forcings (solar and HANDOUT: AGENDA ITEM 4.3 Climate 2023, 11, 179 3 of 36 volcanic) were unable to simulate any substantial warming, but those using human-caused ("anthropogenic") forcings matched well with observations [2].AR6 summarized the rationale of this attribution in one of the Frequently Asked Questions (FAQ 3.1, p. 515) of their report.
The main attribution analysis used by AR6 was based on Gillett et al. (2021) [2].However, that analysis largely repeated and updated an equivalent analysis in the IPCC's previous report, AR5 [3] (based on Jones et al. (2013) [4]).
Urban areas represent a small fraction of the global land area, yet the land component of the IPCC's global temperature estimates includes many urbanized weather stations.As a result, there is concern that they might be contaminated by urbanization bias, i.e., warming biases from the growth of urban heat islands around weather stations [6][7][8][9][10].2.
C2021 cautioned that both problems could significantly bias the attribution approach adopted by AR5 (and ultimately repeated by AR6 [1]) into prematurely concluding with unjustified confidence that the long-term global warming implied by the global temperature estimates was mostly human-caused [5].Indeed, they showed that by altering the choice of TSI and/or the temperature records considered, they could explain the observed long-term warming as being anything from "mostly human-caused" to "mostly natural" or a mixture of both human-caused and natural factors.

Influence of C2021's Findings on Recent Attribution Studies
Following the publication of AR6, the IPCC acknowledged that C2021 had not been considered by the AR6 authors since C2021 had missed the deadline for consideration by 10.5 weeks [16].Therefore, the concerns raised by C2021 were apparently not considered for AR6's attribution statement.Nonetheless, a number of recent global temperature attribution studies have explicitly considered different aspects of C2021 in their analysis [17][18][19][20][21][22].
Three of these studies agreed with C2021 that: (a) much of the long-term warming since the late 19th century could be explained in terms of changing solar activity and (b) the IPCC had substantially underestimated the solar contribution [17,18,22].Two of the studies disagreed with C2021 and concluded that the solar contribution was very small [20,21].The remaining study reached an intermediate conclusion, finding that TSI was the dominant climate driver up to 1960 but that afterward CO 2 appeared to dominate [19].
Each of these studies took a slightly different approach to attribution; had a different focus; and considered different aspects of C2021:

•
Stefani (2021) [17] recognized the concerns about urban data raised by C2021 and based their analysis on global SST data instead.Acknowledging C2021's point that it was unclear which (if any) of the many TSI datasets were correct, Stefani instead used the geomagnetic aa index as a proxy for solar activity citing previous work that found it to be useful as a solar proxy.Stefani then carried out a multilinear regression between SST, aa, and atmospheric CO 2 concentrations (i.e., the main component of IPCC's "anthropogenic forcings").The results suggested that solar activity explained between 30% and 70% of the observed long-term warming.• Harde (2022) [18] used a two-layer energy balance model to evaluate the relative and absolute contributions of changes in (a) solar activity and (b) atmospheric CO 2 concentrations to Northern Hemisphere land and ocean temperatures since 1881.Considering C2021's cautions about urbanization bias, Harde combined Soon et al.

•
Li et al. (2022) [19] used a statistical frequency analysis (using wavelet coherence) to evaluate the relative contributions of changes in TSI and CO 2 to global surface temperatures since 1880.Their chosen temperature record was Lenssen et al. (2019)'s global land and ocean series [25], which includes urban data.Therefore, Li et al. cautioned that their chosen temperature record was probably contaminated by nonclimatic biases and referred the readers to C2021 for more details.They also cautioned that there was considerable debate over which TSI dataset to use, but that a choice was necessary and they decided on Coddington et al. (2016)'s TSI reconstruction [26].As we will discuss later, this is very closely related to AR6's Matthes et al. (2017) [11] series and was also one of C2021's 16 TSI records.Li et al. found a very strong solar signal in the temperature changes up to about 1960, but afterward, the temperature changes shifted to being dominated by increasing CO 2 .However, if they detrended the temperature record after 1960 to account for the presumed CO 2 warming, the very strong solar signal remained for the entire 1882-2020 period.

•
Richardson and Benestad (2022) [20] reanalyzed some of the C2021 dataset using a multilinear regression in terms of TSI and anthropogenic forcings.However, they confined their analysis to C2021's Northern Hemisphere "rural and urban" land series and dropped 2 of the 16 TSI records that ended before the 21st century.They were unable to find a substantial solar contribution to the long-term warming of the "rural and urban" series with any of the 14 remaining TSI records.They argued that the main difference between their reanalysis and C2021's was that C2021 carried out a sequential regression rather than a simultaneous multilinear regression.

•
Chatzistergos (2023) [21] did not use any TSI series for his analysis.Instead, he confined his analysis to a particular solar activity proxy-the solar cycle length (SCL).He noted that this proxy was one of the five solar proxies used in Hoyt and Schatten (1993)'s multiproxy TSI reconstruction [13].This was one of the 16 TSI series identified by C2021-coincidentally the one identified by Harde (2022) as the best-fitting TSI record [18].Comparing Hoyt and Schatten (1993)'s SCL component to several global land and ocean records (that incorporated urban data), he found that this individual proxy was unable to explain much of the post-1970 warming.

•
Scafetta (2023)'s [22] attribution study used a 1-D energy balance model with a variable system response time to account for ocean buffering.The main temperature records considered were global land and ocean records.However, recognizing the concerns over urbanization bias in the land component, a secondary analysis only considered global sea surface temperatures.
The model considered three external forcing components-anthropogenic, volcanic, and solar.The first two factors were modeled using AR6's recommended forcings series.However, for TSI, he generated three composite records based on different combinations of 8 of the 16 TSI series identified by C2021.Three of these TSI series were "low multidecadal variability" records and included AR6's recommended series, i.e., Matthes et al. (2017) [11].The other five showed "high multidecadal variability", but he identified one of these records-Hoyt and Schatten (1993) [13]-as distinct from the other four.
Hence, he carried out hindcasts using four different averages of the multiple TSI records: (i) the average of all eight TSI records; (ii) the average of all TSI records except the three low variability records; (iii) the average of all TSI records except Hoyt and Schatten (1993) [13]; (iv) using only AR6's recommended TSI.The hindcasts using only AR6's HANDOUT: AGENDA ITEM 4.3 recommended forcings were able to reproduce AR6's attribution and led to the conclusion "that anthropogenic emissions account for substantially all of the observed global warming since the pre-industrial period (1850-1900) and that changes in solar activity are practically irrelevant".However, the hindcasts using any of the TSI averages that included high multidecadal variability suggested a much greater solar role and nearly halved the apparent climate sensitivity to increasing CO 2 .Furthermore, the hindcasts with the best fits to the observed temperatures were those that excluded the low multidecadal variability series from the TSI average, i.e., excluding those used for AR5 and AR6's hindcasts.

IPCC AR6's Positions on the Urbanization Bias and TSI Debates
AR6 has argued that the urbanization bias problem is small, i.e., <10% of the observed warming: "No recent literature has emerged to alter the AR5 finding that it is unlikely that any uncorrected effects from urbanization [. ..], or from changes in land use or land cover [. ..], have raised global Land Surface Air Temperature (LSAT) trends by more than 10%, although larger signals have been identified in some specific regions, especially rapidly urbanizing areas such as eastern China [27][28][29]" -AR6, Chapter 2, pp.43-44 [1].
However, that AR6 claim contradicts several studies in the post-AR5 literature finding urbanization bias comprises more than 10% [7,9,10] of the estimated land warming.Indeed, ironically, one of AR6's citations in the above quote-Shi et al. ( 2019) [29]-had specifically emphasized that recent literature had emerged questioning the AR5 finding.Surprisingly, Panmao Zhai, one of the co-chairs of AR6 had even drawn attention to the significance of Soon et al. (2015)'s [7] analysis of urbanization bias (and also their concerns about TSI) in his own work [30], yet these insights appear to have been overlooked by the authors of Chapter 2. According to a spokesperson for the IPCC (after consultation with Zhai), the reason for this particular oversight was apparently that "decisions on citations are up to the chapter team authors not the co-chairs" [16].

Aims of This Study
In this article, we carry out a statistical attribution analysis of Northern Hemisphere land surface temperature time series using various combinations of natural and anthropogenic forcings.We propose that much of the conflicting conclusions reached by the recent climate change attribution studies described above [1][2][3]5,[17][18][19][20][21][22] can be traced to differing choices on the two issues raised by C2021 [5], i.e.:

1.
How should the urbanization bias problem be accounted for? 2.
Which solar activity dataset(s) should be considered?
We note also that concerns about the reliability of both the CMIP5 and CMIP6 hindcasts in replicating observed climate changes have been expressed [31,[33][34][35][36][37][38][39][40][41].However, we suggest that most of the differences can be effectively distilled down to different views on these two specific issues.
C2021's analysis had been based on a sequential two-step linear regression, but they had recommended future work should consider multilinear regressions [5].This recommendation has since been echoed by others [20,22].We adopt such a multilinear regression for this analysis.
To investigate the significance of the urbanization bias problem, we consider two different temperature records-both taken from C2021 [5].The first record, "rural and HANDOUT: AGENDA ITEM 4.3 urban stations", uses all stations regardless of urbanization.This is equivalent to the temperature records considered by AR6.The alternative record is C2021's "rural stations only" record.
The NRLTSI2 model is a newer version of the NRLTSI1 model recommended to the CMIP5 modelers and described in Wang et al. (2005) [24] and other papers [44,45].Both NRLTSI1 and NRLTSI2 relied on Hoyt and Schatten (1998)'s "Group Sunspot Number (GSN)" record [46] as the primary solar activity proxy.The SATIRE model also originally used this GSN as its primary solar activity proxy.However, the version used by Matthes et al. (2017) [11] replaced this with version 1 of the "International Sunspot Number (ISN)" [47] record.Kopp et al. (2016) calculated that changing the original GSN to ISN effectively flattens the long-term trend for both NRLTSI2 and SATIRE by increasing the implied sunspot numbers during the 18th and 19th centuries [48].The various GSN/ISN reconstructions are the subject of considerable ongoing debate [49][50][51][52][53][54][55][56].Nonetheless, because the NRLTSI2 and SATIRE models both use sunspot areas [57] as their main solar activity proxy for the period from 1874 onwards (with NRLTSI2 also using 10.7 cm radio flux readings [58] as an additional solar proxy from 1950 onwards), Kopp et al. (2016) noted that most of these changes were confined to the pre-1874 period [48].The Matthes et al. (2017) reconstruction (and also our analysis in this article) avoids most of these issues by starting relatively close to this transition, i.e., in 1850.
The above TSI series comprised four of the "low solar variability" TSI estimates in C2021's compilation [5].However, for brevity, we only consider the Matthes et al. (2017) series since this was the one used for the CMIP6 hindcasts in AR6 [11].
We then compare and contrast the results from each combination.Finally, we offer recommendations for further research into this challenging, but important, problem.

Methods and Datasets Used
2.1.Northern Hemisphere Land Air Temperature Series Used Figure 1 compares the two different estimates of Northern Hemisphere land air temperatures considered in our analysis-a time series considering "rural and urban" stations and another considering "rural-only" stations.Both time series were downloaded from the Supplementary Materials for C2021 at https://doi.org/10.5281/zenodo.7088728(accessed on 6 July 2023).
Both series are gridded mean temperature anomaly estimates calculated using temperature records from version 3 of the National Oceanic and Atmospheric Administration (NOAA)'s National Centers for Environmental Information (NCEI)'s Global Historical Climatology Network (GHCN) dataset [64].Further details of the generation of these time series and a comparison with other estimates of Northern Hemisphere temperature time series are given by C2021 [5].See also Figure S1 in the Supplementary Materials.The "rural and urban" estimate of Figure 1a assumes that urbanization bias is a relatively minor problem.This would be consistent with AR6's claim that urbanization bias represents less than 10% of the long-term warming in most estimates of land air temperature trends [1].Therefore, all Northern Hemisphere stations in the GHCN dataset were used, regardless of their degree of urbanization.If AR6's claim is correct, then this time  [5].The homogenization steps described by Connolly et al. (2021) were applied to the station records [5].(d) The number of stations used for each year of the "rural-only" series.Note that the vertical scales are different for (b,d) since the rural-only series is limited to rural data and therefore only uses 10-15% of the total stations available.
Both series are gridded mean temperature anomaly estimates calculated using temperature records from version 3 of the National Oceanic and Atmospheric Administration (NOAA)'s National Centers for Environmental Information (NCEI)'s Global Historical Climatology Network (GHCN) dataset [64].Further details of the generation of these time series and a comparison with other estimates of Northern Hemisphere temperature time series are given by C2021 [5].See also Figure S1 in the Supplementary Materials.
The "rural and urban" estimate of Figure 1a assumes that urbanization bias is a relatively minor problem.This would be consistent with AR6's claim that urbanization bias represents less than 10% of the long-term warming in most estimates of land air temperature trends [1].Therefore, all Northern Hemisphere stations in the GHCN dataset were used, regardless of their degree of urbanization.If AR6's claim is correct, then this time series should be preferable to the "rural stations only" estimate of Figure 1c, since it is generated from a much larger sample of stations.This can be seen by comparing the station counts of both estimates in Figure 1b,d-noting that the vertical scales are different.Specifically, the "rural-only" estimate only uses 14% as many stations as the "rural and urban" estimate (a total of 586 stations vs. 4176 stations).
This philosophy of including all stations regardless of urbanization has dominated most attempts to estimate global temperature trends using weather stations.As a result, the "rural and urban" series is very similar to the equivalent time series by the Climate Therefore, to develop the rural-only time series, different approaches to correcting for non-climatic biases needed to be applied.The first step was to remove urbanization bias by excluding all non-rural stations for three of the regions (Arctic, USA, and Ireland) and applying empirically derived urbanization bias adjustments to any of the partially urbanized Chinese stations used for increasing the spatial and temporal representativeness of the region [5,7,37,76].Documented time-of-observation biases (TOB) associated with the USA network were accounted for using NOAA NCEI's empirically based TOB adjustments [80,81].Additional biases due to the documented degradation in station exposure of many USA stations [82] were corrected by means of a regionally averaged and empirically derived bias correction [5,7].Finally, biases due to documented station moves associated with the longest rural Irish station in the dataset were identified and corrected for experimentally in consultation with the station owners [7].
Doubtless, these adjustments have not yet identified all of the non-climatic biases that might be present in the temperature data.Some of us are continuing research into collating more station metadata to identify further potential non-climatic biases in the GHCN datasets, e.g., [76,77,83].Nonetheless, these manual homogenization adjustments based on station information have the advantage of avoiding the urban blending problem unlike the standard automated statistical homogenization adjustments currently used by most groups.
O'Neill et al. (2022) [83] have recently described additional concerns over the PHAadjusted GHCN dataset in a detailed assessment of more than 800 European GHCN stations.They found that less than 20% of the homogenization adjustments applied to a given day's version of the GHCN dataset are consistently applied.Also, they found that less than 20% of the adjustments could be clearly associated with known station changes documented in available station metadata.
In this study, we proceed with the two temperature series described above and shown in Figure 1.A visual comparison of the two series shows that while there are clear For this analysis, we use C2021's "rural and urban" time series as representative of AR6's position on the urbanization bias problem, i.e., that it is at worst a minor (<10%) concern-see AR6, Chapter 2, pp.43-44.However, as can be seen from Figure 2, all of these estimates are very similar, i.e., all seven rural and urban records overlap almost exactly with each other.Therefore, the results from our analysis would be very similar if we used any of the other equivalent "rural and urban" estimates, e.g., the Northern Hemisphere land components of Refs.[25, [64][65][66][67][68][69][70][71][72].
For C2021's "rural stations only" estimate, four geographical regions in the Northern Hemisphere were identified that had most of the longest and most complete rural station records in the GHCN dataset.From these regions, only those stations which are currently rural in terms of both nightlight brightness and associated population were used.An exception was made for the earliest periods and some recent years for China since there was a shortage of fully rural data for China in these periods.Therefore, for China, some data from currently urbanized stations were used for parts of the record, but urbanization bias corrections were applied whenever this occurred.
Although C2021's "rural stations only" estimate was derived from only four Northern Hemisphere regions, they noted that "the four regions alone account for more than 80% of the rural data for the early 20th century from either hemisphere".They also carried out a detailed investigation into whether the "new rural-only estimate [is] better or worse than the standard estimates that include both urban and rural stations"-see Section 3.1.1 of C2021 [5].
Most of the current global/hemispheric temperature series were generated using temperature records that have been adjusted using an automated statistical homogenization technique such as the "Pairwise Homogenization Algorithm" (PHA) used by NOAA HANDOUT: AGENDA ITEM 4.3 Climate 2023, 11, 179 9 of 36 NCEI [64,72,73], rather than the original temperature records.The admirable goal of these automated homogenization processes is to attempt to statistically identify and correct for non-climatic biases that might be present in the raw temperature records [64,72,73].However, C2021 [5] noted that such algorithms can inadvertently lead to "urban blending" whereby the urbanization biases of urban stations are partially "aliased" [74,75] into the homogenized records of rural neighbors [5,76,77].This was recently confirmed by Katata et al. (2023) [78].
If we assume that the magnitude of the urbanization bias problem is as modest as AR6 proposes, then the urban blending problem of automated homogenization would also be relatively small-indeed this has been argued by Hausfather et al. (2013) [79].Therefore, the rural and urban time series was generated from the widely used PHA-adjusted GHCN dataset.However, since the purpose of developing a "rural-only" series is to reconsider this assumption, C2021 recognized that using the PHA-adjusted GHCN dataset for evaluating rural temperature trends would be erroneous [5].Hence, the rural-only time series was generated from the original non-PHA-homogenized GHCN dataset.
Therefore, to develop the rural-only time series, different approaches to correcting for non-climatic biases needed to be applied.The first step was to remove urbanization bias by excluding all non-rural stations for three of the regions (Arctic, USA, and Ireland) and applying empirically derived urbanization bias adjustments to any of the partially urbanized Chinese stations used for increasing the spatial and temporal representativeness of the region [5,7,37,76].Documented time-of-observation biases (TOB) associated with the USA network were accounted for using NOAA NCEI's empirically based TOB adjustments [80,81].Additional biases due to the documented degradation in station exposure of many USA stations [82] were corrected by means of a regionally averaged and empirically derived bias correction [5,7].Finally, biases due to documented station moves associated with the longest rural Irish station in the dataset were identified and corrected for experimentally in consultation with the station owners [7].
Doubtless, these adjustments have not yet identified all of the non-climatic biases that might be present in the temperature data.Some of us are continuing research into collating more station metadata to identify further potential non-climatic biases in the GHCN datasets, e.g., [76,77,83].Nonetheless, these manual homogenization adjustments based on station information have the advantage of avoiding the urban blending problem unlike the standard automated statistical homogenization adjustments currently used by most groups.
O'Neill et al. (2022) [83] have recently described additional concerns over the PHAadjusted GHCN dataset in a detailed assessment of more than 800 European GHCN stations.They found that less than 20% of the homogenization adjustments applied to a given day's version of the GHCN dataset are consistently applied.Also, they found that less than 20% of the adjustments could be clearly associated with known station changes documented in available station metadata.
In this study, we proceed with the two temperature series described above and shown in Figure 1.A visual comparison of the two series shows that while there are clear similarities between the "rural and urban" and "rural-only" estimates, there are also several key differences: 1.
The "rural-only" estimate is noticeably "noisier", i.e., the magnitudes of the fluctuations from year to year are larger.This is largely a consequence of the reduced number of stations, as discussed in C2021.However, it is noteworthy that the timing of the multidecadal warming and cooling periods for both series are qualitatively similar.

2.
The long-term (1850-2018) linear warming trend of the "rural and urban" series is 62% higher than that for the "rural-only" series, i.e., +0.89 • C/century compared to +0.55 • C/century.C2021 argue that much of this extra warming in the "rural and urban" series is due to a combination of urbanization bias and "urban blending" arising from the homogenization process.

3.
While the "rural and urban" series implies an almost continuous long-term warming, the "rural-only" series suggests a much more nonlinear behavior.That is, the "ruralonly" series suggests that temperatures have alternated between multidecadal periods of cooling and periods of warming since at least the mid-19th century.
The nonlinear nature of the "rural-only" series has implications for our analysis since we will be predominantly focusing on linear trends and linear regressions.With regards to the relatively warm 1930s-1940s period suggested by the rural-only series, we note it has similarities to the early-20th-century warm period that has been noted for the Arctic region [37,84,85].We also note there seems to have been a slight mid-19th-century warm period that is not apparent from the "rural and urban" estimates.However, we caution that the number of stations available for the 19th century is especially limited for the rural-only series, as can be seen from Figure 1d.Some might argue that the fact that the following all imply a warming trend over their entire records indicates that the long-term warming trends implied by the various land station-based series shown in Figure 2 are also reliable [86]: However, we note from Figure 1c that the "rural-only" estimate also suggests warming trends over all these periods.Therefore, merely providing evidence of "global warming" over these relatively short periods is inadequate for resolving which of the two series is more reliable.
Given these concerns over the extent of urbanization biases in the land surface temperature (LST) records, one might consider bypassing the urbanization bias problem by using sea surface temperature (SST) data instead.This was the approach taken by Stefani (2021) [17] and the use of SST data was also part of the analyses of Harde (2022) [18] and Scafetta (2023) [22].Indeed, C2021 also noted that their "rural-only" estimate was qualitatively similar to Northern Hemisphere sea surface temperature (SST) estimates as well as several Northern Hemisphere temperature proxy-based estimates.
We agree that the SST data should be unaffected by urbanization biases-unless the SST data has been adjusted to better match the LST data as some groups have done, e.g., Cowtan et al. (2018) [92].However, we caution that the SST data also have their own non-climatic bias problems-especially before the 1950s.Hence, Jones (2016) has argued that the LST component of global and hemispheric temperature estimates is more reliable than the SST component [93].
For a detailed discussion of the uncertainties associated with the non-climatic biases of SST data, see Section 3.3 of C2021 [5] or Kent and Kennedy (2021) [94].However, one of the key challenges is the fact that the available SST data is very sparse before the 1950s (especially for the Southern Hemisphere) [95].Other challenges are the changes in instrumentation and measurement practices over time.It is known that different SST measurement practices can imply different multidecadal trends.For instance, Davis et al. (2019) found that SST measurements taken via "engine-room intake" implied a substantial global cooling trend from 1950 to 1975, while those taken by "bucket" measurements implied a slight warming over the same period [96].C2021 noted that the former would be consistent with the "rural-only" LST series while the latter would match better with the "rural and urban" LST series [5].
Therefore, for the current study, we will limit our analysis to the two LST records.However, we encourage further research into resolving the uncertainties associated with the SST data [94] and we recognize that studies using SST instead of urbanized LST have at least bypassed the urbanization bias problem [17,18,22].
In Figure 3, we highlight an additional challenge in assessing the extent of urbanization bias in the "rural and urban" series-the relative urban/intermediate/rural composition HANDOUT: AGENDA ITEM 4.3 of the series varies substantially over the period of the record.As can also be seen from Figure 1b, in the earlier parts of the record the fraction of available stations that remain rural today is very small-especially for the mid-to-late 19th century.For example, only 5% of the available stations for the 1850-1890 period remain rural today, while 65% are now fully urban-see Figure 3c.HANDOUT: AGENDA ITEM 4.3 Initially, one might think that this 19th-century urban composition is not so serious since the relative magnitude of the urban heat islands associated with those stations would have been much smaller than now (in the early 21st century).However, it is important to remember that the urbanization bias problem largely arises from long-term gradual warming biases.The majority of the stations used for comparing the mid-19th century to the present are now urbanized.Counterintuitively, for more recent decades, even though the rate of urbanization has accelerated, the urbanization bias is in some ways less of a problem for the modern period-since there is a much larger number of rural stations available.

Potential Climatic Drivers Used by Each Approach
The CMIP6 hindcasts that formed the main basis for AR6's attribution statement considered two natural forcings (solar and volcanic) and multiple anthropogenic forcings (chiefly, increasing greenhouse gases and aerosols, but with some other contributions).However, while the CMIP6 contributors were all strongly recommended to use the Matthes et al. ( 2017) [11] solar activity dataset, C2021 identified a wide range of plausible solar activity datasets that were also in the literature.Therefore, to assess the sensitivity of the results to changes in the solar activity series used, we consider two different solar activity datasets.We downloaded both of these from the Supplementary Materials of C2021 at https://doi.org/10.5281/zenodo.7088728(accessed 6 July 2023).
"Solar #1" is the one recommended to the CMIP6 contributors, i.e., Matthes et al. (2017) [11], although it has been updated to 2018 by C2021."Solar #2" is Scafetta et al. (2019)'s [12] update of the Hoyt and Schatten (1993) [13] time series to 2018.The time series are plotted in units of W m −2 relative to their 1901-2000 average in Figure 4a,b, respectively.We have also included a secondary y-axis on these two figures showing the equivalent values in terms of the effective radiative forcing (ERF) calculated following AR6's recommendation in Section 7.3.4.4 of AR6 [1].This involves scaling TSI by ×0.1278.
As can be confirmed visually by comparing Figure 4a,b, these two different TSI estimates offer a strikingly different history of changes in solar activity since 1850.We encourage readers to study C2021 for a detailed discussion of why such differences still exist today.However, in essence, the key differences relate to different philosophies on two issues: (a) the choices of solar proxies used for the pre-satellite era and (b) the satellite TSI composite used for calibration.
In terms of satellite TSI composites, there are several available, but the most distinct composites are: 1.
The Active Cavity Radiometer Irradiance Monitor (ACRIM) composite [12,32].This implies TSI increased between solar minima during the 1980s and 1990s but decreased during the early 21st century.

3.
The Royal Meteorological Institute of Belgium (RMIB) composite [99,100].This implies TSI has remained remarkably constant between solar minima over the entire satellite era.
The choice of satellite composite used for calibration has considerable implications for the most suitable solar proxies to use for the pre-satellite era, i.e., before 1979.If the RMIB composite is correct, then it could be argued that TSI largely follows the sunspot cycles and simply calibrating a suitable sunspot number record should give a reasonable TSI reconstruction for the pre-satellite era, e.g., Dewitte et al. (2022) [100].If the PMOD composite is correct, then a suitably rescaled sunspot number record should also give a reasonable TSI reconstruction, but the addition of extra solar proxies could potentially improve accuracy, e.g., Wang and Lean (2021) [101].The Matthes et al. (2017) [11] reconstruction used by CMIP6 for AR6 appears to have taken this second philosophy.
However, if the ACRIM composite is correct, then reconstructing TSI for the presatellite era becomes a much more challenging problem.That is because the variability in the TSI of solar minima implied by ACRIM is not captured by the sunspot number record (which reaches a minimum of exactly zero sunspots every solar minimum).ACRIM therefore implies that long-term (inter-cycle) trends in TSI need to be considered as well as the short-term (~11-year solar cycle) variability.Hoyt and Schatten (1993) [13] was a very significant TSI reconstruction specifically because it used multiple solar proxies to attempt to capture both the short-term (~11-year solar cycle) and the long-term (inter-cycle) multidecadal variability-see also Hoyt (1979) [102].Scafetta et al. (2019) [12] updated this 1993 reconstruction to 2018 using the ACRIM team's composite.
We encourage interested readers to repeat our analysis using other combinations of the 20 TSI time series in the Supplementary Materials of C2021 or indeed any other available TSI time series of interest.In principle, our analysis could also be repeated using annually resolved solar activity proxies other than TSI, e.g., geomagnetic activity indices [17].
For volcanic and anthropogenic components, we use the relevant time series from the IPCC AR6 WG1 Annex III dataset [103], which we downloaded from https://doi.org/10.5 281/zenodo.5705390(last accessed on 6 July 2023).We rescaled these time series relative to their 1901-2000 average.Both time series are plotted as effective radiative forcing in units of W m −2 relative to the 1901-2000 average in Figure 4c,d.
To simplify our analysis, we use the single "all anthropogenic forcings combined" time series provided by the IPCC Annex III dataset.We note, though, that this time series actually comprises 11 individual components.
According to the framework used by AR6, once a "forcing" time series has been converted into an effective radiative forcing (ERF) time series (in W m −2 ) by the application of suitable theoretical and/or semi-empirical calculations, they are almost directly comparable.Hence, many studies will routinely sum these time series together.Therefore, for brevity, we follow that approach for evaluating the anthropogenic contribution.However, for the statistical analysis in this paper, each component of the "net anthropogenic forcings" could in principle be evaluated separately.We have not done so here, but for reference, we have plotted all 11 components in the Supplementary Materials-see Figure S2.
Before we discuss our statistical analysis approaches, it might be useful to briefly compare and contrast our four potential fitting parameters in Figure 4. Comparing Solar #1 and Solar #2 in Figure 4a,b, we can see that both reconstructions are consistently below the 20th-century average for the 2nd half of the 19th century and both show short-term rises and falls over the ~11-year solar cycle.However, Solar #2 suggests a far more dynamic history for TSI variability with the ~11-year periodicity superimposed over multidecadal trends that are often greater in magnitude than the rises and falls associated with the ~11-year cycle.The changes from the other "natural forcing", i.e., volcanic activity, are much shorter in duration and episodic in nature.As can be seen from Figure 4c, the changes associated with a given volcanic event only last for 2-3 years.Finally, the net anthropogenic forcing series is almost flat for most of the study period but starts to steadily increase dramatically during the 1970s.Interested readers might note from Figure S2 that this shape of the "net anthropogenic forcings" is subtly different from the calculated "CO 2 forcing" that is more monotonic in nature-due to the other 10 components, chiefly the two aerosol components.

Statistical Analysis Used
We take two separate approaches to fitting our target temperature records.The first approach is to carry out an OLS linear regression between each of the four radiative forcings time series (Figure 4) and either the "rural and urban" or the "rural-only" time series (Figure 1).We refer to this approach as individual component fitting.This allows us to identify an upper bound for how much of the linear trends of each temperature series could potentially be explained in terms of each individual component, i.e., the maximum potential contribution.
For our main analysis, we will carry out the regression over the entire 1850-2018 record.However, as can be seen from Figure 1, the number of stations available for the midto-late-19th century is small-especially for the "rural-only" series.Moreover, from Figure

Statistical Analysis Used
We take two separate approaches to fitting our target temperature records.The first approach is to carry out an OLS linear regression between each of the four radiative forcings time series (Figure 4) and either the "rural and urban" or the "rural-only" time series (Figure 1).We refer to this approach as individual component fitting.This allows us to identify an upper bound for how much of the linear trends of each temperature series could potentially be explained in terms of each individual component, i.e., the maximum potential contribution.
For our main analysis, we will carry out the regression over the entire 1850-2018 record.However, as can be seen from Figure 1, the number of stations available for the mid-to-late-HANDOUT: AGENDA ITEM 4.3 19th century is small-especially for the "rural-only" series.Moreover, from Figure 3b,c, it can be seen that the "rural and urban" series is dominated by urban stations for this period.Therefore, as a secondary analysis, we will repeat our regression fittings using the shorter 1900-2018 period.The "rural-only" series uses at least 100 stations throughout this period and it has been estimated that global temperature trends can be reliably estimated with as few as 50 well-spaced locations [104,105] at the annual timescale our analysis considers.Meanwhile, the percentage of rural stations in the "rural and urban" series increases from 4% in the 1850-1890 period to ~20-24% throughout the 1900-2018 period.
The second approach is a multiple linear regression using a combination of two or three of our components.This regression is also OLS.As for our individual component fittings, our main analysis is carried out over the entire 1850-2018 period.However, again as a secondary analysis, we will repeat our regression fittings using the shorter 1900-2018 period.For each temperature series, we consider four combinations:
We emphasize that the relatively simple statistical approach adopted in this paper appears to reproduce reasonably well the findings of the more complex GCM-based attribution of AR6 [1] and Gillett et al. (2021) [2] provided it is carried out with the equivalent time series to those used by Gillett et al. (2021) [2].
Nonetheless, given that the results of our analysis only replicate AR6's attribution statement when it is applied using AR6's choices with regard to the TSI and urbanization bias debates, it is probably worth directly comparing and contrasting our approach to the attribution analysis that AR6 used, i.e., Gillett et al. (2021) [2].
Arguably, the biggest methodological difference between the two approaches is that our statistical fitting approach fits the time series for each component directly to the temperature records, whereas Gillett et al. (2021) [2] fits various climate model hindcasts to the temperature records.These hindcasts were themselves generated using the relevant natural and anthropogenic component time series as model inputs (often summarized in terms of radiative forcing units for comparison).Hence, both approaches use similar inputs, but Gillett et al. (2021)'s [2] approach uses climate models as an additional intermediate step.
Recently, Scafetta (2023) has suggested a regression methodology where the climate was allowed to respond to solar forcing differently than to the other radiative forcings.He suggested that the way changes in solar activity are modeled by current GCMs may be underestimating the climatic response to changing TSI by a factor of between 4 and 7.This is because the models do not allow for the possibility that the climate response to total solar activity changes might be greater than that predicted based on ERF calculations [22].This is consistent with Chylek et al. (2020)'s findings [111].Our statistical multilinear regression approach should also mitigate this potential problem.Harde (2022)'s two-layer model also appears to have reduced this problem by allowing for the possibility of solar-induced climate feedbacks [18].
There are several other technical differences between our approach and that of Gillett et [2] did not consider the possibility of urbanization bias contamination or study the effects of varying the choice of the TSI dataset.
Finally, we note that for the two solar activity time series, we use the original TSI values instead of the calculated ERF values, i.e., the primary y-axes of Figure 4a,b instead of the secondary y-axes, since the ERF values are derived from the TSI time series (via a linear scaling).However, the results are identical (not shown) when we use the ERF values.This is because OLS identifies the best linear relationship between the explanatory series and the target series that minimizes the squares of the residuals.Scaling the input time series by a linear (non-zero) constant only changes the values of the slope and intercept of this linear fitting, but not the goodness of the fit or the final results.

Evaluation Metrics Used
A challenge for a climate change attribution study such as this one or AR6's equivalent analysis is to decide on the most suitable metrics for comparing the fitted contributions to the observed climate change-in this case, Northern Hemisphere LSAT.
We consider four sets of metrics for evaluating the fits.We illustrate these in Figure 5, demonstrating how they are applied to the two target temperature records: 1.
For our main analysis, we will use a fairly straightforward metric-the long-term linear trend over the length of the data records, i.e., 1850-2018.As we shall see, this is usually a warming trend and reflects the long-term hemispheric warming since the start of the record (1850).Comparing Figure 5a,e, we note that the rural and urban trend is 60% higher than that for the rural-only record.It seems plausible that at least some of this extra warming is a result of urbanization bias.

2.
Meanwhile, as discussed in Section 2.3, the number of stations used for the 19th century is particularly low and mostly limited to stations that are now urbanized.For these reasons, we repeat our analysis by fitting over the shorter 1900-2018 period.Similarly, our second evaluation metric is the 1900-2018 linear trend.We can see from Figure 5a-d that these trends are higher for both the rural and urban series and the rural-only series.However, comparing Figure 5b,f, we note that the rural and urban trend is still substantially higher (67%) than that for the rural-only record.

3.
Still, using a single long-term linear trend is a somewhat crude metric since the LSAT trends are quite nonlinear-especially for the rural-only series-and neither are the trends for any of the three factors (see Figure 4).Therefore, for our third set of metrics, we consider multiple shorter-term linear trends.As can be seen from Figure 5c,g, both temperature series imply an alternation between warming and cooling periods over the course of the records.Therefore, we calculate the linear trends over three periods, corresponding to local minima and maxima common to both temperature series, i.e., warming from 1885-1938; cooling from 1938-1972; warming from 1972-2018.We note that the differences between the rural-only and rural and urban series are actually greatest for the early-20th-century warming and mid-20th-century cooling rather than the more recent warming period.This might be counterintuitive since urbanization has accelerated in recent decades.However, as discussed in Section 2.1, the urbanization bias problem is complicated by the fact that the availability of rural stations has also increased substantially over recent decades-see Figure 3. 4.
Our fourth metric completely avoids the use of linear trends and instead involves a comparison of the temperature averages over two fixed time periods.Specifically, we use the difference between the 1850-1900 average and the 1995-2014 average for comparison with several discussions in both AR5 and AR6, e.g., Section 7.3.5.3 HANDOUT: AGENDA ITEM 4.3 ("Temperature Contribution of Forcing Agents") of AR6 [1].We refer to this metric as the "AR6 comparison metric".In terms of this metric, the rural and urban series is 45% higher than the rural-only series-see Figure 5d,h.
Climate 2023, 11, x FOR PEER REVIEW 18 of 37 4. Our fourth metric completely avoids the use of linear trends and instead involves a comparison of the temperature averages over two fixed time periods.Specifically, we use the difference between the 1850-1900 average and the 1995-2014 average for comparison with several discussions in both AR5 and AR6, e.g., Section 7.3.5.3 ("Temperature Contribution of Forcing Agents") of AR6 [1].We refer to this metric as the "AR6 comparison metric".In terms of this metric, the rural and urban series is 45% higher than the rural-only series-see Figure 5d,h.We therefore will consider all four sets of metrics for our attribution.However, we note that further work could consider other metrics.For example, Lüning and Vahrenholt (2017) [114] argue that the use of 1850-1900 as a baseline for evaluating long-term warming is inappropriate from a paleoclimatic context since it incorporates the end of the Little Ice Age.Instead, they suggest 1940-1970 as a more suitable baseline.Meanwhile, Scafetta (2021a) [10] used the differences between the decadal averages of 1945-1954 and 2005-2014.We therefore will consider all four sets of metrics for our attribution.However, we note that further work could consider other metrics.For example, Lüning and Vahrenholt (2017) [114] argue that the use of 1850-1900 as a baseline for evaluating long-term warming is inappropriate from a paleoclimatic context since it incorporates the end of the Little Ice Age.Instead, they suggest 1940-1970 as a more suitable baseline.Meanwhile, Scafetta (2021a) [10] used the differences between the decadal averages of 1945-1954 and 2005-2014.

Results from the Individual Component Analyses
Figure 6 shows the OLS linear relationships between each of the components and the two temperature records using either the full 1850-2018 period or the shorter 1900-2018 period.We note that for both temperature records, the anthropogenic forcing has the HANDOUT: AGENDA ITEM 4.3 Climate 2023, 11, 179 18 of 36 highest r 2 value.However, as we saw from Figure 4d, the anthropogenic forcing series is relatively flat for most of the record until the 1970s.Therefore, most of the data points up until this period are clumped in a small region in Figure 6d,h.In contrast, the points are more evenly spread over the entire record for the Solar #2 fits.This suggests a more consistent influence from TSI than anthropogenic forcings over the entire record.The statistics associated with the Solar #1 fits are much weaker than that for Solar #2.Indeed, using the shorter 1900-2018 period for fitting, the linear relationship is not statistically significant for the rural-only record (p > 0.05) and barely statistically significant for the rural and urban record (p = 0.049).The fits for volcanic activity are quite weak, and they are not statistically significant for the rural and urban records either (p > 0.05).Nonetheless, let us now consider the results when we apply these linear relationships to each component.

HANDOUT: AGENDA ITEM 4.3
Figure 7 shows the results from the first stage of our analysis, i.e., the individual component analysis, using the full 1850-2018 record.The corresponding evaluation metrics are provided in Tables 1 and 2. The equivalent results using the shorter 1900-2018 period are provided in the Supplementary Materials as Figure S3 and Tables S1 and S2.The fitting coefficients and statistics for all fits are provided in the Supplementary Materials Excel file.HANDOUT: AGENDA ITEM 4.3 We can already understand AR6's attribution statement from this individual components analysis.If we use "Solar #1", then the maximum warming (upper bound) we can explain in terms of the two natural climatic drivers is 0.19 • C/century (21% of the observed 1850-2018 warming trend).It can only explain up to 12% of the AR6 comparison metric.
The other natural factor considered, i.e., the volcanic contribution can only contribute a slight cooling (or a slight relative warming of ~2% in terms of the AR6 comparison metric).Therefore, this already implies that the long-term warming is mostly human-caused, i.e., AR6's conclusion.
Meanwhile, the anthropogenic forcings can explain almost all the observed 1850-2018 warming trend (0.73 • C/century, i.e., 82%) and 88% of the AR6 comparison metric-Table 1.It even slightly overestimates the 1900-2018 trend with 107% of the observed warming.It also qualitatively appears a much closer match to the temperature record in Figure 7d than in Figure 7a.Therefore, using AR6's recommended solar series implies that the natural contribution to the long-term warming is at most quite modest, i.e., AR6's conclusion.
However, if AR6 had considered "Solar #2", they would probably have been less confident in their attribution.Solar #2 can explain more than 70% of the long-term warming (0.62 • C/century) and 65% of the AR6 comparison metric.It also captures quite a bit of the multidecadal variability in addition to the overall linear trend-Figure 7b.The results are even better using the fittings based on the shorter 1900-2018 period with 76% of the 1850-2018 warming and 71% of the AR6 metric-see Table S1.
Moreover, when we consider the shorter-term periods, the Solar #2 fits can explain the early-20th-century warming and the mid-20th-century cooling much better than the anthropogenic fits, i.e., the anthropogenic fits can only explain about 20% of both trends while the Solar #2 fit explain more than twice the observed warming and cooling over HANDOUT: AGENDA ITEM 4.3 these intervals.That said, the Solar #2 fit leaves nearly 30% of the long-term warming unexplained, only 39-43% of the 1900-2018 trend, and only 18-19% of the 1972-2018 warming interval-see Table 1 and Table S1.
Figure 7e-h and Table 2 consider the "rural-only" temperature series.As discussed, this suggests a lower, long-term, linear warming trend of 0.55 • C/century and a lower AR6 comparison metric of 0.98 • C than the "rural and urban" series.It also suggests more multidecadal variability, e.g., the 1885-1938 warming is nearly twice as fast as the rural and urban (1.90 • C/century vs. 1.07 • C/century) while the 1938-1972 cooling is more than three times as fast as the rural and urban (−2.80 • C/century vs. −0.77• C/century).
Having said that, even for this "rural-only" temperature series, AR6's attribution statement might still seem reasonable if it were assumed that the Matthes et al. (2017) [11] solar forcing dataset is correct.This is because Solar #1 can only explain at most 0.11 • C/century (20% of the observed warming) and 10% of the AR6 comparison metric.The Solar #1 fits using the shorter fitting interval of 1900-2018 are even weaker-see Table S2.Again, the volcanic contribution is only a slight cooling.However, if Solar #2 is correct, then the situation is very different.Solar #2 can explain up to 0.49 • C/century (89%) of the long-term warming trend and 74% of the AR6 comparison metric.
Visually comparing Figure 7f,h, it can be seen that Solar #2 captures more of the observed multidecadal variability of the "rural-only" temperatures than the anthropogenic forcings fit.Specifically, the anthropogenic fits can only explain 8% of the 1885-1938 warming and 3% of the 1938-1972 cooling, while the Solar #2 fit can explain 89% of the former and 49% of the latter-see Table 2.That said, Solar #2 is only able to explain 15% of the 1972-2018 warming, while the anthropogenic forcings can explain 80%.

Results from the Multiple Linear Regression Analyses
Now let us consider more realistic scenarios where there are multiple climatic drivers.Here, the best statistical fits are calculated in terms of either two factors ("natural only" drivers of solar and volcanic activity) or three factors ("natural and anthropogenic" drivers of solar, volcanic, and anthropogenic activity).
Figures 8 and 9 plot the fits from the multiple linear regression analyses using the period 1850-2018 and Tables 3 and 4 provide the evaluation metrics.Figure 8 and Table 3 provide the results for the rural and urban temperature record, while Figure 9 and Table 4 provide those for the rural-only record.The equivalent results fit using the shorter 1900-2018 period are provided in the Supplementary Materials as Figures S4 and S5 and Tables S3 and S4.HANDOUT: AGENDA ITEM 4.3 Furthermore, if we substitute Solar #2 for Solar #1, we find a much better match for the "rural-only" temperature estimates-compare Figure 9a,b.Moreover, using Solar #2, it is possible to explain most (86%) of the 1850-2018 warming and 77% of the AR6 comparison metric in terms of "natural forcings only", i.e., Scenario 8. Indeed, visually, from Figure 9, one could argue that Scenario 8 better captures the multidecadal oscillations between warming and cooling than Scenario 6.For example, from Table 4, we can see that Scenario 8 can explain 96% of the early-20th-century warming while Scenario 6 can only explain 59%.Similarly, while Scenario 8 can explain 64% of the mid-20th-century cooling, Scenario 6 can only explain 44%.

Comparison of Results to Other Attribution Studies Building on C2021's Findings
As described above, our analysis in Scenarios 1, 3, 5, and 7 (i.e., using Solar #1) is able to qualitatively replicate AR6's attribution statement that the long-term warming since the 19th century is mostly human-caused.However, the results from Scenarios 2 and 6 imply that the long-term warming was due to a mixture of natural and anthropogenic factors.As before, we can easily reproduce AR6's attribution statement by selectively considering a few of the results.Scenarios 1 and 3 encapsulate AR6's overall position since Scenarios 2 and 4 consider Solar #2, which was not considered by the CMIP6 contributors.Scenarios 5-8 were also not considered by AR6 since they are based on the rural-only temperature estimate.
Our Scenarios 1 and 3 imply, similar to AR6's analysis, that natural forcings can only explain a small fraction of the observed warming, i.e., at most 21% of the 1850-2018 trend and 13% of the AR6 comparison metric-see Table 3.Furthermore, if we add anthropogenic forcings to the mix, we can explain most of the warming, i.e., 87% of the 1850-2018 trend and 90% of the AR6 comparison metric-see Table 3.
However, if we consider the other scenarios, then this "obvious" conclusion becomes much less obvious.If we simply replace Solar #1 with Solar #2, then the overall percentage of the explained warming trend from all components increases to 92% of the 1850-2018 trend, 100% of the 1900-2018 trend, and 97% of the AR6 comparison metric-see Table 3.This is consistent with Solar #2 being a more representative TSI dataset than Solar #1.
The equivalent results using the shorter 1900-2018 interval fitting are still high but slightly lower for both 1850-2018 and 1900-2018 (80% and 95% respectively)-see Table S3.Furthermore, when we consider the shorter-term periods, Scenario 2 provides a much better match than Scenario 1, i.e., Scenario 1 can only explain 30% of the early-20th-century warming and 36% of the mid-20th-century cooling, while Scenario 2 can explain 57% and 77% of those trends, respectively.Both scenarios can explain 97-98% of the 1972-2018 warming.
Meanwhile, when we consider Scenario 4, we can see that the match between Solar #2 and the observed temperature series is already sufficient to dispute AR6's claim that removing anthropogenic forcings leaves most of the observed warming unexplained.We can see from Scenario 4 of Table 3 that ~70% of the 1850-2018 trend and ~66% of the AR6 comparison metric can be explained in terms of natural forcings if Solar #2 is used.Interestingly, this is intriguingly similar to Hoyt and Schatten (1993)'s [13] estimate that "On a decadal timescale the solar irradiance model [i.e., the 1993 version of Solar #2] can explain ~71% of the variance during the past 100 years" despite more than 25 years of extra data for our current analysis.
If we also replace the "rural and urban" estimate with the "rural-only" estimate, further questions arise concerning the AR6 attribution statement.This can be seen by comparing Scenarios 1 and 5.We can see the visual matches between the fits and the observed temperatures become much weaker-compare Figures 8a and 9a.Therefore, if the "rural-only" estimates are more reliable than the "rural and urban" estimates, it implies problems with the validity of Solar #1.
Furthermore, if we substitute Solar #2 for Solar #1, we find a much better match for the "rural-only" temperature estimates-compare Figure 9a,b.Moreover, using Solar #2, it is HANDOUT: AGENDA ITEM 4.3 possible to explain most (86%) of the 1850-2018 warming and 77% of the AR6 comparison metric in terms of "natural forcings only", i.e., Scenario 8. Indeed, visually, from Figure 9, one could argue that Scenario 8 better captures the multidecadal oscillations between warming and cooling than Scenario 6.For example, from Table 4, we can see that Scenario 8 can explain 96% of the early-20th-century warming while Scenario 6 can only explain 59%.Similarly, while Scenario 8 can explain 64% of the mid-20th-century cooling, Scenario 6 can only explain 44%.
However, we note that Scenario 8 can only explain 22% of the 1972-2018 warming while Scenario 6 can explain 72%.Also, Scenario 8 still leaves 13% of the long-term warming and 23% of the AR6 comparison metric unexplained.

Comparison of Results to Other Attribution Studies Building on C2021's Findings
As described above, our analysis in Scenarios 1, 3, 5, and 7 (i.e., using Solar #1) is able to qualitatively replicate AR6's attribution statement that the long-term warming since the 19th century is mostly human-caused.However, the results from Scenarios 2 and 6 imply that the long-term warming was due to a mixture of natural and anthropogenic factors.Meanwhile, the results from Scenarios 4 and 8 imply that the warming was mostly natural in origin.
Several of the studies described in Section 1.2 also investigated the influence of changing TSI on global or hemispheric temperatures since the 19th century using either Solar #1 or Solar #2 or a closely related estimate of solar activity.Therefore, let us compare our findings to the most relevant results of those studies:

•
Harde (2022)'s [18] analysis included two hindcasts that were somewhat similar to our Scenarios 5 and 6, although his temperature record was a combined "rural Northern Hemisphere land and ocean" series instead of our land-only analysis.His hindcasts using Solar #1 failed to find a substantial solar role (similar to our Scenario 5).However, using Solar #2, he found that 2/3 of the long-term warming was solar in origin and only 30% was anthropogenic.[26], which is closely related to and similar to Solar #1.Their chosen temperature record was a global land and ocean series, which includes urban data.Therefore, their analysis has some similarities to our Scenarios 1 and 3.However, a better comparison would be to our individual component fits in Figure 7. Li et al. found a very strong solar signal in the temperature changes up to about 1960, but afterward, the temperature changes shifted to being dominated by increasing CO 2 .We can obtain similar results by comparing Figure 7a,d, where we can see that Solar #1 cannot explain the post-1960 warming of the rural and urban record, but that it can be explained by the anthropogenic component.

•
Richardson and Benestad (2022) [20] confined their analysis to the rural and urban series and did not consider the "natural only" scenarios.However, they analyzed 14 of C2021's 16 TSI records including Solar #1 and Solar #2.Therefore, their analysis can be compared to Scenarios 1 and 2. Qualitatively, we confirm their finding that for these Scenarios, the long-term warming is mostly anthropogenic (see Figure 8).That said, for Scenario 2, we found that ~27% of the long-term warming could be explained in terms of TSI.According to their Figure 5a, none of Richardson and Benestad (2022)'s [20] "corrected" fits identified a "solar-caused warming fraction" greater than 10%.This suggests that their analysis method substantially underestimates the solar contribution relative to ours.One possible explanation is that their "weighted least squares" fitting method apparently prioritizes fitting the most recent portions of the temperature record over the earlier portions.In contrast, our fitting approach optimized the fits over either the entire temperature record (1850-2018) or the shorter 1900-2018 period.

•
Chatzistergos (2023) [21] did not use any TSI series for his analysis.However, he analyzed one of the five solar proxies used in Solar #2.He found that this specific proxy was unable to explain much of the post-1970 warming for several global land and ocean records (that incorporated urban data).In contrast, our Scenario 4 suggests HANDOUT: AGENDA ITEM 4.3 that Solar #2 can explain ~70% of the long-term (1850-2018) warming of the rural and urban series.This suggests that the multiproxy nature of Solar #2 is better able to explain the observed temperature changes than the isolated use of individual solar proxies.

•
Scafetta (2023) [22] found that hindcasts using only AR6's recommended forcings (including Solar #1) were able to reproduce AR6's attribution statement.However, the hindcasts with the best fits to the observed temperatures were those that included the high-multidecadal-variability TSI records (including Solar #2) and excluded the low-multidecadal-variability TSI records (i.e., Solar #1 and similar reconstructions).
In the latter case, the attribution results were similar to those here obtained for Scenario 6-see Figure 9b-in particular when the global sea surface temperature records were used.

Conclusions
The IPCC AR6 concluded that "climate models can only reproduce the observed warming [. ..] when including the effects of human activities [. ..], in particular the increasing concentration of greenhouse gases", and that "simulations that include only natural process, including internal variability related to El Niño and other similar variations, as well as variations in the activity of the sun and emissions from large volcanoes [. ..], are not able to reproduce the observed warming" (AR6, FAQ 3.1, p. 515) [1].Largely on this basis, AR6 concluded that contemporary climate change is "overwhelmingly due to human influence" (Technical Summary, p. 11).However, in this article, we argue that this confident "detection and attribution of climate change" statement was unjustified because it failed to satisfactorily assess two key ongoing scientific debates: 1.
How much of the warming since the 19th century implied by current global temperature estimates is an artifact of urbanization biases? 2.
Have we established a reliable solar forcing dataset for estimating the solar contribution to these trends?
AR6 has explicitly argued that urbanization bias represents less than 10% of the longterm warming, but several recent studies have disputed this claim [5,7,9,10,78].Meanwhile, AR6 argues that the Matthes et al. (2017) solar forcing dataset has been confirmed to be reliable, yet this claim is challenged by several studies arguing it has not yet been satisfactorily resolved which (if any) of the many solar forcing datasets are most reliable [5,7,12,15].
With that in mind, we applied a series of statistical attribution assessments to two different estimates of Northern Hemisphere land temperatures over the period 1850-2018 (with a secondary analysis over the period 1900-2018).The first temperature estimate assumes that urbanization bias is a minor problem at worst and matches almost exactly with the estimates considered by AR6 (see Figure 2).The second estimate was calculated using only stations that are currently rural or that had been explicitly corrected for urbanization bias.
In terms of the physical significance of the various statistical correlations discussed in this paper, we emphasize that regression can neither verify nor conclude causation; it can only confirm that there is a statistical correlation between the time series under investigation.Indeed, Soon et al. (2015) [7] stressed that there are at least four types of correlations: 1.
Therefore, even identifying the existence of a correlation does not in itself establish causation.
In the case of a commensal correlation where both variables are influenced by a common factor, the analysis can still potentially be informative, e.g., as noted by Soon et al. (2015) [7], "If a given solar climate correlation were commensal, then this would indicate that some (possibly unknown) factor which is influencing the Earth's climate is also influencing a particular HANDOUT: AGENDA ITEM 4.3 aspect of solar variability.However, if that factor was influencing some aspect of solar variability, it would presumably be some other form of solar variability, and therefore the correlation would still be with solar variability".However, in the case of types 3 and 4, the apparent correlation would arguably be spurious.
Nonetheless, we note that this caution equally applies to the analysis of Gillett et al. (2021) [2], i.e., the main basis for AR6's attribution statement, as well as the attribution statement of C2021 [5] and the analysis in this paper.
As can be seen from Figure 1b,d, the "rural-only" estimate includes much less data (~10-15%)-especially for the earlier periods.As a result, that temperature series is "noisier" than the "rural and urban" estimate.Therefore, if AR6 is correct that the problem of urbanization bias is relatively minor, it would seem reasonable to prefer the "rural and urban" time series.However, apart from the reduced interannual "noise" of the "rural and urban" estimates, there are also differences in the magnitudes of the multidecadal intervals of warming and cooling.In particular, the long-term 1850-2018 linear warming trend of the "rural-only" estimate (0.55 • C/century) is only 62% of that of the "rural and urban" estimate (0.89 • C/century).If even half of this difference were due to urbanization bias, it would already contradict AR6's claim.For a detailed analysis of the differences between the two time series, we refer to Section 3.1.1 of C2021 [5].Several studies have noted that resolving the urbanization bias problem remains a major challenge [5][6][7][8][9][10]76].Therefore, we suggest that a more careful investigation of this problem and other non-climatic biases should be a high research priority.
Meanwhile, we found that simply substituting an alternative solar forcing dataset to that considered by AR6's climate model hindcasts can substantially increase the amount of the 1850-2018 warming that can be explained in terms of natural forcing from 21% to 70% of the long-term warming implied by the "rural and urban" series and 87% of the "rural-only" temperature series.
This suggests that the scientific debates over which solar forcing dataset to use have yet to be satisfactorily resolved.C2021 describe several key ongoing debates over the TSI datasets.One question focuses on the timing and shapes of the various peaks and troughs.Another major issue is the choice of satellite composite used for calibrating the various solar proxies [12,32,[97][98][99][100]115].The fact that debate is still ongoing over how TSI has varied even within the satellite era points to the importance of continuing (even increasing) investment in multiple TSI-monitoring satellite missions [12,[115][116][117][118].
Our analysis was confined to the Northern Hemisphere land surface air temperatures since this was the region where we had enough data coverage to construct a "rural-only" time series from the GHCN version 3 dataset.The GHCN dataset has recently been upgraded to version 4 with a larger number of stations and in many cases has longer records [72], and some of us have begun work using this dataset instead [76,83].We also encourage projects to compile and digitize early historic temperature measurements and accompanying station history metadata [131,132].However, we caution that the current approaches of using statistical homogenization techniques to correct temperature records for non-climatic biases are prone to "aliasing effects" [74,75], including "urban blending" [5,76,77].Katata et al. (2023) have offered some potential modifications to temperature homogenization to reduce or remove this problem [78].
HANDOUT: AGENDA ITEM 4.3 Temperature proxies might potentially help extend our "rural-only" time series to the early 19th century or even earlier, i.e., the so-called "Little Ice Age" period [133][134][135].We suggest that a more satisfactory resolution of the urbanization debate over land surface temperatures might also help in the various ongoing debates over ocean temperature trends [94,136,137].
We note that even for the "rural-only" time series with the best fit of the solar forcing datasets (Solar #2), ~15% of the 1850-2018 warming was unexplained by just solar and volcanic forcing.Much of this might be explained by an additional contribution from anthropogenic forcings [1,2,17,106,107,111]. Although, if so, it probably would involve a much lower climate sensitivity to greenhouse gases than the CMIP6 models imply-as several studies have suggested, e.g., [17,18,31,33,34,[138][139][140][141][142][143][144][145][146][147][148].There may also be additional non-climatic biases remaining in the data [5,7,10,76].However, we also notice that for the "rural-only" series, none of the fits completely captured all the multidecadal temperature oscillations.That is, the anthropogenic and "natural and anthropogenic" fits failed to capture the mid-19th-century or mid-20th-century warm periods, while the solar and "natural only" fits failed to capture the mid-19th-century warm period and the most recent part of the current warm period.Therefore, if the "rural-only" series is correct, additional climatic drivers to those considered by this analysis and the IPCC's equivalent attribution analyses have yet to be included.
With that in mind, we emphasize that for simplicity, we have explicitly assumed for this paper, like AR6's climate model hindcasts, that the main "natural" drivers of global temperature change are changes in (1) TSI and (2) volcanic forcing.However, some have argued for additional, more subtle, relationships between solar activity and climate [5,17,19,22,63,109,139,.Meanwhile, some studies suggest that a better understanding of the role of volcanic eruptions on climate change is required [111,171].
AR6 correctly notes that the main Earth/Sun orbital changes "operate on very long time scales (i.e., thousands of years).As such, they have displayed very little change over the past century and had very little influence on temperature changes observed over that period" (AR6, FAQ 3.2, p. 517) [1].However, in recent years, several researchers have noted that these long-term changes also lead to subtle regional shifts in seasonality on multidecadal to centennial timescales that are not insignificant [155,[172][173][174][175] and that these shifts are also influenced by the Earth/Moon orbit [174][175][176].
Others suggest that much of the multidecadal temperature variability can indeed be explained in terms of natural "internal climate variability" that current climate models do not appear to capture fully [111,133,142,[177][178][179][180].This could comprise changes in:
Therefore, we also encourage more active investigation in the future into the possibilities of natural climate drivers other than TSI and volcanic forcings.Several of the other studies we discussed have also made this point [17,18,20,22,111].
One potentially useful form of climatic data records could be ground-based surface incident solar radiation measurements ("sunshine duration") [185][186][187][188].This form of data incorporates variability in solar activity, orbital dynamics, atmospheric transparency, and cloud cover-making it potentially a powerful climate-related dataset.However, there are debates over the contribution of urbanization effects to local sunshine measurements [187].Therefore, urban/rural challenges might also exist with those data.
In summary, to resolve the causes of the climate changes since the 19th century more satisfactorily, we encourage more research into the following: 1.
Better quantification of the contribution of urbanization bias to current global temperature estimates.

2.
Improving temperature homogenization techniques to minimize urban blending and more accurately correct for other non-climatic biases.

Figure 1 .
Figure 1.Visual comparison of the two different estimates of Northern Hemisphere land air temperature trends (1850-2018) considered in this article.Both series were generated using version 3 of NOAA NCEI's Global Historical Climatology Network (GHCN) dataset.(a) The "rural and urban" series is the gridded mean average of all Northern Hemisphere stations regardless of urbanization status.NOAA's homogenized versions of the station records were used.(b) The number of stations used for each year of the "rural and urban" series with the relative composition of urban/intermediate/rural for each year indicated via different colors.(c) The "rural-only" series uses only rural stations taken from the four regions identified by Connolly et al. (2021) [5].The homogenization steps described by Connolly et al. (2021) were applied to the station records [5].(d) The number of stations used for each year of the "rural-only" series.Note that the vertical scales are different for (b,d) since the rural-only series is limited to rural data and therefore only uses 10-15% of the total stations available.

Figure 1 .
Figure 1.Visual comparison of the two different estimates of Northern Hemisphere land air temperature trends (1850-2018) considered in this article.Both series were generated using version 3 of NOAA NCEI's Global Historical Climatology Network (GHCN) dataset.(a) The "rural and urban" series is the gridded mean average of all Northern Hemisphere stations regardless of urbanization status.NOAA's homogenized versions of the station records were used.(b) The number of stations used for each year of the "rural and urban" series with the relative composition of urban/intermediate/rural for each year indicated via different colors.(c) The "rural-only" series uses only rural stations taken from the four regions identified by Connolly et al. (2021) [5].The homogenization steps described by Connolly et al. (2021) were applied to the station records[5].(d) The number of stations used for each year of the "rural-only" series.Note that the vertical scales are different for (b,d) since the rural-only series is limited to rural data and therefore only uses 10-15% of the total stations available.

Figure 2 .
Figure 2. Visual comparison of the "rural and urban" Northern Hemisphere land surface air temperature time series (1850-2018) used in the article (thick black line) to six other widely used estimates of Northern Hemisphere land surface air temperatures.For more details on these series, see Connolly et al. (2021) [5].

Figure 2 .
Figure 2. Visual comparison of the "rural and urban" Northern Hemisphere land surface air temperature time series (1850-2018) used in the article (thick black line) to six other widely used estimates of Northern Hemisphere land surface air temperatures.For more details on these series, see Connolly et al. (2021) [5].

Figure 3 .
Figure 3.Comparison in terms of urban composition of the two Northern Hemisphere land surface air temperature estimates considered in this analysis.(a) Both time series of Figure 1 in the same panel for direct comparison.(b) The breakdown of urban, intermediate, and rural stations used for each year of the "rural and urban" series.(c) Urban composition of the "rural and urban" series averaged over three different time periods: left-all available stations; middle-average breakdown for the 1850-1890 period; right-average breakdown for the 1950-1990 period.

Figure 4 .
Figure 4. Time series of the four forcings considered in this analysis.(a,b) represent two different estimates of solar variability since the mid-19th century; (c) plots the volcanic forcing; (d) plots the combined anthropogenic forcings.For more details on the two solar activity series, see Connolly et al. (2021) [5].The volcanic activity and anthropogenic forcing time series are taken from the IPCC AR6 WG1 Annex III dataset [103].

Figure 4 .
Figure 4. Time series of the four forcings considered in this analysis.(a,b) represent two different estimates of solar variability since the mid-19th century; (c) plots the volcanic forcing; (d) plots the combined anthropogenic forcings.For more details on the two solar activity series, see Connolly et al. (2021) [5].The volcanic activity and anthropogenic forcing time series are taken from the IPCC AR6 WG1 Annex III dataset [103].

Figure 5 .
Figure 5. Illustration of the four evaluation metrics when applied to (a-d) the estimates of Northern Hemisphere land surface air temperatures derived from both rural and urban stations and (e-h) the estimates derived from only rural stations.(a,e) show the long-term linear trend over the entire period of record, i.e., 1850-2018.(b,f) show the linear trends over our secondary period analysis, i.e., 1900-2018.(c,g) show three shorter-term warming and cooling periods associated with both time series since the late-19th century, i.e., warming between 1885-1938 and 1972-2018 and a cooling period between 1938-1972.(d,h) show the "AR6" metric, i.e., a comparison between the average temperatures for 1850-1900 and 1995-2014.

Figure 5 .
Figure 5. Illustration of the four evaluation metrics when applied to (a-d) the estimates of Northern Hemisphere land surface air temperatures derived from both rural and urban stations and (e-h) the estimates derived from only rural stations.(a,e) show the long-term linear trend over the entire period of record, i.e., 1850-2018.(b,f) show the linear trends over our secondary period analysis, i.e., 1900-2018.(c,g) show three shorter-term warming and cooling periods associated with both time series since the late-19th century, i.e., warming between 1885-1938 and 1972-2018 and a cooling period between 1938-1972.(d,h) show the "AR6" metric, i.e., a comparison between the average temperatures for 1850-1900 and 1995-2014.

Climate 2023 , 37 Figure 6 .
Figure 6.The ordinary least squares linear regression fitting functions between (a-d) the "rural and urban" temperature record and (e-h) the "rural-only" temperature record.For the 1850-2018 period, the data are shown with gray diamonds; the linear fits are shown with black dashed lines.For the 1900-2018 period, the data are shown with colored circles; the linear fits are shown with colored dotted lines.The slopes and intercepts of each linear relationship along with the r 2 and p statistics of the fits are shown in the relevant panel with the 1850-2018 statistics indicated by black text and those for the 1900-2018 indicated by colored text.

Figure 6 .
Figure 6.The ordinary least squares linear regression fitting functions between (a-d) the "rural and urban" temperature record and (e-h) the "rural-only" temperature record.For the 1850-2018 period, the data are shown with gray diamonds; the linear fits are shown with black dashed lines.For the 1900-2018 period, the data are shown with colored circles; the linear fits are shown with colored dotted lines.The slopes and intercepts of each linear relationship along with the r 2 and p statistics of the fits are shown in the relevant panel with the 1850-2018 statistics indicated by black text and those for the 1900-2018 indicated by colored text.

Figure 7 .
Figure 7.The results of fitting (a-d) the "rural and urban" or (e-h) the "rural-only" temperature records (indicated by thick black lines) using only one component (using ordinary least squares linear regression) over the 1850-2018 period.The best fits for each individual component are indicated in each panel with colored circles joined by a dotted line.(a,e) show the best fits for Solar #1; (b,f) show the best fits for Solar #2; (c,g) show the best fits for volcanic; (d,h) show the best fits for the net anthropogenic forcing.

Figure 8 .
Figure 8.The results of fitting the temperature records over the entire 1850-2018 period using multiple components (using ordinary least squares multiple linear regression) for the "rural and urban" temperature record.(a-d) show the best fits for Scenarios 1-4, respectively.The temperature record is shown in each panel by a thick black line.The panels on the left-hand-side show the model fits with green colored circles joined by a dotted line.The other panels show the contribution to the model fit from each of the two or three components.

Figure 8 .
Figure 8.The results of fitting the temperature records over the entire 1850-2018 period using multiple components (using ordinary least squares multiple linear regression) for the "rural and urban" temperature record.(a-d) show the best fits for Scenarios 1-4, respectively.The temperature record is shown in each panel by a thick black line.The panels on the left-hand-side show the model fits with green colored circles joined by a dotted line.The other panels show the contribution to the model fit from each of the two or three components.

Figure 9 .
Figure 9.As for Figure 8, except for the "rural-only" temperature records.The results of fitting the temperature records over the entire 1850-2018 period using multiple components (using ordinary least squares multiple linear regression) for the "rural-only" temperature record.(a-d) show the best fits for Scenarios 1-4 respectively.The temperature record is shown in each panel by a thick black line.The panels on the left-hand-side show the model fits with green colored circles joined by a dotted line.The other panels show the contribution to the model fit from each of the two or three components.

Figure 9 .
Figure 9.As for Figure 8, except for the "rural-only" temperature records.The results of fitting the temperature records over the entire 1850-2018 period using multiple components (using ordinary least squares multiple linear regression) for the "rural-only" temperature record.(a-d) show the best fits for Scenarios 1-4 respectively.The temperature record is shown in each panel by a thick black line.The panels on the left-hand-side show the model fits with green colored circles joined by a dotted line.The other panels show the contribution to the model fit from each of the two or three components.
HANDOUT: AGENDA ITEM 4.3 [2]lett et al. (2021)his case, year) is well determined.BecauseGillett et al. (2021)[2]attempted to fit climate model hindcasts, they argue that TLS is preferable since this allows for the fact that, due to the internal variability of climate models, there can be some uncertainty in the timing of temperature changes.

Table 1 .
Results of individual component analysis fitting of the "rural and urban" temperature record over the 1850-2018 period in terms of the various evaluation metrics.

Table 1 .
Results of individual component analysis fitting of the "rural and urban" temperature record over the 1850-2018 period in terms of the various evaluation metrics.

Table 2 .
Results of individual component analysis fitting of the "rural-only" temperature record over the 1850-2018 period in terms of the various evaluation metrics.

Table 3 .
Results of multiple linear regression fitting of the "rural and urban" temperature record over the 1850-2018 period in terms of the various evaluation metrics.

Table 4 .
Results of multiple linear regression fitting of the "rural-only" temperature record over the 1850-2018 period in terms of the various evaluation metrics.