Next Article in Journal
Analysis of Cell Signal Transduction Based on Kullback–Leibler Divergence: Channel Capacity and Conservation of Its Production Rate during Cascade
Next Article in Special Issue
Diffusion on Middle-ξ Cantor Sets
Previous Article in Journal
Information Geometry of κ-Exponential Families: Dually-Flat, Hessian and Legendre Structures
Previous Article in Special Issue
Hierarchical Scaling in Systems of Natural Cities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complexity Analysis of Global Temperature Time Series

by
António M. Lopes
1,*,† and
J. A. Tenreiro Machado
2,†
1
UISPA–LAETA/INEGI, Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, 4200–465 Porto, Portugal
2
Department of Electrical Engineering, Institute of Engineering, Polytechnic of Porto, R. Dr. António Bernardino de Almeida, 431, 4249–015 Porto, Portugal
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Entropy 2018, 20(6), 437; https://doi.org/10.3390/e20060437
Submission received: 26 March 2018 / Revised: 25 May 2018 / Accepted: 2 June 2018 / Published: 5 June 2018
(This article belongs to the Special Issue Power Law Behaviour in Complex Systems)

Abstract

:
Climate has complex dynamics due to the plethora of phenomena underlying its evolution. These characteristics pose challenges to conducting solid quantitative analysis and reaching assertive conclusions. In this paper, the global temperature time series (TTS) is viewed as a manifestation of the climate evolution, and its complexity is calculated by means of four different indices, namely the Lempel–Ziv complexity, sample entropy, signal harmonics power ratio, and fractal dimension. In the first phase, the monthly mean TTS is pre-processed by means of empirical mode decomposition, and the TTS trend is calculated. In the second phase, the complexity of the detrended signals is estimated. The four indices capture distinct features of the TTS dynamics in a 4-dim space. Hierarchical clustering is adopted for dimensional reduction and visualization in the 2-dim space. The results show that TTS complexity exhibits space-time variability, suggesting the presence of distinct climate forcing processes in both dimensions. Numerical examples with real-world data demonstrate the effectiveness of the approach.

1. Introduction

Understanding climatic variability is extremely important nowadays. The Earth’s climate is changing and we are facing an increasing number of extreme weather events, like floods, droughts and anomalous temperatures. Since the beginning of the 20th century the Earth’s surface average temperature has increased by nearly 0.8 °C [1,2,3], and several projections based on climate models estimate that during this century, the average temperature may increase by between 1.1 °C and 6.4 °C [1]. It has been verified that the global surface temperature has risen at an average rate of about 0.17 °C per decade since 1970 [1]. Moreover, the annual mean Arctic sea-ice extent decreased at a rate of about 3.5% to 4.1% per decade over the period from 1979 to 2012 [4]. The Earth is warming and it seems that human activity is one of the main causes for this phenomenon [5,6]. Estimates from the Intergovernmental Panel on Climate Change (IPCC) revealed a considerable increase of anthropogenic greenhouse gas emissions since the pre-industrial era [1]. In the period from 1970–2010, the annual growth rate of emissions was above 1.3% [7]. Some impacts of global warming, such as record high temperatures, melting glaciers and severe flooding are becoming increasingly common around the world [8,9,10,11,12]. Besides the direct effect on temperature, warming leads to the modification of wind patterns, the development of humidity, and significant changes in the rates of precipitation. These phenomena are under intensive research due to their major impacts on the social, economic and health aspects of human life [13,14,15,16].
The analysis of temperature time series (TTS) is an important topic that can help us understanding better climate dynamics. Several indicators of climate change can be used, but the time evolution of the Earth’s surface temperature is probably the more straightforward measure to follow [17]. Furthermore, reliable temperature data records comprising information from meteorological stations around the world are available to support data processing [18,19].
We can find in the literature several strategies for analyzing the TTS [20]. Grieser et al. [21] studied the monthly mean temperatures of European meteorological stations. They applied statistical tools to decompose the TTS into significant components and found that the phase of the annual cycle shifts within the year, backwards and forwards in Western and Eastern Europe, respectively. The authors observed that the occurrence of extreme events increased significantly and corresponded mostly to cold peaks in winter. They also found large harmonic components with a period of 92.3 months in data from Northern and Western Europe. Additionally, this period, corresponding to about 7.7 years, was also observed in the North Atlantic Oscillation. Hughes et al. [22] pointed out that the Antarctic Peninsula region is warming faster than the rest of the world. They analyzed the minimum and maximum TTS at the Faraday/Vernadsky (Latitude 65.25 S, Longitude 64.25 W) station using a multiple regression model with non-Gaussian correlated errors. It was calculated that there was an increase in the minimum monthly temperature of 6.78 °C during the years 1951–2003. Viola et al. [23] adopted nonlinear processing tools to analyze TTS. The method of delay coordinates was applied for state space reconstruction and the delay parameters were determined by means of two methods, namely the average mutual information and the false nearest neighbors. Temperatures were extrapolated until 2028 by means of simple nonlinear prediction, which, after the state space reconstruction, uses a limited number of neighboring samples to perform predictions. Founda et al. [24] employed linear regression fitting to analyze the air TTS in Athens from 1897 until 2001 and observed a tendency towards warmer years. Ge et al. [25] studied the temperature variation in China over 20 centuries, revealing not only the cold periods from 1620–1710 and 1800–1860, but also the warming registered during the 20th century. Deser et al. [26] examined the trends of the sea surface and marine air temperatures and observed warming in all cases, with the exception of the Northwestern Atlantic Ocean. The largest warming trends were found in the middle latitudes of both the Northern and Southern hemispheres. Oñate and Pou [27] processed TTS from eleven meteorological stations on the Iberian Peninsula. Trends were identified by means of the Mann–Kendall test, and multidimensional scaling provided automatic clustering. More recently, Stephenson and Doblas-Reyes [28] used multidimensional scaling as an exploratory tool to describe ensembles of forecasts. Multidimensional scaling is a computational technique for exploring similarities between groups of data and easing the visualization of hidden patterns [29]. Lopes and Machado [5] adopted multidimensional scaling to study complex correlations between global TTS.
Earth surface temperatures exhibit slow space-time system dynamics that embed long memory effects and complex relationships. Climate data can be analyzed in a straightforward manner by calculating the means and variances for different levels of spacial and temporal aggregation, but these methods neglect long range dynamic phenomena. Furthermore, while many studies have addressed the problem of global warming, the aspect of the increasing complexity of the data has been somewhat overlooked. In fact, the complexity of the TTS is embedded into the measurements, but needs to be tackled independently of the problem of increasing temperatures. The complexity point of view may lead to a deeper insight into the predictability of future TTS evolution.
In this paper, global TTS are viewed as manifestations of a dynamic system. For each TTS, the trend is calculated by means of empirical mode decomposition (EMD), and the complexity of the detrended series is assessed by means of four indices, namely the Lempel–Ziv complexity ( L Z C ), the sample entropy ( S a m p E n ), the signal harmonics power ratio ( η ), and the fractal dimension ( F r a c D i m ). The four indices capture distinct features of the TTS in a multidimensional space that is projected in 2D through hierarchical clustering. The space-time variability of the complexity measures suggests that the inputs of climate vary in both dimensions. In particular, the increasing complexity of the TTS in recent years shows that temperatures are becoming more difficult to predict. Numerical examples using real-world data for one century as well as the four complexity indices, illustrate the effectiveness of this new point of view for tackling TTS.
With these ideas in mind, the paper is organized as follows. Section 2 presents the main fundamental issues that influence global warming. Section 3 overviews the mathematical tools used for processing data. Section 4 and Section 5 analyze the global TTS complexity by means of different indices. Finally, Section 6 outlines the conclusions.

2. Mathematical Fundamentals

This section introduces the main mathematical tools adopted for data processing, namely the L Z C , S a m p E n , Fourier series (FS), EMD, and F r a c D i m . These tools are well suited to the time series generated by many natural phenomena, such as climatic, geophysical and biological processes.

2.1. Lempel–Ziv Complexity

The L Z C is a method of symbolic sequence analysis that measures the complexity of finite time series [30,31,32]. The L Z C is based on computing the number of distinct subsequences and their recurrence rates along the series of data [33,34,35,36].
Let us consider the finite time series, x ( k ) , where k = 1 , 2 , , K represents the discrete sampling instants. In the first phase, x ( k ) is converted into a symbolic binary sequence, S = { s ( 1 ) , s ( 2 ) , , s ( K ) } , such that
s ( k ) = 0 , i f x ( k ) < T h 1 , i f x ( k ) T h ,
where, T h denotes a pre-defined threshold value. Usually the median of x ( k ) is adopted for T h , since it performs robustly against outliers [37]. In a second phase, we parse the original sequence, S, from left to right, to obtain a new (parsed) sequence, S , formed by distinct subsequences or words. To obtain S , different parsing methods are available [38,39,40]. Herein we summarize the original scheme proposed by Lempel and Ziv [30,41].
Let S ( p , q ) denote a subsequence of S composed of the symbols between positions p and q (in our case, time instants). Thus, we have S ( p , q ) = { s ( p ) , s ( p + 1 ) , , s ( q ) } , for q p , and S ( p , q ) = { } (i.e., the empty set), for q < p . The parsed sequence, S , is obtained as follows:
  • Initialize:
    (a)
    The parsed sequence, S = { s ( 1 ) } ;
    (b)
    An auxiliary sequence, Q = { } ;
    (c)
    A pointer, θ , such that, θ ( 1 ) points to the symbol, s ( 1 ) , of S;
    (d)
    p = 1 .
  • At the iteration p:
    (a)
    Advance from θ ( p ) to θ ( p + 1 ) and extend the auxiliary sequence, Q, by appending the symbol s ( p + 1 ) ;
    (b)
    Check whether the current auxiliary sequence Q matches any subsequence of S ( 1 , p ) . If no match is found, then append, as a new word, the auxiliary sequence, Q, to the parsed sequence, S , and reset to Q = { } ;
  • Increment p;
    (a)
    If p < K , then go to step 2;
    (b)
    If p = K , then append Q to S to yield the final parsed sequence;
  • Count the number of different words, c ( K ) , in S .
To obtain a complexity measure independent of the length of S, the count value c ( K ) is normalized, yielding [30]
L Z C = c ( K ) log 2 ( K ) K ,
which reflects the rate of occurrence of new patterns in S.

2.2. Sample Entropy

The S a m p E n is a modified version of the “approximate entropy” [42], used to measure complexity. It is defined as the symmetrical version of the logarithm of the conditional probability of two sequences that are similar for m points, remain identical for m + 1 points. The value of S a m p E n has little variation across the length of a time series, yielding a characteristic measure of its complexity [43]. The S a m p E n can be calculated by the five steps presented below [44,45,46,47,48]:
  • Generate a set of m-dimensional vectors, X p m ,
    X p m = [ x ( p ) , x ( p + 1 ) , , x ( p + m 1 ) ] , 1 p K m + 1 ,
    representing m consecutive values of x ( k ) , starting at point p;
  • Determine the similarity, D p , q ( r ) , between X p m and X q m (i.e., the vector representing m consecutive values of x ( k ) , starting at point q), by calculating
    D p , q ( r ) = 1 , i f d p , q < r 0 , i f d p , q r ,
    where d p , q = max n = 0 , 1 , , m 1 | x ( p + n ) x ( q + n ) | is a distance, and r is a tolerance value;
  • Compute the correlation sum, B p m ( r ) , under the constraint p q , to exclude self-matches,
    B p m ( r ) = q = 1 , q p K m D p , q ( r ) K m 1 ;
  • Find the probability, B m ( r ) , of template matching for all vectors by
    B m ( r ) = p = 1 K m B p m ( r ) K m ;
  • Calculate the S a m p E n as
    S a m p E n = ln B m + 1 ( r ) B m ( r ) .
The values of the parameters m and r are important in determining the S a m p E n , but no exact guidelines exist for optimizing their values. We know that the accuracy and confidence of the S a m p E n estimates increase when a small m (short template) and large r (wide tolerance) [42,46] are chosen. Moreover, when r is too small, poor conditional probability estimates are obtained, while when r is too large, detailed information is lost and S a m p E n tends to be 0. To avoid problems caused by noise r must be chosen carefully [42,46]. The generally-adopted parameter values are m = 1 or m = 2 , and r = ρ σ , where σ denotes the standard deviation of the original time series, x ( k ) , and ρ [ 0.1 , 0.25 ] [46].

2.3. Fourier Analysis

The classical FS was proposed by Joseph Fourier (1768–1830) when studying problems in the area of heat conduction. The FS expresses a given signal by means of an orthonormal basis of trigonometric functions.
Let us assume that x ( k ) = x ( k + T ) , k , T N , is a periodic function with period T, T = 1 f , and ω = 2 π f , where f and ω denote the frequency and angular frequency, respectively. The signal x ( k ) can be expressed by means of FS expansion [49,50]:
x ( k ) = n = 0 T 1 a n e j 2 π f n k ,
where j = 1 , and the Fourier coefficients, a n , are given by
a n = 1 T k = 0 T 1 x ( k ) e j 2 π f n k .
According to Parseval’s theorem, the average power of x ( k ) is given by
P x = 1 T k = 0 T 1 | x ( k ) | 2 = n = 0 T 1 | a n | 2 .

2.4. Empirical Mode Decomposition

The EMD is a powerful signal processing tool that is able to deal with non-stationary and non-linear time series [51,52,53]. The basic idea is that the original signal x ( k ) is decomposed into a finite set of simpler signals called intrinsic mode functions (IMF). The algorithm is as follows [51]:
  • Identify all extrema of x ( k ) ;
  • Interpolate between minima and maxima, and find the lower and upper envelopes, e m i n ( k ) and e m a x ( k ) , respectively;
  • Calculate the mean using m ( k ) = 1 2 [ e m i n ( k ) + e m a x ( k ) ] ;
  • Extract the details using d ( k ) = x ( k ) m ( k ) ;
  • Iterate on the residual, m ( k ) .
For real-world data, this procedure iterates steps 1–4 until the detail signal, d ( k ) , is an intrinsic mode function (IMF), meaning that it has a zero-mean according to some criterion. It should be noted that the complete process guarantees that the number of extrema decreases between two successive residuals and that the decomposition is achieved with a finite number of modes.
The original signal can be represented by
x ( k ) = m L ( k ) + l = 1 L d l ( k ) ,
where d l ( k ) represent the modes and m L ( k ) denotes the residual.

2.5. Fractal Dimension

A fractal is a geometrical object that displays identical patterns at different scales. The F r a c D i m is a measure of how much the fractal fills the space as it is magnified from larger to smaller scales [54,55,56]. The box-counting method is often adopted for estimating the F r a c D i m , due to its straightforward numerical implementation. For a fractal represented by a binary image, with non-zero pixels belonging to the fractal, and zero pixels forming the image background, the box counting algorithm is as follows [56]:
  • Pad the image with background pixels so that its dimensions are at a power of 2;
  • Cover the fractal object with a grid of squares with size ϵ (in the first iteration, there is just one square of equal size to the size of the image);
  • Count the number of boxes (i.e., squares), N ϵ N , needed to cover the object;
  • If ϵ > 1 , then make ϵ ϵ 2 and repeat step 2.
  • Estimate the F r a c D i m as the slope of the log-log plot, N ϵ versus ϵ , calculated by means of the least squares method.
The estimate of F r a c D i m obtained by the algorithm is a good approximation to
F r a c D i m = lim ϵ 0 + log N ϵ log ϵ .

3. Dataset

The University of Delaware “Terrestrial Air Temperature: 1900–2014 Gridded Monthly Time Series (V 4.01)” was used. The dataset is publicly available at the website http://climate.geog.udel.edu/~climate/html_pages/download.html#T2014, and was retrieved on 1 March 2018. The archive consists of monthly mean air TTS for the period from January 1900 up to December 2014. The spatial coverage corresponds to land areas, with one TTS per cell, on a 0.5 × 0.5 latitude × longitude grid of resolution. The dataset was compiled by means of interpolation techniques on data gathered from several sources, including the Global Historical Climatology Network Monthly Version 3 (GHCN3) dataset [18]; the Daily Global Historical Climatology Network (GHCN-Daily) archive [57]; the Atmospheric Environment Service/Environment Canada archive; records from the State Hydrometeorological Institute of St. Petersburg, Russia; data for Greenland provided by the GC-Net [58]; records from the Automatic Weather Station Project (Charles R. Stearns, University of Wisconsin—Madison); the Global Synoptic Climatology Network archive (Dataset 9290c, of the National Climatic Data Center); and observations contained within the Global Surface Summary of Day (GSOD) [59].
Herein, we illustrate the usability of the dataset, while highlighting some aspects of the global warming revealed by the temperature anomalies [13]. Given a reference period, P 12 = [ P 1 , P 2 ] , from year P 1 up to year P 2 , the temperature anomaly for the period P 34 = [ P 3 , P 4 ] is
A 12 34 = 1 P 4 P 3 n = P 3 P 4 y ( n ) 1 P 2 P 1 n = P 1 P 2 y ( n ) ,
where y ( n ) represents the average temperature of year n.
Temperature anomalies represent deviations from a reference value, or long-term average [13]. Positive (negative) anomalies mean that the observed temperatures are higher (lower) than the reference value. Figure 1 depicts the annual land temperature anomalies for the years P 34 = [ 1970 , 1974 ] , P 34 = [ 1990 , 1994 ] , P 34 = [ 2000 , 2004 ] and P 34 = [ 2010 , 2014 ] , relative to the period P 12 = [ 1951 , 1980 ] . The results show the Earth’s warming clearly, with emphasis on certain regions, namely in the Northern Hemisphere.

4. On the Complexity of TTS

The TTS, x ( k ) , k = 1 , , K , exhibits a trend. Here, k represents time with a 1-month resolution, and k = 1 and k = K = 1380 correspond to the initial and final instants—January 1900 and December 2014, respectively. The EMD proved to be a good method for determining the trends of data from nonstationary and nonlinear processes, without the need to know the trends’ functional forms [53]. Within this approach, the TTS trend was estimated by the residuals of the EMD.
Figure 2 depicts the original data, x ( k ) , the IMF modes, d l ( k ) , l = 1 , , 7 , and the residuals, m L ( k ) , obtained for a TTS of a station with latitude/longitude coordinates ( 71.25 , 89.75 ) , following the procedure reported in [60]. Figure 3 illustrates the original data, x ( k ) , and the trend, x ¯ ( k ) = m L ( k ) .

4.1. Lempel–Ziv Complexity of the TTS

Before determining the L Z C , we first detrended the original TTS by calculating x ˜ ( k ) = x ( k ) x ¯ ( k ) , where x ¯ ( k ) was determined by EMD. Figure 4 exemplifies the detrending pre-processing ofn the TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) .
The L Z C algorithm shown in Section 2.1 was then applied to each x ˜ ( k ) on the regular grid ( i , j ) covering land areas, totaling L = 85 , 690 TTS. Figure 5 represents the L Z C i j on a contour map. Using to the L Z C index, we verified that the monthly mean TTS values in the Northern Hemisphere and in certain regions of South America and Australia are more complex than those in other regions of the Earth. This spatial variability of complexity suggests that climate dynamics are driven by distinct regional forcing processes.

4.2. Sample Entropy of the TTS

To calculate the S a m p E n , we applied the algorithm from Section 2.2 to the detrended TTS, x ˜ ( k ) , with m = 2 and r = 0.5 , as discussed in Section 2.2. Figure 6 represents the S a m p E n i j for each TTS of the grid ( i , j ) on land.
To compare the results obtained with L Z C i j and S a m p E n i j we calculated the two-dimensional correlation, R, as
R = i j L Z C i j L Z C ) · ( S a m p E n i j S a m p E n i j L Z C i j L Z C 2 · i j S a m p E n i j L Z C 2 ,
where · denotes the mean of all values in a matrix. The correlation value obtained, R = 0.97 , quantified the similarities observed between Figure 5 and Figure 6, showing that both indices yield similar spacial patterns. The results thus suggest the existence of distinct regional climate forcing processes.

4.3. Harmonic Content of the TTS

In this sub-section, we propose an alternative index for measuring the complexity of TTS. The index, η , is given by the harmonics power ratio of the signals:
η = 1 P h P t × 100 % ,
where
P h = | a 0 | 2 + | a 1 | 2 ,
P t = 1 T k = 0 T 1 | x ˜ ( k ) | 2 ,
when T = 12 months.
Following Parseval’s theorem (10), P h and P t denote the signal power captured by the DC and first harmonic components, and the total power of x ˜ ( k ) , respectively. Therefore, η measures the signal power contained on the higher harmonics of x ˜ ( k ) .
Figure 7 represents the η i j for the TTS of the grid ( i , j ) on land. We verified that tropical and equatorial regions are characterized by larger values of η .

4.4. Fractal Dimension of the TTS

Here we calculate the F r a c D i m for the phase portrait of the TTS. A n-dimensional dynamical system can be represented by a set of first-order differential equations governing n state variables. By knowing the state variables at time k = k 0 and the system inputs for k k 0 , the system behavior for k k 0 can be determined. The state space consists of the set of all possible states, each one corresponding to a unique point. As k evolves, sequences of points describing trajectories in the state space are obtained. The set of trajectories is known as the state space portrait. The phase portrait (PP) corresponds to the two-dimensional state space representation [61,62].
We consider the detrended TTS, x ˜ ( k ) , and its numerical time derivative, x ˜ ˙ ( k ) = x ˜ ( k + 1 ) x ˜ ( k ) , to be the state variables. Figure 8 represents the PP ( x ˜ , x ˜ ˙ ) of the detrended TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) .
Figure 9 represents the F a c D i m i j for all TTS of the grid ( i , j ) on land.

5. Temporal Dynamics of Global Warming

In this section we describe the study of the temporal complexity of global warming. In the first phase, TTS measured on land (within a total of L = 85 , 690 ) was divided into W = 43 time intervals, w t , t = 1 , , W , by means of a 30-year length sliding window, with a 28-year overlap. For each time interval, w t , we calculated the L-dimensional vectors, L Z C w t , S a m p E n w t , η w t and F r a c D i m w t , representing the complexity of the detrended TTS at the time intervals w t , captured in the perspective of each of the four indices. Experiments demonstrated that the window adopted established a good compromise between time discrimination and statistical significance of the data within each time interval. In a second phase, for each indice and w t we determined the relative frequency histograms. Figure 10 depicts the histograms for the four indices versus w t on contour maps, where the years correspond to the center of the time windows. The L Z C w t and η w t maps exhibited two modes that increased with w t almost monotonically. Regarding the L Z C w t index, higher modes were mainly due to the TTS in the regions of Antarctica, South Africa, East Australia, North Africa, the Middle East and some setentrional areas of America and Asia. On the other hand, lower modes were obtained from TTS located in the tropical regions of Earth. In the η w t , the higher and lower modes were due to TTS from equatorial and tropical regions, and to the TTS from North Australia, North Africa, East Asia and Northwest America. The S a m p E n w t and F r a c D i m w t indices had single mode. For the S a m p E n w t , most TTS were from North Africa, South Asia, South America, Australia and Antarctica. For the F r a c D i m w t , the main contribution was due to TTS from tropical regions.
In the follow-up, to unveil temporal patterns, we adopted a hierarchical clustering (HC) analysis and visualization trees. The main objective of HC is to group together objects that are similar to one another in some sense [63,64]. The input of HC is a W × W symmetric matrix [ d t u ] , t , u = 1 , , W , where d t u represents the Canberra distance between the time periods, ( w t , w u ) [29,65]:
d t u = l = 1 L | L Z C w t ( l ) L Z C w u ( l ) | | L Z C w t ( l ) | + | L Z C w u | ( l ) + | S a m p E n w t ( l ) S a m p E n w u ( l ) | | S a m p E n w t ( l ) | + | S a m p E n w u | ( l ) + | η w t ( l ) η w u ( l ) | | η w t ( l ) | + | η w u | ( l ) + | F r a c D i m w t ( l ) F r a c D i m w u ( l ) | | F r a c D i m w t ( l ) | + | F r a c D i m w u | ( l ) .
In the HC algorithm, the successive (agglomerative) clustering and average-linkage methods were adopted. Figure 11 depicts the corresponding graph generated by the software, PHYLIP (http://evolution.genetics.washington.edu/phylip.html). We identified three clusters of TTS complexity patterns, closely coincident to time periods characterized by different temperature trends [66], namely the warming period, A = [ 1915 , 1947 ] , the cooling trend in the middle of the 20th century, B = [ 1947 , 1977 ] , and the new warming period, C = [ 1977 , 1999 ] . Moreover, in A , the HC unveiled two groups, namely A 1 = [ 1915 , 1927 ] and A 2 = [ 1927 , 1947 ] . In the clusters A 2 , B and C , we noted the emergence of some smaller clusters. In A 2 , we found subclusters composed of the years { 1929 , 1931 , 1933 , 1935 } and { 1937 , 1939 , 1941 , 1945 , 1947 } . In B , we verified the appearance of the groups { 1949 , 1951 , 1953 , 1955 , 1957 , 1959 } and { 1961 , 1963 , 1965 , 1967 , 1969 , 1971 , 1973 , 1975 , 1977 } . Finally, in C , there were { 1979 , 1981 , 1983 , 1985 , 1987 } and { 1991 , 1993 , 1995 , 1997 , 1999 } . These subclusters include between 4 and 6 time windows, suggesting some kind of periodicity. However, to the authors’ best knowledge, there have been no studies addressing these fluctuations with the standard perspective of the rise in temperature. Therefore, we question whether there is some hidden correlation between the increasing temperatures and their complexity that has somehow been overlooked in previous approaches. This open question will require additional studies with more detailed data analyses in time and space.
In summary, we verified that the complexity of the TTS is increasing, that a solid assessment is accomplished when a combination of several distinct measures is used, and that complexity is an important issue to consider when analyzing global warming.

6. Conclusions

The TTS embeds rich information contributed by a multitude of distinct factors at different scales. The complexity of the TTS is related to climate forcing factors. In this paper, information from worldwide land meteorological stations was processed and its complexity was calculated in space and time by means of different indices. The increasing complexity of the TTS in recent years demonstrates that there has been a change in the dominating factors that drive the climate. On one hand, we have natural forces, such as the orbital effect, solar radiation, volcanic eruptions, changes in land cover, greenhouse gas, and aerosols. On the other hand, we find anthropogenic factors, including the rise in greenhouse gas and aerosols caused by human activities, as well as sulfate air pollutants, reactive nitrogen, dust, urban heat islands, ozone change, and land-surfaces changes also due to human activities. Four complexity measures were adopted. However, no single measure captured all effects and a multi-measurement perspective led to a superior assessment of the system state. The resulting multidimensional set of measurements was synthesized by means of the HC visualization technique. The obtained clusters revealed clear patterns in the increasing complexity embedded in the TTS. More detailed quantification of these patterns by means of spatial statistics, namely spatial Fourier power spectrum, will be addressed in further research.

Author Contributions

J.A.T.M. and A.M.L. conceived, designed and performed the experiments, analyzed the data and wrote the paper.

Acknowledgments

The authors acknowledge the University of Delaware for the data used in this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Solomon, S.; Qin, D.; Manning, M.; Chen, Z.; Marquis, M.; Averyt, K.B.; Tignor, M.; Miller, H.L. Contribution of Working Group I to the Fourth Assessment Report Of the Intergovernamental Panel on Climate Change; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2007. [Google Scholar]
  2. Root, T.L.; Price, J.T.; Hall, K.R.; Schneider, S.H.; Rosenzweig, C.; Pounds, J.A. Fingerprints of global warming on wild animals and plants. Nature 2003, 421, 57–60. [Google Scholar] [CrossRef] [PubMed]
  3. Huang, J.; Yu, H.; Dai, A.; Wei, Y.; Kang, L. Drylands face potential threat under 2 C global warming target. Nat. Clim. Chang. 2017, 7, 417–422. [Google Scholar] [CrossRef]
  4. Desch, S.J.; Smith, N.; Groppi, C.; Vargas, P.; Jackson, R.; Kalyaan, A.; Nguyen, P.; Probst, L.; Rubin, M.E.; Singleton, H.; et al. Arctic ice management. Earth’s Future 2017, 5, 107–127. [Google Scholar] [CrossRef] [Green Version]
  5. Lopes, A.M.; Machado, J.A.T. Analysis of temperature time-series: Embedding dynamics into the MDS method. Commun. Nonlinear Sci. Numer. Simul. 2014, 19, 851–871. [Google Scholar] [CrossRef] [Green Version]
  6. Alexiadis, A. Global warming and human activity: A model for studying the potential instability of the carbon dioxide/temperature feedback mechanism. Ecol. Model. 2007, 203, 243–256. [Google Scholar] [CrossRef]
  7. Working Group, I. The Scientific Basis. Climate Change; Third Assessment Report of the Intergovernamental Panel on Climate Change; IPCC: Geneva, Switzerland, 2001. [Google Scholar]
  8. Dai, A. Drought under global warming: A review. Wiley Interdiscip. Rev. Clim. Chang. 2011, 2, 45–65. [Google Scholar] [CrossRef]
  9. You, Q.; Kang, S.; Pepin, N.; Flügel, W.A.; Sanchez-Lorenzo, A.; Yan, Y.; Zhang, Y. Climate warming and associated changes in atmospheric circulation in the eastern and central Tibetan Plateau from a homogenized dataset. Glob. Planet. Chang. 2010, 72, 11–24. [Google Scholar] [CrossRef]
  10. Jevrejeva, S.; Moore, J.C.; Grinsted, A. Sea level projections to AD2500 with a new generation of climate change scenarios. Glob. Planet. Chang. 2012, 80, 14–20. [Google Scholar] [CrossRef]
  11. Giannakopoulos, C.; le Sager, P.; Bindi, M.; Moriondo, M.; Kostopoulou, E.; Goodess, C. Climatic changes and associated impacts in the Mediterranean resulting from a 2 C global warming. Glob. Planet. Chang. 2009, 68, 209–224. [Google Scholar] [CrossRef]
  12. Zhu, Y.; Wang, H.; Zhou, W.; Ma, J. Recent changes in the summer precipitation pattern in East China and the background circulation. Clim. Dyn. 2011, 36, 1463–1473. [Google Scholar] [CrossRef]
  13. Hansen, J.; Ruedy, R.; Sato, M.; Lo, K. Global surface temperature change. Rev. Geophys. 2010, 48, RG4004. [Google Scholar] [CrossRef]
  14. Brohan, P.; Kennedy, J.J.; Harris, I.; Tett, S.F.; Jones, P.D. Uncertainty estimates in regional and global observed temperature changes: A new data set from 1850. J. Geophys. Res. Atmos. 2006, 111. [Google Scholar] [CrossRef] [Green Version]
  15. Menne, M.J.; Williams, C.N., Jr. Homogenization of temperature series via pairwise comparisons. J. Clim. 2009, 22, 1700–1717. [Google Scholar] [CrossRef]
  16. Wu, Z.; Huang, N.E.; Wallace, J.M.; Smoliak, B.V.; Chen, X. On the time-varying trend in global-mean surface temperature. Clim. Dyn. 2011, 37, 759–773. [Google Scholar] [CrossRef] [Green Version]
  17. Rohde, R.; Muller, R.; Jacobsen, R.; Perlmutter, S.; Rosenfeld, A.; Wurtele, J.; Curry, J.; Wickhams, C.; Mosher, S. Berkeley Earth temperature averaging process. Geoinf. Geostat. Overv. 2013, 1, 1–13. [Google Scholar] [CrossRef]
  18. Lawrimore, J.H.; Menne, M.J.; Gleason, B.E.; Williams, C.N.; Wuertz, D.B.; Vose, R.S.; Rennie, J. An overview of the Global Historical Climatology Network monthly mean temperature data set, version 3. J. Geophys. Res. Atmos. 2011, 116. [Google Scholar] [CrossRef] [Green Version]
  19. Jones, P.; Lister, D.; Osborn, T.; Harpham, C.; Salmon, M.; Morice, C. Hemispheric and large-scale land-surface air temperature variations: An extensive revision and an update to 2010. J. Geophys. Res. Atmos. 2012, 117. [Google Scholar] [CrossRef] [Green Version]
  20. Capilla, C. Time series analysis and identification of trends in a Mediterranean urban area. Glob. Planet. Chang. 2008, 63, 275–281. [Google Scholar] [CrossRef]
  21. Grieser, J.; Trömel, S.; Schönwiese, C.D. Statistical time series decomposition into significant components and application to European temperature. Theor. Appl. Climatol. 2002, 71, 171–183. [Google Scholar] [CrossRef] [Green Version]
  22. Hughes, G.L.; Rao, S.S.; Rao, T.S. Statistical analysis and time-series models for minimum/maximum temperatures in the Antarctic Peninsula. Proc. R. Soc. Lond. A Math. Phys. Eng. Sci. 2007, 463, 241–259. [Google Scholar] [CrossRef] [Green Version]
  23. Viola, F.M.; Paiva, S.L.; Savi, M.A. Analysis of the global warming dynamics from temperature time series. Ecol. Model. 2010, 221, 1964–1978. [Google Scholar] [CrossRef]
  24. Founda, D.; Papadopoulos, K.; Petrakis, M.; Giannakopoulos, C.; Good, P. Analysis of mean, maximum, and minimum temperature in Athens from 1897 to 2001 with emphasis on the last decade: Trends, warm events, and cold events. Glob. Planet. Chang. 2004, 44, 27–38. [Google Scholar] [CrossRef]
  25. Ge, Q.S.; Zheng, J.Y.; Hao, Z.X.; Shao, X.M.; Wang, W.C.; Luterbacher, J. Temperature variation through 2000 years in China: An uncertainty analysis of reconstruction and regional difference. Geophys. Res. Lett. 2010, 37. [Google Scholar] [CrossRef] [Green Version]
  26. Deser, C.; Phillips, A.S.; Alexander, M.A. Twentieth century tropical sea surface temperature trends revisited. Geophys. Res. Lett. 2010, 37. [Google Scholar] [CrossRef] [Green Version]
  27. Oñate, J.J.; Pou, A. Temperature variations in Spain since 1901: A preliminary analysis. Int. J. Climatol. 1996, 16, 805–815. [Google Scholar] [CrossRef]
  28. Stephenson, D.B.; Doblas-Reyes, F.J. Statistical methods for interpreting Monte Carlo ensemble forecasts. Tellus A 2000, 52, 300–322. [Google Scholar] [CrossRef]
  29. Machado, J.A.T.; Lopes, A.M.; Galhano, A.M. Multidimensional scaling visualization using parametric similarity indices. Entropy 2015, 17, 1775–1794. [Google Scholar] [CrossRef]
  30. Lempel, A.; Ziv, J. On the complexity of finite sequences. IEEE Trans. Inf. Theory 1976, 22, 75–81. [Google Scholar] [CrossRef]
  31. Shao, Z.G. Contrasting the complexity of the climate of the past 122,000 years and recent 2000 years. Sci. Rep. 2017, 7, 4143. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Zhang, Y.; Wei, S.; di Maria, C.; Liu, C. Using Lempel–Ziv complexity to assess ECG signal quality. J. Med. Biol. Eng. 2016, 36, 625–634. [Google Scholar] [CrossRef] [PubMed]
  33. Abásolo, D.; Simons, S.; da Silva, R.M.; Tononi, G.; Vyazovskiy, V.V. Lempel-Ziv complexity of cortical activity during sleep and waking in rats. J. Neurophysiol. 2015, 113, 2742–2752. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Wu, X.; Xu, J. Complexity and brain function. Acta Biophy. Sin. 1991, 7, 103–106. [Google Scholar]
  35. Radhakrishnan, N.; Gangadhar, B. Estimating regularity in epileptic seizure time-series data. IEEE Eng. Med. Biol. Mag. 1998, 17, 89–94. [Google Scholar] [CrossRef] [PubMed]
  36. Li, C.; Li, Z.; Zheng, X.; Ma, H.; Yu, X. A generalization of Lempel-Ziv complexity and its application to the comparison of protein sequences. J. Math. Chem. 2010, 48, 330–338. [Google Scholar] [CrossRef]
  37. Nagarajan, R. Quantifying physiological data with Lempel-Ziv complexity-certain issues. IEEE Trans. Biomed. Eng. 2002, 49, 1371–1373. [Google Scholar] [CrossRef] [PubMed]
  38. Hu, J.; Gao, J.; Principe, J.C. Analysis of biomedical signals by the Lempel-Ziv complexity: The effect of finite data size. IEEE Trans. Biomed. Eng. 2006, 53, 2606–2609. [Google Scholar] [PubMed]
  39. Zhang, X.S.; Zhu, Y.S.; Thakor, N.V.; Wang, Z.Z. Detecting ventricular tachycardia and fibrillation by complexity measure. IEEE Trans. Biomed. Eng. 1999, 46, 548–555. [Google Scholar] [CrossRef] [PubMed]
  40. Aboy, M.; Hornero, R.; Abásolo, D.; Álvarez, D. Interpretation of the Lempel-Ziv complexity measure in the context of biomedical signal analysis. IEEE Trans. Biomed. Eng. 2006, 53, 2282–2288. [Google Scholar] [CrossRef] [PubMed]
  41. Melchert, O.; Hartmann, A. Analysis of the phase transition in the two-dimensional Ising ferromagnet using a Lempel-Ziv string-parsing scheme and black-box data-compression utilities. Phys. Rev. E 2015, 91, 023306. [Google Scholar] [CrossRef] [PubMed]
  42. Pincus, S.M. Approximate entropy as a measure of system complexity. Proc. Natl. Acad. Sci. USA 1991, 88, 2297–2301. [Google Scholar] [CrossRef] [PubMed]
  43. Pincus, S. Approximate entropy (ApEn) as a complexity measure. Chaos Interdiscip. J. Nonlinear Sci. 1995, 5, 110–117. [Google Scholar] [CrossRef] [PubMed]
  44. Tang, L.; Lv, H.; Yang, F.; Yu, L. Complexity testing techniques for time series data: A comprehensive literature review. Chaos Solitons Fractals 2015, 81, 117–135. [Google Scholar] [CrossRef]
  45. Richman, J.S.; Moorman, J.R. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol. Heart Circ. Physiol. 2000, 278, H2039–H2049. [Google Scholar] [CrossRef] [PubMed]
  46. Alcaraz, R.; Rieta, J. A novel application of sample entropy to the electrocardiogram of atrial fibrillation. Nonlinear Anal. Real World Appl. 2010, 11, 1026–1035. [Google Scholar] [CrossRef]
  47. Eduardo Virgilio Silva, L.; Otavio Murta, L., Jr. Evaluation of physiologic complexity in time series using generalized sample entropy and surrogate data analysis. Chaos Interdiscip. J. Nonlinear Sci. 2012, 22, 043105. [Google Scholar] [CrossRef] [PubMed]
  48. Mihailović, D.; Nikolić-Đorić, E.; Drešković, N.; Mimić, G. Complexity analysis of the turbulent environmental fluid flow time series. Phys. Stat. Mech. Its Appl. 2014, 395, 96–104. [Google Scholar] [CrossRef] [Green Version]
  49. Stein, E.M.; Shakarchi, R. Fourier Analysis: An Introduction; Princeton University Press: Princeton, NJ, USA, 2003. [Google Scholar]
  50. Dym, H.; McKean, H. Fourier Series and Integrals; Academic Press: San Diego, CA, USA, 1972. [Google Scholar]
  51. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  52. Pachori, R.B. Discrimination between ictal and seizure-free EEG signals using empirical mode decomposition. Res. Lett. Signal Process. 2008, 2008, 14. [Google Scholar] [CrossRef]
  53. Wu, Z.; Huang, N.E.; Long, S.R.; Peng, C.K. On the trend, detrending, and variability of nonlinear and nonstationary time series. Proc. Natl. Acad. Sci. USA 2007, 104, 14889–14894. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Berry, M.V. Diffractals. J. Phys. Math. Gen. 1979, 12, 781–797. [Google Scholar] [CrossRef]
  55. Fleckinger-Pelle, J.; Lapidus, M. Tambour fractal: Vers une résolution de la conjecture de Weyl-Berry pour les valeurs propres du laplacien. C. R. Acad. Sci. Sér. Math. 1988, 306, 171–175. [Google Scholar]
  56. Valério, D.; Lopes, A.M.; Tenreiro Machado, J.A.T. Entropy Analysis of a Railway Network’s Complexity. Entropy 2016, 18, 388. [Google Scholar] [CrossRef]
  57. Menne, M.J.; Durre, I.; Vose, R.S.; Gleason, B.E.; Houston, T.G. An overview of the global historical climatology network-daily database. J. Atmos. Ocean. Technol. 2012, 29, 897–910. [Google Scholar] [CrossRef]
  58. Steffen, K.; Box, J.; Abdalati, W. Greenland Climate Network: GC-Net; US Army Cold Regions Reattach and Engineering (CRREL), CRREL Special Report; CRREL: Hanover, NH, USA, 1996; pp. 98–103. [Google Scholar]
  59. Willmott, C.J.; Matsuura, K. Terrestrial Air Temperature and Precipitation: Monthly and Annual Time Series (1950–1996). 2000. Available online: http://climate.geog.udel.edu/~climate/html_pages/Global2014/README.GlobalTsT2014.html (accessed on 20 February 2018).
  60. Huang, N.E.; Wu, M.L.C.; Long, S.R.; Shen, S.S.; Qu, W.; Gloersen, P.; Fan, K.L. A confidence limit for the empirical mode decomposition and Hilbert spectral analysis. Proc. R. Soc. Math. Phys. Eng. Sci. 2003, 459, 2317–2345. [Google Scholar] [CrossRef]
  61. Polderman, J.W.; Willems, J.C. Introduction to Mathematical Systems Theory: A Behavioral Approach; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
  62. Lopes, A.M.; Machado, J.A.T. State space analysis of forest fires. J. Vib. Control 2016, 22, 2153–2164. [Google Scholar] [CrossRef]
  63. Machado, J.A.T.; Lopes, A.M. Analysis and visualization of seismic data using mutual information. Entropy 2013, 15, 3892–3909. [Google Scholar] [CrossRef]
  64. Murtagh, F. A survey of recent advances in hierarchical clustering algorithms. Comput. J. 1983, 26, 354–359. [Google Scholar] [CrossRef]
  65. Cha, S.H. Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 2007, 4, 300–307. [Google Scholar]
  66. Ellenburg, W.; McNider, R.; Cruise, J.; Christy, J.R. Towards an understanding of the twentieth-century cooling trend in the southeastern United States: Biogeophysical impacts of land-use change. Earth Interact. 2016, 20, 1–31. [Google Scholar] [CrossRef]
Figure 1. Worldwide land temperature anomalies relative to P 12 = [ 1951 , 1980 ] for the 5-year periods: (a) P 34 = [ 1970 , 1974 ] ; (b) P 34 = [ 1990 , 1994 ] ; (c) P 34 = [ 2000 , 2004 ] ; (d) P 34 = [ 2010 , 2014 ] .
Figure 1. Worldwide land temperature anomalies relative to P 12 = [ 1951 , 1980 ] for the 5-year periods: (a) P 34 = [ 1970 , 1974 ] ; (b) P 34 = [ 1990 , 1994 ] ; (c) P 34 = [ 2000 , 2004 ] ; (d) P 34 = [ 2010 , 2014 ] .
Entropy 20 00437 g001
Figure 2. The intrinsic mode functions (IMF), d l ( k ) , l = 1 , , 7 , and the residuals, m 7 ( k ) , obtained for a temperature time series (TTS) of a station with latitude/longitude coordinates ( 71.25 , 89.75 ) .
Figure 2. The intrinsic mode functions (IMF), d l ( k ) , l = 1 , , 7 , and the residuals, m 7 ( k ) , obtained for a temperature time series (TTS) of a station with latitude/longitude coordinates ( 71.25 , 89.75 ) .
Entropy 20 00437 g002
Figure 3. Monthly mean TTS, x ( k ) , and trend approximation, x ¯ ( k ) , for the TTS at the latitude/longitude coordinates ( 71.25 , 89.75 ) .
Figure 3. Monthly mean TTS, x ( k ) , and trend approximation, x ¯ ( k ) , for the TTS at the latitude/longitude coordinates ( 71.25 , 89.75 ) .
Entropy 20 00437 g003
Figure 4. The detrending pre-processing of the TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) by means of EMD.
Figure 4. The detrending pre-processing of the TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) by means of EMD.
Entropy 20 00437 g004
Figure 5. Contour map depicting L Z C i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Figure 5. Contour map depicting L Z C i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Entropy 20 00437 g005
Figure 6. Contour map depicting S a m p E n i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Figure 6. Contour map depicting S a m p E n i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Entropy 20 00437 g006
Figure 7. Contour map depicting η i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Figure 7. Contour map depicting η i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Entropy 20 00437 g007
Figure 8. The phase portrait (PP) ( x ˜ , x ˜ ˙ ) of the detrended TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) .
Figure 8. The phase portrait (PP) ( x ˜ , x ˜ ˙ ) of the detrended TTS with latitude/longitude coordinates ( 71.25 , 89.75 ) .
Entropy 20 00437 g008
Figure 9. Contour map depicting F r a c D i m i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Figure 9. Contour map depicting F r a c D i m i j for the total L = 85 , 690 TTS of the grid ( i , j ) on land.
Entropy 20 00437 g009
Figure 10. Contour maps of the histograms of the complexity indices versus w t , t = 1 , , W : (a) L Z C w t ; (b) S a m p E n w t ; (c) log [ η w t ] ; (d) F r a c D i m w t . Each year corresponds to the center of a time interval.
Figure 10. Contour maps of the histograms of the complexity indices versus w t , t = 1 , , W : (a) L Z C w t ; (b) S a m p E n w t ; (c) log [ η w t ] ; (d) F r a c D i m w t . Each year corresponds to the center of a time interval.
Entropy 20 00437 g010
Figure 11. The hierarchical tree generated by the hierarchical clustering (HC) algorithm based on (17).
Figure 11. The hierarchical tree generated by the hierarchical clustering (HC) algorithm based on (17).
Entropy 20 00437 g011

Share and Cite

MDPI and ACS Style

Lopes, A.M.; Tenreiro Machado, J.A. Complexity Analysis of Global Temperature Time Series. Entropy 2018, 20, 437. https://doi.org/10.3390/e20060437

AMA Style

Lopes AM, Tenreiro Machado JA. Complexity Analysis of Global Temperature Time Series. Entropy. 2018; 20(6):437. https://doi.org/10.3390/e20060437

Chicago/Turabian Style

Lopes, António M., and J. A. Tenreiro Machado. 2018. "Complexity Analysis of Global Temperature Time Series" Entropy 20, no. 6: 437. https://doi.org/10.3390/e20060437

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop