1. Introduction
Tectonic processes in the lithosphere preceding strong earthquakes are reflected in changes in the properties of seismic noise [
1,
2,
3]. The passage of atmospheric cyclones and the impact of ocean waves on the coast and shelf are the main sources of low-frequency seismic noise [
4,
5,
6,
7,
8,
9,
10]. Studying the characteristics of seismic noise allows us to obtain information about the structure of the Earth’s crust [
11].
This paper examines continuous seismic noise records from early 1997 to late 2025, recorded at a network of 229 broadband seismic stations located worldwide. The following properties of noise waveforms are analyzed: the entropy of the wavelet coefficient distribution, the Donoho–Johnstone (DJ) index (the fraction of wavelet coefficients with large absolute values), and the singularity spectrum support width. These statistics are indicators of seismic noise complexity and are calculated daily at each station. An auxiliary network of 50 reference points is considered. For each reference point, the median values of noise properties are found from the five nearest stations that are operational on the current day. The use of reference points allows for a continuous time series of noise property changes with a time step of 1 day and overcomes methodological difficulties associated with recording gaps at stations. Further analysis is based on the principal component analysis (PCA) method, calculated for the noise properties used both at each reference point and across all reference points. The principal components are estimated over a 365-day sliding time window. The relationship between seismic noise properties and the sequence of strong earthquakes (magnitude of at least 7) is analyzed using a parametric model of interacting point processes. In this model, one process is the sequence of seismic events, and the other is the sequence of local extremes of seismic noise properties.
The article continues the studies carried out in [
12,
13] to investigate the correlation and coherence properties of low-frequency seismic noise on a global scale covering the entire planet.
2. Data
The vertical components of continuous seismic noise records with 1 s sampling intervals are analyzed. Data were downloaded from the Incorporated Research Institutions for Seismology (IRIS) website [
14] from 229 broadband seismic stations of three networks:
The data were downloaded using a Python Toolbox “ObsPy”, version 1.3.1, for seismological observatories [
15]. Seismic noise records with a sampling rate of 1 Hz (LHZ records) were considered for 29 years of registration (from 1 January 1997 to 31 December 2025). These data were converted to 1 min time series by calculating averages for successive 60 s time intervals.
Consider an auxiliary network of 50 reference points, which are determined using a hierarchical cluster analysis of the positions of 229 seismic stations using the “far neighbor” method [
12]. The location of 229 seismic stations and 50 reference points is shown in
Figure 1. The numbering of reference points is carried out as the latitude of the point decreases.
3. Properties of Seismic Noise
The
minimum entropy of a time series is determined by the formula
, where
,
are the wavelet coefficients of the signal decomposition, and
is the total number of coefficients
. Seventeen orthogonal Daubechies wavelets were used: 10 ordinary minimum support bases with vanishing moments from 1 to 10 and seven so-called Daubechies symlets [
16] with vanishing moments from 4 to 10. In each time window, the wavelet for which the value
is minimal is selected. These orthogonal wavelets have the smallest support length for a given smoothness—the number of vanishing moments.
The
Donoho–Johnstone (DJ) wavelet-based index is the ratio of “large” wavelet coefficients in absolute value to their total number. By definition
, the threshold separating “large” wavelet coefficients is
, where
is a robust estimate of the standard deviation of the normal distribution,
are the wavelet coefficients at the first detail level of decomposition, and
is the number of such coefficients [
16,
17].
The
singularity spectrum support width is considered a measure of the diversity of the stochastic behavior of the signal
. It is defined as
, where
and
are the minimum and maximum values of the Hölder–Lipschitz exponent [
18], which governs the behavior of the signal in the vicinity of the time instant
:
. For a monofractal signal, the exponent
is the same for all time instants
. If this exponent differs, then the signal is multifractal [
19].
After switching to a 1 min sampling time step, the seismic records from each station were divided into 1-day-long time fragments (1440 samples), and the parameters
of daily seismic noise signals were calculated for each fragment. The multifractal DFA method [
20] was used to calculate the values of the singularity spectrum support width. The methods for estimating the parameters
,
and
for seismic noise records in a sliding time window are described in detail in [
21,
22]. Thus, time series of values with a 1-day time step were obtained for each of the seismic stations.
The entropy value
used in this paper, defined through the coefficients of the orthogonal wavelet decomposition, has features in common with multiscale entropy used, for example, in biology [
23,
24]. In [
25,
26], entropy is used within the framework of the natural time approach for seismic data analysis. In [
27], non-extensive Tsallis entropy is used for processing seismic noise data.
The use of multifractal analysis to study the behavior of various complex systems has a long history. Particular attention is paid to the effect of parameter
reduction (loss of multifractality) preceding changes in system properties. In medicine, a decrease in the
value of various parameters accompanies age-related changes [
28,
29,
30]. In [
31,
32], multifractal analysis is used to analyze geoelectric signals and wind speed. Within the framework of the natural time approach, multifractal analysis is used to study both seismicity and other time sequences [
33].
Let us denote by , , , , the daily time series of parameters at 50 reference points of the seismic noise observation network, which are calculated as the medians of the values at the five nearest operational (without data gaps for this day) seismic stations.
4. First Principal Component and Weighted Mean of Multiple Time Series
In the future, we will need to calculate the first principal components and weighted averaging of multivariate time series. The values of the parameters
introduced above are calculated in successive time windows of a certain length, resulting in a 3-dimensional time series, the properties of which are further studied jointly. The used properties of seismic noise reflect the change in its structure; in particular, we are interested in the phenomenon of noise structure simplification as a sign that precedes strong earthquakes. Attempts to determine the “best” property of noise led to the idea of using the principal component approach [
34,
35,
36] to aggregate time series
into one scalar time series. Since the purpose of the analysis is to study the variability of noise properties both in time and space, the principal component method was applied in a sliding time window.
Let us consider a multiple time series
,
of the dimensionality
. It is necessary to estimate two types of its first principal component in the moving time window of the length
samples. For this purpose, let us consider samples with time indices
under the condition
, where
is the right-most end of the moving time window. Let us perform normalization of multiple time series components within the current time window:
Let us consider two correlation matrices
and
of the size
, which are calculated by the formulae:
Let
and
be eigenvectors of the matrices
and
corresponding to their maximum eigenvalues. Let us define the first principal component
and the weighted mean
of multiple time series
within the current time window by the formulae:
and define the scalar time series
and
of the adaptive first principal component and the weighted mean in the moving time window by the formulae:
Equations (1)–(6) are applied independently within each time window. According to these formulae, within the first time window time series, and consist of values corresponding to (3–5). In all subsequent windows, and correspond to the only sample in the right-most time index. Thus, outside the first time window, and are dependent on the past values of .
The standard first component is used when it is necessary to aggregate heterogeneous components of a multidimensional time series, for example, a daily time series of seismic noise properties at a single reference point. In this case, the time series dimension is . Weighted averaging of time series components is used when the time series components are homogeneous, for example, to average the time series of principal components at all reference points. In this case, the time series dimension is . In the following, we will agree to call these two variants of using the principal component method P-averaging and W-averaging.
A moving time window with a period of 365 days will be used. The choice of this period is quite natural because of the presence of annual periodicity in almost all background processes in the Earth’s crust. We assume that the period of the sliding time window of 365 days ensures relative stationarity of the behavior of low-frequency seismic noise within this time window.
At each reference point with a number
, we calculate the first principal component of three time series
,
,
using the
P-averaging method (1–6) in a sliding time window of 365 days in length and denote it as
. It should be noticed that the values of seismic noise parameters and their first principal component are dimensionless.
Figure 2 shows graphs of the time series of seismic noise properties and their first principal components, calculated in a sliding adaptation time window of 365 days for six reference points.
Let us estimate the temporal variability of strong correlations between the first principal components of seismic noise properties at different reference points. To do this, we calculate the average absolute correlations of the principal components
between all reference points over a 365-day sliding time window.
Figure 3(a1) shows a graph of these average correlations, which shows a trend of increasing spatial correlations since 2002.5 (end of June 2002). It is easy to see near-periodic oscillations in this graph. To accurately estimate the period of these oscillations, we calculate the mean squared wavelet coefficients of the continuous Morlet wavelet transform for the curve in
Figure 3(a1), starting with the 2004 time stamp. The Morlet transform of the signal
is defined by a formula [
16]:
The Morlet spectrum is defined as the mean value of
with respect to time moments
for a set of periods
of oscillations. This spectrum is shown in
Figure 3(a2), in which the maximum oscillation energy is reached at a period of 940 days, which corresponds to 2.57 years.
We set a threshold of 0.8 for correlations and calculated the change in the number of reference point pairs for which the correlation exceeds this threshold. This dependence is shown in
Figure 3b, which shows that the number of strong spatial correlations increased essentially after 2013.
The time point 2013, corresponding to the right-hand end of the time window, was found using the method of variance analysis [
35] by comparing the mean values in two time intervals between the trial time point, separating these intervals for the fragment 1998–2025. The final fragment of the positions of the right-hand end of the time window 2025–2026 stands out, especially due to a remarkable anomaly in the behavior of the number of pairs of reference points with strong correlations: in the second half of 2025, the number of strong correlations tripled in six months. We suggest that this effect may be related to the mega-earthquake in Kamchatka on 29 July 2025, with a magnitude of 8.8 [
37,
38].
Figure 3c shows the dependence of the maximum distances between reference points for which a strong correlation greater than 0.8 emerged on the right-hand end position of a 365-day sliding time window. This dependence also reveals a step in the distances of strong spatial correlations after 2012.
In
Figure 4, strong correlations between the data points are visualized as two connectivity diagrams for time windows spanning two 2-year time intervals.
Figure 4b corresponds to the time window with the largest number of strong correlations.
5. Probability Densities of Extreme Values
In what follows, we distinguish two cases that consider the extreme values of the seismic noise statistics. We agree to denote the first case by , in which we analyze the joint spatial distribution or local extrema in the time realizations of the minimum values of the index and and the maximum values of entropy . We denote the second case by , for which we consider the joint spatial distribution or local extrema in the time realizations of the maximum values of index and and the minimum values of entropy . These two cases are antagonistic. Case 1 corresponds to a simplification of the seismic noise structure and its properties approaching white noise. The opposite, case 2, corresponds to enrichment of the noise structure and, as a rule, corresponds to the appearance of chaotic high-amplitude spikes in noise realizations.
Let us study the variability in the spatial distribution of extreme values of seismic noise properties. For this purpose, we consider a regular grid of 100 nodes by longitudes and 50 nodes by latitudes, covering all area of the Earth. Let
be any value of
,
or
. For each grid node
and for each day with number
, we find the five nearest working seismic stations, which provide five values of
. Let us denote
as the median value of these five properties at node
on a day with number
. For each daily set of
values with a discrete time index
, we find the coordinates of the nodes
at which extreme values of
are reached relative to all other nodes of the regular grid for the two cases introduced above,
and
. The two-dimensional probability distribution function of the vectors
is calculated within the time interval
for each node
of the regular grid using the Parzen–Rosenblatt estimate with the Gaussian kernel function [
36]:
Here
is the kernel averaging radius in spherical degrees;
are integer indices that number the daily maps; and
is the number of daily maps in the time interval under consideration.
is a squared spherical distance between points
and
at the Earth’s surface;
is the square of the spherical distance between points on the Earth’s surface, corresponding to the value
. A smoothing bandwidth of
was used. This value corresponds to a distance of ≈1700 km along the Earth’s surface and is defined as the maximum of the histogram of the distribution of distances between all pairs of the nearest reference points [
12].
Let us calculate the 2-dimensional distribution densities (8) of extreme values in successive time windows of 10 days in length (). To analyze the temporal variability in the distribution of extreme seismic noise density properties, it is necessary to select a time window whose length, on the one hand, ensures variability monitoring and, on the other hand, smooths high-frequency noise. This is a common dilemma when choosing data smoothing parameters. There are no formal methods for such a choice, and it is made empirically by selecting a value. In this case, a time window of 10 days proved to be quite appropriate.
Since we have three such distributions, we calculate their weighted average by taking the squares of the components of the eigenvector of the correlation matrix of probability densities as weights corresponding to the maximum eigenvalue of the matrix [
34,
35,
36]. By construction, the sum of the squares of the components is equal to 1.
Figure 5 presents the averaged distribution maps of extreme seismic noise properties for the two cases,
and
, for three consecutive time intervals. The first interval covers the time interval from the beginning of observations to the time of the Tohoku mega-earthquake in Japan on 11 March 2011. The next time interval extends from the Tohoku event to the end of 2024. The final time interval begins in early 2024 and extends until the end of 2025. The final time interval, the shortest, was chosen because it contains a sharp increase in the number of pairs of control points with correlations exceeding the 0.8 threshold (
Figure 3b).
As can be seen in
Figure 5(a1–a3), the concentration regions of the most “quiet” seismic noise behavior (variant
) are stable throughout the entire observation interval, with the exception of the middle of the Atlantic Ocean in the vicinity of the island of Saint Elena and the south of the Indian Ocean in the vicinity of the Kerguelen archipelago, where a gradual increase in probability density is observed. As for the variant
of the most “violent” seismic noise behavior, the concentration region of this behavior remains stable until 2024 and occupies northeastern Eurasia, centered on the Putarana Plateau, which formed after the largest supervolcanic eruption 250 million years ago. However, after 2024, this region shifted to the Middle East.
We currently lack a definitive explanation for this effect. The only hypothesis is that the Kamchatka mega-earthquake of 29 July 2025 had an unexpectedly strong impact on the seismic noise field, unlike previous mega-earthquakes: Sumatra (26 December 2004, M = 9.1), Maule (Chile) (27 February 2010, M = 8.8), and Tohoku (Japan) (11 March 2011, M = 9.1).
The graphs in
Figure 6 provide a more detailed representation of the spatiotemporal features of the changes in the distribution densities of the extreme values of seismic noise properties. The graphs in
Figure 6(a1,b1) represent the changes in the latitude of those nodes of a regular 100 × 50 grid for which the maximum distribution of extreme properties is realized for the cases
and
.
Figure 6(a1) shows a strong annual periodicity, with the point of maximum probability density
making a sudden transition to the Arctic region in December–January. Furthermore,
Figure 6(a1) shows a transition from the point of maximum probability density from 15 degrees north latitude to 30 degrees south latitude, occurring around 2011, which is noticeable when comparing
Figure 5(a1,a2).
As for
Figure 6(b1), it shows a gradual evolution of the region of maximum probability densities for the case
from northeastern Eurasia to the Middle East, which began around 2019.
Figure 6(a2,b2) present graphs of probability histograms of the corresponding latitude values with maximum values of the distribution densities of extreme values of noise properties.
6. Sequence of Major Earthquakes and Their Periodic Components
The following sections of the article analyze the relationships between the properties of global seismic noise and the sequence of strong earthquakes.
Figure 7 shows the time sequence of 433 seismic events with a magnitude of at least 7 for the time interval 1997–2025.
As
Figure 7 shows, the regime of the strongest earthquakes is non-stationary. In particular, we are interested in the periodic components of the intensity change in the earthquake sequence. In [
39,
40], this method was used to calculate the periodic component of the stepwise variations in the time series of the displacement of the Earth’s surface, which was measured by GPS, and periodic components of earthquake sequences.
Let us consider the sequence of time moments
on the interval
. Consider the following intensity model containing a periodic component:
where period
, amplitude
, phase angle
, and multiplier
(describing the Poisson part of the intensity) are parameters of the model. Thus, the Poisson part of the intensity is modulated by a harmonic oscillation. Let us fix some value of period
. The logarithmic likelihood function [
41], in this case for a series of observed events, is equal to:
Taking the maximum of expression (10) with respect to the parameter
, it is easy to find that
where
.
Expression
is an estimate of the intensity of the process under the condition that it is purely random. The increment of the logarithmic likelihood function with respect to a purely random sequence is equal to
For a fixed value of period
, we consider two competing models of earthquake sequences: the simplest model of a stationary Poisson process, which is described by one parameter
of average intensity, and a more complex model (9), containing three parameters
. The difference between the maximum values of the log-likelihood function for the complex data model (9) and the simple Poisson process model is always non-negative and gives a measure of how much more “profitable” the complex model is compared to the simple one. According to Wilks’s theorem [
35], this difference is asymptotically distributed as
, where the number of degrees of freedom
is equal to the difference between the number of parameters of the competing models, that is, in our case,
, and the asymptotic distribution of the difference
in the maximum values of the likelihood functions is
. Since the distribution
coincides with the exponential distribution, we obtain the following asymptotic distribution:
Expression (14) provides the possibility to find thresholds for statistics (14) that allow us to find when the hypothesis of the existing periodical component of intensity with a given period could be accepted.
For a time sequence of seismic events with a magnitude of at least 7, we calculated the increments of the logarithmic likelihood function (13) in a sliding time window of 5 years with a shift of 0.01 years for 100 period values
corresponding to the values varying from 1 to 5 years with a uniform step in a logarithmic scale. The number of events within the moving time window of the period of 5 years is presented in
Figure 8a. The average number of events within the sliding window is 75, and it varies from 59 to 97. Therefore, we believe that the conditions of the asymptotic inequality (13) are applicable. It is interesting to note in
Figure 8a that the number of events
in the 5-year time window started to increase after the Maule mega-earthquake in Chile on 27 February 2010, M = 8.8, and reached its maximum values after the Tohoku mega-earthquake in Japan on 11 March 2011, M = 9.1.
The resulting time–frequency dependence is shown in
Figure 8b.
Figure 8c shows a plot of the mean values of the log-likelihood increments for all 5-year time windows. It shows a spectral peak at a period of 2.6 years. This plot was obtained by summing all time windows, and the maximum increment value
is 1.55, which gives a probability of 0.787 for accepting the hypothesis of the presence of a periodic intensity component for a period of 2.6 years, according to Equation (14). However, if we calculate the value
using the entire sample of seismic event times, we obtain the plot shown in
Figure 8d, in which the maximum value
at a period of 2.6 years is 5.55, which gives a probability of accepting the periodicity hypothesis of 0.996.
To determine the dominant period in the earthquake sequence, one could rely solely on the graph in
Figure 8d. However, the time–frequency diagram in
Figure 8b is of independent interest, as it provides information on the evolution of the periodic intensity components. It shows that the period gradually increases until approximately 2013, after which the periodic component virtually disappears until 2018. After 2018, two branches of the periodic component’s evolution emerge—low-frequency and high-frequency—which merge at the end of the analyzed time interval.
An analysis of the periodic components of the seismic process yielded an interesting result: it turned out that the dominant period of seismicity, 2.6 years, is very close to the 2.57-year period of oscillations of the average value of pairwise absolute correlations between the properties of seismic noise at reference points—
Figure 3(a2).
Although
Figure 8b demonstrates the variability in the spectral composition of the sequence of the strongest earthquakes, it is nonetheless clear that the 2.6-year oscillation period (logarithm equals 0.4) is dominant starting from the 2008 timestamp of the right-hand end of the 5-year sliding time window. Given the 5-year window period, this means that this period began to dominate starting in 2003, that is, approximately from the time when the trend in
Figure 3(a1) changed with superposed oscillations for a period of 2.57 years.
7. Weighted Means of Seismic Noise Properties and Their Local Extrema
We apply the method for calculating weighted averages (“
W-averaging” in Equations (5) and (6)) to the analyzed seismic noise statistics. The weights for averaging are the squared absolute values of the first eigenvectors of the 50 × 50 correlation matrices calculated for daily values of seismic noise properties
,
,
at 50 reference points, corresponding to the maximum eigenvalues of these matrices. The weighted average values of these properties of seismic noise, calculated in a sliding time window of 365 days, are denoted by
,
and
. The results of such a “left-oriented” weighted “
W-average” are presented in
Figure 9(a1–a3). These graphs show that the behavior of the first principal components of the analyzed properties of seismic noise can be divided into three large time fragments, of which the middle fragments (time stamp intervals 2007.33–2014.12 (1 May 2007–13 February 2014) for
, 2002.31–2015.81 (24 April 2002–23 October 2015) for
, and 2004–2020.62 (1 January 2004–14 August 2020) for
) correspond to transient processes of change in the average values.
In the future, we will be interested in the positions of the time points of a certain number of the largest local maxima or the smallest local minima of daily seismic noise properties in comparison with the times of earthquakes with a magnitude of at least 7 (
Figure 7). To eliminate the influence of low-frequency components of changes in statistical values on the determination of the times of local extrema, the time series of noise properties were subjected to the operation of removing low frequencies using Gaussian kernel smoothing. Let
be a time series with discrete time
. Gaussian kernel averaging of a time series
with radius (scale parameter)
at time
, is calculated using the formula [
42]:
Since the Gaussian smoothing kernel used in Equation (15) is symmetrical about zero, the position of the local extremum identified after local trend removal in the vicinity of a radius of two samples is unbiased. The local trend removal operation itself, due to the rapid decay of the Gaussian kernel, is a “local” operation, the result of which is influenced by the behavior of the original signal in the vicinity of each point, the radius of which is approximately equal to two samples.
Let us denote
,
, and
as the result of removing the local trends from the signals
,
, and
by the Gaussian window of radius
, equal to 2 days. We are interested in the local extremum points of signals
,
, and
. The results of removing local trends and determining local extremum points are presented in
Figure 9(b1–b3). From the most prominent local extrema, the number 433, representing the most prominent local extrema, was chosen as equal to the number of earthquakes with magnitudes not less than 7.
Let us analyze how the time points of the most significant local extrema of the statistics
,
, and
are distributed. To do this, let us calculate the probability histograms separately for 433 of the smallest local minima
,
and 433 of the largest local maxima of entropy
(which means case
), i.e., empirical probabilities of time values falling within successive six-month intervals. Since the total observation time is 29 years, the number of such intervals is 58.
Figure 10(a1–c1) show graphs of these histograms.
Figure 10(a2–c2) correspond to probability histograms for 433 of the largest local maxima
,
and 433 of the smallest local minima of entropy
(case
). It shows that the last six-month interval has the highest empirical probability of containing local extremum points, except
Figure 10(a2).
Thus, from the graphs in
Figure 10, it is evident that the times of prominent local extremes of almost all seismic noise statistics are significantly concentrated in the second half of 2025.
8. Influence Matrices
Further analysis involves calculating a measure describing the lead or lag of two time sequences, one of which represents the times of strong earthquakes, and the other represents the times of certain characteristic features of seismic noise statistics. Such features could be, for example, the times of the most prominent local extrema.
To solve this problem, we apply the influence matrices method, which was used in [
43,
44,
45,
46] to analyze the prognostic properties of the Earth’s surface tremors, measured by GPS, to analyze the relationship between magnetic field fluctuations and strong earthquakes, and to analyze the relationship between anomalies in meteorological time series and seismicity.
Let
represent the moments of time of two sequences of events. Let us represent their intensities as follows:
where
are parameters,
is the influence function of time moment
from the sequence with number
:
The parameter is a characteristic of the decay time of the weight of the event with number for .
The strength of the mutual influence of the sequence on the flow is defined by the parameter . The self-exciting effect is determined by the parameter . The parameter reflects a purely random (Poisson) component of intensity. Let us fix the parameter and consider the problem of determining the parameters .
The log-likelihood function for the model of intensity (16) equals [
41]:
The parameters
should be found by maximizing (18). As shown in detail in [
13,
43,
44,
45,
46], the problem of finding the parameter values
from the maximum likelihood principle is reduced to solving problems for the maximum:
where
,
,
, under restrictions:
For a given relaxation time
, the problem (19) and (20) is solved numerically. The elements of the influence matrix
could be defined as:
The quantity
is a pure random share of the intensity
of the process
. The share
reflects the self-excitation, whereas
describes the external influence
. The normalization condition follows from (21):
The influence matrix is defined as follows:
The sum of the component rows of the influence matrix (23) equals 1. The matrix is estimated in a moving time window of a certain period for a given value of the relaxation time .
We investigate the combined effect of the advance or lag of the local extrema of the functions , , and relative to the earthquake moments. To do this, we calculate the first principal component of these functions in a sliding time window of 365 days according to Equations (1)–(6) for the P-averaging and denote it by .
The components of the influence matrix (23) are calculated in a sliding time window of 2.6 years with an offset of 0.01 years. This choice of window length is due to the fact that it is equal to the dominant period of the sequence of strong earthquakes (
Figure 8).
The parameter is searched by maximizing the average value of the difference , where is the share of the intensity of the earthquake sequence caused by the “influence” of the sequence of local extrema, and is the share of the intensity of the local extrema sequence caused by the leading influence of seismic events.
The graphs in
Figure 11 show that the local minima of the function
lead earthquake times significantly more than vice versa (average lead measures are 0.714 versus 0.228). Moreover, the lead measure of the local minima
relative to seismic event times is subject to cyclic variations with a period of about 9 years, as can be seen in
Figure 11(b2). The cyclic trend in
Figure 11(b2) (purple line) is determined by minimizing the variance of the residual using a trial cyclic trend with fitted parameters of the linear trend and oscillation period. As for the local maxima of the function
, they almost always occur after earthquakes, and the reverse effect is negligible (the mean lag measure is 0.802 versus 0.023).
9. The Step Component of the Entropy Probability Density and Its Relationship with the Strongest Earthquakes
The number of grid nodes covering the surface of the Earth equals
. Let us calculate the histogram of the weighted probability density functions (PDFs) for each 10-day interval
as a sequence of values:
Here
is the number of bins dividing the range of density variation within a given 10-day window into intervals of equal length,
is the number of PDF values within the interval with number
. We choose the value
according to [
36,
42]. It is evident that
. Thus, values of the histogram (24) have the properties of a discrete probability distribution. The normalized entropy of this distribution is calculated as:
The plot of entropy change (25) is indicated by the gray line in
Figure 12a.
The behavior of entropy (25) in short time intervals of 10 days is non-stationary. We are interested in points where the average entropy value changes significantly. To identify such time points, we apply the Wavelet Transform Modulus Maxima (WTMM) method, which allows us to formalize the determination of significant changes in the average value of a noisy signal using sufficiently long chains of the local maxima of the wavelet transform modulus over time, depending on the scale of temporal changes under consideration. The method for determining jumps in the average level is based on the construction of a piecewise-step approximation of the signal under study. This approximation is constructed using continuous wavelet transforms with a kernel in the form of derivatives of a Gaussian function, which allows one to formalize the selection of moments in time at which a significant, scale-dependent change in the mean value of a noisy signal occurs [
46,
47].
Let us define the smoothed functions of entropy (25):
In Equation (26), a Gaussian function is used as the smoothing kernel, with the parameter
being the smoothing radius. Due to the rapid decay of the Gaussian function and its symmetry, the first derivative of the smoothed entropy can be written in the form:
We define a WTMM point as the argument of the modulus at which a local maximum is achieved with respect to . As the smoothing parameter changes, the points on the plane form so-called WTMM chains. The set of all WTMM chains forms the WTMM skeleton of the signal. Since , the points determine the time instants for which the maximum trends (decrease or increase) of the smoothed function are observed. If we set a parameter , a sequence of WTMM points will arise, from which we can formalize a stepwise approximation of the original signal as a sequence of its average values between adjacent time instants and .
Let us consider the sequence of time instants of stepwise changes in the WTMM approximation of entropy (25) of the values of the weighted average density distribution of extreme values
.
Figure 12a shows a graph of the change in this entropy (gray line) and a graph of the stepwise WTMM approximation. Since entropy (25) is defined as a sequence of values with a time step of 10 days, then when selecting a parameter
, the dimensional length of this parameter is equal to 100 days.
Figure 12b shows the time sequence of the absolute values of steps in the WTMM approximation. We examine the relationship between steps in the WTMM approximation of entropy and the time instants of the strongest earthquakes with a magnitude of at least 7.8, which is shown in
Figure 12c.
Let us calculate the influence matrix between the two time sequences shown in
Figure 12b,c over the entire length of the available sample. The result depends on the choice of the parameter
, which we select based on the condition of maximizing the difference between the influence matrix components responsible for mutual influence. This value is
years. The calculation results are presented in
Table 1.
The results presented in the
Table 1 indicate that the abrupt changes in the entropy values of the weighted average probability densities of the extreme value distributions for the case
significantly precede the times of the strongest earthquakes (the lead measure is 0.382 versus 0.027). It is interesting to note that, when similarly considering the “antagonistic” case
, the mutual influence is negligible.
10. Estimates of Coherence
To analyze the relationships between the length of day time series reflecting the irregularity of the Earth’s rotation and the properties of seismic noise, we need to calculate the quadratic coherence between two time series in a sliding time window. For this purpose, we use a 2-dimensional autoregressive model, which provides high-resolution spectral analysis for short time samples [
48]. For a 2-dimensional time series
, the AR model is described by the formula:
Here
is the autoregression order,
are matrices of autoregression coefficients of the size
, and
is the covariance matrix of the size
of residual signal
. For fast computing of matrices
and
, the Durbin–Levinson procedure is applied [
49]. The spectral matrix estimate is defined by the formula:
where
is a unit matrix of the size
,
is a frequency (in units
, where
is the time step, 1 day in our case,
), and “
” is a symbol of Hermitian conjugation. Squared coherence spectrum is computed by the formula:
Here and are diagonal elements of the matrix (29), whereas is the cross-spectrum.
11. Connection of Seismic Noise Response to the Irregularity of Earth’s Rotation with Major Earthquakes
The connection between the irregularity of the Earth’s rotation measured by the length of day (LOD) and seismicity was analyzed in [
50]. The trigger mechanism of the effect of variations in the Earth’s rotation on the seismic process was studied in [
51]. Estimates of the effect of a strong earthquake on the length of the day are given in [
52].
Figure 13a presents the length of day time series for the interval 1997–2025 [
53].
The relationship between seismic noise properties and irregularity of the Earth’s rotation was investigated in [
13]. We calculated the maximum quadratics coherence spectrum between the LOD and the first principal component of properties
for each of 50 reference points in a moving time window of 365 days with a mutual shift of 3 days. We call these maximum quadratic coherences as the response
of seismic noise to the LOD. We used a 2-dimensional AR-model (28) for
within a moving time window of 365 days with a mutual shift of 3 days.
Examples of such response functions are presented in
Figure 13(b1–b6) for six reference points. Having response functions
from all reference points, we calculated their weighted mean value where we used squared components of the first eigenvector (corresponding to the maximum eigenvalue) of the 50 × 50 size covariance matrix. This weighted mean of all response functions we will call the integral response of three seismic noise properties all over the world to the irregularity of the Earth’s rotation.
Figure 13c of the integrated response of seismic noise properties to the Earth’s rotation irregularity shows that the strongest earthquakes
occur primarily after the response reaches its maximum. This response behavior highlights a distinctive pattern before and after the Kamchatka mega-earthquake of 29 July 2025,
[
37,
38]: a prolonged increase prior to the event and a sharp decline immediately afterward.
To analyze the relationship between local extrema of the integrated seismic noise response to the LOD and seismic events
, we remove local trends from the response function using a Gaussian window. The goal of this operation, as before, is to isolate the effect of the leading time of the most prominent local extrema relative to the earthquake time points. However, it turned out that the optimal window width, for which the leading effect in this case is maximal, is 24 days (eight time steps with a step length of 3 days).
Figure 14a shows the response function graphs after removing local trends and the positions of the 433 largest local maxima.
Figure 14(b1,b2) show graphs of the change in the influence matrix components responsible for the mutual leading of the analyzed time sequences. The assessments were performed in a sliding time window of 2.6 years with an offset of 0.01 years. The relaxation time value
was chosen to maximize the difference between the “forward” and “reverse” lead measures. This value turned out to be equal to
year.
Comparing the graphs in
Figure 14(b1,b2) shows how much the lead measure of the local maxima of the LOD response function relative to earthquake times outperforms the inverse measure (average value of 0.253 versus 0.049).
Figure 14(b2) shows that the lead measure is subject to a cyclical trend with a period of 7.5 years. This period was found by minimizing the residual variance for cyclical trends with trial periods.
12. Discussion
A method for analyzing long-term continuous seismic noise records from a global network of broadband seismic stations is proposed. This method utilizes their multifractal properties and entropy estimates based on wavelet decompositions. The goal of this research is to establish patterns in the evolution of noise properties and their relationships to the sequence of strong earthquakes and the irregularity of the Earth’s rotation.
As a result of the analysis, the following facts were established:
1. From 2002.5 (end of June 2002) to the end of 2025, spatial correlations in global seismic noise properties increased. This increase follows a linear trend, superimposed by oscillations with a period of 2.57 years (
Figure 3(a1,a2)).
2. Since 2011, following the Tohoku mega-earthquake of 11 March 2011 in Japan, there has been a sharp increase in the largest distances at which strong correlations between seismic noise properties occur in a set of 50 control points covering the surface of the globe (
Figure 3c).
3. Along with the growth of the largest distances of strong correlations, the number of pairs of control points between which strong correlations arise also increases. In the second half of 2025, following the Kamchatka mega-earthquake of 29 July 2025 (M = 8.8), the number of pairs of control points with strong correlations exploded, tripling over six months (
Figure 3b and
Figure 4).
4. On the Earth’s surface there are stable areas of concentration of the highest values of probability densities of extreme values of seismic noise properties (
Figure 5 and
Figure 6).
5. The sequence of strong earthquakes with a magnitude of at least 7 demonstrates a complex non-stationary nature of the evolution of its periodic components (an increase in frequency before 2014 and a decrease after 2018) with the identification of a dominant period of 2.6 years, which is almost equal to the period of 2.57 years of oscillations of the average value of pairwise correlations between the properties of seismic noise at reference points (
Figure 8).
6. Probability histograms of time points at which extreme values of seismic noise properties are observed have significant maxima in the second half of 2025 (
Figure 10).
7. The average lead measure for the smallest local minima of the Donoho–Johnstone index, the singularity spectrum support width, and the largest local maxima of the entropy values of the wavelet coefficients—relative to the time moments of earthquakes with a magnitude of at least 7—significantly leads the inverse lead measure (0.714 versus 0.228) (
Figure 11).
8. The full measure of the lead time of the step changes in the entropy of the values of the average weighted distribution densities of the minimum values of the Donoho–Johnstone index, the singularity spectrum support width, and the maximum values of the entropy of the wavelet coefficients—relative to the moments of time of earthquakes with a magnitude of at least 7.8—significantly leads the inverse measure of the lead (0.382 versus 0.000) (
Figure 12,
Table 1).
9. The average lead measure for the largest local maxima of the value of the generalized response of seismic noise properties to the unevenness of the Earth’s rotation relative to the moments of time of earthquakes with magnitudes of at least 7 significantly leads the inverse lead measure (0.253 versus 0.049) (
Figure 14).
13. Conclusions
Taken together, the established facts allow us to assert that three important features, which can be termed “events,” occurred in the Earth’s dynamics, as manifested by low-frequency seismic noise, between 1997 and 2025. The first “event” occurred around 2002.5 (end of June 2002), when the process of increasing global spatial correlation of seismic noise began. The second “event” occurred around 2013, when the increase in spatial correlation of noise increased essentially. Finally, the third “event” occurred in the second half of 2025, when the spatial correlation of noise increased dramatically. The final “event” can be associated with the Kamchatka mega-earthquake of 29 July 2025.
The physical nature of the observed oscillations with a period of 2.6 years in variations in the spatial correlations of seismic noise properties and, simultaneously, in fluctuations in the intensity of strong earthquakes is currently unclear. It is possible that this periodicity is associated with processes in the atmosphere and ocean [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
44,
45].
The results also indicate that the minimum values of the Donoho–Johnstone index and the multifractal singularity spectrum support width, as well as the maximum values of the entropy of the wavelet coefficient distribution, possess significant predictive power for the sequence of strong earthquakes. Furthermore, the maximum values of the response of seismic noise properties to the irregularity of the Earth’s rotation also possess predictive power.