1. Introduction
This article is devoted to the analysis of seismic noise data recorded in the F-net network of stations on the Japanese islands for 29 years, spanning 1997–2025. During this period, on 11 March 2011, a mega-earthquake with a magnitude of 9.1 occurred in Japan. Processes of preparation for strong seismic events are reflected in changes in the properties of seismic noise. The availability of the F-net seismic network with free access to data, provided by the National Research Institute for Earth Science and Disaster Resilience (NIED), makes it possible to test various hypotheses about how changes in the properties of seismic noise reflect the preparation of a strong seismic event. In works [
1,
2,
3], it was shown that the processes of earthquake preparation are preceded by changes in the statistical structure of seismic noise, consisting in simplification of noise, namely an increase in entropy and a loss of multifractality. This article is a continuation of work [
3] on the study of seismic noise in Japan. The possibility of a catastrophic earthquake in the deep-sea Nankai Trough region south of Tokyo has long been a subject of discussion among Japanese seismologists [
4,
5]. The Tohoku mega-earthquake of 11 March 2011 revived interest in assessing the likelihood of a repeat mega-earthquake in this area [
6,
7,
8]. A hypothesis was put forward in [
8] that a new mega-earthquake could even exceed the Tohoku event in energy, and its magnitude could reach 10. The emergence of new data makes it possible to assess the possibility of a new strong earthquake. This paper shows that significant changes in the spatiotemporal structure of seismic noise in Japan occurred in 2020–2025, which may indicate an acceleration of the preparation processes for a new mega-earthquake.
The sources of energy of low-frequency seismic noise energy mostly have atmospheric origin; the crust is the medium for seismic wave propagation. Thus, preparation of strong seismic events are reflected in changes in the properties of seismic noise. The parameters studied were selected after reviewing a large number of other seismic noise properties because they responded most sensitively to the emergence of areas of increased seismic hazard during retrospective analysis. This experience is reflected in publications [
1,
2,
3].
This article examines the relationship between local extremum points of three seismic noise properties and the times of seismic events. Seismic noise properties have the form of a conventional time series with a constant time step of 1 day. A sequence of earthquakes has the structure of a point process, that is, a sequence of time points in which the intervals between events have varying durations and can be either short (e.g., for aftershocks following strong events) or long (seismic lulls). To enable correlation of conventional time series with a constant time step with point processes, this article transforms the time series into point processes by taking the times of the most prominent local extremums of the seismic noise properties time series. It should be emphasized that the result of this transformation depends heavily on which local extrema are taken (maxima or minima).
3. Seismic Noise Statistics
The
minimum entropy of a time series is determined by the formula
, where
,
are the wavelet coefficients of the signal decomposition, and
is the total number of coefficients
. Seventeen orthogonal Daubechies wavelets were used: 10 ordinary minimum support bases with a number of vanishing moments from 1 to 10, and 7 symlets [
11] with a number of vanishing moments from 4 to 10. In each time window, the wavelet for which the value
is minimal is selected.
This set of basis wavelet functions represents a “vocabulary” of bases with a minimum support width and a number of vanishing moments from 1 (Haar wavelet) to 10 [
11]. Increasing the number of vanishing moments above 10 is associated with an increase in the calculation error of the basic functions and is usually not used in practice.
The
Donoho–Johnston (DJ) wavelet-based index is the share of “large” wavelet coefficients in absolute value of their total number. By definition,
. The threshold separating “large” wavelet coefficients is
, where
is a robust estimate of the standard deviation of the normal distribution,
are the wavelet coefficients at the first detail level of decomposition, and
is the number of such coefficients [
11,
12].
The
singularity spectrum support width is considered as a measure of the diversity of the stochastic behavior of the signal. Here,
and
are the minimum and maximum values of the Hölder–Lipschitz exponent [
13], which governs the behavior of the signal in the vicinity of the time instant
:
. For a monofractal signal, the exponent
is the same for all time instants
. If this exponent differs, then the signal is multifractal [
14].
After switching to a 1 min sampling time step, the seismic records from each station were divided into 1-day-long time fragments (1440 samples), and the parameters
of daily seismic noise signals were calculated for each fragment. The methods for estimating the parameters
,
and
for seismic noise records in a moving time window are presented in [
3]. The DFA method [
15] was used to calculate the values of the singularity spectrum carrier width. Detrending of seismic noise signals using an 8th-order polynomial was used before calculating the entropy and index in each daily time window. Thus, time series of values with a 1-day time step were obtained for each of the seismic stations. The graphs of these quantities are presented in
Figure 4(a1,b1,c1).
The question of choosing the best order 8 of the polynomial for trend removal is discussed in detail in [
2] and is driven by the need to remove the influence of tidal and thermal trends in low-frequency seismic noise.
The entropy value
used in this paper, defined through the coefficients of the orthogonal wavelet decomposition, has features in common with multiscale entropy used, for example, in biology [
16,
17]. In [
18,
19], entropy is used within the framework of the natural time approach for seismic data analysis. In [
20,
21], non-extensive Tsallis entropy is used for processing seismic noise data.
The use of multifractal analysis to study the behavior of various complex systems has a long history. Particular attention is paid to the effect of parameter
reduction (loss of multifractality) preceding changes in system properties. In medicine, a decrease in the
value of various parameters accompanies age-related changes [
22,
23,
24]. In [
25,
26], multifractal analysis is used to analyze geoelectric signals and wind speed. Within the framework of the natural time approach, multifractal analysis is used to study both seismicity and other time sequences [
27,
28].
The purpose of using the seismic noise statistics under study is to identify deviations from the conventional distribution law of their properties (e.g., 1/f) that are interpreted as an approximation to the properties of white noise, rather than a complete coincidence with white noise. These deviations are temporary in nature, while the fulfillment of the 1/f law is the result of an assessment over long time intervals. Due to the orthogonality of wavelet basis functions, the total oscillation energy is equal to the sum of the squares of the wavelet coefficients (Parseval equality) [
11]. Thus, the wavelet entropy used is, by its construction, the entropy of the oscillation energy distribution over discrete frequency–time “atoms,” each of which is equal to the square of the wavelet coefficients. This entropy is maximum when the oscillation energy in each “atom” is the same, which coincides with the wavelet decomposition of stationary white noise. It is for this reason that an increase in entropy is considered a sign of a simplification of the structure of chaos.
This article uses wavelet-based entropy, which is transparently related to the entropy of the distribution of oscillation energy over a discrete set of frequency–time intervals. The value of this entropy depends on the choice of basis wavelet functions. This choice is made with the goal of finding a basis for which the maximum information is concentrated in the minimum wavelet coefficients. This choice corresponds to the minimum entropy. For this reason, entropy is called minimal.
In the future, we will be interested in the positions of the time points of a certain number of the largest local maxima or the smallest local minima of daily seismic noise properties in comparison with the times of earthquakes with a magnitude of at least 6 (
Figure 3). To reduce the low-frequency components’ influence of changes in statistical values on the determination of the times of local extrema, the time series of noise properties were subjected to the operation of removing low frequencies using Gaussian kernel smoothing. Let
be a time series with discrete time
. Gaussian kernel averaging of a time series
with smoothing scale
is defined by the formula [
29]:
Next, the smoothed values for the scale
of 2 days were subtracted from the time series of statistical changes, and the most significant local extremum points were defined for the residuals. These operations are illustrated by the graphs in
Figure 4(a2,b2,c2).
Extreme values exhibit the most pronounced effects of the noise statistics under consideration. While a key factor in seismic activity is its triggering nature, the occurrence of strong earthquakes is often attributed to small-scale factors, but the most likely result of such influence is precisely the extreme values.
The distribution of the most prominent local extremes of seismic noise properties is strongly influenced by the low-frequency components of their variations. To avoid their dominant influence, it is necessary to calculate the local extremes after suppressing the low frequencies. This is achieved by preliminary local smoothing of the noise property change time series and subsequent calculation of the residual from removing the local means. The radius of the Gaussian time window, equal to 2 days, was chosen as the minimum acceptable range.
Let us consider how the time points of the most significant local extrema of the statistics
,
and
are distributed. To do this, we combine all the extremum points (352 smallest local minima
,
and 352 largest local maxima of entropy
) into a single time sequence, sort it in ascending order, and calculate the empirical probabilities of time values falling within successive six-month intervals; that is, we calculate a probability histogram of time points. Since the total length of observations is 29 years, the number of such intervals is 58. For all the statistics used, the last time interval in
Figure 5 is characterized by a spike in the empirical probability of extreme values. However, for the minimum values of the DJ index, this spike is the largest. It would have been possible to limit the analysis to the DJ index alone. However, this article uses a joint processing approach for three seismic noise statistics. Therefore,
Figure 5 shows the result for this joint analysis. In the new version of the article, a separate probability histogram for the DJ index alone has been added to
Figure 5. It shows that the last six-month interval has the highest empirical probability of containing local extremum points.
4. Lead Times Between Local Extremes of Seismic Noise Properties and Earthquakes
To quantitatively estimate the relative positions of time points of local extrema of seismic noise properties and earthquake times, a measure of the mutual advance of point processes is used. This method was used in [
30,
31,
32,
33,
34] to investigate the relationships between seismic noise properties, fluctuations of the global magnetic field, GPS tremor of the Earth’s surface, properties of the solar proton flux, and anomalies of meteorological parameters with the seismic process.
To conduct an analysis using the influence matrices method, the number of events in the two sequences being compared must be approximately equal. Choosing a minimum magnitude threshold of 6 ensures a sufficiently large number of events for the analysis to be statistically significant. On the other hand, magnitude 6 earthquakes are powerful enough for their preparation to be reflected in changes in seismic noise properties at a significant portion of the seismic network’s stations.
The lead measure is calculated using a parametric model of the intensity of two interacting point processes. Let
represent the time instants of two event streams. Let their intensities be represented as follows:
where
are the parameters, and
is the function of influence for events
of the flow with number
:
From the Formula (3) it follows that the weight of the event with number decays with relaxation time for . The parameter determines the degree of influence of the event stream on the stream . The parameter defines the strength of the stream on itself (self-excitation), and the parameter defines the stochastic component of intensity. For the fixed value of the parameter , consider the problem of finding the parameters .
The logarithmic likelihood function for a non-stationary Poisson process is equal to over the time interval
[
35]:
It is necessary to find the maximum of functions (4) with respect to
. In [
32,
33] it was shown that problem (4) is reduced to the following:
where
,
,
, under restrictions
Having solved problems (5) and (6) numerically, we can regard the elements of the influence matrix
by the following formulae:
The quantity
is the portion of the average intensity
of the process with index
, which is purely random, the portion
caused by self-excitation
, and the portion
caused by external influence
. The influence matrix is defined by the following formula:
The elements and present Poisson fractions of mean intensities. The elements and present self-excited components of mean intensity. The elements and defined the mutual excitation. For each row of the matrix (8) the sum equals 1. The matrices (8) are calculated in a moving time window of length with mutual shift .
Next, optimization is performed with respect to the parameters
,
with the aim of maximizing the average value of the difference
, where
is the portion of the intensity of the earthquake sequence (the first process) caused by the “influence” of the sequence of local extrema (the second process), and
is the portion of the intensity of the sequence of local extrema caused by the leading influence of seismic events. We will agree to call
the “direct” lead, and the
“reverse” lead. The greater the difference
, the greater the effect of the leading of the local extrema by the time instants relative to the time instants of the earthquakes. The problem of finding the maximum is solved within an a priori range of parameter variations:
,
. The values
,
,
, and
were used in the calculations; all time units are in years. The shift
of the time windows was taken equal to 0.05 years.
Table 1 presents parameters of models (2) and (3) which were obtained after solving the problem
.
Figure 6 presents graphs of the values of
and
for points of local extrema of the analyzed properties of seismic noise with the optimal choice of parameters.
Let us denote by
the result of averaging the measures
of “forward” advance for various properties of seismic noise, presented by the graphs in
Figure 6(a1,b1,c1). This averaging is realized by calculating the Gaussian means of the union of these dependencies using Formula (1) for an averaging radius
equal to 10 days and for a uniform sequence of time points with a step of 10 days. Next, we calculate the same result
of averaging the measures
of “reverse” advance, presented by the graphs in
Figure 6(a2,b2,c2).
Figure 7 shows their difference
in dependence on the position of the right end of successive time intervals of 10 days.
The median time interval between earthquakes with magnitudes
in the region, presented
Figure 1, equals 13.7 days. Therefore, a time step of 10 days was chosen as the closest “round” number.
The value
is an integrated measure of the lead time of local extrema in seismic noise properties relative to the times of earthquakes with magnitudes of at least 6. The vertical blue line in
Figure 7 represents the time of the Tohoku mega-earthquake of 11 March 2011. The graph in
Figure 7 shows that this measure is mostly positive; that is, for most times, the points of local extrema in seismic noise statistics lead the times of earthquakes. An exception is a time interval of just under 4 years, adjacent to the time of the Tohoku event on the right and extending to the 2015 timestamp, when the value
is negative. The graph in
Figure 7 also shows other time segments when
, but they are not as long as after the Tohoku earthquake. The time intervals
correspond to a situation when, on average, the attainment of extreme values of properties
,
and
is a consequence of earthquakes. If
, then this means that local extremes of noise properties foreshadow seismic events.
For two long time periods before the Tohoku event and after the 2015 timestamp, quasi-periodic fluctuations in the measure
values are noticeable. We estimate their periods by fitting harmonic oscillations (shown as green lines in
Figure 7) using the least-squares method with unknown periods, which we find by minimizing the residual variance. It turns out that these periods are quite close to each other, equal to 3.5 years before the Tohoku event and 3.2 years after 2015.
5. The First Principal Components of the Amplitudes of the EEMD Envelope Decompositions of Seismic Noise Properties
Let us consider the issue of identifying the general fluctuations of the median values of the noise properties
,
and
, the graphs of which are presented in
Figure 4(a1,b1,c1). To solve this problem, we will use the Hilbert–Huang method [
36,
37] of decomposing signals into sequences of intrinsic mode functions (IMF), also known as empirical mode decomposition (EMD). The use of EMD has a wide range of applications. In [
27], this method was used to analyze seismicity on various time scales in combination with multifractal analysis. In [
31,
33], EMD was used to analyze the prognostic properties of the earth’s surface of GPS-measured tremors and anomalies of meteorological time series.
The components of IMF decompositions can be formally associated with the frequency bands of the detail levels of wavelet decompositions or simply with the result of frequency bandpass Fourier filtering for a sequence of frequency bands with uniform frequencies on a logarithmic scale. The Hilbert–Huang decomposition differs from wavelet or Fourier decompositions in that it lacks a pre-assigned basis of orthogonal functions. This circumstance allows the Hilbert–Huang decomposition to take into account nonstationary and nonlinear effects much more flexibly [
36,
37].
The decomposition of an arbitrary signal
into empirical modes is given by the following formula:
where
is the
j-th function,
is the remainder, and
n is the number of empirical modes.
Let us describe the iterations for determining empirical mode function for each level
. Let
be the iteration index, where
is the final number of iterations. The iterations are described by the formula
Here, , where and are the upper and lower envelopes for the signal , which are constructed by 3rd-order splines over all local maxima and minima of the signal .
Iterations (10) are started with a zero step for the first level. Next, the upper and lower envelopes are found, the mean line is calculated, and is found using Formula (10). For , the upper and lower envelopes are determined, and the mean value is found, and so on, up to the last iteration index , after which the first empirical mode is considered to have been found.
Iterations are stopped according to the following condition:
where
is a small number, for instance, 0.01. After the mode
is found, the next mode
is started to be found with iteration
:
A new signal for subsequent decomposition is defined by subtraction (12) of the high-frequency component from the original signal. The construction of empirical oscillation modes continues until the number of local extrema becomes too small to be used to construct envelopes.
As the empirical mode level number increases, the signals become increasingly low-frequency and tend toward an unchanging form. The sequence is constructed so that its sum gives an approximation (9) of the original signal.
Empirical oscillation modes, known as intrinsic mode functions (IMFs), are orthogonal to each other, thus forming the empirical basis for decomposing the original signal. From here on, these decomposition levels will be referred to as IMF levels.
One of the drawbacks of the EMD method is an “intermittency” which occurs when a single empirical mode includes signals of different scales. To overcome this effect of mode mixing, in [
37] the Ensemble Empirical Mode Decomposition (EEMD) method was proposed. The EEMD modification could be presented as the following steps.
1. Adding a white noise realization to the original data.
2. EMD decomposing into empirical modes with the added white noise.
3. Repeating steps 1 and 2 a sufficiently large number of times with independent white noise realizations.
4. Obtaining the ensemble mean for the corresponding empirical modes.
The sequence of artificial observations is simulated in real time series:
where
is the
i-th realization of white noise. In our calculations, we used 1000 realizations of independent white noise with a standard deviation equal to 0.1 of the standard deviation of the original signals.
After obtaining EEMD realizations of daily time series of noise properties
,
and
for a sequence of IMF levels, we calculate the amplitudes of their envelopes using the Hilbert transform, which is efficiently implemented using the fast Fourier transform [
38]. For each IMF level, we have three time functions of the envelope amplitudes corresponding to different properties of seismic noise. To isolate their common properties at each level, we calculate their first principal component after centering and normalization to unit variance [
39,
40]. The graphs of the first principal components of the envelope amplitudes are presented in
Figure 8 for the first eight IMF levels of the EEMD. Each IMF level approximately corresponds to a frequency band with boundary periods from
to
, where
is the IMF level number and
is the time sampling step (in our case, 1 day).
Among the graphs in
Figure 8, the low-frequency IMF levels 5–8 are particularly striking. At IMF levels 5–7, peaks in the first principal component centered on the time of the Tohoku mega-earthquake of 11 March 2011, are visible, indicating a post-seismic response. However, the IMF level 8 graph (periods from 256 to 512 days) displays a significant peak for the time interval from mid-2024 to late 2025, which may indicate preparation for the next strong earthquake.
The greatest synchronization of the envelope amplitudes at the 8th IMF level (the lowest frequency) in 2025 may indicate that a very powerful earthquake is preparing in the Pacific Trench near the coast of Japan, which may reach magnitude 10, as follows from the estimates in the article [
8]. As for the bursts of the first principal component of the envelopes at the IMF06–IMF08 levels around 2002, they may foreshadow the earthquake of 25 September 2003, off the coast of Hokkaido with a magnitude of 8.3.
6. Extreme Values Probability Densities
Let us study the variability of the spatial distribution of extreme values of seismic noise properties. For this purpose, we consider a regular grid of 30 × 30 nodes, covering an area with latitude from 28° N to 46° N and longitude from 128° E to 146° E (
Figure 1). Let
be any value of
,
or
. For each grid node
and for each day with number
, we find the five nearest working seismic stations, which yields five values of
. Let us denote by
the median value of these five properties at a node
on a day with number
. The values
in the set of nodes of the regular grid can be considered as an elementary daily map. For each daily map with a discrete time index
let us find the coordinates of the nodes
at which a given number
of extreme values of
is reached relative to all other nodes of the regular grid. Minima are found for
or
, and maxima are found for
. Further,
extreme values will be used. The set of two-dimensional vectors
considered within the time interval
forms a random set. We estimate the two-dimensional probability distribution function of the vectors
for each node of the regular grid. For each node
, the probability density function of the distribution of extreme values of the property
is calculated according to the Parzen–Rosenblatt estimate with the Gaussian kernel function [
40]:
Here, is the kernel averaging scale, are integer indices that number the daily elementary maps, and is the daily maps number within the time interval under consideration. A smoothing scale was used.
Let us calculate the two-dimensional distribution densities (14) of extreme values in successive time windows of 10 days in length (
). Since we have three such distributions, we will calculate their weighted average. The weights are the squared values of the main eigenvector (corresponding to the maximum eigenvalue) components of the correlation matrix of probability densities [
39]. By construction, the sum of weights equals to 1.
Figure 9 shows graphs of the weighting coefficients depending on the position of the right end of time windows of 10 days in length.
To quantitatively describe the sequence of values of the weighted probability density function (PDF) for the distribution of extreme seismic noise properties in 10-day time windows, we consider the entropy of these densities. Let
be the number of grid nodes within the region formed by the union of circles with a radius of 250 km, centered at each F-net station (
Figure 1). We calculate the histogram of the weighted probability density functions for each 10-day interval
as a sequence of values:
where
is the number of intervals dividing the range of density variation within a given 10-day window into intervals of equal length. According to recommendations [
29,
40], we choose the value
. Obviously
, the values of the histogram (15) also have the properties of a discrete probability distribution. We define the normalized entropy of this distribution by the following formula:
In
Figure 10, the gray line represents the graph of entropy change (16) depending on the position of the right end of 10-day time windows. The red line represents the result of smoothing with a Gaussian kernel (Formula (1)) with a smoothing kernel width
of 400 days.
The blue line represents a stepwise approximation using so-called long WTMM (Wavelet Transform Modulus Maxima) chains of the signal skeleton. Scale-dependent stepwise approximation allows one to formally determine the moments in time of significant changes in the mean entropy values. This approximation is constructed using continuous wavelet transforms with a kernel in the form of derivatives of a Gaussian function, which allows one to formalize the selection of moments in time at which a significant, scale-dependent change in the mean value of a noisy signal occurs [
11,
41].
To describe the construction of the stepwise WTMM approximation, we will use continuous-time formulas, which are easily adapted to the discrete-time case but are more compact. The smoothed entropy value is calculated using the following formula:
where
is the smoothing time scale. For the
-th derivative of the smoothed signal, divided by
(the Taylor coefficient), the following expression holds:
where
. A two-dimensional WTMM point
for
is defined as a point for which
exhibits a local maximum in time
for a given time scale
. For
, WTMM points are present points of local extrema (maxima or minima) of the smoothed signal
, which can be combined into chains. The set of all chains forms the WTMM skeleton of the signal. If the kernel
is Gaussian, then a given chain of the WTMM skeleton does not terminate (does not “hang in the air”) when decreasing the scale and touches the time axis [
42]. The WTMM points for the 1st-order derivative
indicate the time points of maximum trend (positive or negative) of the smoothed signal
for a given scale
. This can be used to formally identify points of large changes in the mean. These times correspond to the roots of long WTMM chains reaching given values of the scale
. We define a stepwise WTMM approximation for a signal
as a function that is equal to a sequence of constant values
in intervals
. Here,
denotes the beginnings of WTMM chains for
, which exceeds a threshold time scale
, and stepwise approximation equals the sequence of average values
within the time intervals
.
In
Figure 10, the blue line represents the stepwise WTMM approximation for the parameter
days. This threshold time scale ensures that the overall time interval is divided into 12 fragments with different average entropy values, which are then interpreted as stages in the development of seismic noise field features in the Japanese islands. A comparison of the red and blue lines in
Figure 10 shows that both averaging methods exhibit roughly the same behavior in the most general trends. In particular, the final time interval from the end of 2020 is characterized by elevated average entropy values.
7. A Sequence of Distribution Maps of Average-Weighted Probability Densities of Extreme Values of Noise Properties
Figure 11 shows a sequence of distributions of average-weighted probability densities of extreme values of seismic noise properties for 12 time intervals allocated by the stepwise WTMM approximation of entropy (16).
Within the framework of the considered hypothesis, the areas of concentration of the weighted probability density are interpreted as “spots of increased seismic hazard.” The time intervals identified by the stepwise approximation of entropy (16) can be represented as a sequence of 12 stages of precursor development based on the properties of seismic noise. The first six distribution maps (α01–α06) correspond to the time intervals preceding the Tohoku event, the epicenter of which is shown by a black asterisk. In the first two maps (α01,α02), it is noticeable how a single area of concentration of the probability density at α01 splits into two parts, one of which at α02 closely approaches the epicenter of the Tohoku event. Stage α03 lasts for approximately six months, and during this short period of time, for which the average entropy value (16) declines, a region of probability density concentration formed in the region of 30° N ≤ Lat ≤ 34° N, 136° E ≤ Lon ≤ 140° E, which remains stable for all later stages. A clear precursor to the future Tohoku mega-earthquake of 11 March 2011 formed during stages (α04–α06). Note the significant duration of the time interval corresponding to stage α06.
Five stages (α07–α11) demonstrate a relatively calm evolution of “seismic hazard spots” with periodic moderate concentration of probability density in the Tohoku aftershock region, the bursts of which correspond to an increase in the average entropy values (16) in
Figure 10. Finally, the last stage, α12, is characterized by a significant rearrangement of the configuration of seismic hazard spots. Concentration of probability density is observed along the entire length of the Pacific coast of Japan (subduction region), while the area of the highest density values remains low-lying in the region 30° N ≤ Lat ≤ 34° N, 136° E ≤ Lon ≤ 140° E. During stage α12, the Aomori earthquake occurred on 8 December 2025, M = 7.6, the epicenter of which is shown by a black asterisk on the α12 map.
The region 30° N ≤ Lat ≤ 34° N, 136° E ≤ Lon ≤ 140° E finally formed at stage α03 and remains a stable region of elevated seismic hazard for all subsequent stages. At stages α04-α06, a region of elevated seismic hazard is visible in the region of the 11 March 2011 Tohoku mega-earthquake, which is interpreted as a precursor. This region disappears completely at stage α07 after the mega-earthquake, but at stages α08–α11, it manifests as episodic activation of moderate seismic hazard in the aftershock region of the Tohoku event. As for the final stage of α12, the seismic hazard zone encompasses the entire Pacific coast of Japan, but the region 30° N ≤ Lat ≤ 34° N, 136° E ≤ Lon ≤ 140° E remains dominant. Therefore, it is interpreted as the most likely location for the next mega-earthquake.
8. Conclusions
A method for analyzing the evolution of low-frequency seismic noise field properties in a seismically active region is proposed. This method is based on studying the temporal and spatial distribution of extreme values of noise properties, which describe the complexity of its statistical structure. An increase in seismic hazard is interpreted as a simplification of the noise structure and its approach to the properties of white noise, as well as the synchronization of temporal variations in these properties. A method for identifying stages in the development of a seismic noise field is proposed based on a stepwise WTMM approximation of the change in the entropy of the distribution of weighted average densities of the distribution of extreme values of seismic noise properties. The method is applied to the analysis of noise properties on the Japanese islands over 29 years of observations, 1997–2025.
The main conclusions are that the spatial and temporal behavior of low-frequency seismic noise properties in Japan since 2020, and particularly in the last two years, reveal a number of indicators that point to the preparation of the next strong earthquake.
Figure 5 shows a spike in the probability histogram of the distribution of time points when the values of the properties under consideration reach extreme values over the last six months.
Figure 7 shows a pattern of quasi-periodic oscillations in the average integrated measure of the lead time of seismic noise property extreme points relative to the time points of strong earthquakes with a magnitude of at least 6 since 2015, with a peak value at the end of 2025, following a four-year interval when this pattern disappeared following the Tohoku mega-earthquake of 11 March 2011.
Figure 8 shows a spike in the first principal component of the envelope amplitudes for the EEMD Hilbert–Huang decomposition of the daily median values of three seismic noise properties for the lowest-frequency IMF level number 8 (characteristic periods from 256 to 512 days) for the 2024–2025 time interval. This spike indicates synchronization of low-frequency variations in seismic noise properties.
Figure 10 shows that the average entropy values of the weighted average probability density distribution of noise property extremes reach their maximum values from the end of 2020 to the end of 2025.
Figure 11, which visualizes 12 stages of seismic noise field development on the Japanese islands, shows that the situation before the Tohoku mega-earthquake (stages (α04–α06)) is repeated at the final stage, α12, in that new “seismic hazard” spots appear, covering the entire Pacific coast of Japan. At the same time, the maximum probability density in the southern part of the studied area of the Japanese islands, which formed at stage α03, remains stable.
Taken together, these features are interpreted as indicators of increased current seismic hazard, with particular attention to the region 30° N ≤ Lat ≤ 34° N, 136° E ≤ Lon ≤ 140° E. Naturally, this is not a prediction of the timing of the next strong earthquake; the essence of this study is to detect a trend of increasing seismic hazard.