Next Article in Journal
A Successive over Relaxation Implicit Iterative Algorithm for Solving Stochastic Linear Systems with Markov Jumps
Next Article in Special Issue
Predicting Compressive Strength of High-Performance Concrete Using Hybridization of Nature-Inspired Metaheuristic and Gradient Boosting Machine
Previous Article in Journal
Evaluation of the Segmental Casting Length of Strongly Restrained Super-Long Mass Concrete Based on Crack Resistance
Previous Article in Special Issue
Analysis of a Predictive Mathematical Model of Weather Changes Based on Neural Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hybrid Model of Natural Time Series with Neural Network Component and Adaptive Nonlinear Scheme: Application for Anomaly Detection

by
Oksana Mandrikova
* and
Bogdana Mandrikova
Institute of Cosmophysical Research and Radio Wave Propagation, Far Eastern Branch of the Russian Academy of Sciences, Mirnaya St, 7, Paratunka, 684034 Kamchatskiy Kray, Russia
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(7), 1079; https://doi.org/10.3390/math12071079
Submission received: 17 February 2024 / Revised: 30 March 2024 / Accepted: 31 March 2024 / Published: 3 April 2024

Abstract

:
It is often difficult to describe natural time series due to implicit dependences and correlated noise. During anomalous natural processes, anomalous features appear in data. They have a nonstationary structure and do not allow us to apply traditional methods for time series modeling. In order to solve these problems, new models, adequately describing natural data, are required. A new hybrid model of a time series (HMTS) with a nonstationary structure is proposed in this paper. The HMTS has regular and anomalous components. The HMTS regular component is determined on the basis of an autoencoder neural network. To describe the HMTS anomalous component, an adaptive nonlinear approximating scheme (ANAS) is used on a wavelet basis. HMTS is considered in this investigation for the problem of neutron monitor data modeling and anomaly detection. Anomalies in neutron monitor data indicate negative factors in space weather. The timely detection of these factors is critically important. This investigation showed that the developed HMTS adequately describes neutron monitor data and has satisfactory results from the point of view of numeric performance. The MSE model values are close to 0 and errors are white Gaussian noise. In order to optimize the estimate of the HMTS anomalous component, the likelihood ratio test was applied. Moreover, the wavelet basis, giving the least losses during ANAS construction, was determined. Statistical modeling results showed that HMTS provides a high accuracy of anomaly detection. When the signal/noise ratio is 1.3 and anomaly durations are more than 60 counts, the probability of their detection is close to 90%. This is a high rate in the problem domain under consideration and provides solution reliability of the problem of anomaly detection in neutron monitor data. Moreover, the processing of data from several neutron monitor stations showed the high sensitivity of the HMTS. This shows the possibility to minimize the number of engaged stations, maintaining anomaly detection accuracy compared to the global survey method widely used in this field. This result is important as the continuous operation of neutron monitor stations is not always provided. Thus, the results show that the developed HMTS has the potential to address the problem of anomaly detection in neutron monitor data even when the number of operating stations is small. The proposed HMTS can help us to decrease the risks of the negative impact of space weather anomalies on human health and modern infrastructure.

1. Introduction

Information retrieval from natural data includes stages for the construction of models, methods and algorithms of analysis. The known problems in this field are nonstationary data and incomplete a priori knowledge on the information component and noise. This significantly complicates the process of model construction and methods for natural data analysis. In some critical related fields (physics and technique, biology, medicine, etc.), such a problem results in the insufficient efficiency of existing methods. This is of high significance regarding the problems of natural anomaly detection (earthquakes, space weather anomalies, tsunamis, etc.) [1].
The object of research in this paper is the data from global network neutron monitors recording intensity variations (particles per a minute) in cosmic rays (CRs). Neutron monitor data time series contain regular cyclic components (22-year, 11-year, 27-day and solar-diurnal variations) and anomalous features of nonstationary structures having the form of bursts of different amplitudes and durations, series of spikes, etc. [2]. Generation factors of such anomalies are coronal mass ejections (CMEs), high-speed solar wind streams from coronal holes and anomalous processes in the near-Earth space. Neutron monitor data anomalies indicate negative factors of space weather such as occurrences of Forbush effects (sudden decreases and/or increases in cosmic ray intensity) or GLE events (ground-level enhancement events—strong anomalous increases in cosmic ray intensity on the Earth’s surface) [3]. During such anomalies, radiation hazards for astronauts, airplane crews and passengers on polar routes increase. Space systems also undergo negative impacts due to technique losses. Thus, the near-real-time detection of anomalies in neutron monitor data is an important applied aspect of research [1].
The investigation of cosmic ray (CR) variations and the development of methods for their analysis began in the first half of the 20th century and has continued through today. One of the most successful and known methods for the investigation of CR parameters is the global survey method [4]. This method includes the method of CR variation communication functions, the method of particle trajectory calculations and spherical analysis for the detection of particle significant spherical harmonics [4]. The global survey method makes it possible to estimate CR flux characteristics with satisfactory accuracy in the case of an uninterrupted operation of a certain number of recording stations. However, due to calculation complexity, the method cannot be realized in automatic mode and it is not effective for the detection of initial proton increases.
Threshold algorithms, used by the Australian Space Weather Service [5] and GLE Alert system [6] to analyze CR variations, make short-term forecasts of radiation hazards according to the data of neutron monitors in an on-line regime. However, the efficiency of these algorithms is low due to their insensitivity to low-amplitude anomalies. The investigations in [7] showed that the GLE Alert algorithm can give unreliable results. Application of this algorithm for 3 years did not allow identification of more than 50% of solar proton events.
Thus, the applied methods for neutron monitor data analysis do not satisfy modern requirements and new approaches are needed in this field [8].
A high proportion of uncertainty in knowledge, the significant nonstationarity of neutron monitor data and noise correlation make it difficult to apply classical methods for time series analysis (ARIMA models, methods for time series decomposition, etc.) and result in low efficiency [9]. At the present time, methods of artificial intelligence and machine learning are actively applied in different applied fields to solve these problems. For example, the authors of [10,11] suggested using neural networks to forecast the state of oil pipelines. The approach in [10], based on the application of neural networks of direct propagation, made it possible to forecast the conditions of oil pipeline exploitation and to classify metal loss defects. In the research in [12], the authors proposed using neural networks with Bayesian regularization in order to solve this problem. A new approach was suggested in [12] to forecast the operation capability of a dry gas transmission system and to classify whether the amount of metal loss was effective even in the case of the absence of a priori data. For the problem of the diagnostics of rotating machine faults, a group of researchers [13] developed a model of a neural network with deep momentum transfer (SNN). The investigations in [13] showed that the developed method allows one to carry out contactless processing of visual data event fluxes, distinguish features and make diagnostics of machine faults.
In the field of this investigation, the author of [14] studied the possibility of application of graph neural networks to investigate the CR data energy spectrum. The neural network approach in [14] made it possible to decrease the time and computational efforts and showed more accurate results compared to the application of the likelihood function. However, the method results in [14] significantly depend on CR primary energy and on the configuration features of an applied detector.
The authors of [15] suggested a method for analysis of the arrival directions of super-high energy cosmic ray fluxes using a deep convolutional neural network. The results in [15] showed that the suggested method is more effective and more sensitive than the approach based on the angle power spectrum. The advantages of the method are the low generalization and efficiency decreases of the neural network when test models deviate from those used for training.
At the present time, hybrid models are actively developing to solve the problems indicated above. They combine classical statistical methods for data analysis and new developments in the fields of artificial intellect, machine learning and signal digital processing. The synthesis of mathematical techniques with adaptive modern tools allows one to extend the capabilities of applied constructs and to improve their efficiency [8,16,17,18,19,20,21,22]. For example, the paper [17] suggests a hybrid model based on the combination of the method of empirical mode decomposition (EMD) and the deep LSTM neural network. The approach proposed in [17] makes it possible to forecast the time series of climatic indexes and solar spots with long-term periodic behavior. The forecast changeability (or uncertainty) is suggested to be detected on the basis of the combination of the EMD method and the K-nearest neighbor. The paper [21] proposes an approach to the forecast of total electron content in the ionosphere (ionospheric parameter). This approach includes a combination of methods for ensemble empirical mode decomposition, K-averages and the self-service LSTM neural network. According to the estimates, the model suggested in [21] has a higher performance compared to some typical forecast methods applied to this problem. A hybrid model of a time series is suggested in [22]. It includes ARIMA components and multiple-scale wavelet analysis (MSA) components. The combination of the time series classical models with the MSA allowed one to obtain the adaptive model of an ionospheric parameters time series.
In this investigation, we propose a new hybrid model of a time series (HMTS) with nonstationary structure. It includes a neural network component and a nonlinear adaptive approximating scheme. The HMTS neural network component describes the data’s regular time variation and its parameters are estimated on the basis of an autoencoder network. The autoencoder is a nonlinear method of main components. It allows us to approximate dependences of a priori unknown structure and suppress noise. Today the autoencoder is successfully applied to solve different applied tasks. For example, in the paper [23] the authors suggested a federated semi-supervised method for the diagnostics of data transfer errors called targeted transfer learning through distribution barycenter medium (TTL-DBM). The application of the autoencoder network in [23] made it possible to aggregate key data distribution parameters and generate a distribution barycenter in an intermediate link for federated adaptation.
We applied the autoencoder network for the first time to approximate the regular time variation of neutron monitor data in the investigation in [24]. In the same paper [24] the autoencoder efficiency was shown for anomaly detection based on the search for the change points in a system. For the first time, we considered application of nonlinear approximating schemes for detection of anomalies in CR variations in [25]. The results in [25] showed the prospects of this approach for detection of multi-scale anomalies. Moreover, the possibility of application of both the autoencoder and the nonlinear approximating schemes was considered in the papers [24,25]. The investigations in [24,25] showed that the autoencoder is more effective for the detection of narrow-band anomalies, and the nonlinear approximating scheme is more effective for the detection of short-period different-scale anomalies. In the following research [26], we compared the method of singular spectral analysis (SSA) with the autoencoder and adaptive threshold filtering (AADA algorithm). The study [26] showed that the SSA can be applied to detect CR variation components during the process of dynamics analysis. However, the autoencoder and adaptive threshold wavelet filtering are more effective for anomaly detection. This paper continues these investigations. The combination of the autoencoder with the nonlinear approximating scheme in the form of the hybrid model of a time series with nonstationary structure develops that theory. This paper presents a formal form of the HMTS, and the methods to estimate its parameters have been developed. In order to optimize the calculation of the HMTS anomalous component, the likelihood ratio test was applied. The analytical expressions of threshold function parameters were obtained. They minimize data approximation errors. Moreover, the wavelet basis selection criterion, when constructing a nonlinear approximating scheme, was proposed. This allowed us to improve the HMTS efficiency for anomaly detection. Based on statistical modeling, the HMTS’s high efficiency was proved for the problem of anomaly detection in neutron monitor data.

2. Materials and Methods

2.1. Hybrid Model of a Natural Time Series of a Nonstationary Structure

A natural time series of a nonstationary structure can be represented in the following form:
f t = R t + A t + e t = r S r t , θ r + m a m f , g m g m + e t ,
where R t = r S r t , θ r is the regular component, which has a parametric form and is a linear combination of the components S r t , θ r , r is the component number, θ = θ r is the parameter set, A t = m a m f , g m g m is the nonstationary component, which includes local features of different forms and durations and is represented in the form of a nonlinear approximating scheme [27], in which a m depends on f , g m , g m is the orthonormal basis, · is the scalar product, t is the time.

2.2. Determination of the Regular Component R t of Model (1)

The regular component R t of model (1) includes implicit complex regularities. Thus, we represent it by means of the autoencoder neural network. The autoencoder is a nonlinear method of main components, and it makes it possible to determine the form of hidden regularities and estimate their parameters [28]. In terms of the autoencoder, we obtain the following estimate for the regular component R t of model (1):
R ^ t = r S r t , θ r = Z θ r 2 H θ r 1 f t ,  
where Z , H are the activation functions of the neural network for output and inner layers, respectively, θ = θ r i i = 1,2 are the parameters, θ r i are the weight matrix of the i -th layer.
From ratio (2) we estimate the parameters θ ^ = θ ^ r i i = 1,2 of model (1) based on minimizing the cost function [28].
C N E T = k = 1 l R ^ t f k t 2 ; θ ^ = a r g m i n θ Θ k = 1 l R ^ t f k t 2 ,
where f k t is the k -th example (data sampling), Θ is the parameter set.
To verify the adequacy of the estimated regular component R ^ t (ratio (2)), for example, Ljung–Box Q -statistics can be applied [29]. The Q -statistics tests the hypothesis on the absence of model residuals’ autocorrelation. In order to test the normality of the model residuals’ distribution, the Jarque–Bera test can be used [29].

2.3. Determination of the Anomalous Component A t of Model (1)

The anomalous component A t of model (1) has a nonstationary structure and is represented in the form of the nonlinear approximating scheme (1).
A t = m a m f , g m g m ,
where it is assumed that f L 2 R , L 2 R is the Lebesgue space.
For nonparametric representation (4), the problem of estimation of the component A t parameters is reduced to the filtering problem, which consists of the construction of the operator D A minimizing the risk:
R D A f , A = i n f D A O E D A f A 2 ,
where E is the mathematical expectation, · is the norm in L 2 R , O is the operator set.
In ratio (4), the nonlinear diagonal operator is considered as the operator D A [30], thus, risk (5) is determined by the following value [27]:
R D A f , A = Ε A ^ A 2 = m = 1 N Ε A , g m f , g m a m 2 ,
N is the series length.
Then, we assume that
f t = A t + e A t ,
where the noise e A t is correlated and the noise coefficient dispersion σ m 2 = E e A , g m 2 depending on g m , the mathematical expectation of noise E e A t = 0 . Then, from ratio (7), the risk R D A f , A (6) is minimum when [27]
a m = A , g m 2 A , g m 2 + σ m 2 .  
If we assume that a m 0,1 , the operator D A , minimizing the risk in (8), can be determined as [30]
a m = 1 , i f A , g m h × σ m 0 , i f A , g m < h × σ m , h = 2 l n N .
Taking into account (9), the risk [30]
R D A f , A = R σ m A = Ε D A f A 2 = m = 0 N m i n A , g m 2 , σ m 2 .
Then, applying the threshold estimate in the orthonormal wavelet basis G Ψ = { Ψ k , n } k , n Z 2 , Ψ k , n t = 2 k 2 Ψ 2 k t n , the operator D A Ψ , minimizing the risk, is [31]
A ^ t = D A Ψ f t = k n = 1 N P σ k , n f , Ψ k , n Ψ k , n t ,
where P σ k , n f , Ψ k , n = f , Ψ k , n , i f f , Ψ k , n h σ k , n 0 , i f f , Ψ k , n < h σ k , n ,   σ k , n 2 = E e A Ψ , Ψ k , n 2 .
The operator D A Ψ in ratio (11) determines the nonlinear approximating scheme in the wavelet basis.
Note. In ratio (11) the threshold dependence T k , n = h σ k , n is considered both on the space variable n (time) and on the scale k . This makes it possible to optimize the threshold both to time and scale changes of data. Moreover, as is shown below, in this case, thresholds close to optimal ones can be found based on the ratio criterion [32].
When joining the expression in (2) and (11), we obtain the hybrid model of a nonstationary time series (HMTS):
f t = R t + A t + e t = r S r t , θ r + m a m f , g m g m + e t = = Z θ r 2 H θ r 1 f t + k n = 1 N P σ k , n f , Ψ k , n Ψ k , n t + e t ,
where Z is the linear function of the network output layer activation, H is the signoidal function of the network inner layer activation, θ = θ r i i = 1,2 are the parameters of the neural network determined in ratio (3), P σ k , n · is the threshold function with the thresholds determined in ratio (11).

2.4. Optimization of the HMTS. Determination of the Wavelet Basis Giving the Least Errors

From ratio (10), the risks of the estimates A ^ t on the wavelet basis can be limited by the value
R Ψ D A Ψ f , A = Ε D A Ψ f A 2 = k , n m i n A , Ψ k , n 2 , σ k , n 2 k , n σ k , n 2 .
Ratio (13) makes it possible to control the risk of the estimate A ^ t based on noise dispersion σ k , n 2 . Moreover, having estimated the risk for different bases from the dictionary Ω and using (13), we can determine the wavelet basis giving the least losses.
Ψ o p :   R Ψ o p D A Ψ f , A = min Ψ Ω R Ψ D A Ψ f , A .

2.5. Optimization of the Estimate of the Model Anomalous Component A ^ t by the Ratio Criterion

Derivation of an optimal estimate (10) is based on the assumption that E e A t = 0 , which for the correlated noise σ m is not always realized. However, the function mapping into wavelet space gives an important advantage, which follows from the Jaffar theorem [27]: an increase in the amplitude of wavelet coefficients f , Ψ k , n indicates the appearance of a local feature of the function f t in the vicinity of the point t = n ; out of the neighborhood, containing local features, wavelet coefficient amplitudes satisfy the condition
f , Ψ k , n Q k q + 0.5 =   П k ,
Q = c o n s t > 0 ,   q is the Lipschitz index.
Thus, it follows from (15) that the values f , Ψ k , n = e A Ψ , Ψ k , n   with respect to the argument n are close to zero. Thus, the following condition is provided in the wavelet space: E e A Ψ t = 0 . The f , Ψ k , n = c k , n value’s deviation from zero is the result of noise effect in this case (Figure 1).
Then, using the ratio criterion [32], we assume that there is a local feature in the vicinity of the point t = n if
W c k , n Г 1 W c k , n Г 0 T k , n ,  
where W c k , n Г 0 and W c k , n Г 1 are the absolute value probability densities for the coefficients c k , n when a local feature is absent or is present, respectively. In its turn, following the Neyman–Pearson criterion [32], the threshold
T k , n = T α , k , n   ( α   is the probability of   I- kind error )
is determined from the condition Φ 0 W c k , n Г 0 d c k , n = α , where Φ is the critical region (Figure 1).
Thus, from (11), (16), (17) we obtain the optimized estimate of the model anomalous component in the form of the nonlinear adaptive approximating scheme (NAAS):
A ^ t = D A Ψ f t = k n = 1 N P T α , k , n c k , n Ψ k , n o p t ,
where P T α , k , n c k , n = c k , n , i f c k , n T α , k , n 0 , i f c k , n < T α , k , n , T α , k , n = t 1 α 2 ,   M σ ^ k , n ,   t 1 α 2 ,   M  is the quantile of the level 1 α 2 .
As the result, from (12), (18) we have the optimized hybrid model on a nonstationary time series (HMTS):
f t = R t + A t + e t = r S r t , θ r + m a m f , g m g m + e t = = Z θ r 2 H θ r 1 f t + k n = 1 N P T α , k , n c k , n Ψ k , n o p t + e t ,
Z , H are the activation functions, θ = θ r i i = 1,2 are the model regular component parameters, P T α , k , n c k , n are the threshold functions with the thresholds T α , k , n = t 1 α 2 ,   M σ ^ k , n , Ψ k , n o p t = 2 k 2 Ψ o п 2 k t n is the wavelet.

3. Construction of the HMTS for Neutron Monitor Data

In the investigation, data from Nain, Inuvik, Oulu, Tule and South Pole stations were used [33]. The choice of the stations was determined by the experience of the experts from the applied subject field [4]. NM data record cosmic ray intensities (particles per minute). In the HMTS terms (representation (19)), NM data time variation can be represented as
f t = R t + A t + e t = Z θ r 2 H θ r 1 f t + k n = 1 N P T α , k , n c k , n Ψ k , n t + e t ,
where the regular cyclic component R t = Z θ r 2 H θ r 1 f t includes periodicities of different amplitude and duration, shifts, spikes etc. (27-day and solar-diurnal, seasonal variations, etc.).
The nonstationary (anomalous) component A t = k n = 1 N P T α , k , n c k , n Ψ k , n t contains different-scale anomalous features in the form of bursts, spikes of different amplitude and duration, etc., which characterize the occurrences of Forbush effects and GLE events and are determined by anomalous processes on the Sun and in the near-Earth space; e t is the noise.
The regular component of the NM data in the terms of the typical autoencoder neural network has the following form (see Formula (20)).
R ^ t = Z θ r 2 H θ r 1 f t + b 1 + b 2 ,  
where the upper index i is the layer number, H is the nonlinear activation function of the encoder (sigmoid), θ r 1 is the weight matrix of the network’s first layer, Z is the linear activation function of the decoder (inner layer), θ r 2 is the weight matrix of the second layer, b i are the shift vectors.
Taking into account the local natural factor effect, neural networks for each station were trained separately. The neural networks were trained using the data from high-latitudinal and polar ground stations of neutron monitors. The network architecture for each station was typical [28] and included two layers. Taking into account data on diurnal variation, the network output vector length was 1440 counts (minute sampling). The network hidden layer length was determined empirically and was equal to 720 counts. The neural networks were trained on the basis of backpropagation, taking into account the application of a sparsity regularizer [28]. The training samples were formed from the neutron monitor data for 2013–2015 during the periods of absence of anomalous processes in the near-Earth space (calm periods). The calm periods were determined by the space weather data [34,35]. Solar and diurnal cycles were also taken into account during the selection.
Table 1 presents the results of the estimates of the model’s regular component R ^ t adequacy (Formula (21)). Data for the calm periods were used, and the mean squared error (MSE) was estimated. The results in Table 1 show that the MSE values are close to zero. We should note that the MSE during high solar activity (SA) significantly exceeded that during low solar activity. The highest MSE was obtained from the South Pole station data that is likely to be associated with its geographical location (the South Pole station is polar and the rest of the stations are high-latitudinal). The results from Table 1 confirm the adequacy of the estimated regular component R ^ t of the model.
The model’s R ^ t adequacy was verified using the Ljung–Box Q -statistics and the Jarque–Bera test [29]. The Ljung–Box Q -statistic ( Q s t a t ) verifies the hypothesis on the joint equality of all the autocorrelations of the model residuals series up to the order m inclusive. The Jarque–Bera fit test ( J B ) was applied to verify the normality of the model residuals’ distribution. The test results are illustrated in Table 2. The results in Table 1 indicate that the estimated model R ^ t adequately describes the NM data’s regular time variation, and the errors are the white Gaussian noise (column 1, Table 2). We should note that during the anomalies, the network errors significantly increase (column 2, Table 2). During the research, we also tried to train the network on the data containing anomalies (column 4, Table 2). However, due to the great diversity of anomaly forms, it is impossible to obtain an adequate model and to estimate its parameters. This fact confirms the necessity of taking into account the HMTS nonstationary (anomalous) component.
The anomalous component A t parameters of the model were estimated using the NM data during the periods containing anomalous events in cosmic rays (Forbush effects) and observed during the disturbances in the near-Earth space (magnetic storms). The data were transformed by the discrete wavelet transform and threshold functions (see Formula (20)):
f t = k = 0 K n = 1 N P T α , k , n c k , n Ψ k , n t ,
where Ψ k , n t = 2 k 2 Ψ 2 k t n are basic wavelets, n ,   k N , c k , n = f t , Ψ k , n are the coefficients of the f t function’s decomposition into a series, P T α , k , n c k , n = c k , n , i f c k , n T α , k , n 0 , i f c k , n < T α , k , n is the threshold function, N is the signal length, K is the largest scale.
The thresholds T α , k , n = t 1 α 2 ,   M σ ^ k , n (Formula (22)) where determined with the significance level α = 0.05 (Neyman–Pearson criterion). The standard deviation σ ^ k , n = 1 M n = 1 M c k , n c k , n ¯ 2 was estimated in the time window of the length M = 1440 (determined by the solar-diurnal cycle). Orthonormal wavelets of Coiflets and Daubechies families were used as the basic wavelets. Table 3 shows the results of the estimates of the NM data approximation errors using different wavelets (Formula (13)). The estimates show that the least error was obtained when using the Coiflet 2 and Daubechies 2 wavelets. When selecting the wavelet basis, the carrier size and the wavelet smoothness order were taken into consideration besides the errors. It is known [27] that the carrier size determines the vicinity dimensions containing the boundary effect. The smoothness order characterizes the capability of a wavelet to detect high-order features. Taking into account the wavelet properties indicated above, the Coiflet 2 wavelet was determined as the best one.

4. Application of the HMTS to Detect Anomalies

As illustrated above (column 2, Table 2), data time variation is disturbed during anomaly occurrences, and, as a consequence, modeling errors increase. Thus, detection of anomalies in NM data can be based on data modeling using the autoencoder (Formula (13)) and on the determination of the periods of neural network error increases. Then, to estimate anomaly parameters, the nonlinear approximating scheme on the wavelet basis was constructed (Formula (14)). The anomaly intensity was estimated from wavelet coefficient absolute values as
E n = k = 0 K P T α , k , n c k , n ,
where K = 250 is the largest scale analyzed.
The control-flow chart realizing the proposed method is illustrated in Figure 2.
As an example, Figure 3 shows the results of the NM data modeling at Oulu station during an anomaly occurrence. Analysis of the results in Figure 3 shows the neural network error increase during the anomalous period that confirms the method’s efficiency.
The method’s efficiency was estimated using the neutron monitor data for 2013–2023. The data samples were formed for the periods containing the anomalies in cosmic ray variations (anomalous periods). The statistical modeling was performed. Both initial natural data and model data, formed on the basis of the natural data, were used in the estimates. The model data were formed using the wavelet packet operations [27] as follows:
(1) the data time series with the length of 1440 counts (diurnal variations) were formed, each count was equal to the corresponding median value of the neutron monitor’s initial data during calm periods;
(2) the obtained time series were decomposed into wavelet packets up to the 7th scale level (using Coiflet 1 wavelet; the 7th scale level was determined taking into account the series length);
(3) wavelet reconstruction of the time series’ smoothed components was carried out not taking into account the details ( X t r e n d );
(4) local features of triangle-pulse form and the Gaussian-modeled pulse, which had different amplitude and duration ( X a n o m ), were added to the obtained time series;
(5) additive correlated noise ( X n o i s e ) was added to the formed time series.
The graphs of anomaly detection probabilities, depending on anomaly amplitude (signal–noise ratio) and on anomaly duration, are illustrated in Figure 4. The analysis of the results shows that the detection probability for the anomaly, having the duration of 20 counts, is more than 80% for the signal–noise ratio of 1.5 (at the false alarm rate α = 0.05 ). When the anomaly is more than 60 counts, the detection probability is close to 90% for the signal–noise ratio of 1.3. The results show the high accuracy of the method.
Figure 5 shows the results of the detection of a low-amplitude anomaly (signal–noise ratio is 1.2) in the model signal against the background of correlated noise (pink noise was added). We should note that the anomaly in the noised model signal (Figure 5b) is not observed visually. The obtained result (Figure 5d) (see Formula (23)) confirms the high accuracy of the method for detection of anomalies including low-amplitude anomalies.
Figure 6 shows the results of the method (Figure 6d,e) on the model data with correlated (pink) noise (Figure 6b). In order to make the comparison, the results of application of a continuous wavelet transform (CWT) are illustrated (Figure 6c,d). The CWT results agree well with the method results. However, due to the correlated noise effect (including the solar-diurnal cycle), it is impossible to detect an anomaly in the data based on the CWT.
Figure 7 illustrates the results of the suggested method using the NM data from Oulu station (Figure 6a) and using the model data (Figure 7e). Based on the data [34], the Forbush effect was recorded at 05:59 UT on 16 July 2017. The magnetic storm [35] was recorded at Novosibirsk station at 06:00 UT on 17 July 2017. The period, containing anomalous changes, is marked by a red dashed-line oval. The results show that application of the suggested method makes it possible to detect anomalies both in natural (Figure 7c,d) and in model data (Figure 7g,h).
Figure 8 illustrates the results of the suggested method with the NM data from different stations. An example, containing the period of Forbush decrease in cosmic ray variations (Figure 8 on the left) and the period containing an anomalous decrease in cosmic ray intensity (Figure 8 on the right), is shown. The times of the magnetic storm commencements are indicated in Figure 8 by yellow lines. The time of the Forbush decrease according to the data [34] (global survey method was applied) is indicated in Figure 8 by a red line. The results in Figure 8 illustrate that the proposed method made it possible to detect the anomalies in cosmic ray variations timely. We should also note that the Forbush decrease occurrence was determined by the suggested method much earlier (Figure 8 on the left, marked by blue) compared to the global survey method (Figure 8 on the left, marked by a red line). Moreover, the comparison of the results from different stations shows the high sensitivity of the HMTS method and the ability to detect anomalies by the data from each separate station. That indicates high efficiency of the method and the possibility to minimize the number of engaged stations when detecting anomalies in cosmic ray variations. This is an important advantage of the HMTS compared to the global survey method.

5. Conclusions

The paper suggested the hybrid model of a nonstationary time series (HMTS) including the neural network component and the nonlinear adaptive approximating scheme. The HMTS parameters were estimated using the data of neutron monitor ground stations, recording cosmic ray intensities in the near-Earth space. To estimate the HMTS efficiency, model data were also used. Their structure corresponded to neutron monitor data. The estimates showed the high accuracy of the method in the problem of anomaly detection:
  • The HMTS regular component adequately describes neutron monitor time series during the periods of anomaly absence. The MSE model values are close to 0 and errors are the Gaussian noise.
  • During the anomalous periods, the neural network model errors increase that makes it possible to detect anomalies effectively.
  • The detection probability of the anomaly, lasting for 20 counts, is more than 80% for the signal–noise ratio of 1.5 (at the false alarm rate α = 0.05 ). When the anomaly lasts for more than 60 counts, the detection probability is close to 90% for the signal–noise ratio equal to 1.3.
Comparison of the HMTS with continuous wavelet transform confirmed the HMTS efficiency in the problems of data analysis and anomaly detection. In the result of the correlated noise effect (including the solar-diurnal cycle of the neutron monitor data), it was impossible to detect an anomaly in the data based on the continuous wavelet transform. Application of the method allowed us to detect the anomaly against the background of correlated noise.
The processing results for the data from different neutron monitor stations showed the high sensitivity of the HMTS. Application of the HMTS made it possible to detect the anomalies based on the data from each separate station compared to the widely applied method of global survey. This shows the possibility to minimize the number of engaged stations when detecting anomalies in cosmic ray variations. The continuous operation of neutron monitor stations is not always provided, thus, this result is important.
The developed HMTS can be recommended for anomaly detection in neutron monitor data, and it is effective even for a small number of operating stations. Moreover, the HMTS can be recommended for the modeling and analysis of nonstationary time series including regular components of nonlinear structures and local features of different form and duration.

Author Contributions

Conceptualization, O.M.; methodology, O.M. and B.M.; software, B.M.; validation, B.M.; formal analysis, O.M. and B.M.; writing—review and editing, O.M. and B.M.; project administration, O.M. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by IKIR FEB RAS State Task (subject registration No. 124012300245-2).

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors are grateful to the institutes that support the neutron monitor stations (http://www01.nmdb.eu/, http://spaceweather.izmiran.ru/) (accessed on 1 November 2023), the data of which were used in the work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kuznetsov, V.D. Space weather and risks of space activity. Space Tech. Technol. 2014, 3, 3–13. [Google Scholar]
  2. Schlickeiser, R. Cosmic Ray Astrophysics; Springer GmbH & Co., KG.: Berlin/Heidelberg, Germany, 2002; p. 519. [Google Scholar]
  3. Poluianov, S.V.; Usoskin, I.G.; Mishev, A.L.; Shea, M.A.; Smart, D.F. GLE and Sub-GLE Redefinition in the Light of High-Altitude Polar Neutron Monitors. Sol. Phys. 2017, 292, 176. [Google Scholar] [CrossRef]
  4. Belov, A.V.; Eroshenko, E.A.; Yanke, V.G.; Oleneva, V.A.; Abunina, M.A.; Abunin, A.A. Global survey method for the world network of neutron monitors. Geomagn. Aeron. 2018, 58, 356–372. [Google Scholar] [CrossRef]
  5. Australian Space Weather. Available online: www.sws.bom.gov.au (accessed on 1 November 2023).
  6. Mavromichalaki, H.; Paschalis, P.; Gerontidou, M.; Tezari, A.; Papailiou, M.C.; Lingri, D.; Livada, M.; Stassinakis, A.; Crosby, N.; Dierckxsens, M. The updated GLE alert system by ANEMOS. In Proceedings of the Hybrid Symposium on Cosmic Ray Studies with Neutron Detectors, Athens, Greece, 26–30 September 2022; pp. 97–103. [Google Scholar]
  7. Veselovsky, I.S.; Yakovchuk, O.S. On the forecast of solar proton events according to the data of ground-based neutron monitors. Astronomical Herald. Sol. Syst. Explor. 2011, 45, 365–375. [Google Scholar]
  8. Ramos, A.A.; Cheung, M.C.M.; Chifu, I.; Gafeira, R. Machine learning in solar physics. Living Rev. Sol. Phys. 2023, 20, 4. [Google Scholar] [CrossRef]
  9. Wang, S.; Wang, H.; Perdikaris, P. On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks. Comput. Methods Appl. Mech. Eng. 2021, 384, 113938. [Google Scholar] [CrossRef]
  10. Shaik, N.B.; Pedapati, S.R.; Othman, A.R.; Bingi, K.; Dzubir, F.A.A. An intelligent model to predict the life condition of crude oil pipelines using artificial neural networks. Neural Comput. Appl. 2021, 33, 14771–14792. [Google Scholar] [CrossRef]
  11. Shaik, N.B.; Pedapati, S.R.; Taqvi, S.A.A.; Othman, A.R.; Dzubir, F.A.A. A Feed-Forward Back Propagation Neural Network Approach to Predict the Life Condition of Crude Oil Pipeline. Processes 2020, 8, 661. [Google Scholar] [CrossRef]
  12. Shaik, N.B.; Jongkittinarukorn, K.; Benjapolakul, W.; Bingi, K. A novel neural network-based framework to estimate oil and gas pipelines life with missing input parameters. Sci. Rep. 2024, 14, 4511. [Google Scholar] [CrossRef]
  13. Chen, X.; Li, X.; Yu, S.; Lei, Y.; Li, N.; Yang, B. Dynamic Vision Enabled Contactless Cross-Domain Machine Fault Diagnosis with Neuromorphic Computing. IEEE/CAA J. Autom. Sin. 2024, 11, 788–790. [Google Scholar] [CrossRef]
  14. Koundal, P. Graph neural networks and application for cosmic-ray analysis. In Proceedings of the 5th International Workshop on Deep Learning in Computational Physics, Moscow, Russia, 28–29 June 2021. [Google Scholar]
  15. Kalashev, O.; Pshirkov, M.; Zotov, M. Identifying nearby sources of ultra-high-energy cosmic rays with deep learning. J. Cosmol. Astropart. Phys. 2020, 2020, 005. [Google Scholar] [CrossRef]
  16. Bojang, P.O.; Yang, T.-C.; Pham, Q.B.; Yu, P.-S. Linking Singular Spectrum Analysis and Machine Learning for Monthly Rainfall Forecasting. Appl. Sci. 2020, 10, 3224. [Google Scholar] [CrossRef]
  17. Lee, T. EMD and LSTM Hybrid Deep Learning Model for Predicting Sunspot Number Time Series with a Cyclic Pattern. Sol. Phys. 2020, 295, 82. [Google Scholar] [CrossRef]
  18. Gocheva-Ilieva, S.; Ivanov, A.; Kulina, H.; Stoimenova-Minova, M. Multi-Step Ahead Ex-Ante Forecasting of Air Pollutants Using Machine Learning. Mathematics 2023, 11, 1566. [Google Scholar] [CrossRef]
  19. Boussaada, Z.; Curea, O.; Remaci, A.; Camblong, H.; Mrabet Bellaaj, N. A Nonlinear Autoregressive Exogenous (NARX) Neural Network Model for the Prediction of the Daily Direct Solar Radiation. Energies 2018, 11, 620. [Google Scholar] [CrossRef]
  20. Tang, R.; Zeng, F.; Chen, Z.; Wang, J.-S.; Huang, C.-M.; Wu, Z. The Comparison of Predicting Storm-Time Ionospheric TEC by Three Methods: ARIMA, LSTM, and Seq2Seq. Atmosphere 2020, 11, 316. [Google Scholar] [CrossRef]
  21. Zhao, X.; Lu, X.; Quan, W.; Li, X.; Zhao, H.; Lin, G. An Effective Ionospheric TEC Predicting Approach Using EEMD-PE-Kmeans and Self-Attention LSTM. Neural Process Lett. 2023, 55, 9225–9245. [Google Scholar] [CrossRef]
  22. Mandrikova, O.; Fetisova, N.; Polozov, Y. Hybrid Model for Time Series of Complex Structure with ARIMA Components. Mathematics 2021, 9, 1122. [Google Scholar] [CrossRef]
  23. Yang, B.; Lei, Y.; Li, X.; Li, N. Targeted transfer learning through distribution barycenter medium for intelligent fault diagnosis of machines with data decentralization. Expert Syst. Appl. 2024, 244, 122997. [Google Scholar] [CrossRef]
  24. Mandrikova, O.; Mandrikova, B. Hybrid Method for Detecting Anomalies in Cosmic ray Variations Using Neural Networks Autoencoder. Symmetry 2022, 14, 744. [Google Scholar] [CrossRef]
  25. Mandrikova, O.; Mandrikova, B.; Rodomanskay, A. Method of Constructing a Nonlinear Approximating Scheme of a Complex Signal: Application Pattern Recognition. Mathematics 2021, 9, 737. [Google Scholar] [CrossRef]
  26. Mandrikova, O.; Mandrikova, B.; Esikov, O. Detection of Anomalies in Natural Complicated Data Structures Based on a Hybrid Approach. Mathematics 2023, 11, 2464. [Google Scholar] [CrossRef]
  27. Mallat, S. A Wavelet Tour of Signal Processing; Academic Press: London, UK, 1999; p. 620. [Google Scholar]
  28. Pattanayak, S. Pro Deep Learning with TensorFlow: A Mathematical Approach to Advanced Artificial Intelligence in Python; Apress: Bangalore, India, 2023; p. 398. [Google Scholar]
  29. Witte, R.S.; Witte, J.S. Statistics, 11th ed.; Wiley: New York, NY, USA, 2017; p. 496. [Google Scholar]
  30. Donoho, D.L.; Johnstone, I.M. Ideal spatial adaptation via wavelet shrinkage. Biometrika 1994, 81, 425–455. [Google Scholar] [CrossRef]
  31. Mandrikova, O.V. Intelligent methods for natural data analysis: Application to space weather. Comput. Optics. 2024, 48, 139–148. [Google Scholar] [CrossRef]
  32. Levin, B.R. Theoretical Foundations of Statistical Radio Engineering; Radio and Communications: Moscow, Russia, 1989. [Google Scholar]
  33. Real Time Data Base for the Measurements of High-Resolution Neutron Monitor. Available online: www.nmdb.eu (accessed on 1 November 2023).
  34. IZMIRAN Space Weather Forecast Center. Catalog of Forbush Effects and Interplanetary Disturbances. Available online: http://spaceweather.izmiran.ru/rus/fds2019.html (accessed on 11 November 2023).
  35. Institute of Applied Geophysics. Available online: http://ipg.geospace.ru/ (accessed on 11 October 2023).
Figure 1. Histogram of wavelet coefficients c k , n for k = 4 , constructed on the basis of neutron monitor data, which do not contain local features.
Figure 1. Histogram of wavelet coefficients c k , n for k = 4 , constructed on the basis of neutron monitor data, which do not contain local features.
Mathematics 12 01079 g001
Figure 2. The control-flow chart for anomaly detection.
Figure 2. The control-flow chart for anomaly detection.
Mathematics 12 01079 g002
Figure 3. Processing results: (a) NM data are in blue, NN error is in orange; (b) smoothed | ε t | s m .
Figure 3. Processing results: (a) NM data are in blue, NN error is in orange; (b) smoothed | ε t | s m .
Mathematics 12 01079 g003
Figure 4. Graphs of the anomaly detection probabilities.
Figure 4. Graphs of the anomaly detection probabilities.
Mathematics 12 01079 g004
Figure 5. Processing results: (a) trend of the model signal with anomaly; (b) model signal; (c,d) results of the method with model data.
Figure 5. Processing results: (a) trend of the model signal with anomaly; (b) model signal; (c,d) results of the method with model data.
Mathematics 12 01079 g005
Figure 6. Processing results: (a) model signal; (b,c) results of the CWT application; (d,e) results of the suggested method.
Figure 6. Processing results: (a) model signal; (b,c) results of the CWT application; (d,e) results of the suggested method.
Mathematics 12 01079 g006
Figure 7. Processing results: (a) NM signal at Oulu station; (b) trend of NM signal; (c,d) results of the method with NM data; (e) the model signal; (f) trend of the model signal; (g,h) results of the method with the model data.
Figure 7. Processing results: (a) NM signal at Oulu station; (b) trend of NM signal; (c,d) results of the method with NM data; (e) the model signal; (f) trend of the model signal; (g,h) results of the method with the model data.
Mathematics 12 01079 g007
Figure 8. Processing results: (a,h) proton density; (b,i) solar wind speed; (c,j) Bz (GSM) component of the interplanetary magnetic field; (d,k) geomagnetic activity Dst-index; (e,l) NM data; (f,g,m,n) results of the method with NM data.
Figure 8. Processing results: (a,h) proton density; (b,i) solar wind speed; (c,j) Bz (GSM) component of the interplanetary magnetic field; (d,k) geomagnetic activity Dst-index; (e,l) NM data; (f,g,m,n) results of the method with NM data.
Mathematics 12 01079 g008
Table 1. Estimate of the HMTS regular component R ^ t adequacy.
Table 1. Estimate of the HMTS regular component R ^ t adequacy.
1 N t = 1 , N ¯ f t R ^ t 2 Mean Squared Error
High SA (2013–2015)Low SA (2018–2020)
Inuvik st.0.204180.17003
Nain st.0.172310.15162
Tule st.0.202120.16181
Oulu st.0.193230.13104
South Pole st.0.242450.19006
Table 2. Results of the neural network performance.
Table 2. Results of the neural network performance.
Results of a Network Trained on Data without AnomaliesResults of a Network Trained on Data without AnomaliesResults of a Network Trained on Data with Anomalies
NN inputData without anomalies
J B = 4.44 ;
Q s t a t = 54.75 ;
Data with anomalies
J B = 104.61 ;
Q s t a t = 3.85 × 10 4
Data with anomalies
J B = 104.61 ;
Q s t a t = 3.85 × 10 4
NN output J B = 2.77 ;
Q s t a t = 3.2 × 10 3 ;
J B = 209.95 ;
Q s t a t = 1.428 × 10 4
J B = 225.86 ;
Q s t a t = 2.11 × 10 4
NN errors J B = 2.33 ;
Q s t a t = 30.23
J B = 232.45 ;
Q s t a t = 8.836 × 10 3
J B = 68.64 ;
Q s t a t = 9.25 × 10 4
Table 3. Estimates of the data approximation error using different wavelets.
Table 3. Estimates of the data approximation error using different wavelets.
k , n σ k , n 2 / f Wavelets
coif_1coif_2coif_3db_1db_2
st. InuvikHigh SA0.01750.01700.01760.01780.0174
Low SA0.01770.01750.01740.01780.0177
st. OuluHigh SA0.01890.01870.01880.01900.0189
Low SA0.02020.02020.02010.02030.0200
Carrier size6121824
Smoothness order24612
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mandrikova, O.; Mandrikova, B. Hybrid Model of Natural Time Series with Neural Network Component and Adaptive Nonlinear Scheme: Application for Anomaly Detection. Mathematics 2024, 12, 1079. https://doi.org/10.3390/math12071079

AMA Style

Mandrikova O, Mandrikova B. Hybrid Model of Natural Time Series with Neural Network Component and Adaptive Nonlinear Scheme: Application for Anomaly Detection. Mathematics. 2024; 12(7):1079. https://doi.org/10.3390/math12071079

Chicago/Turabian Style

Mandrikova, Oksana, and Bogdana Mandrikova. 2024. "Hybrid Model of Natural Time Series with Neural Network Component and Adaptive Nonlinear Scheme: Application for Anomaly Detection" Mathematics 12, no. 7: 1079. https://doi.org/10.3390/math12071079

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop