Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets

Musaev, Alexander; Grigoriev, Dmitry

doi:10.3390/jrfm18060296

Open AccessArticle

Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets

by

Alexander Musaev

¹ and

Dmitry Grigoriev

^2,*

¹

St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences, 199178 St. Petersburg, Russia

²

Center of Econometrics and Business Analytics (CEBA), St. Petersburg State University, 199034 St. Petersburg, Russia

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag. 2025, 18(6), 296; https://doi.org/10.3390/jrfm18060296

Submission received: 12 April 2025 / Revised: 16 May 2025 / Accepted: 22 May 2025 / Published: 29 May 2025

(This article belongs to the Special Issue Innovative Approaches in Econometrics, Financial Market and Business Analytics)

Download

Browse Figures

Versions Notes

Abstract

Financial time series in volatile markets often exhibit non-stationary behavior and signatures of stochastic chaos, challenging traditional forecasting methods based on stationarity assumptions. In this paper, we introduce a novel multi-expert forecasting system (MES) that leverages ensemble machine learning techniques—including bagging, boosting, and stacking—to enhance prediction accuracy and support robust risk management decisions. The proposed framework integrates diverse “weak learner” models, ranging from linear extrapolation and multidimensional regression to sentiment-based text analytics, into a unified decision-making architecture. Each expert is designed to capture distinct aspects of the underlying market dynamics, while the supervisory module aggregates their outputs using adaptive weighting schemes that account for evolving error characteristics. Empirical evaluations using high-frequency currency data, notably for the EUR/USD pair, demonstrate that the ensemble approach significantly improves forecast reliability, as evidenced by higher winning probabilities and better net trading results compared to individual forecasting models. These findings contribute both to the theoretical understanding of ensemble forecasting under chaotic market conditions and to its practical application in financial risk management, offering a reproducible methodology for managing uncertainty in highly dynamic environments.

Keywords:

non-stationary time series; stochastic chaos; ensemble machine learning; forecast aggregation; multi-expert forecasting system

1. Introduction

Financial time series are characterized by high volatility, non-stationarity, structural breaks and abrupt regime shifts. Foundational work on ARCH models has demonstrated the importance of capturing time-varying volatility (Chou & Kroner, 1992), while regime-switching frameworks allow one to model sudden market transitions (Hamilton, 2010). Classical forecasting techniques—ARMA, ARIMA and GARCH—remain cornerstones of empirical econometrics: ARMA presumes covariance stationarity following mean adjustment; ARIMA absorbs low-frequency trends via differencing; and GARCH models the conditional second moments under a (weakly) stationary conditional mean (Box & Jenkins, 1970; Engle, 1982; Chou & Kroner, 1992).

In financial decision science it is crucial to distinguish risk and uncertainty. Risk refers to situations where outcome probabilities are known (allowing quantification), whereas uncertainty—in the Knightian sense—implies that both outcomes and their distributions are unknown (Knight, 1921). Traditional VaR-style metrics address only risk; residual uncertainty must be mitigated through structural model diversity. Accordingly, our MES delivers probabilistic outputs that quantify risk, while the heterogeneity of its experts hedges against model uncertainty that cannot be captured by any single specification.

A further complication is the weak-form Efficient-Markets Hypothesis (EMH) for liquid currency pairs: one-minute EUR/USD returns approximate a martingale with near-zero linear predictability (Fama, 1970). Purely linear models frequently fail to capture the rich nonlinear and chaotic dynamics observed in financial markets. Granger (2008) highlights the limitations of traditional single-model approaches and advocates the use of ensemble methods to improve forecast accuracy in economics. Zhou’s comprehensive treatment of ensemble learning algorithms (Zhou, 2025) underpins many of the strategies we employ in our multi-expert system (MES). Moreover, Avramov and Chordia (2006) show that adaptive and ensemble techniques can compensate for model misspecification and significantly enhance predictive power under conditions of structural instability.

Finally, high-frequency data (HFD) exacerbate microstructure noise and quote-bounce effects that spoil the i.i.d. error assumptions of low-frequency econometrics (Zhang et al., 2005). Our empirical design therefore aggregates 1 min ticks into minute-bars, applies variance-stabilizing transformations, and uses heteroscedastic robust error estimates throughout (see Section 2.2). These precautions, combined with the ensemble architecture, allow us to confront the dual challenge of risk and residual uncertainty in a market that is close to weak-form efficient yet demonstrably chaotic.

In this context, recent advances in deep learning–augmented ensembles have shown potential. Han et al. (2024) propose a hybrid framework that integrates GARCH-type models with LSTM architectures to model volatility clusters and temporal dependencies in non-stationary series. Similarly, Li et al. (2022) apply ensemble deep learning for metro passenger flows, yet their methodology is equally relevant for high-frequency financial data, where concept drift and seasonality pose significant barriers to generalization.

Other works (A. A. Musaev et al., 2021; A. Musaev & Grigoriev, 2022) emphasize the importance of accounting for structural breaks and abrupt changes through model adaptation and segmentation. This is aligned with the findings of (Cretarola et al., 2020), who incorporate attention-based mechanisms to track shifting patterns in Bitcoin prices, demonstrating that ignoring non-stationarity significantly deteriorates forecast quality.

Moreover, Nguyen and Thu (Nguyen & Thu, 2018) explore support vector machines (SVMs) for forex prediction and highlight the algorithm’s ability to capture nonlinearities. However, they also acknowledge limitations in adapting to fast-changing dynamics, thus making a case for multi-model aggregation rather than relying on a single ML architecture.

Contemporary reviews and meta-analyses (Ayitey et al., 2023) further reveal a shift toward multi-expert and hybrid modeling approaches, especially in volatile markets where traditional methods frequently fail. These reviews advocate for combining statistical foundations with ML-based learning and soft computing techniques (e.g., fuzzy logic, evolutionary learning), especially when operating in highly noisy and chaotic financial environments.

A prime example of a stochastic–chaotic process in action is evident in the fluctuation of the EUR/USD currency pair, for which several empirical studies—employing correlation dimension, largest Lyapunov exponent and BDS–surrogate tests—have documented low-dimensional deterministic chaos (Bildirici & Sonustun, 2019). Although the methodological framework is asset-agnostic, the empirical evaluation in this study is intentionally restricted to the EUR/USD currency pair over a fixed three-year window. High-frequency, one-minute mid-quotes for EUR/USD covering 1 January 2020–31 December 2022 were downloaded from Finam.ru (accessed 12 April 2025). Because the foreign-exchange market operates on a 24 h basis, five days per week, the dataset includes every consecutive minute-stamp across the Asian, European, and North American sessions. Consequently, any generalization to other financial instruments must be confirmed by dedicated case studies and is left for future research.

Figure 1 illustrates the changes in this quotation over a three-year period, with a one-minute discretization. The graph clearly shows that the evolution of quotations is an oscillatory, non-periodic random process, characterized by numerous local trends and distinct signs of self-similarity (Mandelbrot et al., 2004; Crilly et al., 2012). In essence, there is ample reason to posit that the process under scrutiny is an embodiment of stochastic chaos, a notion that has been investigated in the context of econometric data through the lens of market chaos theory (Peters, 1996; A. Musaev et al., 2023a; Gregory Williams & Williams, 2004).

Prices of actively traded instruments are frequently modeled as martingales or, at best, as I(1) processes whose first differences are near-white noise (Fama, 1970). From a risk-management perspective, this implies that shocks have permanent effects on the level series and that the unconditional variance of P_t grows without bound. Forecasting models must therefore (i) operate on transformed data (returns, log-returns), (ii) account for possible cointegration with related macro variables, and (iii) adapt quickly to structural breaks. The ensemble architecture proposed below addresses points (i)–(iii) by combining differenced linear predictors with nonlinear and sentiment-based experts and by dynamically re-weighting them in response to recent forecasting errors.

In light of these considerations, this study introduces a multi-expert forecasting system (MES) that leverages ensemble machine learning algorithms to address the forecasting challenges posed by non-stationary, chaotic financial processes. The EUR/USD currency pair, with its well-documented volatility and complexity (Peters, 1996; Boccaletti et al., 2000), serves as a representative example in our analysis. By integrating diverse forecasting models—each capturing different aspects of market behavior—into a unified decision framework, our proposed MES aims to enhance forecast reliability and thereby improve the quality of risk management decisions.

Despite valuable advances in ensemble forecasting—combining linear, nonlinear and sentiment-based models (e.g., Han et al., 2024; Cretarola et al., 2020)—existing work typically treats bagging, boosting and stacking in isolation and focuses almost exclusively on day-ahead horizons. Moreover, these approaches often lack a supervisory module capable of dynamically adapting expert weights in response to evolving error characteristics under chaotic non-stationarity. As a result, the relative merits of different ensemble strategies across both intraday (one-hour) and day-ahead (24 h) forecasting remain under-examined.

This study fills that gap by developing a two-level MES that unifies diverse weak learners—including polynomial extrapolators, multidimensional regressors and a sentiment expert—within an adaptive stacking framework and by empirically comparing bagging, boosting and stacking on high-frequency EUR/USD data over both one-hour and one-day horizons.

The primary objectives of this research are the following:

i: Elucidate the specific challenges associated with forecasting in chaotic financial environments;
ii: Develop a robust framework for constructing a multi-expert system using ensemble techniques;
iii: Empirically demonstrate the enhanced performance of the proposed system relative to traditional forecasting approaches with a single agent.

Ultimately, this work contributes both to the theoretical understanding of ensemble forecasting in non-stationary settings and to its practical application within financial risk management, offering actionable insights for practitioners operating in volatile markets.

2. Methods

2.1. Observation Model and Problem Statement

A central challenge in forecasting financial and economic time series is their intrinsic chaotic and non-stationary behavior. In many practical scenarios, particularly under volatile market conditions, the observed data cannot be satisfactorily modeled by classical stationary processes. To formally capture this complexity, we represent the observation series

y_{k}

as an additive model based on Wold’s decomposition (Wold, 1938),

y_{k} = x_{k} + v_{k}, k = 1, \dots, N,

(1)

where

x_{k}

denotes the unknown deterministic “system” component that encapsulates the regular underlying dynamics of the process, and

v_{k}

represents the stochastic noise component associated with measurement errors or external disturbances.

Traditionally, it is assumed that the system component

x_{k}

is sufficiently smooth—amenable to representations by polynomial or trigonometric series. However, in environments characterized by instability and rapid regime shifts, the dynamics of

x_{k}

often follow an oscillatory, non-periodic pattern typical of deterministic chaos. In these settings, the smoothness assumption is frequently violated.

In conventional approaches, the noise component

v_{k}

is modeled as Gaussian white noise. Yet, empirical evidence indicates that for many financial time series, the noise is non-stationary and exhibits heteroscedasticity. A more realistic representation involves modeling

v_{k}

as a mixture process that converges weakly to a Huber-type Gaussian model,

v_{k} \sim (1 - ε) N (0, σ_{1}^{2}) + ε N (0, σ_{2}^{2}), ε \in (0, 1), σ_{2} > > σ_{1},

where

ε

is a fouling coefficient that quantifies the degree of contamination (Huber, 1981). Moreover, for heteroscedastic processes, the noise variance is time-dependent, denoted as

σ = σ (t)

.

An illustration of the non-stationary noise component, generated by subtracting a smoothed estimate of

x_{k}

(using a transfer coefficient

α = 0.05

) from the observation series, is provided in Figure 2.

The transfer coefficient α controls the aggressiveness of the exponential filter: smaller α values yield stronger smoothing and a longer effective time constant T ≈ 1/α. At α = 0.05, the filter effectively averages over the last 20 observations, reducing noise but introducing a lag of approximately τ = (1/α − 1) steps.

The presence of heteroscedasticity and numerous abnormal observations undermines the applicability of traditional identification and forecasting schemes. Even adaptive algorithms are rendered less effective, as they require considerable time to recalibrate tracking contours, thus failing to capture rapid and chaotic changes in the process dynamics.

These challenges motivate the transition to multi-expert systems (MES) (A. Musaev & Grigoriev, 2022). Rather than relying on a singular forecasting model, MES employs a diverse group of software experts (SEs), each characterized by distinct forecasting methodologies and parameterizations. Formally, each expert is defined by a mapping,

E x p (S, P) : (Y_{(k - T, k - 1)}, y_{k}) \to {\tilde{x}}_{k + τ}, k = 1, \dots, n,

(2)

where

Y_{(k - T, k - 1)} = {y_{k - T}, y_{k - T + 1}, \dots, y_{k - 1}}

is the array of historical observations used for training,

y_{k}

is the current observation, and

{\tilde{x}}_{k + τ}

represents the forecast over a horizon

τ

. By operating M experts simultaneously or sequentially, a set of candidate forecasts is generated,

{\tilde{X}}_{k + τ} = {{\tilde{x}}_{k + τ} (i), i = 1, \dots, M}

, at each forecasting step

k = 1, \dots, n

.

The forecasting task then consists of optimally combining these candidate forecasts. In particular, the goal is to determine the forecast

{\tilde{x}}_{k}^{*}

that minimizes a chosen efficiency indicator

Q ({\tilde{x}}_{k}, x_{k})

,

{\tilde{x}}_{k}^{*} = e x t r Q ({\tilde{x}}_{k}, x_{k}), k = 1, \dots, n,

(3)

where “extr” denotes the extraction of an optimal value according to the performance criterion. For linear computational schemes, the criterion is often defined as the mean squared error (MSE),

Q ({\tilde{x}}_{k}, x_{k}) = \frac{1}{n} \sum_{k = 1}^{n} {({\tilde{x}}_{k} - x_{k})}^{2},

(4)

or alternatively, the mean absolute deviation (MAD),

Q ({\tilde{x}}_{k}, x_{k}) = \frac{1}{n} \sum_{k = 1}^{n} | {\tilde{x}}_{k} - x_{k} | .

(5)

Thus, the optimal forecasting problem is formulated as follows:

{\tilde{x}}_{k}^{*} = a r g m i n Q ({\tilde{x}}_{k}, x_{k})

(6)

It is important to note that the forecast itself is not an end but a tool to support superior management decisions. An inherent benefit of employing MES lies in its ability to adapt to the variability in individual expert performance; whereas a singular “ideal” expert is unlikely to exist, the collective intelligence derived from diverse perspectives enhances overall forecast stability and accuracy.

In summary, the observation model encapsulated by Equations (1)–(6) not only reflects the inherent challenges posed by non-stationary, chaotic dynamics but also establishes the motivation for developing a multi-expert ensemble framework. This framework is intended to bridge the gap between theoretical forecasting improvements and practical risk management applications in volatile financial environments.

Throughout the paper the symbol τ denotes the forecast horizon measured in one-minute bars. Two practical horizons are analyzed as follows: (i) τ = 1 440 (~24 h) for the day-ahead trading scenario and (ii) τ = 60 (~1 h) for intraday risk-control experiments. Unless stated otherwise, results in Section 3.3, Section 3.4 and Section 3.5 refer to the latter setting.

2.2. Problem of Smoothing Non-Stationary Observation Series

Extracting the underlying system component

x_{k}

from the additive mixture (1) is a challenging task, particularly when dealing with non-stationary and chaotic time series. Conventional dynamic smoothing methods, although necessary, tend to introduce significant bias in the estimated signal. Figure 3 illustrates the effects of filtering the observation series using an exponential smoothing filter (Gardner, 1985) defined as follows:

{\overset{⌢}{x}}_{k} = α y (t) + (1 - α) {\overset{⌢}{x}}_{k - 1} = {\overset{⌢}{x}}_{k - 1} + α (y_{k} - {\overset{⌢}{x}}_{k - 1}), k = 1, \dots, n,

where two different transfer coefficients are considered: a relatively mild smoothing with

α_{1} = 0.05

and a stronger smoothing with

α_{2} = 0.0075

.

With the milder setting (

α_{1} = 0.05

), the filtered series retains numerous stochastic local inflections. This property, while preserving much of the original dynamic detail, can lead to an increased incidence of type II errors—so-called “false alarms” in change detection—where the system erroneously signals a trend change. Conversely, a smaller coefficient (

α_{2} = 0.0075

) produces a heavily smoothed signal in which the identified breakpoints tend to correspond to significant trend shifts; however, this comes at the cost of an inherent delay due to the offset in the estimated values. Such delays can result in untimely decisions, thereby reducing overall management efficiency.

This trade-off underscores the need for the development of advanced sequential filtering algorithms that can achieve an optimal level of smoothing while minimizing bias and delay. Several alternative approaches have been proposed in the literature (e.g., A. Musaev et al., 2023b) and serve as the basis for the implementation of various software expert (SE) variants within our multi-expert system framework. Below, we briefly describe three such approaches.

2.3. A Polynomial Extrapolator

One of the simplest software expert models is based on polynomial extrapolation. In this approach, a sliding observation window,

Y_{(k - L, k)} = (y_{k - L}, y_{k - L + 1}, \dots, y_{k}),

(7)

is used to capture the local dynamics of the process

y_{k} .

The forecasting model is given by a power polynomial of the following form:

x_{p} (t_{j}) = \sum_{i = 0}^{p} a_{i} t^{j}, j = 1, \dots, L,

(8)

which acts as a deterministic extrapolator. The model parameters are determined by a vector of optional parameters

P = (L, α, p, τ)

, where L is the size of the sliding window, α is the transfer coefficient used in smoothing, p is the polynomial degree, and τ is the forecasting horizon. According to the Weierstrass approximation theorem (Stone, 1948; Khanh & Quan, 2019), any continuous function can be uniformly approximated by polynomials; however, the rapid dynamics of chaotic processes necessitate the construction of a polynomial model on a window immediately preceding the forecast interval

(k + 1, \dots, k + τ)

, with the parameters

(α_{0}, α_{1}, \dots, α_{p})

obtained via least squares minimization,

α^{*} : m i n \sum_{j = 1}^{L} {(y_{j} - x_{p} (t_{j}))}^{2},

(9)

is then produced by substituting the forecast time into the polynomial,

{\tilde{x}}_{k + c} = x_{p} (k + c), c = 1, \dots, τ .

(10)

This approach rests on the inertia hypothesis of the observed process. However, as demonstrated in (A. Musaev et al., 2023c), many financial time series lack clear inertial behavior, limiting the applicability of this method. Moreover, increasing the polynomial degree beyond 3–4 can lead to degeneracy in the system of normal equations and may require additional regularization.

2.4. Multidimensional Regression Forecast

For processes exhibiting correlations among multiple variables, a multidimensional regression predictor can offer an effective alternative (Lauritzen, 2023; A. Musaev et al., 2023a, 2023d). The observation model in this context is expressed in the following vector form:

Y_{k} = X_{k} + V_{k}

(11)

where

Y_{k} = {y_{k, j} = x_{k j} + v_{k, j}, j = 1, \dots, m}, k = 1, \dots, n

.

A training data matrix is constructed over a sliding window of length L, where the regressors are time-shifted by the forecasting interval

τ

,

Y_{(L, m)} = {y_{k - L - τ, j}, \dots, y_{k - τ, j}, k = 1, \dots, L, j = 1, \dots, m} .

(12)

In the case of forecasting a single process using m-1 correlated processes, the regression model is formulated as follows:

Y_{k} = C_{k} Y_{(k - L, k)} + V_{k},

(13)

where

C_{k} = {(c_{1}, \dots, c_{m})}_{k}^{T}

represents the coefficients defining a hyperplane in m-dimensional space, and

Y_{(k - L, k)}

is composed of shifted observations,

Y_{(k - L, k)} = [\begin{matrix} 1 & y_{1, k - L} & \dots & y_{m, k - L} \\ 1 & y_{1 . k - L + 1} & \dots & y_{m . k - L + 1} \\ \dots & \dots & \dots & \dots \\ 1 & y_{1, k} & \dots & y_{m, k} \end{matrix}], V = {(v_{1}, \dots, v_{m})}^{T},

(14)

Standard regression assumptions apply, such as zero mean of the noise components and independence among them,

E {V_{j}} = 0; E {Y_{k, j}, V_{k j}} = 0, c o v (V_{i}, V_{j}) =, {\begin{matrix} 0, i \neq j \\ σ^{2}, i = j \end{matrix}, \forall i, j = 1, \dots, m .

(15)

By minimizing the sum of squared errors

V^{T} V = {(Y_{k} - Y_{(k - L, k)} C_{k})}^{T} (Y_{k} - Y_{(k - L, k)} C_{k})

, we come to a system of normal equations, the solution of which is determined by a well-known relationship,

{\overset{⌢}{C}}_{k} = {(Y_{(k - L, k)}^{T} Y_{(k - L, k)})}^{- 1} Y_{(k - L, k)} Y_{k}

. The time shift by τ ensures that the obtained regression coefficients

{\overset{⌢}{C}}_{k}

serve as a suitable linear predictive operator for the SE.

2.5. Precedent Forecast

The precedent forecasting approach is based on the assumption that similar historical patterns yield similar future outcomes—a concept aligned with human cognitive judgment of similarity (Fukunaga, 2013). In practice, this method involves scanning the historical data using a sliding window,

Y_{(i, i + L - 1)} = (y_{i}, \dots, y_{i + L - 1}), i = 1, \dots, N_{S} - L,

(16)

where

N_{S}

denotes the size of the historical training set. During the scanning process, the algorithm identifies

M_{a}

analog windows (denoted by their indices

i_{1}^{*},, \dots, i_{M_{a}}^{*}

) that minimize a chosen similarity metric—typically the root mean square error (4) or the mean absolute deviation (5)—when compared to the current observation window,

Y_{(k, k - L + 1)} = (y_{k - L + 1}, \dots, y_{k}), k = L + 1, \dots, n .

The scanning window with the smallest metric value is considered as a precedent. As measures of similarity between windows

Y_{k + L + 1, k}

and

Y_{i, i + L - 1}, i = 1, \dots, N_{S} - L

in machine learning tasks, distances of the type of root mean square deviation (4) or mean absolute deviations (5) are usually used.

The following remarks have been made:

When computing metrics, it is recommended to use centered data values in observation windows (15). Furthermore, in instances of significant heteroscedasticity of the observed process, it is advisable to normalize the data in the state and scanning windows by estimates of their standard deviations (std).
The size of the observation window L is typically selected based on the minimization of the total square error, which encompasses the sum of the variance and the square of the bias induced by dynamic errors. However, this approach is not suitable for chaotic environments due to the non-stationarity of observation series. In such cases, the size of the state window L should be viewed as a parameter to be refined during the process of model adaptation.
For certain tasks aimed at estimating the local trend, the difference between the coefficients of linear approximation $α_{1}$ of observation series in the current state and scanning windows can serve as a measure of similarity,

$ρ_{i} [y_{C}, y_{S}] = α_{1} (y_{c} (k)) - α_{1} (y_{S} (i)), i = 1, \dots, N_{S} - L .$

As a forecast, as already noted, the smoothed aftermath is used, following the precedent window

{\tilde{x}}_{k + 1, k + τ} = x_{i^{*} + L + 1, i^{*} + L + τ}

, i.e., immediately after the scanning window with the minimum value of the chosen metric,

i * : ρ_{i^{*}} [y_{C}, y_{S}] = \min, \forall i = 1, \dots, N_{S} - L .

(17)

The transition to the forecast of the systemic component (1)

{\tilde{y}}_{k + 1, k + τ} \to {\tilde{x}}_{k + 1, k + τ}

can be made by sequentially smoothing the result of the forecast, for example, using an exponential filter.

2.6. Limits of Traditional Forecasting Under Unit Root Non-Stationarity and Chaotic Regimes

Empirical log price series for liquid FX pairs such as EUR/USD are well documented to possess a unit root: the level P_t is integrated of order one (I(1)), whereas the first difference ΔP_t (the return) is usually weakly stationary with near-zero linear autocorrelation, in line with the weak-form Efficient-Markets Hypothesis (Fama, 1970; Phillips & Perron, 1988).

The random walk representation P_t = P_t−1 + u_t immediately induces high sample autocorrelation in the level series, although the shocks u_t themselves are serially uncorrelated. Thus, non-stationarity stems from the permanent impact of shocks, not from the mere fact that P_t is a stochastic process. Ignoring this distinction leads to spurious regressions and invalidates estimators whose consistency relies on repeated samples from an identical distribution. Any forecasting scheme that does not difference, cointegrate, or otherwise neutralize the unit root implicitly assumes repeatability of mean and variance and is therefore prone to systematic bias.

Beyond unit root behavior, high-frequency currency data exhibit signatures of deterministic chaos: small perturbations in initial conditions produce widely diverging paths (Bildirici & Sonustun, 2019). Two data windows that appear almost identical under Euclidean metrics can evolve in strikingly different ways a few minutes later. This lack of local repeatability explains why purely similarity-based techniques from classical technical analysis often fail and motivates the use of an ensemble of weak, heterogenous learners whose weights adapt to local error dynamics.

The crux of this issue lies in the failure of the essential assumption of repeatability under identical conditions. In chaotic environments, even slight random fluctuations can instigate radical shifts in the dynamic behavior of the series. This volatility implies that data segments, which appear nearly identical when assessed via standard similarity metrics, may nonetheless diverge dramatically in their subsequent evolution. Such unpredictability partly explains the limited effectiveness of traditional technical analysis (Iskrich & Grigoriev, 2017; Escher, 2019; Parameswaran, 2022) in guiding trading operations in capital markets.

To quantify the performance of individual Software Expert (SE) predictors in trading tasks, their effectiveness is defined as the ratio between the number of successful forecasts

m_{i}^{+}

(i.e., those where the prediction’s direction aligns with the actual market move) and the total number of forecasts m_i for the i-th predictor,

h_{i}^{+} = m_{i}^{+} / m_{i}, i = 1, \dots, M_{e} .

(18)

A more precise evaluation of trading efficiency (Yusupov et al., 2021) is obtained by estimating the net gain realized from implementing a management strategy S. This gain is defined as follows:

R (S) = \sum_{j = 1}^{m} Δ y_{j} = \sum_{j = 1}^{m} y_{j} (k_{c l o s e}) - y_{j} (k_{o p e n}) .

(19)

Here,

Δ y_{j} = y_{j} (k_{c l o s e}) - y_{j} (k_{o p e n})

,

j = 1, \dots, m

, represents the outcome of the j-th trade, calculated as the difference between the asset value at the closing

(k_{c l o s e})

and opening (

k_{o p e n}

) of the position, with m being the total number of operations performed. The sign of the difference

Δ y_{j}, j = 1, \dots, m

is determined by the condition of coincidence or discrepancy of the directions of the forecast and the real dynamics of the process on the forecasting interval.

To benchmark the proposed experts, we also employ the classical persistence model. For price levels, this is the random walk assumption

{\overset{⌢}{P}}_{t + τ} = P_{t}

, while for log-returns it degenerates to

{\overset{⌢}{r}}_{t + 1} = 0

, i.e., the best mean-square forecast is “no change”. In the trading framework that follows, a directional signal is therefore taken as follows:

d_{t + 1}^{R W} = s i g n (P_{t} - P_{t - 1}),

which corresponds to the directional-persistence rule, “tomorrow will move in the same direction as today”. Because this rule needs no parameters and uses only the last observation, it represents the minimal-information baseline recommended by (Armstrong, 2001) and is commonly required by forecasting guidelines.

3. Experiments

From the three-year master dataset, we extracted a 100-day estimation window for parameter fitting. For graphical illustration in Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8, we further selected a 10-day subset that spans both trending and range-bound regimes. We chose this particular subset because it covers diverse market situations—like clear trends and periods where the market moves sideways—which makes it ideal for thoroughly testing our strategies.

3.1. A Software Expert Based on Linear Extrapolation

To establish a baseline for forecasting in daily Forex trading, we implement a software expert that employs a first-order linear extrapolation. In this approach, the training phase uses a sliding observation window of duration L = 1440 min (i.e., one full calendar day). Within each window, the observed time series is modeled by a first-order polynomial,

x_{j} = a_{0} + a_{1} j, j = 1, \dots, L,

where the coefficients

(a_{0}, a_{1})

are estimated using the least squares minimization (LSM) as described in (9). The forecast for the next day (that is, 1440 one-minute observations)—spanning a forecast horizon τ = L—is produced by directly substituting the corresponding time indices into the fitted polynomial, that is,

{\tilde{x}}_{k + c} = x_{p} (k + c), c = 1, \dots, τ

.

As an initial diagnostic, we applied the Augmented Dickey–Fuller test to the level series; the test failed to reject the unit root null (ADF p-value = 0.74), confirming the I(1) behavior discussed in Section 2.6. Consequently, all forecasts were generated either in the log-return domain or with model specifications that explicitly accommodate integrated processes. The length of the rolling estimation window L was re-optimized each day so that it constituted the shortest span yielding coefficient estimates that remained statistically stable at the conventional 5% significance threshold.

Figure 4 illustrates this process using centered EUR/USD quotes over a 10-day period. In the figure, the boundaries of the daily observation intervals are marked by red dots. The blue solid lines represent the linear approximations obtained on each day’s data, while the red lines depict the linear forecasts generated by naturally extending these trends into the next day. It is evident from Figure 4 that only three out of nine observation intervals resulted in forecasts whose trend directions were aligned with the actual market movement, yielding an estimated probability of favorable outcomes

F^{+} \approx 0.33

(see also metric (18)).

Recognizing that the 10-day interval was selected primarily for illustrative clarity, we extended the experiment to a 100-day observation period to enhance statistical reliability. In this case, the success rate improved only modestly to

F^{+} \approx 0.40

, while the cumulative performance metric (as defined in (19)) indicated a net loss of −210 points. These results clearly highlight that the simple linear extrapolator functions as a “weak learner” in the machine learning context—it fails to independently generate forecasts of adequate quality for effective management decisions under chaotic market conditions.

Motivated by this limitation, the remainder of this study investigates the potential of MES technology to integrate diverse forecasting approaches and thereby enhance overall prediction accuracy, as outlined in (A. Musaev & Grigoriev, 2022).

3.2. Constructing a Two-Level MES Using Ensemble Machine Learning Methods

In this study, a two-level MES is developed to enhance forecasting performance in chaotic financial environments. At the lower level, a set of

M_{e}

software experts (SEs) is employed; these experts differ either in the structure or in the parameters of their forecasting algorithms. The second level is formed by a Software Expert–Supervisor (SE-S) that processes the local decisions of the individual experts and generates the final forecast, which ultimately informs the management decision.

The simplest aggregation method implemented by SE-S is the unweighted averaging of the forecasts from the individual SEs. Mathematically, if

{\tilde{x}}_{k, k + τ}

represents the forecast from the k-th expert for future time

k + τ

, then the combined forecast is given by the following:

{\tilde{x}}_{k, k + τ} = \frac{1}{M_{e}} \sum_{i = 1}^{M_{e}} x_{t}^{i}, t = k + 1, \dots, k + τ, k = 1, \dots, n,

(20)

which provides a baseline ensemble prediction. A more refined approach relies on weighted averaging, where each expert’s forecast is weighted inversely with respect to an a priori estimation of its Bayesian risk. Specifically, after sequentially solving the forecasting task for each SE using the retrospective database

Y_{S}

, the average forecasting error

δ {\tilde{X}}_{k, τ}^{i}

is computed and the corresponding Bayesian risk

r_{k, i}^{τ}

is estimated. The final forecast is then defined as follows:

{\tilde{x}}_{t + τ} = \sum_{i = 1}^{M_{e}} w_{i} {\tilde{x}}_{t + τ}^{i} / \sum_{i = 1}^{M_{e}} w_{i}, w_{i} = {(r_{i}^{τ})}^{- 1}, i = 1, \dots, m, t = k + 1, \dots, τ .

(21)

In practice, the actual management decision is determined by this final forecast along with an experimentally selected critical threshold

δ {(x_{k})}^{*} = {({\tilde{x}}_{k} - x_{k})}^{*}

.

A fundamental challenge in constructing an MES is that, under the conditions of chaotic dynamics, each SE typically acts as a “weak learner.” To overcome this limitation, ensemble machine learning techniques—namely bagging, boosting, and stacking—are incorporated. In these frameworks, the supervisor acts as a meta-learner to enhance the overall forecast reliability relative to individual SE outputs.

3.3. Ensemble Forecast for Chaotic Process Based on Bagging Technology

Bagging (Bootstrap Aggregating) relies on generating multiple training subsets from the original dataset to train a homogeneous set of SEs. Each expert in this approach is identical in structure but is trained on different resampled or sliding subsets of data. In non-stationary environments, however, the empirical distribution of the data changes over time, and so relying on bootstrap resampling can be less effective. To address this issue, this work proposes the use of sliding window samples of different lengths, taken immediately prior to the forecast time. Increasing the sample length improves the quality of smoothing and reduces the probability of Type II errors (false alarms), albeit at the cost of increased Type I errors (delays in change detection).

For example, consider a bagging framework applied to a linear extrapolation forecast in a daily trading scenario. Three variants are used, each employing training windows of different lengths:

L = 1440

min counts (1 day),

L_{1} = 720

counts and

L_{2} = 360

counts. These intervals are positioned directly adjacent to the forecast starting point, and the forecast is carried out on a daily interval (

τ = 1440

counts). As shown in Figure 5, the learning outcomes of the three SE extrapolators are displayed as linear approximations (solid blue, blue dashed, and black dashed lines), and the final forecast for each trading day is based on the averaged least squares estimates of the coefficients,

{\tilde{x}}_{k + i} = a_{o} + a_{1} t, t = 1, \dots, τ,

(22)

where

a_{o} = (a_{o 1} + a_{o 2} + a_{o 3}) / 3, a_{1} = (a_{11} + a_{12} + a_{13}) / 3

.

Although simple bagging based on linear extrapolators may not yield highly effective forecasts under conditions of inertialess market chaos, the approach shows more promise when applied to processes with inherent inertia (see A. Musaev et al., 2023c). It is important to note that the example presented is intended for illustration only; real-world trading strategies require dynamic entry points rather than static daily divisions.

3.4. Ensemble Forecast for a Chaotic Process Based on Boosting Technology

Boosting improves forecast accuracy by sequentially correcting the errors of homogeneous models. In the context of forecasting, boosting involves updating the weights of individual forecasts based on their errors, thereby refining the overall weighted average forecast. In our approach, the boosting algorithm operates on the basis of precedent data analysis: a sliding observation window,

Y_{(k, k - L + 1)} = (y_{k - L + 1}, \dots, y_{k}), k = L + 1, \dots, n,

(23)

is defined for the current state, and similar historical observation windows are identified from a retrospective database.

A set of indices

i_{1}^{*},, \dots, i_{M_{a}}^{*}

corresponding to the smallest values of a chosen similarity metric is selected. The most similar window, having the minimum similarity value

μ_{1} = \min (Y_{k, k - L + 1}, Y_{i, i + L - 1}), \forall i = 1, \dots, N_{S} - L,

is assigned a unit weight

w_{1} = 1

. Other windows are assigned weights relative to

μ_{1}

by

w_{i_{e}} = μ_{i_{e}} / μ_{1}, \forall i_{e} = 1, \dots, M_{e}

. The final forecast is then obtained as a weighted average of the forecast consequences of the identified analog windows,

{\tilde{x}}_{(k + 1, k + τ)} = \sum_{j = 1}^{M_{e}} w_{i_{j}^{*}} y_{i_{j}^{*} + L + 1, i_{j}^{*} + L + τ} / \sum_{j = 1}^{M_{e}} w_{i_{j}^{*}} .

(24)

As a numerical example, consider precedent forecasting for the EUR/USD currency pair using a sliding window of length L = 300 counts, with a scanning window from retrospective data shifted by 150 counts. The training dataset spans 300 days of continuous monitoring with a 1 min time discretization. At each step, a similarity metric—such as the mean absolute difference—is evaluated, and Figure 6 illustrates the current state window (blue graph) alongside three analogous windows extracted from the historical database.

Figure 7 then shows the corresponding forecast outcomes and the linear trends (red lines) estimated via least squares. Although the boosting algorithm successfully predicts the trend direction across several forecast windows, its performance may vary over the entire observation period due to the inherently unpredictable nature of market chaos.

3.5. Ensemble Forecast for a Chaotic Process Based on Stacking Technology

Stacking involves the integration of heterogeneous forecasting algorithms—each considered a “weak learner”—to produce a more reliable final forecast. In our stacking approach, each SE independently generates a prediction for the same forecast interval τ, and the supervisory expert (SE-S) subsequently acts as a meta-learner to combine these predictions.

The outputs of the individual experts are formulated as fuzzy decisions drawn from the set,

D = {1, 0, - 1},

(25)

where +1 indicates an upward trend, −1 indicates a downward trend, and 0 represents a flat or sideways trend. The final decision is determined by a threshold condition, for example,

d = {\begin{matrix} 1, α > α^{*}, \\ 0, | α | < α^{*}, \\ - 1, α < - α^{*}, \end{matrix}

(26)

with

α

representing the coefficient of linear approximation on the current observation segment and

α^{*}

determined via hypothesis testing (using, for instance, t-distribution tables with L-2 degrees of freedom) (Van Der Waerden, 2013).

In our illustrative implementation, three heterogeneous SEs are used as follows:

SE-1 employs a linear forecasting algorithm (as per Equation (10)) based on the trend extrapolation from the previous one-day observation window.
SE-2 is based on regression analysis; its forecasting operator is expressed as follows:

${\overset{⌢}{x}}_{k + r} = {\overset{⌢}{C}}_{k, r} Y_{R k}, k = L, \dots, n,$

(27)

where ${\overset{⌢}{C}}_{k, r} = {({Y_{R < (k - L + 1) : (k - r - 1) >}}^{T} Y_{R < (k - L + 1) : (k - r - 1) >})}^{- 1} {Y_{R [< (k - L + 1 : (k - 1) >]}}^{T} y_{< k - L + r + 1) : (k - 1) >}$ is the least squares estimate of the transfer coefficient of the linear filter, and $Y_{R k} = {(y_{R 1}, y_{R 2}, \dots, y_{R m})}_{k}$ is a vector of regressors corresponding to financial instruments strongly correlated with the target asset. Owing to non-stationarity, this forecast is often unstable; thus, an indicative forecast is formed based on the discrepancy between the current multiregression estimate ${\overset{⌢}{x}}_{k} = {\overset{⌢}{C}}_{k} Y_{R k}, k = L, \dots, n$ and the observed price $x_{k} = α y_{k} + (1 - α) x_{k - 1}$ , $k = 2, \dots, n$ . This discrepancy, defined as follows:

$d_{k} = {\overset{⌢}{x}}_{k} - x_{k}, k = 1, \dots n,$

(28)

is interpreted in light of market mispricing.
SE-3 utilizes text mining techniques to analyze market sentiment from analytical reviews. The objective is to convert qualitative textual information into a fuzzy decision drawn from D (see Hassani et al., 2020; Liu et al., 2021).

The supervisory expert (SE-S) aggregates the fuzzy outputs from SE-1, SE-2, and SE-3. In an illustrative day-trading simulation using EUR/USD real quotes over a 10-day period, the observation interval is divided into 10 segments (each of 1440 min counts). For all bagging experiments, we adopt an intraday horizon of τ = 60 one-minute bars. At the beginning of each segment, each SE produces a fuzzy decision. These decisions are visually represented by colored arrows in Figure 8: a green arrow for d = +1 (forecasted growth), a red arrow for d = −1 (forecasted decline), and a black two-sided arrow for d = 0 (no significant trend).

The final decision is determined via simple majority voting,

d_{s} = {\begin{matrix} 1, i f \sum_{i = 1}^{3} d_{i} > 0 \\ - 1, i f \sum_{i = 1}^{3} d_{i} < 0 \\ 0, i f \sum_{i = 1}^{3} d_{i} = 0 \end{matrix},

(29)

and can alternatively be computed using weighted majority voting if historical accuracy rates

w_{i} = h_{i}^{+}, i = 1, \dots, M_{e}

for each SE are available,

d_{s} = {\begin{matrix} 1, i f \sum_{i = 1}^{M_{e}} w_{i} d_{i} > 0 \\ - 1, i f \sum_{i = 1}^{M_{e}} w_{i} d_{i} < 0 \\ 0, i f \sum_{i = 1}^{M_{e}} w_{i} d_{i} = 0 \end{matrix} .

(30)

The final decision, as aggregated by SE-S, is depicted in Figure 8 by an arrow whose direction and color correspond to the ultimate market entry signal.

4. Results and Discussion

This section presents the empirical evaluation of the proposed ensemble decision-making framework, which integrates heterogeneous software experts (SEs) via stacking techniques over a 100-day observation interval. Over the 100-day evaluation period (144,000 one-minute bars), the study produced 99 day-ahead forecasts and 142,560 intraday one-hour-ahead forecasts, resulting in 142,659 individual point forecasts that underpin the trading and accuracy statistics reported below. The results are analyzed both from a performance perspective—in terms of trading profitability and forecasting accuracy—and with regard to the probabilistic reliability of the ensemble approach under conditions of stochastic chaos.

Table 1 includes the naïve random walk (RW) benchmark alongside the three single-expert (SE) variants and the multi-expert system (MES) that combines their signals by simple majority voting as formalized in Equation (29). The RW rule opens exactly one position per day by repeating the sign of the previous day’s return; over the 100-day evaluation window, this produces 100 trades, a modest loss of −185 pips, and a 49% hit rate. All three SE models meet or modestly surpass this baseline: SE-1 still posts a loss, whereas SE-2 and SE-3 generate positive net results. MES outperforms every individual expert and the RW benchmark, delivering a profit of 1392 pips—1577 pips more than RW and 296 pips above the best SE (SE-2)—while lifting the win probability (Equation (18)) to 61%.

These outcomes demonstrate that aggregating heterogeneous experts provides an economically and statistically meaningful edge over both the minimal-information persistence model and the standalone SEs. Moreover, applying sequential evolutionary optimization procedures (Katoch et al., 2021; A. Musaev et al., 2022; Albadr et al., 2020) to adapt expert weights can boost the MES win probability by an additional 5–8%, further reinforcing the advantage of the proposed framework.

These results indicate that leveraging an ensemble of weak predictors via MES can yield superior trading performance compared to the independent use of individual SEs. The enhanced performance is attributed to the aggregation of diverse, weak forecasting signals, which collectively contribute to forming a more robust management decision.

A fundamental requirement for the practical application of MES is that each individual SE must exceed a 50 percent threshold in the probability of successful decisions. To illustrate this, we consider a binary decision scenario in which a group of

M_{e}

experts makes a collective forecast. In this context, the final decision is determined by majority voting, where the necessary vote threshold is given by,

m^{+} = ⌊ M_{e} / 2 ⌋ + 1,

with

⌊ x ⌋

denoting rounding x to the nearest smaller integer. Assuming each expert makes an erroneous decision with probability

p

, the overall probability of an erroneous decision by the ensemble is computed using the Bernoulli formula as follows:

P_{e r r} = \sum_{i = 1}^{m^{+}} C_{M_{e}}^{i} p^{i} {(1 - p)}^{M_{e} - i} .

(31)

For instance, if p = 0.4, then as the number of experts increases (using odd numbers = 3, 5, …, 21), the probability of an erroneous ensemble decision decreases. Figure 9 illustrates this trend with a bar chart where a red line represents the least squares fit of a second-degree polynomial. A similar trend is observed in Figure 10, which shows polynomial approximations for ensemble error probabilities under different assumed values of p. These figures demonstrate that maintaining an individual expert error probability below p^* = 0.5 is a critical condition for the successful application of MES.

The empirical results clearly suggest that an ensemble approach, which aggregates weak signals from multiple SEs, can significantly improve forecast reliability and trading performance in environments characterized by stochastic chaos. However, it is important to note the following limitations:

The chaotic nature of financial time series implies that even an SE that exhibits high efficiency during one observation segment may fail under altered market conditions in the subsequent period. This poses challenges in dynamically estimating the individual p-values and assigning appropriate weightings.
While statistical or adaptive methods for decision fusion remain valuable, their efficiency can be compromised in turbulent regimes where traditional assumptions of stability do not hold (as in the Wald model described by Equation (1)). Similar challenges arise when employing neural networks, which require extensive training to reliably capture chaotic dynamics.
Despite promising aggregate results, the example presented here does not constitute a guarantee of consistent advantage under all market conditions. Given the inherent uncertainty in chaotic environments, practitioners should use MES as one part of a broader risk management framework, continuously monitored and adjusted using updated market data and retrospective analyses.

5. Conclusions

This study has demonstrated that an ensemble approach, specifically the multi-expert system (MES), can substantially enhance forecasting accuracy and decision-making in unstable environments characterized by stochastic chaos. The improved performance, however, is contingent upon individual software experts (SEs) maintaining a success probability exceeding 50%. This finding underscores the critical importance of integrating multiple weak predictors to offset individual deficiencies—a strategy that ultimately leads to more reliable and robust risk management decisions.

Our empirical results illustrate that while the basic implementations of individual SEs and the MES framework serve well to explain the underlying concepts, further enhancements can be achieved by developing more robust variants of these experts (Maronna et al., 2006; Schmidbauer et al., 2014; A. Musaev et al., 2023e). It is important to note that increasing the stability of a forecasting system by reducing its sensitivity to variations in dynamic and statistical characteristics may come at the cost of overall management efficiency. Therefore, a careful balance must be struck between robustness and agility, particularly in environments where permissible decision boundaries are inherently narrow and dynamic.

The proposed MES framework also opens avenues for the incorporation of advanced aggregation technologies beyond simple majority voting. Emerging techniques based on conflict resolution and compromise (Antipova & Rashkovskiy, 2023; Vinyamata, 2010) offer promising alternatives for integrating diverse expert opinions, potentially leading to even more refined decision-making processes. As illustrated in earlier sections (see Figure 5, Figure 6, Figure 7 and Figure 8), these sophisticated mechanisms may significantly improve the operational performance of the system when applied to real-world financial scenarios.

Mansurov et al. (2023) support the idea that integrating adaptive, self-learning components into multi-expert systems not only mitigates individual model deficiencies but also aligns aggregate forecasts more closely with observed market behavior. Such insights provide a compelling rationale for employing ensemble machine learning techniques that aggregate diverse forecasting methodologies to improve risk management and decision-making in volatile financial environments.

In practical terms, the ensemble approach detailed herein provides risk managers and financial practitioners with a promising tool for navigating the challenges of volatile markets. Future research should focus on the following:

(i): Refining the robustness of individual SEs within the MES framework;
(ii): Exploring alternative decision aggregation methods that can dynamically adapt to market changes;
(iii): Validating the methodology on a broader spectrum of financial datasets to enhance its practical applicability.

Ultimately, this work contributes to the ongoing discourse on managing uncertainty and risk in financial markets by offering a viable path toward improved forecast reliability and decision-making effectiveness.

Author Contributions

Conceptualization, methodology, validation, writing, A.M.; review and editing, investigation, programming, visualization, administration, scientific discussions, supervision, funding acquisition, D.G. All authors have read and agreed to the published version of the manuscript.

Funding

Alexander Musaev’s research for this paper was supported by a grant from the Russian Science Foundation (Project No. 24-19-00823). Dmitry Grigoriev’s research for this paper was supported by Saint-Petersburg State University, project ID: 103964829.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The source of data—Finam.ru (https://www.finam.ru/, accessed on 12 April 2025).

Acknowledgments

The authors are grateful to participants at the Center for Econometrics and Business Analytics (ceba-lab.org, CEBA) seminar series for helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Albadr, M. A., Tiun, S., Ayob, M., & Al Dhief, F. (2020). Genetic algorithm based on natural selection theory for optimization problems. Symmetry, 12(11), 1758. [Google Scholar] [CrossRef]
Antipova, E., & Rashkovskiy, S. (2023). Mathematical theory of conflicts as a cognitive control theory. Information, 14(1), 1. [Google Scholar] [CrossRef]
Armstrong, J. S. (Ed.). (2001). Principles of forecasting: A handbook for researchers and practitioners (Vol. 30). Springer Science & Business Media. [Google Scholar]
Avramov, D., & Chordia, T. (2006). Asset pricing models and financial market anomalies. The Review of Financial Studies, 19(3), 1001–1040. [Google Scholar] [CrossRef]
Ayitey, M., Jr., Appiahene, P., Appiah, O., & Ninfaakang, C. (2023). Forex market forecasting using machine learning: Systematic literature review and meta analysis. J. Big Data, 10, 9. [Google Scholar] [CrossRef]
Bildirici, M. E., & Sonustun, B. (2019). Chaotic behavior in exchange rate. International Journal of Financial Research, 10(1), 17–31. [Google Scholar] [CrossRef]
Boccaletti, S., Grebogi, C., Lai, Y. C., Mancini, H., & Maza, D. (2000). The control of chaos: Theory and applications. Physics Reports, 329(3), 103–197. [Google Scholar] [CrossRef]
Box, G. E. P., & Jenkins, G. M. (1970). Time series analysis: Forecasting and control. Holden-Day. [Google Scholar]
Chou, R. Y., & Kroner, K. F. (1992). ARCH modeling in finance. Journal of Econometrics, 52(1–2), 5–54. [Google Scholar]
Cretarola, A., Figà Talamanca, G., & Patacca, M. (2020). Market attention and bitcoin price modeling: Theory, estimation and option pricing. Decisions in Economics and Finance, 43(1), 187–228. [Google Scholar] [CrossRef]
Crilly, A. J., Earnshaw, R., & Jones, H. (Eds.). (2012). Fractals and chaos. Springer Science & Business Media. [Google Scholar]
Engle, R. F. (1982). Autoregressive conditional heteroskedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 50(4), 987–1007. [Google Scholar] [CrossRef]
Escher, F. (2019). Elements of foreign exchange: A foreign exchange primer (172p). Wentworth Press. [Google Scholar]
Fama, E. F. (1970). Efficient capital markets: A review of theory and empirical work. Journal of Finance, 25(2), 383–417. [Google Scholar] [CrossRef]
Fukunaga, K. (2013). Introduction to statistical pattern recognition (2nd ed., 616p). Academic Press. [Google Scholar]
Gardner, E. S., Jr. (1985). Exponential smoothing: The state of the art. Journal of Forecasting, 4(1), 1–28. [Google Scholar] [CrossRef]
Granger, C. W. (2008). Non-linear models: Where do we go next—Time varying parameter models? Studies in Nonlinear Dynamics & Econometrics, 12(3), 1–11. [Google Scholar] [CrossRef]
Gregory Williams, J., & Williams, B. M. (2004). Trading chaos: Maximize profits with proven technical techniques (2nd ed., 251p). J. Wiley & Sons. [Google Scholar]
Hamilton, J. D. (2010). Regime switching models. In Macroeconometrics and time series analysis (pp. 202–209). Palgrave Macmillan UK. [Google Scholar]
Han, H., Liu, Z., Barrios, M., Li, J., Zeng, Z., Sarhan, N., & Awwad, E. M. (2024). Time series forecasting model for non stationary series pattern extraction using deep learning and GARCH modeling. Journal of Cloud Computing, 13, 2. [Google Scholar] [CrossRef]
Hassani, H., Beneki, C., Unger, S., Mazinani, M. T., & Yeganegi, M. R. (2020). Text mining in big data analytics. Big Data and Cognitive Computing, 4(1), 1. [Google Scholar] [CrossRef]
Huber, P. J. (1981). Robust statistics (317p). John Wiley & Sons. [Google Scholar]
Iskrich, D., & Grigoriev, D. (2017, April 3–7). Generating long-term trading system rules using a genetic algorithm based on analyzing historical data. 2017 20th Conference of Open Innovations Association (FRUCT) (pp. 91–97), St. Petersburg, Russia. [Google Scholar]
Katoch, S., Chauhan, S. S., & Kumar, V. (2021). A review on genetic algorithm: Past, present, and future. Multimedia Tools and Applications, 80, 8091–8126. [Google Scholar] [CrossRef]
Khanh, P. Q., & Quan, N. H. (2019). Versions of the weierstrass theorem for bifunctions and solution existence in optimization. SIAM Journal on Optimization, 29(2), 1502–1523. [Google Scholar] [CrossRef]
Knight, F. H. (1921). Risk, uncertainty and profit. Houghton Mifflin. [Google Scholar]
Lauritzen, S. (2023). Fundamentals of mathematical statistics (244p). Chapman and Hall/CRC. [Google Scholar]
Li, H., Jin, K., Sun, S., Jia, X., & Li, Y. (2022). Metro passenger flow forecasting though multi source time series fusion: An ensemble deep learning approach. Applied Soft Computing, 120, 108644. [Google Scholar] [CrossRef]
Liu, Z., Huang, D., Huang, K., Li, Z., & Zhao, J. (2021, January 7–15). Finbert: A pre trained financial language representation model for financial text mining. Twenty Ninth International Conference on International Joint Conferences on Artificial Intelligence (pp. 4513–4519), Yokohama, Japan. [Google Scholar]
Mandelbrot, B. B., Evertsz, C. J., & Gutzwiller, M. C. (2004). Fractals and chaos: The mandelbrot set and beyond (Vol. 3). Springer. [Google Scholar]
Mansurov, K., Semenov, A., Grigoriev, D., Radionov, A., & Ibragimov, R. (2023). Impact of self-learning based high-frequency traders on the stock market. Expert Systems with Applications, 232, 120567. [Google Scholar] [CrossRef]
Maronna, R. A., Martin, R. D., & Yohai, V. J. (2006). Robust statistics: Theory and methods (454p). John Wiley & Sons, Ltd. [Google Scholar]
Musaev, A., & Grigoriev, D. (2022). Multi expert systems: Fundamental concepts and application examples. Journal of Theoretical and Applied Information Technology, 100(2), 336–348. [Google Scholar]
Musaev, A., Makshanov, A., & Grigoriev, D. (2022). Evolutionary optimization of control strategies for non stationary immersion environments. Mathematics, 10, 1797. [Google Scholar] [CrossRef]
Musaev, A., Makshanov, A., & Grigoriev, D. (2023a). Algorithms of sequential identification of system component in chaotic processes. International Journal of Dynamics and Control, 11, 2566–2579. [Google Scholar]
Musaev, A., Makshanov, A., & Grigoriev, D. (2023b). Dynamic robustification of trading management strategies for unstable immersion environments. Montenegrin Journal of Economics, 19(1), 19–29. [Google Scholar] [CrossRef]
Musaev, A., Makshanov, A., & Grigoriev, D. (2023c). Exploring the quotation inertia in international currency markets. Computation, 11, 209. [Google Scholar] [CrossRef]
Musaev, A., Makshanov, A., & Grigoriev, D. (2023d). Multiregression forecast in stochastic chaos. Computational Economics, 64, 137–160. [Google Scholar] [CrossRef]
Musaev, A., Makshanov, A., & Grigoriev, D. (2023e). The genesis of uncertainty: Structural analysis of stochastic chaos in Finance markets. Complexity, 2023(1), 1302220. [Google Scholar] [CrossRef]
Musaev, A. A., Makshanov, A. A., & Grigoriev, D. A. (2021). Forecasting multivariate chaotic processes with precedent analysis. Computation, 9, 110. [Google Scholar] [CrossRef]
Nguyen, T., & Thu, T. (2018, May 11–12). Using support vector machine in forex predicting. 2018 IEEE International Conference on Innovative Research and Development (ICIRD) (pp. 1–5), Bangkok, Thailand. [Google Scholar]
Parameswaran, S. K. (2022). Fundamentals of financial instruments: An introduction to stocks, bonds, foreign exchange, and derivatives (2nd ed., 560p). Wiley. [Google Scholar]
Peters, E. (1996). Chaos and order in the capital markets: A new view of cycles, prices, and market volatility (2nd ed.). John Wiley & Sons. [Google Scholar]
Phillips, P. C., & Perron, P. (1988). Testing for a unit root in time series regression. Biometrika, 75(2), 335–346. [Google Scholar] [CrossRef]
Schmidbauer, H., Rösch, A., Sezer, T., & Tunalioğlu, V. S. (2014). Robust trading rule selection and forecasting accuracy. Journal of Systems Science and Complexity, 27, 169–180. [Google Scholar] [CrossRef]
Stone, M. H. (1948). The generalized weierstrass approximation theorem. Mathematics Magazine, 21(5), 237–254. [Google Scholar] [CrossRef]
Van Der Waerden, B. L. (2013). Mathematical statistics (Vol. 156). Springer Science & Business Media. [Google Scholar]
Vinyamata, E. (2010). Conflictology: A multidisciplinary vision. Journal of Conflictology, 1(1), 1–3. [Google Scholar] [CrossRef]
Wold, H. O. (1938). A study in the analysis of stationary time series (214p). Almqvist & Wiksells. [Google Scholar]
Yusupov, R. M., Musaev, A. A., & Grigoriev, D. A. (2021, September 21–23). Evaluation of statistical forecast method efficiency in the conditions of dynamic chaos. 2021 IV International Conference on Control in Technical Systems (CTS) (pp. 178–180), Saint Petersburg, Russian. [Google Scholar]
Zhang, L., Mykland, P. A., & Aït-Sahalia, Y. (2005). A tale of two time scales: Determining integrated volatility with noisy high-frequency data. Journal of the American Statistical Association, 100(472), 1394–1411. [Google Scholar] [CrossRef]
Zhou, Z. H. (2025). Ensemble methods: Foundations and algorithms. CRC Press. [Google Scholar]

Figure 1. Fluctuations in the EUR/USD currency pair over a three-year period. The x-axis reports elapsed time in minutes (tick labels use a ×10⁵ multiplier), while the y-axis shows price in pips (1 pip = 10⁻⁴ USD).

Figure 2. The noise component of the observation series, formed in the process of sequential filtering with transfer coefficient

α_{1} = 0.05

.

Figure 2. The noise component of the observation series, formed in the process of sequential filtering with transfer coefficient

α_{1} = 0.05

.

Figure 3. An example of sequential smoothing of an observation series with transfer coefficients

α_{1} = 0.1

and

α_{2} = 0.005

.

Figure 3. An example of sequential smoothing of an observation series with transfer coefficients

α_{1} = 0.1

and

α_{2} = 0.005

.

Figure 4. Daily linear-extrapolation forecasts on centered EUR/USD minute bars over a ten-day sample (blue = local fit, red = one-day projection; green = centered EUR/USD minute bars over the ten-day interval).

Figure 5. An example of implementing bagging technology based on three linear extrapolators with different training intervals (solid blue = linear extrapolator trained on a 1-day window; blue dashed = linear extrapolator trained on a 2-day window; black dashed = linear extrapolator trained on a 3-day window).

Figure 6. The current state window and three analog windows identified in the process of searching the retrospective experience database (blue line—current analysis window; black line—overlaid plots of the three most similar historical analog windows).

Figure 7. Processes and their trends corresponding to the predicted process and three forecast options, corresponding to the analog windows (blue line—actual realized process over the forecast interval; black lines—forecast trajectories drawn from the three most similar historical analog windows; red lines—linear least-squares trend estimates for each forecast option).

Figure 8. An example of asset management based on a multi-expert system using stacking technology (green arrow: expert verdict +1; red arrow: expert verdict −1; black double-headed arrow: expert verdict 0; single arrow beneath each segment adopts the same colour for the aggregate majority vote).

Figure 9. Change in the probability of making an erroneous decision with simple majority voting with a probability of error of an individual expert of 0.4 as the odd number of experts increases.

Figure 10. Approximation of the probability of making an erroneous decision with simple majority voting with different probabilities of error of an individual expert as the odd number of experts increases.

Table 1. Comparison of trading results for SE and MES. The averages are calculated over M_SE1 = 334 (SE-1), M_SE2 = 100 (SE-2), M_SE3 = 100 (SE-3) and M_MES = 100 (ensemble) realized trades generated during the 100-day evaluation period.

Decisions	Result in Pips.	Probability of Winning	Average Win	Average Loss
RW	−185	0.49	56.2	−57.5
SE-1	−218	0.49	55.6	−54.7
SE-2	1096	0.59	62.7	−63.5
SE-3	788	0.56	59.5	−57.8
MES	1392	0.61	60.1	−58.3

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Musaev, A.; Grigoriev, D. Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets. J. Risk Financial Manag. 2025, 18, 296. https://doi.org/10.3390/jrfm18060296

AMA Style

Musaev A, Grigoriev D. Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets. Journal of Risk and Financial Management. 2025; 18(6):296. https://doi.org/10.3390/jrfm18060296

Chicago/Turabian Style

Musaev, Alexander, and Dmitry Grigoriev. 2025. "Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets" Journal of Risk and Financial Management 18, no. 6: 296. https://doi.org/10.3390/jrfm18060296

APA Style

Musaev, A., & Grigoriev, D. (2025). Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets. Journal of Risk and Financial Management, 18(6), 296. https://doi.org/10.3390/jrfm18060296

Article Menu

Ensemble Multi-Expert Forecasting: Robust Decision-Making in Chaotic Financial Markets

Abstract

1. Introduction

2. Methods

2.1. Observation Model and Problem Statement

2.2. Problem of Smoothing Non-Stationary Observation Series

2.3. A Polynomial Extrapolator

2.4. Multidimensional Regression Forecast

2.5. Precedent Forecast

2.6. Limits of Traditional Forecasting Under Unit Root Non-Stationarity and Chaotic Regimes

3. Experiments

3.1. A Software Expert Based on Linear Extrapolation

3.2. Constructing a Two-Level MES Using Ensemble Machine Learning Methods

3.3. Ensemble Forecast for Chaotic Process Based on Bagging Technology

3.4. Ensemble Forecast for a Chaotic Process Based on Boosting Technology

3.5. Ensemble Forecast for a Chaotic Process Based on Stacking Technology

4. Results and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI