Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach

Du, Jinda; Cao, Wenyi; Wang, Ziyou

doi:10.3390/math14060938

Open AccessFeature PaperArticle

Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach

by

Jinda Du

¹,

Wenyi Cao

^2,*

and

Ziyou Wang

^2,*

¹

School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China

²

Department of Mathematics, King’s College London, London WC2R 2LS, UK

^*

Authors to whom correspondence should be addressed.

Mathematics 2026, 14(6), 938; https://doi.org/10.3390/math14060938

Submission received: 6 February 2026 / Revised: 26 February 2026 / Accepted: 9 March 2026 / Published: 10 March 2026

(This article belongs to the Special Issue Advances in Artificial Intelligence, Machine Learning and Optimization, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Accurately forecasting the risk matrix and constructing a well-controlled portfolio based on these forecasts is the core objective of effective asset allocation. This paper takes the Chinese stock market as the research object, employing multiple machine learning algorithms to systematically compare the predictive performance of the Financial Stress (FS) indicator and the Economic Policy Uncertainty (EPU) index in sectoral risk management. The forecast results are subsequently applied to portfolio construction and optimization. The findings indicate that, in terms of predictive dimensions, EPU demonstrates strong performance in short-term forecasts, but its explanatory power decays rapidly as the forecasting horizon extends. In contrast, the FS factor achieves forecasting accuracy that is significantly superior to both the EPU factor and traditional price series across all time horizons, exhibiting robust long-memory characteristics and cross-period stability. At the portfolio application level, the minimum variance strategy constructed based on FS forecasts effectively reduces out-of-sample portfolio variance, achieving superior risk control performance compared to strategies based on EPU factor forecasts. This result reveals the differentiated mechanisms of the two factor types: EPU acts as a driving force for short-term risk structure reshaping, while financial stress serves as the core variable driving the evolution of long-term risk structures. Machine learning methods provide an effective technical pathway for capturing these complex nonlinear relationships. The research conclusions offer new empirical evidence for investors to optimize asset allocation decisions and for regulatory authorities to improve risk monitoring systems.

Keywords:

risk matrices; machine learning; economic policy uncertainty; financial stress

MSC:

91G70

1. Introduction

The profound impact of Economic Policy Uncertainty (EPU) and Financial Stress (FS) on financial markets has been extensively documented in the literature [1,2,3]. Existing studies suggest that EPU, originating from government decision-making and shifts in the broader macroeconomic environment, constitutes a key source of exogenous shocks driving asset pricing and risk identification [4]. In contrast, financial stress indices, which capture liquidity frictions and credit tightening, provide an intrinsically grounded tool for measuring systemic risk [5]. However, although the existing literature has extensively examined the effects of EPU and FS on individual asset returns and volatility, their impact on the risk matrix—that is, the dynamic evolution of cross-industry covariance and correlation structures—still lacks clear theoretical articulation and systematic empirical investigation. From a theoretical perspective, this issue is grounded in well-established foundations. According to the behavioral finance theories of limited attention and heterogeneous beliefs [6,7], policy ambiguity induced by EPU can intensify investors’ divergent interpretations of information, thereby altering inter-industry risk-pricing linkages and reshaping risk transmission channels. Financial contagion theory further suggests that systemic stress conditions captured by FS may trigger correlation jumps and panic-driven co-movements, causing diversification-based industry portfolios to lose their hedging effectiveness during periods of turmoil [8]. Intertemporal asset pricing theory additionally indicates that EPU and FS, as key variables influencing the state-dependent properties of the stochastic discount factor, may induce systematic shifts and dynamic revaluation in the asset covariance structure. Against this backdrop, this study aims to systematically investigate the mechanisms through which these two complementary signals—EPU and FS—affect the risk matrix. By doing so, it seeks not only to fill a critical gap in the literature concerning the prediction of cross-industry linkage structures, but also to provide targeted theoretical and empirical support for investors seeking to improve risk budgeting and asset allocation strategies, as well as for regulators aiming to enhance macroprudential frameworks.

In identifying the impact of shocks on risk structures, the multi-dimensional components of EPU and FS exhibit highly heterogeneous informational characteristics, placing rigorous demands on model feature extraction and forecasting performance. However, existing literature frequently overlooks the dynamic evolutionary processes of EPU and FS factors, as well as the inherent heterogeneity in their impacts across different sectors [9,10,11]. In reality, the transmission of these risk factors to market risk structures is neither static nor uniform; instead, it manifests through industry-specific volatility spillover effects and dynamic correlation shifts. The failure to capture the differentiated and time-varying exposures of these factors across sectors introduces identification biases that directly undermine portfolio construction effectiveness. The nature of these complex relationships dictates that the modeling process must adopt a multidimensional perspective, treating the risk matrix as a quintessential multi-output forecasting task [12,13,14]. This necessity poses a critical challenge for traditional models. Addressing this, the present study first employs a DCC-GARCH framework to extract time-varying pricing factors embedded with EPU and FS information, aimed at identifying dynamic risk spillover patterns and the evolution of pricing characteristics. Building upon this foundation, these factors are integrated into a multi-output machine learning architecture. By precisely characterizing the complex many-to-many non-linear mapping between macro shocks and sectoral risk linkages, this approach establishes a granular and technically robust forecasting framework.

In asset allocation decisions, the risk matrix serves as the core foundation for risk budget allocation. Furthermore, this paper applies the risk matrix forecasting results to portfolio construction. By injecting the predicted matrices embedded with EPU and FS specific risk logic into the portfolio optimization framework [15,16,17], it enables precise identification and hedging of inter-industry linkage risks triggered by different sources. Specifically, the EPU signal helps capture cross-industry sentiment contagion induced by policy environment uncertainty, while the FS signal effectively identifies inter-industry liquidity resonance caused by endogenous financial system stress. This application-level extension enables real-time correction of industry correlation deviations under specific risk shocks, reducing the over-reliance of asset allocation on historical volatility. Many forecasting studies remain confined to comparisons of statistical error metrics [18], without examining whether improvements in predicting the risk matrix translate into out-of-sample gains in portfolio performance or risk reduction. By incorporating the forecasted covariance matrix into a minimum variance portfolio (MVP) framework and evaluating realized variance and weight stability, this study assesses whether the predictions genuinely reduce portfolio risk, thereby advancing the contribution from statistical significance to economic significance. This not only significantly enhances the defensive resilience of portfolios under uncertain disturbances, but also provides diversified empirical support for investors to construct decision-making systems with logical foresight in a volatile macroeconomic environment.

This paper makes marginal contributions in two aspects: First, it achieves an important expansion in the information sources for risk matrix forecasting by incorporating two typical risk signals—policy uncertainty (EPU) and financial system stress (FS)—into the forecasting framework for China’s industry risk matrix for the first time. Unlike previous studies that primarily rely on historical price data [19,20], this paper is grounded in the characteristics of China’s transitional period—namely, the reality that the market is profoundly influenced by policy guidance and liquidity conditions—revealing the differentiated impacts of risk shocks on inter-industry linkage patterns. Utilizing industry-specific data unique to China, this study captures how macroeconomic uncertainty transmits to the risk matrix through specific pricing mechanisms, filling the gap in existing literature regarding the use of forward-looking macroeconomic indicators to characterize the dynamic evolution of inter-industry risk structures. This not only enriches the empirical evidence on macroeconomic risk transmission but also provides a new perspective for understanding the unique financial market mechanisms in China.

Second, this study provides a complementary framework that is both more interpretable and more predictive. Specifically, we first employ a DCC-GARCH model to extract time-varying risk exposure factors of each industry to EPU and FS from historical data, and use them as economically meaningful structured features. These features are then incorporated into a multi-output machine learning model to directly forecast the cross-industry risk matrix. Compared with approaches that rely solely on parametric correlation models, our framework better captures the nonlinearity and heterogeneity in the relationship between macro shocks and inter-industry linkages. In contrast to purely black-box machine learning methods, it preserves clear economic interpretation and transmission logic. Therefore, our approach achieves a more balanced improvement between interpretability and predictive performance, thereby offering a substantive complement and extension to existing risk forecasting methodologies.

Third, at the practical application level, this paper constructs a risk factor-driven investment decision framework and systematically explores the critical role of EPU and FS signals in optimizing portfolio allocation. The findings reveal that incorporating EPU and FS pathways effectively identifies structural changes in industry correlations, thereby correcting the excessive weight concentration problem that emerges in traditional strategies during market turbulence and significantly reducing portfolio risk. The research results provide a feasible reference framework for investors to construct forward-looking and robust hedging strategies in complex macroeconomic environments.

The remainder of this paper is structured as follows. Section 2 details the literature review. Section 3 describes the data and method. Section 4 presents the empirical findings, covering both risk matrix forecasting and its application in portfolio. Section 5 concludes and provides future research directions.

2. Literature Review

Extensive research has demonstrated that Economic Policy Uncertainty (EPU) exerts a significant predictive influence on financial markets, particularly in the dynamic evolution of asset pricing and market volatility [21,22,23,24]. As a core metric characterizing uncertainty in government decision-making and the macroeconomic policy environment, EPU impacts the real economy by influencing corporate investment, consumption, and financing decisions. Simultaneously, it transmits to the stock market through risk premium and expectation adjustment channels, leading to declined returns and elevated volatility [4,25]. Existing studies generally find that during periods of heightened uncertainty, equity markets are more prone to systemic pullbacks, volatility clustering, and “flight-to-quality” behavior. Consequently, EPU possesses robust predictability for downside return pressure and conditional volatility [26,27,28]. In emerging markets, especially those with strong policy-driven characteristics, the impact of EPU on sectoral returns, volatility spillovers, and cross-market linkages is even more pronounced, often exhibiting stronger predictive power during phases of structural reform or institutional transition [29,30]. Recent studies have further integrated EPU into frameworks for tail risk, risk spillovers, and safe-haven asset pricing, finding that EPU explains the time-varying characteristics of correlation and hedging efficiency in multi-asset portfolios, particularly during crises [31,32]. Collectively, this evidence suggests that EPU not only affects the first and second moments of returns but also contains crucial forward-looking information regarding asset dependency structures and systemic risk pricing.

In contrast to EPU, which primarily reflects exogenous policy and macroeconomic shocks, Financial Stress (FS) focuses on the internal vulnerability and instability of the financial system. Its typical measurement synthesizes multi-dimensional information, including credit spreads, term spreads, market volatility, liquidity indicators, and sharp asset price adjustments [33,34,35]. Research indicates that high levels of financial stress are strongly associated with economic recessions, financial crises, and significant asset price corrections, serving as an early warning signal for the accumulation of systemic risk [36,37]. During periods of rising stress, investor risk aversion intensifies, safe-haven demand surges, and financial institutions’ balance sheets contract. This often leads to increased correlations and a higher frequency of co-movements in asset declines, thereby exacerbating market linkages and systemic exposure [38,39]. Consequently, FS has been widely utilized within macro-prudential policy frameworks to monitor systemic risk and financial fragility, serving as a vital tool for counter-cyclical adjustment and financial stability assessment. However, existing literature focuses predominantly on macroeconomic variables, financial institution risk exposures, or tail risk indicators [40], while the holistic forecasting of high-dimensional risk matrices across sectors or asset classes remains relatively under-explored.

The accurate forecasting of the Risk Matrix—typically presented as the conditional covariance matrix of asset returns—is fundamental to modern asset pricing, risk management, and asset allocation. Traditional research relies primarily on multivariate GARCH-type models and their extensions, serving portfolio optimization and risk measurement by characterizing the dynamic evolution of conditional covariance over time [41,42,43]. Some scholars have utilized high-frequency financial data to advance the realized covariance approach [44,45,46], constructing ex-post covariance matrices through the aggregation of high-frequency returns and subsequently modeling and forecasting them using vector autoregression (VAR), factor models, or shrinkage estimation. Recently, machine learning methods have been introduced to covariance forecasting to mitigate issues such as complex non-linear relationships, high dimensionality, and rapid structural changes. Studies have employed algorithms like Random Forests, Gradient Boosting Trees, and Neural Networks to learn latent correlation structures from large-scale cross-sectional and time-series features [47,48].

Overall, however, existing research in risk matrix forecasting remains heavily dependent on historical price and return information. Macro-financial factors are frequently treated as auxiliary control variables rather than core drivers of covariance matrix dynamic reconstruction [49,50]. In particular, research that systematically embeds EPU and FS—two indicators with forward-looking and endogenous complementary characteristics—into a high-dimensional risk matrix forecasting framework is almost non-existent. The established literature pays more attention to the impact of EPU and FS on mean returns, single-asset volatility, or low-dimensional systemic risk indicators, while rarely analyzing how they jointly drive the evolution of covariance structures at the sectoral level or within multi-asset portfolios. Against this backdrop, utilizing a multi-output machine learning framework that incorporates EPU and FS as dual drivers for exogenous policy shocks and endogenous financial fragility to jointly forecast high-dimensional risk matrices can bridge the gap. This approach not only transcends the limitations of traditional price-driven covariance forecasting at the methodological level but also reveals how macroeconomic uncertainty and financial stress reshape inter-sectoral risk linkage structures at the empirical level, thereby filling a distinct void in the literature regarding “macro-financial factor-driven risk matrix forecasting.

3. Data and Methodology

3.1. Data and Preprocessing

The EPU data used in this study are China-specific and are obtained as monthly observations from the authoritative Economic Policy Uncertainty database (www.policyuncertainty.com), capturing policy uncertainty signals in China’s macroeconomic environment. The Financial Stress (FS) index is also China-based and is measured by the China Composite Indicator of Systemic Financial Stress (China CISS), jointly developed by MGF Lab and the European Central Bank (ECB). The underlying data are retrieved from the ECB Data Portal. To ensure comparability and sectoral representativeness, this study take all 11 sector indices aligned with the Global Industry Classification Standard (GICS), covering Energy, Materials, Industrials, Consumer Discretionary, Consumer Staples, Health Care, Financials, Information Technology, Communication Services, Utilities, and Real Estate. Compared to traditional classification frameworks, this system offers an extended historical observation window, enabling a more comprehensive characterization of the Chinese economy’s volatility across diverse cycles. The sample period spans from October 2006 to November 2025, totaling 230 observations. All market-related data were retrieved from the Wind database, ensuring the professional rigor and accuracy of the research benchmark.

Table 1 presents the descriptive statistics for the eleven sector index returns and the core predictive factors. In terms of return performance, all sectors recorded positive monthly average returns, with Health Care exhibiting the highest gain, while Information Technology and Real Estate displayed the greatest standard deviations, indicating higher volatility. Regarding distributional characteristics, all sectors except Real Estate exhibited significant negative skewness, with kurtosis values far exceeding the normal distribution threshold of 3, manifesting distinct “fat-tail” properties. Notably, the FS demonstrated substantially higher positive skewness and kurtosis than EPU, suggesting that FS possesses stronger jump dynamics and extreme risk-capturing capabilities compared to the relatively smoother fluctuations of EPU. This further justifies the necessity of employing machine learning frameworks to capture non-linear risk dynamics. Regarding stationarity, both ADF and PP tests rejected the null hypothesis of a unit root at the 1% significance level, confirming that all variables are stationary I(0) sequences, thereby providing a robust foundation for subsequent machine learning modeling.

In accordance with standard asset pricing theory, this study incorporates EPU/FS into the Capital Asset Pricing Model (CAPM) framework to construct a factor pricing model, aiming to examine the pricing relationship between Chinese sectoral stock returns and EPU/FS:

R_{i, t} = b_{i, t}^{Mkt} M K T_{t} + b_{i, t}^{F a c t o r} ({F a c t o r}_{t}) + e_{i, t}

(1)

where

R_{i, t} = l n (P_{i, t} / P_{i, t - 1})

denotes the return of sector

i

at time

t

;

M K T_{t}

represents the market portfolio proxy used to control for common exposure to overall market conditions, and in this study, the CSI 300 Index (HS300) is employed as the market return proxy; and

{F a c t o r}_{t}

refers to the EPU/FS factor. The coefficients

b_{i, t}^{Mkt}

and

b_{i, t}^{F a c t o r}

measure the exposure of sector

i

to the market factor and EPU/FS factor, respectively. Notably, the beta (

b_{i, t}^{F a c t o r}

) constitutes the core variable of interest in this study, as it captures the transmission intensity of EPU/FS fluctuations to sectoral pricing.

However, traditional asset pricing models are predominantly built upon linear assumptions and static parameter estimation, which imposes significant limitations when addressing the impacts of EPU/FS shocks. To characterize the time-varying linkage intensity between sectoral returns and factors (EPU or FS), this study employs the DCC-GARCH framework at the second-moment level [51]. The core philosophy of this framework is to separately model “volatility” (scale) and “correlation” (linkage structure) and subsequently synthesize them into conditional covariances. This process yields the time-varying exposures (

b_{i, t}^{F a c t o r}

) consistent with asset pricing theory.

First, let

σ_{i, t}

and

σ_{F a c t o r, t}

denote the conditional standard deviations of sector

i

and factor

F

, denote the conditional standard deviations of sector

i

and factor

F

, respectively, which are estimated via univariate GARCH processes to capture the time-varying volatility of each variable at time

t

. Second, the conditional correlation matrix

C_{t}

is obtained by standardizing the auxiliary matrix

Q_{t}

from the DCC model. At the element level, the conditional correlation coefficient is defined as:

ρ_{i F, t} = \frac{q_{i F a c t o r, t}}{\sqrt{q_{i i, t} q_{F a c t o r F a c t o r, t}}}

(2)

where

ρ_{i F, t}

characterizes the evolution of the linkage structure over time. Third, the conditional covariance matrix is expressed as

H_{t} = D_{t} C_{t} D_{t}

, where the covariance element

h_{i F, t}

satisfies:

h_{i F, t} = ρ_{i F a c t o r, t} σ_{i, t} σ_{F a c t o r, t}

(3)

Intuitively,

ρ_{i F, t}

provides the dimensionless linkage direction and intensity, while

σ_{i, t} σ_{F a c t o r, t}

provides the dimensioned scale. Their synthesis yields the conditional covariance

h_{i F a c t o r, t}

suitable for risk measurement and asset pricing.

Finally, the time-varying factor exposure

b_{i, t}^{F a c t o r}

follows the standard asset pricing definition, i.e., the ratio of conditional covariance to conditional varian

b_{i, t}^{F a c t o r} \equiv \frac{{Cov}_{t} (R_{i, t}, {F a c t o r}_{t})}{{Var}_{t} ({F a c t o r}_{t})} = \frac{h_{i F, t}}{h_{F F, t}} = \frac{ρ_{i F a c t o r, t} σ_{i, t} σ_{F a c t o r, t}}{σ_{F a c t o r, t}^{2}}

(4)

This approach circumvents several restrictive limitations inherent in traditional research, such as the static assumption that sectoral responses to EPU/FS fluctuations remain constant over time. By relaxing the predefined constraint of invariant correlation coefficients, the DCC-GARCH model allows for time-varying dependencies among variables, thereby providing a more precise characterization of the dynamic linkages between EPU/FS and various industrial sectors. Based on this pricing framework, the study develops forecasting models and sector-based portfolios that deeply integrate EPU/FS factors, offering investors more robust asset allocation strategies.

Figure 1 illustrates the evolution of time-varying EPU/FS betas (

b_{i, t}^{F a c t o r}

) for 11 sector indices, revealing two distinct characteristics. First, the results reveal significant heterogeneity in sectoral sensitivity: policy-driven sectors, such as Real Estate and Industrials, exhibit markedly higher intensity in response to EPU shocks, whereas Financials and Consumer sectors demonstrate greater sensitivity to FS. Furthermore, sectoral pricing relationships display pronounced time-varying and non-linear characteristics. Sector betas frequently undergo extreme structural mutations during major policy-related episodes, with representative periods including the 2015 A-share market turbulence, the 2018 US-China trade friction, and the initial outbreak of the COVID-19 pandemic in 2020. These findings provide compelling evidence that the transmission of EPU and FS into market pricing is fundamentally event-driven, rather than following a simplistic linear steady-state relationship.

3.2. Methods

Building upon the data described above, this study aims to examine the marginal contribution of incorporating time-varying EPU/FS betas to the predictive performance of risk matrices across 11 industrial sectors. The primary objective of this research design is to explore whether time-varying EPU/FS betas can effectively capture incremental information regarding the transmission of macro-shocks to sectoral correlations, thereby addressing the perceptual blind spots of traditional price factors in decoding complex market dynamics. To this end, within an evaluation framework encompassing multiple machine learning algorithms, this study utilizes the time-varying EPU/FS betas of 11 sectors to perform multi-output forecasting of the risk matrix. This approach allows us to systematically validate the efficacy of EPU/FS factors in enhancing the predictive accuracy of sectoral covariance matrices. The overall methodological framework of the proposed modeling and evaluation procedure is summarized in Figure 2.

3.2.1. Support Vector Regression (SVR)

Support Vector Regression (SVR) is an extension of the Support Vector Machine (SVM) framework for regression tasks [52]. Unlike traditional Ordinary Least Squares (OLS) which minimizes squared residuals, SVR introduces an

ϵ

-insensitive loss function. This mechanism defines a tolerance tube around the predicted values; losses are only incurred when the actual observations fall outside this

ϵ

-margin. This property grants SVR exceptional robustness against outliers and fat-tail noise inherent in financial return series. The objective of SVR is to find a function

f (x) = w^{T} ϕ (x) + b

by solving the following optimization problem:

\underset{w, b, ξ, ξ^{*}}{m i n} \frac{1}{2} ∥ w ∥^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*})

(5)

Subject to

|y_{i} - (w^{T} ϕ (x_{i}) + b)| \leq ϵ + ξ_{i}

(6)

where

C

is the regularization parameter and

ϕ (x)

represents the kernel function used to map input features into a high-dimensional space to handle non-linearity. SVR with Radial Basis Function (RBF) kernel seeks a function that keeps prediction errors within an

ϵ

-insensitive tube while maximizing flatness. The

ϵ

-insensitive loss function is defined as:

L_{ϵ} (z) = m a x (0, |z| - ϵ),

(7)

where

z = y - f (x)

is the prediction error. The Lagrangian dual leads to

\begin{array}{l} \underset{α, α^{*}}{m a x} & - \frac{1}{2} \sum_{i, j = 1}^{n} (α_{i} - α_{i}^{*}) (α_{j} - α_{j}^{*}) K (x_{i}, x_{j}) - ϵ \sum_{i = 1}^{n} (α_{i} + α_{i}^{*}) + \sum_{i = 1}^{n} y_{i} (α_{i} - α_{i}^{*}) \end{array}

(8)

subject to

0 \leq α_{i}, α_{i}^{*} \leq C

and

\sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) = 0

.

Prediction:

\hat{y} (x) = \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) K (x_{i}, x) + b,

(9)

where

K (x_{i}, x) = e x p (- γ ∥ x_{i} - x ∥^{2})

is the RBF kernel.

3.2.2. Random Forest Regressor (RF)

Random Forest (RF) is an ensemble learning algorithm based on the Bagging (Bootstrap Aggregating) paradigm. It constructs a multitude of decision trees during training and outputs the average prediction of the individual trees [53]. The core strength of RF lies in its dual randomness: random bootstrap sampling of data and random selection of feature subsets at each node split. This mechanism significantly reduces model variance, granting it superior generalization capabilities and robustness against overfitting, especially when navigating noisy financial datasets. Given the training set

D = < (x_{1}, y_{1}), \dots, (x_{n}, y_{n}) >

, Random Forest generates predictions through the following steps:

(1): Bootstrap sampling: Randomly draw B bootstrap samples $D_{b}$ with replacement from $D$ .
(2): Tree construction: For each bootstrap sample $D_{b}$ , train a regression tree $T_{b}$ . At each node split, randomly select $m$ features ( $m \leq M$ ) from all $M$ features, and choose the optimal split point according to the principle of minimizing the mean squared error (MSE):

$\underset{j, s}{m i n} [\sum_{x_{i} \in R_{1} (j, s)} {(y_{i} - c_{1})}^{2} + \sum_{x_{i} \in R_{2} (j, s)} {(y_{i} - c_{2})}^{2}]$

(10)

where $j$ is the feature, $s$ is the split threshold, $R_{1}, R_{2}$ are the regions after splitting, and $c_{1}, c_{2}$ are the means of the corresponding regions.
(3): Ensemble prediction: The final result is the arithmetic average of the predictions from $B$ trees:

${\hat{f}}_{r f}^{B} (x) = \frac{1}{B} \sum_{b = 1}^{B} T_{b} (x)$

(11)

where ${\hat{y}}_{b} (x)$ is the prediction of the $b$ -th tree ( $B = 50$ ).

3.2.3. AdaBoost Regressor

AdaBoost (Adaptive Boosting) is a classic boosting ensemble learning algorithm [54]. Unlike Random Forest, which constructs trees in parallel, AdaBoost iteratively builds weak learners (typically shallow decision trees) in a sequential and adaptive manner. Its core idea is to focus on samples that were poorly predicted in previous iterations by dynamically adjusting sample weights, thereby transforming a series of weak learners—whose performance is only slightly better than random guessing—into a strong learner with high accuracy. When dealing with data exhibiting high-frequency transient shocks, such as EPU/FS fluctuations, AdaBoost can rapidly identify hard-to-predict extreme risk shifts through its iterative mechanism. Given the training set

D = < (x_{i}, y_{i}) >_{i = 1}^{n}

, the AdaBoost algorithm proceeds as follows:

(1)

Initialize sample weights:

w_{i} = 1 / n

for all

i = 1, \dots, n

(2)

For each iteration

t = 1, \dots, T

:

–: Train a weak learner $h_{t} (x)$ on the weighted training data.
–: Compute the loss for each sample. Common loss functions include:
–: Linear loss:

$L_{i, t} = |y_{i} - h_{t} (x_{i})| / {m a x}_{k} |y_{k} - h_{t} (x_{k})|$
–: Squared loss:

$L_{i, t} = {(y_{i} - h_{t} (x_{i}))}^{2} / {m a x}_{k} {(y_{k} - h_{t} (x_{k}))}^{2}$
–: Exponential loss:

$L_{i, t} = 1 - e x p (- |y_{i} - h_{t} (x_{i})| / {m a x}_{k} |y_{k} - h_{t} (x_{k})|)$

(The loss is normalized to $[0, 1]$ using the maximum absolute error $D_{t}$ .)
–: Compute the weighted average loss:

${\overline{L}}_{t} = \sum_{i = 1}^{n} w_{i, t} L_{i, t} .$

(12)
–: If ${\overline{L}}_{t} \geq 0.5$ , terminate early (optional).
–: Compute the learner confidence:
$β_{t} = {\overline{L}}_{t} / (1 - {\overline{L}}_{t})$ (or a variant such as $β_{t} = {\overline{L}}_{t}^{p}$ for tuned robustness).
–: Update sample weights: $w_{i, t + 1} = w_{i, t} \cdot β_{t}^{(1 - L_{i, t})}$ , then renormalize weights.

(3)

Final prediction: The ensemble output is the weighted median of all weak learner predictions, where each

h_{t}

is weighted by

α_{t} = l o g (1 / β_{t})

.

3.2.4. XGBoost Regressor

XGBoost Regressor is a scalable gradient boosting framework that minimizes a regularized objective function using second-order gradient statistics, incorporating tree pruning and handling sparsity to achieve state-of-the-art performance on structured data [55]. XGBoost minimizes a regularized second-order approximated objective:

O b j^{(t)} = \sum_{i = 1}^{n} [g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} (x_{i})] + Ω (f_{t}),

(13)

Ω (f_{t}) = γ T + \frac{1}{2} λ ∥ w ∥^{2} + α ∥ w ∥_{1},

(14)

where

g_{i} = \partial_{{\hat{y}}^{(t - 1)}} l (y_{i}, {\hat{y}}^{(t - 1)})

,

h_{i} = \partial_{{\hat{y}}^{(t - 1)}}^{2} l (y_{i}, {\hat{y}}^{(t - 1)})

,

T

is leaf count,

w

leaf weights.

For squared loss,

l (y, \hat{y}) = \frac{1}{2} {(y - \hat{y})}^{2}

, so

g_{i} = {\hat{y}}_{i}^{(t - 1)} - y_{i}

,

h_{i} = 1

.

Leaf score:

w_{j}^{*} = - \frac{G_{j}}{H_{j} + λ},

(15)

objective after split:

O b j = - \frac{1}{2} \sum_{j = 1}^{T} \frac{G_{j}^{2}}{H_{j} + λ} + γ T .

(16)

Split gain:

G a i n = \frac{1}{2} [\frac{G_{L}^{2}}{H_{L} + λ} + \frac{G_{R}^{2}}{H_{R} + λ} - \frac{{(G_{L} + G_{R})}^{2}}{H_{L} + H_{R} + λ}] - γ .

(17)

For each node split, the algorithm searches for the feature and corresponding split point that maximizes the Gain score, thereby ensuring the objective function is minimized at each step of the tree’s growth.

3.2.5. LightGBM Regressor

LightGBM is an advanced gradient boosting decision tree (GBDT) framework [56]. It addresses computational bottlenecks in traditional boosting algorithms on large-scale datasets by introducing Gradient-based One-Side Sampling (GOSS) and Exclusive Feature Bundling (EFB). In this study, LightGBM is employed to capture the high-dimensional non-linear mapping between EPU/FS betas and sectoral risk matrices. LightGBM follows an additive model formulation. At the

t

-th iteration, the objective is to minimize:

O b j^{(t)} = \sum_{i = 1}^{n} L (y_{i}, {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i})) + Ω (f_{t})

(18)

where

L

is the loss function and

Ω (f_{t})

is the regularization term that penalizes tree complexity (number of leaves and the squared sum of leaf weights). To enable fast optimization, the objective is approximated using a second-order Taylor expansion around the current prediction

{\hat{y}}_{i}^{(t - 1)}

:

O b j^{(t)} \approx \sum_{i = 1}^{n} [g_{i} f_{t} (x_{i}) + \frac{1}{2} h_{i} f_{t}^{2} (x_{i})] + Ω (f_{t})

(19)

where

g_{i} = \partial_{{\hat{y}}^{(t - 1)}} L (y_{i}, {\hat{y}}_{i}^{(t - 1)})

is the first-order gradient and

h_{i} = \partial_{{\hat{y}}^{(t - 1)}}^{2} L (y_{i}, {\hat{y}}_{i}^{(t - 1)})

is the second-order gradient (Hessian). LightGBM evaluates candidate splits by maximizing the following gain

G

:

G = \frac{1}{2} [\frac{{(\sum_{i \in L} g_{i})}^{2}}{\sum_{i \in L} h_{i} + λ} + \frac{{(\sum_{i \in R} g_{i})}^{2}}{\sum_{i \in R} h_{i} + λ} - \frac{{(\sum_{i \in (L \cup R)} g_{i})}^{2}}{\sum_{i \in (L \cup R)} h_{i} + λ}]

(20)

where

L

and

R

denote the left and right child subsets after splitting, and

λ

is the

L_{2}

regularization parameter on leaf weights. Instead of using all samples to compute the gain, Gradient-based One-Side Sampling (GOSS) retains instances with large gradients and randomly samples those with small gradients. The adjusted gradients are defined as

{\tilde{g}}_{i} = \{\begin{array}{l} g_{i} & if i \in A \\ \frac{1 - a}{b} g_{i} & if i \in B \end{array}

(21)

where

A

is the top

a %

of instances ranked by absolute gradient magnitude, and

B

is a random subset of size

b %

drawn from the remaining instances. This mechanism ensures that the model prioritizes poorly fitted samples—particularly useful for capturing sudden shifts induced by EPU/FS shocks.

3.2.6. CatBoost Regressor

CatBoost (Categorical Boosting) is a high-performance gradient boosting algorithm [57]. Although the input features in this study (time-varying EPU/FS betas) are primarily numerical, CatBoost’s distinctive symmetric (oblivious) decision trees and ordered boosting mechanisms endow it with exceptional resistance to overfitting and superior predictive stability when applied to time-series data. Compared to other boosting algorithms, CatBoost more effectively handles noise in financial data and mitigates the gradient bias problem inherent in traditional gradient boosting frameworks. Unlike LightGBM’s leaf-wise growth strategy, CatBoost employs symmetric decision trees as base learners. In each level of the tree, the same splitting criterion (feature + threshold) is applied across all nodes at that depth. This structure provides exponentially faster inference and acts as a form of structural regularization, effectively preventing local overfitting when capturing extreme fluctuations in EPU/FS:

f (x) = \sum_{t = 1}^{T} h_{t} (x)

(22)

where

h_{t}

denotes the

t

-th symmetric tree. Traditional GBDT algorithms compute gradients using the same dataset for both training the current tree and evaluating its residuals, leading to prediction shift (or gradient bias). CatBoost introduces ordered boosting by applying random permutations to the training samples and estimating the gradient for each sample using models trained only on preceding samples in the permutation:

g_{i} = {\frac{\partial L (y_{i}, F)}{\partial F}|}_{F = \overline{F} (x_{i})}

(23)

where

\overline{F} (x_{i})

is the prediction obtained from models trained exclusively on samples appearing before

i

in the current random permutation. This approach yields more robust out-of-sample performance, particularly valuable for time-sensitive tasks such as sectoral covariance forecasting. The optimization objective of CatBoost combines the empirical loss with

L_{2}

regularization on leaf weights:

O b j = \sum_{i = 1}^{n} L (y_{i}, {\hat{y}}_{i}) + λ \sum_{t = 1}^{T} ∥ ω_{t} ∥^{2}

(24)

where

ω_{t}

represents the vector of leaf weights in the

t

-th tree, and

λ

controls the strength of regularization.

3.2.7. Extreme Learning Machine (ELM)

Extreme Learning Machine (ELM) is a novel learning algorithm designed for single-hidden-layer feedforward neural networks (SLFNs) [58]. Unlike traditional neural networks that rely on back-propagation (BP) with iterative gradient descent, the defining feature of ELM is that the input weights and biases of the hidden layer neurons are randomly generated and remain fixed throughout training. The model requires only an analytical solution to compute the output weights, thereby reducing the entire training process to a simple linear least-squares problem. When applied to the time-varying EPU/FS beta data in this study, ELM leverages its extremely fast training speed and excellent non-linear approximation capability to efficiently capture the instantaneous impact of macroeconomic policy shocks on financial markets. For an SLFN with

L

hidden nodes and activation function

G (\cdot)

, the prediction function is expressed as

f (x_{i}) = \sum_{j = 1}^{L} β_{j} G (w_{j} \cdot x_{i} + b_{j}) = h (x_{i}) β, i = 1, \dots, n

or in matrix form:

H β = Y

(25)

where

w_{j} \in R^{d}

and

b_{j} \in R

are the input weight vector and bias of the

j

-th hidden node, respectively, randomly generated according to any continuous probability distribution and kept fixed.

H \in R^{n \times L}

is the hidden layer output matrix, with its

(i, j)

-th element defined as

H_{i j} = G (w_{j} \cdot x_{i} + b_{j})

.

β \in R^{L \times m}

is the output weight matrix, representing the only parameters to be learned.

Y \in R^{n \times m}

is the target matrix. Training process: To minimize the approximation error

∥ H β - Y ∥

, the optimal output weights are obtained analytically via the Moore–Penrose generalized inverse

H^{†}

:

β = H^{†} Y

(26)

In practice, when

H^{T} H

is nonsingular, this simplifies to

β = {(H^{T} H)}^{- 1} H^{T} Y

; otherwise, the pseudoinverse ensures the minimum-norm least-squares solution. This closed-form solution eliminates the need for iterative gradient descent, making ELM training thousands of times faster than conventional neural networks while maintaining comparable or superior generalization performance.

3.2.8. Ridge Regression

Ridge Regression is a linear regression method that incorporates an L2 regularization term on the coefficients to prevent overfitting and handle multicollinearity, yielding a closed-form solution that shrinks coefficients toward zero [59]. Ridge minimizes L2-regularized least squares:

J (w) = ∥ y - X w ∥_{2}^{2} + α ∥ w ∥_{2}^{2},

(27)

\nabla J (w) = - 2 X^{T} (y - X w) + 2 α w = 0,

(28)

\hat{w} = {(X^{T} X + α I)}^{- 1} X^{T} y .

(29)

The essence of Ridge Regression lies in its ‘weight shrinkage’ mechanism. When processing the EPU/FS betas of ten industrial sectors, traditional regression models are prone to abnormal fluctuations in parameter estimates due to the prevalence of strong multicollinearity among sectors. By penalizing excessively large coefficients, Ridge Regression ensures that the model becomes less sensitive to minor noise in the input data. Consequently, it significantly enhances the robustness of risk matrix forecasting while retaining all sectoral features as predictive inputs.

3.2.9. Lasso Regression

Lasso Regression extends linear regression by adding an L1 regularization penalty on the absolute values of coefficients, promoting sparsity by driving some coefficients exactly to zero [60], thus performing simultaneous variable selection. Lasso uses L1 regularization:

J (w) = ∥ y - X w ∥_{2}^{2} + α ∥ w ∥_{1} .

(30)

Subgradient condition for optimality (for each coordinate):

- X_{j}^{T} (y - X w) + α \cdot sign (w_{j}) = 0 if w_{j} \neq 0,

(31)

|X_{j}^{T} (y - X_{- j} w_{- j})| \leq α if w_{j} = 0 .

(32)

Solved via coordinate descent or proximal gradient. Lasso Regression, in essence, operates as a sparse feature selection mechanism. Within the predictive framework of this study, it automatically identifies and eliminates EPU/FS signals that contribute minimally or even introduce noise to the risk matrix forecasting.

3.2.10. Bayesian Ridge Regression

Bayesian Ridge Regression is a probabilistic linear regression approach that places a spherical Gaussian prior on the weights and estimates both coefficients and regularization hyperparameters via empirical Bayes, providing a robust alternative to standard ridge regression. The core advantage of Bayesian Ridge Regression lies in its parametric adaptability and probabilistic inference capabilities. When processing time-varying EPU/FS betas, the model moves beyond the reliance on manually pre-specified penalty parameters (

λ

) typical of standard Ridge Regression; instead, it automatically learns the optimal intensity of regularization from the data by maximizing the marginal likelihood function. This mechanism enables the model to dynamically calibrate the degree of coefficient shrinkage in response to policy fluctuations across different periods. Furthermore, the Bayesian framework provides not only point estimates for the sectoral risk matrix but also a formal measure of uncertainty (confidence intervals). This provides a probabilistic foundation for assessing the reliability of market forecasts under EPU/FS shocks, enhancing the model’s robustness in extreme market environments. The Bayesian Ridge Regression model is set as follows:

Likelihood:

p (y | X, w, λ) = N (y | X w, λ^{- 1} I) .

(33)

Prior:

p (w | α) = N (w | 0, α^{- 1} I) .

(34)

Posterior:

p (w | y, X, α, λ) \sim N (μ, Σ),

(35)

Σ = {(λ X^{T} X + α I)}^{- 1},

(36)

μ = λ Σ X^{T} y .

(37)

Hyperparameters maximized via marginal likelihood:

l o g p (y | X, α, λ) = - \frac{1}{2} [n l o g (2 π) + l o g |Σ| + {(y - X μ)}^{T} {(λ^{- 1} I)}^{- 1} (y - X μ)] .

(38)

3.2.11. Extra Trees Regressor

Extremely Randomized Trees (Extra Trees) is an ensemble method that constructs multiple unpruned decision trees using extreme randomization [61]: at each node, it selects a random subset of features and chooses split thresholds completely at random from the range of values in the selected feature, without optimizing for impurity reduction. The final prediction is the average of all tree predictions, which helps reduce variance more effectively than standard Random Forests in some cases.

The overall model prediction is given by:

\hat{y} (x) = \frac{1}{B} \sum_{b = 1}^{B} T_{b} (x),

(39)

where

\hat{y} (x)

is the predicted value for input vector

x

,

B

is the number of trees in the ensemble (e.g.,

B = 50

),

T_{b} (x)

is the prediction from the

b

-th extremely randomized tree. Each tree

T_{b} (x)

is constructed recursively. At each internal node: A random subset of

K

features is selected (typically

K = \sqrt{d}

, where

d

is the total number of features). For each selected feature

f_{k}

, a split threshold

θ_{k}

is chosen uniformly at random from the empirical minimum and maximum values of

f_{k}

in the current node’s samples:

θ_{k} \sim Uniform (m i n (f_{k}), m a x (f_{k})) .

(40)

The “best” split

s

among these random candidates is selected based on a score, such as variance reduction for regression:

Score (s) = Var (y) - \frac{n_{L}}{n} Var (y_{L}) - \frac{n_{R}}{n} Var (y_{R}),

(41)

where

Var (\cdot)

is the variance of targets in the node:

Var (y) = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \overline{y})}^{2},

(42)

with

n, n_{L}, n_{R}

the sample sizes in the parent, left, and right child nodes, and

y, y_{L}, y_{R}

the corresponding target sets,

\overline{y} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}

. Leaf nodes predict the mean of their samples:

T_{b} (x) = {\overline{y}}_{l} = \frac{1}{n_{l}} \sum_{i \in l} y_{i},

(43)

where

l

is the leaf node containing

x

, and

n_{l}

is the number of samples in

l

. The ensemble’s variance is estimated as:

Var (\hat{y} (x)) = ρ σ^{2} + \frac{1 - ρ}{B} σ^{2},

(44)

where

ρ

is the average correlation between tree predictions:

ρ = \frac{1}{B (B - 1)} \sum_{b \neq b'} Corr (T_{b} (x), T_{b'} (x)),

(45)

and

σ^{2}

is the average tree variance:

σ^{2} = \frac{1}{B} \sum_{b = 1}^{B} Var (T_{b} (x)) .

(46)

Extreme randomization often lowers

ρ

compared to Random Forests. Training is parallelizable across trees, with no pruning applied to maximize bias reduction.

3.2.12. Gaussian Process Regression (GPR)

Gaussian Process Regression (GPR) is a non-parametric probabilistic model grounded in Bayesian theory [62]. Unlike conventional regression approaches that presuppose a specific functional form (e.g., linear or polynomial), GPR posits a prior distribution over the space of possible functions via a Gaussian Process. It defines similarity between data points through a kernel function. When modeling the relationship between EPU/FS betas and sectoral risk matrices, GPR not only provides point predictions of industry covariances but also yields a full posterior predictive distribution, offering rigorous quantification of predictive uncertainty. A Gaussian Process is fully specified by its mean function

m (x)

(often assumed zero for simplicity) and covariance function (kernel)

k (x, x')

:

f (x) \sim G P (m (x), k (x, x'))

(47)

Given observed data

D = < (x_{i}, y_{i}) >_{i = 1}^{n}

and a new input

x_{*}

, the predictive distribution is jointly Gaussian. The closed-form posterior mean and variance are:

Predictive mean:

{\overline{f}}_{*} = k_{*}^{T} {(K + σ_{n}^{2} I)}^{- 1} y

(48)

Predictive variance:

var (f_{*}) = k (x_{*}, x_{*}) - k_{*}^{T} {(K + σ_{n}^{2} I)}^{- 1} k_{*}

(49)

where

K_{i j} = k (x_{i}, x_{j})

is the training kernel matrix,

k_{*}

is the vector of covariances between

x_{*}

and the training inputs, and

σ_{n}^{2}

represents observation noise variance.

4. Empirical Results

4.1. Implementation Details

This section systematically evaluates the empirical effectiveness of the twelve previously established machine learning models in sectoral risk forecasting. Using 11 Chinese industry stock indices as the research sample, this study performs multi-output forecasting of the sectoral risk matrix (covariance matrix, which is computed based on the asset return vector, representing the second-order moment of the joint distribution of multiple assets) across various algorithms. More importantly, it explores the incremental value of time-varying EPU/FS factors in enhancing predictive accuracy. To evaluate model generalizability, the full sample is partitioned into a training set and a test set using an 80/20 split, with the latter reserved exclusively for out-of-sample performance evaluation.

Within the evaluation system, two categories of metrics are employed: (1) statistical measures of forecasting error, including Mean Squared Error (MSE), Root Mean Squared Error (RMSE) [63], Mean Absolute Error (MAE) [64], and Mean Absolute Percentage Error (MAPE) [65]; and (2) economic performance of the strategies. Specifically, the predicted risk matrices are used to construct a Minimum Variance Portfolio (MVP) [66]. By comparing their out-of-sample volatility performance, we validate the economic utility of the improved forecasting accuracy from a practical investment perspective. The specific definitions are as follows:

(1): MSE is the most widely used loss function in financial volatility forecasting due to its strong penalty on large errors.

$M S E = \frac{1}{N} \sum_{t = 1}^{N} {(y_{t} - {\hat{y}}_{t})}^{2}$

(50)
(2): RMSE is the square root of MSE and shares the same unit as the original data, facilitating intuitive interpretation.

$R M S E = \sqrt{\frac{1}{N} \sum_{t = 1}^{N} {(y_{t} - {\hat{y}}_{t})}^{2}}$

(51)
(3): MAE is less sensitive to outliers, providing a more robust measure of average prediction bias.

$M A E = \frac{1}{N} \sum_{t = 1}^{N} |y_{t} - {\hat{y}}_{t}|$

(52)
(4): MAPE expresses prediction errors as percentages of the true values, making it particularly suitable for comparing forecasting performance across sectors of differing magnitudes.

$M A P E = \frac{1}{N} \sum_{t = 1}^{N} |\frac{y_{t} - {\hat{y}}_{t}}{y_{t}}| \times 100 %$

(53)
(5): Minimum Variance Portfolio (MVP)

To assess the practical value of the predicted covariance matrix

{\hat{Σ}}_{t}

, we construct the global minimum variance portfolio and evaluate its realized performance in the true market environment.

Portfolio Weight Optimization: The optimal asset weights

w_{t}^{*}

are obtained by solving the following constrained minimization problem:

\underset{w_{t}}{m i n} w_{t}^{T} {\hat{Σ}}_{t} w_{t}

(54)

subject to

w_{t}^{T} 1 = 1

(55)

Analytical Solution: In the absence of short-sale constraints, the closed-form solution is:

w_{t}^{*} = \frac{{\hat{Σ}}_{t}^{- 1} 1}{1^{T} {\hat{Σ}}_{t}^{- 1} 1}

(56)

Realized Portfolio Variance: The economic usefulness of the forecasted covariance matrix is ultimately measured by the realized variance using the true covariance matrix

Σ_{real, t}

:

Realized Variance = {(w_{t}^{*})}^{T} Σ_{real, t} w_{t}^{*}

(57)

A lower realized variance indicates that the model produces covariance forecasts that are more structurally accurate and economically valuable.

4.2. Forecast Evaluation

Building upon the research design and feature selection described above, this study follows established literature by adopting a 36-period rolling input window to generate forecasts for 1-, 6-, and 12-period horizons [67], representing short-, medium-, and long-term forecasting scales, respectively. By comparing the predictive performance of models incorporating EPU/FS-specific factors, this section provides a systematic evaluation of each model’s statistical accuracy and robustness across diverse forecasting horizons. Overall, the inclusion of the FS and EPU factors consistently reduces prediction errors, indicating that these macroeconomic uncertainty signals provide substantively meaningful incremental information for modeling cross-industry covariance structures and enhance the predictability of the risk matrix.

The results reported in Table 2 clearly indicate that risk-matrix forecasts based on the Financial Stress (FS) feature specification outperform both the EPU-based specification and the traditional return-based benchmark in the vast majority of “model–horizon” combinations, as reflected by systematic reductions in MSE and RMSE. Taking Panel A (1-period forecasts) as an example, in the SVR model, the FS specification achieves an RMSE of 0.0633, which is significantly lower than the EPU specification (0.0717) and the benchmark (0.0679). Its MSE (0.004013) also outperforms EPU (0.005145) and the benchmark (0.004610). Similarly, in the GPR model, the FS specification delivers strong predictive performance, with MSE (0.004045), RMSE (0.063602), MAE (0.045371), and MAPE (1.098) all outperforming both the EPU specification (0.004077, 0.063849, 0.045399, 1.115) and the benchmark (0.004109, 0.064100, 0.045884, 1.129). These findings suggest that FS features provide stable and substantively meaningful incremental information for forecasting cross-industry covariance and correlation structures, thereby significantly enhancing the predictability of the risk matrix.

In contrast to the broad and stable advantage of the FS specification, the gains associated with the EPU specification are more selective in nature. Its improvements relative to the benchmark are primarily concentrated in short-horizon forecasts and within specific algorithmic frameworks. As shown in Panel A of Table 2, for 1-period forecasts, the Lasso model achieves an MSE of 0.004150 under the EPU specification, outperforming the benchmark value of 0.004199. The ExtraTrees model similarly records a lower MSE of 0.004233 under EPU compared to the benchmark (0.004247). In addition, the GPR model achieves a lower MAPE of 1.115 under EPU relative to the benchmark (1.129). These results indicate that policy uncertainty information can improve risk-matrix forecasts in short-term settings, even though the magnitude of error reduction is generally smaller than that achieved by the FS specification. For example, in the SVR model, the reduction in RMSE under the FS specification (from 0.0679 to 0.0633) is substantially larger than under the EPU specification (0.0717, which actually increases relative to the benchmark). Consistent with the short-horizon evidence in Table 2, this pattern aligns with an expectations-based transmission mechanism whereby policy uncertainty first influences investor expectations and pricing behavior, exerting a marginal impact on inter-sectoral correlations before shocks are fully incorporated into market prices [27].

Furthermore, the relative dominance of the FS specification in Table 2 is consistent with the distinct characteristics of the two signals. Empirical evidence shows that the FS specification exhibits more pervasive predictive advantages across horizons and algorithms. In Panel A (1-period forecasts), 10 out of 12 models achieve lower MSE values under the FS specification compared to the benchmark (with ELM and Lasso as exceptions). In Panel B (6-period forecasts), this ratio remains 10 out of 12 (with ELM and ExtraTrees as exceptions). In Panel C (12-period forecasts), 9 out of 12 models continue to exhibit lower MSE values under FS relative to the benchmark. These patterns are consistent with FS being closely linked to liquidity constraints, financing conditions, and endogenous systemic stress, mechanisms that more directly synchronize sectoral returns and induce adjustments in the covariance structure. By contrast, the advantages of the EPU specification are more context-specific and horizon-dependent. In Panel A, only a limited number of models (such as Lasso, ExtraTrees, and GPR) show improvements under EPU relative to the benchmark, whereas many others (e.g., SVR, RF, and XGBoost) exhibit higher prediction errors when EPU is included. The evidence in Table 2 therefore supports the view that EPU influences sectoral correlations through relatively gradual processes such as policy interpretation and expectation formation.

A vertical comparison from Panel A to Panel C further highlights the horizon-related patterns reflected in Table 2. As the forecasting horizon extends from 1 period to 12 periods, prediction errors increase across all specifications, consistent with the general principle of information decay over time. For example, under the FS specification in the GPR model, MSE increases from 0.004045 (1-period) to 0.004127 (6-period), and further to 0.004351 (12-period); MAPE rises from 1.098 to 1.108 and then returns to 1.098, exhibiting minor fluctuations but remaining relatively stable overall. Nevertheless, the FS specification demonstrates stronger cross-horizon resilience. In Panel C (12-period forecasts), the advantage of FS relative to both the benchmark and EPU becomes more pronounced for certain ensemble algorithms. In the Random Forest model, the FS MSE (0.004574) is substantially lower than both the benchmark (0.004910) and EPU (0.004746). In the CatBoost model, the FS MSE (0.004745) also outperforms the benchmark (0.005030) and EPU (0.005245). This cross-horizon evidence indicates that the information captured by FS is associated not only with short-term shifts in correlations but also with more persistent adjustments in inter-sectoral dependence, thereby maintaining strong explanatory and predictive power in medium- to long-term risk-matrix forecasting.

Comparing the performance of various algorithms in Table 2 reveals that the model architecture is crucial for the effective utilization of predictive factors. Ensemble learning and kernel-based models (e.g., CatBoost, GPR, and ExtraTrees) demonstrate significant improvements in predictive accuracy upon the introduction of EPU/FS factors. In contrast, while linear models such as Lasso and Ridge also show enhancements, their degree of improvement remains relatively limited, with overall forecast errors staying at a higher level. Notably, the combination of CatBoost and the FS factor achieves some of the lowest MSE records in multiple tests. From a methodological perspective, this phenomenon suggests that the relationship between macro risk factors and sectoral correlations is characterized by strong non-linearity and high-dimensional interactions. While linear models attempt to fit this relationship through global linear mapping, they tend to capture only coarse average effects, potentially failing to fully exploit the non-linear transmission paths of macro shocks under varying market regimes. Our findings align with the growing body of literature suggesting that flexible machine learning models can significantly outperform linear factor models in characterizing the complex dynamics of financial markets [68].

From a mechanism perspective, traditional return-based factors primarily capture risk compensation and revisions in investor expectations as implied by realized price movements; thus, their information content is essentially an ex-post reflection of historical shocks. Consequently, when forecasting future covariance matrices, these factors predominantly rely on extrapolating the persistence and mean-reversion patterns of volatility. In contrast, the FS and EPU factors measure deep-seated fluctuations in the macro-policy environment, institutional arrangements, and regulatory expectations. Specifically, the Financial Stress (FS) factor reflects the real-time accumulation of systemic risk by capturing constraints on financial intermediaries’ balance sheets, anomalous fluctuations in credit spreads, and the tightening of market liquidity. According to financial friction theory, when financial stress remains elevated, market risk aversion can undergo a non-linear mutation. This leads previously uncorrelated sectors to exhibit ‘forced co-movement’ driven by shared financing constraints and deleveraging pressures. In academic literature, FS and EPU are regarded as forward-looking macro risk factors capable of anticipating structural shifts in corporate earnings expectations, financing costs, and cross-industry capital flows before prices fully adjust [69,70]. Such anticipatory capacity enables the FS factor to capture phenomena such as abrupt shifts in asset correlations triggered.

4.3. Portfolio Analysis

This section further incorporates the covariance matrices predicted by different models into a minimum variance portfolio (MVP) optimization framework to derive sectoral portfolio weights, and evaluates their performance by computing out-of-sample realized portfolio variance. Table 3 reports both the realized variance and the corresponding portfolio weight allocations, thereby assessing whether improvements in statistical prediction translate into economically meaningful risk reduction. The results indicate that forecasting frameworks incorporating the FS factor significantly reduce out-of-sample realized portfolio variance, confirming their practical value for asset allocation and risk budgeting.

In Panel A (1-period forecast), the SVR-FS path achieves an out-of-sample realized variance of 0.00126, significantly lower than its benchmark counterpart (0.00186), representing an approximate 32% reduction in risk. In contrast, the realized variance of the SVR-EPU path is 0.00742, substantially higher than the benchmark specification, indicating that under this algorithmic framework, the EPU feature does not translate into effective risk mitigation. This difference is consistent with the distinct transmission mechanisms of the two signals: FS more directly captures funding-chain conditions, liquidity constraints, and deleveraging pressures, which tend to synchronously alter inter-industry co-movement intensity and trigger a structural revaluation of the covariance matrix. As a result, it is more readily reflected as stable variance reduction within an MVP framework centered on second-moment optimization. By contrast, EPU primarily influences correlation structures through expectations and policy interpretation channels; its impact is typically more lagged and context-dependent, and under relatively linear mapping frameworks, it may not be effectively transformed into systematic adjustments of the covariance matrix.

At the same time, Table 3 shows that EPU can provide usable incremental information within certain nonlinear ensemble learning models. For example, under the XGBoost framework, the EPU path yields a realized variance of 0.00180, significantly lower than the benchmark value of 0.00293 and close to the FS path (0.00178). Similarly, in LightGBM, the EPU path achieves a variance of 0.00175, again lower than the benchmark (0.00223). These findings suggest that the effectiveness of EPU depends more heavily on nonlinear modeling capacity: when the model is capable of capturing nonlinearities and threshold effects in the relationship between policy uncertainty, sectoral expectations, and correlation adjustments, EPU is more likely to be translated into substantive improvements in covariance forecasting and portfolio risk control. Therefore, based on the evidence in Table 3, the FS specification demonstrates greater stability across models, whereas the benefits of the EPU specification are more model-dependent.

A vertical comparison from Panel A to Panel C further highlights the impact of forecast horizons on factor performance. As the forecasting horizon extends, the variance-control capacity of different specifications begins to diverge. In Panel C (12-period forecast), the CatBoost-FS path achieves a realized variance of 0.00002, maintaining the lowest level under that algorithm. In contrast, the LightGBM-EPU path records a realized variance of 0.00030, which is relatively higher among the three feature specifications. This cross-horizon difference suggests that the informational “half-life” of the two signals may differ: the financing conditions, balance sheet constraints, and liquidity tightening embedded in FS tend to exhibit greater persistence and can influence sectoral co-movement structures over longer windows. By contrast, EPU shocks are more likely to be event-driven and stage-specific, with their direction and magnitude varying across phases, thereby leading to less stable performance in longer-horizon forecasts. The evidence in Table 3 thus indicates that the FS specification exhibits more robust risk-control effects across time.

Further examination of the weight allocations (W) under different feature specifications in Table 3 reveals three distinct asset allocation patterns. Under the benchmark specification, weight distributions in several models display pronounced concentration. For instance, in Panel A, SVR assigns a weight of −0.727 to the Industrials sector, while Ridge allocates 1.051 to the Energy sector, reflecting the tendency of models relying solely on historical return information to generate extreme allocation structures. A plausible mechanism is that when future covariance structures are inferred purely from past price dynamics, models may mechanically extrapolate phase-specific correlations, leading the MVP optimization to overconcentrate exposures in a limited number of sectors.

Under the EPU specification, some extreme positions are partially moderated—for example, in Panel A, SVR adjusts the Industrials weight from −0.727 to −0.620—yet substantial cross-algorithm heterogeneity remains. This indicates that while EPU signals may alter the model’s perception of the risk environment, their transmission requires expectation formation and sector-specific pricing adjustments, and different algorithms extract their structural implications with varying effectiveness.

In contrast, under the FS specification, weight distributions across most models become more compressed and balanced. For example, SVR’s Industrials weight shrinks from −0.727 to 0.022, and Ridge’s Energy weight declines from 1.051 to 0.098. At the same time, allocations to Financials and defensive sectors exhibit more economically balanced patterns. This phenomenon is consistent with the mechanism through which FS reflects systemic funding constraints: when models capture the structural feature of heightened co-movement under stress scenarios, the predicted covariance matrix tends to compress spurious diversification effects, thereby naturally suppressing extreme weights and reducing tail exposure within the MVP optimization. Combined with the realized variance results, the FS specification not only lowers portfolio variance numerically but also reduces extreme risk exposures at the structural level.

Taken together, the realized variance and weight distribution evidence in Table 3 indicate that FS features improve both risk levels and weight stability across most models and forecast horizons. By contrast, the improvements associated with EPU features are more model-dependent and concentrated within algorithms capable of capturing nonlinear mapping relationships. From a mechanism perspective, FS functions as a real-time indicator of systemic stress that can structurally reshape the covariance matrix and directly support risk budgeting, whereas EPU acts more as a forward-looking signal of policy-expectation volatility, whose effectiveness depends on the model’s ability to identify nonlinearities and regime-dependent dynamics.

From a deeper perspective of economic mechanisms, the interplay between the external policy environment and internal financial pressures not only alters the conditional volatility of individual sectors but also triggers a systematic repricing of inter-sectoral correlation structures. Specifically, during periods of acute macroeconomic fluctuations, the ensuing surge in risk aversion and expansion of risk premiums often lead to episodic decoupling or recoupling of correlations between pro-cyclical and defensive industries. Due to the significant heterogeneity across sectors—ranging from regulatory sensitivity and policy dependence to financing constraints—these external shocks manifest as directional distortions of correlation coefficients within the risk matrix. Existing literature further indicates that uncertainties significantly influence not only the absolute variance of the stock market but also the systemic risk exposure by restructuring asset interconnectedness [71,72]. The empirical findings of this study confirm that incorporating these macro-feature paths enables models to identify genuine risk linkages among sectors during stress windows, rather than relying on “spurious correlations” generated by historical price-volume data. This global calibration mechanism guides the investment portfolio toward a fundamental transition from “momentum speculation” to “structural risk hedging,” ultimately constructing a logically consistent framework capable of withstanding systemic collapse.

4.4. Robustness Check

To further ensure the reliability of the empirical findings, this study conducts multi-dimensional stress tests to verify the robustness of the model results. First, a sensitivity analysis of rolling window partitions is performed. To ensure that the empirical conclusions are not dependent on a specific sample splitting method, a sliding window scheme is employed. In this framework, each fold consists of 100 samples, with a sliding step size (steps) set to 25 (Figure 3). This design is intended to cover various economic cycles and market volatility regimes, thereby examining the predictive efficacy of FS and EPU factors across different sample densities and time horizons. Additionally, the training window length was recalibrated from the baseline of 36 months to 24 months to further evaluate the robustness of the results. The predictive performance across these windows exhibits broad consistency with the full-sample results. Second, a robustness check using multiple random seeds is conducted. Given the inherent dependence of non-linear algorithms, such as ensemble learning, on initial random states, all models were retrained and re-evaluated by varying the Random Seeds. The results indicate that core evaluation metrics remain highly stable across different time slices and random initializations. This immunity to sample selection bias and stochastic interference strongly confirms that the reshaping of the risk correlation structure is rooted in the underlying economic logic of the data, rather than being a statistical artifact resulting from the over-fitting of specific algorithmic parameters. Table 4 presents the results of the rolling window analysis for the forecast horizon H = 6, which exhibit high consistency with the primary findings reported above. Due to space constraints, additional robustness test results as well as detailed descriptions of the replicability of the machine learning technique are provided in the Appendix A.

5. Conclusions

This study constructs an integrated framework for risk matrix forecasting and portfolio optimization in the Chinese equity market by transforming Financial Stress (FS) and Economic Policy Uncertainty (EPU) signals into model-ready features. Multiple machine learning algorithms are employed to generate multi-output forecasts of cross-industry covariance structures, and the predicted covariance matrices are subsequently incorporated into a minimum variance portfolio (MVP) framework. In this way, the effectiveness of EPU and FS signals is evaluated from both a forecasting accuracy perspective and a portfolio risk perspective.

The empirical findings indicate that incorporating FS and EPU features improves the characterization of cross-industry correlation structures and enhances risk forecasting performance to varying degrees; however, the stability and applicability of the two signals differ substantially. Overall, the FS feature delivers more consistent error reductions across most models and forecasting horizons and more stably lowers out-of-sample realized portfolio variance. This suggests that the information embedded in FS—such as liquidity constraints, financing conditions, and endogenous systemic stress—can be more directly mapped into shifts in sectoral co-movement structures. In contrast, the improvements associated with EPU are more concentrated in short-term forecasts and exhibit clear model dependence. Under certain linear or weakly nonlinear frameworks, its effects on prediction error and portfolio variance are unstable; yet within some nonlinear ensemble models, EPU significantly improves realized portfolio variance. This pattern indicates that policy uncertainty signals are more likely to influence sectoral correlations through nonlinear and threshold effects, and their incremental value is more effectively extracted in algorithms with stronger nonlinear representation capacity.

At the asset allocation level, the MVP constructed from the predicted covariance matrices further reveals how macro signals reshape portfolio weights. Under benchmark specifications relying solely on historical return information, several models generate highly concentrated industry exposures, reflecting extreme weight allocations in selected sectors. After incorporating FS signals, weight distributions across most models become more compressed and balanced, with extreme positions systematically moderated and realized variance more consistently reduced. This suggests that FS signals help mitigate the concentrated exposures arising from mechanically extrapolating historical correlation structures, aligning sector weights more closely with genuine co-movement patterns under stress conditions and enhancing the portfolio’s out-of-sample risk resilience. By comparison, the impact of EPU on weight structures is more context-dependent: in some models, it moderates extreme exposures and improves risk outcomes, but its effects vary substantially across algorithms. This reflects the fact that EPU operates through expectation formation and sector-specific pricing adjustments and therefore requires suitable model structures to effectively extract its informational content.

Based on the findings above, the practical implications of this study primarily lie in the alignment of signals, modeling frameworks, and allocation strategies. First, sector allocation and risk budgeting should incorporate signals reflecting endogenous financial system stress as macro calibration inputs, especially during periods of market volatility or stress, to correct extreme positions and risk underestimation that arise from relying solely on historical extrapolation. Second, modeling strategies should differentiate according to the characteristics of the signals: for stress-related signals with stronger cross-period stability (such as FS), structural and smoothed feature extraction should be emphasized to enhance consistency in medium- to long-term allocation logic; for policy uncertainty signals with stronger nonlinearity and regime-dependent dynamics (such as EPU), modeling frameworks capable of capturing nonlinear and threshold effects are more appropriate for identifying short-term distortions in sectoral correlations and implementing targeted hedging strategies. Overall, the evidence supports embedding macro uncertainty signals into risk matrix forecasting and portfolio optimization processes, improving the identification of sectoral dependence structures, and enhancing the robustness of asset allocation under complex macroeconomic conditions. Additionally, this framework provides actionable guidance for dynamic risk calibration in portfolio management and industry linkage monitoring in systemic risk: in portfolio management, FS/EPU-driven forecasted covariance matrices can be used to regularly update risk budgets and minimum variance weights, with increased rebalancing frequency during periods of stress; in systemic risk monitoring, continuous tracking of industry correlation and covariance concentration indicators can be used as triggers for stress testing and risk warning.

This study primarily focuses on sector-level data from the equity market, which may limit the direct generalizability of our conclusions to other asset classes such as commodities, bonds, and cryptocurrencies. These assets differ substantially in pricing mechanisms, term-structure features, liquidity constraints, and risk drivers, so the transmission strength and timing of EPU/FS to their covariance structures may not be identical. Future research can extend our framework by incorporating cross-asset data and developing a multi-asset risk-matrix forecasting and portfolio optimization system across equities, bonds, commodities, and cryptocurrencies to further assess the applicability and robustness of macro-uncertainty signals in cross-market allocation.

Author Contributions

Conceptualization, J.D. and W.C.; methodology, J.D.; software, J.D.; validation, J.D., W.C. and Z.W.; formal analysis, J.D.; investigation, W.C.; resources, Z.W.; data curation, W.C. and Z.W.; writing—original draft preparation, J.D.; writing—review and editing, J.D., W.C. and Z.W.; visualization, J.D.; supervision, W.C. and Z.W.; project administration, W.C.; funding acquisition, Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Acknowledgments

We thank the funders of this project School of Cyber Science and Engineering, Wuhan University, Department of Mathematics, King’s College London and all the teams and individuals who supported this research.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

(1): Computational Environment and Software Configuration

All experiments in this study were conducted on a Windows 11 (64-bit) operating system, with Python version 3.12.7 (64-bit). Graphical and interface-related dependencies include Qt 5.15.2 and PyQt5 5.15.10. The hardware environment consists of 64 GB RAM and an NVIDIA 5080 GPU (NVIDIA Corporation, Santa Clara, CA, USA). Model implementations are based on publicly available standard libraries, including scikit-learn, XGBoost, LightGBM, and CatBoost. Data processing and computational procedures follow deterministic index-based operations (without random shuffling). The training and testing sets are divided using a fixed 80–20% split rule. Forecasting is conducted under a rolling-window framework (window = 36) with multiple forecast horizons (horizon = 1, 6, 12).

(2): Random Seed Control and Hyperparameter Specification

All models involving stochastic components explicitly fix random seeds to ensure reproducibility. The specific settings are as follows:

RandomForestRegressor (n_estimators = 50, random_state = 42, n_jobs = −1); AdaBoostRegressor (n_estimators = 50, random_state = 42); XGBRegressor (n_estimators = 50, random_state = 42, n_jobs = −1); LightGBM (n_estimators = 50, random_state = 42, verbose = −1); CatBoostRegressor (n_estimators = 50, random_state = 42, verbose = 0); ExtraTreesRegressor (n_estimators = 50, max_depth = 5, random_state = 42, n_jobs = −1); GaussianProcessRegressor (kernel = C(1.0, (1.0 × 10⁻³, 1.0 × 10³)) × RBF(1.0, (1.0 × 10⁻², 1.0 × 10²)) + WhiteKernel(noise_level = 1.0 × 10⁻⁵), n_restarts_optimizer = 10, random_state = 42); SVR (kernel = ‘rbf’, C = 10, epsilon = 0.01, gamma = ‘auto’); Lasso (alpha = 0.01); Ridge and BayesianRidge use default parameters; ELMRegressor is specified with n_hidden = 50 and generates input weights and biases using np.random.RandomState with fixed random seeds (42 or 123).

References

Phan, D.H.B.; Iyke, B.N.; Sharma, S.S.; Affandi, Y. Economic policy uncertainty and financial stability—Is there a relation? Econ. Model. 2021, 94, 1018–1029. [Google Scholar] [CrossRef]
Wang, L.; Qu, X.; Wang, Y.; Lu, J.; Wang, Q. Economic policy uncertainty, related-party transactions and corporate liquidity. Financ. Res. Lett. 2025, 76, 107010. [Google Scholar] [CrossRef]
Sergeyev, D.; Lian, C.; Gorodnichenko, Y. The economics of financial stress. Rev. Econ. Stud. 2025, 92, 3403–3437. [Google Scholar] [CrossRef]
Baker, S.R.; Bloom, N.; Davis, S.J. Measuring economic policy uncertainty. Q. J. Econ. 2016, 131, 1593–1636. [Google Scholar] [CrossRef]
Cardarelli, R.; Elekdag, S.; Lall, S. Financial stress and economic contractions. J. Financ. Stab. 2011, 7, 78–97. [Google Scholar] [CrossRef]
Anderson, E.W.; Ghysels, E.; Juergens, J.L. Do heterogeneous beliefs matter for asset pricing? Rev. Financ. Stud. 2005, 18, 875–924. [Google Scholar] [CrossRef]
Awosusi, A.A. How do economic policy uncertainty, climate policy uncertainty and us monetary policy uncertainty shape investors’ sentiment in the green market? Appl. Econ. 2025, 1–19. [Google Scholar] [CrossRef]
Allen, F.; Gale, D.M. Financial contagion. J. Political Econ. 2000, 108, 1–33. [Google Scholar] [CrossRef]
Brogaard, J.; Detzel, A. The asset-pricing implications of government economic policy uncertainty. Manag. Sci. 2015, 61, 3–18. [Google Scholar] [CrossRef]
Liang, C.; Luo, Q.; Li, Y.; Huynh, L.D.T. Global financial stress index and long-term volatility forecast for international stock markets. J. Int. Financ. Mark. Inst. Money 2023, 88, 101825. [Google Scholar] [CrossRef]
Tsai, I.-C. The source of global stock market risk: A viewpoint of economic policy uncertainty. Econ. Model. 2017, 60, 122–131. [Google Scholar] [CrossRef]
Taieb, S.B.; Sorjamaa, A.; Bontempi, G. Multiple-output modeling for multi-step-ahead time series forecasting. Neurocomputing 2010, 73, 1950–1957. [Google Scholar] [CrossRef]
Xu, D.; Shi, Y.; Tsang, I.W.; Ong, Y.-S.; Gong, C.; Shen, X. Survey on multi-output learning. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 2409–2429. [Google Scholar] [CrossRef]
Sánchez-Fernández, M.; de-Prado-Cumplido, M.; Arenas-García, J.; Pérez-Cruz, F. Svm multiregression for nonlinear channel estimation in multiple-input multiple-output systems. IEEE Trans. Signal Process. 2004, 52, 2298–2307. [Google Scholar] [CrossRef]
Hao, Q.; Li, A. Economic policy uncertainty and institutional portfolio investment. Financ. Manag. 2025, 54, 271–304. [Google Scholar] [CrossRef]
Lee, K.; Jeon, Y.; Jo, C. Chinese economic policy uncertainty and us households’ portfolio decisions. Pac.-Basin Financ. J. 2020, 64, 101452. [Google Scholar] [CrossRef]
Mezghani, T.; Boujelbène-Abbes, M. Financial stress effects on financial markets: Dynamic connectedness and portfolio hedging. Int. J. Emerg. Mark. 2023, 18, 4064–4087. [Google Scholar] [CrossRef]
Younis, I.; Gupta, H.; Shah, W.U.; Sharif, A.; Tang, X.; Amman, H. The effects of economic uncertainty and trade policy uncertainty on industry-specific stock markets equity. Comput. Econ. 2024, 64, 2909–2933. [Google Scholar] [CrossRef]
Liu, M.; Choo, W.C.; Lee, C.C.; Lee, C.C. Trading volume and realized volatility forecasting: Evidence from the China stock market. J. Forecast. 2023, 42, 76–100. [Google Scholar] [CrossRef]
Dai, Z.; Zhu, H.; Zhang, X. Dynamic spillover effects and portfolio strategies between crude oil, gold and Chinese stock markets related to new energy vehicle. Energy Econ. 2022, 109, 105959. [Google Scholar] [CrossRef]
Ma, Y.; Wang, Z.; He, F. How do economic policy uncertainties affect stock market volatility? Evidence from g7 countries. Int. J. Financ. Econ. 2022, 27, 2303–2325. [Google Scholar] [CrossRef]
Ruan, Q.; Zhang, J.; Lv, D. Forecasting exchange rate volatility: Is economic policy uncertainty better? Appl. Econ. 2024, 56, 1526–1544. [Google Scholar] [CrossRef]
Jia, L.; Xu, R.; Wu, J.; Song, M.; Chen, X. Impacts of geopolitical risk and economic policy uncertainty on metal futures price volatility: Evidence from China. Resour. Policy 2023, 87, 104328. [Google Scholar] [CrossRef]
Ye, C.; Ou, H.; Basile, V.; Bhuiyan, M.A. The effect of uncertainty index based on sparse method on volatility prediction of stock market. Expert Syst. Appl. 2025, 290, 128208. [Google Scholar] [CrossRef]
Bretscher, L.; Hsu, A.; Tamoni, A. The real response to uncertainty shocks: The risk premium channel. Manag. Sci. 2023, 69, 119–140. [Google Scholar] [CrossRef]
Shahzad, S.J.H.; Raza, N.; Balcilar, M.; Ali, S.; Shahbaz, M. Can economic policy uncertainty and investors sentiment predict commodities returns and volatility? Resour. Policy 2017, 53, 208–218. [Google Scholar] [CrossRef]
Salisu, A.A.; Demirer, R.; Gupta, R. Policy uncertainty and stock market volatility revisited: The predictive role of signal quality. J. Forecast. 2023, 42, 2307–2321. [Google Scholar] [CrossRef]
Li, X.; Wei, Y.; Chen, X.; Ma, F.; Liang, C.; Chen, W. Which uncertainty is powerful to forecast crude oil market volatility? New evidence. Int. J. Financ. Econ. 2022, 27, 4279–4297. [Google Scholar] [CrossRef]
Adjei, A.N.K.; Tweneboah, G.; Owusu Junior, P. Economic policy uncertainty and spillovers in selected emerging market economies: Time-and frequency-domain approach. J. Financ. Econ. Policy 2025, 17, 1–28. [Google Scholar] [CrossRef]
Li, A.; Zhong, B. Asymmetric spillover connectedness between clean energy markets and industrial stock markets: How uncertainties affect it. PLoS ONE 2025, 20, e0316171. [Google Scholar] [CrossRef]
Demiralay, S.; Golitsis, P.; Fassas, A.; Chen, J. Hedging effectiveness of alternative assets: Evidence from range-based dynamic correlation models. Stud. Econ. Financ. 2025, 43, 30–58. [Google Scholar] [CrossRef]
Papathanasiou, S.; Kenourgios, D.; Koutsokostas, D.; Pergeris, G. Can treasury inflation-protected securities safeguard investors from outward risk spillovers? A portfolio hedging strategy through the prism of COVID-19. J. Asset Manag. 2022, 24, 198. [Google Scholar] [CrossRef]
Gao, W.; Jin, X.; Zhang, H.; He, M. The asymmetric response of higher-order moments of precious metals to energy shocks and financial stresses: Evidence from time-frequency connectedness approach. Energy Econ. 2025, 142, 108171. [Google Scholar] [CrossRef]
Zhang, H.; Wang, P. Does bitcoin or gold react to financial stress alike? Evidence from the Us and China. Int. Rev. Econ. Financ. 2021, 71, 629–648. [Google Scholar] [CrossRef]
Wu, N.; Zhang, Z.; Lin, B. Responses of financial stress and monetary policy to global warming: Evidence from China. Int. Rev. Financ. Anal. 2024, 92, 103092. [Google Scholar] [CrossRef]
Duca, M.L.; Peltonen, T.A. Assessing systemic risks and predicting systemic events. J. Bank. Financ. 2013, 37, 2183–2195. [Google Scholar] [CrossRef]
Hubrich, K.; Tetlow, R.J. Financial stress and economic dynamics: The transmission of crises. J. Monet. Econ. 2015, 70, 100–115. [Google Scholar] [CrossRef]
Oet, M.V.; Dooley, J.M.; Ong, S.J. The financial stress index: Identification of systemic risk conditions. Risks 2015, 3, 420–444. [Google Scholar] [CrossRef]
Kaminsky, G.L.; Reinhart, C.M. Financial markets in times of stress. J. Dev. Econ. 2002, 69, 451–470. [Google Scholar] [CrossRef][Green Version]
Huang, Z.; Zhu, H.; Deng, X.; Zeng, T. Time-frequency tail risk spillover between esg climate and high-carbon assets: The role of economic policy uncertainty and financial stress. Financ. Res. Lett. 2024, 67, 105866. [Google Scholar] [CrossRef]
Ranković, V.; Drenovak, M.; Urosevic, B.; Jelic, R. Mean-univariate garch var portfolio optimization: Actual portfolio approach. Comput. Oper. Res. 2016, 72, 83–92. [Google Scholar] [CrossRef]
Huang, J.-J.; Lee, K.-J.; Liang, H.; Lin, W.-F. Estimating value at risk of portfolio by conditional copula-garch method. Insur. Math. Econ. 2009, 45, 315–324. [Google Scholar] [CrossRef]
Sahamkhadam, M.; Stephan, A.; Östermark, R. Portfolio optimization based on GARCH-EVT-Copula forecasting models. Int. J. Forecast. 2018, 34, 497–506. [Google Scholar] [CrossRef]
Jin, X.; Maheu, J.M. Modeling realized covariances and returns. J. Financ. Econom. 2013, 11, 335–369. [Google Scholar] [CrossRef]
Callot, L.A.; Kock, A.B.; Medeiros, M.C. Modeling and forecasting large realized covariance matrices and portfolio choice. J. Appl. Econom. 2017, 32, 140–158. [Google Scholar] [CrossRef]
Bollerslev, T.; Patton, A.J.; Quaedvlieg, R. Modeling and forecasting (un) reliable realized covariances for more reliable financial decisions. J. Econom. 2018, 207, 71–91. [Google Scholar] [CrossRef]
Torres, R.; Villena, M. On the empirical performance of different covariance-matrix forecasting methods. Neural Comput. Appl. 2024, 36, 9503–9524. [Google Scholar] [CrossRef]
Leccadito, A.; Staino, A.; Toscano, P. A novel robust method for estimating the covariance matrix of financial returns with applications to risk management. Financ. Innov. 2024, 10, 116. [Google Scholar] [CrossRef]
O’Brien, M.; Velasco, S. Macro-financial imbalances and cyclical systemic risk dynamics: Understanding the factors driving the financial cycle in the presence of non-linearities. Macroecon. Dyn. 2025, 29, e13. [Google Scholar] [CrossRef]
Alessandri, P.; Gazzani, A.; Vicondoa, A. Are the effects of uncertainty shocks big or small? Eur. Econ. Rev. 2023, 158, 104525. [Google Scholar] [CrossRef]
Engle, R.F. Dynamic conditional beta. J. Financ. Econom. 2016, 14, 643–667. [Google Scholar] [CrossRef]
Awad, M.; Khanna, R. Support vector regression. In Efficient Learning Machines: Theories, concepts, and Applications for Engineers and System Designers; Apress: Berkeley, CA, USA, 2015; pp. 67–80. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Schapire, R.E. Explaining AdaBoost. In Empirical Inference: Festschrift in Honor of Vladimir N. Vapnik; Springer: Berlin/Heidelberg, Germany, 2013; pp. 37–52. [Google Scholar]
Chen, T.; He, T.; Benesty, M.; Khotilovich, V.; Tang, Y.; Cho, H.; Chen, K.; Mitchell, R.; Cano, I.; Zhou, T. Xgboost: Extreme Gradient Boosting; R Package, Version 04-2. Comprehensive R Archive Network (CRAN) Online. 2015. Available online: https://CRAN.R-project.org/package=xgboost (accessed on 1 February 2026).
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2017; p. 30. [Google Scholar]
Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. Catboost: Unbiased boosting with categorical features. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2018; p. 31. [Google Scholar]
Huang, G.-B.; Zhu, Q.-Y.; Siew, C.-K. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Hoerl, A.E.; Kennard, R.W. Ridge regression: Applications to nonorthogonal problems. Technometrics 1970, 12, 69–82. [Google Scholar] [CrossRef]
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef]
Schulz, E.; Speekenbrink, M.; Krause, A. A tutorial on gaussian process regression: Modelling, exploring, and exploiting functions. J. Math. Psychol. 2018, 85, 1–16. [Google Scholar] [CrossRef]
Patton, A.J. Volatility forecast comparison using imperfect volatility proxies. J. Econom. 2011, 160, 246–256. [Google Scholar] [CrossRef]
Hodson, T.O. Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not. Geosci. Model Dev. Discuss. 2022, 2022, 1–10. [Google Scholar] [CrossRef]
De Myttenaere, A.; Golden, B.; Le Grand, B.; Rossi, F. Mean absolute percentage error for regression models. Neurocomputing 2016, 192, 38–48. [Google Scholar] [CrossRef]
Clarke, R.; De Silva, H.; Thorley, S. Minimum-variance portfolios in the us equity market. J. Portf. Manag. 2006, 33, 10. [Google Scholar] [CrossRef]
Wang, C. Stock return prediction with multiple measures using neural network models. Financ. Innov. 2024, 10, 72. [Google Scholar] [CrossRef]
Gu, S.; Kelly, B.; Xiu, D. Empirical asset pricing via machine learning. Rev. Financ. Stud. 2020, 33, 2223–2273. [Google Scholar] [CrossRef]
Chen, J.; Ma, F.; Qiu, X.; Li, T. The role of categorical epu indices in predicting stock-market returns. Int. Rev. Econ. Financ. 2023, 87, 365–378. [Google Scholar] [CrossRef]
Junttila, J.; Vataja, J. Economic policy uncertainty effects for forecasting future real economic activity. Econ. Syst. 2018, 42, 569–583. [Google Scholar] [CrossRef]
Rigotti, L.; Shannon, C. Uncertainty and risk in financial markets. Econometrica 2005, 73, 203–243. [Google Scholar] [CrossRef]
Lyu, Y.; Yi, H.; Yang, M.; Zou, Y.; Li, D.; Qin, Z. Financial uncertainty shocks and systemic risk: Revealing the risk spillover from the oil market to the stock market. Appl. Energy 2025, 382, 125311. [Google Scholar] [CrossRef]

Figure 1. Time-varying beta series. (a)

b_{i, t}^{EPU}

. (b)

b_{i, t}^{FS}

.

Figure 1. Time-varying beta series. (a)

b_{i, t}^{EPU}

. (b)

b_{i, t}^{FS}

.

Figure 2. Methodological framework.

Figure 3. Rolling verification process.

Table 1. Data descriptive.

Variable	Mean	SD	Skewness	Kurtosis	Min	Max	ADF	PP
Energy	0.0044	0.0891	−0.6328	5.7665	−0.4101	0.2500	−14.265 ***	−14.393 ***
Materials	0.0070	0.0981	−0.5430	4.5692	−0.3549	0.2730	−13.710 ***	−13.772 ***
Industrials	0.0066	0.0878	−0.4721	5.1414	−0.3564	0.2655	−13.450 ***	−13.544 ***
Consumer Discretionary	0.0084	0.0874	−0.3958	5.3501	−0.3559	0.3172	−13.011 ***	−13.091 ***
Consumer Staples	0.0095	0.0824	−0.4854	4.1433	−0.2842	0.2277	−14.427 ***	−14.444 ***
HealthCare	0.0101	0.0870	−0.2701	4.4309	−0.3175	0.3122	−14.457 ***	−14.474 ***
Financials	0.0080	0.0842	−0.0810	5.4433	−0.3249	0.3175	−13.338 ***	−13.444 ***
Information Technology	0.0098	0.1022	−0.4349	4.5332	−0.4004	0.2812	−13.806 ***	−13.779 ***
Communication Services	0.0062	0.0943	−0.3352	7.2679	−0.4205	0.3938	−14.509 ***	−14.567 ***
Utilities	0.0063	0.0780	−0.1664	5.9358	−0.3233	0.3250	−14.909 ***	−15.010 ***
RealEstate	0.0039	0.0994	0.1198	4.4751	−0.3416	0.3149	−14.031 ***	−14.172 ***
EPU	0.0126	0.3570	0.2356	3.2424	−0.9661	1.1445	−24.519 ***	−30.654 ***
FS	0.0841	0.0942	2.3509	9.8916	0.0033	0.5955	−4.323 ***	−4.591 ***

*** imply the statistical significance level of 1%.

Table 2. Evaluation of model predictions.

Model	Feature	MSE	RMSE	MAE	MAPE
Panel A: 1-period
SVR	EPU	0.005145	0.071730	0.052141	2.549732
	FS	0.004013	0.063346	0.046395	1.804270
	benchmark	0.004610	0.067900	0.051310	2.464716
RF	EPU	0.004265	0.065310	0.048273	2.030137
	FS	0.004186	0.064702	0.046482	1.599939
	benchmark	0.004324	0.065758	0.047556	2.094928
AdaBoost	EPU	0.004727	0.068755	0.051101	2.408522
	FS	0.004388	0.066241	0.048019	1.974238
	benchmark	0.004421	0.066487	0.049292	2.310125
XGBoost	EPU	0.005922	0.076955	0.059317	3.387983
	FS	0.004982	0.070580	0.052359	3.458025
	benchmark	0.005577	0.074678	0.055194	2.773075
LightGBM	EPU	0.004823	0.069449	0.051009	2.986291
	FS	0.004986	0.070613	0.051772	2.357543
	benchmark	0.004612	0.067910	0.050782	2.694265
ELM	EPU	0.006265	0.079150	0.062709	5.021667
	FS	0.008906	0.094371	0.070217	4.518472
	benchmark	0.006851	0.082770	0.063109	3.887008
Ridge	EPU	0.010665	0.103270	0.078449	5.209488
	FS	0.009898	0.099486	0.080004	6.507205
	benchmark	0.011141	0.105553	0.083336	7.330348
Lasso	EPU	0.004150	0.064418	0.046717	1.536297
	FS	0.004358	0.066016	0.047809	1.796620
	benchmark	0.004199	0.064796	0.047249	1.708908
BayesianRidge	EPU	0.013542	0.116368	0.088853	6.278818
	FS	0.010214	0.101062	0.081250	6.668407
	benchmark	0.011792	0.108592	0.085790	7.620073
GPR	EPU	0.004077	0.063849	0.045399	1.114777
	FS	0.004045	0.063602	0.045371	1.098367
	benchmark	0.004109	0.064100	0.045884	1.128547
ExtraTrees	EPU	0.004233	0.065058	0.046744	1.342984
	FS	0.004194	0.064760	0.046675	1.479477
	benchmark	0.004247	0.065166	0.047534	2.000153
CatBoost	EPU	0.004919	0.070139	0.052690	2.672598
	FS	0.004601	0.067832	0.049942	2.896800
	benchmark	0.004609	0.067890	0.051327	2.776865
Panel B: 6-period	Feature	MSE	RMSE	MAE	MAPE
SVR	EPU	0.005333	0.073027	0.054215	3.165859
	FS	0.004146	0.064389	0.047242	1.944144
	benchmark	0.004641	0.068122	0.051419	2.720944
RF	EPU	0.004442	0.066651	0.049714	2.860055
	FS	0.004299	0.065569	0.046878	1.774717
	benchmark	0.004524	0.067260	0.049042	2.121421
AdaBoost	EPU	0.004805	0.069316	0.051098	2.639021
	FS	0.004469	0.066849	0.049240	2.033002
	benchmark	0.004561	0.067538	0.049878	2.514272
XGBoost	EPU	0.006202	0.078751	0.058455	3.507845
	FS	0.004900	0.070001	0.051770	3.023800
	benchmark	0.005575	0.074669	0.056142	3.797553
LightGBM	EPU	0.005269	0.072586	0.053123	3.059856
	FS	0.004774	0.069097	0.051112	2.728685
	benchmark	0.004652	0.068206	0.050862	2.754102
ELM	EPU	0.007548	0.086882	0.066133	5.569471
	FS	0.007930	0.089049	0.067642	5.370794
	benchmark	0.007218	0.084958	0.065310	5.001222
Ridge	EPU	0.013667	0.116906	0.093119	7.720820
	FS	0.008807	0.093843	0.073823	6.451602
	benchmark	0.011113	0.105416	0.082058	6.980386
Lasso	EPU	0.004201	0.064811	0.047122	1.640506
	FS	0.004489	0.066997	0.048489	1.796050
	benchmark	0.004308	0.065639	0.047779	1.699343
BayesianRidge	EPU	0.015920	0.126174	0.100919	8.644365
	FS	0.009071	0.095240	0.075066	6.620043
	benchmark	0.011663	0.107993	0.084117	7.243136
GPR	EPU	0.004186	0.064700	0.046388	1.255501
	FS	0.004127	0.064243	0.046044	1.107867
	benchmark	0.004212	0.064903	0.046615	1.195267
ExtraTrees	EPU	0.004175	0.064616	0.046337	1.372894
	FS	0.004179	0.064643	0.046312	1.365704
	benchmark	0.004257	0.065248	0.047196	1.565862
CatBoost	EPU	0.005116	0.071527	0.053193	3.177933
	FS	0.004635	0.068081	0.049651	2.260024
	benchmark	0.004717	0.068681	0.051689	3.030385
Panel C: 12-period	Feature	MSE	RMSE	MAE	MAPE
SVR	EPU	0.005473	0.073982	0.054951	3.494224
	FS	0.004326	0.065776	0.048429	1.874586
	benchmark	0.004915	0.070109	0.053301	2.751202
RF	EPU	0.004746	0.068893	0.051821	2.778744
	FS	0.004574	0.067629	0.049048	2.128297
	benchmark	0.004910	0.070069	0.052057	2.286216
AdaBoost	EPU	0.005134	0.071650	0.052965	2.536894
	FS	0.004773	0.069084	0.051085	2.147255
	benchmark	0.004743	0.068867	0.051182	2.313904
XGBoost	EPU	0.006187	0.078659	0.058495	3.620144
	FS	0.005405	0.073519	0.054404	3.438061
	benchmark	0.005733	0.075714	0.057196	3.546748
LightGBM	EPU	0.005549	0.074491	0.054400	3.027736
	FS	0.005050	0.071066	0.052987	2.772777
	benchmark	0.004999	0.070703	0.053020	2.973662
ELM	EPU	0.008149	0.090272	0.069860	5.913705
	FS	0.008427	0.091801	0.070553	5.076785
	benchmark	0.007759	0.088086	0.067924	5.366021
Ridge	EPU	0.013188	0.114841	0.091150	8.368397
	FS	0.008980	0.094764	0.075301	6.837844
	benchmark	0.011463	0.107065	0.084222	7.152406
Lasso	EPU	0.004488	0.066994	0.049038	1.740371
	FS	0.004720	0.068703	0.049848	1.660265
	benchmark	0.004561	0.067533	0.049566	1.724404
BayesianRidge	EPU	0.014895	0.122045	0.096918	9.130487
	FS	0.009200	0.095918	0.076285	6.972025
	benchmark	0.012043	0.109740	0.086344	7.447165
GPR	EPU	0.004439	0.066625	0.048127	1.381115
	FS	0.004351	0.065961	0.047614	1.097768
	benchmark	0.004449	0.066702	0.048281	1.253879
ExtraTrees	EPU	0.004390	0.066257	0.048018	1.508622
	FS	0.004423	0.066509	0.048035	1.188682
	benchmark	0.004463	0.066809	0.048582	1.316968
CatBoost	EPU	0.005245	0.072424	0.053976	2.921081
	FS	0.004745	0.068887	0.050917	2.622902
	benchmark	0.005030	0.070919	0.053020	2.780529

Notes: Bold values indicate the best performance.

Table 3. Out-of-sample predictive performance for portfolio realized variance and portfolio weights.

Model	Feature	Realized Variance	Portfolio Weights
Model	Feature	Realized Variance	W_Energy	W_Materials	W_Industrials	W_Consumer Discretionary	W_Consumer Staples	W_Health Care	W_Financials	W_Information Technology	W_Communication Services	W_Utilities	W_Real Estate
Panel A: 1-period
SVR	EPU	0.00742	0.17470	−0.03113	−0.62069	0.48599	0.26564	0.33896	0.33053	0.07552	−0.22257	−0.30294	0.50599
	FS	0.00126	−0.02431	−0.13727	0.02232	−0.20046	0.19747	−0.03390	0.36551	0.06066	0.05023	0.75643	−0.05669
	benchmark	0.00186	0.14090	0.35613	−0.72719	−0.22218	0.36212	0.26256	0.17229	0.19408	−0.03319	0.56676	−0.07226
RF	EPU	0.00444	0.83584	−0.50812	0.32506	−0.09355	0.26249	0.55824	0.29046	−0.55109	0.14285	0.14311	−0.40529
	FS	0.00189	0.33156	−0.23352	−0.65523	0.04424	−0.01120	0.03950	0.73768	0.51949	−0.00268	0.60942	−0.37927
	benchmark	0.00735	0.47585	−0.06727	0.27654	−0.48575	−0.32460	0.50419	0.84552	−0.39138	−0.34872	1.27006	−0.75443
AdaBoost	EPU	0.00259	0.58928	−0.16087	0.03205	0.12991	0.50767	0.03611	−0.00201	−0.01273	−0.01137	−0.00355	−0.10449
	FS	0.00158	0.16524	0.08313	−0.12227	−0.13079	0.23643	−0.04493	0.05812	0.16086	0.18853	0.35501	0.05066
	benchmark	0.00255	−0.08601	−0.00566	0.00561	0.07178	0.09601	0.01712	0.72686	−0.01450	−0.06608	0.24950	0.00537
XGBoost	EPU	0.00180	0.23156	−0.09115	0.12046	0.14399	0.14385	−0.12993	0.03806	0.02806	0.16390	0.24734	0.10386
	FS	0.00178	0.10988	−0.04511	0.21618	−0.04898	0.10402	−0.00545	0.15590	0.14220	0.18129	0.16538	0.02468
	benchmark	0.00293	0.27370	0.13448	−0.17674	0.01399	0.18790	0.38650	−0.01969	0.06687	−0.23345	0.21777	0.14867
LightGBM	EPU	0.00175	0.35280	−0.12462	0.07275	−0.15723	−0.04167	0.30234	0.33440	−0.01308	−0.03254	0.23345	0.07339
	FS	0.00138	0.15558	−0.17659	0.02134	−0.04722	0.10196	0.18317	0.21634	0.02298	0.22523	0.28432	0.01287
	benchmark	0.00223	0.08096	−0.10965	0.10842	0.34802	0.20972	−0.04569	0.18460	0.03853	0.10287	0.09396	−0.01174
ELM	EPU	0.00204	0.65155	−0.11274	−0.31318	0.20989	0.18189	−0.10458	0.18490	0.31528	−0.39046	0.63394	−0.25649
	FS	0.00304	0.17642	0.16420	−0.99714	−0.22869	0.11569	0.09329	1.05877	0.55224	−0.47755	0.72060	−0.17781
	benchmark	0.00202	0.17505	−0.42747	−0.21666	0.52266	−0.17503	0.10161	0.78026	0.00512	−0.08732	0.51394	−0.19216
Ridge	EPU	0.00335	0.74928	−0.66264	−0.03455	0.54495	0.28836	0.13199	0.04988	−0.13604	0.23303	−0.19869	0.03443
	FS	0.00393	0.09866	0.22538	−0.81404	0.59068	0.10696	0.43320	−0.02758	−0.44181	−0.06448	0.63369	0.25934
	benchmark	0.00553	1.05129	−0.45215	−0.38632	0.65125	−0.07099	0.55668	0.31883	−0.01915	−0.58425	−0.27818	0.21299
Lasso	EPU	0.00413	0.18355	−0.14184	−0.27946	0.31063	0.18969	0.23108	0.55763	−0.15396	0.02027	−0.11310	0.19551
	FS	0.00223	−0.03597	0.21941	−0.05676	−0.35708	0.06886	0.11528	0.71464	0.05646	−0.07866	0.34322	0.01060
	benchmark	0.00221	−0.30899	0.00633	−0.15977	−0.30589	0.14387	0.27508	0.61348	−0.06944	0.19130	0.53563	0.07839
BayesianRidge	EPU	0.00360	0.82115	−0.81851	−0.05632	0.70764	0.27443	−0.01448	0.05385	−0.09007	0.21936	−0.12503	0.02799
	FS	0.00405	0.07553	0.23875	−0.85727	0.61031	0.07417	0.47482	−0.02442	−0.46059	−0.07269	0.66009	0.28130
	benchmark	0.00567	1.04890	−0.46290	−0.38698	0.63273	−0.06322	0.56474	0.31318	−0.02626	−0.58823	−0.27600	0.24403
GPR	EPU	0.00313	0.07208	−0.00153	−0.07333	−0.01668	0.13538	0.19394	0.19776	0.41438	0.06606	−0.00143	0.01337
	FS	0.00218	0.23760	0.01044	0.00816	−0.01580	0.03105	0.11202	0.12581	−0.02500	0.04052	0.23760	0.23759
	benchmark	0.00207	0.10797	0.20340	0.03065	0.16651	0.01112	−0.01272	0.00086	−0.01563	0.46786	0.04203	−0.00205
ExtraTrees	EPU	0.00886	0.48554	−0.86709	−0.98362	−0.03591	0.35016	1.20449	1.04068	−0.30971	−0.14165	0.49887	−0.24177
	FS	0.00888	0.60572	−1.13815	−1.00340	0.36155	1.00168	0.56989	0.66581	0.01244	−0.36378	0.52969	−0.24144
	benchmark	0.00975	0.67668	−0.72346	−0.68244	−0.41306	0.83909	0.85995	0.68560	−0.41925	−0.12029	0.36584	−0.06865
CatBoost	EPU	0.00202	0.22842	−0.09755	0.16631	0.03380	0.20245	0.08410	0.01593	0.01182	0.15811	0.09671	0.09991
	FS	0.00194	0.03993	0.17846	0.01590	0.11635	−0.01403	0.04915	0.12127	0.02069	0.21567	0.19651	0.06009
	benchmark	0.00231	0.14712	0.02351	0.06517	0.06420	0.09547	0.03528	0.25538	0.11428	0.02094	0.09163	0.08702
Panel B: 6-period
SVR	EPU	0.00031	0.51843	−0.84016	−0.11080	1.08388	−0.26849	0.88692	0.38780	−0.07992	−0.63045	0.09155	−0.03875
	FS	0.00008	0.33022	−0.23329	−0.00651	−0.10703	0.06602	0.05546	0.30953	0.30688	0.02804	0.43947	−0.18880
	benchmark	0.00012	0.40175	−0.14844	−0.75032	0.12431	−0.15115	0.83123	0.14785	0.17869	−0.18490	0.38157	0.16943
RF	EPU	0.00081	1.28703	−0.63041	−1.34473	0.12284	0.19984	0.96877	0.94259	0.45718	−0.11346	0.02762	−0.91727
	FS	0.00028	1.09313	−0.36874	−0.28756	−0.16121	0.17449	0.52385	−0.12682	0.45307	0.06350	0.02635	−0.39007
	benchmark	0.00014	0.54251	−0.37923	−1.01597	0.13515	0.18491	0.68494	0.44576	0.35863	−0.29449	0.42760	−0.08981
AdaBoost	EPU	0.00030	0.30933	0.09206	−0.21125	0.40604	0.11166	0.07924	−0.02629	0.12878	−0.12440	−0.08948	0.32433
	FS	0.00009	0.34501	0.02139	0.01268	0.03376	0.24396	0.07657	−0.14519	0.06038	0.09800	0.23737	0.01609
	benchmark	0.00007	0.34884	−0.02740	−0.05668	0.09368	0.12634	0.09838	0.04193	−0.07376	−0.04049	0.35740	0.13176
XGBoost	EPU	0.00008	0.20871	−0.19932	0.33731	0.07758	0.13055	0.17935	0.01734	−0.13596	0.13206	0.17812	0.07428
	FS	0.00017	0.07709	0.02690	0.06012	0.08176	0.18599	0.10322	0.19365	−0.00206	0.10965	0.08044	0.08325
	benchmark	0.00013	0.10197	0.06638	−0.09197	0.11392	−0.02127	0.18841	0.02815	0.04743	0.05153	0.36009	0.15536
LightGBM	EPU	0.00013	0.52093	−0.28400	−0.10691	−0.05099	0.03280	0.21297	0.35108	0.26582	0.11069	−0.09710	0.04472
	FS	0.00009	0.23851	0.11078	−0.19094	−0.07774	0.38792	0.18858	−0.08367	0.01354	0.14270	0.31086	−0.04054
	benchmark	0.00019	0.28078	−0.19638	0.08904	−0.02140	0.24603	0.11563	0.11030	0.10023	0.01094	−0.00878	0.27362
ELM	EPU	0.00015	0.85068	−0.45063	−1.03331	0.88850	−0.20195	0.15288	0.02432	0.50111	−0.19510	0.28671	0.17678
	FS	0.00024	0.85840	−0.48070	−0.43993	0.69456	−0.04804	0.05271	0.61163	0.24220	−0.29361	0.09076	−0.28799
	benchmark	0.00046	0.50192	0.97022	−2.13727	−0.09303	0.18236	0.58306	−0.05182	0.33496	−0.07712	1.01012	−0.22340
Ridge	EPU	0.00044	0.31996	−0.51088	−1.04051	0.53471	−0.43738	1.09395	0.61816	−0.24595	0.08245	0.88968	−0.30417
	FS	0.00021	0.36833	0.22148	−1.23202	−0.12141	0.45706	0.16284	−0.00090	0.32175	−0.12334	1.05142	−0.10523
	benchmark	0.00024	1.12015	−0.52793	−0.66782	0.80654	−0.49957	0.66597	0.14396	0.18985	−0.38535	0.05398	0.10022
Lasso	EPU	0.00015	0.42636	−0.13712	−0.33478	0.27125	0.02269	0.46839	0.14141	−0.18819	0.14551	−0.10011	0.28458
	FS	0.00012	0.19061	0.07996	0.08297	−0.28885	0.10379	0.19521	0.28374	−0.00927	−0.03780	0.30352	0.09613
	benchmark	0.00011	−0.00154	−0.01163	−0.01788	0.07698	0.27374	0.10360	0.43502	−0.03466	0.04503	0.22976	−0.09842
BayesianRidge	EPU	0.00035	0.36930	−0.44834	−0.99672	0.56062	−0.39173	0.91415	0.51287	−0.21853	0.09389	0.80477	−0.20029
	FS	0.00020	0.36739	0.20497	−1.21598	−0.10196	0.45261	0.16159	0.00181	0.30740	−0.11844	1.04449	−0.10388
	benchmark	0.00024	1.10159	−0.53272	−0.67257	0.80271	−0.47869	0.64523	0.14609	0.17340	−0.38167	0.06741	0.12922
GPR	EPU	0.00018	0.20058	−0.02924	−0.14921	0.10576	0.24487	0.13517	0.22720	0.11764	0.02342	0.00370	0.12009
	FS	0.00014	0.17316	−0.04154	0.01159	0.03737	0.02179	0.09912	0.15499	0.06019	0.11526	0.19937	0.16870
	benchmark	0.00012	0.20437	0.14195	0.06833	0.02702	−0.01306	0.03058	0.14160	−0.03239	0.28036	0.14806	0.00320
ExtraTrees	EPU	0.00028	0.38507	−0.07322	−0.54551	0.12563	0.79437	0.76500	0.24121	−0.09352	−0.11220	−0.33341	−0.15341
	FS	0.00037	0.50750	0.05422	−0.30696	0.07225	0.51447	0.37562	0.41007	−0.38550	−0.63780	0.17476	0.22137
	benchmark	0.00055	−0.14657	−0.08579	−0.09170	0.07713	0.25331	0.29935	0.41863	−0.09010	−0.22508	0.03610	0.55473
CatBoost	EPU	0.00018	0.25793	−0.02951	0.24674	−0.01805	0.21742	0.33454	−0.11419	0.01996	0.18187	−0.04991	−0.04680
	FS	0.00011	0.20220	0.06684	0.06204	0.12385	0.13239	0.11545	0.08763	−0.02404	0.07031	0.13286	0.03045
	benchmark	0.00022	0.10871	−0.02374	0.01968	−0.00157	0.10104	0.14151	0.16568	0.21278	0.11276	0.09551	0.06764
Panel C: 12-period
SVR	EPU	0.00015	1.07893	−0.15785	−0.74268	0.14014	−0.05322	1.33759	−0.32347	−0.11502	−0.24022	−0.33561	0.41141
	FS	0.00002	0.36578	−0.03047	−0.10271	0.23590	−0.01078	0.27513	0.19334	−0.03505	0.21354	−0.03301	−0.07166
	benchmark	0.00005	0.67290	−0.10758	−0.86626	0.03519	−0.11012	0.93399	0.15255	0.08331	0.04398	0.05519	0.10685
RF	EPU	0.00026	1.10391	−0.54476	−0.83912	0.22597	0.29843	0.79421	0.12465	0.07103	0.19438	−0.11917	−0.30953
	FS	0.00003	0.60020	0.01519	−0.23677	0.05502	−0.12668	0.50090	−0.04169	−0.06383	0.03378	0.17978	0.08410
	benchmark	0.00010	0.65895	−0.17889	−0.45619	0.16426	0.21952	0.56648	0.03539	0.17686	−0.19376	−0.41933	0.42671
AdaBoost	EPU	0.00025	0.23153	0.24365	0.16790	0.06022	0.13868	0.04306	0.13888	0.09382	−0.10279	−0.41720	0.40226
	FS	0.00002	0.33184	0.16750	0.01304	0.07695	0.06991	0.09412	−0.08266	−0.03630	0.10261	0.24994	0.01305
	benchmark	0.00007	0.07439	−0.06540	−0.08690	0.02882	0.29478	0.02072	0.17892	0.16603	−0.01939	0.23663	0.17140
XGBoost	EPU	0.00003	0.31535	−0.36687	0.42431	0.02576	0.32593	0.05786	−0.10288	−0.01736	0.02337	0.15142	0.16314
	FS	0.00003	0.24896	0.09892	0.03764	0.05058	0.15547	0.21714	0.09696	−0.10907	0.14026	0.01500	0.04814
	benchmark	0.00006	−0.01548	0.01597	0.05292	0.24036	0.00306	0.22913	0.03636	−0.00045	0.02384	0.38626	0.02804
LightGBM	EPU	0.00030	0.16791	0.04232	−0.09153	0.27398	−0.12877	0.33294	−0.00647	0.59738	−0.02697	−0.17019	0.00938
	FS	0.00004	0.42130	−0.04176	0.18344	0.08980	0.22825	0.19406	0.17116	0.02864	−0.00656	−0.17658	−0.09174
	benchmark	0.00022	−0.23836	0.00937	0.05375	0.07143	0.11254	0.20598	0.48652	0.07919	0.04731	0.05236	0.11990
ELM	EPU	0.00005	0.44781	0.21384	−1.89934	0.69080	−0.30166	0.30673	−0.14031	0.77309	−0.13845	0.61885	0.42863
	FS	0.00009	1.15525	−0.76232	−0.27871	0.14737	−0.33575	0.95047	0.23319	0.21719	−0.22330	−0.03189	−0.07151
	benchmark	0.00017	1.11807	−0.84319	−0.53064	0.81276	0.37661	0.22913	0.23562	0.08128	−0.10576	−0.16470	−0.20918
Ridge	EPU	0.00009	0.79378	−0.18255	−0.39905	−0.45936	−0.44801	1.58431	0.85841	−0.01338	−0.15721	−0.37338	−0.20357
	FS	0.00003	0.76425	−0.39251	−0.13180	0.30589	0.01584	0.56673	0.31001	0.04704	0.02997	−0.28056	−0.23485
	benchmark	0.00016	0.36918	−0.02007	−0.43981	−0.18599	−0.29144	1.10026	−0.18304	−0.70548	0.21521	0.35421	0.78697
Lasso	EPU	0.00010	0.19686	0.05829	−0.20890	0.21041	0.28611	0.57887	−0.35966	−0.13423	0.35903	−0.06409	0.07731
	FS	0.00004	0.16978	0.20374	0.00059	−0.12682	0.18091	0.04342	0.23115	−0.00054	−0.00769	0.31753	−0.01207
	benchmark	0.00005	0.03476	0.00741	−0.02400	0.00266	0.17368	0.10160	0.24478	0.04156	0.20032	0.19661	0.02063
BayesianRidge	EPU	0.00008	0.78189	−0.18752	−0.45086	−0.35161	−0.46643	1.54276	0.86163	−0.01687	−0.14073	−0.35620	−0.21606
	FS	0.00003	0.75621	−0.39462	−0.12320	0.30512	0.01480	0.57299	0.30978	0.03934	0.02667	−0.27462	−0.23248
	benchmark	0.00015	0.36465	−0.04006	−0.41711	−0.23069	−0.29323	1.12178	−0.19560	−0.67951	0.21249	0.39052	0.76677
GPR	EPU	0.00005	0.23642	0.02331	−0.04241	0.02271	0.17922	0.15987	0.14169	0.07377	0.12230	0.02007	0.06305
	FS	0.00007	0.14185	0.08614	0.06461	0.10888	0.02862	0.10325	0.10239	0.04955	0.07988	0.12159	0.11324
	benchmark	0.00003	0.24066	0.13054	0.03812	0.00940	0.01270	0.04530	0.11781	−0.01820	0.24757	0.10950	0.06659
ExtraTrees	EPU	0.00012	0.46503	−0.18211	−0.76954	0.16431	0.98178	0.69396	0.23694	−0.03763	−0.32497	−0.41613	0.18836
	FS	0.00013	0.07175	0.09137	0.01306	0.13329	0.17174	0.12143	0.17330	0.07176	−0.08562	0.05001	0.18789
	benchmark	0.00023	−0.00920	0.11054	0.05100	0.16196	0.14331	0.19755	0.05617	0.13471	−0.16692	0.01778	0.30312
CatBoost	EPU	0.00004	0.30896	0.01246	0.34106	0.11663	−0.25494	0.11702	−0.12989	0.03682	0.33798	0.27575	−0.16186
	FS	0.00002	0.23895	0.06989	−0.01057	0.13123	0.10917	0.03699	0.01741	−0.09314	0.19690	0.17692	0.12626
	benchmark	0.00006	0.08252	−0.04916	0.14921	−0.11889	0.27326	0.16610	0.01618	0.03279	0.07811	0.18620	0.18366

Table 4. Prediction performance of robustness check.

Model	Feature	MSE	RMSE	MAE	MAPE	Real_Port_Var
SVR	EPU	0.004391	0.065658	0.048661	2.436572	0.000409
	FS	0.003981	0.062478	0.047476	2.456640	0.000636
	benchmark	0.004409	0.065711	0.049611	2.972219	0.000475
RF	EPU	0.004239	0.064479	0.048570	2.463773	0.000743
	FS	0.003976	0.062371	0.046648	2.236589	0.001118
	benchmark	0.004043	0.062932	0.046955	1.993569	0.000498
AdaBoost	EPU	0.004733	0.068458	0.051747	3.078430	0.000262
	FS	0.004300	0.065001	0.049564	3.170169	0.000227
	benchmark	0.004278	0.064878	0.048485	2.400641	0.000287
XGBoost	EPU	0.007592	0.086359	0.065655	5.539737	0.000280
	FS	0.006878	0.082352	0.061373	4.678319	0.000329
	benchmark	0.005428	0.073351	0.055734	3.521434	0.000308
LightGBM	EPU	0.004737	0.068349	0.051743	3.420724	0.000204
	FS	0.004983	0.070094	0.053870	3.835933	0.000229
	benchmark	0.004777	0.068555	0.051716	3.465748	0.000258
ELM	EPU	0.016106	0.126449	0.100468	9.933915	0.000660
	FS	0.015848	0.124968	0.097776	10.073721	0.000875
	benchmark	0.015781	0.124965	0.097892	9.592483	0.000774
Ridge	EPU	0.011503	0.105494	0.081596	7.259794	0.001165
	FS	0.007400	0.085249	0.067620	5.583496	0.000675
	benchmark	0.007999	0.089166	0.070147	6.921657	0.000592
Lasso	EPU	0.004291	0.065292	0.048576	2.833624	0.000210
	FS	0.004916	0.069827	0.053490	3.290862	0.000534
	benchmark	0.004410	0.065870	0.049268	2.673334	0.000233
BayesianRidge	EPU	0.012176	0.108369	0.083920	7.520761	0.001297
	FS	0.007467	0.085633	0.067935	5.639780	0.000678
	benchmark	0.008103	0.089747	0.070625	6.993569	0.000571
GPR	EPU	0.003738	0.060491	0.044766	1.295250	0.000168
	FS	0.003738	0.060526	0.045098	1.568015	0.000253
	benchmark	0.003762	0.060679	0.045044	1.523631	0.000162
ExtraTrees	EPU	0.004304	0.065257	0.048929	2.478588	0.000584
	FS	0.003771	0.060776	0.044964	1.639218	0.000580
	benchmark	0.003799	0.061069	0.045256	1.741564	0.000607
CatBoost	EPU	0.004513	0.066773	0.050029	2.805877	0.000281
	FS	0.004246	0.064562	0.048943	2.995353	0.000190
	benchmark	0.004410	0.065816	0.049101	2.786700	0.000177

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Du, J.; Cao, W.; Wang, Z. Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach. Mathematics 2026, 14, 938. https://doi.org/10.3390/math14060938

AMA Style

Du J, Cao W, Wang Z. Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach. Mathematics. 2026; 14(6):938. https://doi.org/10.3390/math14060938

Chicago/Turabian Style

Du, Jinda, Wenyi Cao, and Ziyou Wang. 2026. "Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach" Mathematics 14, no. 6: 938. https://doi.org/10.3390/math14060938

APA Style

Du, J., Cao, W., & Wang, Z. (2026). Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach. Mathematics, 14(6), 938. https://doi.org/10.3390/math14060938

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Forecasting Risk Matrices with Economic Policy Uncertainty and Financial Stress: A Machine Learning Approach

Abstract

1. Introduction

2. Literature Review

3. Data and Methodology

3.1. Data and Preprocessing

3.2. Methods

3.2.1. Support Vector Regression (SVR)

3.2.2. Random Forest Regressor (RF)

3.2.3. AdaBoost Regressor

3.2.4. XGBoost Regressor

3.2.5. LightGBM Regressor

3.2.6. CatBoost Regressor

3.2.7. Extreme Learning Machine (ELM)

3.2.8. Ridge Regression

3.2.9. Lasso Regression

3.2.10. Bayesian Ridge Regression

3.2.11. Extra Trees Regressor

3.2.12. Gaussian Process Regression (GPR)

4. Empirical Results

4.1. Implementation Details

4.2. Forecast Evaluation

4.3. Portfolio Analysis

4.4. Robustness Check

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI