2.3.1. Factor Model Specification
The model builds on the classical CAPM, as introduced in the risk management literature (
Hull 2015), by adopting a multi-factor linear regression specification, in line with the risk allocation framework developed by
Meucci (
2009).
It expresses portfolio returns as a linear function of underlying risk factors that represent key drivers of systematic market risk.
Formally, we model the portfolio return
at time t as a linear function of k risk factors:
where
- -
is portfolio return at time ;
- -
is the vector of risk factor returns;
- -
is the vector of portfolio exposures (sensitivities) to each risk factor;
- -
is the idiosyncratic return component at time , assumed to be normally distributed with mean zero.
The coefficients are estimated using ordinary least squares (OLS) based on historical data.
To construct the risk factor model, we begin by selecting a set of market-based factors with sound economic rationale. The initial candidate set includes: the S&P 500 index (SPY) as a broad equity market factor; TLT for interest rate duration risk; XLK for technology sector exposure; MTUM to capture recent momentum; VIX as a proxy for implied market volatility; VLUE to reflect value-style equity exposure; and IWF for high-growth equity characteristics. Given the portfolio’s concentration in technology-sector equities, the inclusion of factors such as XLK and IWF may embed sector-specific sensitivities into the model, potentially influencing the estimation of systematic risk exposures.
To evaluate the adequacy of the regression model and ensure the quality of estimated factor sensitivities, we conduct a series of diagnostic checks. First, we assess the overall model fit using the R-squared statistic. A threshold of 80% is used to indicate an acceptable level of explanatory power. This threshold is consistent with both academic research and industry practice.
Fama and French (
2015) report that the average adjusted R-squared values from five-factor regressions reach 0.91 for 25 portfolios sorted on size and book-to-market ratio, demonstrating strong explanatory power across asset pricing tests. The MSCI Barra Global Equity Model is reported to achieve R-squared values above 0.8 when calibrated to historical data (
MSCI 2021). Similarly, Morningstar considers an R-squared value below 80% to indicate that the benchmark lacks relevance (
Morningstar 2021). These sources collectively support our use of the 80% threshold in assessing model adequacy.
Second, we evaluate the statistical significance of each factor using both
p-values and t-statistics from the OLS estimation. Factors with
p-values above a significance threshold (typically 0.05) are considered statistically insignificant and excluded from further simulation. While non-significant factors may sometimes be retained for theoretical reasons, we adopt a conservative approach and remove them entirely. Since the validity of
p-values in OLS relies on the normality of residuals, we additionally examine the distribution of regression errors using a Q–Q plot. The residuals align closely with the 45-degree reference line in the central range, with mild tail deviations, indicating approximate normality. This supports the appropriateness of using t-statistics for factor significance testing (see
Appendix B Figure A2).
Third, we evaluate the classical assumptions of linear regression, including linearity, homoscedasticity, independence of residuals, and the absence of multicollinearity. To assess multicollinearity, we employ the Variance Inflation Factor (VIF), a standard diagnostic statistic widely used in both academic and applied regression modeling. In this study, we adopt a VIF threshold of 10 to identify potentially problematic collinearity. This choice aligns with established conventions in the literature, where VIF values exceeding 10 are often interpreted as indicative of serious multicollinearity that could undermine coefficient stability (
Kutner et al. 2005;
Menard 2001;
Chatterjee and Hadi 2015).
Based on this criterion, all variables in our model are within acceptable bounds. Although XLK exhibits a VIF marginally above 10, we retain this factor due to its strong sectoral relevance and distinct contribution to explanatory power. This approach is consistent with empirical modeling practices, where economic interpretability may justify the retention of variables exhibiting modest multicollinearity (
O’Brien 2007). In contrast, SPY and IWF exhibit extremely high VIF values (54.17 and 49.71, respectively), far exceeding conventional thresholds. These factors are therefore excluded from the final model to avoid severe multicollinearity.
To further validate the robustness of our factor selection, we complement the OLS and VIF-based approach with a LASSO regression. LASSO, or Least Absolute Shrinkage and Selection Operator, is a penalized regression technique that encourages sparsity by shrinking weaker coefficients toward zero (
James et al. 2013). This method is particularly suited for handling multicollinearity among predictors. LASSO regression confirms the selection of XLK, MTUM, and VLUE, while effectively excluding TLT, ^VIX, SPY, and IWF. This supports the robustness of our factor screening procedure (see
Appendix B Figure A3).
Each line represents the estimated coefficient of a factor across a sequence of regularization parameters lambda in log scale. Dominant variables (e.g., XLK, IWF) emerge early and maintain strong coefficients, while redundant or weak predictors (e.g., TLT, ^VIX) are shrunk toward zero. This cross-method consistency strengthens the credibility of the selected factor set and mitigates concerns about potential p-value-driven selection bias.
The final factor set includes XLK, MTUM, and VLUE, all statistically significant at the 5% level (
p < 0.05), as shown in
Table 1. These factors are selected based on a combination of statistical significance, acceptable VIF values, and conceptual relevance to equity portfolio risk.
This specification is broadly consistent with the equity risk factor requirements outlined in the Basel framework (
Basel Committee on Banking Supervision 2019, MAR31.9), which recommend using market-wide indices, sectoral indices, and volatility-based factors when modeling equity exposures.
2.3.2. Monte Carlo Simulation of Factor Returns
To estimate Value-at-Risk (VaR) using the factor-based approach, we simulate portfolio returns by first generating random realizations of the underlying risk factors. We assume that factor returns follow a multivariate normal distribution, with parameters estimated from historical data:
where:
- -
is the simulated vector of factor returns;
- -
is the vector of historical factor return means;
- -
is the covariance matrix of factor returns.
This setup implies a Gaussian copula structure, where dependencies among risk factors are fully captured through the covariance matrix
. While this is a common assumption in risk modeling, it may underestimate joint extreme events. Future work could consider alternative copulas, such as t-copulas, which are more suitable than Gaussian copulas for capturing joint tail dependence in extreme market conditions (
Embrechts et al. 2002).
In each simulation iteration
i, a random vector
is drawn from this distribution. The simulated portfolio return
is then computed as follows:
where
is the vector of estimated factor sensitivities from the regression model in
Section 2.3.1.
We repeat this process N times (e.g., N = 10,000) to generate a distribution of simulated portfolio returns:
The 99% VaR is then calculated as the 1st percentile of the simulated return distribution.
All factor simulations are based on parameter estimates derived from the in-sample window (January 2023 to April 2024), ensuring that the resulting VaR reflects forward-looking risk estimates for the out-of-sample period.
Compared to return-based simulation, the factor-based Monte Carlo approach offers enhanced interpretability by attributing risk to specific systematic drivers, such as market-, sectoral-, or volatility-linked factors. It also incorporates the historical covariance among these factors, allowing more realistic co-movement patterns that are often neglected in return-based models.
Moreover, this structure is naturally compatible with scenario-based stress testing. By adjusting one or more factor inputs, practitioners can transparently assess the portfolio’s sensitivity to hypothetical shocks—such as a volatility spike or a sector-specific decline. This flexibility aligns with regulatory expectations under Basel III and FRTB, which emphasize risk attribution and scenario analysis.
However, the method carries limitations. It assumes a linear factor structure and relies on multivariate normal distributions, which may understate the likelihood of extreme events and thus tail risk. Such limitations could result in insufficient capital allocation during stress periods (
McNeil et al. 2015). Additionally, effective factor selection requires both sound economic rationale and expert judgment. Omitting key exposures may reduce the simulation’s robustness in dynamic markets.