1. Introduction
Blockchain technology, originally introduced with Bitcoin [
1], has rapidly evolved, finding diverse applications across various sectors such as finance, artificial intelligence, supply chain management, healthcare, and governance [
2,
3,
4,
5]. Despite significant technological advances and its transformative impact on decentralization across multiple domains [
6], cryptocurrency investment and speculation continue to dominate its primary use case, with Bitcoin remaining at the forefront of this movement.
Initially driven primarily by speculative retail investors, Bitcoin’s price behavior was characterized by high sentiment sensitivity, low liquidity, and notable market inefficiencies, resulting in substantial volatility and frequent price bubbles [
7]. However, over recent years, increasing institutional adoption, regulatory developments, and deeper integration within traditional financial systems have significantly reshaped its market structure. The entry of major financial institutions such as MicroStrategy, Goldman Sachs, and BlackRock, together with regulatory milestones including the Markets in Crypto-Assets Regulation (MiCA) implemented in the European Union and the SEC’s approval of Bitcoin spot ETFs in 2024, has solidified Bitcoin’s status as a recognized asset class. Additionally, sovereign adoption—highlighted notably by El Salvador’s decision to adopt Bitcoin as legal tender in 2021—underscores its growing significance within the global financial ecosystem. Collectively, these developments represent a shift in Bitcoin’s fundamental price dynamics, suggesting that macroeconomic conditions now play an increasingly influential role as the market matures toward efficiency [
8].
Despite this evolution, the academic debate regarding the primary drivers of Bitcoin’s price remains ongoing and inconclusive. Some studies contend that Bitcoin has increasingly integrated with traditional financial markets, exhibiting stronger correlations with conventional indicators such as stock indices and exchange rates, particularly in the post-COVID-19 environment [
9,
10]. In contrast, other research emphasizes Bitcoin’s continuing autonomy as an asset, arguing that internal ecosystem dynamics, including trading volume, miner activities, and speculative sentiment, remain central to its price formation [
9,
10,
11]. Additionally, historical event-driven analyses highlight that market manipulation, regulatory developments, and halving events have consistently contributed to Bitcoin’s volatility and market dynamics [
12,
13]. These divergent findings underscore the complexity of Bitcoin’s price behavior, suggesting that its determinants are not static but rather dynamic and may vary significantly across different market regimes.
A two-state Hidden Markov Model (HMM) framework is particularly suited for investigating the evolving determinants of Bitcoin’s price, as it accommodates latent regime shifts characteristic of financial time series. This framework enables the dynamic assessment of Bitcoin’s sensitivity to macroeconomic and internal factors across varying market conditions. Formally, the observed price series
is modeled as conditionally Gaussian, dependent on an unobserved state sequence
following a Markov process:
where
denotes the hidden market state (e.g., bullish (
) or bearish (
)) and
represents macroeconomic, financial, and Bitcoin-specific covariates. The impact of these covariates on Bitcoin’s price is governed by a state-dependent coefficient vector
, enabling distinct predictors to influence price behavior differently across market regimes.
In a homogeneous HMM, the transitions between states occur with fixed probabilities:
where
is the time-invariant probability of transitioning from state
i to state
j. This structure assumes that market regime shifts follow an internal process independent of external influences.
Conversely, the non-homogeneous HMM (NH-HMM) allows for external variables to dynamically influence state transitions, modeling these probabilities as a function of covariates:
where
is a set of covariates influencing the probability of state transitions and
are logistic regression coefficients.
By capturing these hidden regime shifts, the HMM framework offers a structured approach to analyzing Bitcoin’s evolving price dynamics. As institutional involvement increases and market conditions continue to shift, a crucial research question arises: Has Bitcoin become increasingly intertwined with macroeconomic factors as its market evolves and matures?
To investigate this question, the study employs two complementary approaches: a Bayesian Markov Chain Monte Carlo (MCMC) covariate selection method to identify relevant long-term determinants, and a bootstrapped rolling-window forecasting technique to capture short-term dynamics.
Several studies have already explored Bitcoin price dynamics using Hidden Markov Models (HMMs) and related regime-switching frameworks. Koki et al. [
14,
15] applied a NH-HMM with Pólya–Gamma augmentation and found that macroeconomic variables were insufficient for reliable out-of-sample prediction, suggesting Bitcoin’s relative autonomy from broader economic trends. In contrast, Chen [
16] combined vector error correction (VEC) and Markov-switching models, concluding that financial expectations and exchange rates had a stronger influence on Bitcoin price movements than blockchain-specific variables, although the impact varied across “bull” and “bear” market regimes. Katsiampa [
17] modeled Bitcoin volatility through Markov-switching GARCH models, revealing distinct regime changes in volatility behavior. Tiwari et al. [
18] employed a two-state Markov-switching framework to analyze Bitcoin’s speculative bubbles, identifying alternating phases of exuberance and correction. Kodama et al. [
19] applied HMMs to detect trend reversals in Bitcoin markets, while Alam and Dutta [
20] investigated cryptocurrency interconnectedness across different market regimes. Similarly, Bouri et al. [
21] examined regime-dependent spillovers between Bitcoin and traditional financial assets. While these studies highlight the utility of regime-switching models for capturing Bitcoin’s nonlinear dynamics, few explicitly incorporate a systematic covariate selection process to assess macroeconomic influences within an HMM framework.
Our study addresses this gap by integrating Bayesian MCMC covariate selection into both homogeneous and non-homogeneous HMMs to trace how Bitcoin’s price drivers evolve across hidden market states. Although previous research has examined Bitcoin’s price determinants, there is still no consensus on whether these factors remain stable or shift across different market regimes. By embedding Bayesian MCMC within a regime-switching structure, this research offers a new perspective on the dynamic interplay between macroeconomic and Bitcoin-specific variables under varying market conditions. The findings provide valuable insights for investors, regulators, and policymakers, enhancing their understanding of Bitcoin’s evolving role within the broader financial system and informing strategic and regulatory decision-making.
The key contributions of this study include:
Identification of Regime-Dependent Determinants: The analysis reveals how the influence of macroeconomic, financial, and Bitcoin-specific variables changes across market regimes, enriching the understanding of Bitcoin’s evolving price behavior.
Enhanced Bayesian MCMC Approach: The study advances existing Bayesian MCMC covariate selection techniques [
14], particularly for regime-switching contexts, to more accurately capture long-term effects.
Improved Predictive Performance: By employing a bootstrapped rolling-window forecasting approach across different subsamples, the study demonstrates variations in Bitcoin’s price predictability under diverse market conditions, thus enhancing forecasting accuracy and reliability.
The paper is structured as follows:
Section 2 introduces the Bayesian MCMC covariate selection algorithm, outlines the bootstrapped rolling-window forecasting methodology, and provides an overview of the empirical data used in the study.
Section 3 presents and discusses the empirical findings across different subsamples.
Section 4 highlights the methodological enhancements and contributions of this research and suggests avenues for future research.
Section 5 concludes the study by summarizing the key findings and implications.
2. Materials and Methods
This section outlines the methodological approach used to analyze long-term and short-term determinants of Bitcoin’s price, illustrated visually in
Figure 1.
The analysis comprises three main stages:
Data processing and specification of prior parameters;
Identification of optimal long-term covariates using the Bayesian MCMC covariate selection algorithm;
Evaluation of these selected covariates against benchmark sets through a bootstrapped rolling-window forecasting procedure, considering steps ahead.
Details of the Bayesian MCMC covariate selection method and its modifications are provided in
Section 2.1. The bootstrapped rolling-window forecasting approach is described in
Section 2.2. Finally,
Section 2.3 presents an overview of the dataset used for empirical analysis.
2.1. The Bayesian MCMC Covariate Selection Algorithm
The optimal subset
of covariates for the HMMs is determined using a MCMC algorithm inspired by Koki et al. [
14], implemented over 100,000 iterations. This procedure systematically explores the model space to identify the most relevant covariates by repeatedly proposing subsets, simulating hidden states, sampling emission coefficients (and transition coefficients in the non-homogeneous case), calculating and comparing likelihoods, and updating covariate sets accordingly. The procedure begins by fitting a full-covariate HMM using the Expectation–Maximization (EM) algorithm to establish initial parameter priors. The steps are as follows:
Simulating the Latent State Sequence: The latent states are simulated using the
Forward–Backward algorithm. Forward probabilities are calculated as
These probabilities inform the computation of backward probabilities from which the hidden state sequence is subsequently sampled.
Sampling Emission Coefficients: Emission coefficients
are sampled through a
Gibbs sampling approach from a
Normal–Inverse Gamma distribution:
Here, and represent posterior mean and covariance matrices, while denotes the state-specific variance drawn from an Inverse–Gamma distribution.
Sampling Transition Coefficients (NH-HMM): In the NH-HMM case, transition coefficients
are sampled using
Pólya–Gamma augmentation. This method simplifies coefficient sampling by introducing auxiliary Pólya–Gamma distributed latent variables
, enabling efficient Gibbs sampling of transition parameters:
Here, and indicate posterior means and covariance matrices.
Double Reversible Jump Algorithm: The algorithm employs a
Double Reversible Jump procedure that iteratively proposes adding or removing covariates. Each proposal is evaluated by calculating an acceptance ratio
, which balances model complexity and goodness of fit:
where
is a penalty term proportional to the model’s complexity. This penalty discourages overly complex models unless they significantly enhance the model’s explanatory power. Through successive iterations, the algorithm efficiently identifies the optimal covariate subset.
Metropolis–Hastings Acceptance Criterion: The acceptance of the proposed covariate set update follows the Metropolis–Hastings criterion, accepting models with automatically, or probabilistically accepting updates with probability if a random draw is less than . This approach ensures thorough exploration of model space, promoting convergence to a well-fitting, parsimonious model.
The final set of covariates selected for the emission models is determined based on posterior inclusion probabilities derived from the Bayesian MCMC algorithm. Covariates with high posterior inclusion probabilities (indicating improvement in model fit) are retained, while those with probabilities below a threshold are excluded. Specifically, covariates are selected based on their posterior inclusion probability, with a threshold adapted for emission models () and a lower threshold for transition models (due to the complexity and subtlety of non-linear effects, set at ).
The choice of 0.5 for the emission models follows standard practice in Bayesian variable selection (e.g., [
22]), where inclusion is favored once the posterior probability exceeds the point of equal odds between inclusion and exclusion. In contrast, transition models within NH-HMMs often exhibit weaker and less stable covariate effects due to the inherent challenges of estimating time-varying transition probabilities, as discussed in
Section 4. To account for these identifiability issues and prevent the transition models from becoming overly sparse, thereby excluding potentially relevant predictors, we apply a more permissive threshold of 0.3 for covariate inclusion in the transition equations.
The
Double Reversible Jump algorithm manages the exploration of a lengthy, correlated candidate covariate list. However, this complexity can cause the algorithm to become stuck in regions of high likelihood early on, making it reluctant to exclude covariates despite their limited contribution. To mitigate this issue, the procedure initializes with a reduced set (contrary to Koki et al. [
14]) of three essential covariates, enhancing the algorithm’s flexibility in exploring the covariate space. Remaining covariates stay candidates for inclusion or exclusion from the second iteration onward, thus gradually refining the covariate set without overwhelming the algorithm.
Because HMM likelihoods naturally tend to improve when introducing additional covariates—regardless of whether these covariates meaningfully contribute to predictive performance—a penalty-based approach is employed. Specifically, the Double Reversible Jump algorithm introduces a complexity penalty into the acceptance ratio calculation whenever a proposed model includes more covariates than the current model. A penalty of was selected to sufficiently discourage overfitting by excessively large covariate sets, balancing model parsimony and exploratory power. This penalty reduces the Metropolis–Hastings criterion acceptance probability for model updates that significantly increase complexity without proportionate improvement in model fit. The choice of this value is motivated by the need to avoid spurious inclusion of redundant predictors, as observed in correlated variable groups, where joint inclusion would otherwise be frequently accepted due to minor cumulative likelihood gains.
To facilitate gradual model refinement and avoid excessively restrictive moves that could trap the chain in local modes, a smaller penalty of 0.05 is imposed for incremental changes (e.g., the addition or removal of a single covariate). This softer penalty for small steps encourages local exploration while maintaining overall control over model complexity. Both penalty values are calibrated empirically, following the observation that higher penalties overly constrained the model space. Compared to previous implementations of [
14] without penalization, this refinement substantially reduces the risk of multicollinearity-driven overfitting.
Given that initial iterations may be biased due to insufficient exploration of the parameter space, a burn-in period is implemented, discarding the first
of iterations. This percentage ensures the exclusion of early iterations heavily influenced by initial priors, enhancing the stability and reliability of posterior inference. Convergence of the MCMC algorithm is evaluated using diagnostic tools provided by the
coda 0.19–4.1 package in
R 4.4.3, including trace plots, the Gelman–Rubin statistic (for comparing variance between chains), and the Effective Sample Size (ESS) to quantify the number of effectively independent samples in the chain. As substantiated later by the typical convergence patterns shown in
Section 3.1, the
burn-in proportion is sufficient for the majority of the MCMC chains, with 78 out of 96 achieving convergence within this window. Conversely, a longer burn-in period does not resolve convergence issues in the remaining cases, where the Gelman–Rubin statistic remains consistently high across all iterations.
In non-homogeneous Hidden Markov Models (NH-HMMs), sampling transition coefficients
poses computational challenges due to the nonlinear nature of logistic regression. To address these challenges, a Pólya–Gamma augmentation approach (Polson et al. [
23]) is implemented, introducing latent variables
from a Pólya–Gamma distribution. This transformation simplifies the logistic likelihood into a Gaussian form, significantly improving computational efficiency, mixing properties, and convergence speed within the MCMC sampling process.
2.2. The Forecasting Procedure
As depicted in
Figure 1, the
h-step-ahead forecasting methodology employed in this study integrates a rolling-window approach, initially proposed by Box and Jenkins [
24], with bootstrapping techniques introduced by Efron [
25]. This combined procedure involves iteratively moving forward a fixed-length window of the most recent observations. Within each window, a HMM is fitted to classify market regimes, estimate model parameters, and determine state transition probabilities. The estimated parameters are then utilized for generating forecasts
h-steps ahead, specifically the following:
For : Given the most recently observed hidden state, the next-day forecast is obtained as the mean of 1000 bootstrap simulations generated using the respective state-specific parameters.
For : At each forecast horizon beyond one step, the next possible hidden state is simulated using the estimated transition probability matrix. This simulation determines which state-specific parameters to apply for subsequent forecasting.
NH-HMM case: Transition probabilities are dynamic and depend on external covariates. A logistic function employing covariate data determines the transition probability matrix for each forecast step, thus simulating the next possible hidden state.
At each step, the estimation window is shifted forward by one observation, repeating the forecasting procedure iteratively.
For each bootstrap iteration, the forecast is initiated from the most recent hidden state estimated from the model’s posterior distribution. Subsequent state transitions are simulated based on the model’s transition probabilities. Forecasted values for each h-step-ahead period are derived using the emission parameters and , conditioned on the simulated hidden states. This bootstrapping approach generates a distribution of future values, allowing computation of mean forecasts and associated confidence intervals. In the NH-HMM scenario, the recalculation of transition probabilities at each step based on time-varying covariates enhances the model’s responsiveness to external influences on market dynamics.
Forecast accuracy and robustness are evaluated using multiple performance metrics, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and the coverage probability of the 95% forecast confidence intervals relative to the observed actual values.
2.3. Data Considered
The dataset employed in this study consists of daily closing Bitcoin prices spanning from 1 June 2016 to 1 December 2024. To better understand temporal variations in Bitcoin’s behavior, the analysis is conducted on two distinct subsamples: the early period (1 June 2016 to 31 August 2019) and the recent period (1 September 2019 to 1 December 2024).
The Bayesian MCMC analysis incorporates 16 covariates representing financial, macroeconomic, and Bitcoin-specific factors. Data sources include
Yahoo Finance (accessed via the
tidyquant package in
R) and the
Blockchain.com API. Given sensitivity to varying data magnitudes, all covariates are log-transformed and scaled. Variables unavailable on weekends or those reported with minor lags are aligned to daily frequency using linear interpolation. Additionally, all covariates are lagged by seven days, reflecting a realistic time adjustment period.
Table 1 summarizes these adjusted covariates classified into macroeconomic and Bitcoin-specific categories.
The selection of macroeconomic covariates is directly motivated by the central research question of this study: Has Bitcoin become increasingly intertwined with macroeconomic factors as its market evolves and matures?
Figure 2 visualizes transformed macroeconomic variables alongside Bitcoin’s price. Stock market indices (S&P500, Dow Jones, Nasdaq) serve as proxies for investor confidence and broader financial market performance, where positive equity returns may signal economic optimism and greater liquidity, potentially supporting speculative investment across asset classes, including cryptocurrencies. Conversely, the VIX index, a measure of implied stock market volatility, captures risk-off environments where uncertainty may influence portfolio reallocations toward or away from cryptocurrencies. Exchange rates (EUR/USD, GBP/USD, JPY/USD, CNY/USD) reflect international capital flows and monetary policy differentials, offering insight into global liquidity conditions and the strength of reserve currencies relative to Bitcoin. Their inclusion acknowledges Bitcoin’s emerging role as both a speculative vehicle and a perceived hedge against fiat currency depreciation in certain regions. Similarly, gold and crude oil futures are included to assess Bitcoin’s positioning relative to traditional inflation hedges and commodities, contributing to the broader discussion on whether Bitcoin functions as “digital gold” or aligns more closely with high-risk speculative assets.
The scaled time-series dynamics reveal a pronounced decline in gold prices and inflation expectations following the implementation of COVID-19 regulations in 2020, accompanied by a rise in the VIX uncertainty index, during which Bitcoin’s price remained relatively stable. Exchange rates, particularly EUR/USD and GBP/USD, generally exhibit directional alignment with Bitcoin, though they experienced significant declines in 2022 without recovering to their earlier peak levels observed in 2018. Financial market indices, notably the Dow Jones Industrial Index (DJI), exhibit stronger co-movement with Bitcoin’s price post 2019, although the magnitude of these movements remains subdued compared to earlier periods.
Figure 3 illustrates Bitcoin-specific variables including trading volume, average block size, total hash rate, miner revenue, and the cyclical variable of days until halving events (set for 9 July 2016, 11 May 2020, 19 April 2024, and 26 March 2028). Bitcoin’s price closely tracks trading volume and miner revenue, while days until halving events display a clear cyclical pattern, confirming prior findings by M’bakob [
13].
4. Discussion and Methodological Comparisons
The empirical findings from the Bayesian MCMC covariate selection in homogeneous HMMs indicate that both macroeconomic and Bitcoin-specific factors influence Bitcoin’s price over the examined periods, underscoring the asset’s evolving market structure. By contrast, results from the non-homogeneous HMM highlight a shift from primarily Bitcoin-specific determinants toward macroeconomic drivers, particularly in the later subsample. These observations are further supported by the forecasting procedure, aligning with [
16,
27,
36], where the highest accuracy for the early sample derives from a Bitcoin-focused Set 2, while a mixed Set 1 dominates for the later sample—yet Set 2 still outperforms in the full sample.
Rolling coefficient estimates provide deeper insight into shifting price drivers. In both the early and full-sample analyses,
Days until halving exhibits a pronounced effect, consistent with [
13], which links halving events to significant supply reductions and subsequent price surges. Conversely, in the recent subsample, the
CNY/USD exchange rate and the
Dow Jones index assume top coefficient ranges, highlighting Bitcoin’s heightened sensitivity to global financial markets, consistent with [
31,
37].
Additionally, dynamic MAPE values reveal that large forecasting errors coincide with influential external shocks or fraudulent activities [
27]. For instance, substantial error spikes appear in June 2023 alongside BlackRock’s ETF filing and in April 2023, coinciding with the EU’s MiCA adoption. Likewise, Tether issuance (2017–2018) and the
PlusToken Ponzi scheme (2019) align with error peaks observed in earlier years [
12].
Building on [
14], this study introduces several modifications designed to enhance covariate selection and forecasting performance:
Penalized Acceptance Ratios: A penalty term is added to discourage unnecessary model complexity.
Reduced Initial Covariate List: Initialization with a minimal set of covariates enhances search flexibility.
Lower Posterior Transition Benchmark: A reduced threshold is used to accommodate non-linear transition effects.
Bootstrapped Rolling-Window Forecasting: Short-term predictions benefit from iterative re-estimation on local windows.
Further innovations include extending the approach to homogeneous HMMs, enlarging the macroeconomic and Bitcoin-specific variable pool, explicitly reporting MCMC diagnostics, and incorporating dynamic, rolling-window analyses. These collective refinements mitigate multicollinearity and overfitting by discouraging redundant covariates. For instance, SPX, DJI, and IXIC—highly correlated U.S. indices—are rarely all included simultaneously, as their combined likelihood improvement triggers the penalty mechanism.
Despite these advancements, sampling transition probabilities within non-homogeneous HMMs remains challenging. Some variables display low ESS and elevated Gelman–Rubin statistics (Diagnostics Section). Among the 96 estimated MCMC chains, 4 exceeded the PSRF convergence threshold of
in the early subsample, 5 in the recent subsample, and 9 in the full sample, with poor convergence more frequently observed in the NH-HMM specifications. Restrictive priors, or attempts to inflate priors for broader exploration, can yield overflow errors in the logistic transition likelihood if
becomes excessively large. Similar complexities are noted in [
38], wherein non-linear transitions undermine stable mixing. Although Pólya–Gamma augmentation [
23] addresses non-conjugacy issues, time-varying transitions and modest covariate effects can complicate convergence. Empirically, adding the VIX uncertainty index to transition equations often does not enhance accuracy, deviating from [
30,
31] but aligning with [
10], which report limited forecasting gains from uncertainty proxies.
Another key observation is that MCMC-selected covariates—optimized for broad, long-term effects—do not consistently surpass benchmark sets under rolling-window forecasts aimed at short-term patterns. In high-correlation scenarios, substituting one macroeconomic or Bitcoin variable for another often yields minimal net likelihood gain. For instance, the non-homogeneous MCMC set (CNYUSD, DJI, EURUSD, CL_F) is closely matched by a slightly different Set 1 (CNYUSD, DJI, GC_F, VOL), emphasizing that strong pairwise correlations can make certain variables interchangeable.
Strengths. The methodological framework—Bayesian MCMC-based variable selection combined with homogeneous and non-homogeneous HMMs—offers several notable strengths. It effectively captures regime-dependent relationships and accommodates structural shifts in Bitcoin’s price dynamics. By integrating long- and short-term forecasting through a rolling-window bootstrap procedure, the approach remains flexible to evolving market conditions. The Bayesian framework in particular enhances model parsimony by using posterior inclusion probabilities, thereby reducing the risk of overfitting in high-dimensional settings.
Weaknesses. Several limitations should be acknowledged. The non-linear nature of transition probabilities in the non-homogeneous HMMs introduces considerable computational challenges, often leading to slower MCMC convergence and lower effective sample sizes. Additionally, the reliance on lagged covariates may limit the model’s real-time predictive responsiveness, and the methodological complexity could reduce transparency and interpretability for broader, non-technical audiences.
Opportunities. This framework opens up promising opportunities for extension. It can be adapted to other volatile assets or markets undergoing structural transformation. Future research could explore its integration with sentiment indicators, regime duration modeling, or hybrid machine learning-HMM systems. Moreover, real-time macro-financial surveillance tools based on this framework could aid policymakers and institutional investors.
Threats. Potential threats include the risk of misinterpretation or over-reliance on model outputs, particularly in policymaking or financial regulation. Model performance may degrade rapidly during unprecedented crises (e.g., pandemics or geopolitical shocks) if relevant covariates are omitted or lagged responses are misaligned. Moreover, high computational demands and limited accessibility for practitioners may hinder broader adoption.
Beyond statistical and econometric modeling, Bitcoin’s long-term evolution is embedded in broader macro-environmental dynamics. A PESTLE framework helps illustrate this multifaceted landscape. Politically, Bitcoin faces shifting regulatory positions—ranging from El Salvador’s adoption to the European MiCA regulation—affecting global legitimacy. Economically, its ties to inflation expectations, institutional investment flows, and interest rate regimes grow stronger. Socially, cultural attitudes, generational shifts, and public narratives drive adoption cycles. Technologically, innovations like Layer 2 scaling and proof-of-stake alternatives shape user behavior and infrastructure. Legally, classification as a commodity or security continues to shape compliance obligations and institutional involvement. Environmentally, Bitcoin mining remains under scrutiny due to energy intensity, sparking interest in sustainable practices. These contextual forces deepen Bitcoin’s integration into traditional financial systems while also introducing volatility through exogenous shocks.
The findings of this study suggest that Bitcoin is increasingly influenced by traditional macroeconomic factors, particularly during recent periods of institutional adoption and regulatory attention. Policymakers and financial regulators should consider integrating Bitcoin and other major cryptocurrencies into broader macro-financial surveillance frameworks. The heightened sensitivity of Bitcoin prices to interest rates, exchange rates, and stock market volatility implies that central bank policies and fiscal developments may have amplified spillover effects into the crypto asset space. Therefore, establishing clear, adaptive regulatory frameworks—such as MiCA in the EU or guidelines on exchange-traded products—can help stabilize investor expectations and reduce market uncertainty. Additionally, given the limitations of long-horizon predictability, policy communication strategies should emphasize the inherent risks of speculative behavior, particularly for retail investors.
4.1. Alternative Approaches for Covariate Selection
Given the impact of variable selection on forecast performance, various machine learning methods were tested to gauge their suitability.
4.1.1. Least Absolute Shrinkage and Selection Operator (LASSO)
First, LASSO was applied to the complete covariate set to isolate coefficients shrunk to zero. However, the optimal (selected via cross-validation) retained essentially all covariates, reflecting high dimensionality and multicollinearity. Adopting a larger (one standard error from the minimum) only excluded , which still left too many covariates for the HMM structure, risking overfitting.
4.1.2. Elastic Net
An Elastic Net model with (balancing L1 and L2 penalties) was then employed under cross-validation. Nevertheless, no coefficients were shrunk to zero, paralleling the outcome of LASSO. Thus, while these methods offer efficient shrinkage in conventional regression contexts, they proved unsuitable for HMM-based analyses due to entrenched multicollinearity and regime-switching dynamics.
4.1.3. Random Forest (RF)
A Random Forest algorithm was next used to rank covariates based on reductions in out-of-bag accuracy. The top four—SPX, IXIC, INFL, and DJI—strongly overlapped, capturing similar U.S. financial market trends. Incorporating all four into a rolling HMM forecast induced convergence problems stemming from high correlation among indices. These results reaffirm that solely algorithmic feature ranking must be reconciled with the requirements of HMM-based methods.
4.1.4. Support Vector Machine (SVM)
Finally, a rolling-window SVM was tested on identical covariate sets. Despite the same bootstrap logic, SVM forecasts for 1-day-ahead lagged substantially behind HMM equivalents. For example, a rolling SVM using Set 2 over the full sample produced a MAPE of 29.52%, compared to 15.78% with the homogeneous HMM. Coupled with higher computational expenses, SVM was deemed impractical for the studied regime-switching context.
Implications and Future Directions
Collectively, these alternative strategies highlight the distinctive requirements of HMM-based forecasting. Methods like LASSO or Elastic Net could not adequately reduce the correlated feature space. Random Forest offered feature importance insights but did not address multicollinearity in transition models. Rolling SVM attempts incurred reduced accuracy and heavier computation. In contrast, the Bayesian MCMC approach—tailored for regime-switching—showed robust short-term performance when combined with rolling windows and bootstrapping while also identifying high-impact covariates for the long run.
From a practical standpoint, investors may wish to monitor macroeconomic indicators (e.g., exchange rates, stock indices) alongside Bitcoin-specific metrics (e.g., trading volume, halving cycles) to anticipate major price swings. Policymakers could leverage such insights to understand how cryptocurrencies interface with traditional financial systems, potentially shaping more informed regulatory measures. Future research might extend this work by incorporating structural break detection or investor sentiment variables directly into state transitions, offering a finer-grained view of how exogenous events reshape Bitcoin’s market regimes.
5. Conclusions
This paper investigates whether Bitcoin has become increasingly sensitive to macroeconomic variables, especially amid rising institutional adoption. While some studies characterize Bitcoin as an autonomous asset, others posit growing convergence with traditional financial markets. To clarify this debate, the Bayesian MCMC covariate selection method of [
14] was extended to both homogeneous and non-homogeneous Hidden Markov Models (HMMs). This study thoughtfully integrates established techniques—Bayesian MCMC-based covariate selection, regime-switching modeling, and rolling-window forecasting—to address challenges like overfitting, convergence, and predictive instability. While each method individually is well established in the literature, their combined application to the evolving cryptocurrency market offers new empirical insights.
A dataset comprising 16 macroeconomic and Bitcoin-specific variables from 2016 to 2024 was used to compare the MCMC-selected covariate sets against two benchmark sets:
Set 1, a combination of macroeconomic and Bitcoin-specific variables;
Set 2, Bitcoin-focused covariates only.
In non -homogeneous HMMs, the VIX uncertainty index served as a transition covariate. Analyses across an early (pre-2019) and recent (post-2019) period demonstrate a clear evolution in Bitcoin’s price determinants. Whereas early Bitcoin price movements were dominated by internal metrics, including trading volume and halving cycles, the recent era shows a stronger influence from macroeconomic indicators, particularly exchange rates and U.S. market indices. Dynamic forecasting errors further reveal that sharp volatility often aligns with regulatory shifts, major institutional announcements, and potential market manipulation or sentiment-driven episodes.
By coupling Bayesian MCMC-based variable selection with rolling-window forecasts, this study contributes a robust framework for uncovering the shifting drivers behind Bitcoin’s price dynamics. Practical considerations—such as effectively sampling non-linear transitions or managing high-dimensional covariate sets—proved crucial to the methodology’s success.
Future Research
Potential extensions include (i) incorporating Bayesian change-point detection for extreme volatility episodes, (ii) adding sentiment or social-media-based indicators to capture behavioral effects, and (iii) testing structural break hypotheses tied to pivotal regulatory or macroeconomic disruptions. These directions can further refine understanding of how digital assets integrate into—and diverge from—traditional financial systems.