Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction

Kuznetsov, Oleksandr; Kostenko, Oleksii; Klymenko, Kateryna; Hbur, Zoriana; Kovalskyi, Roman

doi:10.3390/app152011145

Open AccessArticle

Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction

by

Oleksandr Kuznetsov

^1,2,*

,

Oleksii Kostenko

^3,*

,

Kateryna Klymenko

⁴

,

Zoriana Hbur

⁵

and

Roman Kovalskyi

⁶

¹

Department of Theoretical and Applied Sciences, eCampus University, Via Isimbardi 10, 22060 Novedrate, CO, Italy

²

Department of Intelligent Software Systems and Technologies, School of Computer Science and Artificial Intelligence, V.N. Karazin Kharkiv National University, 4 Svobody Sq., 61022 Kharkiv, Ukraine

³

State Scientific Institution “Institute of Information, Security and Law of the National Academy of Legal Sciences of Ukraine”, 3, Pylypa Orlyka Street, 01024 Kyiv, Ukraine

⁴

Institute of Compliance in Financial Markets, Chicago Kent College of Law, 565 West Adams Street, Chicago, IL 60661, USA

⁵

Department of Economics, Hryhorii Skovoroda University in Pereiaslav, 30, Sukhomlynsky Street, 08401 Pereiaslav, Ukraine

⁶

Department of Post-Graduate and Doctoral Courses, State University “Kyiv Aviation Institute”, 1, Liubomyra Huzara Ave., 03058 Kyiv, Ukraine

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2025, 15(20), 11145; https://doi.org/10.3390/app152011145

Submission received: 22 September 2025 / Revised: 11 October 2025 / Accepted: 13 October 2025 / Published: 17 October 2025

(This article belongs to the Special Issue Blockchain Technologies: Trends, Challenges, Potentials and Applications)

Download

Browse Figures

Versions Notes

Abstract

Featured Application

The confidence-threshold framework developed in this research presents immediate applications for cryptocurrency trading platforms, blockchain-based financial services, and decentralized finance (DeFi) protocols. Trading firms can implement the selective execution strategy to improve risk-adjusted returns while reducing exposure during high-uncertainty periods. Cryptocurrency exchanges can integrate the confidence scoring methodology to enhance market-making algorithms and provide better liquidity provisioning. DeFi protocols can utilize the framework for automated portfolio rebalancing and yield optimization strategies that adapt to varying market conditions. Institutional investors entering cryptocurrency markets can employ the approach for systematic allocation decisions that account for blockchain market volatility characteristics. The methodology’s emphasis on order book microstructure makes it particularly suitable for high-frequency trading applications where blockchain transaction transparency provides unique data advantages. Regulatory bodies can leverage the systematic risk assessment capabilities to monitor cryptocurrency market stability and identify potential systemic risks in blockchain-based financial systems. The framework’s ability to quantify prediction confidence also supports the development of risk management tools specifically designed for blockchain asset portfolios, addressing a critical need as cryptocurrency adoption expands across traditional financial institutions.

Abstract

Blockchain-based cryptocurrency markets present unique analytical challenges due to their decentralized nature, continuous operation, and extreme volatility. Traditional price prediction models often struggle with the binary trade execution problem in these markets. This study introduces a confidence-based classification framework that separates directional prediction from execution decisions in cryptocurrency trading. We develop a neural network system that processes multi-scale market data, combining daily macroeconomic indicators with a high-frequency order book microstructure. The model trains exclusively on directional movements (up versus down) and uses prediction confidence levels to determine trade execution. We evaluate the framework across 11 major cryptocurrency pairs over 12 months. Experimental results demonstrate 82.68% direction accuracy on executed trades with 151.11-basis point average net profit per trade at 11.99% market coverage. Order book features dominate predictive importance (81.3% of selected features), validating the critical role of blockchain microstructure data for short-term price prediction. The confidence-based execution strategy achieves superior risk-adjusted returns compared to traditional classification approaches while providing natural risk management capabilities through selective trade execution. These findings contribute to blockchain technology applications in financial markets by demonstrating how a decentralized market microstructure can be leveraged for systematic trading strategies. The methodology offers practical implementation guidelines for cryptocurrency algorithmic trading while advancing the understanding of machine learning applications in blockchain-based financial systems.

Keywords:

blockchain technology; cryptocurrency markets; machine learning; confidence-based classification; algorithmic trading; market microstructure; neural networks; trading systems; financial prediction; decentralized markets

1. Introduction

Blockchain technology has fundamentally transformed financial market infrastructure by enabling decentralized transaction processing and immutable record-keeping without traditional intermediaries [1,2]. Cryptocurrency markets represent the most prominent real-world application of blockchain technology, creating unprecedented opportunities for algorithmic trading and quantitative finance research [3,4]. The continuous 24/7 trading environment, combined with high-frequency data availability and extreme price volatility, presents unique challenges for predictive modeling that distinguish blockchain-based assets from conventional financial instruments.

The emergence of cryptocurrency markets has introduced novel market microstructure dynamics that traditional financial models struggle to address effectively. Unlike equity markets with established market makers and regulatory oversight, blockchain-based trading operates through decentralized protocols where price discovery occurs through peer-to-peer interactions across global participants. This decentralized structure creates distinctive patterns in order book dynamics, liquidity provision, and volatility clustering that require specialized analytical approaches.

Recent developments in cryptocurrency market analysis reveal the inadequacy of conventional three-class prediction frameworks that attempt to simultaneously forecast direction and trading signals [5,6]. Traditional approaches classify price movements as up, down, or no-trade, forcing models to learn both directional patterns and execution timing within a single optimization objective. This conflation of prediction and execution decisions often results in suboptimal performance in volatile cryptocurrency markets where prediction confidence varies significantly across different market conditions.

The proliferation of high-frequency blockchain data provides unprecedented opportunities for microstructure-based prediction models. Order book snapshots, transaction flows, and network activity metrics offer granular insights into market dynamics that were previously unavailable in traditional financial markets. However, the integration of multi-scale data sources spanning from the minute-level microstructure to daily macroeconomic indicators presents complex feature engineering challenges that require sophisticated analytical frameworks.

Machine learning applications in cryptocurrency markets have demonstrated promising results but face critical limitations in practical deployment. Existing approaches often optimize for prediction accuracy without the adequate consideration of trading costs, execution feasibility, or risk management requirements. The extreme volatility characteristic of blockchain-based assets necessitates selective execution strategies that balance prediction accuracy with prudent risk exposure.

Contemporary research in cryptocurrency prediction focuses primarily on deep learning architectures, technical indicator combinations, and sentiment analysis integration. Izadi and Hajizadeh (2025) [7] demonstrate 57% accuracy in Bitcoin trend prediction using transformer-based models, while Liu and Huang (2024) [8] achieve profitable results through technical indicator integration with LSTM networks. However, these approaches typically evaluate performance across all time periods without consideration for prediction confidence or selective execution strategies.

The growing institutional adoption of cryptocurrencies has elevated the importance of robust analytical frameworks for blockchain-based financial markets. Ballis et al. (2025) [3] document Bitcoin’s evolution into a digital safe haven during global crises, while Bonaparte (2022) [9] provides evidence that households increasingly view cryptocurrencies as long-term productive assets rather than speculative instruments. This institutional maturation demands more sophisticated trading approaches that can adapt to varying market conditions while maintaining consistent risk-adjusted returns.

Market efficiency research in cryptocurrency markets reveals complex relationships between liquidity, volatility, and price discovery mechanisms. Bouteska et al. (2025) [10] demonstrate that market efficiency varies significantly across different cryptocurrencies and time periods, with efficiency improvements associated with increased liquidity and reduced volatility. These findings suggest that prediction models should incorporate market condition indicators to optimize performance across different efficiency regimes.

The interconnected nature of cryptocurrency markets creates systemic risk transmission mechanisms that traditional asset classes do not exhibit. Franco and Laurini (2025) [11] quantify systemic risk using high-frequency data, revealing strong interconnectedness among major cryptocurrencies with Bitcoin and Ethereum serving as primary risk transmission sources. This systemic interconnectedness supports the development of confidence-based execution strategies that can adapt to varying levels of market stress.

This research introduces a confidence-threshold framework that addresses fundamental limitations in existing cryptocurrency prediction approaches by separating directional forecasting from execution decisions. The methodology trains binary classifiers exclusively on directional movements while using prediction confidence scores to determine trade execution. This separation enables selective trading strategies that execute only high-conviction predictions, potentially improving risk-adjusted returns in volatile cryptocurrency markets.

The confidence-based approach offers several advantages over traditional prediction methods. First, it eliminates the need for arbitrary no-trade class definitions that vary across market conditions and time horizons. Second, it provides explicit uncertainty quantification that enables adaptive risk management. Third, it accommodates varying prediction horizons and deadband thresholds without requiring model retraining.

The experimental framework evaluates the confidence-threshold approach using comprehensive data spanning 11 major cryptocurrency pairs from October 2023 to October 2024. The dataset integrates daily macroeconomic indicators (https://www.kaggle.com/datasets/imtkaggleteam/top-100-cryptocurrency-2020-2025 (accessed on 11 October 2025)) with minute-level order book snapshots (https://www.kaggle.com/datasets/ilyazawilsiv/cryptocurrency-order-book-data-asks-and-bids (accessed on 11 October 2025)), creating a 296-dimensional feature space that captures both macro trends and microstructure dynamics. The evaluation employs symbol-wise temporal splitting to prevent data leakage while ensuring representative performance across different cryptocurrency market segments.

The research contributes to blockchain technology applications in financial markets through three primary innovations. First, it demonstrates the effectiveness of confidence-based execution for cryptocurrency trading, achieving 82.68% direction accuracy with 151.11-basis point average profit per trade. Second, it validates the dominance of order book features over traditional technical indicators for short-term cryptocurrency prediction. Third, it provides practical guidelines for implementing selective trading strategies in volatile blockchain-based markets.

The findings advance the understanding of machine learning applications in cryptocurrency markets while addressing practical deployment considerations that existing research often overlooks. The confidence-threshold framework bridges academic research and industry practice by providing actionable trading strategies that account for transaction costs, execution constraints, and risk management requirements.

This study tests three core hypotheses about cryptocurrency price prediction:

First, we hypothesize that separating directional prediction from execution decisions through confidence thresholding improves risk-adjusted returns compared to traditional all-trade classification. Conventional approaches force models to learn both direction and trading signals simultaneously, potentially degrading performance in high-uncertainty periods.
Second, we hypothesize that order book microstructure features provide superior predictive power for short-term cryptocurrency price movements compared to traditional technical indicators. The transparency of blockchain transactions and a decentralized market structure may make limit order book dynamics more informative than in traditional markets.
Third, we hypothesize that prediction confidence correlates with actual accuracy across cryptocurrency pairs, enabling selective execution strategies that trade coverage for profitability. This relationship would validate confidence scores as practical risk management signals.

Our research questions follow directly the following:

RQ1: Does confidence-based selective execution outperform constant-threshold classification in cryptocurrency markets when evaluated on risk-adjusted returns?
RQ2: What proportion of predictive importance comes from order book microstructure versus traditional price-based features across different cryptocurrency market segments?
RQ3: How does the coverage–accuracy trade-off vary across market capitalizations, and what does this reveal about signal quality in different liquidity regimes?

We address these questions through systematic experimentation on 11 cryptocurrency pairs spanning diverse market conditions over 12 months.

The remainder of this paper is organized as follows. Section 2 reviews related work in blockchain-based financial markets and machine learning prediction methodologies. Section 3 describes the confidence-threshold framework and feature engineering pipeline. Section 4 presents the experimental design and dataset specifications. Section 5 analyzes results across multiple performance dimensions. Section 6 discusses implications for blockchain market analytics and practical implementation considerations. Section 7 concludes with future research directions and broader applications of confidence-based prediction in blockchain technology contexts.

2. Background and Related Work

Blockchain technology fundamentally transforms financial market infrastructure through decentralized transaction verification and immutable record-keeping. Cryptocurrency markets represent the most prominent application of blockchain technology in finance, creating new paradigms for price discovery, liquidity provision, and market microstructure. Unlike traditional financial markets that operate through centralized exchanges with defined trading hours, blockchain-based markets enable continuous 24/7 trading across global participants without geographic restrictions.

2.1. Blockchain-Based Financial Markets and Prediction Challenges

The decentralized nature of blockchain networks introduces unique market dynamics that distinguish cryptocurrency trading from conventional financial instruments. Market participants interact directly through peer-to-peer networks, eliminating traditional intermediaries while creating new challenges for price prediction and risk management. Liu et al. (2025) [12] identify systematic patterns in cryptocurrency liquidity, revealing an inverted U-shaped distribution of bid–ask spreads and trading volumes throughout the week, with strong liquidity commonality across different cryptocurrencies.

Market volatility in cryptocurrencies significantly exceeds traditional assets, with Bitcoin and Ethereum exhibiting daily volatility ranges of 2–15% compared to 0.5–3% for major equity indices (Yang et al., 2025 [13]). This heightened volatility creates both opportunities and challenges for predictive modeling, as traditional statistical assumptions often fail under extreme market conditions.

Retail trader dominance distinguishes cryptocurrency markets from institutional-driven traditional markets. Hsieh et al. (2025) [14] establish that momentum effects in cryptocurrencies arise only during UP-UP market regimes, contrasting with equity markets where momentum persists across various state transitions. This asymmetric momentum pattern reflects belief-updating mechanisms specific to retail-driven speculative markets.

2.2. Baseline and Classical Approaches

Before examining deep learning methods, we acknowledge foundational work on simpler models for financial prediction. Logistic regression and linear discriminant analysis remain competitive baselines for directional forecasting when combined with proper feature engineering (Ballings et al., 2015 [15]; Henrique et al., 2019 [16]). Tree-based ensembles—particularly XGBoost and LightGBM—have demonstrated strong performance for cryptocurrency price prediction due to their ability to capture non-linear interactions and handle mixed-scale features (Zhang et al., 2024 [17]). Our choice of neural networks over gradient boosting reflects the streaming nature of order book data, where sequential patterns justify recurrent or attention-based architectures, though we acknowledge ensemble methods as viable alternatives.

Cryptocurrency research has benefited from increasingly rich public datasets [18,19]. The Kaggle cryptocurrency order book dataset (Ilyazawilsiv, 2024 [18]) used in our study represents one of few sources providing tick-level depth across multiple exchanges. Prior work on equity markets established the predictive value of order book imbalance (Cont et al., 2014 [20]) and price impact modeling (Hautsch & Huang, 2012 [21]), but cryptocurrency order books exhibit distinct dynamics due to decentralized market structures and retail trader dominance. The integration of macro (daily OHLCV [19]) and micro (minute order book [18]) data follows multi-scale modeling principles from high-frequency trading literature (Cartea et al., 2015 [22]), though adaptation to 24/7 cryptocurrency markets requires careful temporal alignment procedures that we detail in Section 3.2.

Our confidence-threshold framework builds on the selective prediction paradigm (El-Yaniv & Wiener, 2010 [23]); Geifman & El-Yaniv, 2017 [24]), where models abstain from low-confidence predictions to improve realized accuracy. Recent work on conformal prediction for regression (Romano et al., 2019 [25]) and classification provides distribution-free uncertainty quantification, though application to non-stationary cryptocurrency markets presents challenges.

Realistic performance evaluation requires modeling transaction costs, market impact, and execution constraints. Seminal work by Almgren & Chriss (2001 [26]) established optimal execution frameworks for minimizing implementation shortfall. For cryptocurrencies, Makarov & Schoar (2020 [27]) document significant price differences across exchanges, averaging 2.5% but occasionally exceeding 10%, highlighting arbitrage opportunities and execution complexity. Our 1-basis point cost assumption reflects maker fees on centralized exchanges (Binance, 2025 [28]) but may underestimate true costs including slippage and adverse selection.

2.3. Machine Learning Applications in Cryptocurrency Prediction

Recent advances in deep learning have spawned numerous approaches for cryptocurrency price prediction, each addressing different aspects of market complexity. Zhang et al. (2024) [29] provide a comprehensive survey identifying four primary modeling categories: price prediction, portfolio construction, bubble analysis, and trading strategy development.

Sequence modeling approaches dominate current research, with LSTM and GRU architectures showing consistent performance across multiple cryptocurrencies. Golnari et al. (2024) [30] introduce probabilistic gated recurrent units (P-GRU) that generate probability distributions for predicted values rather than point estimates. Their approach demonstrates superior performance on Bitcoin 5 min data and successful transfer learning to six additional cryptocurrencies.

Attention mechanisms enhance traditional sequence models by capturing long-range dependencies in cryptocurrency time series. Peng et al. (2024) [31] propose an attention-based CNN-LSTM model for multiple cryptocurrencies (ACLMC) that exploits correlations across frequencies and currencies. Their triple trend labeling method reduces transaction frequency while maintaining profitability, addressing practical trading implementation concerns.

Network-based approaches recognize that cryptocurrency prices exhibit interdependencies that individual modeling cannot capture. Zhong et al. (2023) [32] developed LSTM-ReGAT, combining LSTM for individual features with relation-wise graph attention networks for cryptocurrency interactions. Their network-centric approach achieves superior trading profits by incorporating cryptocurrency interrelations into prediction models.

Feature engineering advances focus on incorporating domain-specific indicators beyond traditional technical analysis. Kang et al. (2025) [33] evaluate twelve deep learning models combined with technical indicators, finding that SegRNN outperforms other models for price forecasting while TimesNet combined with Bollinger Bands achieves optimal trading performance. Their results indicate that technical indicator integration provides significant improvements at 4 h intervals but limited benefits at shorter timeframes.

Alternative feature extraction methods explore cryptocurrency-specific characteristics. Zhang et al. (2024) [17] propose generalized visible curvature indicators that capture trend, acceleration, and volatility interactions for bubble identification and price prediction. Their curvature-based features significantly improve Light Gradient Boosting Machine performance compared to standard statistical tests.

Anomaly detection represents an emerging application area addressing cryptocurrency market instability. Pellicani et al. (2025) [34] introduce CARROT, which exploits temporal correlations among cryptocurrencies to predict anomalies through clustered multi-target LSTM models. Their approach outperforms single-target models by 20% in macro F1 score, demonstrating the value of cross-cryptocurrency information sharing.

Multi-scale analysis approaches combine different temporal horizons and data sources. Viéitez et al. (2024) [35] develop Ethereum prediction models using contextual stock indices and online trends while excluding technical price indicators. Their knowledge-based investment strategies generate profit factors up to 5.16, suggesting that external market indicators provide valuable predictive information.

2.4. Comparative Survey of Machine Learning Approaches for Cryptocurrency Prediction

Table 1 synthesizes representative studies across different model families, providing a structured comparison of methodologies, data characteristics, and performance outcomes. We organize works by modeling approach to clarify the state-of-the-art ones and position our contribution.

Key observations from the comparative analysis are as follows:

First, the most existing work evaluates performance across all predictions without selective execution mechanisms. Our confidence-threshold approach introduces systematic selectivity, trading coverage for accuracy—a dimension largely unexplored in the current literature.
Second, order book microstructure data remains underutilized. While some studies incorporate basic bid–ask features, comprehensive multi-level order book analysis (50 depth levels in our case) has not been systematically evaluated against traditional technical indicators.
Third, validation protocols vary significantly. Many studies employ standard train–test splits without addressing cryptocurrency-specific challenges like cross-asset leakage or regime non-stationarity. Our symbol-wise temporal validation prevents such artifacts.
Fourth, realistic trading constraints often receive limited attention. Studies reporting accuracy metrics without transaction cost analysis, slippage modeling, or holding period constraints may overestimate practical viability. We explicitly incorporate 1 bp costs and specify our 10 h execution horizon.
Finally, risk-adjusted metrics beyond Sharpe ratios remain scarce. Our reporting includes per-trade Sharpe alongside traditional classification metrics (F1, balanced accuracy) and profit distributions (percentiles, drawdown characteristics) to provide multidimensional performance assessment.

3. Methodology

The confidence-threshold framework integrates multi-scale market data to predict cryptocurrency price movements through a binary classification approach coupled with execution decision logic. This methodology addresses the inherent challenges of cryptocurrency market prediction by separating directional forecasting from trading execution decisions. The approach leverages both macroeconomic indicators derived from daily price data and microstructure information extracted from high-frequency order book snapshots.

3.1. Data Sources and Preprocessing

The dataset comprises two complementary data streams spanning October 2023 to October 2024. The macro dataset contains daily OHLCV data for 100 cryptocurrency pairs covering the period from August 2018 to August 2025, providing 211,679 observations across 38 features. The micro dataset captures minute-level order book snapshots for 11 cryptocurrency pairs, yielding 5,672,947 observations across 264 microstructure features.

Table 2 presents the comprehensive data source specifications and quality metrics for each dataset component.

The macro dataset preprocessing implements quality control through adaptive volatility-based thresholds and temporal consistency checks. Price movements exceeding 20% trigger validation flags, with adjustments based on rolling volatility windows. Temporal gaps exceeding 24 h are identified to ensure complete coverage for indicator calculations.

Micro dataset preprocessing addresses high-frequency order book challenges through symbol normalization, minute-level aggregation via last-observation-carried-forward, and order book structure validation. The system verifies monotonic price ordering, identifies crossed markets, and applies adaptive spread filtering (0.001–200 bps). Table 3 summarizes the complete preprocessing pipeline.

Figure 1 displays temporal coverage, showing consistent 24/7 data availability with minor exchange maintenance gaps. Peak activity occurs at 07:00 UTC during European-American session overlap.

3.2. Feature Engineering Framework

The pipeline creates a unified 296-dimensional feature space combining temporal, microstructure, and technical indicators (Figure 2). Technical indicators use proper temporal alignment: moving averages with 5-, 10-, 20-day windows and one-period lagging, RSI with Wilder’s smoothing over 14 days, and volatility via rolling standard deviations (5, 15, 30 days).

Microstructure extraction processes order book snapshots across 50 depth levels, computing volume-weighted spreads, order imbalance ratios, and depth concentration. Figure 3 shows median spreads of 1.55 basis points with the 95th percentile at 9.14 bps.

Feature correlation analysis (Figure 4) reveals distinct patterns: asset correlations dominate macro features while microstructure correlations remain localized. RobustScaler handles the dramatic scale variation (0.017 to 95 M) across features. Figure 5 demonstrates that volatility features achieve the highest mutual information scores.

Temporal alignment combines end-of-day macro features with minute-level micro features using proper lagging to prevent information leakage. Figure 6 shows intraday patterns with peak activity during European and US sessions, exhibiting systematic spread tightening during high-volume periods.

Table 3 summarizes the data preprocessing pipeline parameters and filtering criteria applied to achieve data quality standards.

3.3. Mathematical Problem Formulation

We formalize the prediction task and execution logic. Given feature vector

x_{t} \in ℝ^{128}

at time

t

, the neural network

f_{θ}

estimates directional probabilities. The binary target is

y_{t} = 1_{r_{t, t + h} > d e l t a},

(1)

where

r_{t, t + h} = (P_{t + h} - P_{t}) / P_{t}

is the arithmetic return over horizon

h = 600

minutes,

δ = 10

basis point filter noise, and

P_{t}

denotes the midpoint price.

The model outputs calibrated probabilities:

p_{t} = P (y_{t} = 1 | x_{t}) = f_{θ} (x_{t}) .

(2)

Training minimizes binary cross-entropy with balanced class weights:

L (θ) = - \frac{1}{N} \sum_{i = 1}^{N} [w_{1} y_{i} \log p_{i} + w_{0} (1 - y_{i}) \log (1 - p_{i})],

(3)

where

w_{0}, w_{1}

are inverse class frequencies.

Execution occurs when prediction confidence exceeds threshold

τ

:

Execute if \max (p_{t}, 1 - p_{t}) \geq τ,

(4)

{\hat{y}}_{t} = \arg \max_{y \in {0, 1}} P (y | x_{t}),

(5)

The threshold

τ *

maximizes the validation set expected profit:

τ * = \arg \max_{τ \in [0.5, 0.95]} E [Net Profit ∣ Execute (τ)] .

(6)

Performance evaluation employs coverage (proportion of executed trades), per-trade net profit after 1 bp transaction costs, and risk-adjusted Sharpe ratio. These metrics are defined in Section 3.4 as Equations (7)–(12).

Figure 7 presents the end-to-end pipeline from raw data to execution decisions.

Reproducibility: Symbol-wise temporal splits allocate 70-15-15% for the train-validation-test. Threshold

τ = 0.8

was selected on validation data only. All random seeds were fixed at 42. Complete code can be found at https://github.com/KuznetsovKarazin/crypto-confidence-execution (accessed on 11 October 2025).

3.4. Confidence-Based Classification Model and Performance Evaluation Metrics

The confidence-threshold framework employs a binary neural network architecture that separates directional prediction from execution decisions. Unlike traditional three-class approaches that simultaneously predict direction and trading signals, this methodology trains exclusively on directional movements (up versus down) and uses prediction confidence to determine trade execution.

The target variable creation process applies a 10-basis point deadband filter to eliminate noise-driven movements. Price movements exceeding this threshold within the 600 min prediction horizon receive directional labels: y = 1 for upward movements and y = 0 for downward movements. The system excludes no-trade samples during training, focusing the model on clear directional patterns. This filtering process yielded 5,333,774 trade samples from 5,666,347 total observations, representing 94.1% data utilization.

The neural network architecture implements a multi-layer perceptron with 256-128-64 hidden units, ReLU activation functions, and 20% dropout regularization. Table 4 specifies the complete model configuration and training parameters.

The target variable creation follows standard practice in directional forecasting (Krauss et al., 2020 [36]). We define the directional label as

y_{t} = 1_{r_{t, t + h} > d e l t a},

(7)

where

r_{t, t + h}

denotes the arithmetic return from time

t

to

t + h

,

d e l t a = 10

basis points represent the noise-filtering deadband, and h = 600 min defines our prediction horizon. Returns are computed as

r_{t, t + h} = (P_{t + h} - P_{t}) / P_{t},

(8)

where

P_{t}

is the midpoint price at time t, calculated as the average of best bid and best ask. This arithmetic return convention aligns with the short-horizon prediction literature where log approximations introduce negligible differences (Campbell et al., 1997 [37]).

The execution price assumes immediate fill at the prevailing midpoint, which is conservative for our 10 h horizon but may underestimate slippage in high-frequency scenarios. For production systems, incorporating market impact models (Almgren & Chriss, 2001 [26]) would be essential.

The confidence threshold optimization process evaluates execution decisions across a grid of confidence values

τ \in [0.5, 0.95]

with 0.01 increments. For each threshold, the system executes trades only when

\max (P (up), P (down)) \geq τ

. The optimization criterion maximizes expected profit per executed trade after accounting for 1.0-basis point transaction costs.

The evaluation framework employs both classification metrics and trading-specific performance indicators. Classification metrics include direction accuracy, F1 scores, precision, and recall calculated exclusively on executed trades. Trading metrics encompass coverage rates, profit distributions, win rates, and risk-adjusted returns.

Coverage represents the proportion of observations triggering execution:

C o v e r a g e = N_{e x e c u t e d} / N_{t o t a l} .

(9)

This selectivity metric controls the accuracy–frequency trade-off inherent in confidence-based systems.

Gross profit per trade follows the standard signed return formulation (Jegadeesh & Titman, 1993 [38]):

G r o s s P r o f i t_{i} = r_{i} \times s i g n (y_{i}),

(10)

where

r_{i}

is the realized return and

y_{i} \in [- 1, + 1]

is the predicted direction. Net profit incorporates our assumed transaction cost:

N e t P r o f i t_{i} = G r o s s P r o f i t_{i} - c,

(11)

where

c = 1.0

basis point represents maker–taker fees on major exchanges (Binance fee schedule 2025 [28]). This cost assumption is conservative for limit order execution but may underestimate market order costs during volatile periods.

The Sharpe ratio adaptation for per-trade analysis provides risk-adjusted performance assessment:

Sharpe = \frac{μ_{profit}}{σ_{profit}} \times \sqrt{N},

(12)

where

μ_{profit}

and

σ_{profit}

represent mean and standard deviation of net profits, and N equals the number of executed trades.

The per-trade Sharpe ratio adapts the classical formulation (Sharpe, 1994 [39]) to discrete trading events. This metric differs from traditional time-series Sharpe ratios by treating each trade as an independent event rather than continuous portfolio returns.

Temporal validation employs symbol-wise splitting to prevent data leakage. The chronological split allocates 70% for training, 15% for validation, and 15% for testing within each cryptocurrency pair. This approach ensures that future information never influences past predictions while maintaining representative samples across all trading pairs.

Figure 8 presents the rolling correlation analysis between major cryptocurrency pairs (ADA/USDT vs. ETH/USDT), revealing time-varying correlation patterns that justify the symbol-wise validation approach. Correlation values fluctuate between 0.2 and 0.9, indicating regime changes that could bias performance estimates under naive splitting strategies.

Statistical significance testing applies bootstrap resampling with 1000 iterations to generate confidence intervals for key performance metrics. The bootstrap procedure randomly samples executed trades with replacement, computing performance statistics for each sample to estimate distribution parameters.

Cross-asset performance evaluation examines model behavior across different market capitalizations and volatility regimes. Figure 9 displays the symbol performance comparison, showing systematic differences in prediction accuracy across cryptocurrency pairs. Large-cap assets (BTC, ETH) demonstrate higher accuracy but lower coverage, while mid-cap assets exhibit more balanced coverage–accuracy profiles.

The baseline comparison framework evaluates the confidence-threshold approach against alternative methodologies including buy-and-hold strategies, traditional three-class classification, and constant-threshold binary classification. Performance improvements are measured using both absolute metrics (profit, accuracy) and relative metrics (Sharpe ratio, information ratio).

4. Experimental Design

The experimental framework evaluates the confidence-threshold approach through comprehensive testing on cryptocurrency market data spanning 371 days of continuous trading activity. The experimental design addresses three core research questions: (1) the effectiveness of confidence-based execution versus traditional classification approaches, (2) the optimal threshold calibration for different risk-return profiles, and (3) the generalizability across diverse cryptocurrency market segments.

4.1. Dataset Description

The experimental dataset encompasses 11 cryptocurrency trading pairs selected based on order book data availability and market liquidity criteria. Table 5 presents the complete dataset specifications and symbol-level statistics.

The dataset spans from 7 October 2023 to 13 October 2024, capturing diverse market conditions including bull market phases, volatility spikes, and consolidation periods. Data coverage exceeds 96% across all symbols, with gaps primarily occurring during scheduled exchange maintenance windows.

Symbol selection criteria prioritize liquid trading pairs with complete microstructure data availability. The chosen pairs represent different market capitalization segments, from Bitcoin (largest) to TRON (smallest in the sample), enabling cross-segment performance analysis. Average daily volumes range from USD 67.8 million (TRX_USDT) to USD 2.85 billion (BTC_USDT), providing heterogeneous liquidity conditions for model evaluation.

Spread characteristics exhibit systematic variation across market capitalizations. Major assets (BTC, ETH) demonstrate tight spreads with median values below 1.5 basis points, while smaller assets show wider spreads exceeding 2.5 basis points. The 95th percentile spread values range from 4.21 basis points (BTC) to 15.43 basis points (TRX), reflecting different market-making conditions.

Temporal distribution analysis reveals consistent 24/7 coverage with minor gaps during exchange maintenance. The dataset captures 53 complete weeks of trading activity, enabling robust statistical analysis across multiple market cycles. Weekend trading patterns differ significantly from weekday patterns, with reduced volume but maintained price discovery functionality.

Data preprocessing eliminated 6600 observations (0.12% of total) due to quality issues including crossed markets, extreme spreads, and timestamp inconsistencies. The preprocessing pipeline maintained 99.88% data retention while ensuring order book structural integrity across all symbols.

4.2. Model Configuration

The neural network implementation employs PyTorch framework optimizations for efficient training on the experimental hardware configuration. The system utilizes an AMD Ryzen 7 7840 HS processor with 64 GB RAM, enabling in-memory processing of the complete dataset without disk I/O bottlenecks.

Training procedures implement early stopping with 10-epoch patience to prevent overfitting. The validation loss monitoring triggers training termination when improvement stagnates, typically occurring between epochs 45 and 65. Model checkpointing saves the best-performing weights based on validation accuracy, ensuring optimal generalization performance.

Hyperparameter optimization focuses on three critical parameters: confidence threshold

τ

, learning rate scheduling, and batch size selection. The confidence threshold grid search evaluates 46 values from 0.50 to 0.95 with 0.01 increments. Learning rate optimization applies cosine annealing with warm restarts, starting at 0.001 and minimum decay to 0.0001.

Batch size selection balances computational efficiency with gradient estimation quality. The 4096 sample batch size enables stable gradient computation while fitting within memory constraints. Larger batch sizes (8192) showed marginal accuracy improvements but significantly increased training time from 57 min to 94 min per epoch.

Memory optimization techniques include gradient checkpointing and mixed precision training. These optimizations reduce peak memory usage from 18.3 GB to 12.7 GB while maintaining numerical precision for critical computations. Training completion requires approximately 3.2 h for the full 80-epoch schedule.

Feature preprocessing applies consistent scaling across training, validation, and test splits. RobustScaler parameters fit exclusively on training data and prevent information leakage. The median-based centering handles cryptocurrency price outliers more effectively than mean-based alternatives, reducing sensitivity to extreme market events.

Model calibration employs isotonic regression on validation predictions to improve confidence reliability. The calibration procedure maps raw prediction probabilities to calibrated confidence scores, enhancing the correlation between predicted confidence and actual accuracy. Calibration training uses 10-fold cross-validation within the validation set to prevent overfitting.

Reproducibility protocols fix random seeds across all stochastic processes. NumPy (version 2.3.2, seed = 42), PyTorch (version 2.5.1, seed = 42), and Python random module use identical initialization values. This configuration enables exact replication of experimental results across different hardware platforms. (The experiments were implemented in Python 3.11.9 using the following key packages: NumPy 2.3.2, pandas 2.3.1, scikit-learn 1.7.1, LightGBM 4.6.0, and XGBoost 2.1.1.).

The confidence threshold optimization procedure evaluates each threshold value using validation set performance. The optimization criterion maximizes expected profit per trade:

E [Profit] = \sum_{i = 1}^{N} P ({execute}_{i}) \times E [{profit}_{i} | {execute}_{i}]

where execution probability depends on the confidence threshold and expected profit accounts for directional accuracy and transaction costs.

5. Results and Analysis

The confidence-threshold framework was evaluated on a dataset comprising 802,967 observations across 11 cryptocurrency pairs from October 2023 to October 2024. The neural network model employed a multi-layer perceptron architecture with 256-128-64 hidden units, trained for 80 epochs with calibrated probability outputs. The system achieved selective execution on 96,247 trades (11.99% coverage) using an optimized confidence threshold of τ = 0.8.

5.1. Classification Performance

The binary direction classifier demonstrated strong predictive accuracy on executed trades. Table 6 presents the comprehensive classification metrics for the confidence-based execution framework.

The model achieved 82.68% direction accuracy on executed trades, indicating robust prediction capability when confidence exceeds the threshold. The F1 score of 0.8195 demonstrates balanced performance across both up and down movement predictions. The ROC-AUC of 0.6886 suggests moderate discriminative ability, while the PR-AUC of 0.6695 indicates reasonable precision-recall trade-offs given the class distribution.

Figure 10 displays the confusion matrix for executed trades, revealing balanced classification performance with 30,103 correct up predictions and 49,478 correct down predictions out of 96,247 total executions. The false positive rate for up movements was 23.2%, while the false negative rate was 13.2%.

Figure 11 presents the ROC curve analysis, showing the trade-off between true positive rate and false positive rate across different probability thresholds. The curve demonstrates performance above random baseline (AUC = 0.5), with optimal operating points concentrated in the high-specificity region corresponding to the confidence-based execution strategy.

5.2. Trading Performance Evaluation

The confidence-threshold framework generated substantial risk-adjusted returns through selective trade execution. Table 7 summarizes the trading performance metrics across the complete test period.

Trading profitability depends critically on transaction cost assumptions. Table 8 presents sensitivity analysis across realistic cost scenarios.

The framework maintains profitability up to 5-basis point transaction costs, though performance degrades steadily. Each additional basis point reduces average profit by approximately 33 basis points and Sharpe ratio by 0.21. The optimal confidence threshold increases with costs (from τ = 0.78 at 0.5 bp to τ = 0.92 at 5 bp) as the system compensates by executing only the highest-conviction trades.

At costs exceeding 5 bp, the strategy becomes marginally profitable or negative for lower market cap pairs. This threshold aligns with maker–taker fees on major exchanges (0.5–2 bp) but exceeds typical market order costs (5–10 bp) during volatile periods. Practitioners using market orders would need to adjust thresholds upward or accept reduced profitability.

The relationship between costs and optimal threshold suggests that adaptive threshold mechanisms could improve robustness across varying liquidity conditions. Future work should explore dynamic threshold adjustment based on realized execution costs.

The system generated an average net profit of 151.11 basis points per executed trade after accounting for 1.0-basis point transaction costs. The median profit of 124.60 basis points indicates positive skewness in the return distribution, with more frequent moderate gains than extreme losses. The Sharpe ratio of 0.8313 demonstrates favorable risk-adjusted performance, considering the per-trade volatility of 181.77 basis points.

Figure 12 illustrates the cumulative profit progression over the test period, showing consistent positive drift with periodic drawdown periods. The cumulative performance exhibits steady growth patterns with maximum drawdowns contained within acceptable risk parameters. The profit trajectory demonstrates the effectiveness of the confidence-based execution strategy in avoiding low-conviction trades.

Figure 13 presents the distribution of per-trade net profits, revealing a right-skewed distribution with mode around 100 basis points. The histogram shows that 82.68% of trades generated positive returns, with the loss tail extending to −641 basis points while the profit tail reaches 1196 basis points. This asymmetric distribution supports the effectiveness of the confidence threshold in filtering profitable opportunities.

5.3. Cross-Asset Performance Analysis

The confidence-threshold framework exhibited heterogeneous performance across the 11 cryptocurrency pairs analyzed. Table 9 presents the per-symbol trading metrics, revealing significant variability in both execution coverage and profitability patterns.

The results demonstrate an inverse relationship between coverage and profitability across different market capitalizations. Bitcoin (BTC_USDT) achieved the highest average profit of 198.45 basis points with 88.9% accuracy but exhibited the lowest coverage at 8.2%. Conversely, smaller market cap assets like TRX_USDT showed higher coverage (21.4%) but lower profitability (108.56 basis points) and accuracy (77.3%).

Figure 14 illustrates the per-symbol accuracy distribution across executed trades, showing performance ranging from 77.3% to 88.9%. The accuracy variations correlate with market liquidity characteristics, where more liquid pairs (BTC, ETH) demonstrate superior prediction reliability. The confidence threshold effectively filtered low-conviction predictions, maintaining accuracy above 77% across all symbols.

Figure 15 presents the average net profit distribution by cryptocurrency pair, revealing substantial profit heterogeneity. Major assets (BTC, ETH, XRP) generated profits exceeding 160 basis points, while emerging assets showed more modest returns between 108 and 145 basis points. This pattern suggests that the confidence-based approach adapts effectively to varying market microstructure conditions.

The coverage analysis reveals systematic differences across asset classes. Large-cap cryptocurrencies exhibited conservative execution patterns (8.2–11.7% coverage) with high-conviction predictions, while mid- and small-cap assets showed more frequent trading opportunities (12.7–21.4% coverage). This coverage distribution reflects the underlying volatility and price movement characteristics inherent to different cryptocurrency market segments.

5.4. Feature Importance and Model Interpretation

The mutual information feature selection process identified 128 critical predictors from the original 296-feature spaces. Table 10 categorizes the selected features by their market data origin and temporal characteristics.

Order book features dominated the selected feature set, comprising 81.3% of the total features. Deep order book levels provided the most predictive power for direction classification. This finding validates the importance of market microstructure information for short-term price movement prediction in cryptocurrency markets.

Traditional technical indicators contributed only 4.7% of selected features, suggesting that conventional technical analysis metrics provide limited additional value when comprehensive order book data is available. The Average True Range (ATR) and RSI emerged as the most relevant technical indicators, while moving averages showed modest predictive capacity.

The feature selection process eliminated several expected predictors, including volume ratio indicators and order book level counts, due to insufficient variability across the dataset. This elimination pattern indicates that certain microstructure metrics may be less informative in cryptocurrency markets compared to traditional equity markets.

Spread-based features, including lagged spread measurements and moving averages, maintained moderate importance scores. The spread_bps_lag1 feature achieved the highest individual importance among microstructure indicators, confirming the predictive value of recent transaction cost patterns for future price movements.

The temporal horizon of 600 min (10 h) proved effective for capturing meaningful price movements while maintaining sufficient prediction accuracy. This horizon length accommodates the 24/7 nature of cryptocurrency markets and allows the model to incorporate overnight and weekend trading patterns that are absent in traditional financial markets.

Model calibration successfully improved probability estimates, as evidenced by the coherent relationship between predicted confidence levels and actual trading outcomes. The calibration process reduced overconfidence in borderline predictions while maintaining discriminative power for high-confidence trades, resulting in more reliable execution decisions.

5.5. Baseline Comparison and Ablation Analysis

To validate the advantage of confidence-based selective execution, we compare our framework against three baseline approaches using identical data and features.

Table 11 presents the comparative performance analysis.

The confidence-threshold framework substantially outperforms all baselines. The always-execute binary classifier achieves 76.34% accuracy across all predictions but generates only 89.23-basis point average profit with 0.42 Sharpe ratio. Our selective approach sacrifices coverage (11.99% vs. 100%) to improve accuracy by 6.34 percentage points and profit by 69% while doubling the Sharpe ratio.

The fixed threshold baseline (τ = 0.5, no calibration) executes 47.83% of observations but achieves only 74.12% accuracy and 102.45 bps profit. This demonstrates that both threshold optimization and probability calibration contribute to performance gains. The calibration step alone improves accuracy by 8.56 percentage points on executed trades.

Random selection at equivalent coverage (12%) produces near-chance accuracy (50.21%) and negative returns (−8.34 bps), confirming that our confidence scores capture genuine signal rather than data artifacts.

We also examined LSTM and Transformer architectures during preliminary experiments but found their performance comparable to our MLP when using identical features and training procedures. The multi-layer perceptron offers faster training (3.2 h vs. 8+ h for LSTM) and simpler deployment without sacrificing accuracy. The key innovation lies in the confidence-based execution logic rather than architectural complexity.

Feature ablation tests reveal that order book features contribute 73% of total performance when measured by profit degradation upon removal. Removing technical indicators reduces profit by only 4%, while removing order book depth causes a 68% profit decline. This confirms hypothesis H2 regarding microstructure dominance.

6. Discussion

6.1. Key Findings and Implications

The confidence-threshold framework demonstrates substantial improvements over traditional cryptocurrency prediction approaches through selective execution strategies. The system achieved 82.68% direction accuracy on executed trades while maintaining 11.99% market coverage, generating average net profits of 151.11 basis points per trade. These results indicate that confidence-based filtering effectively identifies high-conviction trading opportunities in volatile cryptocurrency markets.

The 10 h prediction horizon (600 min) proves optimal for capturing meaningful price movements while avoiding excessive noise. This timeframe accommodates the 24/7 nature of cryptocurrency markets and allows sufficient time for order execution without exposure to rapid market reversals. Shorter horizons (60–300 min) showed higher noise sensitivity, while longer horizons (12–24 h) reduced prediction accuracy due to increased uncertainty.

Order book features dominate the predictive importance hierarchy, comprising 81.3% of selected features. This finding validates the critical role of the market microstructure in short-term price prediction for cryptocurrency markets. Traditional technical indicators contribute only 4.7% of predictive power, suggesting limited additional value when comprehensive microstructure data is available.

The inverse relationship between market capitalization and coverage rates reveals systematic differences in prediction confidence across asset classes. Bitcoin demonstrates the lowest coverage (8.2%) but highest profitability (198.45 bps), while smaller assets like TRON show higher coverage (21.4%) with reduced profitability (108.56 bps). This pattern reflects varying signal-to-noise ratios across different market segments.

6.2. Cryptocurrency Market Considerations

Cryptocurrency markets exhibit unique characteristics that distinguish them from traditional financial markets. The 24/7 trading environment eliminates traditional market opening and closing effects, creating continuous price discovery processes. Weekend trading patterns show reduced volume but maintained volatility, contrasting with equity markets where weekend gaps are common.

Decentralized exchange structures introduce additional complexity through fragmented liquidity and varying fee structures. The confidence-threshold approach adapts naturally to these conditions by incorporating microstructure signals that reflect local liquidity conditions rather than relying solely on price-based indicators.

Regulatory uncertainty creates periodic volatility spikes that traditional models struggle to handle. The confidence-based approach provides natural protection by reducing execution frequency during high-uncertainty periods, as prediction confidence typically decreases when regulatory announcements create market instability.

The dominance of retail trading in cryptocurrency markets generates different order flow patterns compared to institutional-dominated traditional markets. Retail-driven price movements often exhibit momentum characteristics that the microstructure features capture effectively, explaining the superior performance of order book-based predictions.

6.3. Practical Implementation Considerations

Real-time implementation requires robust data infrastructure capable of processing high-frequency order book updates with minimal latency. The 128-feature model demands approximately 15 milliseconds for feature computation and prediction generation, making it compatible with minute-frequency trading strategies but potentially limiting for higher-frequency applications.

Risk management integration presents both opportunities and challenges. The confidence scores provide natural position sizing signals, with higher confidence predictions justifying larger position sizes. However, the selective execution approach may result in extended periods without trading signals, requiring sophisticated portfolio management to maintain capital efficiency.

Transaction cost assumptions prove critical for threshold optimization. The 1.0-basis point cost assumption may underestimate real-world implementation costs, particularly for smaller cryptocurrency pairs or during high-volatility periods. Sensitivity analysis suggests that costs exceeding 2.5 basis points would require threshold adjustments to maintain profitability.

The 11-symbol limitation constrains diversification opportunities compared to traditional portfolio approaches. Expanding the symbol universe requires comprehensive order book data collection, which involves significant infrastructure investments and exchange relationships.

6.4. Limitations and Future Directions

Data availability represents the primary constraint for broader application. Complete order book data remains limited to major cryptocurrency exchanges and popular trading pairs. Expanding to decentralized exchanges or smaller assets would require different data collection strategies and potentially modified modeling approaches.

Model generalization across different market regimes requires further validation. The October 2023 to October 2024 period captured diverse market conditions but may not represent all possible cryptocurrency market states. Extended validation across multiple market cycles would strengthen confidence in the approach.

The fixed confidence threshold approach could benefit from adaptive mechanisms that adjust to changing market conditions. Machine learning-based threshold optimization could potentially improve performance by responding to volatility regime changes or market structure evolution.

Feature engineering opportunities exist through cross-asset signal incorporation and alternative data sources. Social sentiment indicators, on-chain metrics, and macroeconomic factors could enhance prediction accuracy, particularly for longer-term horizons.

Ensemble methods combining multiple prediction horizons and confidence thresholds present promising research directions. Multi-horizon ensembles could provide more robust predictions by capturing both short-term microstructure signals and longer-term trend information.

The framework could extend to other blockchain-based assets beyond cryptocurrencies, including tokenized securities, non-fungible tokens, and decentralized finance protocols, each presenting unique prediction challenges and opportunities.

6.5. Comparative Analysis with Existing Literature

The confidence-threshold framework demonstrates competitive performance compared to recent cryptocurrency prediction approaches. Our 82.68% direction accuracy with 11.99% coverage compares favorably to the existing literature, though direct comparisons require careful consideration of different evaluation methodologies.

Peng et al. (2024) [31] report improved financial metrics using attention-based CNN-LSTM with triple trend labeling but do not specify exact accuracy figures. Their focus on reducing transaction frequency aligns with our selective execution approach, though our confidence-based method provides more systematic selectivity control.

Kang et al. (2025) [33] achieve Sharpe ratios up to 3.56 using TimesNet with Bollinger Bands on 4 h Ethereum data. Our per-trade Sharpe ratio of 0.83 operates on different time scales (10 h horizon) and evaluation methodology, making direct comparison challenging. However, our approach maintains consistent performance across 11 cryptocurrency pairs rather than optimizing for individual assets.

Viéitez et al. (2024) [35] generate profit factors up to 5.16 for Ethereum using external indicators over longer time periods. Our 151.11-basis point average profit per trade on 600 min horizons targets different trading frequencies and risk profiles. Their approach excludes technical indicators while ours demonstrates that microstructure features dominate predictive importance.

The network-based approach of Zhong et al. (2023) [32] shows highest profits in trading simulations but lacks specific performance metrics for direct comparison. Our framework incorporates cross-asset information implicitly through mutual information feature selection, though not through explicit network modeling.

Our findings regarding order book feature dominance (81.3% of selected features) align with Liu et al.’s (2025) [12] observations about liquidity commonality and microstructure importance. However, most existing studies focus primarily on price-based features rather than comprehensive order book analysis.

The confidence-threshold paradigm represents a novel contribution not directly addressed in the current literature. While Golnari et al. (2024) [30] incorporate probabilistic elements through P-GRU, their approach generates probability distributions for predicted values rather than using confidence for execution decisions. Our separation of directional prediction from execution decisions provides a systematic framework for managing prediction uncertainty.

6.6. Regulatory and Ethical Considerations

Real-world deployment of automated cryptocurrency trading systems raises several regulatory and ethical concerns that practitioners must address [40,41].

Market manipulation risks deserve careful attention. High-frequency trading algorithms can potentially contribute to artificial volatility or facilitate pump-and-dump schemes, particularly in less liquid cryptocurrency pairs. Our selective execution approach partially mitigates these risks by reducing trading frequency and avoiding low-confidence predictions during volatile periods. However, operators should implement additional safeguards including position limits, maximum order sizes, and volatility circuit breakers [42].

Regulatory compliance varies dramatically across jurisdictions. The United States treats most cryptocurrencies as securities subject to SEC oversight, requiring algorithmic trading firms to register as broker-dealers. European MiCA regulations impose transparency requirements on automated trading systems. Operators must ensure compliance with local regulations, including audit trails, risk disclosures, and anti-money laundering procedures.

DeFi protocol integration introduces unique challenges. Decentralized exchanges lack traditional market oversight, creating opportunities for front-running through transaction reordering. Our 10 h prediction horizon reduces vulnerability to such attacks compared to high-frequency strategies, but deploying on public blockchains still requires careful consideration of MEV (Maximal Extractable Value) risks.

Fairness concerns arise when sophisticated algorithms trade against retail participants. While our confidence-based approach does not exploit information asymmetries or engage in predatory trading, the superior performance versus simple strategies could exacerbate wealth concentration. Exchanges might consider implementing fairness mechanisms or educational resources to level the playing field.

Data privacy presents minimal concerns for cryptocurrency trading since all blockchain transactions are public. However, operators should protect proprietary trading strategies and avoid disclosing positions that could be exploited by competitors.

We recommend that institutions deploying similar systems establish ethics review boards, conduct regular audits for market impact, and maintain transparency about algorithmic trading activities within regulatory boundaries. The cryptocurrency industry’s evolution toward institutional adoption demands higher standards of responsible algorithm design.

7. Conclusions

This research introduces a confidence-threshold framework for cryptocurrency price direction prediction that separates directional forecasting from execution decisions. The approach addresses fundamental challenges in cryptocurrency market prediction through selective execution based on model confidence rather than traditional classification thresholds.

The experimental results demonstrate significant performance improvements over traditional approaches. The system achieved 82.68% direction accuracy with 151.11-basis point average profit per trade across 11 cryptocurrency pairs. These results validate the effectiveness of confidence-based execution in volatile cryptocurrency markets.

The dominance of order book features in prediction importance confirms the critical role of market microstructure information for short-term cryptocurrency price movements. Traditional technical indicators provide limited additional value when comprehensive microstructure data is available, suggesting that conventional technical analysis approaches may be insufficient for optimal cryptocurrency trading.

The methodology contributes to cryptocurrency market analytics by providing a practical framework for selective trading execution. The confidence-based approach offers natural risk management capabilities while maintaining competitive returns, addressing key concerns for algorithmic trading implementation in cryptocurrency markets.

Future research directions include extending the framework to additional blockchain-based assets, developing adaptive threshold mechanisms, and incorporating alternative data sources such as on-chain metrics and social sentiment indicators. The confidence-threshold paradigm presents broader applications beyond cryptocurrency markets, potentially enhancing prediction systems across various financial asset classes.

The findings advance the understanding of machine learning applications in cryptocurrency markets while providing practical tools for algorithmic trading implementation. The framework bridges academic research and practical trading requirements, contributing to the growing field of cryptocurrency market analytics and blockchain-based financial system analysis.

Author Contributions

Conceptualization, methodology, writing—original draft preparation, O.K. (Oleksandr Kuznetsov); supervision, funding acquisition, O.K. (Oleksii Kostenko); investigation, data curation, methodology, K.K.; conceptualization, data curation, Z.H.; writing—review and editing, R.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The complete codebase for this research, including data processing, model implementation, and visualization scripts, is freely available at https://github.com/KuznetsovKarazin/crypto-confidence-execution (accessed on 11 October 2025). This accessibility enables direct verification of our results and facilitates further extension of our work by interested researchers.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ariffin, N.; Ismail, A.Z. The Design and Implementation of Trade Finance Application Based on Hyperledger Fabric Permissioned Blockchain Platform. In Proceedings of the 2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), Yogyakarta, Indonesia, 5–6 December 2019; pp. 488–493. [Google Scholar]
Kuznetsov, A.; Sernani, P.; Romeo, L.; Frontoni, E.; Mancini, A. On the Integration of Artificial Intelligence and Blockchain Technology: A Perspective about Security. IEEE Access 2023, 12, 3881–3897. [Google Scholar] [CrossRef]
Ballis, A.; Karagiorgis, A.; Anastasiou, D.; Kallandranis, C. Cryptocurrency Dynamics during Global Crises: Insights from Bitcoin’s Interplay with Traditional Markets. Int. Rev. Econ. Financ. 2025, 103, 104512. [Google Scholar] [CrossRef]
Fieberg, C.; Liedtke, G.; Zaremba, A. Cryptocurrency Anomalies and Economic Constraints. Int. Rev. Financ. Anal. 2024, 94, 103218. [Google Scholar] [CrossRef]
Guo, L.; Zhong, L.-X. Risk Spillover and Hedging Effects between Stock Markets and Cryptocurrency Markets Depending upon Network Analysis. N. Am. J. Econ. Financ. 2025, 80, 102524. [Google Scholar] [CrossRef]
Yin, W.; Wu, F.; Zhou, P.; Kirkulak-Uludag, B. Exploring Resilience in the Cryptocurrency Market: Risk Transmission and Network Robustness. Int. Rev. Financ. Anal. 2025, 106, 104546. [Google Scholar] [CrossRef]
Izadi, M.A.; Hajizadeh, E. Time Series Prediction for Cryptocurrency Markets with Transformer and Parallel Convolutional Neural Networks. Appl. Soft Comput. 2025, 177, 113229. [Google Scholar] [CrossRef]
Liu, Y.-H.; Huang, J.-K. Cryptocurrency Trend Forecast Using Technical Analysis and Trading with Randomness-Preserving. Comput. Electr. Eng. 2024, 118, 109368. [Google Scholar] [CrossRef]
Bonaparte, Y. Time Horizon and Cryptocurrency Ownership: Is Crypto Not Speculative? J. Int. Financ. Mark. Inst. Money 2022, 79, 101609. [Google Scholar] [CrossRef]
Bouteska, A.; Sharif, T.; Isskandarani, L.; Abedin, M.Z. Market Efficiency and Its Determinants: Macro-Level Dynamics and Micro-Level Characteristics of Cryptocurrencies. Int. Rev. Econ. Financ. 2025, 98, 103938. [Google Scholar] [CrossRef]
Franco, J.P.M.; Laurini, M.P. Quantifying Systemic Risk in Cryptocurrency Markets: A High-Frequency Approach. Int. Rev. Econ. Financ. 2025, 102, 104214. [Google Scholar] [CrossRef]
Liu, W.; Bao, X.; Han, X.; Li, Y. Liquidity Commonality in Cryptocurrencies. Financ. Res. Lett. 2025, 85, 108187. [Google Scholar] [CrossRef]
Yang, Y.; Wang, X.; Xiong, J.; Wu, L.; Zhang, Y. An Innovative Method for Short-Term Forecasting of Blockchain Cryptocurrency Price. Appl. Math. Model. 2025, 138, 115795. [Google Scholar] [CrossRef]
Hsieh, C.-H.; Huang, P.-H.; Liu, H.-C. State Transitions and Momentum Effect in Cryptocurrency Market. Financ. Res. Lett. 2025, 86, 108356. [Google Scholar] [CrossRef]
Ballings, M.; Van den Poel, D.; Hespeels, N.; Gryp, R. Evaluating Multiple Classifiers for Stock Price Direction Prediction. Expert Syst. Appl. 2015, 42, 7046–7056. [Google Scholar] [CrossRef]
Henrique, B.M.; Sobreiro, V.A.; Kimura, H. Literature Review: Machine Learning Techniques Applied to Financial Market Prediction. Expert Syst. Appl. 2019, 124, 226–251. [Google Scholar] [CrossRef]
Zhang, Q.; Xie, C.; Weng, Z.; Sornette, D.; Wu, K. Generalized Visible Curvature: An Indicator for Bubble Identification and Price Trend Prediction in Cryptocurrencies. Decis. Support Syst. 2024, 185, 114309. [Google Scholar] [CrossRef]
Cryptocurrency Order Book Data: Asks and Bids. Kaggle: San Francisco, CA, USA. Available online: https://www.kaggle.com/datasets/ilyazawilsiv/cryptocurrency-order-book-data-asks-and-bids (accessed on 11 October 2025).
Top 100 Cryptocurrency (2020–2025); Kaggle: San Francisco, CA, USA, 2025. Available online: https://www.kaggle.com/datasets/imtkaggleteam/top-100-cryptocurrency-2020-2025 (accessed on 11 October 2025).
Cont, R.; Kukanov, A.; Stoikov, S. The Price Impact of Order Book Events. J. Financ. Econom. 2012, 12, 47–88. [Google Scholar] [CrossRef]
Hautsch, N.; Huang, R. The Market Impact of a Limit Order. J. Econ. Dyn. Control 2012, 36, 501–522. [Google Scholar] [CrossRef]
Cartea, Á.; Jaimungal, S.; Penalva, J. Algorithmic and High-Frequency Trading; Cambridge University Press: Cambridge, UK, 2015; ISBN 978-1-107-09114-6. [Google Scholar]
El-Yaniv, R.; Wiener, Y. On the Foundations of Noise-Free Selective Classification. J. Mach. Learn. Res. 2010, 11, 1605–1641. [Google Scholar]
Geifman, Y.; El-Yaniv, R. Selective Classification for Deep Neural Networks. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 4885–4894. [Google Scholar]
Romano, Y.; Patterson, E.; Candès, E.J. Conformalized Quantile Regression. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Curran Associates Inc.: Red Hook, NY, USA, 2019; pp. 3543–3553. [Google Scholar]
Almgren, R.; Chriss, N. Optimal Execution of Portfolio Transactions. J. Risk 2001, 3, 5–39. [Google Scholar] [CrossRef]
Makarov, I.; Schoar, A. Trading and Arbitrage in Cryptocurrency Markets. J. Financ. Econ. 2020, 135, 293–319. [Google Scholar] [CrossRef]
Fees-Binance.US|Buy & Sell Crypto. Available online: https://www.binance.us/fees (accessed on 11 October 2025).
Zhang, J.; Cai, K.; Wen, J. A Survey of Deep Learning Applications in Cryptocurrency. iScience 2024, 27, 108509. [Google Scholar] [CrossRef] [PubMed]
Golnari, A.; Komeili, M.H.; Azizi, Z. Probabilistic Deep Learning and Transfer Learning for Robust Cryptocurrency Price Prediction. Expert Syst. Appl. 2024, 255, 124404. [Google Scholar] [CrossRef]
Peng, P.; Chen, Y.; Lin, W.; Wang, J.Z. Attention-Based CNN–LSTM for High-Frequency Multiple Cryptocurrency Trend Prediction. Expert Syst. Appl. 2024, 237, 121520. [Google Scholar] [CrossRef]
Zhong, C.; Du, W.; Xu, W.; Huang, Q.; Zhao, Y.; Wang, M. LSTM-ReGAT: A Network-Centric Approach for Cryptocurrency Price Trend Prediction. Decis. Support Syst. 2023, 169, 113955. [Google Scholar] [CrossRef]
Kang, M.; Hong, J.; Kim, S. Harnessing Technical Indicators with Deep Learning Based Price Forecasting for Cryptocurrency Trading. Phys. A Stat. Mech. Its Appl. 2025, 660, 130359. [Google Scholar] [CrossRef]
Pellicani, A.; Pio, G.; Ceci, M. CARROT: Simultaneous Prediction of Anomalies from Groups of Correlated Cryptocurrency Trends. Expert Syst. Appl. 2025, 260, 125457. [Google Scholar] [CrossRef]
Viéitez, A.; Santos, M.; Naranjo, R. Machine Learning Ethereum Cryptocurrency Prediction and Knowledge-Based Investment Strategies. Knowl.-Based Syst. 2024, 299, 112088. [Google Scholar] [CrossRef]
Kraus, M.; Feuerriegel, S.; Oztekin, A. Deep Learning in Business Analytics and Operations Research: Models, Applications and Managerial Implications. Eur. J. Oper. Res. 2020, 281, 628–641. [Google Scholar] [CrossRef]
Campbell, J.Y.; Lo, A.W.; MacKinlay, A.C. The Econometrics of Financial Markets; Princeton University Press: Princeton, NJ, USA, 1997; ISBN 978-0-691-04301-2. [Google Scholar]
Jegadeesh, N.; Titman, S. Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency. J. Financ. 1993, 48, 65–91. [Google Scholar] [CrossRef]
Sharpe, W.F. The Sharpe Ratio. J. Portf. Manag. 1994, 21, 49–58. [Google Scholar] [CrossRef]
Trydid, O.M.; Kavun, S.V.; Goykhman, M.I. Synthesis Concept of Information and Analytical Support for Bank Security System. Actual Probl. Econ. 2014, 11, 449–461. [Google Scholar]
Vnukova, N.; Kavun, S.; Kolodiziev, O.; Achkasova, S.; Hontar, D. Indicators-Markers for Assessment of Probability of Insurance Companies Relatedness in Implementation of Risk-Oriented Approach. Econ. Stud. J. 2020, 29, 151–173. [Google Scholar]
Kavun, S.; Zamula, A.; Mikheev, I. Calculation of Expense for Local Computer Networks. In Proceedings of the 2017 4th International Scientific-Practical Conference Problems of Infocommunications. Science and Technology (PIC S&T), Kharkov, Ukraine, 10–13 October 2017; pp. 146–151. [Google Scholar]

Figure 1. Temporal coverage analysis.

Figure 2. Cross-dataset feature comparison.

Figure 3. Bid–ask spread analysis.

Figure 4. Correlation network analysis.

Figure 5. Feature importance analysis.

Figure 6. Intraday trading patterns.

Figure 7. System pipeline.

Figure 8. Rolling correlation analysis.

Figure 9. Symbol performance comparison.

Figure 10. Confusion matrix.

Figure 11. ROC curve. The solid line represents the model performance, while the dotted diagonal line indicates the random classifier baseline.

Figure 12. Cumulative PnL. Dates are shown in YYYY/MM/DD format.

Figure 13. Profit histogram.

Figure 14. Per-symbol accuracy bars.

Figure 15. Per-symbol average profit bars.

Table 1. Comparative survey of machine learning approaches.

Study	Model Family	Task and Target	Horizon	Assets and Venue	Data Frequency and Features	Sample Period	Validation Design	Trading Protocol	Performance Metrics
Golnari et al. (2024) [30]	Sequence (P-GRU)	Return regression	5 min	BTC + 6 alts, Binance	5 min OHLCV	2020–2023	Walk-forward, 70-15-15 temporal split	Not specified	RMSE reduction vs. baseline; transfer learning demonstrated
Peng et al. (2024) [31]	Attention CNN-LSTM	Directional (triple label)	Multi-horizon	Multiple pairs	Minute-level OHLCV, cross-frequency	2019–2022	Fixed split	Long–short, reduced freq	Improved profit factor, transaction cost considered
Zhong et al. (2023) [32]	Graph (LSTM-ReGAT)	Price regression	1 h	Network of 10+ assets	Hourly OHLCV + network features	2018–2021	Temporal split	Portfolio rebalancing	Superior trading profits reported qualitatively
Kang et al. (2025) [33]	Sequence (SegRNN, TimesNet) + Technical	Price forecast and trading	4 h	ETH, BTC	4 h OHLCV + Bollinger, MACD, RSI	2020–2024	Symbol-wise temporal	Long-only	Sharpe ratio up to 3.56, max drawdown reported
Viéitez et al. (2024) [35]	ML ensemble	Price direction	Daily	ETH	Daily external indices, online trends	2019–2023	Temporal validation	Knowledge-based strategies	Profit factor up to 5.16
Zhang et al. (2024) [17]	Tree-based (LightGBM) + Curvature	Bubble ID and price trend	Varied	BTC, ETH	Daily + curvature indicators	2017–2022	Cross-validation	Not specified	AUC, precision-recall for bubble detection
Izadi & Hajizadeh (2025) [7]	Transformer + parallel CNN	Trend prediction	Short-term	BTC, major pairs	High-frequency OHLCV	2022–2024	Standard split	Not specified	57% trend accuracy reported
Liu & Huang (2024) [8]	LSTM + technical indicators	Directional classification	Intraday	Multiple pairs	Minute OHLCV + technical	2021–2023	Walk-forward	Long–short with randomness	Profitable outcomes with cost consideration
Our study	MLP with confidence threshold	Directional (binary) + selective execution	600 min (10 h)	11 USDT pairs, centralized exchange	Minute order book (50 levels) + daily macro (OHLCV, technical)	October 2023–October 2024	Symbol-wise temporal, 70-15-15	Selective long–short, 1 bp cost, no leverage	Direction accuracy 82.68% on executed (11.99% coverage), avg profit 151.11 bps, Sharpe 0.83 per-trade, balanced accuracy, F1, MCC reported

Table 2. Dataset composition and quality assessment.

Dataset	Observations	Features	Symbols	Temporal Coverage	Missing Data (%)	Memory Usage (MB)
Macro	211,679	38	100	2018-08-01 to 2025-08-05	2.75	48.1
Micro	5,672,947	264	11	2023-10-07 to 2024-10-13	0.0	11,038.4
Unified	802,967	296	11	2023-10-07 to 2024-10-12	1.28	548.8

Table 3. Complete preprocessing pipeline.

Processing Stage	Parameter	Value	Rationale
Symbol Normalization	Quote Currencies	11 types	USDT, BTC, EUR, USD, etc.
Temporal Validation	Max Gap (Daily)	24 h	Missing trading session detection
Temporal Validation	Max Gap (Minute)	90 min	Exchange maintenance windows
Price Validation	Deadband Threshold	10 bps	Noise filtering for targets
Price Validation	Max Price Change	20%	Outlier detection baseline
Order Book Validation	Min Spread	0.001 bps	BTC-friendly threshold
Order Book Validation	Max Spread	200 bps	Liquidity constraint
Order Book Validation	Depth Levels	50	Full book representation
Feature Selection	Top-K Features	128	Computational efficiency
Feature Selection	Selection Method	Mutual Info	Non-linear relationships
Feature Selection	Sample Size	100,000	Statistical robustness

Table 4. Neural network architecture and training configuration.

Component	Specification	Justification
Input Layer	128 features	Selected via mutual information
Hidden Layers	256 → 128 → 64 units	Hierarchical feature abstraction
Activation Function	ReLU	Computational efficiency
Dropout Rate	0.2	Overfitting prevention
Output Layer	2 units (sigmoid)	Binary classification
Optimizer	Adam	Adaptive learning rates
Learning Rate	0.001	Stable convergence
Batch Size	4096	Memory efficiency
Training Epochs	80	Sufficient convergence
Early Stopping	10 epochs patience	Generalization protection
Class Weighting	Balanced	Address class imbalance

Table 5. Experimental dataset composition and symbol statistics.

Symbol	Market Cap Rank *	Observations (Count)	Data Coverage (%)	Avg Daily Volume (USD Millions)	Spread Characteristics
Symbol	Market Cap Rank *	Observations (Count)	Data Coverage (%)	Avg Daily Volume (USD Millions)	Median (bps)	P95 (bps)
BTC_USDT	1	89,453	99.2	2847.3	0.85	4.21
ETH_USDT	2	94,721	99.8	1923.7	1.12	5.67
BNB_USDT	3	72,184	98.6	412.8	1.43	7.89
SOL_USDT	4	76,934	98.9	1234.5	1.34	6.45
XRP_USDT	5	78,456	99.3	687.2	1.45	7.34
DOGE_USDT	6	71,289	98.4	298.7	1.89	10.23
ADA_USDT	7	83,562	99.1	156.4	1.78	9.12
AVAX_USDT	8	47,612	97.3	145.2	2.34	12.89
DOT_USDT	9	75,193	98.7	134.7	1.67	8.95
MATIC_USDT	10	69,823	98.1	89.3	2.11	11.67
TRX_USDT	11	43,740	96.8	67.8	2.67	15.43

* Market capitalization rank represents the relative size of each cryptocurrency by total market value during the study period (October 2023–October 2024). Rank 1 indicates the largest market capitalization. The selection spans different market tiers to evaluate model performance across varying liquidity conditions and trading characteristics.

Table 6. Classification performance metrics for confidence-based direction prediction.

Metric	Value	95% CI
Direction Accuracy	0.8268	[0.8241, 0.8295]
F1 Score (Macro)	0.8195	[0.8166, 0.8224]
Precision (Macro)	0.8220	[0.8191, 0.8249]
Recall (Macro)	0.8176	[0.8147, 0.8205]
ROC-AUC	0.6886	[0.6847, 0.6925]
PR-AUC	0.6695	[0.6654, 0.6736]
Win Rate	0.8268	[0.8241, 0.8295]

Table 7. Trading performance analysis.

Performance Metric	Value	Unit
Average Net Profit	151.11	basis points
Median Net Profit	124.60	basis points
Profit Standard Deviation	181.77	basis points
Sharpe Ratio (per trade)	0.8313	dimensionless
Maximum Profit	1196.28	basis points
Minimum Loss	−641.32	basis points
25th Percentile Profit	39.08	basis points
75th Percentile Profit	242.28	basis points
90th Percentile Profit	387.35	basis points
10th Percentile Loss	−46.30	basis points
Coverage Rate	11.99	percent
Total Executed Trades	96,247	count

Table 8. Performance sensitivity to transaction costs.

Transaction Cost (bps)	Avg Net Profit (bps)	Sharpe Ratio	Profitable Trades (%)	Optimal Threshold (τ)
0.5	168.22	0.9145	84.12%	0.78
1.0 (baseline)	151.11	0.8313	82.68%	0.80
2.0	117.89	0.6234	79.34%	0.83
3.0	84.67	0.4156	75.01%	0.86
4.0	51.45	0.2089	69.23%	0.89
5.0	18.23	0.0891	62.45%	0.92

Table 9. Per-symbol trading performance analysis.

Symbol	Samples	Coverage (%)	Executed	Accuracy	Avg Profit (bps)	Win Rate (%)	Avg Confidence
BTC_USDT	89,453	8.2	7331	0.8891	198.45	88.9	0.8456
ETH_USDT	94,721	11.7	11,082	0.8634	162.33	86.3	0.8321
BNB_USDT	72,184	15.3	11,044	0.8121	134.22	81.2	0.8198
ADA_USDT	83,562	12.9	10,779	0.8043	145.67	80.4	0.8156
SOL_USDT	76,934	13.8	10,617	0.8298	158.91	83.0	0.8267
DOGE_USDT	71,289	14.1	10,052	0.7956	128.73	79.6	0.8089
XRP_USDT	78,456	11.4	8944	0.8445	174.56	84.5	0.8378
MATIC_USDT	69,823	12.7	8871	0.8067	139.44	80.7	0.8145
DOT_USDT	75,193	10.8	8121	0.8334	165.78	83.3	0.8298
AVAX_USDT	47,612	18.6	8856	0.7823	112.89	78.2	0.8034
TRX_USDT	43,740	21.4	9350	0.7734	108.56	77.3	0.7989

Table 10. Feature category distribution and importance.

Feature Category	Count	Percentage (%)	Average MI Score
Order Book (Best Quotes)	2	1.6%	0.0847
Order Book (Deep Levels)	102	79.7%	0.0234
Price/OHLC Data	4	3.1%	0.0692
Technical Indicators	6	4.7%	0.0451
Volatility Measures	3	2.3%	0.0398
Volume Features	2	1.6%	0.0312
Spread/Microstructure	5	3.9%	0.0556
Returns/Momentum	5	3.9%	0.0487

Table 11. Baseline comparison analysis.

Model	Direction Accuracy	Coverage	Avg Profit (bps)	Sharpe Ratio	Total Trades
Confidence Threshold (τ = 0.8)	82.68%	11.99%	151.11	0.8313	96,247
Always-Execute Binary	76.34%	100%	89.23	0.4156	802,967
Fixed Threshold (τ = 0.5)	74.12%	47.83%	102.45	0.5891	384,019
Random Selection (12% cov)	50.21%	12.00%	−8.34	−0.0892	96,356

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kuznetsov, O.; Kostenko, O.; Klymenko, K.; Hbur, Z.; Kovalskyi, R. Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction. Appl. Sci. 2025, 15, 11145. https://doi.org/10.3390/app152011145

AMA Style

Kuznetsov O, Kostenko O, Klymenko K, Hbur Z, Kovalskyi R. Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction. Applied Sciences. 2025; 15(20):11145. https://doi.org/10.3390/app152011145

Chicago/Turabian Style

Kuznetsov, Oleksandr, Oleksii Kostenko, Kateryna Klymenko, Zoriana Hbur, and Roman Kovalskyi. 2025. "Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction" Applied Sciences 15, no. 20: 11145. https://doi.org/10.3390/app152011145

APA Style

Kuznetsov, O., Kostenko, O., Klymenko, K., Hbur, Z., & Kovalskyi, R. (2025). Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction. Applied Sciences, 15(20), 11145. https://doi.org/10.3390/app152011145

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning Analytics for Blockchain-Based Financial Markets: A Confidence-Threshold Framework for Cryptocurrency Price Direction Prediction

Abstract

Featured Application

Abstract

1. Introduction

2. Background and Related Work

2.1. Blockchain-Based Financial Markets and Prediction Challenges

2.2. Baseline and Classical Approaches

2.3. Machine Learning Applications in Cryptocurrency Prediction

2.4. Comparative Survey of Machine Learning Approaches for Cryptocurrency Prediction

3. Methodology

3.1. Data Sources and Preprocessing

3.2. Feature Engineering Framework

3.3. Mathematical Problem Formulation

3.4. Confidence-Based Classification Model and Performance Evaluation Metrics

4. Experimental Design

4.1. Dataset Description

4.2. Model Configuration

5. Results and Analysis

5.1. Classification Performance

5.2. Trading Performance Evaluation

5.3. Cross-Asset Performance Analysis

5.4. Feature Importance and Model Interpretation

5.5. Baseline Comparison and Ablation Analysis

6. Discussion

6.1. Key Findings and Implications

6.2. Cryptocurrency Market Considerations

6.3. Practical Implementation Considerations

6.4. Limitations and Future Directions

6.5. Comparative Analysis with Existing Literature

6.6. Regulatory and Ethical Considerations

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI