Digital Asset Analytics for DeFi Protocol Valuation: An Explainable Optuna-Tuned Super Learner Ensemble Framework
Abstract
1. Introduction
2. Related Work
2.1. Disclosure Determinants of Decentralized Finance (DeFi) Valuation
2.2. Machine Learning Applications in Valuation: Bridging the DeFi Gap
3. Proposed Framework
3.1. General Context
3.2. Phase 1: Data Preparation
3.3. Phase 2: Model Development
3.3.1. Extremely Randomized Trees (ETs)
3.3.2. Categorical Gradient Boosting (CAT)
3.3.3. Support Vector Regression (SVR)
3.3.4. The K-Nearest Neighbors (KNNs)
3.3.5. Optuna Optimization
- 1.
- Define the Objective Function
- 2.
- Create a Study Object
- 3.
- Run the Optimization
- Its base learners (ET, CAT, and SVR);
- Other tree-based and boosting homogenous ensemble methods (RF, Bagging, LGBM, AdaBoost);
- Average and weighted-voting heterogeneous ensembles composed of the same constituent algorithms;
- Classical regressors (OLS, Ridge, Lasso, ElasticNet, DT, KNN).
4. Experimental Results and Discussion
4.1. Analysis of Experimental Results
4.1.1. Performance Comparison Between the Super Learner and Its Base Learners
4.1.2. Performance Comparison Between the Super Learner and Alternative Ensemble Strategies
4.1.3. Performance Comparison Between the Super Learner and Statistical and ML Models
4.1.4. SHAP Analysis
5. Discussion of Results
6. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. (2019, August 4–8). Optuna: A next-generation hyperparameter optimization framework. 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (pp. 2623–2631), Anchorage, AK, USA. [Google Scholar]
- Akyildirim, E., Goncu, A., & Sensoy, A. (2021). Prediction of cryptocurrency returns using machine learning. Annals of Operations Research, 297(1–2), 3–36. [Google Scholar] [CrossRef]
- Alamsyah, A., & Salsabila, N. (2024, August 7–8). Exploring the mechanisms of decentralized finance (DeFi) using blockchain technology. 2024 3rd International Conference on Creative Communication and Innovative Technology (ICCIT) (pp. 1–8), Tangerang, Indonesia. [Google Scholar]
- Ali, G., & Yahia, Z. (2025). Disclosure determinants of blockchain crowdfunding performance for sustainable smart city financing: An explainable Optuna-optimized machine learning approach. Journal of Posthumanism, 5(7), 802–826. [Google Scholar] [CrossRef]
- Almarzooq, H., & bin Waheed, U. (2024). Automating hyperparameter optimization in geophysics with Optuna: A comparative study. Geophysical Prospecting, 72(5), 1778–1788. [Google Scholar] [CrossRef]
- Almeida, D., Dionísio, A., Vieira, I., & Ferreira, P. (2022). Uncertainty and risk in the cryptocurrency market. Journal of Risk and Financial Management, 15(11), 532. [Google Scholar] [CrossRef]
- Angeris, G., Kao, H.-T., Chiang, R., Noyes, C., & Chitra, T. (2019). An analysis of Uniswap markets. Cryptoeconomic Systems, 1(1). [Google Scholar] [CrossRef]
- Angeris, G., Kao, H.-T., Chiang, R., Noyes, C., & Chitra, T. (2021). An analysis of Uniswap markets. arXiv, arXiv:1911.03380. [Google Scholar] [CrossRef]
- Ante, L. (2022). The non-fungible token (NFT) market and its relationship with Bitcoin and Ethereum. FinTech, 1(3), 216–224. [Google Scholar] [CrossRef]
- Bhambhwani, S. M., & Huang, A. H. (2024). Auditing decentralized finance. The British Accounting Review, 56(2), 101270. [Google Scholar] [CrossRef]
- Bogaert, M., & Delaere, L. (2023). Ensemble methods in customer churn prediction: A comparative analysis of the state-of-the-art. Mathematics, 11(5), 1137. [Google Scholar] [CrossRef]
- Cakici, N., Fieberg, C., Metko, D., & Zaremba, A. (2023). Machine learning goes global: Cross-sectional return predictability in international stock markets. Journal of Economic Dynamics and Control, 155, 104725. [Google Scholar] [CrossRef]
- Chhajer, P., Shah, M., & Kshirsagar, A. (2022). The applications of artificial neural networks, support vector machines, and long–short term memory for stock market prediction. Decision Analytics Journal, 2, 100015. [Google Scholar] [CrossRef]
- Cong, L. W., & He, Z. (2019). Blockchain disruption and smart contracts. The Review of Financial Studies, 32(5), 1754–1797. [Google Scholar] [CrossRef]
- Cong, L. W., Li, Y., & Wang, N. (2021). Tokenomics: Dynamic adoption and valuation. The Review of Financial Studies, 34(3), 1105–1155. [Google Scholar] [CrossRef]
- Corbet, S., Goodell, J. W., Gunay, S., & Kaskaloglu, K. (2023). Are DeFi tokens a separate asset class from conventional cryptocurrencies? Annals of Operations Research, 322(2), 609–630. [Google Scholar] [CrossRef]
- Cousaert, S., Xu, J., & Matsui, T. (2022, May 2–5). Sok: Yield aggregators in defi. 2022 IEEE International Conference on Blockchain and Cryptocurrency (ICBC) (pp. 1–14), Shanghai, China. [Google Scholar]
- Daian, P., Goldfeder, S., Kell, T., Li, Y., Zhao, X., Bentov, I., Breidenbach, L., & Juels, A. (2019). Flash boys 2.0: Frontrunning, transaction reordering, and consensus instability in decentralized exchanges. arXiv, arXiv:1904.05234. [Google Scholar] [CrossRef]
- Dash, R. K., Nguyen, T. N., Cengiz, K., & Sharma, A. (2023). Fine-tuned support vector regression model for stock predictions. Neural Computing and Applications, 35(32), 23295–23309. [Google Scholar] [CrossRef]
- Davies, M. M., & Van Der Laan, M. J. (2016). Optimal spatial prediction using ensemble machine learning. International Journal of Biostatistics, 12(1), 179–201. [Google Scholar] [CrossRef]
- Dey, R., & Mathur, R. (2023). Ensemble learning method using stacking with base learner, a comparison. In Proceedings of international conference on data analytics and insights, ICDAI 2023 (Volume 727 LNNS, pp. 159–169). Lecture Notes in Networks and Systems. Springer. [Google Scholar] [CrossRef]
- Dorogush, A. V., Ershov, V., & Gulin, A. (2018). CatBoost: Gradient boosting with categorical features support. arXiv, arXiv:1810.11363. [Google Scholar] [CrossRef]
- Erdebilli, B., & Devrim-İçtenbaş, B. (2022). Ensemble voting regression based on machine learning for predicting medical waste: A case from Turkey. Mathematics, 10(14), 2466. [Google Scholar] [CrossRef]
- Foster, K., Blakstad, S., Gazi, S., & Bos, M. (2021). BigFintechs and their impacts on macroeconomic policies. Dialogue on Global Digital Finance Governance, Technical Paper No. 1.1B. United Nations Development Programme & United Nations Capital Development Fund. Available online: https://hub.hku.hk/handle/10722/301744 (accessed on 19 November 2025).
- Geertsema, P., & Lu, H. (2019). Machine valuation. Working Paper. [Google Scholar] [CrossRef]
- Geertsema, P., & Lu, H. (2023). Relative valuation with machine learning. Journal of Accounting Research, 61(1), 329–376. [Google Scholar] [CrossRef]
- González, S., García, S., Del Ser, J., Rokach, L., & Herrera, F. (2020). A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives and opportunities. Information Fusion, 64, 205–237. [Google Scholar] [CrossRef]
- Gudgeon, L., Werner, S., Perez, D., & Knottenbelt, W. J. (2020, October 21–23). Defi protocols for loanable funds: Interest rates, liquidity and market efficiency. 2nd ACM Conference on Advances in Financial Technologies (pp. 92–112), New York, NY, USA. [Google Scholar]
- Ince, H., & Trafalis, T. B. (2008). Short term forecasting with support vector machines and application to stock price prediction. International Journal of General Systems, 37(6), 677–687. [Google Scholar] [CrossRef]
- John, K., Kogan, L., & Saleh, F. (2023). Smart contracts and decentralized finance. Annual Review of Financial Economics, 15(1), 523–542. [Google Scholar] [CrossRef]
- Kaal, W. A., Evans, S. R., & Howe, H. A. (2024). Digital asset valuation. Richmond Journal of Law and Technology, 28, 657. [Google Scholar] [CrossRef]
- Kayikci, S., & Khoshgoftaar, T. M. (2024). Blockchain meets machine learning: A survey. Journal of Big Data, 11(1), 9. [Google Scholar] [CrossRef]
- Kim, H. S., & Sohn, S. Y. (2010). Support vector machines for default prediction of SMEs based on technology credit. European Journal of Operational Research, 201(3), 838–846. [Google Scholar] [CrossRef]
- Klages-Mundt, A., Harz, D., Gudgeon, L., Liu, J.-Y., & Minca, A. (2020, October 21–23). Stablecoins 2.0: Economic foundations and risk-based models. 2nd ACM Conference on Advances in Financial Technologies (pp. 59–79), New York, NY, USA. [Google Scholar]
- Knechel, W. R., Maex, S., & Park, H. J. (2023). Decentralized Finance (DeFi) and cybersecurity assurance. In Donald G. Costello College of Business at George Mason University research paper. George Mason University. [Google Scholar]
- Koklev, P. S. (2022). Business valuation with machine learning. Finance: Theory and Practice, 26(5), 132–148. [Google Scholar] [CrossRef]
- Kunapuli, G. (2023). Ensemble methods for machine learning. Simon and Schuster. [Google Scholar]
- Ladds, M. A., Thompson, A. P., Kadar, J.-P., Slip, D. J., P Hocking, D., & G Harcourt, R. (2017). Super machine learning: Improving accuracy and reducing variance of behaviour classification from accelerometry. Animal Biotelemetry, 5, 8. [Google Scholar] [CrossRef]
- Lee, S., Nguyen, N., Karamanli, A., Lee, J., & Vo, T. P. (2023). Super learner machine-learning algorithms for compressive strength prediction of high performance concrete. Structural Concrete, 24(2), 2208–2228. [Google Scholar] [CrossRef]
- Liu, Y., & Tsyvinski, A. (2021). Risks and returns of cryptocurrency. The Review of Financial Studies, 34(6), 2689–2727. [Google Scholar] [CrossRef]
- Loukil, S., Syed, A. A., Hamza, F., & Jeribi, A. (2025). Decoding the dynamic connectedness between traditional and digital assets under dynamic economic conditions. Journal of Theoretical and Applied Electronic Commerce Research, 20(2), 97. [Google Scholar] [CrossRef]
- Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., & Lee, S.-I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. [Google Scholar] [CrossRef]
- Metelski, D., & Sobieraj, J. (2022). Decentralized finance (DeFi) projects: A study of key performance indicators in terms of DeFi protocols’ valuations. International Journal of Financial Studies, 10(4), 108. [Google Scholar] [CrossRef]
- Mienye, I. D., & Sun, Y. (2022). A survey of ensemble learning: Concepts, algorithms, applications, and prospects. IEEE Access, 10, 99129–99149. [Google Scholar] [CrossRef]
- Mohammed, A., & Kora, R. (2023). A comprehensive review on ensemble deep learning: Opportunities and challenges. Journal of King Saud University—Computer and Information Sciences, 35(2), 757–774. [Google Scholar] [CrossRef]
- Naimi, A. I., & Balzer, L. B. (2018). Stacked generalization: An introduction to super learning. European Journal of Epidemiology, 33, 459–464. [Google Scholar] [CrossRef]
- Nguyen, X. C., Nguyen, T. T. H., La, D. D., Kumar, G., Rene, E. R., Nguyen, D. D., Chang, S. W., Chung, W. J., Nguyen, X. H., & Nguyen, V. K. (2021). Development of machine learning—Based models to forecast solid waste generation in residential areas: A case study from Vietnam. Resources, Conservation and Recycling, 167, 105381. [Google Scholar] [CrossRef]
- Parekh, N., Sen, A., Rajasekaran, P., Jayaseeli, J. D. D., & Robert, P. (2024, December 17–18). Network intrusion detection system using Optuna. 2024 International Conference on IoT Based Control Networks and Intelligent Systems (ICICNIS) (pp. 312–318), Online. [Google Scholar]
- Phillips, R. V., Van Der Laan, M. J., Lee, H., & Gruber, S. (2023). Practical considerations for specifying a super learner. International Journal of Epidemiology, 52(4), 1276–1285. [Google Scholar] [CrossRef]
- Porwik, P., Doroz, R., & Wrobel, K. (2019). An ensemble learning approach to lip-based biometric verification, with a dynamic selection of classifiers. Expert Systems with Applications, 115, 673–683. [Google Scholar] [CrossRef]
- Priyadarshi, P., & Kumar, P. (2024). Detecting insider trading in the indian stock market: An optimized deep learning approach. Computational Economics, 65, 3923–3943. [Google Scholar] [CrossRef]
- Schär, F. (2021). Decentralized finance: On blockchain- and smart contract-based financial markets. Federal Reserve Bank of St. Louis Review, 103(2), 153–174. [Google Scholar] [CrossRef]
- Schmid, L., Gerharz, A., Groll, A., & Pauly, M. (2023). Tree-based ensembles for multi-output regression: Comparing multivariate approaches with separate univariate ones. Computational Statistics & Data Analysis, 179, 107628. [Google Scholar] [CrossRef]
- Srinivas, P., & Katarya, R. (2022). hyOPTXg: OPTUNA hyper-parameter optimization framework for predicting cardiovascular disease using XGBoost. Biomedical Signal Processing and Control, 73, 103456. [Google Scholar] [CrossRef]
- Şoiman, F., Dumas, J.-G., & Jimenez-Garces, S. (2023). What drives DeFi market returns? Journal of International Financial Markets, Institutions and Money, 85, 101786. [Google Scholar] [CrossRef]
- Thabet, H. H., Darwish, S. M., & Ali, G. M. (2024). Measuring the efficiency of banks using high-performance ensemble technique. Neural Computing and Applications, 36, 16797–16815. [Google Scholar] [CrossRef]
- Van der Laan, M. J., Polley, E. C., & Hubbard, A. E. (2007). Super learner. Statistical Applications in Genetics and Molecular Biology, 6(1), Art25. [Google Scholar] [CrossRef]
- Werner, S., Perez, D., Gudgeon, L., Klages-Mundt, A., Harz, D., & Knottenbelt, W. (2022, September 19–21). Sok: Decentralized finance (DeFi). 4th ACM Conference on Advances in Financial Technologies (pp. 30–46), Cambridge, MA, USA. [Google Scholar]
- Wong, J., Manderson, T., Abrahamowicz, M., Buckeridge, D. L., & Tamblyn, R. (2019). Can hyperparameter tuning improve the performance of a super learner?: A case study. Epidemiology, 30(4), 521–531. [Google Scholar] [CrossRef]
- World Bank. (2023). Remittance prices worldwide. Available online: https://remittanceprices.worldbank.org (accessed on 19 November 2025).
- Xu, J., Paruch, K., Cousaert, S., & Feng, Y. (2023). Sok: Decentralized exchanges (dex) with automated market maker (amm) protocols. ACM Computing Surveys, 55(11), 1–50. [Google Scholar] [CrossRef]
- Yan, Y., Guo, W., Zhao, M., Hu, J., & Yan, W. P. (2017). Optimizing gross merchandise volume via DNN-MAB dynamic ranking paradigm. arXiv, arXiv:1708.03993. [Google Scholar] [CrossRef]
- Yousaf, I., & Yarovaya, L. (2022). Static and dynamic connectedness between NFTs, Defi and other assets: Portfolio implication. Global Finance Journal, 53, 100719. [Google Scholar] [CrossRef]
- Zetzsche, D. A., Arner, D. W., & Buckley, R. P. (2020). Decentralized finance. Journal of Financial Regulation, 6(2), 172–203. [Google Scholar] [CrossRef]
- Zhang, C., Zhang, H., & Liu, D. (2020). A contrastive study of machine learning on energy firm value prediction. IEEE Access, 8, 11635–11643. [Google Scholar] [CrossRef]
- Zhang, R., Tian, Z., McCarthy, K. J., Wang, X., & Zhang, K. (2023). Application of machine learning techniques to predict entrepreneurial firm valuation. Journal of Forecasting, 42(2), 402–417. [Google Scholar] [CrossRef]
- Zmaznev, E. (2021). Measuring decentralized finance regulatory uncertainty (pp. 1–43). Norwegian School of Economics. [Google Scholar]









| Variable | Description |
|---|---|
| The Dependent Variable | |
| Valuation (VAL) | Market value of a DeFi protocol, calculated as circulating token supply multiplied by token price. Reflects overall protocol valuation within the digital asset ecosystem. |
| The Independent Variables | |
| Total Value Locked (TVL) | Total amount of user funds deposited in a protocol’s smart contracts (e.g., lending, staking, liquidity pools). Indicates protocol scale, liquidity, and user trust. |
| Protocol Revenue (PR) | Revenue distributed directly to token holders, representing financial returns generated for protocol participants. |
| Total Revenue (TR) | Total user fees collected over a specified period. Includes both protocol revenue and supply-side revenue (e.g., liquidity providers). |
| Gross Merchandise Volume (GMV) | Total value of transactions processed by the protocol during a given period. Used to assess growth, activity levels, and competitive positioning. |
| Inflation Factor (INF) | Measure of the change in circulating token supply, reflecting dilution effects. Calculated as the percentage increase in token supply from one period to the next. |
| DeFi-class | Categorical variable indicating the protocol type. It includes three classes: Decentralized Exchanges (DEXs), which enable peer-to-peer trading; Lending Protocols, which support decentralized borrowing and lending; and Asset Management Protocols, which provide automated portfolio management and yield optimization services. |
| Model/Parameter | Definition | Selected Value |
|---|---|---|
| ET | ||
| n_estimators | Number of trees in the ensemble. | 132 |
| max_depth | Maximum depth of each tree. | 9 |
| CAT | ||
| iterations | Number of boosting iterations (trees). | 394 |
| max_depth | Maximum depth of each tree. | 8 |
| learning_rate | Step size used to update model weights. | 0.0206 |
| SVR | ||
| kernel | Type of function (e.g., linear, polynomial, rbf) used to map data into a higher-dimensional space. | rbf |
| C | Regularization strength controlling the balance between error tolerance and model complexity. | 1.0 |
| KNN | ||
| n_neighbors | Number of nearest neighbors | 18 |
| weights | How neighbor contributions are weighted | uniform |
| p | Power parameter for Minkowski distance. | 1 |
| Model | RMSE (±SD) | MAE (±SD) | R2 (±SD) | Wilcoxon Test |
|---|---|---|---|---|
| Super Learner | 0.0854 ± 0.0259 | 0.0645 ± 0.0231 | 0.9731 ± 0.0202 | — |
| ET | 0.1165 ± 0.0519 | 0.0921 ± 0.0506 | 0.9424 ± 0.0628 | ** |
| CAT | 0.1318 ± 0.0603 | 0.1087 ± 0.0590 | 0.9235 ± 0.0789 | ** |
| SVR | 0.1572 ± 0.0649 | 0.1214 ± 0.0516 | 0.8960 ± 0.1001 | ** |
| Model | RMSE (±SD) | MAE (±SD) | R2 (±SD) | Wilcoxon Test |
|---|---|---|---|---|
| Super Learner | 0.0854 ± 0.0259 | 0.0645 ± 0.0231 | 0.9731 ± 0.0202 | — |
| WV Ensemble | 0.1235 ± 0.0571 | 0.1012 ± 0.0564 | 0.9335 ± 0.0736 | ** |
| AV Ensemble | 0.1247 ± 0.0574 | 0.1021 ± 0.0565 | 0.9324 ± 0.0745 | ** |
| Bagging | 0.1536 ± 0.0682 | 0.1119 ± 0.0582 | 0.8971 ± 0.0945 | ** |
| LGBM | 0.1632 ± 0.0657 | 0.1146 ± 0.0576 | 0.8898 ± 0.0927 | ** |
| RF | 0.1674 ± 0.0753 | 0.1264 ± 0.0672 | 0.8749 ± 0.1186 | ** |
| AdaBoost | 0.1911 ± 0.0809 | 0.1562 ± 0.0695 | 0.8380 ± 0.1443 | ** |
| Model | RMSE (±SD) | MAE (±SD) | R2 (±SD) | Wilcoxon Test |
|---|---|---|---|---|
| Super Learner | 0.0854 ± 0.0259 | 0.0645 ± 0.0231 | 0.9731 ± 0.0202 | — |
| KNN | 0.1599 ± 0.0796 | 0.1268 ± 0.0735 | 0.8811 ± 0.1242 | ** |
| Ridge | 0.1757 ± 0.0901 | 0.1345 ± 0.0757 | 0.8561 ± 0.1455 | ** |
| ElasticNet | 0.1815 ± 0.0876 | 0.1394 ± 0.0674 | 0.8556 ± 0.1210 | ** |
| Lasso | 0.1883 ± 0.0864 | 0.1441 ± 0.0632 | 0.8486 ± 0.1280 | ** |
| OLS | 0.1899 ± 0.0793 | 0.1433 ± 0.0705 | 0.8482 ± 0.1299 | ** |
| DT | 0.2025 ± 0.0873 | 0.1375 ± 0.0679 | 0.8231 ± 0.1463 | ** |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ali, G.M. Digital Asset Analytics for DeFi Protocol Valuation: An Explainable Optuna-Tuned Super Learner Ensemble Framework. J. Risk Financial Manag. 2026, 19, 63. https://doi.org/10.3390/jrfm19010063
Ali GM. Digital Asset Analytics for DeFi Protocol Valuation: An Explainable Optuna-Tuned Super Learner Ensemble Framework. Journal of Risk and Financial Management. 2026; 19(1):63. https://doi.org/10.3390/jrfm19010063
Chicago/Turabian StyleAli, Gihan M. 2026. "Digital Asset Analytics for DeFi Protocol Valuation: An Explainable Optuna-Tuned Super Learner Ensemble Framework" Journal of Risk and Financial Management 19, no. 1: 63. https://doi.org/10.3390/jrfm19010063
APA StyleAli, G. M. (2026). Digital Asset Analytics for DeFi Protocol Valuation: An Explainable Optuna-Tuned Super Learner Ensemble Framework. Journal of Risk and Financial Management, 19(1), 63. https://doi.org/10.3390/jrfm19010063

