Deterministic Data Governance in Hybrid Financial Architectures
Abstract
1. Introduction
- RQ1: How can a hybrid financial architecture be designed to ensure the deterministic application of data governance mechanisms across the RDBMS, Big Data and Cloud layers?
- RQ2: What performance and computational resource costs are introduced by implementing data governance mechanisms within distributed financial workflows?
- RQ3: To what extent do governance scenarios influence system performance, execution repeatability, and cross-run consistency across the evaluated RDBMS, Big Data, and Cloud workflows?
2. Literature Review
2.1. Relational Databases Data Governance Mechanisms
2.2. Governance Mechanisms in Big Data Frameworks for Financial Analytics
2.3. Data Governance in Cloud-Based Financial Services
2.4. Data Governance Integration Strategies and Findings
3. Materials and Methods
4. System Architecture and Governance Mechanism Integration
4.1. Comparison of Technologies in Relation to Data Governance
4.2. Data Governance Integration Strategies
4.3. Data Governance in Hybrid Architectures and Practical Implementation
5. Performance Analysis
5.1. Performance Analysis and Obtained Results
- Used data and representation formats
- Testing and result aggregation methodology
- Performance Impact and System Stability Across Governance Policies
5.2. Linking the Results to Financial Applications
6. Conclusions
6.1. Practical Outcomes and Implementation Recommendations
6.2. Methodological Constraints
6.3. Future Research Directions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Rizvi, S.K.A.; Rahat, B.; Naqvi, B.; Umar, M. Revolutionizing finance: The synergy of fintech, digital adoption, and innovation. Technol. Forecast. Soc. Change 2024, 200, 123112. [Google Scholar] [CrossRef]
- Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manag. 2015, 35, 137–144. [Google Scholar] [CrossRef]
- Hoofnagle, C.J.; van der Sloot, B.; Borgesius, F.Z. The European Union general data protection regulation: What it is and what it means. Inf. Commun. Technol. Law 2019, 28, 65–98. [Google Scholar] [CrossRef]
- State of California. California Consumer Privacy Act of 2018, Civil Code, Title 1.81.5. Official Legislative Text. 2018. Available online: https://cppa.ca.gov/regulations/pdf/ccpa_statute.pdf (accessed on 1 December 2025).
- Miller, K.M.; Lukic, K.; Skiera, B. The impact of the General Data Protection Regulation (GDPR) on online tracking. Int. J. Res. Mark. 2026, 43, 48–70. [Google Scholar] [CrossRef]
- Sharma, R.K.; Bharathy, G.; Karimi, F.; Mishra, A.V.; Prasad, M. Thematic Analysis of Big Data in Financial Institutions Using NLP Techniques with a Cloud Computing Perspective: A Systematic Literature Review. Information 2023, 14, 577. [Google Scholar] [CrossRef]
- Cheng, M.; Qu, Y.; Jiang, C.; Zhao, C. Is cloud computing the digital solution to the future of banking? J. Financ. Stab. 2022, 63, 101073. [Google Scholar] [CrossRef]
- Gąsiorkiewicz, L.; Monkiewicz, J. Digital Finance and the Future of the Global Financial System; Routledge: Oxfordshire, UK, 2022. [Google Scholar] [CrossRef]
- Zhang, C.; Li, G.; Zhang, J.; Zhang, X.; Feng, J. HTAP Databases: A Survey. IEEE Trans. Knowl. Data Eng. 2024, 36, 6410–6429. [Google Scholar] [CrossRef]
- Program on International Financial Systems. Data Localization, Cloud Adoption, and the Financial Sector. 2024. Available online: https://www.pifsinternational.org/wp-content/uploads/2024/07/Report-on-Data-Localization-07.29.2024.pdf (accessed on 1 December 2025).
- Dong, H.; Zhang, C.; Li, G.; Zhang, H. Cloud-Native Databases: A Survey. IEEE Trans. Knowl. Data Eng. 2024, 36, 7772–7791. [Google Scholar] [CrossRef]
- Gray, J. The Transaction Concept: Virtues and Limitations. In Proceedings of the Seventh International Conference on Very Large Data Bases, Cannes, France, September 9–11 1981; Available online: https://jimgray.azurewebsites.net/papers/thetransactionconcept.pdf (accessed on 1 December 2025).
- Carvalho, M.; Sá, F.; Bernardino, J. Evaluation of the Impact of AES Encryption on Query Read Performance Across Oracle, MySQL, and SQL Server Databases. Cryptography 2025, 9, 77. [Google Scholar] [CrossRef]
- Sun, B.; Zhao, S.; Tian, G. SQL queries over encrypted databases: A survey. Connect. Sci. 2024, 36, 2323059. [Google Scholar] [CrossRef]
- Rao, A.; Khankhoje, D.; Namdev, U.; Bhadane, C.; Dongre, D. Insights into NoSQL databases using financial data: A comparative analysis. Procedia Comput. Sci. 2022, 215, 8–23. [Google Scholar] [CrossRef]
- Pokorný, J. Integration of Relational and NoSQL Databases. Vietnam. J. Comput. Sci. 2019, 6, 389–405. [Google Scholar] [CrossRef]
- de la Vega, A.; García-Saiz, D.; Blanco, C.; Zorrilla, M.; Sánchez, P. Mortadelo: Automatic generation of NoSQL stores from platform-independent data models. Future Gener. Comput. Syst. 2020, 105, 455–474. [Google Scholar] [CrossRef]
- Deka, G.C. NoSQL Polyglot Persistence. In Advances in Computers; Elsevier: Amsterdam, The Netherlands, 2018; pp. 357–390. [Google Scholar] [CrossRef]
- Morabito, G.; Galletta, A.; Celesti, A.; Villari, M.; Fazio, M. Enhancing Data Privacy in Federated Data Spaces: Hierarchical Secret-Share based Pseudonymization. In Proceedings of the IEEE/ACM 16th International Conference on Utility and Cloud Computing, Taormina, Italy, 4–7 December 2023; ACM: New York, NY, USA, 2023; pp. 1–2. [Google Scholar] [CrossRef]
- Sankhe, M.; Mangla, M.; Shah, P.; Doshi, P.; Shah, J.; Vyas, D. A Unified Distributed Version Control System for SQL and Graph Databases. In Proceedings of the 2025 9th International Conference on Computing, Communication, Control and Automation (ICCCBEA), Pune, India, 22–23 August 2025; IEEE: Piscataway, NJ, USA, 2025; pp. 1–7. [Google Scholar] [CrossRef]
- Rao, T.R.; Mitra, P.; Bhatt, R.; Goswami, A. The big data system, components, tools, and technologies: A survey. Knowl. Inf. Syst. 2019, 60, 1165–1245. [Google Scholar] [CrossRef]
- Hussain, K.; Prieto, E. Big Data in the Finance and Insurance Sectors. In New Horizons for a Data-Driven Economy; Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 209–223. [Google Scholar] [CrossRef]
- Zaharia, M.; Xin, R.S.; Wendell, P.; Das, T.; Armbrust, M.; Dave, A.; Meng, X.; Rosen, J.; Venkataraman, S.; Franklin, M.J.; et al. Apache Spark. Commun. ACM 2016, 59, 56–65. [Google Scholar] [CrossRef]
- Schneider, J.; Gröger, C.; Lutsch, A.; Schwarz, H.; Mitschang, B. The Lakehouse: State of the Art on Concepts and Technologies. SN Comput. Sci. 2024, 5, 449. [Google Scholar] [CrossRef]
- Harby, A.A.; Zulkernine, F. Data Lakehouse: A survey and experimental study. Inf. Syst. 2025, 127, 102460. [Google Scholar] [CrossRef]
- Armbrust, M.; Das, T.; Sun, L.; Yavuz, B.; Zhu, S.; Murthy, M.; Torres, J.; van Hovell, H.; Ionescu, A.; Luszczak, A.; et al. Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores. Proc. VLDB Endow. 2020, 13, 3411–3424. [Google Scholar] [CrossRef]
- Apache Software Foundation. Apache Hadoop Documentation: Transparent Encryption in HDFS. 2026. Available online: https://hadoop.apache.org/docs/r3.4.3/hadoop-project-dist/hadoop-hdfs/TransparentEncryption.html (accessed on 1 December 2025).
- Apache Software Foundation. Hadoop Key Management Server (KMS)—Documentation Sets. 2026. Available online: https://hadoop.apache.org/docs/stable/hadoop-kms/index.html (accessed on 1 December 2025).
- Apache Software Foundation. Apache Spark Documentation: Security. 2026. Available online: https://spark.apache.org/docs/latest/security.html (accessed on 1 December 2025).
- Apache Software Foundation. Timeline|Apache Hudi. 2026. Available online: https://hudi.apache.org/docs/next/timeline/ (accessed on 1 December 2025).
- Apache Software Foundation. Apache Iceberg Documentation: Spark Queries—Time Travel. 2026. Available online: https://iceberg.apache.org/docs/latest/spark-queries/ (accessed on 1 December 2025).
- Ogundipe, D.O. Conceptualizing cloud computing in financial services: Opportunities and challenges in Africa-US contexts. Comput. Sci. It Res. J. 2024, 5, 757–767. [Google Scholar] [CrossRef]
- European Banking Authority. Guidelines on Outsourcing Arrangements (EBA/GL/2019/02). Official Guideline. 2019. Available online: https://www.eba.europa.eu/sites/default/files/documents/10180/2551996/38c80601-f5d7-4855-8ba3-702423665479/EBA%20revised%20Guidelines%20on%20outsourcing%20arrangements.pdf (accessed on 1 December 2025).
- European Central Bank. Guide on Outsourcing Cloud Services to Cloud Service Providers. Official Supervisory Guide. 2025. Available online: https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssm.supervisory_guides202507.en.pdf (accessed on 1 December 2025).
- Gajbhiye, A.; Shrivastva, K.M.P. Cloud computing: Need, enabling technology, architecture, advantages and challenges. In Proceedings of the 2014 5th International Conference—Confluence The Next Generation Information Technology Summit (Confluence), Noida, India, 25–26 September 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–7. [Google Scholar] [CrossRef]
- Huaman, C.H.O.; Fuster, N.F.; Luyo, A.C.; Armas-Aguirre, J. Critical Data Security Model: Gap Security Identification and Risk Analysis In Financial Sector. In Proceedings of the 2022 17th Iberian Conference on Information Systems and Technologies (CISTI), Madrid, Spain, 22–25 June 2022; IEEE: Piscataway, NJ, USA, 2022. [Google Scholar] [CrossRef]
- Shaik, V.; K., N. Cloud databases: A resilient and robust framework to dissolve vendor lock-in. Softw. Impacts 2024, 21, 100680. [Google Scholar] [CrossRef]
- Bolognini, L.; Bistolfi, C. Pseudonymization and impacts of Big (personal/anonymous) Data processing in the transition from the Directive 95/46/EC to the new EU General Data Protection Regulation. Comput. Law Secur. Rev. 2017, 33, 171–181. [Google Scholar] [CrossRef]
- Lee, S.; Kim, Y.; Kwon, Y.; Cho, S. Secure privacy-preserving record linkage system from re-identification attack. PLoS ONE 2025, 20, e0314486. [Google Scholar] [CrossRef] [PubMed]
- Achanta, M.; Vuppu, D. Implementing Data Versioning and Lineage Tracking in ETL Workflows. Int. J. Sci. Res. (IJSR) 2025, 14, 1312–1315. [Google Scholar] [CrossRef]
- European Union. Regulation (EU) 2022/2554 of the European Parliament and of the Council of 14 December 2022 on Digital Operational Resilience for the Financial Sector. Official EUR-Lex Text. 2022. Available online: https://eur-lex.europa.eu/eli/reg/2022/2554/oj/eng (accessed on 1 December 2025).
- Amazon Web Services. How S3 Versioning Works. 2026. Available online: https://docs.aws.amazon.com/AmazonS3/latest/userguide/versioning-workflows.html (accessed on 1 December 2025).
- Microsoft. Blob Versioning. 2026. Available online: https://learn.microsoft.com/en-us/azure/storage/blobs/versioning-overview (accessed on 1 December 2025).
- Google Cloud. Object Versioning. 2026. Available online: https://docs.cloud.google.com/storage/docs/object-versioning (accessed on 1 December 2025).
- Armbrust, M.; Ghodsi, A.; Xin, R.; Zaharia, M. Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics. Proc. CIDR 2021, 8, 28. [Google Scholar]
- Clemente-Castello, F.J.; Nicolae, B.; Katrinis, K.; Mustafa Rafique, M.; Mayo, R.; Fernandez, J.C.; Loreti, D. Enabling Big Data Analytics in the Hybrid Cloud Using Iterative MapReduce. In Proceedings of the 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC), Limassol, Cyprus, 7–10 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 290–299. [Google Scholar] [CrossRef]
- Garad, A.; Riyadh, H.A.; Al-Ansi, A.M.; Beshr, B.A.H. Unlocking financial innovation through strategic investments in information management: A systematic review. Discov. Sustain. 2024, 5, 381. [Google Scholar] [CrossRef]
- Eichler, R.; Giebler, C.; Gröger, C.; Schwarz, H.; Mitschang, B. Modeling metadata in data lakes: A generic model. Data Knowl. Eng. 2021, 136, 101931. [Google Scholar] [CrossRef]
- Ionescu, S.A.; Radu, A.O. Assessment and Integration of Relational Databases, Big Data, and Cloud Computing in Financial Institutions: Performance Comparison. In Proceedings of the 2024 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), Craiova, Romania, 4–6 September 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–7. [Google Scholar] [CrossRef]
- Apache Software Foundation. Spark SQL and DataFrames. 2025. Available online: https://spark.apache.org/docs/latest/sql-programming-guide.html (accessed on 1 December 2025).
- Zhu, J.; Cheng, K.; Liu, J.; Guo, L. Full encryption. Proc. VLDB Endow. 2021, 14, 2811–2814. [Google Scholar] [CrossRef]
- Nwatuzie, G.A.; Enyejo, L.A.; Umeaku, C. Enhancing Cloud Data Security Using a Hybrid Encryption Framework Integrating AES, DES, and RC6 with File Splitting and Steganographic Key Management. Int. J. Innov. Sci. Res. Technol. 2025, 10, 1555–1569. [Google Scholar] [CrossRef]
- Stalla-Bourdillon, S. Identifiability, as a Data Risk: Is a Uniform Approach to Anonymisation About to Emerge in the EU? Eur. J. Risk Regul. 2025, 16, 1456–1474. [Google Scholar] [CrossRef]
- Baumgartner, M.; Kreiner, K.; Wiesmüller, F.; Hayn, D.; Puelacher, C.; Schreier, G. Masketeer: An Ensemble-Based Pseudonymization Tool with Entity Recognition for German Unstructured Medical Free Text. Future Internet 2024, 16, 281. [Google Scholar] [CrossRef]
- UCI Machine Learning. German Credit Risk. Kaggle Dataset Page, Modified 2016-12-14. 2016. Available online: https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data (accessed on 1 December 2025).
- Kumar, J.A.; Ajay, K.; Kumar, P.N. Leveraging Social Media and Multitask Natural Language Processing to Detect Financial Consumer Complaints. Econ. Comput. Econ. Cybern. Stud. Res. 2025, 59, 145–162. [Google Scholar] [CrossRef]
- Ionescu, S.A.; Diaconita, V. Transforming Financial Decision-Making: The Interplay of AI, Cloud Computing and Advanced Data Management Technologies. Int. J. Comput. Commun. Control 2023, 18, 1–19. [Google Scholar] [CrossRef]
- Ionescu, S.A.; Diaconita, V.; Radu, A.O. Engineering Sustainable Data Architectures for Modern Financial Institutions. Electronics 2025, 14, 1650. [Google Scholar] [CrossRef]
- Schreiner, G.A.; Knob, R.; Duarte, D.; Vilain, P.; Mello, R.d.S. NewSQL Through the Looking Glass. In Proceedings of the 21st International Conference on Information Integration and Web-Based Applications & Services, Munich, Germany, 2–4 December 2019; ACM: New York, NY, USA, 2019; pp. 361–369. [Google Scholar] [CrossRef]
- European Parliament and the Council of the European Union. Regulation (EU) 2016/679 (General Data Protection Regulation). Official EUR-Lex Text. 2016. Available online: https://eur-lex.europa.eu/eli/reg/2016/679/oj (accessed on 1 December 2025).
- European Data Protection Board. Recommendations 01/2020 on Measures That Supplement Transfer Tools to Ensure Compliance with the EU Level of Protection of Personal Data, Version 2.0. 2021. Available online: https://www.edpb.europa.eu/system/files/2021-06/edpb_recommendations_202001vo.2.0_supplementarymeasurestransferstools_en.pdf (accessed on 1 December 2025).
- PostgreSQL Global Development Group. PostgreSQL Documentation: Transactions. 2026. Available online: https://www.postgresql.org/docs/current/tutorial-transactions.html (accessed on 1 December 2025).
- Oracle Corporation. Oracle Database Reference: Unified Audit Trail. 2026. Available online: https://www.oracle.com/a/tech/docs/dbsec/unified-audit-best-practice-guidelines.pdf (accessed on 1 December 2025).
- pgAudit Project. pgAudit. 2026. Available online: https://github.com/pgaudit/pgaudit/blob/main/README.md (accessed on 1 December 2025).
- Arnal, J. The Banking Sector Is Increasingly Looking to the Cloud. CEPS Explainer, 2023-12. 2023. Available online: https://cdn.ceps.eu/wp-content/uploads/2023/11/UqkAUXOF-2023-12_CEPS-Explainer-banking-sector-looking-to-the-cloud.pdf (accessed on 1 December 2025).
- Odebrecht, C. Research Data Governance. The Need for a System of Cross-organisational Responsibility for the Researcher’s Data Domain. Data Sci. J. 2025, 24, 12. [Google Scholar] [CrossRef]
- Tun, M.T.; Nyaung, D.E.; Phyu, M.P. Performance Evaluation of Intrusion Detection Streaming Transactions Using Apache Kafka and Spark Streaming. In Proceedings of the 2019 International Conference on Advanced Information Technologies (ICAIT), Yangon, Myanmar, 6–7 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 25–30. [Google Scholar] [CrossRef]
- Luo, C.; Cao, Q.; Li, T.; Chen, H.; Wang, S. MapReduce accelerated attribute reduction based on neighborhood entropy with Apache Spark. Expert Syst. Appl. 2023, 211, 118554. [Google Scholar] [CrossRef]
- Runsewe, O.; Samaan, N. Cloud Resource Scaling for Time-Bounded and Unbounded Big Data Streaming Applications. IEEE Trans. Cloud Comput. 2021, 9, 504–517. [Google Scholar] [CrossRef]
- Westerlund, M.; Hedlund, U.; Pulkkis, G.; Björk, K.M. A Generalized Scalable Software Architecture for Analyzing Temporally Structured Big Data in the Cloud. In Advances in Intelligent Systems and Computing; Springer International Publishing: Berlin/Heidelberg, Germany, 2014; pp. 559–569. [Google Scholar] [CrossRef]
- Rozony, F.Z.; Aktar, M.N.A.; Ashrafuzzaman, M.; Islam, A. A systematic review of big data integration challenges and solutions for heterogeneous data sources. Acad. J. Bus. Adm. Innov. Sustain. 2024, 4, 1–18. [Google Scholar] [CrossRef]
- Putrama, I.M.; Martinek, P. Heterogeneous data integration: Challenges and opportunities. Data Brief 2024, 56, 110853. [Google Scholar] [CrossRef] [PubMed]







| Governance Dimension | RDBMS (Relational Systems) | Big Data (Hadoop/Spark) | Cloud (EU vs Non-EU) |
|---|---|---|---|
| Architectural role | Initial control and governance layer, where data are validated and protected at ingestion | Parallel processing and large-scale audit layer | Elastic analytics layer with controlled geographic replication |
| Governance application stage | At the transaction level, before data leave operational systems | During preprocessing, prior to data distribution across the cluster | Only after pseudonymization and compliance validation |
| Encryption | Fine-grained column-level encryption and/or TDE using standard algorithms (e.g., AES-256) | Encryption in transit and at rest via KMS (HDFS-KMS, TLS) | Mandatory encryption prior to transfer, keys remain on-premises or within the EU |
| Pseudonymization | Deterministic hashing or tokenization directly in the RDBMS engine | Applied in ETL pipelines or Spark jobs before persistence | Accepts only pseudonymized data, especially for Non-EU zones |
| Key management | Internally managed keys, integrated with database security policies | Centralized KMS shared at the cluster level | Keys are not transferred; they remain within the EU in cross-border scenarios |
| Data versioning | Logical versioning via audit tables, triggers, and transaction logs | Native incremental versioning (Delta Lake, Hudi, Iceberg) | Object versioning (S3 Versioning, Blob Snapshots) for replication and rollback |
| Audit and traceability | Deterministic, complete, transaction-level audit | Dataset- and job-level audit and lineage | Access-, replication-, and usage-oriented audit |
| Typical use cases | Sensitive, operational data with strict compliance requirements | Large-scale analytics, batch and streaming processing | Elastic analytics, reporting, and ML on pseudonymized data |
| Main limitations | Limited scalability and high costs at large volumes | Operational complexity and orchestration overhead | Jurisdictional constraints and indirect control over encryption keys |
| Architectural Layer | Managed Data Types | Integration Strategies | Applied Governance Mechanisms | Primary Role |
|---|---|---|---|---|
| Data sources | Raw, transactional, and personal data | Native connectors, APIs, and controlled ingestion | Initial classification and semantic validation | Legal boundary and single point of entry |
| Relational governance (RDBMS) | Personal data | Controlled ETL and CDC | AES-256 encryption, deterministic pseudonymization, and logical audit | Primary legal and operational control |
| Integration layer | Batch and streaming flows | JDBC, ETL, Kafka, and CDC | Sensitive data filtering and separation of personal vs non-personal data | Control of data propagation |
| Big Data platforms | Pseudonymized datasets | Distributed batch/streaming processing | Distributed audit, lineage, and incremental versioning | Analytical traceability and consistency |
| EU Cloud | Pseudonymized data | Controlled replication | Locally managed keys and snapshots | Compliant elastic analytics |
| Non-EU Cloud | Pseudonymized data only | Selective replication | No key access and read-only access | Global scaling without legal risk |
| Reconciliation and metadata | Metadata and versions | Incremental synchronization | Consistency validation, and version correlation | End-to-end integrity |
| Status of Existing Checking Account | Duration (Months) | Credit History | Purpose | Credit Amount | Employment Since | Credit Worthiness |
|---|---|---|---|---|---|---|
| A13 | 52 | A32 | A48 | 6104 | A72 | 2 |
| A14 | 58 | A31 | A45 | 19,673 | A72 | 2 |
| A11 | 16 | A30 | A49 | 13,160 | A75 | 1 |
| A13 | 26 | A31 | A40 | 13,326 | A73 | 2 |
| A13 | 40 | A30 | A45 | 5163 | A71 | 2 |
| A14 | 45 | A32 | A41 | 12,683 | A75 | 2 |
| A11 | 56 | A33 | A44 | 10,397 | A75 | 2 |
| A11 | 44 | A34 | A45 | 1512 | A74 | 2 |
| A13 | 44 | A32 | A46 | 19,066 | A74 | 2 |
| {“Status of existing checking account”:“A13”,“Duration in month”:52,“Credit |
| history”:“A32”,“Purpose”:“A48”,“Credit amount”:6104,“Present employment |
| since”:“A72”,“Installment rate in percentage of disposable income”:1,“Personal status |
| and sex”:“A95”,“Other debtors guarantors”:“A103”,“Present residence |
| since”:1,“Property”:“A124”,“Age in years”:41,“Other installment |
| plans”:“A143”,“Housing”:“A153”,“Number of existing credits at this |
| bank”:1,“Job”:“A171”,“Number of people being liable to provide maintenance |
| for”:1,“Telephone”:“A191”,“Foreign worker”:“A202”,“Creditworthiness”:2} |
| “The customer has a checking account status of A13, with a loan duration of 52 months. The credit history is A32, and the purpose of the loan is A48. The loan amount is 6104 units. The customer has been employed for A72 and has a job categorized as A171. The customer resides in housing type A153 and owns property of type A124. The customer has 1 existing credit(s) and is classified as a foreign worker.” |
| Technology | Data Type | CPU (%) Mean ± SD | Memory (GB) Mean ± SD | Time (s) Mean ± SD | Overhead vs. Baseline |
|---|---|---|---|---|---|
| RDBMS | Structured | (baseline: s) | |||
| Big Data | Structured | (baseline: s) | |||
| Big Data | Semi-structured | (baseline: s) | |||
| Big Data | Unstructured | ≈0 | (baseline: s) | ||
| Cloud | Structured | ≈0 | (baseline: s) | ||
| Cloud | Semi-structured | ≈0 | (baseline: s) | ||
| Cloud | Unstructured | ≈0 | (baseline: s) |
| Technology | Data Type | Baseline Mean ± SD (s) | All Policies Mean ± SD (s) | % | H-Statistic | p-Value |
|---|---|---|---|---|---|---|
| RDBMS | Structured | <0.001 | ||||
| Big Data | Structured | <0.001 | ||||
| Big Data | Semi-structured | <0.001 | ||||
| Big Data | Unstructured | <0.001 | ||||
| Cloud | Structured | <0.001 | ||||
| Cloud | Semi-structured | <0.001 | ||||
| Cloud | Unstructured | <0.001 |
| Data Type | Scenario | Baseline Mean Time (s) | Scenario Mean Time (s) | Time (%) | CPU |
|---|---|---|---|---|---|
| RDBMS | |||||
| Structured | Encryption | 1.862 | 2.547 | +36.8 | +0.426 |
| Structured | Pseudonymization | 1.862 | 2.869 | +54.1 | +0.740 |
| Structured | Versioning | 1.862 | 2.510 | +34.8 | +0.296 |
| Structured | All Policies | 1.862 | 4.302 | +131.0 | +1.535 |
| Big Data | |||||
| Structured | Encryption | 1.893 | 2.006 | +6.0 | +0.117 |
| Structured | Pseudonymization | 1.893 | 2.363 | +24.8 | +0.005 |
| Structured | Versioning | 1.893 | 2.038 | +7.6 | 0.000 |
| Structured | All Policies | 1.893 | 2.536 | +34.0 | +0.123 |
| Semi-structured | Encryption | 3.619 | 5.113 | +41.3 | +1.460 |
| Semi-structured | Pseudonymization | 3.619 | 3.617 | −0.1 | 0.000 |
| Semi-structured | Versioning | 3.619 | 3.757 | +3.8 | 0.000 |
| Semi-structured | All Policies | 3.619 | 5.233 | +44.6 | +1.460 |
| Unstructured | Encryption | 0.385 | 2.228 | +478.3 | +1.152 |
| Unstructured | Pseudonymization | 0.385 | 0.609 | +57.9 | +0.001 |
| Unstructured | Versioning | 0.385 | 0.431 | +11.8 | +0.002 |
| Unstructured | All Policies | 0.385 | 1.749 | +353.9 | +1.118 |
| Cloud | |||||
| Structured | Encryption | 1.962 | 2.116 | +7.9 | +0.114 |
| Structured | Pseudonymization | 1.962 | 2.459 | +25.4 | +0.005 |
| Structured | Versioning | 1.962 | 2.148 | +9.5 | +0.001 |
| Structured | All Policies | 1.962 | 2.614 | +33.2 | +0.123 |
| Semi-structured | Encryption | 4.431 | 6.192 | +39.8 | +1.661 |
| Semi-structured | Pseudonymization | 4.431 | 4.416 | −0.3 | +0.021 |
| Semi-structured | Versioning | 4.431 | 4.581 | +3.4 | +0.027 |
| Semi-structured | All Policies | 4.431 | 6.305 | +42.3 | +1.688 |
| Unstructured | Encryption | 1.818 | 2.758 | +51.7 | +1.171 |
| Unstructured | Pseudonymization | 1.818 | 1.063 | −41.5 | −0.057 |
| Unstructured | Versioning | 1.818 | 1.685 | −7.3 | −0.003 |
| Unstructured | All Policies | 1.818 | 2.966 | +63.2 | +1.138 |
| Governance Policy | Performance Impact | Data Quality Impact | System Stability | Financial Applications |
|---|---|---|---|---|
| No governance (Baseline) | Maximum performance; minimal overhead | High quality; no protection | Very high | Exploratory analytics; internal prototyping |
| Encryption | Moderate CPU overhead; increased execution time | Full confidentiality | High | Core banking systems; regulated reporting |
| Pseudonymization | Moderate overhead; good scalability | Semantic quality preserved | High | Credit scoring; risk analysis |
| Incremental versioning | Moderate increase in execution time | Full traceability and auditability | Medium to high | Audit; compliance; historical reconstruction |
| All policies enabled | High cumulative overhead; scalable in hybrid architectures | Maximum security and data quality | High (RDBMS); medium (Cloud) | Mission-critical financial systems |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ionescu, S.-A.; Diaconita, V.; Radu, A.-O.; Dinca, L.G.; Nagit, I. Deterministic Data Governance in Hybrid Financial Architectures. Electronics 2026, 15, 1716. https://doi.org/10.3390/electronics15081716
Ionescu S-A, Diaconita V, Radu A-O, Dinca LG, Nagit I. Deterministic Data Governance in Hybrid Financial Architectures. Electronics. 2026; 15(8):1716. https://doi.org/10.3390/electronics15081716
Chicago/Turabian StyleIonescu, Sergiu-Alexandru, Vlad Diaconita, Andreea-Oana Radu, Laurentiu Gabriel Dinca, and Ioana Nagit. 2026. "Deterministic Data Governance in Hybrid Financial Architectures" Electronics 15, no. 8: 1716. https://doi.org/10.3390/electronics15081716
APA StyleIonescu, S.-A., Diaconita, V., Radu, A.-O., Dinca, L. G., & Nagit, I. (2026). Deterministic Data Governance in Hybrid Financial Architectures. Electronics, 15(8), 1716. https://doi.org/10.3390/electronics15081716

