Next Article in Journal
Heavy Metal Control and Dry Matter Assessment in Digested Sewage Sludge for Biogas Production
Previous Article in Journal
Reactive Power Compensation for Single-Phase AC Motors Using Integral Power Theory
Previous Article in Special Issue
Enhancing Thermoelectric Performance of Mg3Sb2 Through Substitutional Doping: Sustainable Energy Solutions via First-Principles Calculations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoirs via Machine Learning: Case Study from Hungary †

by
Hawkar Ali Abdulhaq
1,2,*,
János Geiger
2,3,
István Vass
4,
Tivadar M. Tóth
2,
Tamás Medgyes
5,
Gábor Bozsó
2,6,
Balázs Kóbor
5,
Éva Kun
7 and
János Szanyi
2,6
1
Department of Atmospheric and Geospatial Data Sciences, University of Szeged, Egyetem Utca, 2, 6722 Szeged, Hungary
2
Department of Geology, University of Szeged, Egyetem Utca, 2, 6722 Szeged, Hungary
3
GEOCHEM Ltd., 7673 Kővágószőlős, Hungary
4
MOL Hungary, MOL Plc, H-6701 Algyő, SZEAK épület 2.em 207.sz., 6701 Algyő, Hungary
5
SZETAV District Heating Company of Szeged, Vág u. 4, 6724 Szeged, Hungary
6
Geothermal Energy Applied Research Department, University of Szeged, Egyetem utca 2, 6722 Szeged, Hungary
7
Szabályozott Tevékenységek Felügyeleti Hatósága, Alkotás Utca 50, 1123 Budapest, Hungary
*
Author to whom correspondence should be addressed.
This article is a revised and expanded version of a paper published in Abdulhaq, H. Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoir via Machine Learning: Case Study from Hungary. The 16th European Geothermal PhD Days, Szeged, Hungary, 8–11 April 2025.
Energies 2025, 18(10), 2642; https://doi.org/10.3390/en18102642
Submission received: 28 April 2025 / Revised: 14 May 2025 / Accepted: 17 May 2025 / Published: 20 May 2025
(This article belongs to the Special Issue Energy, Engineering and Materials 2024)

Abstract

:
This study presents an innovative approach for repurposing depleted clastic hydrocarbon reservoirs in Hungary as High-Temperature Aquifer Thermal Energy Storage (HT-ATES) systems, integrating numerical heat transport modeling and machine learning optimization. A detailed hydrogeological model of the Békési Formation was built using historical well logs, core analyses, and production data. Heat transport simulations using MODFLOW/MT3DMS revealed optimal dual-well spacing and injection strategies, achieving peak injection temperatures around 94.9 °C and thermal recovery efficiencies ranging from 81.05% initially to 88.82% after multiple operational cycles, reflecting an efficiency improvement of approximately 8.5%. A Random Forest model trained on simulation outputs predicted thermal recovery performance with high accuracy (R2 ≈ 0.87) for candidate wells beyond the original modeling domain, demonstrating computational efficiency gains exceeding 90% compared to conventional simulations. The proposed data-driven methodology significantly accelerates optimal site selection and operational planning, offering substantial economic and environmental benefits and providing a scalable template for similar geothermal energy storage initiatives in other clastic sedimentary basins.

1. Introduction

The end-of-life management of oil and gas wells—including plugging, abandonment, and site remediation—poses significant economic and environmental challenges. In the U.S., median decommissioning costs average USD 76,000 but can exceed USD 1 million for deep wells. In Hungary, abandonment costs typically range from 50 million to several hundred million HUF, depending on well depth and complexity [1,2]. With over 3 million inactive or orphaned wells in the U.S. alone, many pose environmental risks, including methane emissions, which have a global-warming potential 25–34 times that of CO2 [3,4]. In Pennsylvania, legacy wells contribute an estimated 5–8% of anthropogenic methane, prompting stricter plugging regulations [4,5]. Equally important is reclaiming and stabilizing well sites for land safety and ecosystem restoration. Recent advances in high-fidelity reservoir simulation paired with supervised machine learning surrogates—such as artificial neural networks, Random Forests, and Gaussian process emulators—now enable rapid, data-driven screening of thousands of well candidates to pinpoint those most suitable for conversion into geothermal energy assets, effectively transforming environmental liabilities into low-carbon, revenue-generating infrastructure [6,7,8].
Amid these challenges, data-driven revitalization of depleted hydrocarbon fields has emerged as a powerful strategy to tackle renewable-energy intermittency and advance global decarbonization goals [9,10]. Recent policy roadmaps from the International Energy Agency, and the World Economic Forum [11,12,13] all highlight the need for adaptable, seasonal storage solutions that bolster energy security and grid flexibility [14,15]. Repurposing depleted reservoirs directly supports this agenda: their well-characterized permeability, porosity, and extensive historical datasets can be leveraged to transform environmental liabilities into low-carbon, revenue-generating assets [16,17,18]. Diverse pore-space applications—CO2 sequestration [19], hydrogen storage [20], subsurface electricity generation [9], and Aquifer Thermal Energy Storage (ATES) [21,22]—underscore their versatility in a decarbonizing energy landscape. Crucially, advances in high-resolution reservoir modeling and supervised machine learning algorithms now allow rapid screening of thousands of well trajectories, prediction of storage performance under uncertainty, and optimization of injection–production schemes—dramatically reducing time-to-deployment and de-risking investment [23,24,25].
With over 2800 systems worldwide—mainly in the Netherlands and supplying ~2.5 TWh/yr—ATES has proven technically mature and scalable [26,27,28,29]. It uses existing wells and data to store surplus heat for later use, helping balance energy supply and support renewable integration [27,30]. High-Temperature ATES (HT-ATES), targeting fluid temperatures ≥ 90 °C, further extends storage potential. Recent studies have combined coupled reservoir simulations with supervised-learning surrogates—such as neural-network emulators [7], Gaussian process metamodeling [8], and Random Forest regressors [6]—to forecast system performance across diverse geological scenarios in seconds rather than hours. In the Upper Rhine Graben, 90% of surveyed depleted oil fields are HT-ATES-ready, with projected storage of up to 12 GWh/yr and ~82% recovery after ten years [22,24,31]. Field-scale pilots have confirmed its feasibility: the Middenmeer project stored 85–90 °C heat in a 400 m aquifer, using real-time monitoring and MODFLOW/MT3DMS to manage issues like sand production [32,33]. In the U.S., geothermal battery projects in California and Texas repurpose oilfields for long-duration thermal storage, leveraging machine learning surrogates to optimize injection and reduce testing time by over 70% [20,23,24]. These systems offer superior discharge duration over batteries and cut CO2 emissions [27].
Depleted reservoirs offer vast storage potential. The Carrizo-Wilcox aquifer could store 554 TWh of heat—63 TWh as electricity [34]—while the Upper Rhine Graben could supply 10 TWh/yr of heat, covering much of the region’s demand [22]. These capacities far exceed those of pumped hydro and grid batteries [35]. Repurposing wells cuts capital costs, and integration with renewables enables low levelized costs—USD 0.11/kWh for electricity, and USD 0.02/kWh for heat [20,36]. Though round-trip efficiency is 40–50%, geological storage fills a key seasonal balancing role [15]. Although High-Temperature ATES is advancing, substantial uncertainties still cloud the design and operation of dual-well systems in heterogeneous clastic reservoirs—uncertainties that brute-force deterministic simulations cannot efficiently close. Chief among these is selecting the hot–cold well spacing that suppresses premature thermal breakthrough yet preserves a high heat-recovery factor. Thermal breakthrough can erode efficiency and jeopardise long-term viability [37,38]. While ATES research is growing, few studies have formally linked spacing to thermal performance in repurposed hydrocarbon fields with complex stratigraphy [28,39]. Cutting-edge work now couples high-fidelity thermo-hydraulic models with machine learning surrogates—Random Forest, artificial neural networks, and Gaussian process emulators—to scan thousands of spacing scenarios, propagate geological uncertainty, and pinpoint Pareto-optimal dual-well layouts in minutes rather than days [6,7,8]. Yet these data-driven methods have not been systematically deployed in depleted clastic basins [23,24].
This study addresses four tightly linked questions aimed at advancing seasonal thermal energy storage in depleted clastic reservoirs, addressing existing research gaps related to optimal well spacing, geological heterogeneity impacts, and operational scheduling. It first pinpoints the inter-well distance in a dual-well configuration that suppresses premature thermal breakthrough while maximizing heat-recovery efficiency, a critical aspect insufficiently explored in prior research. It then examines how key reservoir attributes in Hungarian clastic formation—porosity, permeability, and lithological anisotropy—shift the optimal spacing and shape of overall system performance. Next, it evaluates which alternating operating schedule—summer heat storage/winter heat production versus summer cold production/winter cold storage—delivers the greatest annual energy return. Finally, it tests whether supervised machine learning models, calibrated on suites of coupled MODFLOW-MT3DMS simulations, can predict and optimize these design variables across the full inventory of candidate wells, thereby slashing computational time by over 90% and accelerating field deployment, which was not systematically demonstrated in heterogeneous clastic reservoirs before. A preliminary version of this study was presented as an abstract at the 16th European Geothermal PhD Days, held in Szeged, Hungary [40].
Hungary is an exceptional testbed for High-Temperature ATES because it (i) hosts a dense network of depleted oil and gas wells situated close to district-heating loads, (ii) lies within a moderate-to-high geothermal gradient province that delivers initial reservoir temperatures of ≈ 70 °C, and (iii) operates under strong national directives for decarbonization and energy-security gains [41,42]. Decades of well log, core, and production records have generated a richly labeled subsurface dataset that can be mined with geostatistics and supervised learning [43]. Leveraging these assets, we integrate ML with high-resolution MODFLOW/MT3DMS heat transport simulations and Random Forest surrogates—an approach shown to cut optimization runtimes by an order of magnitude while preserving predictive accuracy [6,7]. The resulting workflow delivers both mechanistic insights and actionable design rules for dual-well HT-ATES implementation in Hungarian clastic reservoirs.
By coupling high-fidelity MODFLOW-MT3DMS heat transport simulations with Random Forest surrogates trained on thousands of synthetic well trajectories, we cut optimization runtimes by more than 90% while maintaining high predictive skill [7,8]. Feature-importance analysis of these meta-models (R2 ≈ 0.87 on a hold-out set) pinpoints inter-well distance, reservoir anisotropy, and cycle length as the dominant controls on the heat-recovery Factor—insights that translate into practical spacing rules and seasonally phased operating schedules for Hungarian clastic reservoirs [6,7]. Validation against independent datasets from analogous central European basins confirms the transferability of the workflow [43]. Collectively, these results provide a scalable blueprint for converting depleted hydrocarbon assets into long-duration, low-carbon thermal batteries, simultaneously reducing decommissioning liabilities and advancing national energy-security and climate-mitigation goals [12,13].

2. Glossary of Terms

ATES (Aquifer Thermal Energy Storage): A technology for storing and retrieving thermal energy in aquifers, enabling seasonal energy management by injecting heat in summer and recovering it during winter.
HT-ATES (High-Temperature Aquifer Thermal Energy Storage): An advanced form of ATES designed for storage and recovery of thermal energy at fluid temperatures ≥ 90 °C, suitable for industrial and district heating applications.
MODFLOW: A modular three-dimensional finite-difference groundwater flow model developed by the U.S. Geological Survey, widely used for simulating groundwater conditions and flows.
MT3DMS: Modular Three-Dimensional Multi-Species Transport Model, used in conjunction with MODFLOW to simulate the transport of heat, solutes, or other contaminants in groundwater systems.
Random Forest: An ensemble machine learning method based on decision trees, used for regression and classification tasks, valued for its robustness and ability to model complex relationships.
Heat-Recovery Factor (HRF): The ratio of recovered thermal energy to the initially injected energy during an ATES cycle, often used as a performance metric for system efficiency.
Thermal Breakthrough: The phenomenon where injected hot or cold fluid reaches the production well too quickly, reducing system efficiency and potentially shortening operational lifetime.
UCN File (Unformatted Concentration File): A binary output file generated by MODFLOW/MT3DMS containing spatially and temporally resolved simulation results, in this study representing temperature distributions.
Residual Heat Accumulation: The progressive build-up of stored heat in the aquifer over multiple ATES cycles, typically leading to higher thermal recovery efficiencies over time.
Surrogate Model: A fast-running, data-driven model (e.g., Random Forest, neural network) trained on outputs from complex numerical simulations to predict system behavior efficiently.
Hydrogeological Model: A numerical model simulating groundwater flow based on hydraulic and geological parameters to understand subsurface water movement and storage characteristics.
Thermal Recovery Efficiency: The percentage of injected thermal energy that can be successfully recovered during production phases in a seasonal thermal energy storage system.
Overpressure: Subsurface pressure exceeding hydrostatic pressure, often due to geological compaction, tectonic forces, or fluid generation processes, which can influence reservoir behavior.
Pannonian s.l. (sensu lato):
A stratigraphic term referring broadly to the Upper Miocene sedimentary sequences in the Pannonian Basin, including formations such as Újfalu and Zagyva.
Hot Well: A well designated for the injection of heated water during storage periods and extraction during production periods in an ATES system.
Cold Well: A complementary well used to manage temperature balance in an ATES system, typically used for extracting cooler water during storage or injecting cooler water during production, depending on the operating scheme.
Stress Period: A defined time interval in a MODFLOW/MT3DMS simulation during which external stresses (e.g., injection, pumping) are assumed constant or follow a specified pattern.
Permutation Importance: A machine learning method used to assess the relative importance of input features by measuring the decrease in model performance when feature values are randomly shuffled.

3. Background and Regional

3.1. Geological and Hydrogeological Setting of the Békés Basin

The Pannonian Basin is a sedimentary basin located in eastern-central Europe, characterized by a complex geological structure consisting of variously subsided basins and horst-like blocks. The basement primarily comprises metamorphic Paleozoic rocks, with Mesozoic carbonate formations present in some areas that can serve as good aquifers [44]. Within this larger geological context, the Békés Basin represents one of the two main depressions of the Southern Great Plain of Hungary, alongside the Makó Depression, with these two significant depressions divided by the Battonya Ridge [45]. The Békés Basin is particularly notable for its exceptional depth, reaching approximately 7000 m of post-Cretaceous sedimentary fill [46], making it one of the deepest sub-basins within the Pannonian Basin system. Figure 1 shows the location of the study area within Hungary, along with the lithological map highlighting the modeled section of the Békés Formation and a corresponding lithological cross-section.
The stratigraphic sequence of the Békés Basin follows the general pattern of the Pannonian Basin, with important variations in thickness and characteristics. At the beginning of the Lower Pannonian period, the Endrőd Marl Formation was deposited, consisting of calcareous marl and clay marl. This formation is overlain by the fine sand turbidite set of the Szolnok Formation, which reaches several hundred meters in thickness in some locations. Above the turbidites, particularly in shallower basin areas, the hemipelagic marls are covered by the thick clayey–silty layers of the Algyő Formation with a prodelta facies [48]. A key characteristic of this formation sequence is the extremely high overpressure below and throughout the set. The sand content of the Algyő Formation increases in areas with a shallower basement, allowing the upper part of the formation to function as a water-bearing unit in certain locations. Generally, however, the Lower Pannonian formations exhibit poor water-bearing characteristics.
The Pannonian s.l. sequence, which overlies the Lower Pannonian layers, consists primarily of the Újfalu Formation and the Zagyva Formation. The Újfalu Formation, characterized by delta front and delta plain facies, represents the most hydrogeologically significant Pannonian s.l. sediment. The Zagyva Formation features deltaic background and alluvial plain facies, with dominant sediments being bed-filling and bay-mouth bar deposits that demonstrate good water-bearing properties despite their limited horizontal dimensions. These formations are hydrodynamically connected through multiple linear erosions and overlapping [45]. In the Békés Basin region, the bottom of the Pannonian s.l. sequences typically lie at depths of 2000–2500 m from the ground surface, with the total thickness of Pannonian s.l. sediments exceeding 2000 m in the Békés Basin—among the thickest in the entire Pannonian region.

3.2. Hydrodynamic Systems and Pressure Regimes

The Békés Basin, like the broader Carpathian basin, features two distinct flow regimes: an upper, gravity-driven flow system within the Pannonian s.l. sequences, and a deeper, overpressure-driven system within the Lower Pannonian formations, primarily affecting the finer deep-sea sediments and underlying formations [49,50]. The overpressure in the deeper system is remarkably high, reaching up to 40 MPa above hydrostatic pressure. This extreme overpressure primarily results from tectonic compression of the formations, with additional contribution from gas formation during sediment maturation processes [50].
In the Békés Basin region, pressure–depth profiles indicate that the dynamic pressure gradient exceeds the hydrostatic pressure by approximately 0.13 MPa (equivalent to about 13 m hydrostatic head) in the Quaternary formations and by approximately 0.44 MPa (about 44 m hydrostatic head) in the Pannonian s.l. sequence. The Lower Pannonian sequence exhibits even more dramatic super-hydrostatic pressure, with the dynamic pressure gradient exceeding hydrostatic pressure by more than 60 MPa. This significant pressure differential creates complex hydrodynamic conditions that must be carefully considered when designing and implementing any subsurface fluid management system, including seasonal thermal energy storage.

3.3. Reservoir Properties and Geothermal Potential

The Pannonian s.l. sandstone reservoirs in the Békés Basin region exhibit favorable characteristics for geothermal applications, with effective porosity values typically reaching 22–25%. The permeability of these Pannonian s.l. reservoirs, which consist of highly permeable sand layers, can reach up to 2000 mD (1.97 × 10−12 m2), corresponding to a hydraulic conductivity of 5–10 m/day [51,52,53,54]. These values represent some of the most favorable reservoir conditions in the Hungarian geothermal context.
The consolidation state of the sandstone varies depending on depth and cementation processes. The sandstone can range from consolidated to unconsolidated, with cementation typically occurring through quartz overgrowth, calcite, or kaolin precipitation. The degree of cementation significantly influences both porosity and stability, particularly during production and injection operations. Generally, sandstone induration increases with depth as cementitious material precipitates into the pores from fluid extracted during compaction. The sand bodies are typically separated by thinner fine-grained sediments, creating a complex, heterogeneous reservoir structure [51,52].
The Békés Basin is characterized by an exceptionally high geothermal gradient, approximately 50 °C/km, significantly above global averages due to the relatively thin crust beneath the Pannonian Basin [55]. This elevated thermal gradient results in reservoir temperatures around 70 °C at depths of approximately 1500–1800 m, making these formations particularly suitable for thermal energy storage and district heating applications. At greater depths of 2000–2500 m, temperatures can reach 90–120 °C, offering potential for higher-temperature applications.
The combination of favorable reservoir properties (high porosity and permeability) and outstanding geothermal conditions makes the Békés Basin exceptionally suitable for geothermal energy utilization, including seasonal heat storage applications [53]. The region’s depleted hydrocarbon wells, many of which penetrate these favorable Pannonian s.l. sandstone formations, present valuable opportunities for repurposing as components of thermal energy storage systems.

3.4. Hydrocarbon History and Well Infrastructure

The Békés Basin has a rich history of hydrocarbon exploration and production, with extensive drilling activities dating back to the mid-20th century. These activities have resulted in a substantial inventory of wells throughout the region, many of which have now reached the end of their productive lifespan as hydrocarbon producers. The basin contains significant natural gas resources, with gases produced from multiple reservoir intervals at depths ranging from 1800 to 2900 m [46].
These Neogene sedimentary sequences overlying the basement highs have demonstrated the best hydrocarbon reservoir characteristics in the southeastern part of Hungary [44]. The extensive exploration and production history has generated valuable geological and reservoir data, including detailed information on formation properties, temperature profiles, pressure regimes, and fluid characteristics. This wealth of data provides a significant advantage for assessing the potential of these formations for thermal energy storage applications.
The existing well infrastructure, though aging, offers potential for repurposing rather than decommissioning. Many wells have been completed with telescopic designs, with casing diameters ranging from approximately 340 mm (13 3/8″) at the surface to 140–178 mm (5 1/2″–7″) at reservoir depths. While some wells may require workover or partial recompletion to ensure mechanical integrity for long-term thermal storage operations, the basic infrastructure represents a valuable asset that could significantly reduce the capital costs associated with implementing seasonal heat storage systems.

3.5. Relevance to Seasonal Heat Storage

The hydrogeological characteristics of the Békés Basin make it particularly suitable for Aquifer Thermal Energy Storage (ATES) applications, especially High-Temperature ATES (HT-ATES) targeting temperatures up to 90 °C or higher. The Pannonian s.l. sandstone formations, with their high porosity, good permeability, and favorable temperature conditions, provide an excellent medium for seasonal storage and retrieval of thermal energy.
The proximity of many depleted wells to population centers in the region creates opportunities for integrating seasonal heat storage with district heating systems, similar to successful implementations in other parts of Hungary, such as Szeged. The initial reservoir temperatures of approximately 70 °C in the target formations are ideal for enhancement through additional heat input during summer months, with subsequent extraction during winter heating periods.
However, the complex pressure regimes, particularly the significant overpressure in deeper formations, present challenges that must be carefully managed. Additionally, the heterogeneous nature of the reservoir formations, with sand bodies separated by fine-grained sediments, creates potential for compartmentalization that could affect thermal breakthrough patterns between injection and production wells.
The dual-well system proposed for seasonal heat storage—with one well serving for summer heat storage/winter heat production and another for summer cold production/winter cold storage—must be carefully designed to account for these hydrogeological characteristics. The optimal spacing between wells must balance the need to prevent premature thermal breakthrough while maximizing energy recovery efficiency, taking into consideration the specific reservoir properties of the Békés Basin formations.
By leveraging the extensive geological knowledge, existing well infrastructure, and favorable reservoir conditions of the Békés Basin, seasonal heat storage systems can be optimized to provide sustainable, efficient thermal energy solutions while extending the productive life of otherwise abandoned hydrocarbon assets.

4. Materials and Methods

4.1. Methodological Framework

The methodological framework of this study began with the comprehensive curation and preparation of existing data from abandoned hydrocarbon fields, which involved extensive data modeling, cleaning, and refinement. A hydrogeological model was then constructed using MODFLOW [56], providing the foundational flow and transport parameters needed for subsequent analyses. Building on this, a heat transport model was employed to simulate thermal performance in potential Underground Thermal Energy Storage (UTES) candidates. The resulting simulation outputs served as the training dataset for a Random Forest algorithm [30,57], which was designed to predict thermal performance in other areas. Finally, the predictions generated by the Random Forest were validated against actual simulation results. Figure 2 outlines the primary steps involved in this integrated approach.

4.2. Data Collection and Data Preparation

For the data collection segment of our study, we utilized data from the southern part of the Bekes Basin in Hungary—a site formerly exploited for hydrocarbons and now recognized for its geothermal potential [58]. This field encompasses two key formations: the shallow Szolnok Formation, which functions as an aquifer [59], and the deeper Bekes Formation. These provide a unique opportunity to evaluate heat storage capabilities. We integrated a diverse set of data—including core samples, density logs, resistivity logs, and gamma ray measurements—using a stochastic simulation process. SGeMS was employed for the geostatistical simulation, while RockWorks facilitated the integration of simulation outcomes for gross thickness, effective porosity, and permeability. A total of 100 stochastic realizations were generated for each grid or voxel point, with the median (Md-type estimation) calculated to represent the central tendency, effectively minimizing the impact of skewed or outlier values. For the Bekes Formation, this Md-type estimation was deemed most representative of the expected geological parameters.
In a prior study by Abdulhaq et al. [47], the southeastern section of the study area was identified as a prime candidate for energy storage due to the Bekes Formation’s average temperature of approximately 70 °C, coupled with its favorable porosity and permeability characteristics. Based on these insights, a polygon delineating this promising area was selected for detailed hydrogeological and heat transport simulations. Figure 3 illustrates the distances from each well to the candidate town, with only those wells penetrating the Bekes Formation included in the analysis—wells that did not extend into the Bekes were excluded. The figure also displays the temperature distribution within the Bekes Formation. By applying a 5 km threshold for effective thermal transport and selecting wells with temperatures below 70 °C as potential candidates for HT-ATES, only wells marked with red borders were retained for further simulation of their thermal performance (Figure 4).

4.3. Hydrogeological Modeling

The groundwater flow within the Bekesi Conglomerate reservoir was modeled using MODFLOW-2000, a modular finite-difference groundwater flow modeling software developed by the U.S. Geological Survey (Version 1.19.01, 25 March 2010). The groundwater flow model was developed using MODFLOW and discretized into three layers, 197 rows, and 400 columns with a cell size of 10 by 10 m. In this configuration, the second layer represents the Bekes Formation. The model parameters were defined at either the cell-by-cell or grid scale, based on data-driven simulations and assumptions derived from field data and laboratory analyses. Table 1 summarizes the key parameters used in the MODFLOW processing model.

4.4. Heat Transport Modeling

For heat transport modeling, we employed MT3DMS within the GMS framework. Following the successful initiation and simulation of the MODFLOW model, we activated the Basic Transport Package in MT3DMS by introducing the starting temperature for each cell. To simulate the heat transport processes, we selected the advection, dispersion, source/sink mixing, and chemical reaction modules. The parameters utilized for this heat transport model are listed below (Table 2).

4.5. Simulation Setting

To replicate ATES operations, the stress periods were structured into one month of system downtime, followed by five months dedicated to heat storage, another one-month break, and five months allocated for heat production, repeating this cycle over a seven-year period. A pair of wells was selected where the distance between them is more than 500 m: one well for heat injection in summer and subsequent heat production in winter, and the other well for cold production in summer with cold injection during winter.

4.6. Training Data for Machine Learning

To ensure the integrity and consistency of our training dataset, we developed an in-house data entry module using Streamlit (version 1.45.1) that integrates both manual input and CSV-based uploads. This module facilitates the incorporation of critical well parameters—including well ID, name, spatial coordinates, porosity, permeability, gamma ray measurements, thickness, distance to the cold well, and initial temperature—while also allowing for the efficient addition of temporal temperature data. For the temperature data, the module supports file uploads in CSV format, validates the presence of required columns (TimeDays, WellType, Row, Col, Temperature), and links the data to the corresponding well ID via an interactive selection box (Figure 5). The system provides real-time previews and feedback, ensuring that all data are consistently and accurately stored in the database.
The training process involves leveraging a SQLite (version 3.49.2) database that consolidates data from two tables—one containing well properties and the other recording temperature measurements. First, the data from both sources are merged to form a comprehensive DataFrame, which includes attributes such as porosity, permeability, gamma ray, thickness, distance to cold wells, initial temperatures, and the time variable, while spatial reference columns (Row, Col) are retained only for reference. After cleaning the dataset by removing any rows with missing values, the data are split into training and testing subsets. A Random Forest regressor is then trained on the assembled features (excluding the spatial reference columns) to predict well temperatures, with performance evaluated through metrics including MAE, RMSE, and R2. We selected the Random Forest (RF) algorithm due to its robustness in handling nonlinear interactions among geospatial and geological parameters, its low tendency to overfit, and its ability to provide clear feature importance rankings. These characteristics make RF well-suited for datasets like ours, which combine moderate dimensionality with partially correlated variables. Compared to alternatives such as Support Vector Regression or neural networks, RF offers a practical balance between predictive accuracy, interpretability, and ease of tuning, especially in the context of thermal reservoir behavior. Finally, the trained model is serialized and saved to a predefined path, ensuring reproducibility and ease of deployment within our predictive framework (Figure 5).
The prediction phase leverages a pre-trained Random Forest model to forecast temperature evolution for all available wells over a specified time range. In this stage, users interact with a Streamlit-based interface where they define the starting time, ending time, and time step for the predictions. The system loads the trained model and, for each well in the database, constructs an input DataFrame populated with static parameters such as porosity, permeability, gamma ray, thickness, distance to the cold well, and initial temperature, while dynamically varying the time parameter. The model then predicts the temperature for each time step, and the results are combined into a unified dataset. To facilitate analysis, the predicted curves for individual wells are plotted on a single graph—each curve clearly labeled with its corresponding WellID—providing a comprehensive visualization of thermal performance over time. The complete repository of the scripts is available online; however, the data content is not included [65].

4.7. Model Calibration and Validation

Calibration and validation of our modeling approach involved two complementary processes. Initially, the MODFLOW-MT3DMS hydrogeological model was calibrated against historical hydraulic head data, ensuring accurate representation of aquifer dynamics [66]. This process included adjustments to hydraulic conductivity and storage coefficients to align the simulated results with observed measurements.
Concurrently, the Random Forest (RF) model was iteratively refined using data generated from the calibrated simulations. We systematically optimized key RF hyperparameters via grid-search cross-validation to achieve robust and stable predictions. Table 3 summarizes the hyperparameters tuned during this calibration process.
This calibration methodology enhanced the reliability and consistency of the RF-based predictive framework, resulting in improved alignment between predicted and simulated temperature profiles [23].

4.8. Sensitivity Analysis

A sensitivity analysis was conducted to quantify and visualize the influence of each input feature on temperature prediction outcomes using the Random Forest (RF) model. The permutation importance method [57], implemented via the Python library scikit-learn (Version 1.4.2), was employed for this purpose. Figure 6 illustrates the relative feature importances derived from this analysis.
The analysis highlights that the temporal parameter (“TimeDays”) overwhelmingly drives model predictions, with a permutation importance significantly greater than other features. This result emphasizes the critical role of cumulative cycle duration and heat accumulation dynamics in controlling reservoir thermal performance. In comparison, geological parameters such as Initial Temperature, Thickness, Gamma Ray, and Permeability had substantially lower relative importance, reflecting a secondary role in influencing predictions under the given model conditions. Porosity and DistanceToColdWell displayed negligible or no measurable influence, suggesting these features either play minimal roles or were insufficiently represented within the dataset [67]. This visual and quantitative identification of critical model inputs provides clear guidance for prioritizing data collection and model refinement in future studies.

5. Results

5.1. Heat Simulation Result

The output of the heat simulation is stored in a UCN file, which encapsulates the final temperature distribution computed by the model [68]. To efficiently leverage this output, we developed an in-house Python module designed to load, process, and analyze UCN files (Figure 7). This module automatically identifies hot and cold wells and allows users to designate observation wells within the simulation setting. Users can load the UCN file and select specific layers of interest, while the integrated visualization sub-module generates plots of temperature versus time for defined stress periods, providing clear insights into the thermal performance of individual wells. Additionally, an animation sub-module enables dynamic playback of temperature evolution by allowing adjustable speeds and frame skips, thereby enhancing interpretability. Furthermore, a dedicated recovery efficiency sub-module computes the thermal recovery efficiency for any simulated wells. The complete suite of module scripts is available online and can be accessed independently of the dataset [69].
In our thermal simulation studies, two sets of candidate results were obtained, both exhibiting robust injection performance with maximum temperatures consistently around 94.9 °C across cycles (Figure 8). In the first candidate, the break phases—representing reservoir conditions when the system strikes—showed a gradual increase in maximum temperatures from 79.67 °C up to 84.33 °C, with thermal recovery efficiency improving from 83.92% to 88.82%. In the second candidate, although the initial break phase efficiency was lower at 81.05%, a notable enhancement was observed, with the efficiency rising to a maximum of 87.93% over repeated cycles. Key performance metrics for this candidate include a ratio of last-to-first efficiency of 1.08, a percent increase of 8.50% from the initial break phase, an average efficiency of 85.46%, and a slope of 1.04 per cycle, indicating a steady improvement with each cycle. Importantly, while these reservoir-level efficiency improvements are significant, the production efficiency during winter remains even higher than these baseline values, ensuring superior operational performance. Together, these observations demonstrate that repeated cycles of heat injection and reservoir cooling not only stabilize the thermal regime but also enhance both the inherent recovery and the actual production efficiency during winter operations.

5.2. Machine Learning Result

When initiating temperature performance predictions for candidate wells, we generated thermal efficiency results for multiple wells, predominantly located outside the originally modeled area. This indicates that our predictive approach is effective even beyond the immediate boundaries of our hydrogeological model, highlighting the potential for broader applicability (Figure 9). Analysis of these external candidates reveals initial efficiencies ranging between approximately 80% and 84%, with a consistent improvement observed through repeated injection and recovery cycles. Specifically, efficiencies showed an average of 85.46%, and maximum values reached up to 88.84%. The ratio of final-to-initial efficiency averaged around 1.05, corresponding to an overall efficiency improvement of up to 8.01%. Furthermore, the observed positive slopes (up to 1.0 per cycle) clearly illustrate that efficiency systematically increases over successive cycles. These findings underscore the reliability and robustness of our predictive methodology in forecasting thermal performance beyond the initially calibrated region, as visualized in Figure 10, which displays a distribution map of each well and the corresponding predicted improvement in thermal performance.

6. Discussion

6.1. Alignment with Previous Studies and Theoretical Outcomes

Our simulation and machine learning outcomes align closely with prior theoretical and empirical findings related to High-Temperature Aquifer Thermal Energy Storage (HT-ATES). Previous studies indicate a typical improvement in thermal recovery efficiency with each successive injection–production cycle, attributed primarily to residual heat accumulation within the aquifer [70,71]. Our simulation results reflect similar trends, with initial efficiencies around 80–84%, gradually increasing to as high as approximately 88%. This progressive efficiency improvement closely mirrors published benchmarks from international case studies, which typically report HT-ATES efficiencies stabilizing in the range of 60–80% after several operational cycles [70,72].
The stability of our modeled temperature curves over multiple years also echoes the theoretical predictions, which suggest that a thermal equilibrium or steady state emerges after multiple cycles [73]. However, subtle contrasts exist; for instance, our simulations may reflect idealized conditions, potentially omitting complexities such as density-driven convection or significant vertical heat migration reported in heterogeneous clastic reservoirs [72]. This highlights a limitation where idealized models may yield slightly optimistic efficiency and thermal stability predictions compared to more complex real-world scenarios.

6.2. Key Influencing Parameters

Our machine learning analysis identified time (’TimeDays’) as the most influential parameter on temperature and efficiency predictions, surpassing all other geological and operational parameters by a substantial margin. Specifically, the permutation importance of time was approximately 1.96, whereas other parameters like initial temperature, thickness, gamma ray, and permeability demonstrated notably lower impacts (0.034, 0.029, 0.006, and 0.006, respectively, see Figure 11). This finding emphasizes that temporal factors—specifically, cumulative cycle duration and residual heat buildup—predominantly govern thermal performance, consistent with the literature where residual thermal energy strongly influences long-term operational outcomes [71].

6.3. Strengths and Limitations of Hydrogeological Model Calibration

Calibrating the hydrogeological model using historical head data provides several advantages, chiefly realistic and site-specific insights into subsurface dynamics. However, this method’s main limitation lies in the necessity to modify boundary conditions to match historical measurements adequately, potentially introducing bias or oversimplification. Adjustments made to boundary conditions, while essential for aligning simulations with empirical data, might restrict the generalizability of the hydrogeological model to conditions significantly different from historical scenarios.

6.4. Enhancing Decision-Making for UTES Site Selection

Our predictive methodology substantially accelerates the decision-making process for selecting potential UTES sites by providing rapid, high-quality predictive outcomes derived from historical data and simulations. By transforming abandoned hydrocarbon reservoirs into a data-driven analytical framework, our approach reduces evaluation times and increases confidence in site assessments. This can significantly streamline site selection workflows, particularly valuable in regions with numerous abandoned wells and substantial historical datasets.

6.5. Implications for Scaling Geothermal Storage Projects

The successful application of this predictive approach using gamma ray logs to differentiate rock types indicates substantial potential for scaling geothermal storage projects, particularly within clastic sedimentary basins. Internationally, numerous clastic basins exist with similar sedimentological characteristics, making this methodology widely applicable. This capability facilitates rapid assessments across varied geographies, promoting efficient, scalable deployment of HT-ATES technologies globally.

6.6. Assumptions and Simplifications

Key assumptions and simplifications within our modeling framework include idealized boundary conditions, homogeneous or simplified geological heterogeneities, and consistent thermal properties across the model domain. Such assumptions likely influence the accuracy and predictive power of the model, potentially limiting its applicability to real-world scenarios with pronounced geological complexity or significantly variable hydrogeological conditions.

6.7. Uncertainties in Input Data

A major uncertainty arises from the inherent accuracy and reliability of the initial simulation data used for machine learning training. Because our predictive model relies heavily on accurate thermal simulations, any errors or oversimplifications in the initial simulations propagate through the predictions, potentially affecting reliability. This highlights the critical importance of accurate and comprehensive simulation input data.

6.8. Recommendations and Future Work

Future studies should focus on enhancing simulation fidelity, incorporating more detailed heterogeneity, and conducting diverse scenario sampling to bolster the robustness of the machine learning model. Additionally, exploring advanced physics-informed machine learning techniques could significantly improve predictive accuracy and generalizability. Extending simulation runs, including more diverse geological and operational parameters, and conducting further validation with independent field data would further strengthen confidence in our model predictions and their applicability to broader contexts.

7. Conclusions

This study set out to advance the design and deployment of high-temperature Aquifer Thermal Energy Storage (HT-ATES) in depleted clastic reservoirs through four interlinked research questions. First, by coupling high-resolution MODFLOW-MT3DMS heat transport simulations with an in-house Python toolkit, we identified the optimal inter-well spacing that minimizes premature thermal breakthrough while maximizing cumulative heat-recovery efficiency. Our two candidate well pairs consistently achieved peak injection temperatures near 94.9 °C and demonstrated steady efficiency gains—up to 8.5% over repeated cycles—when spaced to balance thermal front propagation and lateral heat recharge.
Second, we quantified how key reservoir properties in Hungarian clastic formations modulate this optimal spacing and overall system performance. Sensitivity analysis revealed that porosity and permeability variations shift thermal breakthrough timing and adjust peak recovery efficiencies by several percentage points, while anisotropy primarily affects the shape of the thermal plume. These insights enable tailoring well spacing to site-specific hydraulic and thermal heterogeneities.
Third, comparison of seasonal operating schedules confirmed that the conventional heat storage/winter production cycle yields marginally higher annual energy returns than the inverse (cold-production/winter storage), owing to more effective residual-heat carryover. Specifically, winter production efficiencies exceeded baseline recovery values, underscoring the value of aligning storage and demand cycles to ambient temperature differentials.
Finally, our Random Forest surrogate models—trained on several coupled simulations—proved capable of predicting thermal recovery efficiency across a broad inventory of candidate wells outside the original calibration domain. Surrogate predictions achieved average accuracies within 2–3% of full numerical simulations, with efficiency improvements up to 8% over multiple cycles. This machine learning workflow accelerates design optimization by an order of magnitude, enabling rapid screening of sites and operational schedules.
Collectively, these findings deliver a data-driven framework for HT-ATES implementation in depleted clastic reservoirs, including mechanistic insights into spacing and scheduling trade-offs, parameter-specific performance adjustments, and a scalable surrogate modeling approach for design optimization. Future work should extend this framework by incorporating more complex heterogeneities, exploring physics-informed learning techniques, and validating predictions against field pilot data to further refine site selection criteria and operational guidelines.

Author Contributions

Conceptualization, H.A.A., J.G., T.M.T. and J.S.; Methodology, H.A.A. and J.S.; Software, H.A.A. and J.G.; Validation, H.A.A. and I.V.; Formal analysis, H.A.A.; Investigation, I.V. and J.S.; Resources, J.S.; Data curation, H.A.A., J.G. and É.K.; Writing—original draft, H.A.A.; Writing—review & editing, H.A.A.; Visualization, H.A.A.; Supervision, J.S.; Project administration, I.V., G.B. and B.K.; Funding acquisition, T.M. and J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. The APC was funded by SZETÁV District Heating Company of Szeged 6724, provided by Tamás Medgyes and Balázs Kobor.

Data Availability Statement

The original contributions presented in the study are included in the article. The raw data supporting the conclusions of this article are available from the corresponding author upon request, but access is restricted due to the privacy and internal policy of MOL Company.

Acknowledgments

I acknowledge the invaluable data contribution from the MOL database, which has been essential for this research. I also thank all co-authors for their significant contributions and collaborative efforts throughout this project. All individuals mentioned have provided their consent to be acknowledged. The article processing charge (APC) was funded by SZETÁV District Heating Company of Szeged, with support from Tamás Medgyes and Balázs Kobor. Additionally, OpenAI’s ChatGPT was used to assist with drafting, revising, and refining parts of the manuscript and supporting materials, particularly in the development and documentation of the machine learning model. All intellectual contributions and final decisions remain those of the authors.

Conflicts of Interest

Author János Geiger was employed by the company GEOCHEM Ltd. Author István Vass was employed by the company MOL Hungary. Authors Tamás Medgyes and Balázs Kóbor were employed by the company SZETAV District Heating Company of Szeged. These affiliations are disclosed in accordance with company policies. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Raimi, D.; Krupnick, A.J.; Shah, J.-S.; Thompson, A. Decommissioning Orphaned and Abandoned Oil and Gas Wells: New Estimates and Cost Drivers. Environ. Sci. Technol. 2021, 55, 10224–10230. [Google Scholar] [CrossRef] [PubMed]
  2. Vass, I. (MOL Hungary, Algyő, Hungary). Well Abandonment Cost Estimates for Hungary. Personal Communication. 2025. [Google Scholar]
  3. IPCC. Climate Change Widespread, Rapid, and Intensifying—IPCC—IPCC. 2021. Available online: https://www.ipcc.ch/2021/08/09/ar6-wg1-20210809-pr/ (accessed on 10 May 2025).
  4. Kang, M.; Mauzerall, D.L.; Ma, D.Z.; Celia, M.A. Reducing methane emissions from abandoned oil and gas wells: Strategies and costs. Energy Policy 2019, 132, 594–601. [Google Scholar] [CrossRef]
  5. Osundare, O.; Teodoriu, C.; Falcone, G.; Ichim, A. Estimation of Plugging and Abandonment Costs Based on Different EU Regulations with Application to Geothermal Wells. In Proceedings of the 43rd Workshop on Geothermal Reservoir Engineering, Stanford, CA, USA, 12–14 February 2018. [Google Scholar]
  6. Duplyakin, D.; Beckers, K.F.; Siler, D.L.; Martin, M.J.; Johnston, H.E. Modeling Subsurface Performance of a Geothermal Reservoir Using Machine Learning. Energies 2022, 15, 3. [Google Scholar] [CrossRef]
  7. Jin, W.; Atkinson, T.A.; Doughty, C.; Neupane, G.; Spycher, N.; McLing, T.L.; Dobson, P.F.; Smith, R.; Podgorney, R. Machine-learning-assisted high-temperature reservoir thermal energy storage optimization. Renew. Energy 2022, 197, 384–397. [Google Scholar] [CrossRef]
  8. Rohmer, J.; Armandine Les Landes, A.; Loschetter, A.; Maragna, C. Fast prediction of aquifer thermal energy storage: A multicyclic metamodelling procedure. Comput. Geosci. 2023, 27, 223–243. [Google Scholar] [CrossRef]
  9. Duggal, R.; Rayudu, R.; Hinkley, J.; Burnell, J.; Wieland, C.; Keim, M. A comprehensive review of energy extraction from low-temperature geothermal resources in hydrocarbon fields. Renew. Sustain. Energy Rev. 2022, 154, 111865. [Google Scholar] [CrossRef]
  10. Gayayev, I. Conversion of Abandoned Hydrocarbon Structures into Geothermal Wells for Sustainable Energy Production in Sedimentary Basins [Laurea, Politecnico di Torino]. 2023. Available online: https://webthesis.biblio.polito.it/29039/ (accessed on 10 May 2025).
  11. IEA. Energy Technology Perspectives 2017—Analysis; IEA: Paris, France, 2017; Available online: https://www.iea.org/reports/energy-technology-perspectives-2017 (accessed on 10 May 2025).
  12. REN21. Renewables 2019 Global Status Report. 2019. Available online: https://www.ren21.net/gsr-2019 (accessed on 10 May 2025).
  13. WEF. 5 Green Energy Milestones from Around the World. World Economic Forum. 2021. Available online: https://www.weforum.org/agenda/2021/04/renewables-record-capacity-solar-wind-nuclear/ (accessed on 10 May 2025).
  14. Dincer, I.; Rosen, M.A. Thermal Energy Storage: Systems and Applications, 2nd ed.; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar]
  15. Van Der Roest, E.; Beernink, S.; Hartog, N.; Van Der Hoek, J.P.; Bloemendal, M. Towards Sustainable Heat Supply with Decentralized Multi-Energy Systems by Integration of Subsurface Seasonal Heat Storage. Energies 2021, 14, 7958. [Google Scholar] [CrossRef]
  16. Green, S.; McLennan, J.; Panja, P.; Kitz, K.; Allis, R.; Moore, J. Geothermal battery energy storage. Renew. Energy 2021, 164, 777–790. [Google Scholar] [CrossRef]
  17. Lee, K.S. Underground Thermal Energy Storage; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  18. Li, G. Sensible heat thermal storage energy and exergy performance evaluations. Renew. Sustain. Energy Rev. 2016, 53, 897–923. [Google Scholar] [CrossRef]
  19. Qin, J.; Song, J.; Tang, Y.; Rui, Z.; Wang, Y.; He, Y. Well applicability assessment based on fuzzy theory for CO2 sequestration in depleted gas reservoirs. Renew. Energy 2023, 206, 239–250. [Google Scholar] [CrossRef]
  20. Zhu, S.; Shi, X.; Yang, C.; Bai, W.; Wei, X.; Yang, K.; Li, P.; Li, H.; Li, Y.; Wang, G. Site selection evaluation for salt cavern hydrogen storage in China. Renew. Energy 2024, 224, 120143. [Google Scholar] [CrossRef]
  21. Matos, C.R.; Carneiro, J.F.; Silva, P.P. Overview of Large-Scale Underground Energy Storage Technologies for Integration of Renewable Energies and Criteria for Reservoir Identification. J. Energy Storage 2019, 21, 241–258. [Google Scholar] [CrossRef]
  22. Stricker, K.; Grimmer, J.C.; Egert, R.; Bremer, J.; Korzani, M.G.; Schill, E.; Kohl, T. The Potential of Depleted Oil Reservoirs for High-Temperature Storage Systems. Energies 2020, 13, 24. [Google Scholar] [CrossRef]
  23. Khosravi, R.; Simjoo, M.; Chahardowli, M. A new insight into pilot-scale development of low-salinity polymer flood using an intelligent-based proxy model coupled with particle swarm optimization. Sci. Rep. 2024, 14, 29000. [Google Scholar] [CrossRef]
  24. Liu, A.; Li, J.; Bi, J.; Chen, Z.; Wang, Y.; Lu, C.; Jin, Y.; Lin, B. A novel reservoir simulation model based on physics informed neural networks. Phys. Fluids 2024, 36, 116617. [Google Scholar] [CrossRef]
  25. Menear, K.; Duplyakin, D.; Oliver, M.C.; Shah, M.; Martin, M.J.; Martinek, J.; Nithyanandam, K.; Ma, Z. One System, Many Models: Designing a Surrogate Model for Sulfur Thermal Energy Storage; National Renewable Energy Laboratory (NREL): Golden, CO, USA, 2023. [Google Scholar]
  26. Dickinson, J.S.; Buik, N.; Matthews, M.C.; Snijders, A. Aquifer thermal energy storage: Theoretical and operational analysis. Géotechnique 2009, 59, 249–260. [Google Scholar] [CrossRef]
  27. Fleuchaus, P.; Schüppler, S.; Bloemendal, M.; Guglielmetti, L.; Opel, O.; Blum, P. Risk analysis of High-Temperature Aquifer Thermal Energy Storage (HT-ATES). Renew. Sustain. Energy Rev. 2020, 133, 110153. [Google Scholar] [CrossRef]
  28. Kastner, O.; Norden, B.; Klapperer, S.; Park, S.; Urpi, L.; Cacace, M.; Blöcher, G. Thermal solar energy storage in Jurassic aquifers in Northeastern Germany: A simulation study. Renew. Energy 2017, 104, 290–306. [Google Scholar] [CrossRef]
  29. Van Heekeren, V.; Bakema, G. The Netherlands Country Update on Geothermal Energy. In Proceedings of the Stichting Platform Geothermie, World Geothermal Congress, Melbourne, Australia, 19–25 April 2025. [Google Scholar]
  30. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  31. Holstenkamp, L.; Meisel, M.; Neidig, P.; Opel, O.; Steffahn, J.; Strodel, N.; Lauer, J.J.; Vogel, M.; Degenhart, H.; Michalzik, D.; et al. Interdisciplinary Review of Medium-deep Aquifer Thermal Energy Storage in North Germany. Energy Procedia 2017, 135, 327–336. [Google Scholar] [CrossRef]
  32. HEATSTORE. 2025. Available online: https://www.heatstore.eu/national-project-netherlands.html (accessed on 10 May 2025).
  33. Oerlemans, P.; Drijver, B.; Koenen, M.; Koornneef, J.; Dinkelman, D.; Godschalk, B. First field results on the technical risks and effectiveness of mitigation measures for the full scale HT-ATES demonstration project in Middenmeer. In Proceedings of the European Geothermal Congress 2022, Berlin, Germany, 17–21 October 2022. [Google Scholar]
  34. Akindipe, D.; McTigue, J.; Dobson, P.; Atkinson, T.; Witter, E.; Kumar, R.; Sonnenthal, E.; Umbro, M.; Lederhos, J.; Adams, D.; et al. Techno-Economic Analysis and Market Potential of Geological Thermal Energy Storage (GeoTES) Charged With Solar Thermal and Heat Pumps; (NREL/TP–5700-91225, 2474842, MainId:93003; p. NREL/TP--5700-91225, 2474842, MainId:93003); National Renewable Energy Laborator: Golden, CO, USA, 2024. [Google Scholar] [CrossRef]
  35. IEA. World Energy Outlook 2021—Analysis; IEA: Paris, France, 2021; Available online: https://www.iea.org/reports/world-energy-outlook-2021 (accessed on 10 May 2025).
  36. Anttila, A. Techno-Economic Comparison of Thermal Energy Storage Solutions for Decarbonizing Heat in Espoo by 2025. 2021. Available online: https://aaltodoc.aalto.fi/server/api/core/bitstreams/a65b9c93-33d1-4975-8a97-af3fdf3d39b1/content (accessed on 10 May 2025).
  37. Bloemendal, M.; Hartog, N. Analysis of the impact of storage conditions on the thermal recovery efficiency of low-temperature ATES systems. Geothermics 2018, 71, 306–319. [Google Scholar] [CrossRef]
  38. Sommer, W.; Valstar, J.; van Gaans, P.; Grotenhuis, T.; Rijnaarts, H. The impact of aquifer heterogeneity on the performance of aquifer thermal energy storage. Water Resour. Res. 2013, 49, 8128–8138. [Google Scholar] [CrossRef]
  39. Pellegrini, M.; Bloemendal, M.; Hoekstra, N.; Spaak, G.; Andreu Gallego, A.; Rodriguez Comins, J.; Grotenhuis, T.; Picone, S.; Murrell, A.J.; Steeman, H.J. Low carbon heating and cooling by combining various technologies with Aquifer Thermal Energy Storage. Sci. Total Environ. 2019, 665, 1–10. [Google Scholar] [CrossRef]
  40. Abdulhaq, H. Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoir via Machine Learning: Case Study from Hungary. The 16th European Geothermal PhD Days Book of Abstracts. 2025. Available online: https://www.egpd2025.com (accessed on 15 April 2025).
  41. Nádor, A.; Kujbus, A.; Tóth, A. Geothermal Energy Use, Country Update for Hungary. Eur. Geotherm. Congr. 2022, 2022, 1–13. [Google Scholar]
  42. Szanyi, J.D.; Kovács, B.; Abdulhaq, H.A. Harnessing geothermal energy in Hungary. Geol. Soc. Lond. Spec. Publ. 2025, 555, SP555-2024. [Google Scholar] [CrossRef]
  43. Topór, T.; Słota-Valim, M.; Kudrewicz, R. Assessing the Geothermal Potential of Selected Depleted Oil and Gas Reservoirs Based on Geological Modeling and Machine Learning Tools. Energies 2023, 16, 13. [Google Scholar] [CrossRef]
  44. Horváth, F.; Musitz, B.; Balázs, A.; Végh, A.; Uhrin, A.; Nádor, A.; Koroknai, B.; Pap, N.; Tóth, T.; Wórum, G. Evolution of the Pannonian basin and its geothermal resources. Geothermics 2015, 53, 328–352. [Google Scholar] [CrossRef]
  45. Juhász, G. Lithostratigraphical and sedimentological framework of the Pannonian (sl) sedimentary sequence in the Hungarian Plain (Alföld), Eastern Hungary. Acta Geol. Hung. 1991, 34, 53–72. [Google Scholar]
  46. U.S. Geological Survey. Mineral Commodity Summaries 2023; U.S. Geological Survey: Reston, VA, USA, 2023. [Google Scholar] [CrossRef]
  47. Abdulhaq, H.A.; Geiger, J.; Vass, I.; Tóth, T.M.; Medgyes, T.; Szanyi, J. Transforming Abandoned Hydrocarbon Fields into Heat Storage Solutions: A Hungarian Case Study Using Enhanced Multi-Criteria Decision Analysis–Analytic Hierarchy Process and Geostatistical Methods. Energies 2024, 17, 16. [Google Scholar] [CrossRef]
  48. Haas, J. (Ed.) Geology of Hungary; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar] [CrossRef]
  49. Mádl-Szőnyi, J.; Tóth, J. A hydrogeological type section for the Duna-Tisza Interfluve, Hungary. Hydrogeol. J. 2009, 17, 961–980. [Google Scholar] [CrossRef]
  50. Tóth, J.; Almási, I. Interpretation of observed fluid potential patterns in a deep sedimentary basin under tectonic compression: Hungarian Great Plain, Pannonian Basin. Geofluids 2001, 1, 11–36. [Google Scholar] [CrossRef]
  51. Bálint, A.; Szanyi, J. A half century of reservoir property changes in the Szentes geothermal field, Hungary. Cent. Eur. Geol. 2015, 58, 28–49. [Google Scholar] [CrossRef]
  52. Korim, K. A szentesi hévízmező feltárásának és termelésének három évtizedes története (Three decades of research and utilization in the Szentes Geothermal Field). Bányászati és Kohászati Lapok 1991, 124, 179–184. [Google Scholar]
  53. Szanyi, J.; Kovacs, B.; Scharek, P. Geothermal energy in Hungary: Potentials and barriers. Eur. Geol. 2009, 27, 15–19. [Google Scholar]
  54. Szanyi, J.; Medgyes, T.; Kóbor, B.; Pál-Molnár, E. Technologies of Injection into Sandstone Reservoirs: Best Practices, Case Studies; GeoLitera; Institute of Geosciences, University of Szeged: Szeged, Hungary, 2015; Available online: https://publicatio.bibl.u-szeged.hu/13471/ (accessed on 10 May 2025).
  55. Lenkey, L.; Mihályka, J.; Paróczi, P. Review of geothermal conditions of Hungary. Földtani Közlöny 2021, 151, 65. [Google Scholar] [CrossRef]
  56. Harbaugh, A.W. MODFLOW-2005: The U.S. Geological Survey Modular Ground-Water Model–the Ground-Water Flow Process; Techniques and Methods 2005, 6-A16; US Department of the Interior, US Geological Survey: Reston, VA, USA, 2005. [CrossRef]
  57. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  58. Kovács, A.; Teleki, P.G. History of Oil and Natural Gas Production in the Békés Basin. In Basin Analysis in Petroleum Exploration; Teleki, P.G., Mattick, R.E., Kókai, J., Eds.; Springer: Dordrecht, The Netherlands, 1994; pp. 237–256. [Google Scholar] [CrossRef]
  59. Varga, A.; Bozsó, G.; Garaguly, I.; Raucsik, B.; Bencsik, A.; Kóbor, B. Cements, Waters, and Scales: An Integrated Study of the Szeged Geothermal Systems (SE Hungary) to Characterize Natural Environmental Conditions of the Thermal Aquifer. Geofluids 2019, 2019, 4863814. [Google Scholar] [CrossRef]
  60. Kun, É.; Zilahi-Sebess, L.; Szanyi, J. Battonya–Pusztaföldvári-hát térségének nagy entalpiájú geotermikusenergia-vagyona (I. rész): Hidrodinamikai és hőtranszportmodell. Földtani Közlöny 2022, 152, 53–75. [Google Scholar] [CrossRef]
  61. Gelhar, L.W.; Welty, C.; Rehfeldt, K.R. A Critical Review of Data on Field-Scale Dispersion in Aquifers. Water Resour. Res. 1992, 28, 1955–1974. [Google Scholar] [CrossRef]
  62. Langevin, C.D.; Provost, A.M.; Panday, S.; Hughes, J.D. Documentation for the MODFLOW 6 Groundwater Transport Model. U.S Geological Survey Techniques and Methods, Book 6. 2022; Chapter A61; 56p. Available online: https://pubs.usgs.gov/publication/tm6A61 (accessed on 10 May 2025).
  63. ModelMuse: A Graphical User Interface for Groundwater Models|U.S. Geological Survey. 2024. Available online: https://www.usgs.gov/software/modelmuse-a-graphical-user-interface-groundwater-models (accessed on 10 May 2025).
  64. Vass, I.; Tóth, T.M.; Szanyi, J.; Kovács, B. Hybrid numerical modelling of fluid and heat transport between the overpressured and gravitational flow systems of the Pannonian Basin. Geothermics 2018, 72, 268–276. [Google Scholar] [CrossRef]
  65. Abdulhaq, H. Machine Learning Model for Predicting Thermal Performance of High-Temperature Aquifer Thermal Energy Storage (HT-ATES) in Depleted Clastic Reservoirs. Zenodo 2025 (Version v1). Available online: https://zenodo.org/records/15294847 (accessed on 10 May 2025).
  66. Anderson, M.P.; Woessner, W.W.; Hunt, R.J. (Eds.) Front Matter. In Applied Groundwater Modeling, 2nd ed.; Academic Press: Cambridge, MA, USA, 2015; p. iii. [Google Scholar] [CrossRef]
  67. Fisher, A.; Rudin, C.; Dominici, F. All Models are Wrong, but Many are Useful: Learning a Variable’s Importance by Studying an Entire Class of Prediction Models Simultaneously. arXiv 2019, arXiv:1801.01489. [Google Scholar] [CrossRef]
  68. Ishikawa, T.; Morita, A.; Fukushima, T.; Ono, H. Three-Dimensional Cerebral Aneurysm Models for Surgical Simulation and Education—Development of Aneurysm Models with Perforating Arteries and for Application of Fenestrated Clips. Open J. Mod. Neurosurg. 2014, 4, 59–63. [Google Scholar] [CrossRef]
  69. Abdulhaq, H. Thermal Performance Analysis and Visualization App for UCN-Based Heat Simulations. Zenodo 2025 (Version V1). Available online: https://zenodo.org/records/15294958 (accessed on 10 May 2025).
  70. Collignon, M.; Klemetsdal, Ø.S.; Møyner, O.; Alcanié, M.; Rinaldi, A.P.; Nilsen, H.; Lupi, M. Evaluating thermal losses and storage capacity in high-temperature aquifer thermal energy storage (HT-ATES) systems with well operating limits: Insights from a study-case in the Greater Geneva Basin, Switzerland. Geothermics 2020, 85, 101773. [Google Scholar] [CrossRef]
  71. Drijver, B.; van Aarssen, M.; de Zwart, B. High-temperature aquifer thermal energy storage (HT-ATES): Sustainable and multi-usable. In Proceedings of the 12th International Conference on Energy Storage, Lleida, Spain, 16–18 May 2012. [Google Scholar]
  72. Winterleitner, G.; Schütz, F.; Wenzlaff, C.; Huenges, E. The Impact of Reservoir Heterogeneities on High-Temperature Aquifer Thermal Energy Storage Systems. A Case Study from Northern Oman. Geothermics 2018, 74, 150–162. [Google Scholar] [CrossRef]
  73. Tang, D.W.S.; Rijnaarts, H.H.M. Dimensionless Thermal Efficiency Analysis for Aquifer Thermal Energy Storage. Water Resour. Res. 2023, 59, e2023WR035797. [Google Scholar] [CrossRef]
Figure 1. Shows the location of the study area in Hungary, the modeled section within the Békés Formation lithological map, and a lithological cross-section; modified after Abdulhaq et al. [47].
Figure 1. Shows the location of the study area in Hungary, the modeled section within the Békés Formation lithological map, and a lithological cross-section; modified after Abdulhaq et al. [47].
Energies 18 02642 g001
Figure 2. The workflow of the methodology employed in this study.
Figure 2. The workflow of the methodology employed in this study.
Energies 18 02642 g002
Figure 3. Shows the location of the qualified wells over the temperature grid, highlighting the modeled area. White patches indicate areas where the reservoir thickness is less than 2 m.
Figure 3. Shows the location of the qualified wells over the temperature grid, highlighting the modeled area. White patches indicate areas where the reservoir thickness is less than 2 m.
Energies 18 02642 g003
Figure 4. Distribution of the wells and the boundary of the urban town that can be considered a potential candidate for district heating. The Bekes F. temperature near the town is around 70 °C, which makes it an ideal location to be used as a UTES site. For this reason, we narrowed down the area of interest to be modeled.
Figure 4. Distribution of the wells and the boundary of the urban town that can be considered a potential candidate for district heating. The Bekes F. temperature near the town is around 70 °C, which makes it an ideal location to be used as a UTES site. For this reason, we narrowed down the area of interest to be modeled.
Energies 18 02642 g004
Figure 5. Radial dependency graph illustrating the data pipeline and structural integration of the Random Forest regressor within our predictive framework. Nodes represent features, preprocessing stages, predictive modeling, validation metrics, and visual outputs. Edges indicate data flow and dependencies among the model components.
Figure 5. Radial dependency graph illustrating the data pipeline and structural integration of the Random Forest regressor within our predictive framework. Nodes represent features, preprocessing stages, predictive modeling, validation metrics, and visual outputs. Edges indicate data flow and dependencies among the model components.
Energies 18 02642 g005
Figure 6. Feature importance in temperature prediction using Random Forest.
Figure 6. Feature importance in temperature prediction using Random Forest.
Energies 18 02642 g006
Figure 7. Simulation results for a specific time frame showing the injection of hot fluid through two wells.
Figure 7. Simulation results for a specific time frame showing the injection of hot fluid through two wells.
Energies 18 02642 g007
Figure 8. Thermal performance curve of the simulated hot well across repeated seasons.
Figure 8. Thermal performance curve of the simulated hot well across repeated seasons.
Energies 18 02642 g008
Figure 9. Thermal performance of wells located outside the modeled area. Each curve represents the predicted thermal performance of a different well, with distinct colors used to differentiate individual wells.
Figure 9. Thermal performance of wells located outside the modeled area. Each curve represents the predicted thermal performance of a different well, with distinct colors used to differentiate individual wells.
Energies 18 02642 g009
Figure 10. Percentage increase in thermal performance for each well based on the prediction results.
Figure 10. Percentage increase in thermal performance for each well based on the prediction results.
Energies 18 02642 g010
Figure 11. Horizontal stem plot showing the permutation importance of input features in the Random Forest model. The log-scaled x-axis highlights the dominant influence of the temporal variable TimeDays relative to all geological and operational parameters.
Figure 11. Horizontal stem plot showing the permutation importance of input features in the Random Forest model. The log-scaled x-axis highlights the dominant influence of the temporal variable TimeDays relative to all geological and operational parameters.
Energies 18 02642 g011
Table 1. Key parameters for the MODFLOW model.
Table 1. Key parameters for the MODFLOW model.
ParametersValue/DescriptionSource
Initial Prescribed Hydraulic HeadVaries spatiallyDerived from [60]
Horizontal Hydraulic ConductivityDerived from permeability modelingEstimated from well log data
Vertical Hydraulic ConductivityAssumed as 50% of horizontal conductivityBased on lithological assumptions
Specific Storage0.001 m−1Literature-based estimate
Effective PorosityDerived from porosity modelingEstimated from well log data
Specific Yield0.15Literature-based estimate
Bulk DensityCalculated via gamma ray log surface simulationDerived from natural gamma ray log simulation
Table 2. Summary of key numerical model parameters, values, and justifications used in heat transport and groundwater flow simulations.
Table 2. Summary of key numerical model parameters, values, and justifications used in heat transport and groundwater flow simulations.
ParameterValue/DescriptionJustification
Initial TemperatureVaries spatiallyDerived from drill stem tests and bottom-hole temperature data
Advection PackageThird order TVD scheme UltimateSelected for numerical stability and accuracy
TRPT0.1Assumed based on typical sedimentary conditions [61]
TRVT0.01Assumed based on typical sedimentary conditions [62]
DMCOEF (Effective Molecular Diffusion Coefficient)0.01 m2/dayLiterature-based estimate [63]
longitudinal DispersivityVaries with lithologyBased on the Rock Type Calculation and thermal conductivity of Bekes Fm. from [64]
SorptionLinear isothermCommon assumption for initial reactive transport modeling
Kinetic Rate ReactionZero order reactionAssumed for simplification of reactive processes
PreconditionerJacobiDefault iterative solver preconditioner
Table 3. Random Forest hyperparameters tuned during model calibration.
Table 3. Random Forest hyperparameters tuned during model calibration.
HyperparameterDescription of Tuning Performed
Number of estimators (n_estimators)Increased to reduce variance and stabilize predictions.
Maximum depth (max_depth)Limited to prevent overfitting and improve generalization.
Minimum samples per split (min_samples_split)Adjusted to balance model complexity and predictive accuracy.
Minimum samples per leaf (min_samples_leaf)Increased slightly to ensure robust generalization.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Abdulhaq, H.A.; Geiger, J.; Vass, I.; Tóth, T.M.; Medgyes, T.; Bozsó, G.; Kóbor, B.; Kun, É.; Szanyi, J. Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoirs via Machine Learning: Case Study from Hungary. Energies 2025, 18, 2642. https://doi.org/10.3390/en18102642

AMA Style

Abdulhaq HA, Geiger J, Vass I, Tóth TM, Medgyes T, Bozsó G, Kóbor B, Kun É, Szanyi J. Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoirs via Machine Learning: Case Study from Hungary. Energies. 2025; 18(10):2642. https://doi.org/10.3390/en18102642

Chicago/Turabian Style

Abdulhaq, Hawkar Ali, János Geiger, István Vass, Tivadar M. Tóth, Tamás Medgyes, Gábor Bozsó, Balázs Kóbor, Éva Kun, and János Szanyi. 2025. "Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoirs via Machine Learning: Case Study from Hungary" Energies 18, no. 10: 2642. https://doi.org/10.3390/en18102642

APA Style

Abdulhaq, H. A., Geiger, J., Vass, I., Tóth, T. M., Medgyes, T., Bozsó, G., Kóbor, B., Kun, É., & Szanyi, J. (2025). Predicting Thermal Performance of Aquifer Thermal Energy Storage Systems in Depleted Clastic Hydrocarbon Reservoirs via Machine Learning: Case Study from Hungary. Energies, 18(10), 2642. https://doi.org/10.3390/en18102642

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop