1. Introduction
Heavy and extra-heavy oils remain a substantial part of global hydrocarbon resources, yet their development is increasingly constrained by high energy demand and carbon emissions. Thermal recovery methods such as cyclic steam stimulation, steam-assisted gravity drainage, and ISC can mobilize viscous oil effectively, but they typically require large surface fuel input and generate significant CO
2 per produced barrel [
1]. At the same time, hydrogen is gaining attention as an energy carrier and industrial feedstock in decarbonization pathways. For hydrogen-related projects, the carbon intensity of production routes is becoming as important as cost, which motivates concepts that couple subsurface thermal processes with in-reservoir hydrogen generation and potential carbon retention.
Among the proposed concepts, in situ hydrogen generation treats the heated reservoir as a reactive environment in which cracking, oxidation, coke formation, and gasification occur in sequence. Compared with surface upgrading or reforming [
2], subsurface conversion could reduce surface infrastructure and avoid transporting high-viscosity crude oil, while the formation itself provides confinement that can retain a portion of generated CO
2. However, the underlying chemistry is multistage and depends strongly on temperature, pressure, water availability, local gas composition, and mineral surfaces. As a consequence, the net hydrogen outcome results from a competition between hydrogen-forming reactions and hydrogen-consuming reactions.
Recent experimental and modeling studies have shown that hydrogen can be generated during aquathermolysis and in situ combustion gasification of heavy oils under high-temperature/high-pressure conditions [
3,
4,
5]. These studies further suggest that gasification can simultaneously modify gas composition and improve oil mobility or quality, supporting the concept of combined gas production and in situ upgrading.
From a modeling perspective, the reaction set encompasses oil-phase cracking, aquathermolysis, low- and high-temperature oxidation (LTO and HTO), coke formation and gasification, and gas phase reactions including the water–gas shift (WGS) and methanation. Pseudo-component kinetic descriptions for heavy oil conversion and gas lumping have been developed in prior experimental and simulation studies [
6,
7,
8,
9,
10]. Building on these frameworks, subsequent numerical and molecular-level studies have examined the effects of temperature, pressure, oxygen enrichment, water availability, and atmosphere composition on gas yields and hydrogen fractions [
11,
12,
13].
For ISC-relevant conditions, the reservoir environment is strongly zoned. Oxygen is concentrated near the combustion front, while the upstream steam, thermal cracking, and coke gasification zones are largely oxygen-depleted (
Figure 1). Therefore, results obtained under oxygen-rich conditions may not represent the dominant chemistry in oxygen-poor regions, where the residence time at an elevated temperature can be much longer. In addition, laboratory experiments and corresponding simulations often focus on short durations (hours), whereas the reservoir can experience sustained heating over days to weeks. This longer exposure allows slower pathways and secondary consumption routes to become important, which can alter net hydrogen generation and gas composition.
In this work, reactor-scale history matching is coupled with a comparative numerical study. The contribution is not merely parameter replacement; it is the construction of atmosphere-specific effective kinetic descriptions anchored to a common HTHP reactor dataset and evaluated through direct simulator-to-measurement comparison. The pseudo-component kinetic network was recalibrated to reproduce gas compositions under air, N2, and CO2, after which the calibrated models were used to compare hydrogen volume fractions and specific hydrogen yields. This study focuses on three questions: how atmosphere-specific fitting alters the effective kinetics, how the hydrogen response can be organized into practical temperature regimes, and how the hydrogen fraction may diverge from the net hydrogen yield under identical operating conditions. All results are interpreted as reactor-scale screening trends rather than direct field-scale predictions.
2. Materials and Methods
2.1. Reaction Network and Kinetic Formulation
Heavy oil gasification chemistry involves many parallel and consecutive reactions in both condensed and gas phases. To retain the major conversion routes while keeping the model calibratable, we adopt the reaction network proposed by Kapadia et al. [
10] as the baseline (
Figure 2). The network links thermal cracking/pyrolysis, aquathermolysis, LTO and HTO, coke formation, and secondary reactions of cracking products with gas phase processes such as coke gasification, WGS, and methanation.
Because the feed and products comprise numerous molecular species, a lumped (pseudo-component) representation is used. Bitumen is divided into maltenes and asphaltenes, which show different behavior during cracking and oxidation. Asphaltenes can follow competing pathways that generate lighter oils, gases, and solid coke. In the gas phase, H
2, CH
4, CO, CO
2, and H
2S are tracked explicitly, while combustible C2+ species are grouped into HMWG [
5,
8,
9,
10]. This reduced description captures the dominant mass-transfer coupling between phases while remaining practical for reservoir simulators.
Each lumped reaction is described by an Arrhenius-type rate constant, k = Aexp(−E/RT), where A is the pre-exponential factor, E is the activation energy, R is the universal gas constant, and T is the absolute temperature. For higher-order reactions, CMG-STARS internally applies the appropriate concentration terms; accordingly, the reported Arrhenius parameters should be interpreted as effective kinetic coefficients within the simulator formulation. During recalibration, Reactions 1–14 and 22–27 were treated as adjustable, whereas Reactions 15–21 retained the baseline oxygen reaction settings and remained inactive in the N2 and CO2 cases. The calibrated parameters therefore represent effective kinetics specific to the present reactor system rather than universal intrinsic constants.
2.2. Reactor-Scale Numerical Model and Boundary Conditions
A numerical HTHP reactor model was built in CMG-STARS, version 2021 (Computer Modelling Group Ltd., Calgary, AB, Canada) to mirror the boundary conditions of the laboratory tests (
Figure 3). The computational domain used a 10 × 1 × 1 Cartesian grid with porosity set to 1.0 to represent a well-mixed reactor volume. Initial phase saturations were 0.015 oil, 0.015 water, and 0.97 gas, consistent with a gas-dominant headspace. Calibration runs were treated as isothermal at 200, 250, and 300 °C and simulated for 6 h at 4 MPa to match the experimental protocol. The main reactor model input parameters are summarized in
Table 1.
Pressure control and gas collection were implemented through pressure-constrained injection/production controls in CMG-STARS, which maintained the target system pressure while retaining produced gases in the reactor gas phase for composition accounting. The initial atmosphere was set to air, N2, or CO2, representing oxygen-containing conditions near a combustion front, oxygen-depleted inert conditions in upstream cracking zones, and CO2-rich conditions that may develop under flue-gas injection or local CO2 accumulation, respectively. Rock powder (sandstone core powder) was present in the experiments and can provide mineral surfaces that influence apparent reaction rates; in the present model, this effect is absorbed into the fitted effective kinetic parameters rather than treated through an explicit surface-reaction module.
The reactor-scale simulations are not intended to reproduce reservoir-scale transport or spatial gradients; rather, they serve to calibrate kinetics under controlled conditions and support a consistent comparison of operating-condition effects on gas composition and hydrogen behavior. All yield comparisons employ the same simulator gas-volume reporting basis and the same initial oil charge within this fixed reactor configuration; accordingly, the yield metric is used primarily for relative comparison among cases rather than for direct scale-up.
2.3. Parameter Recalibration and History Matching
The crude oil sample and experimental configuration differ from those used in the original Kapadia model development. In addition, the inclusion of rock powder introduces catalytic and buffering effects that may alter apparent activation energies and the balance between hydrogen-forming and hydrogen-consuming pathways. Therefore, directly using Arrhenius parameters from the literature can lead to a systematic mismatch in the predicted gas composition, especially for H2 and sulfur species.
History matching was performed by directly comparing simulator outputs with the measured gas compositions obtained from the HTHP reactor tests at 200, 250, and 300 °C under air, N
2, and CO
2 (4 MPa, 6 h). No secondary linear fit was applied before evaluating model performance. The least-squares objective and the species-wise evaluation metrics were defined as follows:
In Equations (1)–(4), and denote the raw simulated and measured gas composition values, respectively, for species j at data point k, and nj is the number of data points for species j. denotes the mean of the measured values for species j. All quantities were calculated directly from the raw simulated and measured values.
For each atmosphere, the evaluation dataset contained 12 gas composition points (H
2, CH
4, H
2S, and HMWG at 200, 250, and 300 °C). Because replicate information was not available for all conditions, the comparison is interpreted here as a deterministic model performance check rather than a full uncertainty-qualified validation. Adjustable parameters were restricted to Reactions 1–14 and 22–27 around the Kapadia baseline, whereas Reactions 15–21 retained the baseline oxygen reaction settings and remained inactive in the N
2 and CO
2 cases. Calibration was stopped when further manual adjustment no longer produced a meaningful reduction in
J, and no systematic improvement was observed across the reported species.
Figure 4 presents the overall parity plots, whereas
Figure 5 shows species-resolved agreement at each calibration temperature.
Table 2 summarizes the corresponding species-wise RMSE and MAE values, reported in vol%.
A comparison of the parameter sets in
Table 3 shows that the largest atmosphere-dependent shifts occur in Reactions 6, 11–14, 23, 26, and 27, indicating that hydrogen-redistribution steps and selected coke–gas reactions exert the strongest leverage on the fitted response within the present dataset. This comparison does not replace a formal global sensitivity analysis; however, it identifies the reaction subsets most responsive during calibration and therefore most relevant to future targeted experiments.
After history matching, one calibrated parameter set was obtained for each atmosphere (
Table 3). The air and N
2 cases remain broadly comparable for many shared cracking reactions, whereas the CO
2 case requires larger changes in selected effective parameters, especially those associated with hydrogen redistribution, sulfur chemistry, and coke–gas interactions. The atmosphere comparison is therefore semi-empirical: each atmosphere is represented by its own effective kinetic model calibrated to the corresponding experiments. Consequently, the findings substantiate the utility of atmosphere-dependent reactor-scale screening; however, they should not be construed to imply that gas composition alone—in the absence of recalibration—fully accounts for all observed discrepancies.
2.4. Comparative Simulation Plan and Evaluation Metrics
With the calibrated parameter sets, a comparative series of numerical runs was conducted to examine how temperature, reaction time, pressure, gas atmosphere, and oil/water ratio influence hydrogen generation. The temperature was varied from 150 to 425 °C in 25 °C increments. For each temperature, reaction times of 0.25, 0.5, 1, 2, 4, 8, and 10 days were considered to capture both short-term behavior and longer thermal exposure representative of heated reservoir zones. Pressure was compared at 4, 6, and 8 MPa under the same temperature–time combinations. In addition, a supplementary set of runs was performed at 4 MPa after 10 days to compare the oil/water ratios of 1:1, 1:1.5, and 1:2 under the three atmospheres. The three atmospheres were simulated separately using their respective calibrated kinetic parameters. Because the calibration domain is limited to 200–300 °C, 4 MPa, and 6 h, results outside these bounds are treated as controlled extrapolations within the calibrated model framework and are discussed as trend-screening results rather than standalone predictive field-scale forecasts.
Note: Reaction numbers correspond to the pathway labels in
Figure 2. Reactions 1–14 and 22–27 were adjustable during recalibration, whereas Reactions 15–21 retained baseline air case settings and were inactive in the N
2 and CO
2 simulations.
Two metrics were adopted for comparison. The first is the gas phase hydrogen volume fraction (vol%), which is relevant to produced-gas quality and downstream separation. The second is the specific hydrogen yield (m3·kg−1 oil), defined as the cumulative simulator-reported H2 volume on a consistent internal gas volume basis normalized by the initial heavy oil charge in the reactor model. Because both the gas volume basis and oil charge are fixed across all simulations, this metric is intended primarily for relative comparison among cases within the same reactor configuration. Notably, these two metrics can evolve differently: a high H2 fraction may occur when total gas generation is still limited, whereas a high specific yield requires sustained net hydrogen production over time. In the discussion below, both metrics are used to distinguish early transient hydrogen enrichment from longer-term net hydrogen generation.
3. Results and Discussion
3.1. Delineation of Hydrogen Production Stages
Figure 6 summarizes the simulated evolution of the hydrogen volume fraction and specific hydrogen yield as functions of temperature and reaction time under the three atmospheres. Temperature controls which parts of the reaction network become kinetically active, but the net hydrogen response is not monotonic because hydrogen is produced and consumed simultaneously through cracking, oxidation, gasification, and sulfur-related reactions.
Across the investigated ranges, the overall behavior can be described using three temperature regimes. The regime boundaries are not intended as universal kinetic thresholds; rather, they represent approximate transition intervals identified from changes in the simulated H2 fraction/yield trends and from the onset of stronger atmosphere-dependent divergence. This regime-based description therefore serves as an organizing framework for discussion and as a bridge to typical ISC temperature zones.
Stage I—Initiation (150–250 °C): The system is dominated by early volatilization and the onset of low-temperature cracking; a transient H2 maximum appears in all atmospheres. In the model, small amounts of CO and H2 generated from early cracking and oxygenated-group decomposition are redistributed by low-temperature WGS. The peak H2 fraction reaches 5.97% (air, 225 °C, 0.5 d), 4.63% (N2, 225 °C, 1 d), and 8.52% (CO2, 200 °C, 2 d). Although these fractions are moderate, specific hydrogen yield can be relatively favorable because overall conversion is limited and hydrogen-consuming routes remain weak.
Stage II—Pyrolysis-controlled regime (250–325 °C): With increasing temperature, cracking of resins and asphaltenes accelerates, producing a broader gas spectrum consistent with the stronger thermal response reported for heavy oil fractions in this interval [
14]. Net hydrogen accumulation nevertheless remains weak as hydrogen-consuming routes become competitive. In air, the oxidation of light intermediates provides an additional H
2 sink together with hydrocarbon-radical scavenging, yielding the steepest temporal decline in H
2. Under N
2, the absence of oxygen suppresses direct oxidative loss, yet radical recombination and sulfur-bearing pathways still restrain net H
2 build-up. Under CO
2, gas phase dilution and stronger fitted redistribution toward CO/CO
2-containing products keep the H
2 fraction low, even as conversion continues. Because direct reaction flux outputs are unavailable from the present reactor simulations, these pathway assignments are inferred from the combined trends of H
2, CH
4, and H
2S together with the calibrated reaction network.
Stage III—High-yield (325–425 °C): At higher temperatures, gasification-related steps and high-temperature WGS become increasingly active, and the three atmospheres diverge markedly. In air, oxygen-driven oxidation consumes intermediate gases and hydrogen; consequently, both the H2 fraction and specific yield decrease with extended time. Under N2, hydrogen formation begins to outpace consumption once the temperature exceeds approximately 350 °C, and both metrics increase with time. Under CO2, coke–CO2 reactions and CO formation become prominent; hydrogen yield can increase, yet the gas- phase H2 fraction remains comparatively low because CO2 dilutes the gas phase and contributes to competing pathways.
3.2. Impact of Temperature on Hydrogen Production Efficiency
Initiation regime (150–250 °C): The methane trends in
Figure 7 indicate that early CH
4 is mainly produced by volatilization and incipient cracking of larger molecules [
15]. As temperature and time increase, additional gaseous products form and dilute CH
4, causing its volume fraction to fall. H
2S shows a temperature window that shifts slightly with time, which is consistent with the gradual liberation of sulfur-bearing fragments during early cracking and aquathermolysis. Across all atmospheres, a clear H
2 maximum appears near 225 °C, suggesting that low-temperature WGS provides an efficient route for transient hydrogen enrichment in this window.
Pyrolysis-controlled regime (250–325 °C): In this interval, hydrogen is generated during decomposition of resins/asphaltenes, but the net yield remains modest. Above 250 °C, the H
2 fraction drops, which implies that hydrogen-consuming routes become important. This decline is consistent with the increasing importance of hydrogen-consuming secondary reactions, potentially involving radical recombination and sulfur-related pathways [
16,
17]. The H
2S response indicates that this regime corresponds to the main period of H
2S generation; accelerated breakdown of sulfur-containing organics increases H
2S while drawing down H
2. In air, the oxidation of light hydrocarbons becomes increasingly important with time, which further suppresses H
2 accumulation.
High-yield regime (325–425 °C): Under N
2, an inflection in hydrogen behavior appears near 325 °C (
Figure 6); the H
2 fraction initially decreases and then increases, reflecting a shift from methane-dominated gas release toward WGS-related hydrogen generation. Once the temperature exceeds approximately 350 °C, H
2 rises rapidly owing to the activation of high-temperature WGS and secondary gasification steps. Mineral surfaces in the rock powder may have contributed to the apparent enhancement of WGS-related pathways at elevated temperature; however, this possibility was not independently verified in the present study. In contrast, under air, the net hydrogen balance becomes negative at high temperature because oxidation continues to intensify with oxygen availability.
Temperature also affects the relationship between the gas composition and specific yield. A high H
2 fraction does not necessarily imply a high yield because total gas generation can vary substantially with temperature and time. Under CO
2, for instance, the hydrogen fraction can remain low even as the specific yield increases, because CO
2 both dilutes the gas phase and shifts the carbon balance toward CO/CO
2. The distinction between gas quality (H
2 fraction) and net hydrogen production (specific yield) should be kept in mind when comparing operating conditions. The preference for oxygen-lean, high-temperature conditions is qualitatively consistent with recent heavy oil hydrogen-generation modeling studies that also reported stronger hydrogen accumulation once oxidation losses were suppressed and gasification/WGS pathways became more competitive [
18,
19]. Direct quantitative comparison across studies should nonetheless be made with caution, because crude composition, mineralogy, reactor configuration, and gas volume reporting bases vary considerably.
3.3. Effect of Pressure on Hydrogen Production Efficiency
Pressure effects were assessed by varying the amount of reaction gas required to reach the target pressure while keeping the oil/water charge unchanged.
Figure 8 summarizes the H
2 volume fraction and specific H
2 yield after 10 days at 4, 6, and 8 MPa. The temperature interval exhibiting the strongest hydrogen response is broadly similar across the three pressures, yet the magnitude of hydrogen accumulation varies with pressure. In the present model, pressure can influence gas partial pressures, dissolution/partitioning behavior, and the residence of gaseous products in the reactor. Because direct phase-equilibrium diagnostics were not extracted, the pressure discussion below is limited to observable model outcomes rather than a full mechanistic decomposition.
In air, increasing pressure increases oxygen availability and intensifies oxidation pathways that consume hydrogen and light hydrocarbons. Consequently, both the H2 fraction and the specific H2 yield decrease as pressure rises. This trend aligns with the practical view that oxygen-rich conditions generally hinder net hydrogen accumulation unless oxygen access is confined to the combustion zone and rapidly depleted upstream; otherwise, any hydrogen formed can be promptly scavenged by oxidative reactions.
Under N2, the pressure dependence is non-monotonic rather than strictly decreasing or increasing. At 425 °C and 6 MPa, the H2 fraction reaches 28.6%—8.9 percentage points above the 19.5% obtained at 4 MPa. The specific yield exhibits the same optimum, peaking at 0.0247 m3·kg−1 at 6 MPa and 425 °C. Under an inert atmosphere, this intermediate-pressure optimum likely reflects the combined influence of gas retention and secondary gas- phase behavior within the reactor model, rather than a change in the calibrated reaction topology itself. At still higher pressure, the gas phase H2 fraction declines, indicating that elevated pressure does not continuously improve net hydrogen enrichment, even in the absence of oxidation.
In CO2, the initiation-stage H2 fraction increases with pressure at first and then decreases, suggesting that moderate CO2 levels may facilitate certain steps, whereas excessive CO2 becomes inhibitive. In the high-yield regime, the H2 fraction remains low (around 0.5%), yet the specific yield rises to 0.0134 m3·kg−1 at 6 MPa and 425 °C. This divergence reflects the dual role of CO2, which can participate in coke-related routes that generate CO and, through the water–gas shift reaction, contribute to net hydrogen formation, while simultaneously diluting the produced gas and competing for reactive intermediates. The combined effect is that CO2-rich conditions may still deliver measurable hydrogen per unit oil even when the produced gas does not appear hydrogen-rich on a volumetric basis.
3.4. Effect of Oil/Water Ratio on Hydrogen Production Efficiency
At a fixed pressure of 4 MPa and a reaction time of 10 days, an additional comparison was conducted to evaluate the effect of the oil/water ratio on H
2 proportion under the three atmospheres (
Figure 9). The results indicate that the influence of the oil/water ratio is clearly atmosphere-dependent and that no single ratio is optimal across all atmospheres. Compared with the oil/water ratio itself, the atmospheric condition exerts stronger control on net hydrogen accumulation.
Under air, all three curves exhibit a bell-shaped temperature dependence with a maximum near 225 °C; the 1:1.5 case yields the highest H2 proportion in the medium-temperature window, suggesting that a moderate increase in water content is beneficial under oxidizing conditions. Under N2, the H2 proportion increases gradually at low temperature, passes through a shallow trough around 250–300 °C, and then rises sharply above 325 °C. In the high-temperature range, the order becomes 1:2 > 1:1.5 > 1:1, indicating that higher water content increasingly favors hydrogen generation in the absence of oxidative loss. Under CO2, the H2 proportion remains lower than under N2 over most of the temperature range; although the 1:2 case still gives the highest values, the curves peak around 200–225 °C before declining continuously. Overall, the preferred oil/water ratio shifts from 1:1.5 under air to 1:2 under N2 and CO2, reinforcing that atmosphere-dependent chemistry dominates the hydrogen response under the present reactor-scale conditions.
3.5. Implications for Reservoir-Scale Implementation and Model Limitations
In a field ISC process, oxygen is typically confined to a narrow region around the combustion front and is rapidly consumed as it migrates upstream. From this perspective, the air cases examined here are most representative of near-front conditions, where oxidation reactions are active, and hydrogen tends to be short-lived. Upstream of the front, the steam and thermal cracking zones are generally oxygen-depleted, and the background gas is closer to N2/CO2/steam mixtures. The comparisons therefore suggest that meaningful hydrogen accumulation is more likely in oxygen-lean regions, provided that temperature is high enough for WGS-related pathways and that water is available to sustain those reactions.
The results also highlight a practical point that is sometimes overlooked when interpreting gas data: the hydrogen volume fraction and specific hydrogen yield can move in different directions. An H2-rich gas stream is attractive for separation and utilization, but a transient increase in the H2 fraction can occur at early times when total gas generation is still limited. Conversely, a CO2-rich environment can deliver a non-negligible hydrogen yield while keeping the gas phase H2 fraction low because CO2 dominates the mixture. In a field setting, the preferred operating window therefore depends on the target (produced-gas quality versus total hydrogen generated), as well as on constraints in gas handling, reinjection, and carbon management.
The present study is deliberately reactor-scale, and several limitations should be noted before extending the trends to reservoir conditions. First, the calibration domain spans only 200–300 °C, 4 MPa, and 6 h; results beyond this window are extrapolations within the calibrated model and should be treated as screening trends rather than standalone predictive forecasts. Second, the calibration dataset is limited to 12 composition points per atmosphere and does not include complete replicate uncertainty information for all conditions. Recent work on gas desorption and pore–fracture connectivity shows that gas occurrence and transport are highly sensitive to microstructural connectivity, laminae architecture, and desorption characteristics—none of which are resolved by the present well-mixed reactor model [
20,
21,
22]. Third, the isothermal, well-mixed simulations do not capture the thermal gradients, evolving oxygen supply, or transport-controlled residence times of a propagating combustion front. Fourth, the pseudo-component framework condenses complex chemistry into lumped reactions with effective Arrhenius parameters that may shift with mineralogy, surface area, brine composition, and phase partitioning. Fifth, separate parameter sets were fitted for air, N
2, and CO
2, rendering the atmosphere comparison semi-empirical rather than fully mechanistic. Finally, water saturation was not varied explicitly in the full simulation matrix, nor were direct reaction-rate contributions extracted; both gaps warrant dedicated investigation.
A natural next step is embedding the calibrated kinetics into coupled reservoir-scale thermal simulations resolving heat transfer and multiphase flow, followed by testing injection sequences reflecting field practice. Priority should be given to (i) explicit variation in water saturation and steam availability, as both govern WGS-related hydrogen generation; (ii) targeted experiments for reaction subsets most affected by recalibration; and (iii) broader calibration data enabling automated optimization, formal sensitivity analysis, and probabilistic uncertainty evaluation. Such extensions would permit a more quantitative assessment of hydrogen producibility, produced-gas composition along the wellbore, and the extent to which CO2 can be retained underground at the field scale.