Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM

Andersen, Pål Østebø; Nygård, Jan Inge; Kengessova, Aizhan

doi:10.3390/en15020656

Open AccessArticle

Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM

by

Pål Østebø Andersen

^1,*

,

Jan Inge Nygård

² and

Aizhan Kengessova

^1,3

¹

Department of Energy Resources, Faculty of Science and Technology, University of Stavanger, 4021 Stavanger, Norway

²

Bouvet, 4020 Stavanger, Norway

³

Timal Consulting Group LLP, Atyrau 060011, Kazakhstan

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(2), 656; https://doi.org/10.3390/en15020656

Submission received: 5 October 2021 / Revised: 22 December 2021 / Accepted: 11 January 2022 / Published: 17 January 2022

(This article belongs to the Special Issue Management of High Water Cut and Mature Petroleum Reservoirs)

Download

Browse Figures

Versions Notes

Abstract

:

In this study, we solve the challenge of predicting oil recovery factor (

RF

) in layered heterogeneous reservoirs after 1.5 pore volumes of water-, gas- or water-alternating-gas (WAG) injection. A dataset of ~2500 reservoir simulations is analyzed based on a Black Oil 2D Model with different combinations of reservoir heterogeneity, WAG hysteresis, gravity influence, mobility ratios and WAG ratios. In the first model MOD1, RF is correlated with one input (an effective WAG mobility ratio

M^{*}

). Good correlation (Pearson coefficient −0.94), but with scatter, motivated a second model MOD2 using eight input parameters: water–oil and gas–oil mobility ratios, water–oil and gas–oil gravity numbers, a reservoir heterogeneity factor, two hysteresis parameters and water fraction. The two mobility ratios exhibited the strongest correlation with RF (Pearson coefficient −0.57 for gas-oil and −0.48 for water-oil). LSSVM was applied in MOD2 and trained using different optimizers: PSO, GA, GWO and GSA. A physics-based adaptation of the dataset was proposed to properly handle the single-phase injection. A total of 70% of the data was used for training, 15% for validation and 15% for testing. GWO and PSO optimized the model equally well (R² = 0.9965 on the validation set), slightly better than GA and GSA (R² = 0.9963). The performance metrics for MOD1 in the total dataset were: RMSE = 0.050 and R² = 0.889; MOD2: RMSE = 0.0080 and R² = 0.998. WAG outperformed single-phase injection, in some cases with 0.3 units higher RF. The benefits of WAG increased with stronger hysteresis. The LSSVM model could be trained to be less dependent on hysteresis and the non-injected phase during single-phase injection.

Keywords:

water-alternating-gas (WAG); physics-informed machine learning; least square support vector machine (LSSVM); particle swarm optimization (PSO); dimensionless numbers; hysteresis; genetic algorithm (GA); gravitational search algorithm (GSA); grey wolf optimization (GWO)

1. Introduction

Oil recovery through water or gas injection often lacks efficiency due to the unfavorable mobility ratio between the oil and the displacing phase. Viscous fingering, gravity segregation and heterogeneity can also lead to poor sweep. Gas features low viscosity and density and can achieve channeling and early breakthrough [1,2]. Water-alternating gas injection (WAG) is an enhanced oil recovery (EOR) technique in which water and gas are injected in cycles to displace the oil. This type of technique mitigates the exponential rate decline seen in most fields after peak production [3]. The mobility of each injected fluid is reduced by the presence of the other, producing a more favorable mobility ratio to oil. Gravity segregation becomes less detrimental, since gas sweeps the top of the reservoir, while water sweeps the bottom [2,4]. Gas usually results in lower residual oil saturation than water, but this can be further lowered by WAG. Thus, WAG utilizes the advantages of water and gas injection and minimizes their individual downsides. In several field implementations, it has been beneficial to use WAG; the oil recovery factor in 59 fields increased when WAG was introduced, by 5% to 10 % of the oil originally in place [2]. Sanchez [5] reported that 80% of US WAG projects were beneficial. Micromodel studies also demonstrate better oil recovery with WAG than with single-phase displacement [6].

WAG introduces more design and operational parameters compared to water or gas injection, such as the WAG ratio (volume water to volume gas injected), number of cycles, cycle volume, injection rates and pressures. This may further affect optimal well placements. A 1:1 WAG ratio is considered common or even optimal [4]. Variation in WAG ratio with project time (tapering) has been conducted, partly due to limited gas access, for limiting gas production or for optimization. [7] used ensemble-based optimization of injector and producer well controls at each WAG cycle to maximize the net present value for a channeled reservoir model. Whether the reservoir pressure is above the system’s minimum miscibility pressure (MMP) determines whether the gas is miscible or immiscible with the oil [8]. During miscible displacement, the oil and gas become practically the same phase and residual oil saturation can approach zero. Kulkarni [9] found that miscible gas (CO₂) core flooding (continuous or WAG) outperformed immiscible injection. The choice or modification of the injected fluids can also improve the outcome. Foam, surfactant, polymer, low salinity brine and CO₂ are some alternatives [4].

Reservoir geology or heterogeneity is important during any field development. During gas injection, heterogeneity in terms of thief zones, stratification or fractures can cause gas channeling and early breakthrough, which can be mitigated by WAG [4]. Favorable well placement relative to the dip angle can provide more stable frontal displacement of oil. In heterogeneous reservoirs, gravity and capillary forces can divert flow from highly permeable layers to less permeable layers. These effects are more important in naturally fractured reservoirs, where advective forces are unable to mobilize oil [10,11].

The simultaneous flow of oil, water and gas requires the detailed measurement, quantification and correlation of three-phase relative permeabilities [12,13,14]. Injecting water and gas alternately causes gas and water saturations to rise and fall, resulting in hysteresis [15,16,17,18,19]. During WAG, the relative permeability of gas is more affected by hysteresis than oil and water [4] and hysteresis tends to decrease gas mobility. This reduction delays gas breakthrough and reduces gravity segregation. The Land [12] and Carlson [15] models are widely used to model relative permeability hysteresis.

Machine learning (ML) has gained increased popularity in the petroleum industry in recent years. ML algorithms can be useful for understanding trends in complex datasets and provide multivariate nonlinear regression or classification. Their applications include lithology classification [20], selecting EOR methods [21], locating optimal drilling spots [22], correlating asphaltene precipitation [23] or predicting CO₂ viscosity [24]. Important steps in developing ML models include selecting appropriate input and output variables, acquiring sufficient quality data, applying a suitable ML algorithm and tuning its metaparameters to prevent over- or under-fitting, usually via optimization algorithms.

This study makes use of the least squares support vector machine (LSSVM) algorithm, based on the works [25,26,27], for nonlinear regression. This algorithm has been applied in many contexts, such as predicting drilling fluid density [28], gas solubility [29,30,31,32], water availability [33], energy consumption [34,35], shale gas adsorption [36], wind power [37,38] and even tourism flow [39]. LSSVM has been successfully combined with optimizers such as particle swarm optimization (PSO), genetic algorithm (GA) and grey wolf Optimization (GWO) and has, in many cases, outperformed regression algorithms such as artificial neural networks, radial basis function, gene expression programming and adaptive neuro-fuzzy interference system [29,30,40,41].

In recent studies, [42,43] simulated CO₂ WAG injection in a reservoir model and developed machine learning proxy models with different algorithms to predict current rates of oil, gas and water based on current time, gas and water injection rates, half cycle time and operational constraints. The recovery factor and cumulative production were calculated from the produced output. The calibrated proxy models were used to optimize the WAG process. [41] used LSSVM and other ML approaches to predict two-phase relative permeabilities and combined them via correlations in previous research to estimate three-phase relative permeabilities and the performance of a WAG core flood. [40] correlated the oil recovery performance of EOR carbonated water injection using LSSVM. [44] used ML to optimize well placement during WAG injection. [45,46] used ML to co-optimize CO₂ injection for oil recovery and storage during WAG under different operational constraints.

In this study, our main contribution is to predict the reservoir oil recovery factor (RF) in layered reservoirs during immiscible WAG and single-phase (gas or water) injection for different fluid, reservoir, geometrical and operational conditions. This is a relatively complex task given the number of parameters involved and their coupled nature. Based on a comprehensive simulation database generated in both [47] and this work, we present two predictive approaches. In the first (MOD1), a dimensionless number

M^{*}

, derived from Nygård and Andersen’s study [47], is applied as a single input parameter. The second approach (MOD2) applies eight physics-motivated dimensionless input parameters to improve predictive power compared to the first method: two mobility ratios, two gravity numbers, injected water fraction, reservoir heterogeneity factor and two hysteresis parameters. In both models, the input variables incorporate all the system information. The latter approach, MOD2, utilizes the ML regression algorithm, LSSVM, with metaparameters optimized by either PSO (Particle Swarm Optimization), GSA (Gravity Search Algorithm), GA (Genetic Algorithm) or GWO (Grey Wolf Optimization). LSSVM has been optimized successfully in other works using PSO [20,23,36], GSA [37,38], GA [28] and GWO [29,32,33]. We propose a methodology to ensure the physical behavior of the machine learning model MOD2. We then adapt the dataset to be independent of hysteresis and the non-injected phase when single-phase injection is performed. Some of the research questions we investigate are:

-: How well do the models predict WAG performance?
-: Which parameters affect RF the most?
-: Do the parameters have a positive or negative effect on RF?
-: Will the models properly account for WAG injection and single phase injection?

The paper is structured as follows. The model serving as the basis for the simulation results is outlined in Section 2.1. The dimensionless number

M^{*}

is outlined in Section 2.2. This number is used in the single-input parameter model. The machine learning approach and dataset follow in Section 2.3. The eight input parameters of the second approach are also presented in those latter sections. The results from analyzing the data are shown in Section 3 and the paper is concluded in Section 4.

2. Theory

2.1. Mathematical Model

We consider the same modeling approach for immiscible WAG injection as [47]: A 2D reservoir layered in a horizontal direction, with one injector and one producer, both vertical and perforated along the full reservoir height. See Figure 1 for an illustration. A black oil model is assumed with an incompressible and immiscible three-phase flow of oil, water and gas and negligible capillary pressure. WAG is applied from the start, rather than as a tertiary method. Relevant equations are presented below:

\partial_{t} (ϕ s_{o}) = \partial_{x} (f_{o} u_{T x}) + \partial_{z} (f_{o} u_{T z}) + \partial_{z} (K_{z} g λ_{w} f_{o} Δ ρ_{w o}) - \partial_{z} (K_{z} g λ_{g} f_{o} Δ ρ_{o g})

(1)

\partial_{t} (ϕ s_{w}) = \partial_{x} (f_{w} u_{T x}) + \partial_{z} (f_{w} u_{T z}) - \partial_{z} (K_{z} g λ_{o} f_{w} Δ ρ_{w o}) - \partial_{z} (K_{z} g λ_{g} f_{w} Δ ρ_{w g})

(2)

\partial_{x} u_{T x} + \partial_{z} u_{T z} = 0,

(3)

u_{T x} = - K_{x} λ_{T} \partial_{x} p, u_{T z} = - K_{z} λ_{T} \partial_{z} p + K_{z} g (λ_{o} ρ_{o} + λ_{w} ρ_{w} + λ_{g} ρ_{g})

(4)

where

ϕ

denotes porosity,

s_{i}

saturation of phase

i

,

f_{i}

fractional flow,

K

permeability,

λ_{i}

mobility,

ρ_{i}

density,

u_{T}

total Darcy flux and

p

pressure. Corey correlation was applied for relative permeabilities, while gas relative permeability hysteresis was incorporated using Land’s trapping model [12] (which reduces the mobile gas saturation interval based on the parameter

C

) and Carlson’s hysteresis model [15] with parameter

α

(which reduces gas relative permeability).

Nygård and Andersen [47] ran simulations systematically to investigate the role of gravity segregation, the mobility ratios between the three phases, heterogeneity, hysteresis and WAG ratio and how they affected RF after 1.5 pore volumes of fluid were injected. The simulations were scaled using a combined dimensionless mobility ratio

M^{*}

stating how effectively the injected fluids displaced oil under the given conditions, summarized as follows. In the design of this number, the mechanisms were incorporated one at a time. We refer to Table A1, Table A2 and Table A3 in Appendix A for several important simulation input parameters or model configurations that were constant in the simulations. More details can be found in the original paper. The fact that these parameters remained constant was mainly due to prioritization. However, these input parameters were incorporated in the dimensionless numbers presented in the following sections.

Figure 1. The geometrical configuration of the model (modified from [47]). is the distance from the injector, while is the distance from the top of the reservoir.

Oil recovery factor (

RF

) is the output parameter of interest, defined as:

RF = \frac{v o l u m e o i l p r o d u c e d}{v o l u m e o i l i n i t i a l l y i n p l a c e} = 1 - \frac{\sum_{j = 1}^{N_{z}} \sum_{k = 1}^{N_{x}} ϕ (z_{j}) s_{o} (x_{k}, z_{j})}{\sum_{j = 1}^{N_{z}} \sum_{k = 1}^{N_{x}} ϕ (z_{j}) s_{o i}}

(5)

Every grid block features same dimension

Δ x Δ z

.

RF

is reported after 1.5 pore volumes are injected.

2.2. WAG Efficiency Characterization Using Dimensionless Number

The characteristic mobility ratio

M^{*}

defined by Nygård and Andersen [47] features the following functional relation:

M^{*} = {(\frac{r_{w}}{M_{w / o}^{*} F_{H} F_{G}^{w / o}} + \frac{1 - r_{w}}{M_{g / o}^{*} F_{H} F_{G}^{g / o}})}^{- 1}

(6)

r_{w}

is the volume fraction of water in each cycle. Larger

M^{*}

is associated with lower recovery factor. Characteristic two-phase mobilities

λ_{i}^{*}

for each phase

i

were found by averaging their mobility over their mobile saturation interval and used to define two-phase mobility ratios

M_{w / o}^{*}

,

M_{g / o}^{*}

:

M_{w / o}^{*} = \frac{λ_{w}^{*}}{λ_{o w}^{*}}, λ_{w}^{*} = \frac{k_{r w}^{m a x}}{μ_{w}} \frac{(1 - \frac{s_{w r}}{s_{w, m a x}})}{(n_{w} + 1)}, λ_{o w}^{*} = \frac{k_{r o w}^{m a x}}{μ_{o}} \frac{(1 - \frac{s_{o r w}}{s_{o w, m a x}})}{(n_{o w} + 1)}

(7)

M_{g / o}^{*} = \frac{λ_{g}^{*}}{λ_{o g}^{*}}, λ_{g}^{*} = \frac{k_{r g}^{m a x}}{μ_{g}} \frac{(1 - \frac{s_{g r}}{s_{g, m a x}})}{(n_{g} + 1)}, λ_{o g}^{*} = \frac{k_{r o g}^{m a x}}{μ_{o}} \frac{(1 - \frac{s_{o r g}}{s_{o g, m a x}})}{(n_{o g} + 1)}

(8)

s_{i, m a x}

denotes the saturation of phase

i

where end-point relative permeability

k_{r i}^{m a x}

is obtained,

n_{i}

is the Corey exponent and

μ_{i}

is viscosity. A heterogeneity factor

F_{H}

was derived from the horizontal permeability

K_{x j}

and layer height

h_{j}

distribution over layers

j = 1 : N_{L}

:

F_{H} = \frac{{\bar{K}}_{x}^{a r}}{{\bar{K}}_{x}^{h a}}, {\bar{K}}_{x}^{a r} = {(\sum_{j = 1}^{N_{L}} h_{j})}^{- 1} \sum_{j = 1}^{N_{L}} h_{j} K_{x, j}, {\bar{K}}_{x}^{h a} = (\sum_{j = 1}^{N_{L}} h_{j}) {(\sum_{j = 1}^{N_{L}} \frac{h_{j}}{K_{x, j}})}^{- 1}

(9)

Two-phase gravity numbers were defined using the ratio of two-phase segregation time

t_{s e g}

and the residence time

t_{r e s}

of the injected phase:

N_{G}^{w / o} = \frac{t_{r e s}^{w}}{t_{s e g}^{w / o}}, t_{r e s}^{w} = \frac{L_{x} L_{y} \sum_{j = 1}^{N_{L}} ϕ_{j} h_{j}}{Q_{w}}, t_{s e g}^{w / o} = \frac{H ϕ}{K_{z}^{h a} {Δ ρ}_{w o} g} (\frac{1}{λ_{w}^{*}} + \frac{1}{λ_{o w}^{*}})

(10)

N_{G}^{g / o} = \frac{t_{r e s}^{g}}{t_{s e g}^{g / o}}, t_{r e s}^{g} = \frac{L_{x} L_{y} \sum_{j = 1}^{N_{L}} ϕ_{j} h_{j}}{Q_{g}}, t_{s e g}^{g / o} = \frac{H ϕ}{K_{z}^{h a} {Δ ρ}_{g o} g} (\frac{1}{λ_{g}^{*}} + \frac{1}{λ_{o g}^{*}}),

(11)

It was found that the role of gravity depended on heterogeneity and two-phase gravity factors

F_{G}^{w / o}, F_{G}^{w / o}

accounting for this coupling were introduced:

F_{G}^{w / o} = \frac{1 + a_{1} {(N_{G}^{w / o})}^{a_{2}}}{1 + a_{1} (F_{H} - 1) {(N_{G}^{w / o})}^{a_{2}}}, F_{G}^{g / o} = \frac{1 + a_{1} {(N_{G}^{g / o})}^{a_{2}}}{1 + a_{1} (F_{H} - 1) {(N_{G}^{g / o})}^{a_{2}}}

(12)

Note the unitless tuning parameters

a_{1} = 3

and

a_{2} = 0.5

. Finally, hysteresis was incorporated into the gas characteristic relative permeability. Land’s parameter

C

defines a hysteresis residual gas saturation

s_{g r}^{h y s t}

:

s_{g r}^{h y s t} = s_{g r} + \frac{s_{g, m a x} - s_{g r}}{1 + C (s_{g, m a x} - s_{g r})}

(13)

A further modification according to

r_{w}

was made:

s_{g r}^{w a g} = s_{g r} (1 - r_{w}) + r_{w} s_{g r}^{h y s t}

(14)

Additionally, the gas relative permeability end point

k_{r g}^{m a x}

in

λ_{g}^{*}

, (see Equation (8)), was reduced due to hysteresis. The reductions were performed individually for the gas–oil mobility ratio and the gas–oil gravity number first based on the parameter

α

and heterogeneity factor

F_{H}

using unitless tuning parameters

b_{1} = 1, b_{2} = 0.5

and

b_{3} = 10, b_{4} = 2

:

k_{r g, M}^{m a x, h y s t} = \frac{k_{r g}^{m a x}}{1 + b_{1} F_{H}^{b_{2}} α}, k_{r g, N_{G}}^{m a x, h y s t} = \frac{k_{r g}^{m a x}}{1 + b_{3} F_{H}^{b_{4}} α}

(15)

Next, the fraction

r_{w}

was incorporated according to:

k_{r g, M}^{w a g} = {(\frac{1 - r_{w}}{k_{r g}^{m a x}} + \frac{r_{w}}{k_{r g, M}^{m a x, h y s t}})}^{- 1}, k_{r g, N_{G}}^{w a g} = {(\frac{1 - r_{w}}{k_{r g}^{m a x}} + \frac{r_{w}}{k_{r g, N}^{m a x, h y s t}})}^{- 1}

(16)

We then obtained the hysteresis-corrected characteristic gas mobilities

λ_{g, M}^{*}

and

λ_{g, N_{G}}^{*}

by replacing

s_{g r}

with

s_{g r}^{w a g}

in (8), while the end-point relative permeability

k_{r g}^{m a x}

in (8) was replaced by

k_{r g, M}^{w a g}

in the gas-oil mobility ratio

M_{g / o}^{*}

in (8) and by

k_{r g, N_{G}}^{w a g}

in the gas–oil gravity number

N_{G}^{g / o}

in (11):

λ_{g, M}^{*} = \frac{1}{μ_{g}} (1 - \frac{s_{g r}^{w a g}}{s_{g, m a x}}) \frac{k_{r g, M}^{w a g}}{n_{g} + 1}, λ_{g, N_{G}}^{*} = \frac{1}{μ_{g}} (1 - \frac{s_{g r}^{w a g}}{s_{g, m a x}}) \frac{k_{r g, N_{G}}^{w a g}}{n_{g} + 1}

(17)

Every input parameter is incorporated in the dimensionless number

M^{*}

. Note that during single-phase injection

(r_{w} = 0 or 1),

the two-phase parameters involving the phase not injected do not affect

M^{*}

. Similarly, hysteresis does not affect

M^{*}

during single-phase injection.

2.3. Workflow

2.3.1. Model Input Parameters

In the first model, MOD1, we take

x_{0} = \log_{10} M^{*}

(18)

as the only input parameter to predict RF. We also consider a machine learning model (MOD2) in which the following eight dimensionless numbers are used as input parameters:

x_{1} = r_{w}, x_{2} = \log_{10} F_{H}, x_{3} = α, x_{4} = \log_{10} C, x_{5} = \log_{10} M_{g / o}^{*}, x_{6} = \log_{10} M_{w / o}^{*}, x_{7} = \log_{10} N_{G}^{w / o}, x_{8} = \log_{10} N_{G}^{g / o}

(19)

These numbers reflect injected fluid fractions

x_{1}

, heterogeneity

x_{2}

, hysteresis

x_{3}, x_{4}

, relative magnitude of fluid mobilities

x_{5}, x_{6}

and gravity vs. advective forces

x_{7}, x_{8}

. They incorporate all the input parameters used in the Eclipse model and the number

M^{*}

. The overall workflow is demonstrated in Figure 2, where the two modeling approaches after the data collection step are indicated. In MOD1, a polynomial regression is performed, while machine learning is used for MOD2. The data and the detailed steps for developing MOD2 are explained below.

2.3.2. Reservoir Simulation Dataset and Model Approaches

In addition to the 1648 WAG and 96 single phase injection simulations generated by [47], 824 new WAG simulations were performed with new

C

and

α

values combined with existing combinations of heterogeneity, density and mobility. In the previous study,

C

and

α

were selected primarily to cover no or significant hysteresis. The values

α = 0

and

C = 1000

were assigned to points without hysteresis influence from the respective parameters. For MOD1, each simulation allows the calculation of

M^{*}

, which is input to the corresponding output

RF

. The

1648 + 96 + 824 = 2568

data points were analyzed with MOD1 and correlated using a polynomial expression between RF and

x_{0}

.

For the single-phase injection of water

(r_{w} = 1)

, values for

N_{G}^{g / o}

and

M_{g / o}^{*}

are not well defined since gas is not injected. The case is similar regarding

N_{G}^{w / o}

and

M_{w / o}^{*}

at

r_{w} = 0

(gas injection). Further, hysteresis parameters

C, α

should not matter. The insensitivity of

M^{*}

to the mentioned parameters under single-phase injection was ensured during its derivation. To properly define the input values under single-phase injection for MOD2 the following approach was taken:

Each single-phase data point was duplicated to 16 data points, in which all combinations of high and low values of the four missing parameters were assigned. Specifically, for gas injection (points with

r_{w} = 0

, indexed ‘g’), the following values were set for

x_{3}, x_{4}, x_{6}, x_{7}

:

x_{3, g} = {\bar{x}}_{3, W A G} \pm X σ_{3, W A G}, x_{4, g} = {\bar{x}}_{4, W A G} \pm X σ_{4, W A G}, x_{6, g} = {\bar{x}}_{6, W A G} \pm X σ_{6, W A G}, x_{7, g} = {\bar{x}}_{7, W A G} \pm X σ_{7, W A G},

(20)

where

{\bar{x}}_{i, W A G} (i = 3, 4, 6, 7)

indicate the average of the data point values applied in the WAG cases and

σ_{i, W A G} (i = 3, 4, 6, 7)

the corresponding standard deviations.

X

is a multiplier. Similarly, for water injection (indexed ‘w’) the following values were set for

x_{3}, x_{4}, x_{5}, x_{8}

:

x_{3, w} = {\bar{x}}_{3, W A G} \pm X σ_{3, W A G}, x_{4, w} = {\bar{x}}_{4, W A G} \pm X σ_{4, W A G}, x_{5, w} = {\bar{x}}_{5, W A G} \pm X σ_{5, W A G}, x_{8, w} = {\bar{x}}_{8, W A G} \pm X σ_{8, W A G},

(21)

The 96 single-phase simulations resulted in

16 \cdot 96 = 1536

points to the ML model. In total,

1648 + 824 + 16 \cdot 96 = 4008

data points were then applied in MOD2.

2.3.3. Machine Learning Dataset Preparation

The 2472 WAG cases and 96 single-phase injection cases were both divided randomly between three sets: 70% in the training set, 15% in the validation set and the remaining 15% in the testing set [48]. Single-phase data points within each set were further split into 16 points, as described previously. See Table 1 for an overview of points in the models and datasets.

2.3.4. Machine Learning Workflow

We apply LSSVM with radial basis kernel (RBK) function and either PSO, GWO, GA or GSA as optimizers. Each of the optimizers features its own strengths and disadvantages in finding global optima efficiently, as well as depending on its individual tuning parameters. They are swarm-based algorithms, making use of many potential solutions simultaneously and improving these solutions according to those performing best at a given iteration. The result is the existence of differently optimized LSSVM models, as indicated in Figure 2. Detailed explanations of the algorithms are provided in Appendix B and Appendix C, respectively. Each input parameter

x_{i}

was normalized (denoted

x_{N}

) to a range between −1 and +1 based on the maximum and minimum values of the total dataset.

x_{N} = 2 \frac{x - x_{m i n}}{x_{m a x} - x_{m i n}} - 1

(22)

Assuming predefined values of the metaparameters (

σ, γ

), LSSVM provides the function

y (x)

and its coefficients

α_{k}, b

that minimize the error between the model predictions and observations of a given dataset (usually the training dataset) for those parameter choices [27]:

y (x) = \sum_{i = 1}^{n} α_{i} \exp (- \frac{| | x_{N i} - x_{N}^{2} | |}{σ^{2}}) + b

(23)

For given metaparameters, the LSSVM algorithm is calibrated on the training set to provide choices of

α_{k}, b

. The optimizer algorithm is used to search for the metaparameters

σ, γ

that minimize the model prediction error on the validation dataset (i.e., many LSSVM models are calibrated on the training set and the one giving best prediction on the validation set is taken as the best). This systematically determines the best choice of metaparameters to avoid over- or under-fitting. The optimized LSSVM models were finally used to predict the data in the testing set. The model performing best overall was selected for further sensitivity analysis. For proper comparison, the optimizers were implemented with the same random initial solutions guesses (and velocities if applicable), search space and number of iterations. The optimizer parameters can be found in Table A4.

An advantage of LSSVM is its few (two) metaparameters and the automated optimization of its internal tuning parameters [27]. In comparison, artificial neural networks often need subjective selection of the number of nodes and layers and then comprehensive tuning of a vast number of weights and biases to train the network [49].

The correlation between input variables

x

and output

y

(RF) is quantified using Pearson correlation

r_{x y}^{P}

, Spearman rank

r_{x y}^{S p}

and distance correlation

r_{x y}^{D}

coefficients. These indices, respectively, indicate linear correlation, nonlinear monotonic correlation and nonlinear nonmonotonic correlation. The former two range between −1 and 1, while the latter ranges from 0 to 1. For all of them, 0 indicates no correlation. The goodness-of-fit between the calculated RF from MOD1 or MOD2 and the data values of RF were quantified using the coefficient of determination

R^{2}

and the root mean square error

RMSE

. The definitions of the mentioned quantities are in Appendix D.

3. Results and Discussion

3.1. Preliminary Dataset Analysis

The range and mean of the data points used in MOD1 and MOD2 (using

X = 0.5

) are listed in Table 2. The use of logarithms made the range of the different variables span a few units rather than orders of magnitude.

The Pearson, Spearman and distance correlation coefficients evaluated between

RF

and the input parameters were calculated for the two model datasets and are listed in Table 3. For MOD1, the two former coefficients were

\approx - 0.94

and the latter

0.93

. Their magnitude being close to 1 indicates a strong linear correlation between

RF

and

x_{0} = \log_{10} M^{*}

and the negative sign indicates that a larger

M^{*}

reduces RF.

For MOD2, Pearson correlation coefficients were reported for WAG cases only and for all cases when

X = 0.25

,

0.5

and

1

in Table 3. Spearman rank and distance correlation were calculated only for

X = 0.5

. Considering the WAG cases, several variables correlate with

RF

, especially

x_{5}, x_{6}

which are the log of gas/oil and water/oil mobility ratios. They feature Pearson coefficients

r_{x y}^{P} ~ - 0.5

to

- 0.6

indicating that when they increase (less favorable mobility ratio towards the oil), RF is reduced. Heterogeneity, represented by

x_{2}

, also correlates with

RF

with a lower

r_{x y}^{P} ~ - 0.27

, indicating that RF generally reduces when the heterogeneity factor increases. The hysteresis parameters

x_{3}, x_{4}

correlate with RF in opposite ways to each other, with

r_{x y}^{P} ~ 0.15

for

x_{3}

and

- 0.15

for

x_{4}

. When

x_{3}

(i.e.,

α

) increases, gas relative permeability is reduced and should improve RF. Higher

x_{4}

(i.e.,

\log C

) leads to less gas trapping and RF therefore decreases. The gravity numbers feature relatively poor Pearson correlation with

RF

, with

r_{x y}^{P} ~ 0.08

for water-oil and

0.0045

for gas-oil. Similarly, the water volume fraction correlates little with

RF

, and

r_{x y}^{P} ~ - 0.05

is slightly negative. These results could be related to their coupled nature, as is discussed below.

When considering the MOD2 datasets with single-phase data included for different

X

, we note that the magnitude and sign of the different Pearson coefficients are similar to when only the WAG cases were considered. The main difference is that the correlation is somewhat lower, especially the parameters with unspecified information during single-phase injection. This was expected, since we added points where

RF

does not vary with changes in these parameters.

When evaluating the dataset with Pearson rank correlation for

X = 0.5,

we observe similar, but slightly lower values as for the Pearson coefficient, except for

x_{8}

, where the correlation doubles, but remains very low. When considering the distance correlation coefficient, however, several input parameters correlate more strongly with recovery, indicating that their relation is nonmonotonic. In particular, the water fraction,

x_{1}

, features a higher distance correlation coefficient, of

0.19

. WAG was expected to perform better than single-phase injection, with

RF

not changing linearly with

x_{1}

, but peaking. Gravity can be a cause of both low and improved sweep and the gravity numbers

x_{7}, x_{8}

feature distance correlation coefficients around 0.08, where more impact is attributed the gas–oil gravity number in particular. Furthermore, the hysteresis parameters

x_{3}, x_{4}

now seem to correlate more strongly. The three correlation coefficients are similar for

x_{2}

, indicating a relatively linear and monotic relation, so if all other parameters are constant, increased heterogeneity should reduce recovery.

Note that all the variables in MOD2 feature less correlation than the variable

x_{0}

in MOD1, since they individually do not contain all the involved system parameters. The aim is for them to provide better predictions when combined.

3.2. Development of MOD1

RF is plotted against

x_{0}

in Figure 3 and demonstrates a clear correlation where higher

x_{0}

gives lower RF. There is also significant scatter, meaning a given

x_{0}

can be associated with a range of RF values. The data were fitted to a third-order polynomial function, given by the blue curve in Figure 3 and Equation (24). A higher order polynomial did not further reduce the RMSE, which means the remaining error was associated primarily with the scatter in the data.

RF = \sum_{i = 1 : 4} p_{i} {(x_{0})}^{4 - i}, (R^{2} = 0.889; RMSE = 0.0498) p_{1} = 0.01645, p_{2} = - 0.06302, p_{3} = - 0.1393, p_{4} = 0.7676

(24)

The performance of the model is also shown by comparing the estimated

RF

and the data

RF

in Figure 4a together with a histogram, Figure 4b, of the residual errors (the difference between the estimated RF and the data point RF). The R² = 0.889 is relatively high. As seen in the histogram, the residuals are symmetrically distributed around zero and roughly 95% of the points estimate RF correctly within

\pm 0.1

. The RMSE, which can be considered a more typical error, is 0.050.

3.3. Development of LSSVM Model MOD2

The ML model MOD2 was developed using LSSVM and a dataset assuming

X = 0.5

. The best LSSVM model was determined using different approaches. First, a random choice of metaparameters

(σ, γ) = (1, 1)

was used. Next, LSSVM was applied with the different optimization algorithms, PSO, GA, GWO and GSA, to systematically find the best metaparameters. As previously mentioned, for any combination of the metaparameters, LSSVM models were fitted to the training set and used to forecast the validation set. The metaparameters that resulted in the best performance in the validation set, after using a given optimizer algorithm, determined the best model. The test set was then forecasted.

In Figure 5, the performance of the different algorithms is illustrated as a function of the iterations performed. R² and RMSE for the validation set are plotted for the best solution at the given iteration, together with the corresponding values of

\log γ

and

\log σ

in plots a to d, respectively. The same initial solutions and number of iterations were applied in all the algorithms (different colors). Two different initializations were applied for robustness (dashed and full lines).

In all cases, a high

R^{2} \approx 0.996

and low

RMSE \approx 0.009

were obtained after 30 iterations for all the algorithms and both starting points, although GSA deviated from initially good solutions and converged slowly or to inferior solutions. Furthermore, GA seemed to not produce as good results as PSO and GWO. The two algorithms, PSO and GWO, exhibited very similar values of

\log γ \approx 5.5

and

\log σ \approx 0.3

and the lowest error indicating that they were better able to find the global optima. Notably, the values of

\log γ

and

\log σ

varied significantly during the iterations, but mostly exhibited very good performance. This may have been due to the ability of LSSVM to tune its internal parameters

α_{i}

and

b

for any given

γ, σ

.

The best metaparameters obtained during the 30 iterations are listed in Table 4, considering all four algorithms and both initializations. The corresponding metrics (R² and RMSE) were calculated on the training, validation and testing sets. All the optimized models performed better than the algorithm with the arbitrarily preset metaparameters, although this choice also performed well, with

R^{2}

greater than 0.969 on all three sets. The optimized models exhibited very consistent performance in the three datasets, with

R^{2} \approx 0.999

in the training set,

\approx 0.996

in the validation set and

\approx 0.992

on the testing set. The difference in

R^{2}

was in the fourth digit for the first two sets and the third digit in the latter set. The RMSE was around

0.006

on the training set,

0.009

on the validation set and

0.015

on the test set, with GSA standing out with the highest RMSE. As final metaparameter values in the optimized model, we took an average of the four similar results from the PSO and GWO runs with two significant digits. Calculating the RMSE and R² metrics on the datasets confirmed that the performance with these values was still optimal (see Table 4). The LSSVM model with these parameters is referred to as MOD2 in what follows. Note especially that MOD2 is capable of predicting unseen single-phase data and thus accounts for the physics introduced during the modification of the dataset.

The RMSE and R² were calculated with MOD2 for the total dataset as 0.0080 and 0.9976, respectively. These metrics are greatly improved compared to MOD1, which featured a corresponding RMSE of 0.0498 and an R² of 0.889. The calculated (with MOD2) and observed RF data are plotted against each other for the three datasets in Figure 6. For all three datasets, there is little scatter around the perfect match line. The residual errors were calculated for each datapoint in the full dataset and the results are plotted as a histogram in Figure 7. Approximately 90% of the data feature errors in the estimated RF of less than

0.01

, and 95% of the data feature errors less than 0.02.

Partial derivatives with respect to each normalized variable,

\frac{\partial y}{\partial x_{N i}}

, were calculated for each data point using MOD2 and histograms were created for each variable, as shown in Figure 8. The derivatives were calculated numerically and two choices of

Δ x_{N} = 5 \cdot 10^{- 2}

and

10^{- 3}

were used. The two choices produced practically identical histograms, suggesting that the optimized LSSVM function did not suffer from oscillations (a sign of over-fitting). For each variable, a large fraction of the points featured positive and negative derivatives. Hence, changing the variable can affect RF positively or negatively, indicating coupling and room for finding optimal conditions.

For

\frac{\partial y}{\partial x_{N 1}}

we see positive and negative values, which is reasonable, since RF should be higher for WAG than single-phase injection.

\frac{\partial y}{\partial x_{N 5}}

and

\frac{\partial y}{\partial x_{N 6}}

are both dominated by negative values, since increasing the mobility ratio between gas and oil or between water and oil, respectively, should reduce RF. Higher water–oil gravity segregation is considered negative for RF, with

\frac{\partial y}{\partial x_{N 7}}

mainly negative. On the other hand, gas–oil gravity segregation is considered mainly positive for RF with a majority of points having

\frac{\partial y}{\partial x_{N 8}}

positive. This could be attributed to the better sweep of low-permeable layers in heterogeneous cases. Hysteresis appears to benefit recovery, as seen by a majority of positive

\frac{\partial y}{\partial x_{N 3}}

, although the effect of

\frac{\partial y}{\partial x_{N 4}}

seems to be equally negative and positive.

3.4. Sensitivity Analyses with Optimized LSSVM Model MOD2

The calibrated model, MOD2, was much better at predicting RF than MOD1 and was therefore pursued in the sensitivity analysis. Below, we present contour plots showing RF as a function of different input variables, while keeping the others constant. The parameters are kept within the total dataset range (see Table 2) in order to ensure model validity.

3.4.1. Variation of Oil Viscosity

Oil viscosity can vary greatly from one reservoir to another. It proportionally impacts mobility ratios

M_{w / o}^{*}

and

M_{g / o}^{*}

in Equations (7) and (8), represented by

x_{6}

and

x_{5}

. For low oil mobilities, the gravity numbers

N_{G}^{w / o}

and

N_{G}^{g / o}

(represented by

x_{7}

and

x_{8}

) increase proportionally with oil viscosity but are less dependent if water or gas feature mobility that is similar to or lower than that of oil (see Equations (10) and (11)). For simplicity, we assume they are proportional. We vary the oil viscosity by 2.0 orders of magnitude, which is less than the smallest range of the four dimensionless numbers (2.3 for

x_{6}

), as seen in Table 2.

Four cases are defined in Table 5 with low or high heterogeneity (low or high

x_{2}

), and a low or high degree of hysteresis (low

x_{3}

and high

x_{4}

and opposite, respectively). For each of these cases,

RF

is plotted as a function of

x_{1}

(the water fraction) and

x_{5}

(representing the gas–oil mobility ratio) representing different viscosities (see Figure 9).From the figure, we observe that:

-: Optimal RF values were mainly obtained at an intermediate water fraction $0 < x_{1}$ (consider any line parallel with the x-axis), suggesting that WAG gives higher RF than single-phase injection. Cases with low hysteresis and favorable mobility ratios seem to give similar RF for water injection and WAG (although WAG with a low water fraction seems optimal) (see Figure 9a,b (low and high heterogeneity)).
-: The advantage of WAG over single-phase injection was most clear when hysteresis was significant (see Figure 9c,d). The best water fraction produced RF up to 0.3 units higher than the worst fraction. This strong impact was mainly at low oil viscosity (low $x_{5}$ ) with optimal water fraction around 0.5–0.6. For higher oil viscosity or lower heterogeneity cases, WAG was in many cases only marginally better (~0.05 units) than the best single-phase injection.
-: Increased oil viscosity reduced RF for a given water fraction (follow any line parallel with the y-axis). This was dominant over the WAG fraction at high viscosities, except for the highly heterogeneous cases with high hysteresis (Figure 9d). This demonstrates the benefit of WAG in heterogeneous formations and that hysteresis is an important contributor.
-: For a given heterogeneity (low or high), increased hysteresis improved RF (compare Figure 9c,d (high hyst) with Figure 9a,b (low hyst)). This was related to the improved gas–oil mobility ratio and reduced gravity segregation, which improves volumetric sweep. The optimal water fraction shifted to more central values, since both phases are needed for hysteresis.
-: For a given hysteresis state, increased heterogeneity reduced RF, especially for cases with less viscous oil (compare Figure 9b,d (high het) with Figure 9a,c (low het)). For high hysteresis cases, increased heterogeneity increased RF in cases with more viscous oil.

To better understand the relation between viscosity, heterogeneity and hysteresis, we plotted RF as a function of

x_{2} = l o g F_{H}

and

x_{5} = l o g (M_{g o}^{*})

for

x_{1} = 0.5

(WAG injection with equal volume fractions of gas and water) for the two hysteresis cases in Figure 10. Each value of

x_{5}

represents fixed oil viscosity and the curves cover the same viscosity range as before. We observed that that:

-: For low hysteresis (Figure 10a), RF was very sensitive to heterogeneity for low oil viscosities and increased heterogeneity reduced RF. For high viscosity, RF changed little with heterogeneity.
-: With significant hysteresis (Figure 10b), low-viscosity cases produced reduced RF at higher heterogeneity, while high-viscosity cases produced increased RF.

3.4.2. Variation of Well Distance, Injection Rate or Density Difference

The distance between wells can vary from a dense pattern of a few hundred meters onshore to ~1000 m offshore. For fixed injection rates, a longer well distance

L_{x}

proportionally increases the residence time and, hence, the gravity numbers (see (10)), represented by

x_{7}

and

x_{8}

. Similarly, increasing the injection rates of water

Q_{w}

and gas

Q_{g}

equally reduces the residence times and the gravity numbers. Increased density differences reduce the segregation time and increase the gravity numbers. If the height is varied but the injection rate is the same, we note that both segregation time and residence time change equally and there is no net change in the gravity numbers. Varying the aforementioned parameters does not affect the variables

x_{1}

to

x_{6}

; we can thus investigate cases in which they are constant and only the gravity numbers change.

We plotted RF as a function of injected water fraction

x_{1}

and log gravity number (equal values of

x_{7}

and

x_{8}

). We investigated the role of mobility ratio, heterogeneity and hysteresis one by one. The different cases are listed in Table 6. The gravity numbers varied equally by 2.5 orders of magnitude.

When low heterogeneity

x_{2} = 0

is considered (Figure 11a),

RF

stays fairly constant at low

N_{g}

(when the impact from gravity is negligible) and decreases when

N_{g}

is large due to gravity segregation and reduced vertical sweep. At high heterogeneity (

x_{2} = 1

) in Figure 11b, RF is generally lower, but increases significantly with increases in the gravity number. Gravity therefore exerts a positive effect as more of the low-permeable layers are swept by gravity drainage into the highly permeable layers [47].

A relatively heterogeneous case (

x_{2} = 0.8

) is considered where either mobility ratio is favorable, Figure 12a, or unfavorable, Figure 12b. In both cases, increased gravity number improves RF, but the effect is more pronounced in the favorable mobility ratio case. In the unfavorable mobility case, gravity exerts little impact until the gravity number exceeds −3. RF is generally higher in favorable mobility ratio cases compared to corresponding unfavorable mobility ratio cases.

In a relatively uniform case (

x_{2} = 0.3

) with intermediate mobility ratios

x_{5} = x_{6} = 1.5

), hysteresis is varied. At low hysteresis, Figure 13a, increased gravity numbers increase RF moderately towards an optimal gravity number. At high hysteresis, Figure 13b, the optimal gravity number occurs at a lower value (for a given injected fraction). The peak can be related to improved sweep in low mobility layers, which becomes dominated by gravity segregation at the highest gravity numbers.

3.4.3. Handling Single Phase Data

The model MOD2 was trained to provide the same

R F

during single-phase injection when varying input variables related to hysteresis and the phase not injected (for example, gas during water injection). This was performed by generating points with different input values for parameters that should not exert an influence, but with the same output. To check how effectively this was captured by the calibrated model, we ran cases in which the hysteresis parameters

x_{3}, x_{4}

and mobility ratio parameters

x_{5}, x_{6}

were varied individually. RF was plotted against the relevant variable and

r_{w}

ranging from gas to water injection. The input parameters are listed in Table 7 and the results are shown in Figure 14.

The variation of the hysteresis parameters

x_{3}

and

x_{4}

, in Figure 14a,b, respectively, produces relatively constant RF with gas injection

r_{w} = 0

(RF~0.45) and water injection

r_{w} = 1

(RF~0.30), although the variation of

x_{3}

during gas injection produces a wider range (RF = 0.30–0.45). The levels of RF differ, as is expected, since gas and water injection perform differently. Varying the gas-oil mobility ratio

M_{g / o}^{*}

(via

x_{5}

in Figure 14c) produces much less change in RF with water injection (RF~0.3) than gas or WAG injection. Similarly, varying the water–oil mobility ratio

M_{w / o}^{*}

(via

x_{6}

in Figure 14d) produces much less change in RF with gas injection (RF~0.45) than with water or WAG injection.

3.5. Application to a 3D Model

The dataset used to train MOD1 and MOD2 was based on a 2D layered model. By obtaining effective parameters from 3D reservoir models, we can predict RF from MOD1 and MOD2. An artificial 3D heterogeneous model was considered with curvature, faults and three layers (see Figure 15). The average permeability, average porosity and layer thickness are listed in Table 8. The vertical permeability was half of the horizontal permeability. An injector and producer were placed at a distance of 1500 m and the pore volume was based on a width of 750 m. The RF was calculated after the injection of 1.5 PV, assuming five injected fractions (

r_{w}

equal 0, 0.33, 0.5, 0.67 and 1), two oil viscosities (30 and 110 cP) and low and high hysteresis (see values of

x_{3}

and

x_{4}

).

In Figure 16, we plotted RF as a function of

x_{1} = r_{w}

for the four cases, calculated with the 3D model, MOD1 and MOD2. In these examples, RF with water injection (0.13 and 0.35 for high and low oil viscosity) is higher than with gas injection (0.09 and 0.22) and RF is lower with more viscous oil. When hysteresis is high, WAG exhibits the best performance out of all the models (the RF peaks at

0 < r_{w} < 1

). When hysteresis is low, the 3D model features similar RF for a high WAG fraction to water injection, while MOD1 indicates water injection as optimal and MOD2 still clearly supports WAG. MOD1 predicts a level of RF and change in RF that are more similar to those seen in the 3D model than MOD2. MOD2 predicts the level of RF in low hysteresis relatively well, but appears very sensitive to adding hysteresis. This is also seen through a difference in the single-phase injection RF for the same oil viscosity: about 0.2 units for water injection and for gas injection with low amounts of viscous oil, but, more reasonably, 0.03 units for gas injection with high amounts of viscous oil. This indicates that this region of the model could be better calibrated. The water injection points in this case could be insufficiently near the single-phase points in the dataset. Furthermore, we do not expect MOD1 and MOD2 to predict the 3D model behavior identically as the geometries are not the same.

4. Conclusions

In this study, we interpreted a dataset of ~2500 points generated from single-phase and WAG injection reservoir model simulations to predict the recovery factor, RF. Two modeling approaches were selected. In MOD1, a universal dimensionless number

M^{*}

derived from Nygård and Andersen [47] was selected as the single input variable and correlated by a polynomial expression. In MOD2, eight dimensionless numbers were used as input variables. Both choices included all the relevant input parameters to run the reservoir simulations. MOD2 was developed using LSSVM and optimized based on the best results from PSO, GA, GWO and GSA. The overall conclusions to this work can be summarized as follows:

-: We demonstrated that it is possible to predict the recovery factor during single-phase and WAG injection.
-: The LSSVM model optimized by GWO or PSO performed better than when optimized by GA or GSA.
-: MOD2 with eight input variables clearly performed better than MOD1 with one input. Based on the total dataset, the RMSE and R² were 0.0080 and 0.998 for MOD2 and 0.050 and 0.889 for MOD1, respectively.
-: The physics-based training of MOD2 was applied successfully. Single-phase injection data points were duplicated using different values in the input variables that should not affect RF, while keeping the same values for the relevant input variables and the output. The model correctly displayed little response to the irrelevant variables, but not for all conditions. Improvements could be made by adding more of these points or by training the model to include such constraints via an added penalty term in the objective function. MOD1 was analytically independent of these variables during single-phase injection.
-: Plotting histograms of partial derivatives of RF showed that for most input variables, increasing them would increase RF for some conditions, but reduce RF under others, demonstrating coupling in the data.
-: The best model (MOD2) predicted that under identical conditions, an optimal injected WAG fraction existed that outperformed single-phase injection (water or gas). The benefit of WAG was much clearer when gas relative permeability hysteresis was significant.
-: The mobility ratios were important input variables. Increased values tended to reduce RF.
-: The roles of gravity numbers, heterogeneity and hysteresis were coupled. Strong gravity effects reduced RF in low-heterogeneity cases, but improved RF in heterogeneous cases.

Finally, some limitations of the study and recommendations should be mentioned. Some parameters were not varied in the dataset and their role can therefore not be predicted by the model. This includes the reservoir dip angle, capillary forces, starting WAG in tertiary mode (with some period of gas or water injection first), gas miscibility and tapering (changing WAG ratio with time). Furthermore, the heterogeneity of the model was mainly described by one parameter although porosity and vertical permeability appear in other dimensionless numbers. It could also matter how the heterogeneity appears, i.e., permeability increasing up or down. We note that several parameters that were not varied are included in a physically meaningful manner into dimensionless numbers that were varied. Thus, considering new values of Corey parameters, the vertical-to-horizontal permeability ratio and reservoir layer configurations are accounted for. The proposed methodology can be applied to predict the performance of other EOR techniques as well, but requires a similar development of representative dimensionless numbers and parameters capturing the EOR effect.

It is recommended to explore the potential of physics-based machine learning [50,51] in combination with dimensionless numbers describing complex systems, as was considered in this study. The methodology of modifying the dataset as described offers the advantage of applying ML algorithms in their standard form. On the downside, the dataset is enlarged and the physics are added around the specific datapoints, not as inherent part of the model.

Author Contributions

Conceptualization, P.Ø.A.; methodology, P.Ø.A.; software, P.Ø.A.; formal analysis, P.Ø.A.; investigation, P.Ø.A., J.I.N., A.K.; data curation, P.Ø.A., J.I.N., A.K.; writing—original draft preparation, P.Ø.A., A.K.; writing—review and editing, P.Ø.A.; visualization, P.Ø.A.; Supervision, P.Ø.A.; project administration, P.Ø.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data can be provided upon request from the corresponding author.

Acknowledgments

Andersen acknowledges the Research Council of Norway and the industry partners, ConocoPhillips Skandinavia AS, Aker BP ASA, Vår Energi AS, Equinor ASA, Neptune Energy Norge AS, Lundin Norway AS, Halliburton AS, Schlumberger Norge AS and Wintershall DEA, of The National IOR Centre of Norway, for support.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

Roman
$b$	LSSVM constant
$C$	Land’s trapping parameter
$F_{H}$	Heterogeneity multiplier
$F_{G}$	Gravity multiplier
$h_{i}$	Layer height, m
$k_{r i}$	Relative permeability
$k_{r i}^{m a x}$	Relative permeability endpoints
$K_{x}, K_{z}$	Horizontal and vertical absolute permeability, m
$L_{x}$	Distance from injector to producer, m
$L_{y}$	Width of reservoir, m
$L_{z}$	Total height of reservoir, m
$M$	Mobility ratio
$M_{g / o}^{}, M_{w / o}^{}$	Effective mobility ratios between gas-oil and between water-oil
$M^{*}$	Effective three phase mobility ratio accounting for all mechanisms
$n_{i}$	Corey exponents,
$N_{p}$	Number of particles
$N_{i t}$	Number of PSO iterations
$N_{G}$	Gravity number, -
$R^{2}$	Coefficient of determination, -
$r_{x y}$	Pearson correlation coefficient between vectors x and y, -
r_w	Water volume fraction in a WAG cycle, -
$R F$	Recovery factor, -
$s_{i}$	Phase saturation, -
$s_{i r}$	Residual phase saturation, -
$t$	Time, seconds
$x$	Horizontal direction towards producer, m
$x_{i}$	Input vector, -
$X$	Standard deviance multiplier, -
$y$	LSSVM output / RF, -
$z$	Vertical direction downwards, m
Greek
$α$	Carlson hysteresis parameter
$α_{i}$	LSSVM coefficients
$γ$	Regularization coefficient
$Δ ρ$	Density difference, kg/m³
$λ_{i}$	Phase mobility ${(Pa \cdot s)}^{- 1}$
μ_i	Viscosity, Pa⋅s
ρ_i	Phase density, kg/m³
$σ$	RBK width parameter
$ϕ$	Porosity
$ϕ_{1}, ϕ_{2}$	Acceleration constants
$ω$	Damping factor
Indices
*	characteristic value,
$a r i t$	arithmetic
$g$	gas
$G$	gravity
$h a r m$	harmonic
$i$	phase
$j$	layer
$o$	oil
$r e s$	residence
$i n i t$	initial reservoir conditions
$s e g$	segregation
$T$	total
$w$	water
Abbreviations
EOR	Enhanced oil recovery
LSSVM	Least squares support vector machine
PSO	Particle swarm optimization
RMSE	Root mean square error
WAG	Water alternating gas

Appendix A. Reservoir Model Parameters

Table A1. Rock/grid properties and operational parameters.

N

denotes number of cells in each direction,

L

the respective lengths,

Q

the volumetric rate.

Table A1. Rock/grid properties and operational parameters.

N

denotes number of cells in each direction,

L

the respective lengths,

Q

the volumetric rate.

$N_{x}$	100	$L_{x}$	1000 m	$ϕ_{j}$	0.30	$Q_{w}$	1014.6 m³/d	Half cycle duration	45 d
$N_{y}$	1	$L_{y}$	100 m	$h_{j}$	3 m	$Q_{g}$	1014.6 m³/d	Total injection volume, PVs	1.5 PVs
$N_{z}$	81	$L_{z}$	81 m

Table A2. Reservoir flow properties in terms of relative permeability end points, Corey exponents, initial and residual saturations.

$k_{r o w}^{m a x}$	0.25	$n_{o w}$	2	$S_{o i}$	0.842
$k_{r o g}^{m a x}$	0.25	$n_{o g}$	2	$S_{w i} = S_{w r}$	0.158
$k_{r w}^{m a x}$	0.05	$n_{w}$	2	$S_{g i} = S_{g r}$	0.00
$k_{r g}^{m a x}$	0.005	$n_{g}$	2	$S_{o r w}$	0.20
				$S_{o r g}$	0.10

Table A3. Specification of model heterogeneities. Each model had 9 layers with permeability distributed as specified. It was assumed that vertical and horizontal permeabilities were equal in each layer:

K_{z, j} = K_{x, j}

.

Table A3. Specification of model heterogeneities. Each model had 9 layers with permeability distributed as specified. It was assumed that vertical and horizontal permeabilities were equal in each layer:

K_{z, j} = K_{x, j}

.

	$K_{x} [mD]$
Layer 1 (top)	300	300	500	1000
2	300	100	50	20
3	300	900	500	1000
4	300	300	50	20
5	300	100	500	1000
6	300	900	50	20
7	300	300	500	1000
8	300	100	50	20
9 (bottom)	300	900	500	1000
F_H	1.0	2.1	3.0	12.9

Appendix B. Least Squares Support Vector Machines (LSSVM)

The support vector machine (SVM) algorithm was developed by Vapnik [25] and used to solve classification problems by building hyperplanes in multidimensional spaces that separated data into classes. Its application was extended to regression. The least squares support vector machine, or LSSVM, is a modification of SVM introduced by Suykens and Vandewalle [26]. The LSSVM regression algorithm is outlined below.

Consider a finite dataset with

n

points

D = {(x_{1}, y_{1}), \dots \dots, (x_{n}, y_{n})}

, where the input

x_{i} \in R^{p}

,

p

being the number of input variables (in our case 1 or 8) and the output

y_{i} \in R

. The regression function is expressed as [27]:

f (x) = w^{T} φ (x) + b,

(A1)

where

φ

is a higher dimensional function and

w

is a weight vector that combines the contributions of each element of

φ

to a scalar. Each output measurement

y_{i}

is by definition equal to the regression plus the error

e_{i}

:

y_{i} = f (x_{i}) + e_{i}, (i = 1, \dots, n)

(A2)

The LSSVM algorithm aims to minimize the objective function

J

described as follows:

\min_{w, e} J (w, e) = \frac{1}{2} w^{T} w + \frac{1}{2} γ \sum_{i = 1}^{n} e_{i}^{2}

(A3)

γ

is called the regularization coefficient and its magnitude determines which of the two terms is minimized more. The error equations are treated as equality constraints:

y_{i} = w^{T} φ (x_{i}) + b + e_{i}, (i = 1, \dots, n)

(A4)

Solving (A3) and (A4) simultaneously can be transformed to the problem of finding the saddle point of the Lagrange function

L

which incorporates

J

and the equality constraints:

L (w, b, e; α) = J (w, e) - \sum_{k = 1}^{n} α_{i} {w^{T} φ (x_{i}) + b + e_{i} - y_{i}}

(A5)

with Lagrange multipliers

α_{i}

. The conditions for optimality are found by setting partial derivatives equal to zero:

\frac{d L}{d w} = 0 \to w = \sum_{i = 1}^{n} α_{i} φ (x_{i})

(A6)

\frac{d L}{d b} = 0 \to \sum_{i = 1}^{n} α_{i} = 0

(A7)

\frac{d L}{d e_{i}} = 0 \to α_{i} = γ e_{i}, (i = 1, \dots, n)

(A8)

\frac{d L}{d α_{i}} = 0 \to w^{T} φ (x_{i}) + b + e_{i} - y_{i} = 0, (i = 1, \dots, n)

(A9)

We can eliminate

e_{i}

and

w

from the above set of equations to obtain the remaining linear equations for

α_{i}

and

b

:

\sum_{i = 1}^{n} α_{i} = 0

(A10)

b + \sum_{k = 1}^{n} α_{k} φ {(x_{k})}^{T} φ (x_{i}) + \frac{1}{γ} α_{i} = y_{i}

(A11)

By applying Mercer’s condition, the product

φ {(x_{i})}^{T} φ (x_{j})

is replaced by a kernel function

K (x_{i}, x_{j})

:

φ {(x_{i})}^{T} φ (x_{j}) = K (x_{i}, x_{j}), i, j = 1, \dots, N (3.4)

(A12)

We can then solve for

α_{i}

and

b

by solving the matrix form of (A10) and (A11):

[\begin{matrix} 0 & 1 & \dots & 1 \\ 1 & K (x_{1}, x_{1}) + \frac{1}{γ} & \dots & K (x_{1}, x_{n}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & K (x_{n}, x_{1}) & \dots & K (x_{n}, x_{n}) + \frac{1}{γ} \end{matrix}] [\begin{matrix} b \\ α_{1} \\ ⋮ \\ α_{n} \end{matrix}] = [\begin{matrix} 0 \\ y_{1} \\ ⋮ \\ y_{n} \end{matrix}]

(A13)

Combining (A4) with (A6) and (A12), the final form of the LSSVM regression function is given by:

y (x) = \sum_{i = 1}^{N} α_{i} K (x_{i}, x) + b (3.5)

(A14)

Different choices of kernel function can be made. The radial basis kernel (RBK) function was selected:

K (x_{i}, x) = \exp (- \frac{| | x_{i} - x^{2} | |}{σ^{2}})

(A15)

| | x_{i} - x | |

denotes the Euclidian distance between vectors

x_{i}

and

x

, while

σ

is the width parameter.

The choice of the metaparameters

σ

and

γ

determines the LSSVM algorithm performance.

σ

controls how rapidly the function can vary around the training data points

x_{i}

. For very small

σ

, the function equals the constant

b

between the points

x_{i}

, while it matches

y_{i}

at every

x_{i}

of the training set. This results in the very poor prediction of new points. Large

σ

linearizes the function (a straight line for a scalar input variable). Intermediate

σ

are hence expected to capture non-linear trends.

γ

controls how much weight is placed on minimizing the mismatch compared to minimizing the magnitude of the nonlinear terms. A very low

γ

minimizes the coefficients of the nonlinear terms to zero and provides a constant function, equal to

b

. A very high

γ

minimizes the mismatch between the function and the training set (between

y (x_{i})

and

y_{i}

) but allows it to be more nonlinear.

Appendix C. Optimization Algorithms

Optimization algorithms are applied to find the optimal combination of LSSVM metaparameters, as represented by the vector

β = (\log_{10} γ, \log_{10} σ) \in R^{2}

. The applied parameters common and specific to the algorithms are listed in Table A4.

Table A4. Optimization algorithm parameters. No algorithm specific parameters were required for GWO.

Common			PSO			GA
# particles /chromosomes/wolves	$N_{p}$	$20$	Acceleration constants	$ϕ_{1},$ $ϕ_{2}$	$1.5,$ $1.5$	Mutation rate	$μ_{r}$	$0.15$
# variables/genes	$N_{v a r}$	$2$	Damping factor	$ω$	$0.8$	Mutation factor	$μ_{f}$	$0.1$
# iterations	$N_{i t}$	$30$	GSA			# elite chromosomes	$N_{e l i t}$	$2$
Search range variable 1	$β_{1}^{m i n},$ $β_{1}^{m a x}$	$- 2,$ $+ 8$	Initial gravity	$G_{0}$	$2$	GWO
Search range variable 2	$β_{2}^{m i n},$ $β_{2}^{m a x}$	$- 3,$ $+ 3$	Gravity reduction factor	$α$	$5$	-
Initial velocity range			Small constant	$ε$	$10^{- 4}$

Appendix C.1. Particle Swarm Optimization (PSO)

PSO was developed by Kennedy and Eberhart [52] and can be described as follows [53]:

Generate an initial set of $N_{p}$ ‘particles’, which are random solution vectors $β_{n}^{0} (n = 1, \dots, N_{p})$ , all in $R^{2}$ . The entire set of particles is called the swarm.

$β_{n, r}^{0} = U_{n, r} (β_{r}^{m i n}, β_{r}^{m a x}), (r = 1, 2) .$

(A16)
The indices $n$ and $r$ refer, respectively, to the particle and the parameter in the $n$ and $r$ vector while $U_{n, r}$ refers to the uniform probability distribution over the specified range.

The particles are assigned initial velocities

v_{n}^{0} \in R^{2}

v_{n, r}^{0} = \frac{β_{r}^{m a x} - β_{r}^{m i n}}{\sqrt{N_{p}}} U_{n, r} (- 1, 1), (r = 1, 2) .

(A17)

The initial velocity is set to be proportional to the search range and reduced by the number of particles, as they each can cover a shorter interval with more of them.

c.: At a given iteration, the solution estimate of particle $n$ corresponds to its current ‘position’ in the search space, termed $β_{n}^{o l d}$ . The quality of each of the $N_{p}$ solution estimates is evaluated by the coefficient of determination $R^{2} (β_{n}^{o l d})$ . The best solution position (with highest $R^{2}$ ) a particle obtains while it moves in the search space is saved and updated if it improves. These $N_{p}$ solution vectors are called $β_{n, o p t}^{p} (n = 1, N_{p})$ . Similarly, the best solution of all the particles (the swarm) is termed $β_{o p t}^{s}$ . This position updates if the particles find a better solution.
d.: New velocities $v_{n}^{n e w}$ are calculated for each particle $n$ based on the old velocity $v_{n}^{o l d}$ and how far the particle is from its historic best position $β_{n, o p t}^{p}$ and from the swarm’s historic best position $β_{o p t}^{s}$ :

v_{n}^{n e w} = ω v_{n}^{o l d} + U (0, ϕ_{1}) (β_{n, o p t}^{p} - β_{n}^{o l d}) + U (0, ϕ_{2}) (β_{o p t}^{s} - β_{n}^{o l d})

(A18)

ϕ_{1}

and

ϕ_{2}

are acceleration constants, stating how quickly the particles steer towards the two currently best positions. A sum

ϕ_{1} + ϕ_{2} < 4

avoids unbounded oscillation [53].

ω

is a velocity damping factor. A value

ω < 1

refines searches at late iterations.

e.: The position of each particle at the next iteration is updated by adding the velocity:

β_{n}^{n e w} = β_{n}^{o l d} + v_{n}^{n e w}

(A19)

Any particles exceeding the search space limits

β_{r}^{m i n}, β_{r}^{m a x}

are adjusted to travel no farther than the limit.

f.: Finally, the ‘new’ parameters are set as ‘old’ and a new iteration starts from point c. The procedure stops when a set number $N_{i t}$ of iterations is completed.

Appendix C.2. Gravitational Search Algorithm (GSA)

GSA was developed by Rashedi et al. [54] and considers each solution as a particle.

Assign initial positions and velocities according to (A16) and (A17).
The gravitational constant is reduced from an initial value $G_{0}$ at iteration $t = 1$ according to a reduction factor $α$ down to $G_{0} \exp (- α)$ at the last iteration:

G (t) = G_{0} \exp (- α \frac{t - 1}{N_{t} - 1}) .

(A20)

Calculate the relative fitness

m_{n}

for each particle (here using

R^{2}

) at the current state.

m_{n} = \frac{{(R^{2})}_{n} - \min_{n} (R^{2})}{\max_{n} (R^{2}) - \min_{n} (R^{2})} .

(A21)

The ‘mass’

M_{n}

of each particle is then calculated as:

M_{n} = \frac{m_{n}}{\sum_{n = 1 : N_{p}} m_{n}} .

(A22)

c.: For a given particle $n$ , the force $F_{n j}$ working on it from another particle $j \neq n$ is given by:

F_{n j} = G \frac{M_{n} M_{j}}{| β_{j} - β_{n} | + ε} (β_{j} - β_{n}) .

(A23)

where

| β_{j} - β_{n} |

is the Euclidian distance between the particle positions and

ε

is a small constant (to avoid division by zero).

d.: The acceleration of particle $n$ is then its net force divided by the mass, where random weight components are introduced:

a_{n}^{o l d} = \frac{1}{M_{n}} \sum_{j = 1 : n} U_{n, j} (0, 1) F_{n j} .

(A24)

e.: The velocities and new positions are calculated as:

v_{n}^{n e w} = U (0, 1) v_{n}^{o l d} + a_{n}^{o l d} .

(A25)

β_{n}^{n e w} = β_{n}^{o l d} + v_{n}^{n e w} .

(A26)

with coordinates limited by

β_{r}^{m i n}, β_{r}^{m a x}

. The procedure is repeated between steps b and e.

Appendix C.3. Genetic Algorithm (GA)

In GA each solution,

β_{n} \in R^{2}

is called a chromosome and the individual elements

β_{n, r} (r = 1, 2)

are called the genes of the chromosome [55].

a.: A first generation of chromosomes is initialized using (A16).
b.: In ‘Selection’, pairs of two chromosomes from the previous generation, called parents, are combined to produce a new generation of chromosomes, ‘children’. The selection of the parents is random with probability $P_{n}$ proportional to their relative fitness:

P_{n} = \frac{{RMSE}_{n}^{- 1}}{\sum_{n = 1 : N_{p}} {RMSE}_{n}^{- 1}} .

(A27)

c.: ‘Crossover’ is then used to define the new generation chromosomes. In child 1 of a parent pair, the first gene is from parent 1 and the second gene from parent 2. For child 2 of that pair, the first gene is from parent 2 and the second from parent 1.

β_{c 1, 1}^{n e w} = β_{p 1, 1}^{o l d}, β_{c 1, 2}^{n e w} = β_{p 2, 2}^{o l d}, β_{c 2, 1}^{n e w} = β_{p 2, 1}^{o l d}, β_{c 2, 2}^{n e w} = β_{p 1, 2}^{o l d} .

(A28)

Generally, in problems with more than two genes, a crossover point must be defined to distinguish which genes are taken from which parent.

d.: ‘Mutation’ is the operation of randomly modifying one or both genes in a child. The probability that a given gene is mutated is the mutation rate $0 \leq μ_{r} \leq 1$ . Thus, for the fraction $μ_{r}$ of new genes we perform the following modification (while the rest $1 - μ_{r}$ are not modified):

β_{n, r}^{n e w} \to β_{n, r}^{n e w} + μ_{f} (β_{r}^{m a x} - β_{r}^{m i n}) U_{n, r} (- 1, 1)

(A29)

The factor

μ_{f}

is set to a low fraction so the mutation is low compared to the search range of the variables. The coordinates are limited by

β_{r}^{m i n}, β_{r}^{m a x}

.

e.: ‘Elitism’ involves keeping some of the best chromosomes from the previous generation unmodified into the new generation.

Appendix C.4. Grey Wolf Optimization (GWO)

GWO was developed by Mirjalili et al. [56] and considers each solution

β_{n} \in R^{2}

a ‘wolf’.

a.: Initialize the positions of the $N_{p}$ wolves according to (A16). In this algorithm, we call the positions $X$ instead of $β$ .
b.: At a given iteration the best, second-best and third-best solutions are called the alpha $(α)$ , beta $(β)$ and delta $(δ)$ wolves, respectively. The others are grouped as omega $(ω)$ wolves. The positions are denoted $X_{α}, X_{β}, X_{δ}$ and $X_{ω}$ , or $X$ for all the wolves.
c.: Assume the ‘prey’ is located at a position $X_{p}$ . A distance measure to the prey along coordinate $r$ is given by:

D_{r} = | C_{r} X_{p, r} (t) - X_{r} (t) |, (r = 1 : 2)

(A30)

and the position at the next iteration is given as:

X_{r} (t + 1) = X_{p, r} (t) - A_{r} D_{r}

(A31)

where the coefficients

C_{r}

and

A_{r}

are determined as follows:

A_{r} = (2 U_{r} (0, 1) - 1) a, a = 2 (1 - \frac{t - 1}{N_{t o t} - 1}), C_{r} = 2 U_{r} (0, 1)

(A32)

The magnitude of

A_{r}

makes it possible to move farther from the prey at early iterations (exploration) and closer at later iterations (exploitation). As the sign of

A_{r}

can be positive or negative the new position can pass the prey on the given axis.

For each wolf, the position of the prey is estimated by the position of the three top wolves. The position at the next iteration is then the average of the three calculated new positions based on the top three wolves. Mathematically, this is expressed as:

D_{α, r} = | C_{1, r} X_{α, r} - X_{r} |, D_{β, r} = | C_{2, r} X_{β, r} - X_{r} |, D_{δ, r} = | C_{3, r} X_{δ, r} - X_{r} |,

(A33)

X_{1, r} = X_{α, r} - A_{1, r} D_{α, r}, X_{2, r} = X_{β, r} - A_{2, r} D_{β, r}, X_{3, r} = X_{δ, r} - A_{3, r} D_{δ, r},

(A34)

X_{r} (t + 1) = \frac{1}{3} (X_{1, r} + X_{2, r} + X_{3, r}), (r = 1, 2)

(A35)

The best position at a given iteration is described by the position of the alpha wolf. The coordinates are limited by

β_{r}^{m i n}, β_{r}^{m a x}

.

Appendix D. Statistical Measures

Consider a dataset with

n

points, in which we have a model trying to predict the observed output

y_{i}^{o b s}

but actually producing the modelled value

y_{i}^{m o d}

for point

i

. The goodness-of-fit the model provides for the dataset is quantified by the coefficient of determination

R^{2}

between forecasted and true output values, also called the Nash–Sutcliffe efficiency [57,58]:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i}^{o b s} - y_{i}^{m o d})}^{2}}{\sum_{i = 1}^{n} {({\bar{y}}_{i}^{o b s} - y_{i}^{m o d})}^{2}}, {\bar{y}}_{i}^{o b s} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}^{o b s}

(A36)

where values from 0 to 1 correspond to no and perfect correlation, respectively. We also use the Root Mean Square Error (RMSE):

RMSE = {(\frac{1}{n} \sum_{i = 1}^{n} {(y_{i}^{o b s} - y_{i}^{m o d})}^{2})}^{0.5}

(A37)

Linear correlation between two variables,

x

and

y

, is evaluated with the Pearson correlation coefficient

r_{x y}^{P}

:

r_{x y}^{P} = \frac{\sum_{i = 1}^{n} (x_{i} - \bar{x}) (y_{i} - \bar{y})}{\sqrt{\sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}} \sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}}

(A38)

A value close to

+ 1

or

- 1

indicates strong positive or negative correlation, respectively. Nonlinear correlation is calculated using Spearman rank correlation

r_{x y}^{S p}

. This is calculated by calculating the ranks of each value for the input variable

x

and the output variable

y

, where the ranks

R (x_{i}), R (y_{i})

denote the position they would have sorted from least to largest. The rank correlation is then based on the covariance and standard deviations of these rank sets:

r_{x y}^{Sp} = \frac{cov (R (x), R (y))}{s t d (R (x)) s t d (R (y))}

(A39)

If none of the listed values are of equal rank, the above equation can be stated as

r_{x y}^{Sp} = 1 - \frac{6 \sum_{i = 1 : n} {(R (x_{i}) - R (y_{i}))}^{2}}{n (n^{2} - 1)}

(A40)

Nonmonotonic correlation can be detected using distance correlation [59]:

r_{x y}^{D} = \frac{{dCov}^{2} (x, y)}{\sqrt{d V a r (x) d V a r (y)}}

(A41)

defined by the fraction of squared distance covariance between

x

and

y

over the root mean of the distance variances of

x

and

y

, respectively. We refer to the original work for more details.

References

Stenmark, H.; Andfossen, P.O. Snorre WAG Pilot—A Case Study. In Proceedings of the IOR 1995—8th European Symposium on Improved Oil Recovery, Vienna, Austria, 15–17 May 1995. [Google Scholar]
Christensen, J.R.; Stenby, E.H.; Skauge, A. Review of WAG Field Experience. SPE Reserv. Eval. Eng. 2001, 4, 97–106. [Google Scholar] [CrossRef]
Sadik-Zada, E.R.; Loewenstein, W. A Note on Revenue Distribution Patterns and Rent-Seeking Incentive. Int. J. Energy Econ. Policy 2018, 8, 196–204. [Google Scholar]
Afzali, S.; Rezaei, N.; Zendehboudi, S. A comprehensive review on Enhanced Oil Recovery by Water Alternating Gas (WAG) injection. Fuel 2018, 227, 218–246. [Google Scholar] [CrossRef]
Sanchez, N.L. Management of water alternating gas (WAG) injection projects. In Proceedings of the Latin American and Caribbean Petroleum Engineering Conference, Caracas, Venezuela, 21–23 April 1999. [Google Scholar]
Sohrabi, M.T.D.H.; Tehrani, D.H.; Danesh, A.; Henderson, G.D. Visualization of oil recovery by water-alternating-gas injection using high pressure micromodels. SPE J. 2004, 9, 290–301. [Google Scholar] [CrossRef]
Chen, B.; Reynolds, A.C. Ensemble-based optimization of the water-alternating-gas-injection process. SPE J. 2016, 21, 786–798. [Google Scholar] [CrossRef]
Green, D.W.; Willhite, G.P. Enhanced Oil Recovery, 2nd ed.; Henry, L., Ed.; Society of Petroleum Engineers: Richardson, TX, USA, 2018. [Google Scholar]
Kulkarni, M.M.; Rao, D.N. Experimental investigation of miscible and immiscible Water Alternating Gas (WAG) process performance. J. Pet. Sci. Eng. 2005, 48, 1–20. [Google Scholar] [CrossRef]
Andersen, P.Ø. A simplified modelling approach for petroleum recovery by spontaneous imbibition in naturally fractured reservoirs. J. Nat. Gas Sci. Eng. 2019, 63, 95–114. [Google Scholar] [CrossRef]
Andersen, P.Ø. Early- and Late-Time Analytical Solutions for Cocurrent Spontaneous Imbibition and Generalized Scaling. SPE J. 2021, 26, 220–240. [Google Scholar] [CrossRef]
Land, C.S. Calculation of Imbibition Relative Permeability for Two- and Three-Phase Flow From Rock Properties. Soc. Pet. Eng. J. 1968, 8, 149–156. [Google Scholar] [CrossRef]
Stone, H. Estimation of Three-Phase Relative Permeability And Residual Oil Data. J. Can. Pet. Technol. 1973, 12. [Google Scholar] [CrossRef]
Baker, L.E. Three-phase relative permeability correlations. In Proceedings of the SPE Enhanced Oil Recovery Symposium, Tulsa, OK, USA, 16–21 April 1988. [Google Scholar]
Carlson, F.M. Simulation of relative permeability hysteresis to the nonwetting phase. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 4–7 October 1981. [Google Scholar]
Larsen, J.A.; Skauge, A. Methodology for numerical simulation with cycle dependent relative permeabilities. SPE J. 1998, 3, 163–173. [Google Scholar] [CrossRef] [Green Version]
Spiteri, E.J.; Juanes, R. Impact of relative permeability hysteresis on the numerical simulation of WAG injection. J. Pet. Sci. Eng. 2006, 50, 115–139. [Google Scholar] [CrossRef]
Mahzari, P.; Sohrabi, M. An improved approach for estimation of flow and hysteresis parameters applicable to WAG experiments. Fuel 2017, 197, 359–372. [Google Scholar] [CrossRef]
Bourgeois, M.; Joubert, T.; Dominguez, V. Analysis of 3-phase Behavior in WAG Injections for Various Wettabilities. In Proceedings of the IOR 2019—20th European Symposium on Improved Oil Recovery, Pau, France, 8–11 April 2019; pp. 1–16. [Google Scholar] [CrossRef]
Cheng, G.; Guo, R.; Wu, W. Petroleum Lithology Discrimination Based on PSO-LSSVM Classification Model. In Proceedings of the 2010 Second International Conference on Computer Modeling and Simulation, Sanya, China, 22–24 January 2010; Volume 4, pp. 365–368. [Google Scholar]
Alvarado, V.; Ranson, A.; Hernandez, K.; Manrique, E.; Matheus, J.; Liscano, T.; Prosperi, N. Selection of EOR/IOR opportunities based on machine learning. In Proceedings of the European Petroleum Conference, Aberdeen, UK, 29 October 2002. [Google Scholar]
Tahmasebi, P.; Javadpour, F.; Sahimi, M. Data mining and machine learning for identifying sweet spots in shale reservoirs. Expert Syst. Appl. 2017, 88, 435–447. [Google Scholar] [CrossRef]
Chamkalani, A.; Zendehboudi, S.; Bahadori, A.; Kharrat, R.; Chamkalani, R.; James, L.; Chatzis, I. Integration of LSSVM technique with PSO to determine asphaltene deposition. J. Pet. Sci. Eng. 2014, 124, 243–253. [Google Scholar] [CrossRef]
Amar, M.N.; Ghriga, M.A.; Ouaer, H.; Seghier, M.E.A.B.; Pham, B.T.; Andersen, P. Østebø Modeling viscosity of CO₂ at high temperature and pressure conditions. J. Nat. Gas Sci. Eng. 2020, 77, 103271. [Google Scholar] [CrossRef]
Vapnik, V. The nature of statistical learning theory. Springer science & business media, New York, 1999. Springer Science & Business Media: New York, NY, USA, 1999. [Google Scholar]
Suykens, J.A.K.; Vandewalle, J. Least Squares Support Vector Machine Classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
Suykens, J.A.; Van Gestel, T.; De Brabanter, J. Least Squares Support Vector Machines; World Scientific: Singapore, 2002. [Google Scholar]
Alizadeh, S.M.; Alruyemi, I.; Daneshfar, R.; Mohammadi-Khanaposhtani, M.; Naseri, M. An insight into the estimation of drilling fluid density at HPHT condition using PSO-, ICA-, and GA-LSSVM strategies. Sci. Rep. 2021, 11, 1–14. [Google Scholar] [CrossRef] [PubMed]
Bian, X.-Q.; Song, Y.-L.; Mwamukonda, M.K.; Fu, Y. Prediction of the sulfur solubility in pure H2S and sour gas by intelligent models. J. Mol. Liq. 2020, 299, 112242. [Google Scholar] [CrossRef]
Mokarizadeh, H.; Atashrouz, S.; Mirshekar, H.; Hemmati-Sarapardeh, A.; Pour, A.M. Comparison of LSSVM model results with artificial neural network model for determination of the solubility of SO₂ in ionic liquids. J. Mol. Liq. 2020, 304, 112771. [Google Scholar] [CrossRef]
Ouaer, H.; Hosseini, A.H.; Amar, M.N.; Seghier, M.E.A.B.; Ghriga, M.A.; Nabipour, N.; Andersen, P.Ø.; Mosavi, A.; Shamshirband, S. Rigorous connectionist models to predict carbon dioxide solubility in various ionic liquids. Appl. Sci. 2020, 10, 304. [Google Scholar] [CrossRef] [Green Version]
Zeng, B.; Guo, J.; Zhang, F.; Zhu, W.; Xiao, Z.; Huang, S.; Fan, P. Prediction model for dissolved gas concentration in transformer oil based on modified grey wolf optimizer and LSSVM with grey relational analysis and empirical mode decomposition. Energies 2020, 13, 422. [Google Scholar] [CrossRef] [Green Version]
Guo, Y.; Xu, Y.-P.; Sun, M.; Xie, J. Multi-step-ahead forecast of reservoir water availability with improved quantum-based GWO coupled with the AI-based LSSVM model. J. Hydrol. 2021, 597, 125769. [Google Scholar] [CrossRef]
Zhang, L.; Ge, R.; Chai, J. Prediction of China’s energy consumption based on robust principal component analysis and PSO-LSSVM optimized by the Tabu search algorithm. Energies 2019, 12, 196. [Google Scholar] [CrossRef] [Green Version]
Song, Y.; Xie, X.; Wang, Y.; Yang, S.; Ma, W.; Wang, P. Energy consumption prediction method based on LSSVM-PSO model for autonomous underwater gliders. Ocean Eng. 2021, 230, 108982. [Google Scholar] [CrossRef]
Bemani, A.; Baghban, A.; Mohammadi, A.H.; Andersen, P. Østebø Estimation of adsorption capacity of CO₂, CH₄, and their binary mixtures in Quidam shale using LSSVM: Application in CO₂ enhanced shale gas recovery and CO₂ storage. J. Nat. Gas Sci. Eng. 2020, 76, 103204. [Google Scholar] [CrossRef]
Yuan, X.; Chen, C.; Yuan, Y.; Huang, Y.; Tan, Q. Short-term wind power prediction based on LSSVM–GSA model. Energy Convers. Manag. 2015, 101, 393–401. [Google Scholar] [CrossRef]
Lu, P.; Ye, L.; Sun, B.; Zhang, C.; Zhao, Y.; Teng, J. A new hybrid prediction method of ultra-short-term wind power forecasting based on EEMD-PE and LSSVM optimized by the GSA. Energies 2018, 11, 697. [Google Scholar] [CrossRef] [Green Version]
Li, K.; Liang, C.; Lu, W.; Li, C.; Zhao, S.; Wang, B. Forecasting of Short-Term Daily Tourist Flow Based on Seasonal Clustering Method and PSO-LSSVM. ISPRS Int. J. Geo-Inf. 2020, 9, 676. [Google Scholar] [CrossRef]
Esene, C.; Zendehboudi, S.; Shiri, H.; Aborig, A. Deterministic tools to predict recovery performance of carbonated water injection. J. Mol. Liq. 2020, 301, 111911. [Google Scholar] [CrossRef]
Afzali, S.; Zendehboudi, S.; Mohammadzadeh, O.; Rezaei, N. Hybrid mathematical modelling of three-phase flow in porous media: Application to water alternating gas injection. J. Nat. Gas Sci. Eng. 2021, 94, 103966. [Google Scholar] [CrossRef]
Menad, N.A.; Noureddine, Z. An efficient methodology for multi objective optimization of water alternating CO₂ EOR process. J. Taiwan Inst. Chem. Eng. 2019, 99, 154–165. [Google Scholar] [CrossRef]
Amar, M.N.; Zeraibi, N.; Jahanbani Ghahfarokhi, A. Applying hybrid support vector regression and genetic algorithm to water alternating CO₂ gas EOR. Greenh. Gases Sci. Technol. 2020, 10, 613–630. [Google Scholar] [CrossRef]
Nwachukwu, A.; Jeong, H.; Sun, A.; Pyrcz, M.; Lake, L.W. Machine learning-based optimization of well locations and WAG parameters under geologic uncertainty. In Proceedings of the SPE Improved Oil Recovery Conference, Tulsa, OK, USA, 14–18 April 2018. [Google Scholar]
You, J.; Ampomah, W.; Sun, Q.; Kutsienyo, E.J.; Balch, R.S.; Dai, Z.; Zhang, X. Machine learning based co-optimization of carbon dioxide sequestration and oil recovery in CO₂-EOR project. J. Clean. Prod. 2020, 260, 120866. [Google Scholar] [CrossRef]
You, J.; Ampomah, W.; Sun, Q. Co-optimizing water-alternating-carbon dioxide injection projects using a machine learning assisted computational framework. Appl. Energy 2020, 279, 115695. [Google Scholar] [CrossRef]
Nygård, J.I.; Andersen, P.Ø. Simulation of Immiscible Water-Alternating-Gas Injection in a Stratified Reservoir: Performance Characterization Using a New Dimensionless Number. SPE J. 2020, 25, 1711–1728. [Google Scholar] [CrossRef]
Ripley, B.D. Pattern Recognition and Neural Networks; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Pearson Education: Upper Saddle River, NJ, USA, 2010. [Google Scholar]
Fuks, O.; Tchelepi, H.A. Limitations Of Physics Informed Machine Learning For Nonlinear Two-Phase Transport In Porous Media. J. Mach. Learn. Model. Comput. 2020, 1, 19–37. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the IC-NN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
Poli, R.; Kennedy, J.; Blackwell, T. Particle swarm optimization. Swarm Intell. 2007, 1, 33–57. [Google Scholar] [CrossRef]
Rashedi, E.; Nezamabadi-Pour, H.; Saryazdi, S. GSA: A gravitational search algorithm. Inf. Sci. 2009, 179, 2232–2248. [Google Scholar] [CrossRef]
Mirjalili, S. Genetic algorithm. In Evolutionary Algorithms and Neural Networks; Springer: Cham, Switzerland, 2019; pp. 43–45. [Google Scholar]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models part I—A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
Székely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and testing dependence by correlation of distances. Ann. Stat. 2007, 35, 2769–2794. [Google Scholar] [CrossRef]

Figure 2. Workflow demonstrating the development, assessment and application of the models.

Figure 3. Datapoints plotted against corresponding values of

x_{0}

for MOD1, defined using a third-order polynomial (blue line) of

x_{0}

.

Figure 3. Datapoints plotted against corresponding values of

x_{0}

for MOD1, defined using a third-order polynomial (blue line) of

x_{0}

.

Figure 4. Comparison of estimated

RF

with MOD1 and actual datapoints (a) and a histogram of the residuals (b).

Figure 4. Comparison of estimated

RF

with MOD1 and actual datapoints (a) and a histogram of the residuals (b).

Figure 5. Illustration of optimizer performance in terms of the best solution’s R² (a), RMSE (b) and search parameter values

l o g (γ)

(c) and

l o g (σ)

(d), at a given iteration. In total, 20 solutions were initiated and run for 30 iterations in each case. Two identical initializations (marked 1 and 2) were run for each algorithm.

Figure 5. Illustration of optimizer performance in terms of the best solution’s R² (a), RMSE (b) and search parameter values

l o g (γ)

(c) and

l o g (σ)

(d), at a given iteration. In total, 20 solutions were initiated and run for 30 iterations in each case. Two identical initializations (marked 1 and 2) were run for each algorithm.

Figure 6. Comparison of estimated

RF

and actual datapoints on (a) the training set, (b) validation set and (c) test set. Estimated points are based on MOD2 (optimized LSSVM). The orange line represents perfect match.

Figure 6. Comparison of estimated

RF

and actual datapoints on (a) the training set, (b) validation set and (c) test set. Estimated points are based on MOD2 (optimized LSSVM). The orange line represents perfect match.

Figure 7. Histogram of residual errors (estimated

RF

minus actual

RF

) for the total dataset based on MOD2 (optimized LSSVM model).

Figure 7. Histogram of residual errors (estimated

RF

minus actual

RF

) for the total dataset based on MOD2 (optimized LSSVM model).

Figure 8. Histogram of partial derivatives for the total dataset based on MOD2 (the optimized LSSVM model). Each partial derivative is evaluated numerically with a small or large difference

Δ x

.

Figure 8. Histogram of partial derivatives for the total dataset based on MOD2 (the optimized LSSVM model). Each partial derivative is evaluated numerically with a small or large difference

Δ x

.

Figure 9. Contour plots of recovery factor RF plotted against

x_{1} = r_{w}

and

x_{5} = l o g (M_{g o}^{*})

. The latter represents variation in oil viscosity, which affects all of

x_{5}, x_{6}, x_{7}, x_{8}

. The four cases are for low heterogeneity and hysteresis (a), high heterogeneity and low hysteresis (b), low heterogeneity and high hysteresis (c) and high heterogeneity and hysteresis (d). See all input values in Table 5.

Figure 9. Contour plots of recovery factor RF plotted against

x_{1} = r_{w}

and

x_{5} = l o g (M_{g o}^{*})

. The latter represents variation in oil viscosity, which affects all of

x_{5}, x_{6}, x_{7}, x_{8}

. The four cases are for low heterogeneity and hysteresis (a), high heterogeneity and low hysteresis (b), low heterogeneity and high hysteresis (c) and high heterogeneity and hysteresis (d). See all input values in Table 5.

Figure 10. Contour plots of recovery factor RF plotted against

x_{2} = l o g (F_{H})

and

x_{5} = l o g (M_{g o}^{*})

. The latter represents variation in oil viscosity, which affects all of

x_{5}, x_{6}, x_{7}, x_{8}

. The cases are for WAG injection with

r_{w} = 0.5

and either low (a) or high (b) hysteresis. See all input values in Table 5.

Figure 10. Contour plots of recovery factor RF plotted against

x_{2} = l o g (F_{H})

and

x_{5} = l o g (M_{g o}^{*})

. The latter represents variation in oil viscosity, which affects all of

x_{5}, x_{6}, x_{7}, x_{8}

. The cases are for WAG injection with

r_{w} = 0.5

and either low (a) or high (b) hysteresis. See all input values in Table 5.

Figure 11. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Low-heterogeneity (a) and high-heterogeneity (b) cases are shown (see all input values in Table 6).

Figure 11. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Low-heterogeneity (a) and high-heterogeneity (b) cases are shown (see all input values in Table 6).

Figure 12. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Favorable (a) and unfavorable (b) mobility ratio cases are shown (see all input values in Table 6).

Figure 12. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Favorable (a) and unfavorable (b) mobility ratio cases are shown (see all input values in Table 6).

Figure 13. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Low- (a) and high- (b) hysteresis cases are presented (see all input values in Table 6).

Figure 13. Contour plot of recovery factor RF plotted against

x_{1} = r_{w}

and log gravity number with equal values of

x_{7}

and

x_{8}

. Low- (a) and high- (b) hysteresis cases are presented (see all input values in Table 6).

Figure 14. Contour plots of recovery factor RF as function of varying water fraction (horizontal axis) and the indicated parameter (

x_{3}

in (a),

x_{4}

in (b),

x_{5}

in (c) and

x_{6}

in (d)) on the vertical axis while holding other parameters fixed.

r_{w} = 0

indicates gas injection and

r_{w} = 1

water injection.

Figure 14. Contour plots of recovery factor RF as function of varying water fraction (horizontal axis) and the indicated parameter (

x_{3}

in (a),

x_{4}

in (b),

x_{5}

in (c) and

x_{6}

in (d)) on the vertical axis while holding other parameters fixed.

r_{w} = 0

indicates gas injection and

r_{w} = 1

water injection.

Figure 15. Illustration of the 3D model, where permeability and well placements are indicated.

Figure 16. RF after 1.5 PV calculated for different injected WAG fractions (r_w), low or high oil viscosity and low or high degree of hysteresis, calculated based on a 3D Eclipse model (a), MOD1 (b) or MOD2 (c).

Table 1. Number and type of points in different datasets and models.

MOD1	Single Phase Cases	WAG Cases	Total
	96	2472	2568
MOD2	Single phase cases	WAG cases	Total
Training (70%)	68 × 16 = 1088	1730	2818
Validation (15%)	14 × 16 = 224	371	595
Testing (15%)	14 × 16 = 224	371	595
Total	96 × 16 = 1536	2472	4008

Table 2. Range of values for the total datasets used in MOD1 and MOD2 (using

X = 0.5

).

Table 2. Range of values for the total datasets used in MOD1 and MOD2 (using

X = 0.5

).

MOD1	Train			Val			Test			Tot
	Min	Mean	Max	Min	Mean	Max	Min	Mean	Max	Min	Mean	Max
$M^{*}$	−0.2	1.4	3.4	−0.1	1.5	3.2	−0.1	1.4	3.2	−0.2	1.4	3.4
$y$	0.14	0.49	0.88	0.20	0.49	0.84	0.19	0.50	0.85	0.14	0.49	0.88
MOD2	Train			Val			Test			Tot
	Min	Mean	Max	Min	Mean	Max	Min	Mean	Max	Min	Mean	Max
$x_{1}$	0	0.5	1	0	0.5	1	0	0.4	1	0	0.5	1
$x_{2}$	0	0.5	1.1	0	0.5	1.1	0	0.4	1.1	0	0.5	1.1
$x_{3}$	0	1.2	2.5	0	1.2	2.5	0	1.1	2.5	0	1.2	2.5
$x_{4}$	0	1.3	3	0	1.3	3	0	1.4	3	0	1.3	3
$x_{5}$	0.1	1.4	2.4	0.1	1.4	2.4	0.1	1.4	2.4	0.1	1.4	2.4
$x_{6}$	0.0	1.4	2.3	0.0	1.4	2.3	0.0	1.3	2.3	0.0	1.4	2.3
$x_{7}$	−4.6	−2.6	−0.9	−4.6	−2.7	−0.9	−4.6	−2.7	−0.9	−4.6	−2.6	−0.9
$x_{8}$	−7.9	−3.0	−0.8	−7.9	−3.1	−0.8	−7.9	−3.0	−0.8	−7.9	−3.0	−0.8
$y$	0.14	0.45	0.88	0.20	0.44	0.84	0.19	0.48	0.85	0.14	0.45	0.88

Table 3. Pearson, Spearman and Distance correlation coefficients

r_{x y}

evaluated for the total dataset between

RF

and the involved input parameters for MOD1 and MOD2.

Table 3. Pearson, Spearman and Distance correlation coefficients

r_{x y}

evaluated for the total dataset between

RF

and the involved input parameters for MOD1 and MOD2.

	MOD1	$r_{x y}^{P}$	$r_{x y}^{S p}$	$r_{x y}^{D}$
	$x_{0}$	$- 0.94$	$- 0.95$	$0.93$
	MOD2	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$	$x_{5}$	$x_{6}$	$x_{7}$	$x_{8}$
	WAG cases	$- 0.053$	$- 0.34$	$0.16$	$- 0.15$	$- 0.62$	$- 0.49$	$0.078$	$0.0045$
$r_{x y}^{P}$	$X = 0.25$	$- 0.055$	$- 0.27$	$0.11$	$- 0.10$	$- 0.58$	$- 0.49$	$0.059$	$0.0087$
	$X = 0.5$	$- 0.055$	$- 0.27$	$0.10$	$- 0.099$	$- 0.57$	$- 0.48$	$0.057$	$0.0085$
	$X = 1$	$- 0.055$	$- 0.27$	$0.087$	$- 0.083$	$- 0.53$	$- 0.45$	$0.053$	$0.0079$
$r_{x y}^{S p}$	$X = 0.5$	$- 0.045$	$- 0.25$	$0.095$	$- 0.095$	$- 0.53$	$- 0.47$	$0.056$	$0.017$
$r_{x y}^{D}$	$X = 0.5$	$0.19$	$0.25$	$0.15$	$0.15$	$0.54$	$0.47$	$0.092$	$0.077$

Table 4. Optimized LSSVM metaparameters using different optimizers and different initializations (marked 1 and 2) and corresponding performance metrics on the training, validation and testing datasets. The parameters used in the final model, MOD2, are indicated.

				RMSE			R²
	Seed	$\log (γ)$	$\log (σ)$	Train	Val	Test	Train	Val	Test
LSSVM (preset)		$0$	$0$	$0.0220$	$0.0202$	$0.0279$	$0.9821$	$0.9817$	$0.9691$
PSO-LSSVM	$1$	$5.6106$	$0.32535$	$0.0056$	$0.0088$	$0.0142$	$0.9988$	$0.9965$	$0.9920$
PSO-LSSVM	$2$	$5.4335$	$0.30305$	$0.0055$	$0.0088$	$0.0142$	$0.9989$	$0.9965$	$0.9920$
GSA-LSSVM	$1$	$6.6812$	$0.42982$	$0.0058$	$0.0089$	$0.0150$	$0.9988$	$0.9964$	$0.9911$
GSA-LSSVM	$2$	$7.0506$	$0.49871$	$0.0064$	$0.0090$	$0.0156$	$0.9985$	$0.9963$	$0.9904$
GWO-LSSVM	$1$	$5.6564$	$0.32883$	$0.0056$	$0.0088$	$0.0142$	$0.9988$	$0.9965$	$0.9919$
GWO-LSSVM	$2$	$5.6698$	$0.32230$	$0.0055$	$0.0088$	$0.0144$	$0.9989$	$0.9965$	$0.9918$
GA-LSSVM	$1$	$7.3404$	$0.49280$	$0.0059$	$0.0090$	$0.0155$	$0.9987$	$0.9963$	$0.9904$
GA-LSSVM	$2$	$4.9708$	$0.25298$	$0.0054$	$0.0089$	$0.0140$	$0.9989$	$0.9964$	$0.9922$
Range (opt.)		$~ 2.4$	$~ 0.25$	$0.0010$	$0.0002$	$0.0016$	$0.0004$	$0.0002$	$0.0018$
Final (MOD2)		$5.6$	$0.32$	$0.0056$	$0.0088$	$0.0143$	$0.9988$	$0.9965$	$0.9919$

Table 5. Parameter selections for MOD2, where oil viscosity is varied and influences mobility ratios and gravity numbers. Four cases are considered according to heterogeneity and hysteresis.

	Low Het, Low Hyst	High Het, Low Hyst	Low Het, High Hyst	High Het, High Hyst
x₁	0:1
x₂	0.25	1	0.25	1
x₃	0	0	2.5	2.5
x₄	3	3	0.5	0.5
x₅	0.2:2.2
x₆	0.2:2.2
x₇	−3:−1
x₈	−3:−1

Table 6. Parameter selections for MOD2 with cases demonstrating influence of gravity numbers according to heterogeneity, mobility ratio and hysteresis.

	Lo Het	Hi Het	Fav	Unfav	Lo Hyst	Hyst
x₁	0:1
x₂	0	1	0.8		0.3
x₃	1		0		0	2.5
x₄	3		3		3	0
x₅	2		0.5	2	1.5
x₆	2		0.5	2	1.5
x₇	−4:−1.5
x₈	−4:−1.5

Table 7. Parameter selections for MOD2 cases to check response in going from multiphase to single-phase scenarios.

	$Vary x_{3} = α$	Vary $x_{4} = \log C$	Vary $x_{5} = \log M_{g / o}^{*}$	Vary $x_{6} = \log M_{w / o}^{*}$
$x_{1}$	0:1
$x_{2}$	0.8
$x_{3}$	0:2.5	1	1	1
$x_{4}$	1	0:3	1	1
$x_{5}$	1.5	1.5	0.1:2.4	1.5
$x_{6}$	1.5	1.5	1.5	0.0:2.3
$x_{7}$	−2
$x_{8}$	−2

Table 8. Input and calculated parameters for the 3D model. ‘Low/hi’ indicates degree of hysteresis in parameters x₃, x₄, while for parameters x₅ to x₈, values are calculated from two oil viscosities.

Layer	$K_{x} [m D]$	$ϕ [-]$	$h [m]$	$x_{1}$	0, 0.33, 0.5, 0.67, 1
1	2170	0.324	42	$x_{2}$	0.73
2	65.9	0.297	29	$x_{3} (l o w, h i)$	0, 2.5
3	589	0.323	33	$x_{4} (l o w, h i)$	3, 0
				$x_{5} (30, 110 c P)$	1.83, 2.40
$Δ ρ_{w o}$	250 kg/m³	$V_{p}$	$3.70 \times 10^{7}$ m³	$x_{6} (30, 110 c P)$	1.50, 2.06
$Δ ρ_{g o}$	450 kg/m³	$T$	20 years	$x_{7} (30, 110 c P)$	−1.28, −1.84
$L$	1500 m	$s_{o i}$	0.84	$x_{8} (30, 110 c P)$	−0.96, −1.52
$W$	750 m	$Q$	7600 m³/d

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Andersen, P.Ø.; Nygård, J.I.; Kengessova, A. Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM. Energies 2022, 15, 656. https://doi.org/10.3390/en15020656

AMA Style

Andersen PØ, Nygård JI, Kengessova A. Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM. Energies. 2022; 15(2):656. https://doi.org/10.3390/en15020656

Chicago/Turabian Style

Andersen, Pål Østebø, Jan Inge Nygård, and Aizhan Kengessova. 2022. "Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM" Energies 15, no. 2: 656. https://doi.org/10.3390/en15020656

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM

Abstract

1. Introduction

2. Theory

2.1. Mathematical Model

2.2. WAG Efficiency Characterization Using Dimensionless Number

2.3. Workflow

2.3.1. Model Input Parameters

2.3.2. Reservoir Simulation Dataset and Model Approaches

2.3.3. Machine Learning Dataset Preparation

2.3.4. Machine Learning Workflow

3. Results and Discussion

3.1. Preliminary Dataset Analysis

3.2. Development of MOD1

3.3. Development of LSSVM Model MOD2

3.4. Sensitivity Analyses with Optimized LSSVM Model MOD2

3.4.1. Variation of Oil Viscosity

3.4.2. Variation of Well Distance, Injection Rate or Density Difference

3.4.3. Handling Single Phase Data

3.5. Application to a 3D Model

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Nomenclature

Appendix A. Reservoir Model Parameters

Appendix B. Least Squares Support Vector Machines (LSSVM)

Appendix C. Optimization Algorithms

Appendix C.1. Particle Swarm Optimization (PSO)

Appendix C.2. Gravitational Search Algorithm (GSA)

Appendix C.3. Genetic Algorithm (GA)

Appendix C.4. Grey Wolf Optimization (GWO)

Appendix D. Statistical Measures

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI