Next Article in Journal
Alternative Fuel Generation from Dangerous Solid Waste in a Protected Environmental Area
Next Article in Special Issue
Characteristic Forced and Spontaneous Imbibition Behavior in Strongly Water-Wet Sandstones Based on Experiments and Simulation
Previous Article in Journal
Effects of Cofiring Coal and Biomass Fuel on the Pulverized Coal Injection Combustion Zone in Blast Furnaces
Previous Article in Special Issue
Petroleum Reservoir Control Optimization with the Use of the Auto-Adaptive Decision Trees
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM

by
Pål Østebø Andersen
1,*,
Jan Inge Nygård
2 and
Aizhan Kengessova
1,3
1
Department of Energy Resources, Faculty of Science and Technology, University of Stavanger, 4021 Stavanger, Norway
2
Bouvet, 4020 Stavanger, Norway
3
Timal Consulting Group LLP, Atyrau 060011, Kazakhstan
*
Author to whom correspondence should be addressed.
Energies 2022, 15(2), 656; https://doi.org/10.3390/en15020656
Submission received: 5 October 2021 / Revised: 22 December 2021 / Accepted: 11 January 2022 / Published: 17 January 2022
(This article belongs to the Special Issue Management of High Water Cut and Mature Petroleum Reservoirs)

Abstract

:
In this study, we solve the challenge of predicting oil recovery factor ( RF ) in layered heterogeneous reservoirs after 1.5 pore volumes of water-, gas- or water-alternating-gas (WAG) injection. A dataset of ~2500 reservoir simulations is analyzed based on a Black Oil 2D Model with different combinations of reservoir heterogeneity, WAG hysteresis, gravity influence, mobility ratios and WAG ratios. In the first model MOD1, RF is correlated with one input (an effective WAG mobility ratio M * ). Good correlation (Pearson coefficient −0.94), but with scatter, motivated a second model MOD2 using eight input parameters: water–oil and gas–oil mobility ratios, water–oil and gas–oil gravity numbers, a reservoir heterogeneity factor, two hysteresis parameters and water fraction. The two mobility ratios exhibited the strongest correlation with RF (Pearson coefficient −0.57 for gas-oil and −0.48 for water-oil). LSSVM was applied in MOD2 and trained using different optimizers: PSO, GA, GWO and GSA. A physics-based adaptation of the dataset was proposed to properly handle the single-phase injection. A total of 70% of the data was used for training, 15% for validation and 15% for testing. GWO and PSO optimized the model equally well (R2 = 0.9965 on the validation set), slightly better than GA and GSA (R2 = 0.9963). The performance metrics for MOD1 in the total dataset were: RMSE = 0.050 and R2 = 0.889; MOD2: RMSE = 0.0080 and R2 = 0.998. WAG outperformed single-phase injection, in some cases with 0.3 units higher RF. The benefits of WAG increased with stronger hysteresis. The LSSVM model could be trained to be less dependent on hysteresis and the non-injected phase during single-phase injection.

1. Introduction

Oil recovery through water or gas injection often lacks efficiency due to the unfavorable mobility ratio between the oil and the displacing phase. Viscous fingering, gravity segregation and heterogeneity can also lead to poor sweep. Gas features low viscosity and density and can achieve channeling and early breakthrough [1,2]. Water-alternating gas injection (WAG) is an enhanced oil recovery (EOR) technique in which water and gas are injected in cycles to displace the oil. This type of technique mitigates the exponential rate decline seen in most fields after peak production [3]. The mobility of each injected fluid is reduced by the presence of the other, producing a more favorable mobility ratio to oil. Gravity segregation becomes less detrimental, since gas sweeps the top of the reservoir, while water sweeps the bottom [2,4]. Gas usually results in lower residual oil saturation than water, but this can be further lowered by WAG. Thus, WAG utilizes the advantages of water and gas injection and minimizes their individual downsides. In several field implementations, it has been beneficial to use WAG; the oil recovery factor in 59 fields increased when WAG was introduced, by 5% to 10 % of the oil originally in place [2]. Sanchez [5] reported that 80% of US WAG projects were beneficial. Micromodel studies also demonstrate better oil recovery with WAG than with single-phase displacement [6].
WAG introduces more design and operational parameters compared to water or gas injection, such as the WAG ratio (volume water to volume gas injected), number of cycles, cycle volume, injection rates and pressures. This may further affect optimal well placements. A 1:1 WAG ratio is considered common or even optimal [4]. Variation in WAG ratio with project time (tapering) has been conducted, partly due to limited gas access, for limiting gas production or for optimization. [7] used ensemble-based optimization of injector and producer well controls at each WAG cycle to maximize the net present value for a channeled reservoir model. Whether the reservoir pressure is above the system’s minimum miscibility pressure (MMP) determines whether the gas is miscible or immiscible with the oil [8]. During miscible displacement, the oil and gas become practically the same phase and residual oil saturation can approach zero. Kulkarni [9] found that miscible gas (CO2) core flooding (continuous or WAG) outperformed immiscible injection. The choice or modification of the injected fluids can also improve the outcome. Foam, surfactant, polymer, low salinity brine and CO2 are some alternatives [4].
Reservoir geology or heterogeneity is important during any field development. During gas injection, heterogeneity in terms of thief zones, stratification or fractures can cause gas channeling and early breakthrough, which can be mitigated by WAG [4]. Favorable well placement relative to the dip angle can provide more stable frontal displacement of oil. In heterogeneous reservoirs, gravity and capillary forces can divert flow from highly permeable layers to less permeable layers. These effects are more important in naturally fractured reservoirs, where advective forces are unable to mobilize oil [10,11].
The simultaneous flow of oil, water and gas requires the detailed measurement, quantification and correlation of three-phase relative permeabilities [12,13,14]. Injecting water and gas alternately causes gas and water saturations to rise and fall, resulting in hysteresis [15,16,17,18,19]. During WAG, the relative permeability of gas is more affected by hysteresis than oil and water [4] and hysteresis tends to decrease gas mobility. This reduction delays gas breakthrough and reduces gravity segregation. The Land [12] and Carlson [15] models are widely used to model relative permeability hysteresis.
Machine learning (ML) has gained increased popularity in the petroleum industry in recent years. ML algorithms can be useful for understanding trends in complex datasets and provide multivariate nonlinear regression or classification. Their applications include lithology classification [20], selecting EOR methods [21], locating optimal drilling spots [22], correlating asphaltene precipitation [23] or predicting CO2 viscosity [24]. Important steps in developing ML models include selecting appropriate input and output variables, acquiring sufficient quality data, applying a suitable ML algorithm and tuning its metaparameters to prevent over- or under-fitting, usually via optimization algorithms.
This study makes use of the least squares support vector machine (LSSVM) algorithm, based on the works [25,26,27], for nonlinear regression. This algorithm has been applied in many contexts, such as predicting drilling fluid density [28], gas solubility [29,30,31,32], water availability [33], energy consumption [34,35], shale gas adsorption [36], wind power [37,38] and even tourism flow [39]. LSSVM has been successfully combined with optimizers such as particle swarm optimization (PSO), genetic algorithm (GA) and grey wolf Optimization (GWO) and has, in many cases, outperformed regression algorithms such as artificial neural networks, radial basis function, gene expression programming and adaptive neuro-fuzzy interference system [29,30,40,41].
In recent studies, [42,43] simulated CO2 WAG injection in a reservoir model and developed machine learning proxy models with different algorithms to predict current rates of oil, gas and water based on current time, gas and water injection rates, half cycle time and operational constraints. The recovery factor and cumulative production were calculated from the produced output. The calibrated proxy models were used to optimize the WAG process. [41] used LSSVM and other ML approaches to predict two-phase relative permeabilities and combined them via correlations in previous research to estimate three-phase relative permeabilities and the performance of a WAG core flood. [40] correlated the oil recovery performance of EOR carbonated water injection using LSSVM. [44] used ML to optimize well placement during WAG injection. [45,46] used ML to co-optimize CO2 injection for oil recovery and storage during WAG under different operational constraints.
In this study, our main contribution is to predict the reservoir oil recovery factor (RF) in layered reservoirs during immiscible WAG and single-phase (gas or water) injection for different fluid, reservoir, geometrical and operational conditions. This is a relatively complex task given the number of parameters involved and their coupled nature. Based on a comprehensive simulation database generated in both [47] and this work, we present two predictive approaches. In the first (MOD1), a dimensionless number M * , derived from Nygård and Andersen’s study [47], is applied as a single input parameter. The second approach (MOD2) applies eight physics-motivated dimensionless input parameters to improve predictive power compared to the first method: two mobility ratios, two gravity numbers, injected water fraction, reservoir heterogeneity factor and two hysteresis parameters. In both models, the input variables incorporate all the system information. The latter approach, MOD2, utilizes the ML regression algorithm, LSSVM, with metaparameters optimized by either PSO (Particle Swarm Optimization), GSA (Gravity Search Algorithm), GA (Genetic Algorithm) or GWO (Grey Wolf Optimization). LSSVM has been optimized successfully in other works using PSO [20,23,36], GSA [37,38], GA [28] and GWO [29,32,33]. We propose a methodology to ensure the physical behavior of the machine learning model MOD2. We then adapt the dataset to be independent of hysteresis and the non-injected phase when single-phase injection is performed. Some of the research questions we investigate are:
-
How well do the models predict WAG performance?
-
Which parameters affect RF the most?
-
Do the parameters have a positive or negative effect on RF?
-
Will the models properly account for WAG injection and single phase injection?
The paper is structured as follows. The model serving as the basis for the simulation results is outlined in Section 2.1. The dimensionless number M * is outlined in Section 2.2. This number is used in the single-input parameter model. The machine learning approach and dataset follow in Section 2.3. The eight input parameters of the second approach are also presented in those latter sections. The results from analyzing the data are shown in Section 3 and the paper is concluded in Section 4.

2. Theory

2.1. Mathematical Model

We consider the same modeling approach for immiscible WAG injection as [47]: A 2D reservoir layered in a horizontal direction, with one injector and one producer, both vertical and perforated along the full reservoir height. See Figure 1 for an illustration. A black oil model is assumed with an incompressible and immiscible three-phase flow of oil, water and gas and negligible capillary pressure. WAG is applied from the start, rather than as a tertiary method. Relevant equations are presented below:
t ( ϕ s o ) = x ( f o u T x ) + z ( f o u T z ) + z ( K z g λ w f o Δ ρ w o ) z ( K z g λ g f o Δ ρ o g )
t ( ϕ s w ) = x ( f w u T x ) + z ( f w u T z ) z ( K z g λ o f w Δ ρ w o ) z ( K z g λ g f w Δ ρ w g )
x u T x + z u T z = 0 ,
u T x = K x λ T x p , u T z = K z λ T z p + K z g ( λ o ρ o + λ w ρ w + λ g ρ g )
where ϕ denotes porosity, s i saturation of phase i , f i fractional flow, K permeability, λ i mobility, ρ i density, u T total Darcy flux and p pressure. Corey correlation was applied for relative permeabilities, while gas relative permeability hysteresis was incorporated using Land’s trapping model [12] (which reduces the mobile gas saturation interval based on the parameter C ) and Carlson’s hysteresis model [15] with parameter α (which reduces gas relative permeability).
Nygård and Andersen [47] ran simulations systematically to investigate the role of gravity segregation, the mobility ratios between the three phases, heterogeneity, hysteresis and WAG ratio and how they affected RF after 1.5 pore volumes of fluid were injected. The simulations were scaled using a combined dimensionless mobility ratio M * stating how effectively the injected fluids displaced oil under the given conditions, summarized as follows. In the design of this number, the mechanisms were incorporated one at a time. We refer to Table A1, Table A2 and Table A3 in Appendix A for several important simulation input parameters or model configurations that were constant in the simulations. More details can be found in the original paper. The fact that these parameters remained constant was mainly due to prioritization. However, these input parameters were incorporated in the dimensionless numbers presented in the following sections.
Figure 1. The geometrical configuration of the model (modified from [47]). is the distance from the injector, while is the distance from the top of the reservoir.
Figure 1. The geometrical configuration of the model (modified from [47]). is the distance from the injector, while is the distance from the top of the reservoir.
Energies 15 00656 g001
Oil recovery factor ( RF ) is the output parameter of interest, defined as:
RF = v o l u m e   o i l   p r o d u c e d v o l u m e   o i l   i n i t i a l l y   i n   p l a c e = 1 j = 1 N z k = 1 N x ϕ ( z j ) s o ( x k , z j ) j = 1 N z k = 1 N x ϕ ( z j ) s o i
Every grid block features same dimension Δ x Δ z . RF is reported after 1.5 pore volumes are injected.

2.2. WAG Efficiency Characterization Using Dimensionless Number

The characteristic mobility ratio M * defined by Nygård and Andersen [47] features the following functional relation:
M * = ( r w M w / o * F H F G w / o + 1 r w M g / o * F H F G g / o ) 1
r w is the volume fraction of water in each cycle. Larger M * is associated with lower recovery factor. Characteristic two-phase mobilities λ i * for each phase i were found by averaging their mobility over their mobile saturation interval and used to define two-phase mobility ratios M w / o * , M g / o * :
M w / o * = λ w * λ o w * , λ w * = k r w m a x μ w ( 1 s w r s w , m a x ) ( n w + 1 ) , λ o w * = k r o w m a x μ o ( 1 s o r w s o w , m a x ) ( n o w + 1 )
M g / o * = λ g * λ o g * , λ g * = k r g m a x μ g ( 1 s g r s g , m a x ) ( n g + 1 ) , λ o g * = k r o g m a x μ o ( 1 s o r g s o g , m a x ) ( n o g + 1 )
s i , m a x denotes the saturation of phase i where end-point relative permeability k r i m a x is obtained, n i is the Corey exponent and μ i is viscosity. A heterogeneity factor F H was derived from the horizontal permeability K x j and layer height h j distribution over layers j = 1 : N L :
F H = K ¯ x a r K ¯ x h a , K ¯ x a r = ( j = 1 N L h j ) 1 j = 1 N L h j K x , j , K ¯ x h a = ( j = 1 N L h j ) ( j = 1 N L h j K x , j ) 1
Two-phase gravity numbers were defined using the ratio of two-phase segregation time t s e g and the residence time t r e s of the injected phase:
N G w / o = t r e s w t s e g w / o , t r e s w = L x L y j = 1 N L ϕ j h j Q w , t s e g w / o = H ϕ K z h a Δ ρ w o g ( 1 λ w * + 1 λ o w * )
N G g / o = t r e s g t s e g g / o , t r e s g = L x L y j = 1 N L ϕ j h j Q g , t s e g g / o = H ϕ K z h a Δ ρ g o g ( 1 λ g * + 1 λ o g * ) ,
It was found that the role of gravity depended on heterogeneity and two-phase gravity factors F G w / o , F G w / o accounting for this coupling were introduced:
F G w / o = 1 + a 1 ( N G w / o ) a 2 1 + a 1 ( F H 1 ) ( N G w / o ) a 2 , F G g / o = 1 + a 1 ( N G g / o ) a 2 1 + a 1 ( F H 1 ) ( N G g / o ) a 2
Note the unitless tuning parameters a 1 = 3 and a 2 = 0.5 . Finally, hysteresis was incorporated into the gas characteristic relative permeability. Land’s parameter C defines a hysteresis residual gas saturation s g r h y s t :
s g r h y s t = s g r + s g , m a x s g r 1 + C ( s g , m a x s g r )
A further modification according to r w was made:
s g r w a g = s g r ( 1 r w ) + r w s g r h y s t
Additionally, the gas relative permeability end point k r g m a x in λ g * , (see Equation (8)), was reduced due to hysteresis. The reductions were performed individually for the gas–oil mobility ratio and the gas–oil gravity number first based on the parameter α and heterogeneity factor F H using unitless tuning parameters b 1 = 1 , b 2 = 0.5 and b 3 = 10 , b 4 = 2 :
k r g , M m a x , h y s t = k r g m a x 1 + b 1 F H b 2 α , k r g , N G m a x , h y s t = k r g m a x 1 + b 3 F H b 4 α
Next, the fraction r w was incorporated according to:
k r g , M w a g = ( 1 r w k r g m a x + r w k r g , M m a x , h y s t ) 1 , k r g , N G w a g = ( 1 r w k r g m a x + r w k r g , N m a x , h y s t ) 1
We then obtained the hysteresis-corrected characteristic gas mobilities λ g , M * and λ g , N G * by replacing s g r with s g r w a g in (8), while the end-point relative permeability k r g m a x in (8) was replaced by k r g , M w a g in the gas-oil mobility ratio M g / o * in (8) and by k r g , N G w a g in the gas–oil gravity number N G g / o in (11):
λ g , M * = 1 μ g ( 1 s g r w a g s g , m a x ) k r g , M w a g n g + 1 , λ g , N G * = 1 μ g ( 1 s g r w a g s g , m a x ) k r g , N G w a g n g + 1
Every input parameter is incorporated in the dimensionless number M * . Note that during single-phase injection ( r w = 0   or   1 ) , the two-phase parameters involving the phase not injected do not affect M * . Similarly, hysteresis does not affect M * during single-phase injection.

2.3. Workflow

2.3.1. Model Input Parameters

In the first model, MOD1, we take
x 0 = log 10 M *
as the only input parameter to predict RF. We also consider a machine learning model (MOD2) in which the following eight dimensionless numbers are used as input parameters:
x 1 = r w , x 2 = log 10 F H , x 3 = α , x 4 = log 10 C , x 5 = log 10 M g / o * , x 6 = log 10 M w / o * , x 7 = log 10 N G w / o , x 8 = log 10 N G g / o
These numbers reflect injected fluid fractions x 1 , heterogeneity x 2 , hysteresis x 3 , x 4 , relative magnitude of fluid mobilities x 5 , x 6 and gravity vs. advective forces x 7 , x 8 . They incorporate all the input parameters used in the Eclipse model and the number M * . The overall workflow is demonstrated in Figure 2, where the two modeling approaches after the data collection step are indicated. In MOD1, a polynomial regression is performed, while machine learning is used for MOD2. The data and the detailed steps for developing MOD2 are explained below.

2.3.2. Reservoir Simulation Dataset and Model Approaches

In addition to the 1648 WAG and 96 single phase injection simulations generated by [47], 824 new WAG simulations were performed with new C and α values combined with existing combinations of heterogeneity, density and mobility. In the previous study, C and α were selected primarily to cover no or significant hysteresis. The values α = 0 and C = 1000 were assigned to points without hysteresis influence from the respective parameters. For MOD1, each simulation allows the calculation of M * , which is input to the corresponding output RF . The 1648 + 96 + 824 = 2568 data points were analyzed with MOD1 and correlated using a polynomial expression between RF and x 0 .
For the single-phase injection of water ( r w = 1 ) , values for N G g / o and M g / o * are not well defined since gas is not injected. The case is similar regarding N G w / o and M w / o * at r w = 0 (gas injection). Further, hysteresis parameters C , α should not matter. The insensitivity of M * to the mentioned parameters under single-phase injection was ensured during its derivation. To properly define the input values under single-phase injection for MOD2 the following approach was taken:
Each single-phase data point was duplicated to 16 data points, in which all combinations of high and low values of the four missing parameters were assigned. Specifically, for gas injection (points with r w = 0 , indexed ‘g’), the following values were set for x 3 , x 4 , x 6 , x 7 :
x 3 , g = x ¯ 3 , W A G ± X σ 3 , W A G , x 4 , g = x ¯ 4 , W A G ± X σ 4 , W A G , x 6 , g = x ¯ 6 , W A G ± X σ 6 , W A G , x 7 , g = x ¯ 7 , W A G ± X σ 7 , W A G ,  
where x ¯ i , W A G   ( i = 3 , 4 , 6 , 7 ) indicate the average of the data point values applied in the WAG cases and σ i , W A G   ( i = 3 , 4 , 6 , 7 ) the corresponding standard deviations. X is a multiplier. Similarly, for water injection (indexed ‘w’) the following values were set for x 3 , x 4 , x 5 , x 8 :
x 3 , w = x ¯ 3 , W A G ± X σ 3 , W A G , x 4 , w = x ¯ 4 , W A G ± X σ 4 , W A G , x 5 , w = x ¯ 5 , W A G ± X σ 5 , W A G , x 8 , w = x ¯ 8 , W A G ± X σ 8 , W A G ,  
The 96 single-phase simulations resulted in 16 96 = 1536 points to the ML model. In total, 1648 + 824 + 16 96 = 4008 data points were then applied in MOD2.

2.3.3. Machine Learning Dataset Preparation

The 2472 WAG cases and 96 single-phase injection cases were both divided randomly between three sets: 70% in the training set, 15% in the validation set and the remaining 15% in the testing set [48]. Single-phase data points within each set were further split into 16 points, as described previously. See Table 1 for an overview of points in the models and datasets.

2.3.4. Machine Learning Workflow

We apply LSSVM with radial basis kernel (RBK) function and either PSO, GWO, GA or GSA as optimizers. Each of the optimizers features its own strengths and disadvantages in finding global optima efficiently, as well as depending on its individual tuning parameters. They are swarm-based algorithms, making use of many potential solutions simultaneously and improving these solutions according to those performing best at a given iteration. The result is the existence of differently optimized LSSVM models, as indicated in Figure 2. Detailed explanations of the algorithms are provided in Appendix B and Appendix C, respectively. Each input parameter x i was normalized (denoted x N ) to a range between −1 and +1 based on the maximum and minimum values of the total dataset.
x N = 2 x x m i n x m a x x m i n 1
Assuming predefined values of the metaparameters ( σ , γ ), LSSVM provides the function y ( x ) and its coefficients α k , b that minimize the error between the model predictions and observations of a given dataset (usually the training dataset) for those parameter choices [27]:
y ( x ) = i = 1 n α i exp ( | | x N i x N 2 | | σ 2 ) + b
For given metaparameters, the LSSVM algorithm is calibrated on the training set to provide choices of α k , b . The optimizer algorithm is used to search for the metaparameters σ , γ that minimize the model prediction error on the validation dataset (i.e., many LSSVM models are calibrated on the training set and the one giving best prediction on the validation set is taken as the best). This systematically determines the best choice of metaparameters to avoid over- or under-fitting. The optimized LSSVM models were finally used to predict the data in the testing set. The model performing best overall was selected for further sensitivity analysis. For proper comparison, the optimizers were implemented with the same random initial solutions guesses (and velocities if applicable), search space and number of iterations. The optimizer parameters can be found in Table A4.
An advantage of LSSVM is its few (two) metaparameters and the automated optimization of its internal tuning parameters [27]. In comparison, artificial neural networks often need subjective selection of the number of nodes and layers and then comprehensive tuning of a vast number of weights and biases to train the network [49].
The correlation between input variables x and output y (RF) is quantified using Pearson correlation r x y P , Spearman rank r x y S p and distance correlation r x y D coefficients. These indices, respectively, indicate linear correlation, nonlinear monotonic correlation and nonlinear nonmonotonic correlation. The former two range between −1 and 1, while the latter ranges from 0 to 1. For all of them, 0 indicates no correlation. The goodness-of-fit between the calculated RF from MOD1 or MOD2 and the data values of RF were quantified using the coefficient of determination R 2 and the root mean square error RMSE . The definitions of the mentioned quantities are in Appendix D.

3. Results and Discussion

3.1. Preliminary Dataset Analysis

The range and mean of the data points used in MOD1 and MOD2 (using X = 0.5 ) are listed in Table 2. The use of logarithms made the range of the different variables span a few units rather than orders of magnitude.
The Pearson, Spearman and distance correlation coefficients evaluated between RF and the input parameters were calculated for the two model datasets and are listed in Table 3. For MOD1, the two former coefficients were 0.94 and the latter 0.93 . Their magnitude being close to 1 indicates a strong linear correlation between RF and x 0 = log 10 M * and the negative sign indicates that a larger M * reduces RF.
For MOD2, Pearson correlation coefficients were reported for WAG cases only and for all cases when X = 0.25 , 0.5 and 1 in Table 3. Spearman rank and distance correlation were calculated only for X = 0.5 . Considering the WAG cases, several variables correlate with RF , especially x 5 ,   x 6 which are the log of gas/oil and water/oil mobility ratios. They feature Pearson coefficients r x y P ~ 0.5 to 0.6 indicating that when they increase (less favorable mobility ratio towards the oil), RF is reduced. Heterogeneity, represented by x 2 , also correlates with RF with a lower r x y P ~ 0.27 , indicating that RF generally reduces when the heterogeneity factor increases. The hysteresis parameters x 3 , x 4 correlate with RF in opposite ways to each other, with r x y P ~ 0.15 for x 3 and 0.15 for x 4 . When x 3 (i.e., α ) increases, gas relative permeability is reduced and should improve RF. Higher x 4 (i.e., log C ) leads to less gas trapping and RF therefore decreases. The gravity numbers feature relatively poor Pearson correlation with RF , with r x y P ~ 0.08 for water-oil and 0.0045 for gas-oil. Similarly, the water volume fraction correlates little with RF , and r x y P ~ 0.05 is slightly negative. These results could be related to their coupled nature, as is discussed below.
When considering the MOD2 datasets with single-phase data included for different X , we note that the magnitude and sign of the different Pearson coefficients are similar to when only the WAG cases were considered. The main difference is that the correlation is somewhat lower, especially the parameters with unspecified information during single-phase injection. This was expected, since we added points where RF does not vary with changes in these parameters.
When evaluating the dataset with Pearson rank correlation for X = 0.5 , we observe similar, but slightly lower values as for the Pearson coefficient, except for x 8 , where the correlation doubles, but remains very low. When considering the distance correlation coefficient, however, several input parameters correlate more strongly with recovery, indicating that their relation is nonmonotonic. In particular, the water fraction, x 1 , features a higher distance correlation coefficient, of 0.19 . WAG was expected to perform better than single-phase injection, with RF not changing linearly with x 1 , but peaking. Gravity can be a cause of both low and improved sweep and the gravity numbers x 7 , x 8 feature distance correlation coefficients around 0.08, where more impact is attributed the gas–oil gravity number in particular. Furthermore, the hysteresis parameters x 3 , x 4 now seem to correlate more strongly. The three correlation coefficients are similar for x 2 , indicating a relatively linear and monotic relation, so if all other parameters are constant, increased heterogeneity should reduce recovery.
Note that all the variables in MOD2 feature less correlation than the variable x 0 in MOD1, since they individually do not contain all the involved system parameters. The aim is for them to provide better predictions when combined.

3.2. Development of MOD1

RF is plotted against x 0 in Figure 3 and demonstrates a clear correlation where higher x 0 gives lower RF. There is also significant scatter, meaning a given x 0 can be associated with a range of RF values. The data were fitted to a third-order polynomial function, given by the blue curve in Figure 3 and Equation (24). A higher order polynomial did not further reduce the RMSE, which means the remaining error was associated primarily with the scatter in the data.
RF = i = 1 : 4 p i ( x 0 ) 4 i ,     ( R 2 = 0.889 ;     RMSE = 0.0498 ) p 1 = 0.01645 , p 2 = 0.06302 , p 3 = 0.1393 , p 4 = 0.7676
The performance of the model is also shown by comparing the estimated RF and the data RF in Figure 4a together with a histogram, Figure 4b, of the residual errors (the difference between the estimated RF and the data point RF). The R2 = 0.889 is relatively high. As seen in the histogram, the residuals are symmetrically distributed around zero and roughly 95% of the points estimate RF correctly within ± 0.1 . The RMSE, which can be considered a more typical error, is 0.050.

3.3. Development of LSSVM Model MOD2

The ML model MOD2 was developed using LSSVM and a dataset assuming X = 0.5 . The best LSSVM model was determined using different approaches. First, a random choice of metaparameters ( σ , γ ) = ( 1 , 1 ) was used. Next, LSSVM was applied with the different optimization algorithms, PSO, GA, GWO and GSA, to systematically find the best metaparameters. As previously mentioned, for any combination of the metaparameters, LSSVM models were fitted to the training set and used to forecast the validation set. The metaparameters that resulted in the best performance in the validation set, after using a given optimizer algorithm, determined the best model. The test set was then forecasted.
In Figure 5, the performance of the different algorithms is illustrated as a function of the iterations performed. R2 and RMSE for the validation set are plotted for the best solution at the given iteration, together with the corresponding values of log γ and log σ in plots a to d, respectively. The same initial solutions and number of iterations were applied in all the algorithms (different colors). Two different initializations were applied for robustness (dashed and full lines).
In all cases, a high R 2 0.996 and low RMSE 0.009 were obtained after 30 iterations for all the algorithms and both starting points, although GSA deviated from initially good solutions and converged slowly or to inferior solutions. Furthermore, GA seemed to not produce as good results as PSO and GWO. The two algorithms, PSO and GWO, exhibited very similar values of log γ 5.5 and log σ 0.3 and the lowest error indicating that they were better able to find the global optima. Notably, the values of log γ and log σ varied significantly during the iterations, but mostly exhibited very good performance. This may have been due to the ability of LSSVM to tune its internal parameters α i and b for any given γ , σ .
The best metaparameters obtained during the 30 iterations are listed in Table 4, considering all four algorithms and both initializations. The corresponding metrics (R2 and RMSE) were calculated on the training, validation and testing sets. All the optimized models performed better than the algorithm with the arbitrarily preset metaparameters, although this choice also performed well, with R 2 greater than 0.969 on all three sets. The optimized models exhibited very consistent performance in the three datasets, with R 2 0.999 in the training set, 0.996 in the validation set and 0.992 on the testing set. The difference in R 2 was in the fourth digit for the first two sets and the third digit in the latter set. The RMSE was around 0.006 on the training set, 0.009 on the validation set and 0.015 on the test set, with GSA standing out with the highest RMSE. As final metaparameter values in the optimized model, we took an average of the four similar results from the PSO and GWO runs with two significant digits. Calculating the RMSE and R2 metrics on the datasets confirmed that the performance with these values was still optimal (see Table 4). The LSSVM model with these parameters is referred to as MOD2 in what follows. Note especially that MOD2 is capable of predicting unseen single-phase data and thus accounts for the physics introduced during the modification of the dataset.
The RMSE and R2 were calculated with MOD2 for the total dataset as 0.0080 and 0.9976, respectively. These metrics are greatly improved compared to MOD1, which featured a corresponding RMSE of 0.0498 and an R2 of 0.889. The calculated (with MOD2) and observed RF data are plotted against each other for the three datasets in Figure 6. For all three datasets, there is little scatter around the perfect match line. The residual errors were calculated for each datapoint in the full dataset and the results are plotted as a histogram in Figure 7. Approximately 90% of the data feature errors in the estimated RF of less than 0.01 , and 95% of the data feature errors less than 0.02.
Partial derivatives with respect to each normalized variable, y x N i , were calculated for each data point using MOD2 and histograms were created for each variable, as shown in Figure 8. The derivatives were calculated numerically and two choices of Δ x N = 5 10 2 and 10 3 were used. The two choices produced practically identical histograms, suggesting that the optimized LSSVM function did not suffer from oscillations (a sign of over-fitting). For each variable, a large fraction of the points featured positive and negative derivatives. Hence, changing the variable can affect RF positively or negatively, indicating coupling and room for finding optimal conditions.
For y x N 1 we see positive and negative values, which is reasonable, since RF should be higher for WAG than single-phase injection. y x N 5 and y x N 6 are both dominated by negative values, since increasing the mobility ratio between gas and oil or between water and oil, respectively, should reduce RF. Higher water–oil gravity segregation is considered negative for RF, with y x N 7 mainly negative. On the other hand, gas–oil gravity segregation is considered mainly positive for RF with a majority of points having y x N 8 positive. This could be attributed to the better sweep of low-permeable layers in heterogeneous cases. Hysteresis appears to benefit recovery, as seen by a majority of positive y x N 3 , although the effect of y x N 4 seems to be equally negative and positive.

3.4. Sensitivity Analyses with Optimized LSSVM Model MOD2

The calibrated model, MOD2, was much better at predicting RF than MOD1 and was therefore pursued in the sensitivity analysis. Below, we present contour plots showing RF as a function of different input variables, while keeping the others constant. The parameters are kept within the total dataset range (see Table 2) in order to ensure model validity.

3.4.1. Variation of Oil Viscosity

Oil viscosity can vary greatly from one reservoir to another. It proportionally impacts mobility ratios M w / o * and M g / o * in Equations (7) and (8), represented by x 6 and x 5 . For low oil mobilities, the gravity numbers N G w / o and N G g / o (represented by x 7 and x 8 ) increase proportionally with oil viscosity but are less dependent if water or gas feature mobility that is similar to or lower than that of oil (see Equations (10) and (11)). For simplicity, we assume they are proportional. We vary the oil viscosity by 2.0 orders of magnitude, which is less than the smallest range of the four dimensionless numbers (2.3 for x 6 ), as seen in Table 2.
Four cases are defined in Table 5 with low or high heterogeneity (low or high x 2 ), and a low or high degree of hysteresis (low x 3 and high x 4 and opposite, respectively). For each of these cases, RF is plotted as a function of x 1 (the water fraction) and x 5 (representing the gas–oil mobility ratio) representing different viscosities (see Figure 9).From the figure, we observe that:
-
Optimal RF values were mainly obtained at an intermediate water fraction 0 < x 1 (consider any line parallel with the x-axis), suggesting that WAG gives higher RF than single-phase injection. Cases with low hysteresis and favorable mobility ratios seem to give similar RF for water injection and WAG (although WAG with a low water fraction seems optimal) (see Figure 9a,b (low and high heterogeneity)).
-
The advantage of WAG over single-phase injection was most clear when hysteresis was significant (see Figure 9c,d). The best water fraction produced RF up to 0.3 units higher than the worst fraction. This strong impact was mainly at low oil viscosity (low x 5 ) with optimal water fraction around 0.5–0.6. For higher oil viscosity or lower heterogeneity cases, WAG was in many cases only marginally better (~0.05 units) than the best single-phase injection.
-
Increased oil viscosity reduced RF for a given water fraction (follow any line parallel with the y-axis). This was dominant over the WAG fraction at high viscosities, except for the highly heterogeneous cases with high hysteresis (Figure 9d). This demonstrates the benefit of WAG in heterogeneous formations and that hysteresis is an important contributor.
-
For a given heterogeneity (low or high), increased hysteresis improved RF (compare Figure 9c,d (high hyst) with Figure 9a,b (low hyst)). This was related to the improved gas–oil mobility ratio and reduced gravity segregation, which improves volumetric sweep. The optimal water fraction shifted to more central values, since both phases are needed for hysteresis.
-
For a given hysteresis state, increased heterogeneity reduced RF, especially for cases with less viscous oil (compare Figure 9b,d (high het) with Figure 9a,c (low het)). For high hysteresis cases, increased heterogeneity increased RF in cases with more viscous oil.
To better understand the relation between viscosity, heterogeneity and hysteresis, we plotted RF as a function of x 2 = l o g F H and x 5 = l o g ( M g o * ) for x 1 = 0.5 (WAG injection with equal volume fractions of gas and water) for the two hysteresis cases in Figure 10. Each value of x 5 represents fixed oil viscosity and the curves cover the same viscosity range as before. We observed that that:
-
For low hysteresis (Figure 10a), RF was very sensitive to heterogeneity for low oil viscosities and increased heterogeneity reduced RF. For high viscosity, RF changed little with heterogeneity.
-
With significant hysteresis (Figure 10b), low-viscosity cases produced reduced RF at higher heterogeneity, while high-viscosity cases produced increased RF.

3.4.2. Variation of Well Distance, Injection Rate or Density Difference

The distance between wells can vary from a dense pattern of a few hundred meters onshore to ~1000 m offshore. For fixed injection rates, a longer well distance L x proportionally increases the residence time and, hence, the gravity numbers (see (10)), represented by x 7 and x 8 . Similarly, increasing the injection rates of water Q w and gas Q g equally reduces the residence times and the gravity numbers. Increased density differences reduce the segregation time and increase the gravity numbers. If the height is varied but the injection rate is the same, we note that both segregation time and residence time change equally and there is no net change in the gravity numbers. Varying the aforementioned parameters does not affect the variables x 1 to x 6 ; we can thus investigate cases in which they are constant and only the gravity numbers change.
We plotted RF as a function of injected water fraction x 1 and log gravity number (equal values of x 7 and x 8 ). We investigated the role of mobility ratio, heterogeneity and hysteresis one by one. The different cases are listed in Table 6. The gravity numbers varied equally by 2.5 orders of magnitude.
When low heterogeneity x 2 = 0 is considered (Figure 11a), RF stays fairly constant at low N g (when the impact from gravity is negligible) and decreases when N g is large due to gravity segregation and reduced vertical sweep. At high heterogeneity ( x 2 = 1 ) in Figure 11b, RF is generally lower, but increases significantly with increases in the gravity number. Gravity therefore exerts a positive effect as more of the low-permeable layers are swept by gravity drainage into the highly permeable layers [47].
A relatively heterogeneous case ( x 2 = 0.8 ) is considered where either mobility ratio is favorable, Figure 12a, or unfavorable, Figure 12b. In both cases, increased gravity number improves RF, but the effect is more pronounced in the favorable mobility ratio case. In the unfavorable mobility case, gravity exerts little impact until the gravity number exceeds −3. RF is generally higher in favorable mobility ratio cases compared to corresponding unfavorable mobility ratio cases.
In a relatively uniform case ( x 2 = 0.3 ) with intermediate mobility ratios x 5 = x 6 = 1.5 ), hysteresis is varied. At low hysteresis, Figure 13a, increased gravity numbers increase RF moderately towards an optimal gravity number. At high hysteresis, Figure 13b, the optimal gravity number occurs at a lower value (for a given injected fraction). The peak can be related to improved sweep in low mobility layers, which becomes dominated by gravity segregation at the highest gravity numbers.

3.4.3. Handling Single Phase Data

The model MOD2 was trained to provide the same R F during single-phase injection when varying input variables related to hysteresis and the phase not injected (for example, gas during water injection). This was performed by generating points with different input values for parameters that should not exert an influence, but with the same output. To check how effectively this was captured by the calibrated model, we ran cases in which the hysteresis parameters x 3 , x 4 and mobility ratio parameters x 5 , x 6 were varied individually. RF was plotted against the relevant variable and r w ranging from gas to water injection. The input parameters are listed in Table 7 and the results are shown in Figure 14.
The variation of the hysteresis parameters x 3 and x 4 , in Figure 14a,b, respectively, produces relatively constant RF with gas injection r w = 0 (RF~0.45) and water injection r w = 1 (RF~0.30), although the variation of x 3 during gas injection produces a wider range (RF = 0.30–0.45). The levels of RF differ, as is expected, since gas and water injection perform differently. Varying the gas-oil mobility ratio M g / o * (via x 5 in Figure 14c) produces much less change in RF with water injection (RF~0.3) than gas or WAG injection. Similarly, varying the water–oil mobility ratio M w / o * (via x 6 in Figure 14d) produces much less change in RF with gas injection (RF~0.45) than with water or WAG injection.

3.5. Application to a 3D Model

The dataset used to train MOD1 and MOD2 was based on a 2D layered model. By obtaining effective parameters from 3D reservoir models, we can predict RF from MOD1 and MOD2. An artificial 3D heterogeneous model was considered with curvature, faults and three layers (see Figure 15). The average permeability, average porosity and layer thickness are listed in Table 8. The vertical permeability was half of the horizontal permeability. An injector and producer were placed at a distance of 1500 m and the pore volume was based on a width of 750 m. The RF was calculated after the injection of 1.5 PV, assuming five injected fractions ( r w equal 0, 0.33, 0.5, 0.67 and 1), two oil viscosities (30 and 110 cP) and low and high hysteresis (see values of x 3 and x 4 ).
In Figure 16, we plotted RF as a function of x 1 = r w for the four cases, calculated with the 3D model, MOD1 and MOD2. In these examples, RF with water injection (0.13 and 0.35 for high and low oil viscosity) is higher than with gas injection (0.09 and 0.22) and RF is lower with more viscous oil. When hysteresis is high, WAG exhibits the best performance out of all the models (the RF peaks at 0 < r w < 1 ). When hysteresis is low, the 3D model features similar RF for a high WAG fraction to water injection, while MOD1 indicates water injection as optimal and MOD2 still clearly supports WAG. MOD1 predicts a level of RF and change in RF that are more similar to those seen in the 3D model than MOD2. MOD2 predicts the level of RF in low hysteresis relatively well, but appears very sensitive to adding hysteresis. This is also seen through a difference in the single-phase injection RF for the same oil viscosity: about 0.2 units for water injection and for gas injection with low amounts of viscous oil, but, more reasonably, 0.03 units for gas injection with high amounts of viscous oil. This indicates that this region of the model could be better calibrated. The water injection points in this case could be insufficiently near the single-phase points in the dataset. Furthermore, we do not expect MOD1 and MOD2 to predict the 3D model behavior identically as the geometries are not the same.

4. Conclusions

In this study, we interpreted a dataset of ~2500 points generated from single-phase and WAG injection reservoir model simulations to predict the recovery factor, RF. Two modeling approaches were selected. In MOD1, a universal dimensionless number M * derived from Nygård and Andersen [47] was selected as the single input variable and correlated by a polynomial expression. In MOD2, eight dimensionless numbers were used as input variables. Both choices included all the relevant input parameters to run the reservoir simulations. MOD2 was developed using LSSVM and optimized based on the best results from PSO, GA, GWO and GSA. The overall conclusions to this work can be summarized as follows:
-
We demonstrated that it is possible to predict the recovery factor during single-phase and WAG injection.
-
The LSSVM model optimized by GWO or PSO performed better than when optimized by GA or GSA.
-
MOD2 with eight input variables clearly performed better than MOD1 with one input. Based on the total dataset, the RMSE and R2 were 0.0080 and 0.998 for MOD2 and 0.050 and 0.889 for MOD1, respectively.
-
The physics-based training of MOD2 was applied successfully. Single-phase injection data points were duplicated using different values in the input variables that should not affect RF, while keeping the same values for the relevant input variables and the output. The model correctly displayed little response to the irrelevant variables, but not for all conditions. Improvements could be made by adding more of these points or by training the model to include such constraints via an added penalty term in the objective function. MOD1 was analytically independent of these variables during single-phase injection.
-
Plotting histograms of partial derivatives of RF showed that for most input variables, increasing them would increase RF for some conditions, but reduce RF under others, demonstrating coupling in the data.
-
The best model (MOD2) predicted that under identical conditions, an optimal injected WAG fraction existed that outperformed single-phase injection (water or gas). The benefit of WAG was much clearer when gas relative permeability hysteresis was significant.
-
The mobility ratios were important input variables. Increased values tended to reduce RF.
-
The roles of gravity numbers, heterogeneity and hysteresis were coupled. Strong gravity effects reduced RF in low-heterogeneity cases, but improved RF in heterogeneous cases.
Finally, some limitations of the study and recommendations should be mentioned. Some parameters were not varied in the dataset and their role can therefore not be predicted by the model. This includes the reservoir dip angle, capillary forces, starting WAG in tertiary mode (with some period of gas or water injection first), gas miscibility and tapering (changing WAG ratio with time). Furthermore, the heterogeneity of the model was mainly described by one parameter although porosity and vertical permeability appear in other dimensionless numbers. It could also matter how the heterogeneity appears, i.e., permeability increasing up or down. We note that several parameters that were not varied are included in a physically meaningful manner into dimensionless numbers that were varied. Thus, considering new values of Corey parameters, the vertical-to-horizontal permeability ratio and reservoir layer configurations are accounted for. The proposed methodology can be applied to predict the performance of other EOR techniques as well, but requires a similar development of representative dimensionless numbers and parameters capturing the EOR effect.
It is recommended to explore the potential of physics-based machine learning [50,51] in combination with dimensionless numbers describing complex systems, as was considered in this study. The methodology of modifying the dataset as described offers the advantage of applying ML algorithms in their standard form. On the downside, the dataset is enlarged and the physics are added around the specific datapoints, not as inherent part of the model.

Author Contributions

Conceptualization, P.Ø.A.; methodology, P.Ø.A.; software, P.Ø.A.; formal analysis, P.Ø.A.; investigation, P.Ø.A., J.I.N., A.K.; data curation, P.Ø.A., J.I.N., A.K.; writing—original draft preparation, P.Ø.A., A.K.; writing—review and editing, P.Ø.A.; visualization, P.Ø.A.; Supervision, P.Ø.A.; project administration, P.Ø.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data can be provided upon request from the corresponding author.

Acknowledgments

Andersen acknowledges the Research Council of Norway and the industry partners, ConocoPhillips Skandinavia AS, Aker BP ASA, Vår Energi AS, Equinor ASA, Neptune Energy Norge AS, Lundin Norway AS, Halliburton AS, Schlumberger Norge AS and Wintershall DEA, of The National IOR Centre of Norway, for support.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

Roman
b LSSVM constant
C Land’s trapping parameter
F H Heterogeneity multiplier
F G Gravity multiplier
h i Layer height, m
k r i Relative permeability
k r i m a x Relative permeability endpoints
K x ,   K z Horizontal and vertical absolute permeability, m
L x Distance from injector to producer, m
L y Width of reservoir, m
L z Total height of reservoir, m
M Mobility ratio
M g / o * , M w / o * Effective mobility ratios between gas-oil and between water-oil
M * Effective three phase mobility ratio accounting for all mechanisms
n i Corey exponents,
N p Number of particles
N i t Number of PSO iterations
N G Gravity number, -
R 2 Coefficient of determination, -
r x y Pearson correlation coefficient between vectors x and y, -
rwWater volume fraction in a WAG cycle, -
R F Recovery factor, -
s i Phase saturation, -
s i r Residual phase saturation, -
t Time, seconds
x Horizontal direction towards producer, m
x i Input vector, -
X Standard deviance multiplier, -
y LSSVM output / RF, -
z Vertical direction downwards, m
Greek
α Carlson hysteresis parameter
α i LSSVM coefficients
γ Regularization coefficient
Δ ρ Density difference, kg/m3
λ i Phase mobility ( Pa s ) 1
μiViscosity, Pa⋅s
ρiPhase density, kg/m3
σ RBK width parameter
ϕ Porosity
ϕ 1 , ϕ 2 Acceleration constants
ω Damping factor
Indices
*characteristic value,
a r i t arithmetic
g gas
G gravity
h a r m harmonic
i phase
j layer
o oil
r e s residence
i n i t initial reservoir conditions
s e g segregation
T total
w water
Abbreviations
EOREnhanced oil recovery
LSSVMLeast squares support vector machine
PSOParticle swarm optimization
RMSERoot mean square error
WAGWater alternating gas

Appendix A. Reservoir Model Parameters

Table A1. Rock/grid properties and operational parameters. N denotes number of cells in each direction, L the respective lengths, Q the volumetric rate.
Table A1. Rock/grid properties and operational parameters. N denotes number of cells in each direction, L the respective lengths, Q the volumetric rate.
N x 100 L x 1000 m ϕ j 0.30 Q w 1014.6 m3/dHalf cycle duration45 d
N y 1 L y 100 m h j 3 m Q g 1014.6 m3/d Total injection volume, PVs1.5 PVs
N z 81 L z 81 m
Table A2. Reservoir flow properties in terms of relative permeability end points, Corey exponents, initial and residual saturations.
Table A2. Reservoir flow properties in terms of relative permeability end points, Corey exponents, initial and residual saturations.
k r o w m a x 0.25 n o w 2 S o i 0.842
k r o g m a x 0.25 n o g 2 S w i = S w r 0.158
k r w m a x 0.05 n w 2 S g i = S g r 0.00
k r g m a x 0.005 n g 2 S o r w 0.20
S o r g 0.10
Table A3. Specification of model heterogeneities. Each model had 9 layers with permeability distributed as specified. It was assumed that vertical and horizontal permeabilities were equal in each layer: K z , j = K x , j .
Table A3. Specification of model heterogeneities. Each model had 9 layers with permeability distributed as specified. It was assumed that vertical and horizontal permeabilities were equal in each layer: K z , j = K x , j .
K x   [ mD ]
Layer 1 (top)3003005001000
23001005020
33009005001000
43003005020
53001005001000
63009005020
73003005001000
83001005020
9 (bottom)3009005001000
FH1.02.13.012.9

Appendix B. Least Squares Support Vector Machines (LSSVM)

The support vector machine (SVM) algorithm was developed by Vapnik [25] and used to solve classification problems by building hyperplanes in multidimensional spaces that separated data into classes. Its application was extended to regression. The least squares support vector machine, or LSSVM, is a modification of SVM introduced by Suykens and Vandewalle [26]. The LSSVM regression algorithm is outlined below.
Consider a finite dataset with n points D = { ( x 1 , y 1 ) , , ( x n , y n ) } , where the input x i R p , p being the number of input variables (in our case 1 or 8) and the output y i R . The regression function is expressed as [27]:
f ( x ) = w T φ ( x ) + b ,
where φ is a higher dimensional function and w is a weight vector that combines the contributions of each element of φ to a scalar. Each output measurement y i is by definition equal to the regression plus the error e i :
y i = f ( x i ) + e i , ( i = 1 , , n )
The LSSVM algorithm aims to minimize the objective function J described as follows:
min w , e J ( w , e ) = 1 2 w T w + 1 2 γ i = 1 n e i 2
γ is called the regularization coefficient and its magnitude determines which of the two terms is minimized more. The error equations are treated as equality constraints:
y i = w T φ ( x i ) + b + e i , ( i = 1 , , n )
Solving (A3) and (A4) simultaneously can be transformed to the problem of finding the saddle point of the Lagrange function L which incorporates J and the equality constraints:
L ( w , b , e ; α ) = J ( w , e ) k = 1 n α i { w T φ ( x i ) + b + e i y i }
with Lagrange multipliers α i . The conditions for optimality are found by setting partial derivatives equal to zero:
d L d w = 0 w = i = 1 n α i φ ( x i )
d L d b = 0 i = 1 n α i = 0
d L d e i = 0 α i = γ e i , ( i = 1 , , n )
d L d α i = 0 w T φ ( x i ) + b + e i y i = 0 , ( i = 1 , , n )
We can eliminate e i and w from the above set of equations to obtain the remaining linear equations for α i and b :
i = 1 n α i = 0
b + k = 1 n α k φ ( x k ) T φ ( x i ) + 1 γ α i = y i
By applying Mercer’s condition, the product φ ( x i ) T φ ( x j ) is replaced by a kernel function K ( x i , x j ) :
φ ( x i ) T φ ( x j ) = K ( x i , x j ) , i , j = 1 , , N ( 3.4 )
We can then solve for α i and b by solving the matrix form of (A10) and (A11):
[ 0 1 1 1 K ( x 1 , x 1 ) + 1 γ K ( x 1 , x n ) 1 K ( x n , x 1 ) K ( x n , x n ) + 1 γ ] [ b α 1 α n ] = [ 0 y 1 y n ]
Combining (A4) with (A6) and (A12), the final form of the LSSVM regression function is given by:
y ( x ) = i = 1 N α i K ( x i , x ) + b ( 3.5 )
Different choices of kernel function can be made. The radial basis kernel (RBK) function was selected:
K ( x i , x ) = exp ( | | x i x 2 | | σ 2 )
| | x i x | | denotes the Euclidian distance between vectors x i and x , while σ is the width parameter.
The choice of the metaparameters σ and γ determines the LSSVM algorithm performance. σ controls how rapidly the function can vary around the training data points x i . For very small σ , the function equals the constant b between the points x i , while it matches y i at every x i of the training set. This results in the very poor prediction of new points. Large σ linearizes the function (a straight line for a scalar input variable). Intermediate σ are hence expected to capture non-linear trends. γ controls how much weight is placed on minimizing the mismatch compared to minimizing the magnitude of the nonlinear terms. A very low γ minimizes the coefficients of the nonlinear terms to zero and provides a constant function, equal to b . A very high γ minimizes the mismatch between the function and the training set (between y ( x i ) and y i ) but allows it to be more nonlinear.

Appendix C. Optimization Algorithms

Optimization algorithms are applied to find the optimal combination of LSSVM metaparameters, as represented by the vector β = ( log 10 γ , log 10 σ ) R 2 . The applied parameters common and specific to the algorithms are listed in Table A4.
Table A4. Optimization algorithm parameters. No algorithm specific parameters were required for GWO.
Table A4. Optimization algorithm parameters. No algorithm specific parameters were required for GWO.
Common PSO GA
# particles /chromosomes/wolves N p 20 Acceleration constants ϕ 1 ,
ϕ 2
1.5 ,
1.5
Mutation rate μ r 0.15
# variables/genes N v a r 2 Damping factor ω 0.8 Mutation factor μ f 0.1
# iterations N i t 30 GSA# elite
chromosomes
N e l i t 2
Search range variable 1 β 1 m i n ,
β 1 m a x
2 ,
+ 8
Initial gravity G 0 2 GWO
Search range variable 2 β 2 m i n , β 2 m a x 3 ,
+ 3
Gravity reduction factor α 5 -
Initial velocity range Small constant ε 10 4

Appendix C.1. Particle Swarm Optimization (PSO)

PSO was developed by Kennedy and Eberhart [52] and can be described as follows [53]:
  • Generate an initial set of N p ‘particles’, which are random solution vectors β n 0 ( n = 1 , , N p ) , all in R 2 . The entire set of particles is called the swarm.
    β n , r 0 = U n , r ( β r m i n , β r m a x ) ,     ( r = 1 , 2 ) .
  • The indices n and r refer, respectively, to the particle and the parameter in the n and r vector while U n , r refers to the uniform probability distribution over the specified range.
The particles are assigned initial velocities v n 0 R 2
v n , r 0 = β r m a x β r m i n N p U n , r ( 1 , 1 ) ,     ( r = 1 , 2 ) .
The initial velocity is set to be proportional to the search range and reduced by the number of particles, as they each can cover a shorter interval with more of them.
c.
At a given iteration, the solution estimate of particle n corresponds to its current ‘position’ in the search space, termed β n o l d . The quality of each of the N p solution estimates is evaluated by the coefficient of determination R 2 ( β n o l d ) . The best solution position (with highest R 2 ) a particle obtains while it moves in the search space is saved and updated if it improves. These N p solution vectors are called β n , o p t p   ( n = 1 , N p ) . Similarly, the best solution of all the particles (the swarm) is termed β o p t s . This position updates if the particles find a better solution.
d.
New velocities v n n e w are calculated for each particle n based on the old velocity v n o l d and how far the particle is from its historic best position β n , o p t p and from the swarm’s historic best position β o p t s :
v n n e w = ω v n o l d + U ( 0 , ϕ 1 ) ( β n , o p t p β n o l d ) + U ( 0 , ϕ 2 ) ( β o p t s β n o l d )
ϕ 1 and ϕ 2 are acceleration constants, stating how quickly the particles steer towards the two currently best positions. A sum ϕ 1 + ϕ 2 < 4 avoids unbounded oscillation [53]. ω is a velocity damping factor. A value ω < 1 refines searches at late iterations.
e.
The position of each particle at the next iteration is updated by adding the velocity:
β n n e w = β n o l d + v n n e w
Any particles exceeding the search space limits β r m i n , β r m a x are adjusted to travel no farther than the limit.
f.
Finally, the ‘new’ parameters are set as ‘old’ and a new iteration starts from point c. The procedure stops when a set number N i t of iterations is completed.

Appendix C.2. Gravitational Search Algorithm (GSA)

GSA was developed by Rashedi et al. [54] and considers each solution as a particle.
  • Assign initial positions and velocities according to (A16) and (A17).
  • The gravitational constant is reduced from an initial value G 0 at iteration t = 1 according to a reduction factor α down to G 0 exp ( α ) at the last iteration:
G ( t ) = G 0 exp ( α t 1 N t 1 ) .
Calculate the relative fitness m n for each particle (here using R 2 ) at the current state.
m n = ( R 2 ) n min n ( R 2 ) max n ( R 2 ) min n ( R 2 ) .
The ‘mass’ M n of each particle is then calculated as:
M n = m n n = 1 : N p m n .
c.
For a given particle n , the force F n j working on it from another particle j n is given by:
F n j = G M n M j | β j β n | + ε ( β j β n ) .
where | β j β n | is the Euclidian distance between the particle positions and ε is a small constant (to avoid division by zero).
d.
The acceleration of particle n is then its net force divided by the mass, where random weight components are introduced:
a n o l d = 1 M n j = 1 : n U n , j ( 0 , 1 ) F n j .
e.
The velocities and new positions are calculated as:
v n n e w = U ( 0 , 1 ) v n o l d + a n o l d .
β n n e w = β n o l d + v n n e w .
with coordinates limited by β r m i n , β r m a x . The procedure is repeated between steps b and e.

Appendix C.3. Genetic Algorithm (GA)

In GA each solution, β n R 2 is called a chromosome and the individual elements β n , r   ( r = 1 , 2 ) are called the genes of the chromosome [55].
a.
A first generation of chromosomes is initialized using (A16).
b.
In ‘Selection’, pairs of two chromosomes from the previous generation, called parents, are combined to produce a new generation of chromosomes, ‘children’. The selection of the parents is random with probability P n proportional to their relative fitness:
P n = RMSE n 1 n = 1 : N p RMSE n 1 .
c.
‘Crossover’ is then used to define the new generation chromosomes. In child 1 of a parent pair, the first gene is from parent 1 and the second gene from parent 2. For child 2 of that pair, the first gene is from parent 2 and the second from parent 1.
β c 1 , 1 n e w = β p 1 , 1 o l d ,     β c 1 , 2 n e w = β p 2 , 2 o l d ,     β c 2 , 1 n e w = β p 2 , 1 o l d ,     β c 2 , 2 n e w = β p 1 , 2 o l d .
Generally, in problems with more than two genes, a crossover point must be defined to distinguish which genes are taken from which parent.
d.
‘Mutation’ is the operation of randomly modifying one or both genes in a child. The probability that a given gene is mutated is the mutation rate 0 μ r 1 . Thus, for the fraction μ r of new genes we perform the following modification (while the rest 1 μ r are not modified):
β n , r n e w β n , r n e w + μ f ( β r m a x β r m i n ) U n , r ( 1 , 1 )
The factor μ f is set to a low fraction so the mutation is low compared to the search range of the variables. The coordinates are limited by β r m i n , β r m a x .
e.
‘Elitism’ involves keeping some of the best chromosomes from the previous generation unmodified into the new generation.

Appendix C.4. Grey Wolf Optimization (GWO)

GWO was developed by Mirjalili et al. [56] and considers each solution β n R 2 a ‘wolf’.
a.
Initialize the positions of the N p wolves according to (A16). In this algorithm, we call the positions X instead of β .
b.
At a given iteration the best, second-best and third-best solutions are called the alpha ( α ) , beta ( β ) and delta ( δ ) wolves, respectively. The others are grouped as omega ( ω ) wolves. The positions are denoted X α , X β , X δ and X ω , or X for all the wolves.
c.
Assume the ‘prey’ is located at a position X p . A distance measure to the prey along coordinate r is given by:
D r = | C r X p , r ( t ) X r ( t ) | ,     ( r = 1 : 2 )
and the position at the next iteration is given as:
X r ( t + 1 ) = X p , r ( t ) A r D r
where the coefficients C r and A r are determined as follows:
A r = ( 2 U r ( 0 , 1 ) 1 ) a ,     a = 2 ( 1 t 1 N t o t 1 ) ,     C r = 2 U r ( 0 , 1 )
The magnitude of A r makes it possible to move farther from the prey at early iterations (exploration) and closer at later iterations (exploitation). As the sign of A r can be positive or negative the new position can pass the prey on the given axis.
For each wolf, the position of the prey is estimated by the position of the three top wolves. The position at the next iteration is then the average of the three calculated new positions based on the top three wolves. Mathematically, this is expressed as:
D α , r = | C 1 , r X α , r X r | ,     D β , r = | C 2 , r X β , r X r | ,     D δ , r = | C 3 , r X δ , r X r | ,
X 1 , r = X α , r A 1 , r D α , r ,     X 2 , r = X β , r A 2 , r D β , r ,     X 3 , r = X δ , r A 3 , r D δ , r ,
X r ( t + 1 ) = 1 3 ( X 1 , r + X 2 , r + X 3 , r ) ,     ( r = 1 , 2 )
The best position at a given iteration is described by the position of the alpha wolf. The coordinates are limited by β r m i n , β r m a x .

Appendix D. Statistical Measures

Consider a dataset with n points, in which we have a model trying to predict the observed output y i o b s but actually producing the modelled value y i m o d for point i . The goodness-of-fit the model provides for the dataset is quantified by the coefficient of determination R 2 between forecasted and true output values, also called the Nash–Sutcliffe efficiency [57,58]:
R 2 = 1 i = 1 n ( y i o b s y i m o d ) 2 i = 1 n ( y ¯ i o b s y i m o d ) 2 , y ¯ i o b s = 1 n i = 1 n y i o b s
where values from 0 to 1 correspond to no and perfect correlation, respectively. We also use the Root Mean Square Error (RMSE):
RMSE = ( 1 n i = 1 n ( y i o b s y i m o d ) 2 ) 0.5
Linear correlation between two variables, x and y , is evaluated with the Pearson correlation coefficient r x y P :
r x y P = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
A value close to + 1 or 1 indicates strong positive or negative correlation, respectively. Nonlinear correlation is calculated using Spearman rank correlation r x y S p . This is calculated by calculating the ranks of each value for the input variable x and the output variable y , where the ranks R ( x i ) , R ( y i ) denote the position they would have sorted from least to largest. The rank correlation is then based on the covariance and standard deviations of these rank sets:
r x y Sp = cov ( R ( x ) , R ( y ) ) s t d ( R ( x ) ) s t d ( R ( y ) )
If none of the listed values are of equal rank, the above equation can be stated as
r x y Sp = 1 6 i = 1 : n ( R ( x i ) R ( y i ) ) 2 n ( n 2 1 )
Nonmonotonic correlation can be detected using distance correlation [59]:
r x y D = dCov 2 ( x , y ) d V a r ( x ) d V a r ( y )
defined by the fraction of squared distance covariance between x and y over the root mean of the distance variances of x and y , respectively. We refer to the original work for more details.

References

  1. Stenmark, H.; Andfossen, P.O. Snorre WAG Pilot—A Case Study. In Proceedings of the IOR 1995—8th European Symposium on Improved Oil Recovery, Vienna, Austria, 15–17 May 1995. [Google Scholar]
  2. Christensen, J.R.; Stenby, E.H.; Skauge, A. Review of WAG Field Experience. SPE Reserv. Eval. Eng. 2001, 4, 97–106. [Google Scholar] [CrossRef]
  3. Sadik-Zada, E.R.; Loewenstein, W. A Note on Revenue Distribution Patterns and Rent-Seeking Incentive. Int. J. Energy Econ. Policy 2018, 8, 196–204. [Google Scholar]
  4. Afzali, S.; Rezaei, N.; Zendehboudi, S. A comprehensive review on Enhanced Oil Recovery by Water Alternating Gas (WAG) injection. Fuel 2018, 227, 218–246. [Google Scholar] [CrossRef]
  5. Sanchez, N.L. Management of water alternating gas (WAG) injection projects. In Proceedings of the Latin American and Caribbean Petroleum Engineering Conference, Caracas, Venezuela, 21–23 April 1999. [Google Scholar]
  6. Sohrabi, M.T.D.H.; Tehrani, D.H.; Danesh, A.; Henderson, G.D. Visualization of oil recovery by water-alternating-gas injection using high pressure micromodels. SPE J. 2004, 9, 290–301. [Google Scholar] [CrossRef]
  7. Chen, B.; Reynolds, A.C. Ensemble-based optimization of the water-alternating-gas-injection process. SPE J. 2016, 21, 786–798. [Google Scholar] [CrossRef]
  8. Green, D.W.; Willhite, G.P. Enhanced Oil Recovery, 2nd ed.; Henry, L., Ed.; Society of Petroleum Engineers: Richardson, TX, USA, 2018. [Google Scholar]
  9. Kulkarni, M.M.; Rao, D.N. Experimental investigation of miscible and immiscible Water Alternating Gas (WAG) process performance. J. Pet. Sci. Eng. 2005, 48, 1–20. [Google Scholar] [CrossRef]
  10. Andersen, P.Ø. A simplified modelling approach for petroleum recovery by spontaneous imbibition in naturally fractured reservoirs. J. Nat. Gas Sci. Eng. 2019, 63, 95–114. [Google Scholar] [CrossRef]
  11. Andersen, P.Ø. Early- and Late-Time Analytical Solutions for Cocurrent Spontaneous Imbibition and Generalized Scaling. SPE J. 2021, 26, 220–240. [Google Scholar] [CrossRef]
  12. Land, C.S. Calculation of Imbibition Relative Permeability for Two- and Three-Phase Flow From Rock Properties. Soc. Pet. Eng. J. 1968, 8, 149–156. [Google Scholar] [CrossRef]
  13. Stone, H. Estimation of Three-Phase Relative Permeability And Residual Oil Data. J. Can. Pet. Technol. 1973, 12. [Google Scholar] [CrossRef]
  14. Baker, L.E. Three-phase relative permeability correlations. In Proceedings of the SPE Enhanced Oil Recovery Symposium, Tulsa, OK, USA, 16–21 April 1988. [Google Scholar]
  15. Carlson, F.M. Simulation of relative permeability hysteresis to the nonwetting phase. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 4–7 October 1981. [Google Scholar]
  16. Larsen, J.A.; Skauge, A. Methodology for numerical simulation with cycle dependent relative permeabilities. SPE J. 1998, 3, 163–173. [Google Scholar] [CrossRef] [Green Version]
  17. Spiteri, E.J.; Juanes, R. Impact of relative permeability hysteresis on the numerical simulation of WAG injection. J. Pet. Sci. Eng. 2006, 50, 115–139. [Google Scholar] [CrossRef]
  18. Mahzari, P.; Sohrabi, M. An improved approach for estimation of flow and hysteresis parameters applicable to WAG experiments. Fuel 2017, 197, 359–372. [Google Scholar] [CrossRef]
  19. Bourgeois, M.; Joubert, T.; Dominguez, V. Analysis of 3-phase Behavior in WAG Injections for Various Wettabilities. In Proceedings of the IOR 2019—20th European Symposium on Improved Oil Recovery, Pau, France, 8–11 April 2019; pp. 1–16. [Google Scholar] [CrossRef]
  20. Cheng, G.; Guo, R.; Wu, W. Petroleum Lithology Discrimination Based on PSO-LSSVM Classification Model. In Proceedings of the 2010 Second International Conference on Computer Modeling and Simulation, Sanya, China, 22–24 January 2010; Volume 4, pp. 365–368. [Google Scholar]
  21. Alvarado, V.; Ranson, A.; Hernandez, K.; Manrique, E.; Matheus, J.; Liscano, T.; Prosperi, N. Selection of EOR/IOR opportunities based on machine learning. In Proceedings of the European Petroleum Conference, Aberdeen, UK, 29 October 2002. [Google Scholar]
  22. Tahmasebi, P.; Javadpour, F.; Sahimi, M. Data mining and machine learning for identifying sweet spots in shale reservoirs. Expert Syst. Appl. 2017, 88, 435–447. [Google Scholar] [CrossRef]
  23. Chamkalani, A.; Zendehboudi, S.; Bahadori, A.; Kharrat, R.; Chamkalani, R.; James, L.; Chatzis, I. Integration of LSSVM technique with PSO to determine asphaltene deposition. J. Pet. Sci. Eng. 2014, 124, 243–253. [Google Scholar] [CrossRef]
  24. Amar, M.N.; Ghriga, M.A.; Ouaer, H.; Seghier, M.E.A.B.; Pham, B.T.; Andersen, P. Østebø Modeling viscosity of CO2 at high temperature and pressure conditions. J. Nat. Gas Sci. Eng. 2020, 77, 103271. [Google Scholar] [CrossRef]
  25. Vapnik, V. The nature of statistical learning theory. Springer science & business media, New York, 1999. Springer Science & Business Media: New York, NY, USA, 1999. [Google Scholar]
  26. Suykens, J.A.K.; Vandewalle, J. Least Squares Support Vector Machine Classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
  27. Suykens, J.A.; Van Gestel, T.; De Brabanter, J. Least Squares Support Vector Machines; World Scientific: Singapore, 2002. [Google Scholar]
  28. Alizadeh, S.M.; Alruyemi, I.; Daneshfar, R.; Mohammadi-Khanaposhtani, M.; Naseri, M. An insight into the estimation of drilling fluid density at HPHT condition using PSO-, ICA-, and GA-LSSVM strategies. Sci. Rep. 2021, 11, 1–14. [Google Scholar] [CrossRef] [PubMed]
  29. Bian, X.-Q.; Song, Y.-L.; Mwamukonda, M.K.; Fu, Y. Prediction of the sulfur solubility in pure H2S and sour gas by intelligent models. J. Mol. Liq. 2020, 299, 112242. [Google Scholar] [CrossRef]
  30. Mokarizadeh, H.; Atashrouz, S.; Mirshekar, H.; Hemmati-Sarapardeh, A.; Pour, A.M. Comparison of LSSVM model results with artificial neural network model for determination of the solubility of SO2 in ionic liquids. J. Mol. Liq. 2020, 304, 112771. [Google Scholar] [CrossRef]
  31. Ouaer, H.; Hosseini, A.H.; Amar, M.N.; Seghier, M.E.A.B.; Ghriga, M.A.; Nabipour, N.; Andersen, P.Ø.; Mosavi, A.; Shamshirband, S. Rigorous connectionist models to predict carbon dioxide solubility in various ionic liquids. Appl. Sci. 2020, 10, 304. [Google Scholar] [CrossRef] [Green Version]
  32. Zeng, B.; Guo, J.; Zhang, F.; Zhu, W.; Xiao, Z.; Huang, S.; Fan, P. Prediction model for dissolved gas concentration in transformer oil based on modified grey wolf optimizer and LSSVM with grey relational analysis and empirical mode decomposition. Energies 2020, 13, 422. [Google Scholar] [CrossRef] [Green Version]
  33. Guo, Y.; Xu, Y.-P.; Sun, M.; Xie, J. Multi-step-ahead forecast of reservoir water availability with improved quantum-based GWO coupled with the AI-based LSSVM model. J. Hydrol. 2021, 597, 125769. [Google Scholar] [CrossRef]
  34. Zhang, L.; Ge, R.; Chai, J. Prediction of China’s energy consumption based on robust principal component analysis and PSO-LSSVM optimized by the Tabu search algorithm. Energies 2019, 12, 196. [Google Scholar] [CrossRef] [Green Version]
  35. Song, Y.; Xie, X.; Wang, Y.; Yang, S.; Ma, W.; Wang, P. Energy consumption prediction method based on LSSVM-PSO model for autonomous underwater gliders. Ocean Eng. 2021, 230, 108982. [Google Scholar] [CrossRef]
  36. Bemani, A.; Baghban, A.; Mohammadi, A.H.; Andersen, P. Østebø Estimation of adsorption capacity of CO2, CH4, and their binary mixtures in Quidam shale using LSSVM: Application in CO2 enhanced shale gas recovery and CO2 storage. J. Nat. Gas Sci. Eng. 2020, 76, 103204. [Google Scholar] [CrossRef]
  37. Yuan, X.; Chen, C.; Yuan, Y.; Huang, Y.; Tan, Q. Short-term wind power prediction based on LSSVM–GSA model. Energy Convers. Manag. 2015, 101, 393–401. [Google Scholar] [CrossRef]
  38. Lu, P.; Ye, L.; Sun, B.; Zhang, C.; Zhao, Y.; Teng, J. A new hybrid prediction method of ultra-short-term wind power forecasting based on EEMD-PE and LSSVM optimized by the GSA. Energies 2018, 11, 697. [Google Scholar] [CrossRef] [Green Version]
  39. Li, K.; Liang, C.; Lu, W.; Li, C.; Zhao, S.; Wang, B. Forecasting of Short-Term Daily Tourist Flow Based on Seasonal Clustering Method and PSO-LSSVM. ISPRS Int. J. Geo-Inf. 2020, 9, 676. [Google Scholar] [CrossRef]
  40. Esene, C.; Zendehboudi, S.; Shiri, H.; Aborig, A. Deterministic tools to predict recovery performance of carbonated water injection. J. Mol. Liq. 2020, 301, 111911. [Google Scholar] [CrossRef]
  41. Afzali, S.; Zendehboudi, S.; Mohammadzadeh, O.; Rezaei, N. Hybrid mathematical modelling of three-phase flow in porous media: Application to water alternating gas injection. J. Nat. Gas Sci. Eng. 2021, 94, 103966. [Google Scholar] [CrossRef]
  42. Menad, N.A.; Noureddine, Z. An efficient methodology for multi objective optimization of water alternating CO2 EOR process. J. Taiwan Inst. Chem. Eng. 2019, 99, 154–165. [Google Scholar] [CrossRef]
  43. Amar, M.N.; Zeraibi, N.; Jahanbani Ghahfarokhi, A. Applying hybrid support vector regression and genetic algorithm to water alternating CO2 gas EOR. Greenh. Gases Sci. Technol. 2020, 10, 613–630. [Google Scholar] [CrossRef]
  44. Nwachukwu, A.; Jeong, H.; Sun, A.; Pyrcz, M.; Lake, L.W. Machine learning-based optimization of well locations and WAG parameters under geologic uncertainty. In Proceedings of the SPE Improved Oil Recovery Conference, Tulsa, OK, USA, 14–18 April 2018. [Google Scholar]
  45. You, J.; Ampomah, W.; Sun, Q.; Kutsienyo, E.J.; Balch, R.S.; Dai, Z.; Zhang, X. Machine learning based co-optimization of carbon dioxide sequestration and oil recovery in CO2-EOR project. J. Clean. Prod. 2020, 260, 120866. [Google Scholar] [CrossRef]
  46. You, J.; Ampomah, W.; Sun, Q. Co-optimizing water-alternating-carbon dioxide injection projects using a machine learning assisted computational framework. Appl. Energy 2020, 279, 115695. [Google Scholar] [CrossRef]
  47. Nygård, J.I.; Andersen, P.Ø. Simulation of Immiscible Water-Alternating-Gas Injection in a Stratified Reservoir: Performance Characterization Using a New Dimensionless Number. SPE J. 2020, 25, 1711–1728. [Google Scholar] [CrossRef]
  48. Ripley, B.D. Pattern Recognition and Neural Networks; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
  49. Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Pearson Education: Upper Saddle River, NJ, USA, 2010. [Google Scholar]
  50. Fuks, O.; Tchelepi, H.A. Limitations Of Physics Informed Machine Learning For Nonlinear Two-Phase Transport In Porous Media. J. Mach. Learn. Model. Comput. 2020, 1, 19–37. [Google Scholar] [CrossRef]
  51. Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
  52. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the IC-NN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
  53. Poli, R.; Kennedy, J.; Blackwell, T. Particle swarm optimization. Swarm Intell. 2007, 1, 33–57. [Google Scholar] [CrossRef]
  54. Rashedi, E.; Nezamabadi-Pour, H.; Saryazdi, S. GSA: A gravitational search algorithm. Inf. Sci. 2009, 179, 2232–2248. [Google Scholar] [CrossRef]
  55. Mirjalili, S. Genetic algorithm. In Evolutionary Algorithms and Neural Networks; Springer: Cham, Switzerland, 2019; pp. 43–45. [Google Scholar]
  56. Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
  57. Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models part I—A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  58. Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
  59. Székely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and testing dependence by correlation of distances. Ann. Stat. 2007, 35, 2769–2794. [Google Scholar] [CrossRef]
Figure 2. Workflow demonstrating the development, assessment and application of the models.
Figure 2. Workflow demonstrating the development, assessment and application of the models.
Energies 15 00656 g002
Figure 3. Datapoints plotted against corresponding values of x 0 for MOD1, defined using a third-order polynomial (blue line) of x 0 .
Figure 3. Datapoints plotted against corresponding values of x 0 for MOD1, defined using a third-order polynomial (blue line) of x 0 .
Energies 15 00656 g003
Figure 4. Comparison of estimated RF with MOD1 and actual datapoints (a) and a histogram of the residuals (b).
Figure 4. Comparison of estimated RF with MOD1 and actual datapoints (a) and a histogram of the residuals (b).
Energies 15 00656 g004
Figure 5. Illustration of optimizer performance in terms of the best solution’s R2 (a), RMSE (b) and search parameter values l o g ( γ ) (c) and l o g ( σ ) (d), at a given iteration. In total, 20 solutions were initiated and run for 30 iterations in each case. Two identical initializations (marked 1 and 2) were run for each algorithm.
Figure 5. Illustration of optimizer performance in terms of the best solution’s R2 (a), RMSE (b) and search parameter values l o g ( γ ) (c) and l o g ( σ ) (d), at a given iteration. In total, 20 solutions were initiated and run for 30 iterations in each case. Two identical initializations (marked 1 and 2) were run for each algorithm.
Energies 15 00656 g005
Figure 6. Comparison of estimated RF and actual datapoints on (a) the training set, (b) validation set and (c) test set. Estimated points are based on MOD2 (optimized LSSVM). The orange line represents perfect match.
Figure 6. Comparison of estimated RF and actual datapoints on (a) the training set, (b) validation set and (c) test set. Estimated points are based on MOD2 (optimized LSSVM). The orange line represents perfect match.
Energies 15 00656 g006
Figure 7. Histogram of residual errors (estimated RF minus actual RF ) for the total dataset based on MOD2 (optimized LSSVM model).
Figure 7. Histogram of residual errors (estimated RF minus actual RF ) for the total dataset based on MOD2 (optimized LSSVM model).
Energies 15 00656 g007
Figure 8. Histogram of partial derivatives for the total dataset based on MOD2 (the optimized LSSVM model). Each partial derivative is evaluated numerically with a small or large difference Δ x .
Figure 8. Histogram of partial derivatives for the total dataset based on MOD2 (the optimized LSSVM model). Each partial derivative is evaluated numerically with a small or large difference Δ x .
Energies 15 00656 g008
Figure 9. Contour plots of recovery factor RF plotted against x 1 = r w and x 5 = l o g ( M g o * ) . The latter represents variation in oil viscosity, which affects all of x 5 , x 6 , x 7 , x 8 . The four cases are for low heterogeneity and hysteresis (a), high heterogeneity and low hysteresis (b), low heterogeneity and high hysteresis (c) and high heterogeneity and hysteresis (d). See all input values in Table 5.
Figure 9. Contour plots of recovery factor RF plotted against x 1 = r w and x 5 = l o g ( M g o * ) . The latter represents variation in oil viscosity, which affects all of x 5 , x 6 , x 7 , x 8 . The four cases are for low heterogeneity and hysteresis (a), high heterogeneity and low hysteresis (b), low heterogeneity and high hysteresis (c) and high heterogeneity and hysteresis (d). See all input values in Table 5.
Energies 15 00656 g009
Figure 10. Contour plots of recovery factor RF plotted against x 2 = l o g ( F H ) and x 5 = l o g ( M g o * ) . The latter represents variation in oil viscosity, which affects all of x 5 , x 6 , x 7 , x 8 . The cases are for WAG injection with r w = 0.5 and either low (a) or high (b) hysteresis. See all input values in Table 5.
Figure 10. Contour plots of recovery factor RF plotted against x 2 = l o g ( F H ) and x 5 = l o g ( M g o * ) . The latter represents variation in oil viscosity, which affects all of x 5 , x 6 , x 7 , x 8 . The cases are for WAG injection with r w = 0.5 and either low (a) or high (b) hysteresis. See all input values in Table 5.
Energies 15 00656 g010
Figure 11. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Low-heterogeneity (a) and high-heterogeneity (b) cases are shown (see all input values in Table 6).
Figure 11. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Low-heterogeneity (a) and high-heterogeneity (b) cases are shown (see all input values in Table 6).
Energies 15 00656 g011
Figure 12. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Favorable (a) and unfavorable (b) mobility ratio cases are shown (see all input values in Table 6).
Figure 12. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Favorable (a) and unfavorable (b) mobility ratio cases are shown (see all input values in Table 6).
Energies 15 00656 g012
Figure 13. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Low- (a) and high- (b) hysteresis cases are presented (see all input values in Table 6).
Figure 13. Contour plot of recovery factor RF plotted against x 1 = r w and log gravity number with equal values of x 7 and x 8 . Low- (a) and high- (b) hysteresis cases are presented (see all input values in Table 6).
Energies 15 00656 g013
Figure 14. Contour plots of recovery factor RF as function of varying water fraction (horizontal axis) and the indicated parameter ( x 3 in (a), x 4 in (b), x 5 in (c) and x 6 in (d)) on the vertical axis while holding other parameters fixed. r w = 0 indicates gas injection and r w = 1 water injection.
Figure 14. Contour plots of recovery factor RF as function of varying water fraction (horizontal axis) and the indicated parameter ( x 3 in (a), x 4 in (b), x 5 in (c) and x 6 in (d)) on the vertical axis while holding other parameters fixed. r w = 0 indicates gas injection and r w = 1 water injection.
Energies 15 00656 g014
Figure 15. Illustration of the 3D model, where permeability and well placements are indicated.
Figure 15. Illustration of the 3D model, where permeability and well placements are indicated.
Energies 15 00656 g015
Figure 16. RF after 1.5 PV calculated for different injected WAG fractions (rw), low or high oil viscosity and low or high degree of hysteresis, calculated based on a 3D Eclipse model (a), MOD1 (b) or MOD2 (c).
Figure 16. RF after 1.5 PV calculated for different injected WAG fractions (rw), low or high oil viscosity and low or high degree of hysteresis, calculated based on a 3D Eclipse model (a), MOD1 (b) or MOD2 (c).
Energies 15 00656 g016
Table 1. Number and type of points in different datasets and models.
Table 1. Number and type of points in different datasets and models.
MOD1Single Phase CasesWAG CasesTotal
9624722568
MOD2Single phase casesWAG casesTotal
Training (70%)68 × 16 = 108817302818
Validation (15%)14 × 16 = 224371595
Testing (15%)14 × 16 = 224371595
Total96 × 16 = 153624724008
Table 2. Range of values for the total datasets used in MOD1 and MOD2 (using X = 0.5 ).
Table 2. Range of values for the total datasets used in MOD1 and MOD2 (using X = 0.5 ).
MOD1Train Val Test Tot
MinMeanMaxMinMeanMaxMinMeanMaxMinMeanMax
M * −0.21.43.4−0.11.53.2−0.11.43.2−0.21.43.4
y 0.140.490.880.200.490.840.190.500.850.140.490.88
MOD2Train Val Test Tot
MinMeanMaxMinMeanMaxMinMeanMaxMinMeanMax
x 1 00.5100.5100.4100.51
x 2 00.51.100.51.100.41.100.51.1
x 3 01.22.501.22.501.12.501.22.5
x 4 01.3301.3301.4301.33
x 5 0.11.42.40.11.42.40.11.42.40.11.42.4
x 6 0.01.42.30.01.42.30.01.32.30.01.42.3
x 7 −4.6−2.6−0.9−4.6−2.7−0.9−4.6−2.7−0.9−4.6−2.6−0.9
x 8 −7.9−3.0−0.8−7.9−3.1−0.8−7.9−3.0−0.8−7.9−3.0−0.8
y 0.140.450.880.200.440.840.190.480.850.140.450.88
Table 3. Pearson, Spearman and Distance correlation coefficients r x y evaluated for the total dataset between RF and the involved input parameters for MOD1 and MOD2.
Table 3. Pearson, Spearman and Distance correlation coefficients r x y evaluated for the total dataset between RF and the involved input parameters for MOD1 and MOD2.
MOD1 r x y P r x y S p r x y D
x 0 0.94 0.95 0.93
MOD2 x 1 x 2 x 3 x 4 x 5 x 6 x 7 x 8
WAG
cases
0.053 0.34 0.16 0.15 0.62 0.49 0.078 0.0045
r x y P X = 0.25 0.055 0.27 0.11 0.10 0.58 0.49 0.059 0.0087
X = 0.5 0.055 0.27 0.10 0.099 0.57 0.48 0.057 0.0085
X = 1 0.055 0.27 0.087 0.083 0.53 0.45 0.053 0.0079
r x y S p X = 0.5 0.045 0.25 0.095 0.095 0.53 0.47 0.056 0.017
r x y D X = 0.5 0.19 0.25 0.15 0.15 0.54 0.47 0.092 0.077
Table 4. Optimized LSSVM metaparameters using different optimizers and different initializations (marked 1 and 2) and corresponding performance metrics on the training, validation and testing datasets. The parameters used in the final model, MOD2, are indicated.
Table 4. Optimized LSSVM metaparameters using different optimizers and different initializations (marked 1 and 2) and corresponding performance metrics on the training, validation and testing datasets. The parameters used in the final model, MOD2, are indicated.
RMSE R2
Seed log ( γ ) log ( σ ) TrainValTestTrainValTest
LSSVM (preset) 0 0 0.0220 0.0202 0.0279 0.9821 0.9817 0.9691
PSO-LSSVM 1 5.6106 0.32535 0.0056 0.0088 0.0142 0.9988 0.9965 0.9920
2 5.4335 0.30305 0.0055 0.0088 0.0142 0.9989 0.9965 0.9920
GSA-LSSVM 1 6.6812 0.42982 0.0058 0.0089 0.0150 0.9988 0.9964 0.9911
2 7.0506 0.49871 0.0064 0.0090 0.0156 0.9985 0.9963 0.9904
GWO-LSSVM 1 5.6564 0.32883 0.0056 0.0088 0.0142 0.9988 0.9965 0.9919
2 5.6698 0.32230 0.0055 0.0088 0.0144 0.9989 0.9965 0.9918
GA-LSSVM 1 7.3404 0.49280 0.0059 0.0090 0.0155 0.9987 0.9963 0.9904
2 4.9708 0.25298 0.0054 0.0089 0.0140 0.9989 0.9964 0.9922
Range (opt.) ~ 2.4 ~ 0.25 0.0010 0.0002 0.0016 0.0004 0.0002 0.0018
Final (MOD2) 5.6 0.32 0.0056 0.0088 0.0143 0.9988 0.9965 0.9919
Table 5. Parameter selections for MOD2, where oil viscosity is varied and influences mobility ratios and gravity numbers. Four cases are considered according to heterogeneity and hysteresis.
Table 5. Parameter selections for MOD2, where oil viscosity is varied and influences mobility ratios and gravity numbers. Four cases are considered according to heterogeneity and hysteresis.
Low Het,
Low Hyst
High Het,
Low Hyst
Low Het,
High Hyst
High Het,
High Hyst
x10:1
x20.2510.251
x3002.52.5
x4330.50.5
x50.2:2.2
x60.2:2.2
x7−3:−1
x8−3:−1
Table 6. Parameter selections for MOD2 with cases demonstrating influence of gravity numbers according to heterogeneity, mobility ratio and hysteresis.
Table 6. Parameter selections for MOD2 with cases demonstrating influence of gravity numbers according to heterogeneity, mobility ratio and hysteresis.
Lo HetHi HetFavUnfavLo HystHyst
x10:1
x2010.80.3
x31002.5
x43330
x520.521.5
x620.521.5
x7−4:−1.5
x8−4:−1.5
Table 7. Parameter selections for MOD2 cases to check response in going from multiphase to single-phase scenarios.
Table 7. Parameter selections for MOD2 cases to check response in going from multiphase to single-phase scenarios.
Vary   x 3 = α Vary
x 4 = log C
Vary
x 5 = log M g / o *
Vary
x 6 = log M w / o *
x 1 0:1
x 2 0.8
x 3 0:2.5111
x 4 10:311
x 5 1.51.50.1:2.41.5
x 6 1.51.51.50.0:2.3
x 7 −2
x 8 −2
Table 8. Input and calculated parameters for the 3D model. ‘Low/hi’ indicates degree of hysteresis in parameters x3, x4, while for parameters x5 to x8, values are calculated from two oil viscosities.
Table 8. Input and calculated parameters for the 3D model. ‘Low/hi’ indicates degree of hysteresis in parameters x3, x4, while for parameters x5 to x8, values are calculated from two oil viscosities.
Layer K x   [ m D ] ϕ   [ ] h   [ m ] x 1 0, 0.33, 0.5, 0.67, 1
121700.32442 x 2 0.73
265.90.29729 x 3 ( l o w , h i ) 0, 2.5
35890.32333 x 4 ( l o w , h i ) 3, 0
x 5 ( 30 , 110 c P ) 1.83, 2.40
Δ ρ w o 250 kg/m3 V p 3.70 × 10 7 m3 x 6 ( 30 , 110 c P ) 1.50, 2.06
Δ ρ g o 450 kg/m3 T 20 years x 7 ( 30 , 110 c P ) −1.28, −1.84
L 1500 m s o i 0.84 x 8 ( 30 , 110 c P ) −0.96, −1.52
W 750 m Q 7600 m3/d
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Andersen, P.Ø.; Nygård, J.I.; Kengessova, A. Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM. Energies 2022, 15, 656. https://doi.org/10.3390/en15020656

AMA Style

Andersen PØ, Nygård JI, Kengessova A. Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM. Energies. 2022; 15(2):656. https://doi.org/10.3390/en15020656

Chicago/Turabian Style

Andersen, Pål Østebø, Jan Inge Nygård, and Aizhan Kengessova. 2022. "Prediction of Oil Recovery Factor in Stratified Reservoirs after Immiscible Water-Alternating Gas Injection Based on PSO-, GSA-, GWO-, and GA-LSSVM" Energies 15, no. 2: 656. https://doi.org/10.3390/en15020656

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop