Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning

Jia, Zijian; Liang, Can; Zeng, Chunlin; Chen, Rui

doi:10.3390/magnetochemistry10100070

Open AccessArticle

Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning

¹

Key Laboratory of Shale Gas Exploration, Ministry of Natural Resources, Chongqing Institute of Geology and Mineral Resources, Chongqing 401120, China

²

School of Health Science and Engineering, University of Shanghai for Science and Technology, 516 Jungong Road, Shanghai 200093, China

³

School of Civil and Architecture Engineering, Changzhou Institute of Technology, 666 Liaohe Road, Changzhou 213032, China

⁴

National Key Laboratory of Petroleum Resources and Engineering, China University of Petroleum (Beijing), Beijing 102249, China

^*

Author to whom correspondence should be addressed.

Magnetochemistry 2024, 10(10), 70; https://doi.org/10.3390/magnetochemistry10100070

Submission received: 9 August 2024 / Revised: 22 September 2024 / Accepted: 24 September 2024 / Published: 27 September 2024

Download

Browse Figures

Versions Notes

Abstract

The detection and quantitative analysis of shale components are of great significance for comprehensively understanding the properties of shale, assessing its resource potential and promoting efficient development and utilization of resources. The low-field NMR T₁-T₂ two-dimensional spectrum can detect shale components non-destructively and effectively. Unfortunately, due to its complexity, the two-dimensional spectral results of low-field NMR are mainly analyzed using manual qualitative analysis, and accurate results of the composition cannot be obtained. Since the information contained in its two-dimensional map is determined by the morphological texture and the position in the map, commonly used image analysis networks cannot adapt. In order to solve these problems, this paper improves a novel Faster Region-based Convolutional Neural Network (Faster-RCNN). Compared with previous models, the improved Faster-RCNN has better image classification and visual key point estimation capabilities. The results show that compared with traditional methods, the deep learning method using this model can directly obtain key information such as kerogen and movable oil and gas content in rocks. The information provided in this study can help complement and improve the development of analytical methods for low-field 2D NMR spectra.

Keywords:

convolutional neural network; oil Shale; NMR technology; T₁-T₂ maps; deep learning; Faster-RCNN

1. Introduction

With the growth in global demand for oil and gas resources, shale gas has attracted attention due to its huge resource potential. As an unconventional resource, it has the characteristics of large gas-bearing area, long production cycle and stable output, and has become a hot spot in global exploration and development [1].

Unlike conventional sandstone reservoirs, for shale gas systems, shale is both the source rock and reservoir. Therefore, finding effective reservoirs has become the core of shale gas exploration, and precise description and characterization of shale reservoirs are even more necessary [2]. The porosity and permeability of the shale gas matrix are very low. According to their origin, the pores can be divided into nanoscale organic pores, mineral intragranular pores and intergranular pores, and micron-scale micro-fractures. Organic matter pores are generated during the hydrocarbon generation process of organic matter, and the number of pores is directly related to the maturity of organic matter [3]. The occurrence modes of shale gas include free gas, adsorbed gas and dissolved gas. Free gas mainly exists in pore spaces such as inter/intra-particle pores and micro-cracks. A large amount of shale gas adheres to the surface of kerogen and clay particles in an adsorbed state, and a very small amount is dispersed in kerogen, asphalt and water in a dissolved state [4].

The measurement and characterization of shale organic matter is usually achieved using geochemical experimental methods. Organic matter content is expressed as total organic carbon (TOC), and laboratory measurements include carbon and sulfur determination, combustion, pyrolysis gas chromatography and chloroform pitch “A” determination. The maturity of organic matter can be characterized by parameters such as rock pyrolysis parameters, vitrinite reflectance, chemical composition characteristics of soluble extracts, kerogen free radical content and time-temperature index. Geochemical parameters have limitations in data discontinuity, heterogeneity and analytical laboratory workload [5].

Scanning electron microscopy (SEM) [6] and other observation techniques can observe the pore structure, minerals and contact characteristics in shale; obtain high-resolution images; and perform qualitative analysis to obtain the pore structure parameters from imaging. Currently, the mainstream field emission electron microscope (FE-SEM) [7] and focused ion beam scanning electron microscope (FIB-SEM) [8] are the most commonly used research tools for studying the nanoscale pore structure of shale. The advantage is that it can visually characterize pore morphological characteristics and cause analysis, but due to the impact of resolution and sample scale, it cannot meet the needs of large-area observations. On the basis of direct observation, many scholars combine digital rock technology to conduct three-dimensional reconstruction of shale organic matter and pore structure, and extract pore parameters such as porosity, connectivity and pore throat radius [9]. In addition, some indirect testing methods include the following: CO₂ [10] adsorption method, N₂ [11] adsorption method and high-pressure mercury injection (MIP) [12]. Non-wetting fluids such as gas or mercury are used to inject samples at different pressures to conduct adsorption–desorption experiments or capillary force experiments, obtaining rock pore size distribution, specific surface area and pore volume. These methods require destroying the rock sample, injecting test fluids and can only detect open and connected pores. Due to different testing principles, the observed pore size ranges are different. Nuclear magnetic resonance (NMR) technology has become a powerful tool for the identification and evaluation of shale organic matter and organic pores due to its non-destructive and efficient characteristics, as well as its dual detection functions in the laboratory and downhole [13].

NMR can provide reservoir parameters such as porosity, permeability and irreducible water saturation, and observe fluid distribution status [14]. It is one of the most important means for reservoir evaluation of complex oil and gas reservoirs. At present, the application of this technology in conventional reservoirs is very mature and effective. However, in the application process of shale gas reservoirs, it is affected by factors such as complex mineral composition, special pore structure, rich organic matter, ultra-low permeability and nanoscale pores. It faces many challenges such as low detection resolution, low signal-to-noise ratio and inapplicable interpretation models [15,16]. However, with the development of nuclear magnetic resonance equipment for many years, the minimum echo interval of nuclear magnetic resonance logging tools has reached 0.2 ms, and the echo interval of laboratory desktop nuclear magnetic resonance core analyzers (Magritek, Oxford Instruments and other companies) has reached 0.06 ms, which can detect the shortest T₂ value of the core at 0.01 ms. Thus, the fluid signal in the nanoscale pores of shale can be obtained.

As a result, domestic and foreign scholars have used experimental methods to conduct a large amount of research work on the NMR response characteristics of shale. On this basis, they have gained a deeper and updated understanding of the NMR relaxation mechanism of shale. In the early stages of shale research, researchers compared NMR T₂ spectra with pore sizes measured by mercury intrusion, adsorption, etc., and divided the pores into small pores and large pores based on the T₂ spectrum relaxation time. They believed that short relaxation groups represented small pores. The main components are organic pores (Kausik, Cao, Tinni, Richard, Sigal) [17,18,19,20,21], and the organic pore signal is positively correlated with the total organic carbon content (Chen, Kausik, Habina) [22,23,24]. The organic pore surface relaxation rate is about 40–50 μm/s, which is obtained by converting the T₂ spectrum into a pore size distribution. Washburn (2014) [25] calculated the surface relaxation rate through shale micromineral analysis and paramagnetic substance content, and found that the surface relaxation rate of shale organic pores should not be so high. Therefore, industry experts have questioned the pore size distribution of NMR T₂ spectrum conversion in organic-rich shale. Rylander (2013) [26], Cao (2013) [27] and others tried to explain it from the perspective of organic matter wettability, but found that even considering the wettability characteristics of organic matter oil, they could not fully explain the high surface relaxivity of organic pores. R. Kausik (2017) [28], Korb et al. (2018) [29] and others used high-field NMR measurement methods to study shale characteristics and isolate kerogen and asphalt signals. With the improvement of instrument measurement accuracy and the increase in research on the nuclear magnetic resonance properties of organic matter itself, researchers have found that the short relaxation of the T₂ spectrum contains signals generated by the shale organic matter itself, and that the surface relaxation of organic matter pores and inorganic pores are caused by paramagnetic substances. The surface relaxation is different, which is the reason why the capillary pressure curve or adsorption curve cannot correspond to the nuclear magnetic resonance T₂ spectrum. Washburn (2013) [30], Hugh Daigle (2014) [31] and others conducted theoretical research on the surface relaxation of organic pores and believed that the surface relaxation in organic pores depends on the homonuclear couple between the fluid and the hydrogen protons in the surface. This extreme coupling is different from the reason why relaxation occurs in inorganic pores (inter-granular pores, intra-granular pores), therefore it is difficult to detect this part of the relaxation information by conventional measurement methods (CPMG, inversion recovery or saturation recovery method). In recent years, the application of two-dimensional NMR has become increasingly widespread. Two-dimensional T₁-T₂ NMR is also starting to be used in the medical field for non-invasive detection of diabetes and other conditions [32]. Washburn and Birdwell (2013) [33] tried to introduce the solid-state NMR method into shale measurement, enhance the relaxation signal of organic matter kerogen, etc., analyze the relaxation mechanism of the organic matter pore surface and the organic matter itself, and confirm the existence of the homonuclear nature of shale organic matter and organic matter pores, including the dipole coupling phenomenon. This research results updated people’s understanding of the NMR relaxation mechanism of shale. Since 2014, Xiao Lizhi, Jia Zijian, et al. [34,35] of China University of Petroleum (Beijing) have carried out some basic theoretical and experimental research on shale NMR theory and technology, confirming the existence of a special surface relaxation mechanism of shale organic matter. Song Yiqiao of Harvard University (2019) [36] systematically introduced the advantages and future development trends of NMR technology in shale oil and gas. Fleury (2016) [37], R. Kausik (2019) [38], Tan Maojin (2015~2020) [39,40,41,42,43] and other experts and scholars have tried to apply NMR T₁ through a large number of shale relaxation experiments. T₂ distribution distinguishes different components of shale, including organic matter, gas, water, etc., thereby obtaining typical NMR response values of different components. However, different components of shale are affected by other factors such as fluid saturation state, organic matter maturity, porosity, etc., and their exact positions and shapes on the T₁-T₂ map are quite different. It is not possible to directly apply this response characteristic map for shale exploration [44]. Taking the maturity of shale organic matter kerogen as an example, as kerogen matures, it will lose fatty chains (the part with high hydrogen content), aromatic rings form clusters, solid organic matter becomes harder, and its T₁ relaxation time will become longer. The T₂ relaxation time of solid organic matter is almost constant. Therefore, as the maturity of kerogen increases, the T₁/T₂ ratio gradually decreases, and NMR can indirectly reveal the maturity stage of organic matter [45].

In recent years, with the rise of artificial intelligence, especially the advent of the era of big data and deep learning, machine learning, as the main method of experimental artificial intelligence, can design algorithms so that computers can learn certain patterns from large amounts of data. It has outstanding performance in the fields of mining and image recognition [46,47,48,49,50,51]. Tamoto [52] discusses the use of supervised machine learning models to predict nuclear magnetic resonance porosity well logs in a carbonate reservoir. The two-dimensional NMR image contains information on many components such as organic matter, organic pores, oil, gas, water, etc. Each component has different NMR response mechanisms and different response results. As the shale organic matter maturity, organic matter content and other component information changes, the NMR spectrum may change dramatically [53]. Manual identification has problems such as being time-consuming, using large calculations and having low accuracy. The introduction of machine learning methods can improve NMR spectra identification. Data processing efficiency can be achieved by introducing geochemical results and nuclear magnetic resonance results to establish a convolutional neural network model and adding appropriate expert intervention. This can improve the accuracy of interpretation and analysis of shale components. Therefore, using machine learning methods to perform data processing on two-dimensional NMR spectra should be able to achieve quantitative identification of special components of shale. This is the first time that AI has been applied to the analysis of two-dimensional NMR results of rocks.

2. Theory

2.1. NMR Theory

The T₂ relaxation is widely used to assess porosity, pore size distributions, fluid content and wettability in both core analysis and borehole evaluation. The T₂ distribution is acquired using the magnetization decay curves and Laplace inversion algorithm.

However, for shale, due to the complexity of its composition, using a surface relaxation model based on paramagnetic impurities is problematic. Current low-field NMR technology can detect hydrogen signals in organic components within the core. Therefore, the NMR response in this case is influenced not only by pore fluids but also by the core matrix. Organic matter directly affects the NMR signal in two ways: firstly, it contains hydrogen atoms that can be directly measured, and secondly, it indirectly affects the NMR signal by influencing the relaxation time of the fluid in contact with the pore surface in organic pores. In both cases, the impact of organic matter depends on its maturity. Considering the direct interaction between atoms, maturity is proportional to the mobility of hydrogen atoms. Therefore, organic matter characteristics from gas, oil, or immature organic matter samples will be different. Unlike conventional pores, clay content and distribution in shale reservoirs add another aspect of difficulty to fluid identification. Similarly, clay minerals contain hydrogen atoms, and their NMR response affects the response of attached fluids in many different ways.

The theoretical model established by Bloembergen in 1961 on the relationship between NMR relaxation time and molecular Brownian motion provides a functional relationship between the relaxation rate and the rotational correlation time τ_c of the molecule [54].

\begin{array}{l} \frac{1}{T_{1}} = \frac{3}{10} \frac{γ^{4} ℏ^{2}}{b^{6}} [\frac{τ_{c}}{1 + ω_{0}^{2} τ_{c}^{2}} + \frac{4 τ_{c}}{1 + 4 ω_{0}^{2} τ_{c}^{2}}] \\ \frac{1}{T_{2}} = \frac{3}{20} \frac{γ^{4} ℏ^{2}}{b^{6}} [3 τ_{c} + \frac{5 τ_{c}}{1 + ω_{0}^{2} τ_{c}^{2}} + \frac{2 τ_{c}}{1 + 4 ω_{0}^{2} τ_{c}^{2}}] \end{array}

(1)

where γ is the gyromagnetic ratio, ћ is the reduced Planck’s constant, ω₀ = 2πf, f is the Larmor frequency of ¹H and b is the distance between two adjacent ¹H atoms on the same compound molecule. The relationship between the relaxation time and τ_c, as calculated using Equation (1), is illustrated in Figure 1. According to the figure, as τ_c increases, T₁ initially decreases and then increases, while T₂ continuously decreases. When τ_cω₀ << 1, T₁/T₂ ≈ 1, which corresponds to the characteristics of most light fluids. When τ_cω₀ = 1, T₁ reaches its minimum value. When τ_cω₀ ≥ 1, T₁/T₂ > 1, and the ratio increases with increasing τ_c. This generally corresponds to the region where solid and semi-solid organic materials are located.

Washburn [25,30], Fleury [37], R. Kausik [38], and other experts and scholars have conducted numerous shale relaxation experiments, attempting to use NMR T₁-T₂ distribution to differentiate shale components such as organic matter, gas and water (Figure 2). Their goal is to obtain typical NMR response values for different components. However, due to the influence of factors such as fluid saturation state, organic matter maturity and porosity on the different shale components, the exact positions and shapes on the T₁-T₂ map vary significantly. Therefore, these response characteristic maps cannot be directly and simply applied for shale interpretation.

The different protons in the T₁-T₂ map can be associated with the following sources:

Hydroxyl: These are typically considered to be part of the OH groups in the clay structure or the edges of clay platelets. This signal is always at the resolution limit, below 0.1 ms, and can only be detected with high-precision NMR instruments.

Kerogen: Depending on the maturity, these can overlap with hydroxyl. They are best detected in dry samples as their hydrogen index is relatively low compared to water.

Water: This signal is typically located on or near the T₁ = T₂ line, even for very small pore sizes, such as the interlayer spaces in clay.

2.2. Image Acquisition

The shale material in this experiment comes from Fuling and Liaohe, China. ¹H-NMR spectroscopy was performed using instruments produced by China Numax Technology. The frequency was 23 MHz and the magnet temperature was 32 °C. The NMR T₁-T₂ spectrum was obtained using inversion recovery Carr–Purcell–Meiboom–Gill (IR–CPMG) sequence. Echo time TE was 0.06 ms. The repetition time was 1 s. The number of echoes was 1000.

Shale NMR T₁-T₂ two-dimensional planar spectroscopy is a technique used to analyze the characteristics of shale oil and natural gas reservoirs. Such 2D spectrograms combine information from NMR T₁ relaxation times and T₂ relaxation times to provide detailed insights into reservoir pore structure and fluid type. In the T₁-T₂ two-dimensional plane spectrum, the horizontal axis represents the T₂ relaxation time, and the vertical axis represents the T₁ relaxation time. Each data point represents a measurement sample and its location corresponds to the T₁ and T₂ values of that sample. Representing signal strength or frequency through changes in color or grayscale can reveal relationships between different rock components and fluid types. The two-dimensional spectrum of 3 samples is shown in Figure 3 and Figure 4. During the training process, we added more than one hundred samples of data. By conducting T₁-T₂ experiments and complementary geochemical analyses on core samples in different states (dried, centrifuged, oil-saturated, water-saturated, etc.), the signal properties and their occurrence states can be determined. By comparing the signal changes under different states, we determined what components of the shale the signal represents.

We collected 62 shale samples from Fuling and Liaohe as our dataset, which was divided into training, validation and test sets. The division was performed using a random allocation method. For one hundred data points, we randomly selected 50% of the 2D spectra as the training set, 20% as the validation set and 30% as the test set.

2.3. Network Structure

Deep learning is a subfield within machine learning that focuses on simulating the workings of the human brain using multi-layered neural networks. This approach allows computers to learn based on large amounts of data, automatically discovering useful features and patterns in the data.

Convolutional neural networks are a common form of deep learning. They consist of multiple layers; each layer contains several neurons. Neurons are connected through weights and undergo nonlinear transformation through activation functions. The features learned in each layer are abstract representations of the features in the previous layer. By stacking multiple layers, neural networks can learn increasingly complex features to solve complex problems. Deep learning models have been widely used in many fields, and models in the field of target detection and recognition have also become very mature. Among many models, we chose to use the Faster-RCNN model as a method for identifying NMR T₁-T₂ two-dimensional spectra. There are several reasons for choosing this model.

For data recognition of NMR two-dimensional spectra, both the shape and position of the signals need to be considered. Conventional image recognition methods are not suitable for this. However, Faster-RCNN can address these issues. The principle of this model for object detection is as follows: an image is input and processed through a backbone feature extraction network (DeepConvNet in the diagram), which is a convolutional model, to obtain convolutional feature maps. These feature maps are then processed by the region proposal network (RPN) to generate proposal box regions (Figure 5). These proposal boxes are scaled to a fixed size and then fed into two fully connected layers.

Faster-RCNN, as a representative of the two-stage network structure model, offers higher detection accuracy than classic detection algorithms. Its technology is relatively mature and stable, and it surpasses previous models in image classification and visual key point estimation capabilities.

The network structure of Faster-RCNN includes the following key parts. (1) Convolutional layers: These are used to extract features of the input image. Pretrained convolutional neural networks (such as Visual Geometry Group, Residual Network, etc.) are usually used as feature extractors. (2) Region proposal network (RPN): This is responsible for generating candidate target region proposals. RPN slides a small window at each position and outputs multiple bounding boxes and their corresponding probabilities for each window as candidate target areas. (3) ROI pooling layer: This maps candidate areas of different sizes to fixed-size feature maps for subsequent classification and regression operations. (4) Classification network: This classifies the extracted candidate areas and determines the target category to which they belong. (5) Bounding box regression network: This fine-tunes the bounding box of the candidate area to more accurately frame the location of the target. As shown in Figure 5, the overall workflow is as follows: first, extract the features of the input image through the convolution layer; then, RPN generates a candidate target area frame; next, the ROI pooling layer maps the candidate area to a fixed-size feature map; then, the classification network classifies each candidate area into target categories; finally, the bounding box regression network fine-tunes the position of each candidate area to obtain the final target detection result.

2.4. Algorithm Process

The deep learning method of convolutional neural networks is used to analyze and model organic matter-weighted and organic pore-weighted T₁-T₂ maps. This approach enables the rapid acquisition of information such as organic matter maturity, organic porosity, fluid components and shale content, facilitating identification and quantitative calculations. Initially, the shale T₁-T₂ distribution maps are obtained as inputs for the training data. The data are preprocessed according to the characteristics of the network and the input data. Manually labeled organic pore-weighted T₁-T₂ distributions, organic matter-weighted T₁-T₂ distributions and signal areas of kerogen, oil, water, hydroxyl, etc., are used as samples and data labels. The Faster-RCNN model is then trained using the prepared dataset. The accuracy is assessed to determine if it meets the requirements; if not, the error is calculated and the weights in the Faster-RCNN model are updated, with training continuing until the accuracy meets the requirements. This results in a network model capable of predicting components and content based on different T₁-T₂ distribution maps. Finally, the trained convolutional network is utilized to apply the T₁-T₂ maps for identifying shale components. The technical route of this technology module is illustrated in Figure 6.

3. Experimental Results and Analysis

In this section, we will analyze the parameters of the training results from the 1st, 5th, 10th and 15th iterations. The training graphs are labeled with sections such as kerogen, adsorbed oil, free water, adsorbed water, hydroxyl substances and other objects. The following sections will provide explanations for some of the resulting parameters and demonstrate the impact of the model on these outputs.

3.1. Curve

3.1.1. P_Curve

A P_curve is a graph of the relationship between accuracy and confidence, which represents the accuracy of each category recognition when the confidence is set to a certain value. Theoretically, the greater the confidence, the greater the accuracy. The value of confidence when the accuracy reaches 1 for the 1st, 5th, 8th, 10th and 15th training is as shown in the table. From Table 1, the value of confidence for the 8th training is the highest.

3.1.2. R_Curve

An R_curve is a graph of the relationship between recall and confidence, which represents the recall probability of each category when the confidence is set to a certain value. Theoretically, when the confidence level is smaller, the category detection is more comprehensive. The value of the recall rate when the confidence reaches 0 for the 1st, 5th, 8th, 10th and 15th training times is shown in Table 2. The recall rate value of the 10th training is the highest, and the performance of detecting all targets is the best.

3.1.3. F1_Curve

The F1_curve shows the changes in F1_score. An F1_score is an indicator for measuring classification problems. It is the harmonic mean of precision and recall. The F1_score ranges from 0 to 1, which takes into account the accuracy and recall of a specific classification. Therefore, the closer the F1_score is to 1, the better the classification effect. The confidence values corresponding to the best F1 scores for the 1st, 5th, 8th, 10th and 15th training are shown in the table. It can be seen from Table 3 that after the 10th training, the F1 score obtained is optimal and the degree of confidence is 0.816.

3.2. Training Result Indicators

Faster-RCNN is a two-stage classification regression prediction model, and its loss function consists of two parts: region proposal network (RPN) loss and bounding box regression loss. The total loss is also the sum of these two. During the training process, both losses are back-propagated to update the parameters of the model, allowing it to learn to generate accurate region proposals and improve bounding box predictions, resulting in more precise detection results for the model.

In addition, during the model training process, several metrics for object detection are calculated to evaluate the effectiveness of the model. A smaller mean value of the object detection loss function indicates a stronger object detection capability. A smaller mean value of the classification loss function indicates more accurate classification ability. Precision represents the proportion of true positive predictions among the predicted positive samples, while recall represents the probability of the model predicting a positive sample correctly among the actual positive samples. During the model training process, it is also important to monitor the performance of the validation set, including the validation set’s bounding box loss (Val box), mean object detection loss function (Val objectness) and mean classification loss function (Val classification).

The mAP@0.5 represents the average mAP with a threshold greater than 0.5, expressed by the area enclosed by precision and recall as two-axis plots. m represents the average, and the number after @ represents the threshold for determining whether the Intersection over Union (IoU) is a positive or negative sample. mAP@0.5:0.95 (mAP@[0.5:0.95]) represents the average mAP at different IoU thresholds (from 0.5 to 0.95, step size 0.05). The larger the values of mAP@0.5 and mAP@0.5:0.95, the better. The values of various result indicators of Faster-RCNN for the 1st, 5th, 8th, 10th and 15th training are as shown in Table 4. It can be seen from Table 4 that during the 10th training of the model, the model’s various detection result indicators for shale are generally the best, but the value of the indicator mAP@0.5:0.95 is not the highest, and the 15th model training result is the best. In Table 4, TN represents the number of training times; Box represents the regression value of the target coordinate frame during the training process; O represents the mean value of the target detection loss function; C represents the mean value of the classification loss function; P represents precision; R represents recall rate; VB represents the bounding box loss of the verification set; VO represents the mean target detection loss function of the verification set and VC represents the mean classification loss function of the verification set.

3.3. Confusion Matrix

The confusion matrix is a summary of the prediction results of the classification problem. It can be easily used to see whether the machine confuses two categories. The confusion matrix can intuitively show the types of errors in the classification model, which helps to overcome the limitations of relying solely on classification accuracy. Each column of the confusion matrix represents the model’s prediction of the class, and each row represents the actual probability of the class.

In this work, the confusion matrix is a summary of the predicted results of the shale classification problem. Using it, it is easy to see if the model confuses two (more) shale categories. The confusion matrix visualizes the types of errors in the classification model and helps overcome the limitations of relying on classification accuracy. Each column of the confusion matrix represents the model’s prediction for a particular shale component class, and each row represents the actual probability of that class. The confusion matrix for the first training result is shown in Figure 7. In the figure, “Background FN” indicates that the model misclassified a category as background, and “Background FP” indicates that the model misclassified background as a category. In order to have a better view of the display, this confusion matrix plot is normalized in the row direction, and also in order to be able to see the recognition and misrecognition rates of each category more intuitively, the values in each row of the confusion matrix are normalized here by dividing by the total number of the corresponding category, expressed as a percentage. The blue gradient bar on the right side indicates the size of the probability obtained above, with darker colors representing higher probabilities. Therefore, the closer the confusion matrix plot is to the dark blue diagonal line from the top left to the bottom right, the better the model’s recognition performance is. As can be seen in Figure 7, the recognition performance of the model for each shale constituent material class in the confusion matrix is very accurate. The only shortcoming is confusing the recognition of background and other materials. It can be seen that after the first training, the classification and recognition ability of the model is already very good. The numbers in the upper right corner of the graph indicate the scaled values multiplied and added to the values on the vertical axis, e.g., 1 × 10⁻⁷ + 0.999999.

The confusion matrix of the fifth training result is shown in Figure 8. It can be seen from Figure 8 that compared with the first model training result, the recognition performance of various substances in the confusion matrix has reached the best, and the classification effect is the best. It fully demonstrates the superior material classification and recognition performance of Faster-RCNN.

The confusion matrix of the 10th training result is shown in Figure 9. From Figure 9, we can know that the model’s ability to distinguish the background and free water in the T₁-T₂ spectrum is weakened. Compared with the 5th model training result, free water has a higher probability of being identified as background.

The confusion matrix of the 15th training result is shown in Figure 10. From Figure 10, it can be seen that compared with the effect of the 10th training, the model has the same effect as the 5th training, and has returned to the best recognition. Except for the confusion matrix renderings produced by the 1st and 10th model training, the confusion matrix training effects of the remaining rounds are relatively excellent. Judging from the confusion matrix effects of the above several model trainings, the model’s recognition and classification performance of shale components do not improve with more training times, but there is only a certain round of training that reaches the best performance.

3.4. Test Result Indicators

We validated the model using sample data collected from different types and instruments than the test set. First, we used shales’ two-dimensional NMR data collected with the same instrument as the training images for prediction (Figure 11). The predictions were all correct, with a majority of them having confidence scores above 0.95. Then, we used shale two-dimensional NMR spectra obtained with different instruments and software for prediction (Figure 11). The validation results were still correct, with the majority of them having confidence scores above 0.8.

In Figure 11, the model successfully segmented and identified different components of the shale. The names of the components are labeled above the identification boxes, and the numbers indicate the confidence level of the identification results. The test results show that the optimal weights can separate the high peak value regions from the background, and the identification of different components is effective. These components include hydroxyl groups, adsorbed water, free water and adsorbed oil. According to NMR theory, there should be no cases where T₁ < T₂; signals in the T₁ < T₂ region are likely artifacts caused by the NMR inversion. To verify the reliability and adaptability of the model, we also tested shale T₁-T₂ maps obtained from different instruments and inversion software (Figure 12). The results were favorable.

However, we can see in the results that when recognizing the 2D spectra from different software, the identification rate of different components in the spectra did not exceed 90 due to slight differences in color labeling. Additionally, due to variations in rock structure and composition among shales from different regions and formations, a large number of samples from various areas is needed for rock physics and geochemical experiments, using the results as a dataset for training.

4. Discussion and Conclusions

We propose a deep learning neural network method for predicting shale composition based on NMR spectra. Compared with traditional techniques, deep learning methods have the advantages of high throughput, accurate predictions and do not require any knowledge accumulation by operators.

Under the optimal weight test, the prediction accuracy of kerogen, structured water, free water, free oil, adsorbed water and adsorbed oil based on the Faster-RCNN model identification of T₁-T₂ map information is more than 0.9. The above results show that the use of the Faster-RCNN model can assist in the detection and quantitative analysis of shale components.

Moreover, experiments have found that the ratio and signal shape of T₁/T₂ are related to the maturity of kerogen. In the future, a large number of two-dimensional NMR spectra of samples with known kerogen maturity levels will be used for training, and the introduction of machine learning algorithms is expected to solve the identification problem of shale organic matter faster and more accurately.

Author Contributions

Conceptualization, Z.J.; methodology, C.L.; software, R.C.; validation, Z.J.; formal analysis, C.L.; investigation, C.Z.; resources, C.Z.; data curation, R.C.; funding acquisition, C.Z. Writing—original draft preparation, Z.J.; writing—review and editing, Z.J. and C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 42004105), National Key Laboratory of Petroleum Resources and Engineering, China University of Petroleum (Beijing) (PRE/open-2304), Key Laboratory of Shale Gas Exploration, Ministry of Natural Resources, Chongqing Institute of Geology and Mineral Resources (KLSGE-202101). And The APC was funded by Key Laboratory of Shale Gas Exploration (KLSGE-202101).

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gong, B. Shale Energy Revolution; Springer Publishing: Singapore, 2020. [Google Scholar]
Evans, M.J. Unconventional Hydrocarbons and the US Technology Revolution. In Risks, Rewards and Regulation of Unconventional Gas: A Global Perspective; Grafton, R.Q., Cronshaw, I.G., Moore, M.C., Eds.; Cambridge University Press: Cambridge, UK, 2016; pp. 59–91. [Google Scholar]
Loucks, R.G.; Reed, R.M.; Ruppel, S.C.; Jarvie, D.M. Morphology, genesis, and distribution of nanometer-scale pores in siliceous mudstones of the Mississippian Barnett Shale. J. Sediment. Res. 2009, 79, 848–861. [Google Scholar] [CrossRef]
Zou, C.; Yang, Z.; Cui, J.; Zhu, R.; Hou, L.; Tao, S.; Yuan, X.; Wu, S.; Lin, S.; Wang, L.; et al. Shale gas formation mechanism, geological characteristics and resource potential in China. Pet. Explor. Dev. 2010, 37, 641–653. [Google Scholar] [CrossRef]
Yang, J.; Hatcherian, J.; Hackley, P.C.; Pomerantz, A.E. Nanoscale geochemical and geomechanical characterization of organic matter in shale. Nat. Commun. 2017, 8, 2179. [Google Scholar] [CrossRef] [PubMed]
Loucks, R.G.; Reed, R.M.; Ruppel, S.C.; Hammes, U. Spectrum of pore types and networks in mudrocks and a descriptive classification for matrix-related mudrock pores. AAPG Bull. 2012, 96, 1071–1098. [Google Scholar] [CrossRef]
Wang, P.; Jiang, Z.; Ji, W.; Zhang, C.; Yuan, Y.; Chen, L.; Yin, L. Heterogeneity of intergranular, intraparticle and organic pores in Longmaxi shale in Sichuan Basin, South China: Evidence from SEM digital images and fractal and multifractal geometries. Mar. Pet. Geol. 2016, 72, 122–138. [Google Scholar] [CrossRef]
Zhou, S.; Yan, G.; Xue, H.; Guo, W.; Li, X. 2D and 3D nanopore characterization of gas shale in Longmaxi formation based on FIB-SEM. Mar. Pet. Geol. 2016, 73, 174–180. [Google Scholar] [CrossRef]
Xiao, L.; Zhang, X.; Xie, Q. Study on Shale Gas Petrophysics and Well Logging Evaluation and Microscopic Seepage Characteristics; Science Press: Beijing, China, 2015. [Google Scholar]
Shi, M.; Yu, B.; Xue, Z.; Wu, J.; Yuan, Y. Pore characteristics of organic-rich shales with high thermal maturity: A case study of the Longmaxi gas shale reservoirs from well Yuye-1 in southeastern Chongqing, China. J. Nat. Gas Sci. Eng. 2015, 26, 948–959. [Google Scholar] [CrossRef]
Zargari, S.; Canter, K.L.; Prasad, M. Porosity evolution in oil-prone source rocks. Fuel 2015, 153, 110–117. [Google Scholar] [CrossRef]
Rezaee, R.; Saeedi, A.; Clennell, B. Tight gas sands permeability estimation from mercury injection capillary pressure and nuclear magnetic resonance data. J. Pet. Sci. Technol. 2012, 88, 92–99. [Google Scholar]
Coates, G.; Xiao, L.; Prammer, M. NMR Logging Principles and Applications; Gulf Professional Publishing: Oxford, UK, 1999. [Google Scholar]
Elsayed, M.; Isah, A.; Hiba, M.; Hassan, A.; Al-Garadi, K.; Mahmoud, M.; El-Husseiny, A.; Radwan, A.E. A review on the applications of nuclear magnetic resonance (NMR) in the oil and gas industry: Laboratory and field-scale measurements. J. Pet. Explor. Prod. Technol. 2022, 12, 2747–2784. [Google Scholar] [CrossRef]
Singer, P.M.; Chen, Z.; Hirasaki, G.J. Fluid Typing and Pore Size in Organic Shale Using 2D NMR in Saturated Kerogen Isolates. Petrophysics 2016, 57, 604–619. [Google Scholar]
Liang, C.; Jia, Z.; Xiao, L.; Guo, A. Application limits and influencing factors in characterization of rock cores using phase-encoded T2-y method in low-field NMR. Energy Fuels 2024, 38, 12612–12624. [Google Scholar] [CrossRef]
Kausik, R.; Fellah, K.; Rylander, E.; Singer, P.M.; Lewis, R.E.; Sinclair, S.M. NMR relaxometry in shale and implications for logging. Petrophysics 2016, 57, 339–350. [Google Scholar]
Cao Minh, C.; Crary, S.; Zielinski, L.; Liu, C.B.; Jones, S.; Jacobsen, S. 2D-NMR Applications in Unconventional Reservoirs. In Proceedings of the SPE Canadian Unconventional Resources Conference, Calgary, AB, Canada, 30 October–November 1 2012. SPE-161578-MS. [Google Scholar]
Tinni, A.; Odusina, E.; Sulucarnain, I.; Sondergeld, C.; Rai, C.S. Nuclear-Magnetic-Resonance Response of Brine, Oil, and Methane in Organic-Rich Shales. SPE Reserv. Eval. Eng. 2015, 18, 400–406. [Google Scholar] [CrossRef]
Sigal, R.F. Pore-Size Distributions for Organic-Shale-Reservoir Rocks From Nuclear-Magnetic-Resonance Spectra Combined With Adsorption Measurements. SPE J. 2015, 20, 824–830. [Google Scholar] [CrossRef]
Song, Y.-Q. Focus on the physics of magnetic resonance on porous media. New J. Phys. 2012, 14, 55017. [Google Scholar] [CrossRef]
Chen, J.H.; Zhang, J.; Jin, G.; Quinn, T.; Frost, E.; Chen, J. Capillary condensation and NMR relaxation times in unconventional shale hydrocarbon resources. In Proceedings of the SPWLA Annual Proc SWPLA-2012-186, SPWLA 53rd Annual Symposium, Cartagena, Colombia, 16–20 June 2012. [Google Scholar]
Kausik, R.; Kleinberg, R.L.; Rylander, E.; Sibbit, A.; Westacott, A. A novel determination of total gas-in-place (TGIP) for gas shale from magnetic resonance logs. Petrophysics 2017, 58, 232–241. [Google Scholar]
Habina, I.; Radzik, N.; Topór, T.; Krzyżak, A.T. Insight into oil and gas-shales compounds signatures in low field 1H NMR and its application in porosity evaluation. Microporous Mesoporous Mater. 2017, 252, 37–49. [Google Scholar] [CrossRef]
Washburn, K.E. Relaxation mechanisms and shales. Concepts Magn. Reson. Part A 2014, 43, 57–78. [Google Scholar] [CrossRef]
Rylander, E.; Singer, P.M.; Jiang, T.; Lewis, R.; McLin, R.; Sinclair, S.M. NMR T2 distributions in the Eagle Ford shale: Reflections on pore size. In Proceedings of the SPE 164554, SPE Unconventional Resources Conference, Woodlands, TX, USA, 10–12 April 2013. [Google Scholar]
Cao, X.; Birdwell, J.E.; Chappell, M.A.; Li, Y.; Pignatello, J.J.; Mao, J. Characterization of oil shale, isolated kerogen, and postpyrolysis residues using advanced 13C solid-state nuclear magnetic resonance. AAPG Bull 2013, 97, 421–436. [Google Scholar] [CrossRef]
Kausik, R.; Fellah, K.; Feng, L.; Freed, D.; Simpson, G. High- and low-field NMR relaxometry and diffusometry of the Bakken Petroleum System. Petrophysics 2017, 58, 341–351. [Google Scholar]
Korb, J.-P.; Nicot, B.; Jolivet, I. Dynamics and wettability of petroleum fluids in shale oil probed by 2D T1–T2 and fast field cycling NMR relaxation. Microporous Mesoporous Mater. 2018, 269, 7–11. [Google Scholar] [CrossRef]
Washburn, K.E.; Birdwell, J.E.; Seymour, J.D.; Kirkland, C.; Vogt, S.J. Low-Field Nuclear Magnetic Resonance Characterisation of Organic Content in Shales; Society of Core Analysts Symposium: Napa Valley, CA, USA, 2013. [Google Scholar]
Daigle, H.; Johnson, A.; Gips, J.P.; Sharma, M. Porosity Evaluation of Shales Using NMR Secular Relaxation. In Proceedings of the SPE/AAPG/SEG Unconventional Resources Technology Conference, Denver, CO, USA, 25–27 August 2014. [Google Scholar]
Peng, W.K.; Chen, L.; Boehm, B.O.; Han, J.; Loh, T.P. Molecular phenotyping of oxidative stress in diabetes mellitus with point-of-care NMR system. NPJ Aging Mech. Dis. 2020, 6, 11. [Google Scholar] [CrossRef] [PubMed]
Washburn, K.E.; Birdwell, J.E. Updated methodology for nuclear magnetic resonance characterization of shales. J. Magn. Reason. 2013, 223, 17–24. [Google Scholar] [CrossRef] [PubMed]
Jia, Z.; Xiao, L.; Wang, Z.; Liao, G.; Zhang, Y.; Liang, C. Magic echo for nuclear magnetic resonance characterization of shales. Energy Fuels 2017, 31, 7824–7830. [Google Scholar] [CrossRef]
Jia, Z.; Xiao, L.; Chen, Z.; Wang, Z.; Liao, G.; Zhang, Y.; Liang, C.; Guo, L. Determining shale organic porosity and total organic carbon by combining spin echo, solid echo and magic echo. Microporous Mesoporous Mater. 2018, 269, 12–16. [Google Scholar] [CrossRef]
Song, Y.Q.; Kausik, R. NMR application in unconventional shale reservoirs—A new porous media research frontier. Prog. Nucl. Magn. Reson. Spectrosc. 2019, 112–113, 17–33. [Google Scholar] [CrossRef]
Fleury, M.; Romero-Sarmiento, M. Characterization of shales using T₁–T₂ NMR maps. J. Pet. Sci. Eng. 2016, 137, 55–62. [Google Scholar] [CrossRef]
Kausik, R.; Freed, D.; Fellah, K.; Feng, L.; Ling, Y.; Simpson, G. Frequency and temperature dependence of 2D NMR T₁–T₂ maps of shale. Petrophysics 2019, 60, 37–49. [Google Scholar] [CrossRef]
Tan, M.; Mao, K.; Song, X.; Yang, X.; Xu, J. NMR petrophysical interpretation method of gas shale based on core NMR experiment. J. Pet. Sci. Eng. 2015, 136, 100–111. [Google Scholar] [CrossRef]
Li, J.; Wu, Q.; Lu, J.; Jin, W. Determining the content of free methane gas and adsorbed methane gas in shale using nuclear magnetic resonance technology. Logging Technol. 2018, 42, 71–76. [Google Scholar]
Xiao, W.; Zhang, J.; Du, Y.; Zhao, J.; Zhao, Z. Experimental study on NMR response characteristics of shale imbibition under pressure. J. Southwest Pet. Univ. (Nat. Sci. Ed.) 2019, 41, 13–18. [Google Scholar]
Liu, Z.; Yang, D.; Shao, J.; Hu, Y. Study on the pore connectivity evolution of Fushun oil shale based on low-field nuclear magnetic resonance. J. Spectrosc. 2019, 36., 309–318. [Google Scholar]
Ma, X.; Wang, H.; Zhou, S.; Feng, Z.; Guo, W. Insights into nmr response characteristics of shales and its application in shale gas reservoir evaluation. J. Nat. Gas Sci. Eng. 2020, 84, 103674. [Google Scholar] [CrossRef]
Liu, B.; Jiang, X.; Bai, L.; Lu, R. Investigation of Oil and Water Migrations in Lacustrine Oil Shales Using 20 MHz 2D NMR Relaxometry Techniques. Pet. Sci. 2021; in press. [Google Scholar] [CrossRef]
Zhou, G.; Gu, Z.; Hu, Z.; Chang, J.; Zhan, H. Characterization and interpretation of organic matter, clay minerals, and gas shale rocks with low-field nmr. J. Pet. Sci. Eng. 2020, 195, 107926. [Google Scholar] [CrossRef]
Zhang, D.; Chen, Y.; Meng, J. Well logging curve generation method based on recurrent neural network. Pet. Explor. Dev. 2018, 45, 598–607. [Google Scholar] [CrossRef]
Wang, H.; Zhang, Y. Current status and prospects of artificial intelligence processing and interpretation of logging data. Logging Technol. 2021, 45, 345–356. [Google Scholar]
Kang, D.; Wang, X.; Zheng, X.; Zhao, Y.-P. Predicting the components and types of kerogen in shale by combining machine learning with NMR spectra. Fuel 2021, 290, 120006. [Google Scholar] [CrossRef]
Safaei-Farouji, M.; Kadkhodaie, A. Application of ensemble machine learning methods for kerogen type estimation from petrophysical well logs. J. Pet. Sci. Eng. 2022, 208 Pt B, 109455. [Google Scholar] [CrossRef]
Meng, M.; Zhong, R.; Wei, Z. Prediction of methane adsorption in shale: Classical models and machine learning based models. Fuel 2020, 278, 118358. [Google Scholar] [CrossRef]
Wu, Y.; Misra, S.; Sondergeld, C.; Curtis, M.; Jernigen, J. Machine learning for locating organic matter and pores in scanning electron microscopy images of organic-rich shales. Fuel 2019, 253, 662–676. [Google Scholar] [CrossRef]
Tamoto, H.; dos Santos Gioria, R.; de Carvalho Carneiro, C. Prediction of nuclear magnetic resonance porosity well-logs in a carbonate reservoir using supervised machine learning models. J. Pet. Sci. Eng. 2023, 220, 111169. [Google Scholar] [CrossRef]
Zhuoke, L.; Lin, T.; Liu, X.; Ma, S.; Li, X.; Yang, F.; He, B.; Liu, J.; Zhang, Y.; Xie, L. High-temperature-induced pore system evolution of immature shale with different total organic carbon contents. ACS Omega 2023, 8, 12773–12786. [Google Scholar] [CrossRef] [PubMed]
Bloembergen, N. Nuclear Magnetic Relaxation; Benjamin: New York, NY, USA, 1961. [Google Scholar]

Figure 1. The relationship between relaxation time and τ_c at ω₀ = 4.6 MHz.

Figure 2. T₁-T₂ map for shale components.

Figure 3. T₁-T₂ spectra of three shale samples under different conditions.

Figure 4. T₁-T₂ maps of kerogen extracted from three shale samples.

Figure 5. Faster-RCNN structure.

Figure 6. Technical route for evaluating shale organic matter and pore identification using convolutional neural network technology.

Figure 7. Confusion matrix for the first model training.

Figure 8. Confusion matrix of the fifth model training.

Figure 9. Confusion matrix of the 10th model training.

Figure 10. Confusion matrix of the 15th model training.

Figure 11. Recognition results of two-dimensional NMR maps obtained with the same instrument and software as the training images.

Figure 12. Identification results of 2D NMR spectra obtained by different instruments and software.

Table 1. Accuracy and confidence at different training iterations.

Number of Trainings	Confidence	Precision
1	0.936	1
5	0.939	1
8	0.962	1
10	0.943	1
15	0.950	1

Table 2. The recall rate and confidence at different training iterations.

Number of Trainings	Confidence	Recall
1	0	0.33
5	0	0.90
8	0	0.86
10	0	0.92
15	0	0.90

Table 3. The F1 score and confidence at different training iterations.

Number of Trainings	Confidence	F1_Score
1	0.786	0.33
5	0.841	0.90
8	0.843	0.85
10	0.816	0.92
15	0.820	0.89

Table 4. The various performance metrics of Faster-RCNN at different training iterations.

TN	Box	O	C	P	R	VB	VO	VC	mAP@0.5	mAP@0.5:0.95
1	0.029	0.029	0.005	0.33	0.33	0.02	0.01	0.0020	0.18	0.06
5	0.017	0.022	0.002	0.90	0.90	0.01	0.01	0.0008	0.86	0.37
8	0.017	0.022	0.002	0.85	0.86	0.01	0.01	0.0008	0.81	0.33
10	0.017	0.021	0.002	0.91	0.92	0.01	0.01	0.0008	0.90	0.34
15	0.017	0.021	0.002	0.89	0.90	0.01	0.01	0.0008	0.86	0.38

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jia, Z.; Liang, C.; Zeng, C.; Chen, R. Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning. Magnetochemistry 2024, 10, 70. https://doi.org/10.3390/magnetochemistry10100070

AMA Style

Jia Z, Liang C, Zeng C, Chen R. Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning. Magnetochemistry. 2024; 10(10):70. https://doi.org/10.3390/magnetochemistry10100070

Chicago/Turabian Style

Jia, Zijian, Can Liang, Chunlin Zeng, and Rui Chen. 2024. "Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning" Magnetochemistry 10, no. 10: 70. https://doi.org/10.3390/magnetochemistry10100070

APA Style

Jia, Z., Liang, C., Zeng, C., & Chen, R. (2024). Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning. Magnetochemistry, 10(10), 70. https://doi.org/10.3390/magnetochemistry10100070

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identifying Different Components of Oil and Gas Shale from Low-Field NMR Two-Dimensional Spectra Based on Deep Learning

Abstract

1. Introduction

2. Theory

2.1. NMR Theory

2.2. Image Acquisition

2.3. Network Structure

2.4. Algorithm Process

3. Experimental Results and Analysis

3.1. Curve

3.1.1. P_Curve

3.1.2. R_Curve

3.1.3. F1_Curve

3.2. Training Result Indicators

3.3. Confusion Matrix

3.4. Test Result Indicators

4. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI