Research on Image Identiﬁcation Method of Rock Thin Slices in Tight Oil Reservoirs Based on Mask R-CNN

: Terrestrial tight oil has extremely strong diagenesis heterogeneity, so a large number of rock thin slices are needed to reveal the real microscopic pore-throat structure characteristics. In addition, difﬁcult identiﬁcation, high cost, long time, strong subjectivity and other problems exist in the identiﬁcation of tight oil rock thin slices, and it is difﬁcult to meet the needs of ﬁne description and quantitative characterization of the reservoir. In this paper, a method for identifying the characteristics of rock thin slices in tight oil reservoirs based on the deep learning technique was proposed. The present work has the following steps: ﬁrst, the image preprocessing technique was studied. The original image noise was removed by ﬁltering, and the image pixel size was uniﬁed by a normalization technique to ensure the quality of samples; second, the self-labeling image data augmentation technique was constructed to solve the problem of sparse samples; third, the Mask R-CNN algorithm was introduced and improved to synchronize the segmentation and recognition of rock thin slice components in tight oil reservoirs; Finally, it was demonstrated through experiments that the SMR method has signiﬁcant advantages in accuracy, execution speed and migration.


Introduction
Tight oil reservoirs [1][2][3] in unconventional oil and gas resources have a complex pore structure and great resource potential, and the identification of rock thin slices is significant for the analysis of the microscopic pore structure and reservoir-sweet spot distribution [4]. Tight oil reservoirs are generally characterized by large heterogeneity and strong diagenesis, resulting in blurred component boundaries in rock thin slices and difficult image identification. It is urgent to establish an image identification method for rock thin slices in tight oil reservoirs based on the cross fusion of geological big data and artificial intelligence (AI), to achieve a multidimensional quantitative segmentation of rock thin slice images and fast intelligent recognition of its components [5][6][7][8]. The current identification methods include artificial intelligence-based image segmentation methods for rock thin slices in tight oil reservoirs and intelligent identification technology.
The artificial intelligence-based image segmentation methods for rock thin slices in tight oil reservoirs mainly consist of superpixel segmentation [9] and semantic segmentation [10]. Superpixel segmentation can be divided into graph theory and pixel clustering. The graph theory converts the segmentation and recognition of slice components into the division of image pixel units. In the Citations [11,12], the composition and structure of rock thin slices were segmented and analyzed by graph theory. However, due to the inaccurate division boundaries, there were segmentation cavities and over-segmentation problems. Pixel clustering clusters the pixels with similar spatial distance and similar features such as color, brightness and texture into the same superpixel to obtain the image segmentation result [10,13]. A K-means clustering algorithm based on probability selection was proposed in Citation [14], which solved the problems of the traditional method of being time-consuming and prone to discontinuity; However, the K center point is still hard to choose, and the clustering process is affected by noise and outliers. Semantic segmentation first assigns a semantic category label to each pixel, then divides and predicts slice components through the establishment of region proposals and classification evaluation. In the Citations [15,16], applying semantic segmentation can mark the overall boundary of different components, but cannot delineate independent individuals accurately.
Discriminant classifiers and neural networks are the two main intelligent identification techniques of tight oil reservoir composition. The key to the discriminant classifier is learning the geometric features of each component. In Citation [17], a semiautomatic identification method for the pore images of rock thin slices was proposed, but dissolution pores could not be well identified. The neural network algorithm identifies the components of rock thin slice images by establishing a neural network model. In Citation [18], 12 image features such as component hue and saturation were captured with artificial neural network (ANN) to identify component categories, but ANN is not suitable for high-density slice images with similar optical characteristics.
To sum up, the existing methods have low identification accuracy because of algorithm design flaws, susceptibility to noise, complex slice image structure, etc. In addition, due to the complex process and high cost of making tight oil rock thin slices [9,19], there are rarely sufficient samples for the existing methods. To this end, given the outstanding performance of Mask region-based convolutional neural network (Mask R-CNN) algorithm [20] in semantic segmentation, object detection and instance segmentation of natural images [21], a self-labeling augmented Mask R-CNN method for component identification of tight oil slices based on transfer learning (SMR) was proposed in this paper. First, the image preprocessing technique was studied. The original image noise was removed by filtering, and the image pixel size was unified by a normalization technique to ensure the quality of the samples; second, the self-labeling image data augmentation technique was constructed to solve the problem of sparse samples; third, the Mask R-CNN algorithm was introduced and improved to synchronize the segmentation and recognition of rock thin slice components in tight oil reservoirs; finally, the accuracy, execution speed and migration of the SMR method were demonstrated by experiments. At the same time, the SMR method has high segmentation accuracy and good recognition effect, which greatly saves the working time and labor cost of geologists. This method has a good application value for the microscopic study of tight oil reservoirs.
The rest of this paper is organized as below: the Section 2 elaborates the techniques; the Section 3 describes the experimental scheme; the Section 4 analyzes and discusses the experimental results; the Section 5 summarizes the study and prospects for future work.

SMR Methods
The workflow and key techniques of the SMR method are mainly elaborated in this section. The workflow of SMR mainly includes three stages: establishment of slice image data sets, augmentation of slice images, and identification of slice image components ( Figure 1). The key techniques at each stage are, respectively, an image preprocessing technique, a self-labeling image augmentation mechanism and an improved Mask R-CNN algorithm. This will be discussed in detail in the stages below.
The augmentation of slice images realizes the augmentation of a small number of labeled images based on the self-labeling mechanism [22]. The identification of slice image components completes the fine segmentation and accurate identification of the slice image components by the Mask R-CNN algorithm.

Establishment of Slice Image Data Sets
There are two steps (image preprocessing and image labeling) at this stage. The original data sets used in this paper came from two tight reservoirs in China: the Fuyu

Establishment of Slice Image Data Sets
There are two steps (image preprocessing and image labeling) at this stage. The original data sets used in this paper came from two tight reservoirs in China: the Fuyu reservoir in Sanzhao Sag of Songliao Basin (SZS data set) and the Upper Paleozoic in Linxing Block of Ordos Basin (OB data set), with a total of 100 bitmaps. The SZS data set was a self-made training data set, the original image size was 616 × 416 pixels, and the number was 50; the OB data set was a migration test data set, the original image size was 2560 × 1920 pixels, and the number of samples was 45.
To avoid the influence of noise and image size on experimental results, in this paper image preprocessing was firstly performed. Image preprocessing includes image denoising and image normalization, as marked by ① in Figure 1. Since the non-local means algorithm [23] can retain the texture features of images while denoising, image denoising was realized by applying the non-local means method with Gaussian core [24]. Meanwhile, in order to meet the image quality requirements of Mask R-CNN algorithm, the image size was unified to 515 × 512 pixels by the resize image processing method, to ensure that the SZS image quality was closest to the original data.
Image labeling guides the training process. LabelMe [25] was used in this paper for semantic labeling to ensure the accuracy of labels. The labels consist of Quartz, Feldspar, Lithic, Primary Pore (PP), Casting Pore (CP), Cemented Dissolution Pore (CDP) and Microcrack, as marked by ② in Figure 1. The labeled image samples would be formed into JSON files and be converted to the desired format. To avoid the influence of noise and image size on experimental results, in this paper image preprocessing was firstly performed. Image preprocessing includes image denoising and image normalization, as marked by 1 in Figure 1. Since the non-local means algorithm [23] can retain the texture features of images while denoising, image denoising was realized by applying the non-local means method with Gaussian core [24]. Meanwhile, in order to meet the image quality requirements of Mask R-CNN algorithm, the image size was unified to 515 × 512 pixels by the resize image processing method, to ensure that the SZS image quality was closest to the original data.
Image labeling guides the training process. LabelMe [25] was used in this paper for semantic labeling to ensure the accuracy of labels. The labels consist of Quartz, Feldspar, Lithic, Primary Pore (PP), Casting Pore (CP), Cemented Dissolution Pore (CDP) and Microcrack, as marked by 2 in Figure 1. The labeled image samples would be formed into JSON files and be converted to the desired format.

Augmentation of Slice Images
The self-labeling image augmentation mechanism [26] was introduced to augment the number of data sets and improve network robustness. Currently, image augmentation mainly depends on deep learning and image processing [27,28]. Deep learning requires a large number of training samples, and the images generated still rely on large-scale manual verification and labeling, so it is not suitable for the augmentation of small sample slice images. Image processing realizes image augmentation based on small sample images, including flipping, color transformation, cropping, rotation, translation, noise injection [29], and mixed images [30]. Based on image processing, the self-labeling mechanism was introduced to complete the augmentation of labeled image samples and corresponding label files, as shown in (2) in Figure 1. The self-labeling mechanism determines the position change of each component labeling point through spatial geometry principle, and completes the labeling of augmented images.
To ensure the clarity of the augmented images and the accuracy of labeling points, five image augmentation forms were selected and two constraints were set.
(3) Gaussian blur: 3 × 3, 5 × 5 and 7 × 7 Gaussian kernels were applied to blur images globally, horizontally, vertically and diagonally. The set constraints were: (1) Two augmentation forms can be used, at most, each time, and flipping and rotation can be selected simultaneously; (2) Three augmentation forms of Gaussian blur, change exposure and noise injection cannot be selected at the same time.
After augmentation, the total number F num of samples was: where, n denotes the type of initial image change, k ∈ [1, n], a k represents the number of operations, and p a k is the number of images generated by operation change. The number of valid samples after augmentation reached 15,350, which were divided into training set and test set at the ratio of 7:3.

Identification of Slice Image Components
Mask-RCNN is the core algorithm for identification of slice image components, which mainly includes three parts: backbone, region of interest area (ROI), segmentation and recognition.
(1) The backbone consists of ResNet101 [31] and Feature Pyramid Network (FPN), as marked by 3 in Figure 1. ResNet101 extracts image features through residual network to obtain the feature layer; FPN extracts the features and semantic values of each component in the feature layer by undersampling, and then generates the effective feature layer (P2, P3, P4, P5, P6) [32] by upsampling and fusion of feature layers, to complete feature extraction, as shown in Figure 2.  Figure 3. If the region proposal detected by the binary classification network does not contain components, the edge box is invalid. Each ROI area is finally adjusted to ROI align of the same size.  (2) The ROI area is the area where the region proposal network (RPN) slides on the effective feature layer to obtain and output the possible slice components [33], as marked by 4 in Figure 1. The RPN consists of a binary classification network and a regression network. The former detects the slice components of the candidate area by judging the intersection over union value (IOU) in region proposals, and the latter outputs the edge boxes of each component region proposal. The RPN structure is shown in Figure 3. If the region proposal detected by the binary classification network does not contain components, the edge box is invalid. Each ROI area is finally adjusted to ROI align of the same size.  (3) The segmentation and recognition part is composed of fully connected layers and mask branch, as marked by 5 in Figure 1. Fully connected layers identify each component category with ROI align as input, and regress and refine the edge box of each component; mask branch applies a small fully connected network to generate pixel-level object masks for each component in ROI align and complete the instance segmentation of components. The total loss function L T of Mask R-CNN can be defined by Equation (2) [20], where L cls is the recognition process loss; L box is the box regression loss; L mask is the segmentation loss.

Mask R-CNN Algorithm Training
Transfer learning was used for algorithm pretraining, in order to improve the learning efficiency and training speed of Mask R-CNN algorithm. Transfer learning is a machine learning technique that applies the knowledge of a task to related scenarios to complete a new task [33]. The specific performance of this technique in this paper was to apply the training weights on the COCO [34] data set to the Mask R-CNN algorithm, which is mainly divided into three steps: first, the pretraining network structure is modified according to the component type; second, a small learning rate is applied to optimize the pretraining network; finally, the algorithm training is completed to realize the segmentation and recognition of each component. In algorithm training, the stochastic gradient descent method is adopted to update the training parameter θ j through cost function J(θ) to complete iterative training. The update process of training parameter θ j can be defined as Equation (3) [31], where, α is a hyper-parameter representing the learning rate. The learning rate was finally set as α = 10 −5 through fine-tuning test, and the algorithm training was completed with 100 learning epochs. In addition, to optimize the network performance, the IOU size was set to 60% and the batch size to 8 steps. Training time was 34 h. Each loss Energies 2022, 15, 5818 6 of 16 in the iterative process is shown in Figure 4. It can be seen from the figure that the 60th epoch model tends to be stable.
scent method is adopted to update the training parameter j θ through cost function J( ) θ to complete iterative training. The update process of training parameter j θ can be defined as Equation (3) [31], where, α is a hyper-parameter representing the learning rate. The learning rate was finally set as 5 10 − α = through fine-tuning test, and the algorithm training was completed with 100 learning epochs. In addition, to optimize the network performance, the IOU size was set to 60% and the batch size to 8 steps. Training time was 34 h. Each loss in the iterative process is shown in Figure 4. It can be seen from the figure that the 60th epoch model tends to be stable.

Experimental Scheme
An accuracy experiment, execution speed experiment and migration experiment were conducted in this paper. The specific parameters of experimental devices are shown below: CPU: Intel Xeon Silver 4210R, Memory: 64G, GPU: RTX 6000/8000; Operating system: Ubuntu 20.04.3; Experimental framework: Tensorflow-GPU 2.1.0. The comparison algorithm was YOLACT [35] (the self-labeling image augmentation mechanism ("SYL") was added).

Accuracy Experiment
The accuracy experiment included segmentation accuracy and recognition accuracy.

Experimental Scheme
An accuracy experiment, execution speed experiment and migration experiment were conducted in this paper. The specific parameters of experimental devices are shown below: CPU: Intel Xeon Silver 4210R, Memory: 64G, GPU: RTX 6000/8000; Operating system: Ubuntu 20.04.3; Experimental framework: Tensorflow-GPU 2.1.0. The comparison algorithm was YOLACT [35] (the self-labeling image augmentation mechanism ("SYL") was added).

Accuracy Experiment
The accuracy experiment included segmentation accuracy and recognition accuracy.

Segmentation Accuracy Experiment
The evaluation method of the segmentation accuracy experiment was the roughness analysis method for shape particles based on improved Fourier particle profile proposed by Su et al. [36]. In this method, first, the particle profile is reconstructed with Fourier seriesbased method; second, the elongation (EI), angle (AI) and arithmetic average roughness (Ra) of the segmented profile are calculated based on the reconstruction; then, the results are compared with the manual calculation results to obtain the error; finally, the segmentation accuracy of the algorithm is judged by the error. The experimental data set was the test set of SZS original images, and the number was 15; The segmentation experiment was carried out by randomly selecting images from experimental data sets, and the segmentation accuracy was evaluated with the evaluation method.
Reconstruction of particle profiles with Fourier series-based method. This process consists of circular parameterization and function representation. Circular parameterization refers to mapping the points on the edge of segmented component to the circle with the same perimeter, as shown in Figure 5, the points A, B, and C on the original edge are mapped to A , B and C , and the distance between each point remains constant during the mapping process. Function representation refers to applying Fourier series to where x and y represent coordinates of the boundary points; n is the serial number of harmonics; N is the total number of harmonics; a x0 , a y0 , a xn , a yn , b xn , b yn are Fourier coefficients that are calculated by coordinates of the sampled points on the aggregate particle boundary by Equations (4) and (5). For a given N, the total number of coefficients is 4N + 2 and for a given number of sampled points M, the total number of coordinates is 2M.  AI computing. Since components in the tight oil sandstone slice image were noncircular particles, the original equation was discretized in this paper and expressed by Equation (9): where, ' ( ) x φ and ( ) ' y φ represent the derivative of ( ) x φ and ( ) y φ , respectively, and θ is the angle starting along the X-axis, w is the increment element, Δϕ is 2 / w π , and denotes the increment of polar coordinate angle.
Ra computing. Ra computing was obtained by comparing the real profile and reconstructed profile of each component, as shown in Equations (10) and (11).
where, r represents the vertical deviation between real profile and reference profile of each component, l is the line segment length of reference profile, and L is the total length of all line segments.

Accuracy Experiment
The evaluation method of the accuracy experiment calculates the precision, recall, F1 score and accuracy of the recognition result of each component [38,39]. The experimental data set is the same as the segmentation accuracy experiment. The SZS experimental data set was used for recognition experiment, and the accuracy was evaluated by the evaluation method.
Precision. Precision refers to the ratio of the number of components correctly identi- EI computing [37]. The EI value reflects the composition profile of tight oil slice image, and the computing method can be expressed as: where, D min and D max represent the lengths of short diameter and long diameter, respectively, which are judged by the normal vector second-order tensor matrix.
where, L p denotes the profile perimeter of each component; l ϕ is the corresponding arc length of the polar angle ϕ; T ϕ i denotes the unit normal vector component in the direction of micro-arc i corresponding to the polar angle ϕ; T ϕ j represents the unit normal vector component in the direction of micro-arc j corresponding to the polar angle ϕ; λ a and λ b refer to the eigenvalues of matrix C, i.e., the long and short diameter directions of Ω ij ; η represents the long diameter directions of each component. After determining the long and short diameters, D min and D max can be calculated and the EI value can be determined. AI computing. Since components in the tight oil sandstone slice image were noncircular particles, the original equation was discretized in this paper and expressed by Equation (9): where, x (ϕ) and y (ϕ) represent the derivative of x(ϕ) and y(ϕ), respectively, and θ is the angle starting along the X-axis, w is the increment element, ∆ϕ is 2π/w, and denotes the increment of polar coordinate angle. Ra computing. Ra computing was obtained by comparing the real profile and reconstructed profile of each component, as shown in Equations (10) and (11).
where, r represents the vertical deviation between real profile and reference profile of each component, l is the line segment length of reference profile, and L is the total length of all line segments.

Accuracy Experiment
The evaluation method of the accuracy experiment calculates the precision, recall, F1 score and accuracy of the recognition result of each component [38,39]. The experimental data set is the same as the segmentation accuracy experiment. The SZS experimental data set was used for recognition experiment, and the accuracy was evaluated by the evaluation method.
Precision. Precision refers to the ratio of the number of components correctly identified to the number of components identified, as shown in Equation (12), where TP is the true positive test; FP is the false positive test. Precision = TP/(TP + FP) (12) Recall. Recall refers to the ratio of components correctly identified in the true component category, as shown in Equation (13), where FN is the false negative test.
The F1 score was defined according to precision and recall, as shown in Equation (14).

Execution Speed Experiment
The evaluation method of execution speed experiment tests the mean response time (MRT) value of two algorithms to judge the execution speed that the algorithm completes the identification task. The experimental data set was the images of SZS test set, and the number was 1000. The selection of data set is shown in Table 1. The number of experimental data sets was set as N, N ∈ {100, 200, 400, 600, 800, 1000}. MRT values of component identification completed by the algorithm were tested 5 times, and the average value of 5 experiments was finally calculated.

Migration Experiment
The evaluation method of migration experiment calculates the precision and accuracy of two methods in different experimental region proposals. SZS and OB original test sets were experimental data sets, and the number of SZS was 15; The OB test data set was divided into three groups with 15 in each group and 45 in total. Different experimental data sets were selected, and the number of data sets increased from 3 to 15 (an increase of 3) to calculate the precision and accuracy.

Segmentation Accuracy Experiment Results
The calculation results of some EI, AI and Ra values in the segmentation accuracy experiment are shown in Figure 6. The values on the left outside the brackets are the SMR calculation results, the values on the right outside the brackets are the calculation results of SYL, and the values in the brackets are the results manually calculated by geologists. The error distribution results are shown in Figure 7.
The following conclusions can be drawn by analyzing Figures 6 and 7: Conclusion 1: The SMR method had relatively stable error calculation results compared with the SYL method, and the error of SMR and manual calculation results was within 10%, indicating that SMR method has higher segmentation accuracy for each component.

Conclusion 2:
The SMR method had the lowest segmentation accuracy error for cuttings, probably due to the largest number of cuttings.

Accuracy Experiment Results
To verify the accuracy of the recognition results, the confusion matrix of the recognition results of the two algorithms was established, as shown in Figure 8, where the X-axis represents the predicted value and the Y-axis represents the true value. The precision and recall of two algorithms for each component are shown in Table 2 (The green highlights indicate better experimental results); the F1 score and the accuracy under different types of components are shown in Figure 9.

Accuracy Experiment Results
To verify the accuracy of the recognition results, the confusion matrix of the recognition results of the two algorithms was established, as shown in Figure 8, where the X-axis represents the predicted value and the Y-axis represents the true value. The precision and recall of two algorithms for each component are shown in Table 2 (The green highlights indicate better experimental results); the F1 score and the accuracy under different types of components are shown in Figure 9.
The following conclusions can be drawn by analyzing the experimental results: Conclusion 1: Figure 8 shows that some quartz and lithic samples were identified as feldspar, and some PPs and CDPs were identified as CP. This is mainly induced by three causes: (1) the number of quartz and lithic is more than that of feldspar; the number of PP and CDP is more than that of CP; (2) Under single polarized light, quartz, lithic and feldspar have similar optical characteristics; PP, CDP and CP have similar optical characteristics; (3) There may be mislabeling in data labeling. Besides, a certain number of lithic samples were identified as quartz, and some CDPs were identified as PP. However, this had little impact on the recognition effect of quartz and PP due to the large number of quartz and PP.
Conclusion 2: According to Table 2, the two algorithms had the same precision for microcrack, while the SMR algorithm had higher precision and recall for other components than the SYL algorithm, indicating that the SMR algorithm had better accuracy. However, both algorithms had low precision for feldspar and CP, because the number of such components was relatively small and they had no obvious features.
Conclusion 3: According to Figure 9a, quartz, lithic, PP and CDP can be easily recognized with the two algorithms, but their recognition effect on feldspar, CP and microcrack was relatively poor, due to the influence of the number of components. There were few microcracks, but they have obvious characteristics, so they were not easily affected by other components, and can be better recognized than feldspar and CP. In addition, the F1 score of the SMR algorithm for each component was higher than that of the SYL algorithm.
Conclusion 4: Figure 9b indicates that the recognition accuracy of the two algorithms was above 75% and relatively stable, but the overall SMR algorithm was above 88%, suggesting that the SMR algorithm had a higher recognition accuracy.

Execution Speed Experiment Results
The execution speed experiment results of the two algorithms are shown in Table 3 (The green highlights indicate better experimental results). The growth trend of the MRT value with experimental data size is shown in Figure 10. The X-axis represents the number of flakes and the Y-axis represents the time required.
The following conclusions can be drawn by analyzing the experimental results: Conclusion 1: Table 3 shows that the MRT of the SMR algorithm was smaller than that of the SYL algorithm, indicating that the SMR algorithm has a higher execution speed. In addition, when the experimental data set was 1000, the MRT of the SMR algorithm was 95.34 seconds, showing that the operating efficiency of the SMR algorithm can meet the actual needs of image segmentation and recognition of tight oil sandstone slices.
Conclusion 2: According to Figure 10, the SYL algorithm was relatively stable under a small number of data sets, but the MRT value increased significantly when the experimental data set was large (>600 in the experiment), suggesting that the algorithm is easily affected by data sets. The SMR algorithm tended to be linearly correlated with the number of experimental data sets, indicating that the execution speed of the algorithm is relatively less affected by the number of data sets.  The following conclusions can be drawn by analyzing the experimental results: Conclusion 1: Figure 8 shows that some quartz and lithic samples were identified as feldspar, and some PPs and CDPs were identified as CP. This is mainly induced by three causes: (1) the number of quartz and lithic is more than that of feldspar; the number of PP and CDP is more than that of CP; (2) Under single polarized light, quartz, lithic and feldspar have similar optical characteristics; PP, CDP and CP have similar optical characteristics; (3) There may be mislabeling in data labeling. Besides, a certain number of lithic samples were identified as quartz, and some CDPs were identified as PP. However, this had little impact on the recognition effect of quartz and PP due to the large number of quartz and PP.
Conclusion 2: According to Table 2, the two algorithms had the same precision for microcrack, while the SMR algorithm had higher precision and recall for other components than the SYL algorithm, indicating that the SMR algorithm had better accuracy. However, both algorithms had low precision for feldspar and CP, because the number of such components was relatively small and they had no obvious features.
Conclusion 3: According to Figure 9a, quartz, lithic, PP and CDP can be easily recognized with the two algorithms, but their recognition effect on feldspar, CP and microcrack was relatively poor, due to the influence of the number of components. There were few microcracks, but they have obvious characteristics, so they were not easily affected by other components, and can be better recognized than feldspar and CP. In addition, the F1 score of the SMR algorithm for each component was higher than that of the SYL algorithm.
Conclusion 4: Figure 9b indicates that the recognition accuracy of the two algorithms was above 75% and relatively stable, but the overall SMR algorithm was above 88%, suggesting that the SMR algorithm had a higher recognition accuracy.

Execution Speed Experiment Results
The execution speed experiment results of the two algorithms are shown in Table 3 (The green highlights indicate better experimental results). The growth trend of the MRT value with experimental data size is shown in Figure 10. The X-axis represents the number of flakes and the Y-axis represents the time required.   The following conclusions can be drawn by analyzing the experimental results: Conclusion 1: Table 3 shows that the MRT of the SMR algorithm was smaller than that of the SYL algorithm, indicating that the SMR algorithm has a higher execution speed. In addition, when the experimental data set was 1000, the MRT of the SMR algorithm was 95.34 seconds, showing that the operating efficiency of the SMR algorithm can meet the actual needs of image segmentation and recognition of tight oil sandstone slices.

Migration Experiment Results
Conclusion 2: According to Figure 10, the SYL algorithm was relatively stable under a small number of data sets, but the MRT value increased significantly when the experimental data set was large (>600 in the experiment), suggesting that the algorithm is easily affected by data sets. The SMR algorithm tended to be linearly correlated with the number of experimental data sets, indicating that the execution speed of the algorithm is relatively less affected by the number of data sets.

Migration Experiment Results
Figure 11a-h shows the migration experimental results of the two algorithms, and the effects of the control experimental data set are shown in Figure 11a,b. EI, AI and Ra are indexed on the left Y-axis, while precision, recall, F1 and accuracy on the right Y-axis. Error Range (%) The following conclusions can be drawn by analyzing the migration experiment results:

Number Of Test Sets
Conclusion 1: Figure 11 shows that since different sedimentary environments led to different types and contents of slice components in the OB region proposal, the  The following conclusions can be drawn by analyzing the migration experiment results: Conclusion 1: Figure 11 shows that since different sedimentary environments led to different types and contents of slice components in the OB region proposal, the segmentation accuracy and recognition accuracy of the two algorithms were lower than that in the SZS region proposal. The segmentation accuracy and recognition accuracy of the SMR algorithm were higher than those of SYL algorithm, indicating that the SMR algorithm has better migration.
Conclusion 2: According to Figure 11c-h, the accuracy error of the SMR algorithm was within 10, with precision of 0.5-0.7, recall of 0.7-0.85, F1 of 0.6-0.75, accuracy of 0.75-0.9, higher than those of the SYL algorithm. The lower precision and F1 results may be caused by the large number of lithic samples, and misjudgment directly affects the calculation results of quartz and feldspar. Better overall results can be obtained with the SMR algorithm, demonstrating that the SMR algorithm has good migration and applies to the segmentation and recognition of other slice components.
Conclusion 3: From Figure 11c,e,g, the results of various indicators of the SMR algorithm were relatively stable under different experimental data sets, indicating that the SMR method has better migration.

Conclusions
Characteristic identification of rock thin slices of tight oil reservoirs is the core of analyzing the characteristics of microscopic pore-throat structures, and is quite important in reservoir-sweet spot prediction and microscopic evaluation. In order to solve the problem of low accuracy caused by traditional methods due to algorithm design flaws, susceptibility to noise interference, complex slice image structure and sparse sample sizes, the SMR method was proposed in this paper. Through theoretical elaboration and experimental demonstration, the research conclusions of the SMR method are summarized below: (1) Image preprocessing can improve image quality and avoid noise interference; (2) The self-labeling image data augmentation mechanism can increase the number of samples and ensure the availability of samples; (3) Image segmentation and recognition can be simultaneously realized with the improved Mask R-CNN algorithm. The error of segmentation accuracy and manual calculation results was within 10%, and the overall recognition accuracy was 93.18%, so it can be applied to characteristic identification of rock thin slices of tight oil reservoirs.
The key work of this paper includes the augmentation and feature identification of dense rock images. Therefore, it can be applied to the amplification and identification of dense images in related fields such as medical cells and physical molecules. In addition, the SMR method can be improved and applied to the fields of fault identification, thin-section pore-throat feature analysis, and reservoir simulation evaluation.
In the subsequent study, we will optimize the image augmentation process of rocks in tight oil reservoirs using GAN network, so as to further ensure the availability of incremental samples. At the same time, we consider the AlexNet proposed in the paper [40] as the backbone network, combined with the adaptive idea of paper [41] to improve the Mask R-CNN algorithm, optimize the network structure design, improve the identification effect and apply it to the pore-throat feature analysis to evaluate the potential of the method.