EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity

Xiao, Zhang; Du, Meng; Liu, Junjie; Sun, Erjie; Zhang, Jinke; Gong, Xiaojing; Chen, Zhiyi

doi:10.3390/photonics10010073

Open AccessArticle

EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity

¹

College of Mechanical Engineering, University of South China, Hengyang 421001, China

²

Institute of Medical Imaging, Hengyang Medical School, University of South China, Hengyang 421001, China

³

The First Affiliated Hospital, Medical Imaging Centre, Hengyang Medical School, University of South China, Hengyang 421001, China

⁴

Research Center for Biomedical Optics and Molecular Imaging, Shenzhen Key Laboratory for Molecular Imaging, Guangdong Provincial Key Laboratory of Biomedical Optical Imaging Technology, CAS Key Laboratory of Health Informatics, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China

⁵

The Seventh Affiliated Hospital, Hunan Veterans Administration Hospital, Hengyang Medical School, University of South China, Changsha 410000, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Photonics 2023, 10(1), 73; https://doi.org/10.3390/photonics10010073

Submission received: 21 November 2022 / Revised: 21 December 2022 / Accepted: 6 January 2023 / Published: 9 January 2023

(This article belongs to the Section Biophotonics and Biomedical Optics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Optical coherence tomography (OCT) image processing can provide information about the uterine cavity structure, such as endometrial surface roughness, which is important for the diagnosis of uterine cavity lesions. The accurate segmentation of uterine cavity OCT images is a key step of OCT image processing. We proposed an EA-UNet-based image segmentation model that uses a U-Net network structure with a multi-scale attention mechanism to improve the segmentation accuracy of uterine cavity OCT images. The E(ECA-C) module introduces a convolutional layer combined with the ECA attention mechanism instead of max pool, reduces the loss of feature information, enables the model to focus on features in the region to be segmented, and suppresses irrelevant features to enhance the network’s feature-extraction capability and learning potential. We also introduce the A (Attention Gates) module to improve the model’s segmentation accuracy by using global contextual information. Our experimental results show that the proposed EA-UNet can enhance the model’s feature-extraction ability; furthermore, its MIoU, Sensitivity, and Specificity indexes are 0.9379, 0.9457, and 0.9908, respectively, indicating that the model can effectively improve uterine cavity OCT image segmentation and has better segmentation performance.

Keywords:

optical coherence tomography; image segmentation; deep learning; mechanism of attention

1. Introduction

The uterine cavity’s anatomical information, including intrauterine area, endometrial thickness, and the endometrial surface’s fine structure, has important applications in the diagnosis of obstetric and gynecological diseases and the preoperative evaluation of assisted reproductive technologies. However, suitable imaging means in clinical practice are lacking. The commonly used transvaginal ultrasound reflects coarse information, while hysteroscopy is deficient in reflecting fine structures. Optical coherence tomography (OCT) is a well-developed biomedical imaging technique [1], which has significant advantages, such as real-time, non-invasive, and high-resolution images [2,3]. Combined with OCT intraluminal imaging’s application scenario, there is a better prospect of its clinical application for evaluating intraluminal lesions, which already have mature applications in the cardiovascular system [4,5,6,7], esophagus [8,9,10], gastrointestinal tract [11,12,13,14,15], and reproductive tract system [16,17,18,19,20,21].

OCT images of the uterine cavity can provide clear information regarding its structure, such as endometrial surface roughness, which can be analyzed by OCT image processing. Recently, a series of studies reported on the application of OCT intracavitary imaging for the female reproductive tract, particularly intrauterine imaging [22]. These reports confirm that OCT can accurately reflect the uterine cavity’s structural information at the tissue level of the endometrial surface, which is important for the diagnosis of uterine cavity lesions. However, the accurate segmentation of uterine cavity OCT images is key. The use of computer-aided technology [23,24,25] (CAD) is an effective method to solve this problem. Early CAD techniques used traditional machine learning segmentation methods based on manual features, and despite some achievements in the field of OCT image segmentation, there are persisting problems, such as a heavy reliance on manually designed features, low feature levels, high computational cost, and a complex processing flow during image processing. In recent years, deep learning technical approaches [26] have achieved remarkable results in numerous computer vision [27,28,29] fields and medical image analysis applications [30,31,32]. Moradi et al. [33] proposed an attention-based UNet model for the automatic image analysis, pattern recognition, and segmentation of kidney OCT images. Liu et al. [34] proposed an enhanced nested UNet architecture (MDAN-UNet) by taking advantage of multi-scale input, multi-scale side output, and dual-attention mechanisms. This is a new powerful full-convolutional network for automatic end-to-end OCT image segmentation. Fang et al. [35] proposed a new segmentation framework combining CNN and graph search methods (CNN-GS) to segment the nine retinal layer boundaries in OCT images of patients with non-exudative AMD. It consisted of two main parts: (1) CNN layer boundary classification; (2) CNN probability map-based graph search layer segmentation. Wang et al. [36] used CNN to segment CNV in OCT angiography. Shah et al. [37] proposed a convolutional neural network (CNN)-based framework to segment multiple surfaces simultaneously. A single CNN was trained to segment three retinal surfaces in two OCT images, namely normal retina and retina affected by intermediate age-related macular degeneration (AMD). Chen et al. [38] proposed the multiscale double-attention MSDA-UNet network, an MSDA mechanism network for OCT lesion region segmentation. This network can extract lesion region information from OCT images of different scales and perform an end-to-end segmentation of OCT retinal lesion regions. Santos et al. [39] proposed CorneaNet, a deep fully convolutional neural network for segmenting corneal OCT images with high accuracy. Guo et al. [40] proposed a lightweight network model for segmenting retinal vessels by introducing spatial attention in U-Net, naming it as Spatial Attention U-Net (SA-UNet). Xu et al. [41] proposed an optimized compression excitation connection (SEC) module integrated with UNet called SEC-UNet, which not only focused on the target but also stepped out of the local optimum to obtain accurate and complete results of choroidal layer segmentation in OCT images. Singh et al. [42] presented a new benchmark segmentation of the extraretinal limiting membrane (ELM) using image datasets for spectral domain OCT scans of a population of patients with idiopathic total macular lacunae. Gao et al. [43] proposed a new privileged modal distillation (PMD) framework for VBDI. PMD transforms the single-input single-task (SIST) learning problem in single-mode VBDI into a multi-input multi-task (MIMT) problem to help the learning model in the target modality. Medical image segmentation via deep learning has shown several advantages over traditional machine learning algorithms, including higher accuracy and reliability, more efficient GPU-based computing power, and lower power consumption [44,45,46,47]. Convolutional neural networks (CNNs) [48] significantly improve the performance of segmentation tasks by utilizing fast and reliable training. Lee et al. [49] applied CNN to achieve the automatic segmentation of macular edema in OCT images. Fully convolutional neural networks (FCN) [27] have achieved remarkable results in the field of image segmentation. Ronneberger et al., inspired by the FCN network, proposed UNet [50], which combines deep semantic and spatial information through encoder and decoder blocks and jump connections. The UNet architecture has achieved excellent results in many medical image segmentation tasks and is widely used in OCT image segmentation tasks, such as optic nerve papillary tissue segmentation [51], vitreous wart segmentation [52], intraretinal cystic fluid (IRC) segmentation [53], fluid region segmentation [54], and retinal layer segmentation [55]. Kepp et al. [56] proposed a deep learning algorithm with a deep learning model using a U-Net-based network to segment different tissue regions in OCT images of mouse skin, and the segmentation results were in agreement with the expert manual segmentation results.

At present, there are many clinical applications for various OCT image segmentation methods; however, an effective OCT image segmentation process to gather information from the uterine cavity has not yet been found, and our paper is the first time to propose it. Secondly, the uterine cavity can be segmented accurately using deep learning OCT image processing, which helps to analyze the uterine cavity structure’s information, including thickness changes, which is important for the diagnosis of uterine cavity lesions. Finally, our study aims to explore a real-time and accurate uterine cavity OCT image segmentation method. These are the main motives of this study. However, in the feature-extraction process, there is a large amount of redundancy in OCT images, and significant features are easily ignored, resulting in the loss of useful information. Additionally, due to the inconsistent information about the uterine cavity’s variations in size and thickness, we must ensure that segmentation remains effective, which requires the model to have high-level feature-processing capabilities. These are the challenges faced in our study.

U-Net is a common architecture applied in medical image segmentation tasks, and it can obtain high-quality results on a limited training set of medical images. In our paper, we combine a multi-scale attention mechanism and propose an encoder–decoder architecture, which we call EA-UNet, for uterine cavity OCT image segmentation. The main contributions of our study are:

First, we adopted the E(ECA-C) module, which introduces the convolutional layer combined with the attention mechanism ECA [57] instead of max pool, which reduces the loss of feature information, enables the model to focus on the features in the region to be segmented, and suppresses the irrelevant features to enhance the model’s feature-extraction ability.

Second, we improved the U-Net network structure to optimize the upsampled feature-layer channels and retain more detailed features. Then, inspired by the structure of Attention U-Net [47], we introduced the Attention Gates module to extract features containing more detailed information using global contextual information to enhance the model’s detailed segmentation effect and improve its segmentation accuracy.

2. Materials and Methods

2.1. Experimental Environment and Data

We used Windows 10 Professional, 32 G RAM, NVIDIA RTX 3080 with 10 GB video memory for our experimental environment platform. We adopted the deep learning frameworks PyTorch 1.8.1, cuda11.1, and Python 3.8.5. Our experimental batch size was 2, with 90 rounds of training iterations. Our optimizer used RMSprop, and we set the initial learning rate and weight decay factor to 0.001 and 1 × 10⁻⁸, respectively.

We carried out all procedures involving experimental animals in accordance with the protocols approved by the animal study committee of the Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences. We purchased four healthy female New Zealand rabbits (4–4.5 kg) from Kangda biological Co., Ltd. (Qingdao, China). We maintained all animals under a 12/12 light/dark cycle at 21–24 °C with a relative humidity of 40–60% and fed with rabbit chow and water ad libitum [22]. According to studies on ethanol-induced apoptosis [58,59], an injection of 95% ethanol into the uterine horn during general anesthesia causes endometrial damage. The extent of endometrial damage depends on the residence time of ethanol in the uterine horn. To establish a rabbit model of endometrial injury, we injected 95% ethanol into the rabbits’ left uterine horns for 5 or 10 min, then extracted and slowly rinsed with saline to remove residual ethanol before injecting an equal amount of saline into the contralateral uterine horn. The dataset we used consists of clinical data from endometrial images of rabbits collected by the Shenzhen Institute of Advanced Technology of the Chinese Academy of Sciences using medical OCT equipment [22]. Each image in the dataset has a corresponding label image, and each label image is manually labeled by two experts with clinical experience as the segmentation standard Ground Truth images, and then reviewed and gated by senior experts in the field of expertise to correct missing or incorrect labeling when appropriate.

2.2. Data Augmentation

First, we performed an initial image screening and quality assessment of the dataset images; then, we preprocessed each OCT image in the dataset before the model training and testing experiments. We cropped irrelevant areas and reduced the images to an appropriate size to ensure the main feature information of the OCT images; then, all training set images were disordered and subjected to random flip and other operations before being imported into the training model. After processing, the OCT dataset consisted of a total of 1347 images, including 1007 images of a normal uterine cavity and 340 images of a damaged uterine cavity; then, we divided the dataset into training and test sets according to the ratio of 8:2.

2.3. Evaluation Metrics

In order to evaluate the proposed model’s segmentation effect, we used MIoU, Sensitivity, and Specificity as OCT image segmentation evaluation metrics; the calculation formulas are:

M I o U = \frac{1}{k + 1} \sum_{i = 0}^{k} \frac{T P}{T P + F P + F N}

(1)

S e n s i t i v i t y = \frac{T P}{T P + F N}

(2)

S p e c i f i c i t y = \frac{T N}{T N + F P}

(3)

where k is the total number of categories of segmentation. TP denotes true positive, which is the number of pixels inside the predicted region and inside the true-labeled region; TN denotes true negative, which is the number of pixels outside the predicted region and the true-labeled region; FP denotes false positive, which is the number of pixels inside the predicted region and outside the true-labeled region; FN denotes false negative, which is the number of pixels outside the predicted region and inside the true-labeled region.

2.4. EA-UNet Network Model

Our proposed multi-scale attention EA-UNet network structure is composed of the proposed ECA-C and Attention Gates modules introduced on the traditional U-Net network, as shown in Figure 1. The EA-UNet’s structure, which is an end-to-end segmentation model, includes encoding and decoding regions. First, adding the ECA-C module to the encoding region can make the model focus more on the focus region; second, we introduced the convolutional layer Conv on the attention mechanism ECA to extract more image feature information instead of max pool. Then, we introduced the Attention Gates module between the jump connections to connect the corresponding encoding and decoding areas to prevent the loss of image feature information, improving the model’s feature-extraction ability and perfecting the images’ feature information. Finally, we changed the number of channels in each layer of the decoding area to recover as many image features as possible and output the segmented image.

2.5. ECA-C Module

The ECA-C module mainly makes some improvements to the SE-Net [60] module, changing a small number of parameters but achieving considerable performance gains. It is a local cross-channel interaction strategy without dimensionality reduction and a method of adaptively selecting the size of one-dimensional convolution kernels, thereby achieving performance improvements. Avoiding dimensionality reduction is important for learning channel attention, and by properly capturing local cross-channel interactions, one can considerably reduce model complexity while maintaining performance. Therefore, it can be efficiently implemented by one-dimensional convolution with a local cross-channel interaction strategy without dimensionality reduction. According to our experiments, it is efficient and feasible to select the ECA-C module, as shown in Figure 2.

We chose the ECA module to ensure computational performance and model complexity. Among them, W_k is used to denote the learned channel attention, W_k involves K×C parameters, and W_k avoids complete independence of different groups, and for the weight y_i, this paper only considers the information interaction between y_i and its K neighbors, i.e.,

w_{i} = σ (\sum_{j = 1}^{k} w_{i}^{j} y_{i}^{j}), y_{i}^{j} \in Ω_{i}^{k}

(4)

The equation captures local cross-channel interactions, and this locality constraint avoids interactions across all channels, thus allowing for higher model efficiency. To further reduce model complexity and improve efficiency, it is also possible to have all channels share weight information, i.e.,

w_{i} = σ (\sum_{j = 1}^{k} w_{}^{j} y_{i}^{j}), y_{i}^{j} \in Ω_{i}^{k}

(5)

In addition, the ECA module can implement information interactions between channels through a one-dimensional convolution with a convolution kernel size of K.

w = σ (C 1 D_{k} (y))

(6)

Among them, C1D stands for one-dimensional convolution, which only involves K parameter information. This method of capturing local cross-channel information interactions ensures performance results and model efficiency.

2.6. Attention Gates Module

The AG model can automatically respond to feature regions without localizing them and at the same time simulate the position relationship of global feature regions, suppress irrelevant feature responses, enhance similar region features, and then extract image features containing more detailed information to improve the model’s segmentation accuracy. Figure 3 shows the input deep feature map X_g and shallow feature map X_i are features summed by one-dimensional convolution to enhance the feature regions, and then the feature image T_l is obtained by the ReLU nonlinear activation function. We performed one-dimensional convolution on T_l to reduce the computational effort and obtained the weight map α (whose element values range from [0, 1]) by resampling after processing by the Sigmoid activation function. It is then multiplied with the feature map X_i to obtain

{\overset{⌢}{x}}^{l}

to enhance the image feature representation and attenuate the non-image feature response.

The module uses the semantic information in the deep feature map to enhance the feature weights in the shallow feature map, thus adding more details to the shallow feature map to enhance the model’s learning ability for the segmentation region, further improving the model’s segmentation accuracy. The calculation method is as follows:

T_{l} = σ_{1} (W_{x}^{T} x_{i} + W_{g}^{T} m (g_{i}) + b_{1})

(7)

α_{i} = σ_{2} (W^{T} T_{l} + b_{2})

(8)

{\overset{⌢}{x}}^{l} = x_{i} α_{i}

(9)

among them, X_i and X_g are the shallow and deep feature maps, respectively, W_x, W_g, and W^T are linear transformation parameters, b₁ and b₂ are offset terms, and σ₁ and σ₂ are the RelU and Sigmoid activation functions, respectively.

2.7. Loss Function

The binary cross-entropy loss function can be applied to most pixel-level segmentation tasks; however, it can also mislead the model when the number of pixels on the target is far less than the number of pixels in the background [61]. In addition, the Sigmoid layer is integrated into the CELoss class using the BCEWithLogitsLoss loss function. This is numerically more stable than using a simple Sigmoid layer and BCELoss. The formula is as follows:

L_{BCE} = - \sum_{i = 1}^{N} [y_{i} \log p (y_{i}) + (1 - y_{i}) \log (1 - p (y_{i}))]

(10)

where y_i is the pixel real category label and p(y_i) is the pixel prediction category label.

3. Results

3.1. Ablation Experiment

The ECA-C module and AG module performances were verified by comparing the segmentation effects of U-Net, Attention + UNet, ECA-C + UNet, and EA-UNet on OCT images (Table 1; bolded indicates the best results).

The experimental results in Table 1 show that U-Net with the addition of the Attention and ECA-C modules improves the original U-Net’s segmentation performance for uterine cavity OCT images in two metrics, MIoU and Sensitivity; however, the Specificity metric decreases. Our proposed EA-UNet model with both ECA-C and Attention modules has the highest segmentation effect on uterine cavity OCT images.

We visualized the segmentation results of each module in Table 1 to more clearly show the advantages of using the EA module to segment image details. Figure 4 shows the segmentation results of the U-Net model with different Attention modules from Table 1. Four randomly selected OCT images from the test set are presented, where column 1 is the original OCT image, Ground Truth denotes the true label, and columns 2 to 5 denote the segmentation results of each U-Net, Attention + UNet, ECA-C + UNet, and EA-UNet model, respectively.

The visualization of each model’s segmentation results in Figure 4 shows that the U-Net model has the lowest segmentation ability, with obvious problems of blurred boundaries and incomplete image segmentation. The ECA-C + UNet and Attention + UNet models have incomplete segmentation results and rough boundaries, and some detailed regions are ignored. The EA-UNet model, on the other hand, has the highest segmentation effect and can effectively segment the regions, and its segmentation results are closest to the Ground Truth results.

3.2. Experiments Comparing EA-UNet Model and Other Attention Methods

To verify the proposed EA-UNet model’s effectiveness in uterine cavity OCT image segmentation, it was compared with other attention modules for experiments (Table 2; bolded indicates the best results).

Table 2 shows the proposed EA-UNet module can achieve optimal results in uterine cavity OCT image segmentation compared with other attention modules. The SCSE attention module not only does not improve the segmentation effect of U-Net but also degrades its performance. Although the SE and CBAM attention mechanisms improve the MIoU of U-Net metrics, the Specificity metrics are inferior to the original U-Net model. Therefore, the proposed EA-UNet model can achieve effective segmentation in uterine cavity OCT image segmentation.

To demonstrate the proposed EA-UNet model’s experimental effects compared with other attention models, we randomly selected four images from the test set to visualize the segmentation results. Column 1 shows the four original OCT images from the test set, Ground Truth indicates the true label, and columns 2 to 6 show the U-Net, SCSE + UNet, SE + UNet, CBAM + UNet, and EA-UNet models’ segmentation results, respectively, as shown in Figure 5.

The visualization of the segmentation results of each model in Figure 5 shows that the SCSE + UNet model has the worst segmentation effect and is blurred. The SE + UNet and U-Net models have incomplete segmentation results, rough boundaries, and some detailed regions are ignored. The CBAM + UNet model’s segmentation results are second only to the EA-UNet model. Compared with other attention modules, the EA-UNet model has better feature-extraction ability and learning potential due to the E (ECA-C) module, which introduces a convolutional layer on the attention mechanism ECA instead of max pool, thus reducing the loss of feature information, making the model focus on the features of the region to be segmented and suppressing irrelevant features to enhance the network’s feature-extraction ability and learning potential, ensuring a high-quality detail segmentation ability and the best segmentation effect, which can effectively segment the region; furthermore, its segmentation result is closest to the Ground Truth. In addition, we introduced the A (Attention Gates) module to improve the model’s full understanding of local contextual information, thereby improving the model’s segmentation accuracy. Meanwhile, the loss function fitting curve shown in Figure 6 is derived based on our experimental results. Compared with the other four models, when training under the same experimental conditions, our proposed method decreases the loss value faster with little fluctuation during training, and the loss rate is smaller, meaning the model has a faster convergence speed during training, which indicates that the method has good robustness.

3.3. Experiments Comparing EA-UNet Model and Other Methods

To further validate the proposed EA-UNet model’s ability to segment features in the uterine cavity OCT image region, we experimentally compared it with U-Net, SegNet, DeepLabv3+, AGNet, and UNet++ (Table 3; bolded indicates the best results).

The experimental results in Table 3 show that the proposed EA-UNet model has higher indexes of MIoU, Sensitivity, and Specificity compared with the comparison model. In uterine cavity OCT image segmentation, the SegNet model performs the worst, followed by DeepLabv3+ and UNet++.

To more clearly demonstrate that our proposed EA-UNet model has a more detailed segmentation effect compared with other methods, we randomly selected four images from the test set to visualize the segmentation results. Column 1 is the four original OCT images from the test set, Ground Truth indicates the true label, and columns 2 to 7 are the segmentation results of U-Net, SegNet, DeepLabv3+, AGNet, UNet++, and EA-UNet, respectively, as shown in Figure 7.

Figure 7 shows that the EA-UNet model’s detail segmentation effect is better compared with all other models. The model enhances the processing ability for feature information, has a better ability to segment features, can segment regions effectively, and displays a segmentation result closest to Ground Truth. The SegNet model has the worst segmentation effect, with a serious loss of feature information. DeepLabv3+, AGNet, and U-Net models have incomplete segmentation results, display ambiguity, and ignore some boundary-detail regions. The UNet++ model’s segmentation results are second only to our proposed EA-UNet model. In summary, our analysis shows that our proposed EA-UNet model is the most effective in uterine cavity OCT image segmentation. Furthermore, we derived the loss function fitting curve shown in Figure 8 based on our experimental results. By comparing the training process of the six models, we observed that the EA-UNet model decreases the loss value faster, and the loss rate is smaller during the training process. We conclude that the improved EA-UNet model’s robustness is better than the other five network models, and the experimental results also prove that the EA-UNet model’s segmentation effect is better than the other five network models.

4. Discussion

Our proposed EA-UNet model includes the E and A modules. The E module enables the model to focus on the features in the region to be segmented and to suppress irrelevant features and extract multi-scale feature information. The A module uses global contextual information as a way to extract the detailed features of uterine cavity OCT images and improve the model’s segmentation accuracy.

The aim of our study was to explore a real-time and accurate OCT image segmentation method for the uterine cavity. We used our proposed model for experiments on the OCT image segmentation of normal and damaged uterine cavities. We verified the performance of the ECA-C and Attention Gates modules in the EA-UNet model by ablation experiments. We verified the effectiveness of our proposed EA-UNet model in segmenting the uterine cavity OCT image region by comparing our experiments with other attention models. Meanwhile, we analyzed the loss function fitting curve and found that the EA-UNet model decreases the loss value faster with a smaller loss rate during the training process compared with other models. This indicates that the EA-UNet model has high robustness. Additionally, the EA-UNet model’s segmentation time for a single image is 50.91ms, which also shows the model’s real-time performance.

Nevertheless, the EA-UNet model’s segmentation performance has much room for improvement, and we will continue to optimize the network structure to improve our model’s performance. Further exploration is needed in terms of model size and loss-function construction to improve this model’s accuracy for the segmentation of uterine cavity OCT images, as well as real-time performance. In OCT images, the damaged uterus’s surface is discontinuous because the normal uterus’s surface is continuous and smooth. Based on deep learning OCT image processing, the uterine cavity can be segmented accurately, which helps to analyze uterine cavity structure information, including thickness changes, which is important for the diagnosis of uterine cavity lesions. In future research, we will extend the proposed segmentation model to the task of grading the uterine cavity OCT images for damage at different levels. This facilitates physicians to make fast and accurate diagnoses.

5. Conclusions

In our study, we proposed a new EA-UNet model for the automatic segmentation of uterine cavity OCT images. First, the model uses the E (ECA-C) module, which introduces a convolutional layer on the attention mechanism ECA instead of max pool to enhance the model’s feature-extraction capability, meaning the model focuses on the features in the region to be segmented and suppresses irrelevant features. In addition, we introduced the A (Attention Gates) module to utilize global contextual information as a way to extract the detailed features of the uterine cavity OCT images and improve the model’s segmentation accuracy. Finally, the experimental results showed that the proposed EA-UNet model’s MIoU, Sensitivity, and Specificity indexes are 0.9379, 0.9457, and 0.9908, respectively. We experimentally tested the performance of the ECA-C and AG modules and verified that the proposed EA-UNet model results in a better segmentation of uterine cavity OCT images; therefore, it can achieve the end-to-end automatic and accurate segmentation of uterine cavity OCT images.

Author Contributions

Conceptualization, Z.X., M.D. and J.L.; methodology, Z.X., M.D. and J.L.; software, Z.X.; validation, Z.X., M.D. and J.L.; formal analysis, M.D., X.G. and Z.C.; investigation, E.S. and J.Z.; resources, X.G.; data curation, Z.C.; writing—original draft preparation, Z.X. and J.L.; writing—review and editing, Z.X. and M.D.; visualization, Z.X.; supervision, J.Z., X.G. and Z.C.; project administration, X.G. and Z.C.; funding acquisition, X.G. and Z.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key R&D Program of China (2019YFE0110400), National Natural Science Foundation of China (81971621, 82102087, 82102054), Key R&D Program of Hunan Province (2021SK2035), Natural Science Foundation of Hunan (2022JJ30039, 2022JJ40392), and Clinical Research 4310 Program of the First Affiliated Hospital of The University of South China (4310-2021-K06).

Institutional Review Board Statement

The animal study protocol was approved by the Experimental Animal Management and Use Committee of the Shenzhen Institute of Technology, Chinese Academy of Sciences (protocol codes SIAT-IACUC-190221-YGS-ZJK-A0613 and 11 March 2021).

Informed Consent Statement

Not applicable.

Data Availability Statement

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Huang, D.; Swanson, E.A.; Lin, C.P.; Schuman, J.S.; Stinson, W.G.; Chang, W.; Hee, M.R.; Flotte, T.; Gregory, K.; Puliafito, C.A. Optical coherence tomography. Science 1991, 254, 1178–1181. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Drexler, W.; Fujimoto, J.G. State-of-the-art retinal optical coherence tomography. Prog. Retin. Eye Res. 2008, 27, 45–88. [Google Scholar] [CrossRef] [PubMed]
Beaurepaire, E.; Boccara, A.C.; Lebec, M.; Blanchot, L.; Saint-Jalmes, H. Full-field optical coherence microscopy. Opt. Lett. 1998, 23, 244–246. [Google Scholar] [CrossRef] [PubMed]
Brezinski, M.E.; Tearney, G.J.; Weissman, N.; Boppart, S.; Bouma, B.; Hee, M.; Weyman, A.; Swanson, E.; Southern, J.; Fujimoto, J. Assessing atherosclerotic plaque morphology: Comparison of optical coherence tomography and high frequency intravascular ultrasound. Heart 1997, 77, 397–403. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fujimoto, J.; Boppart, S.A.; Tearney, G.; Bouma, B.E.; Pitris, C.; Brezinski, M.E. High resolution in vivo intra-arterial imaging with optical coherence tomography. Heart 1999, 82, 128–133. [Google Scholar] [CrossRef] [Green Version]
Jang, I.-K.; Bouma, B.E.; Kang, D.-H.; Park, S.-J.; Park, S.-W.; Seung, K.-B.; Choi, K.-B.; Shishkov, M.; Schlendorf, K.; Pomerantsev, E. Visualization of coronary atherosclerotic plaques in patients using optical coherence tomography: Comparison with intravascular ultrasound. J. Am. Coll. Cardiol. 2002, 39, 604–609. [Google Scholar] [CrossRef] [Green Version]
Liu, R.; Zhang, Y.; Zheng, Y.; Liu, Y.; Zhao, Y.; Yi, L. Automated detection of vulnerable plaque for intravascular optical coherence tomography images. Cardiovasc. Eng. Technol. 2019, 10, 590–603. [Google Scholar] [CrossRef]
Li, X.; Boppart, S.; Van Dam, J.; Mashimo, H.; Mutinga, M.; Drexler, W.; Klein, M.; Pitris, C.; Krinsky, M.; Brezinski, M.E. Optical coherence tomography: Advanced technology for the endoscopic imaging of Barrett’s esophagus. Endoscopy 2000, 32, 921–930. [Google Scholar] [CrossRef]
Qi, X.; Pan, Y.; Sivak, M.V.; Willis, J.E.; Isenberg, G.; Rollins, A.M. Image analysis for classification of dysplasia in Barrett’s esophagus using endoscopic optical coherence tomography. Biomed. Opt. Express 2010, 1, 825–847. [Google Scholar] [CrossRef] [Green Version]
Tsai, T.-H.; Zhou, C.; Tao, Y.K.; Lee, H.-C.; Ahsen, O.O.; Figueiredo, M.; Kirtane, T.; Adler, D.C.; Schmitt, J.M.; Huang, Q. Structural markers observed with endoscopic 3-dimensional optical coherence tomography correlating with Barrett’s esophagus radiofrequency ablation treatment response (with videos). Gastrointest. Endosc. 2012, 76, 1104–1112. [Google Scholar] [CrossRef]
Sergeev, A.M.; Gelikonov, V.; Gelikonov, G.; Feldchtein, F.I.; Kuranov, R.; Gladkova, N.; Shakhova, N.; Snopova, L.; Shakhov, A.; Kuznetzova, I. In vivo endoscopic OCT imaging of precancer and cancer states of human mucosa. Opt. Express 1997, 1, 432–440. [Google Scholar] [CrossRef] [PubMed]
Tearney, G.J.; Brezinski, M.E.; Bouma, B.E.; Boppart, S.A.; Pitris, C.; Southern, J.F.; Fujimoto, J.G. In vivo endoscopic optical biopsy with optical coherence tomography. Science 1997, 276, 2037–2039. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Shen, B.; Zuccaro, G., Jr.; Gramlich, T.L.; Gladkova, N.; Trolli, P.; Kareta, M.; Delaney, C.P.; Connor, J.T.; Lashner, B.A.; Bevins, C.L. In vivo colonoscopic optical coherence tomography for transmural inflammation in inflammatory bowel disease. Clin. Gastroenterol. Hepatol. 2004, 2, 1080–1087. [Google Scholar] [CrossRef]
Testoni, P.A.; Mangiavillano, B. Optical coherence tomography in detection of dysplasia and cancer of the gastrointestinal tract and bilio-pancreatic ductal system. World J. Gastroenterol. WJG 2008, 14, 6444. [Google Scholar] [CrossRef] [PubMed]
Matsuoka, Y.; Takahashi, A.; Kumamoto, E.; Morita, Y.; Kutsumi, H.; Azuma, T.; Kuroda, K. High-resolution MR imaging of gastrointestinal tissue by intracavitary RF coil with remote tuning and matching technique for integrated MR-endoscope system. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Osaka, Japan, 3–7 July 2013; pp. 5706–5710. [Google Scholar]
Feldchtein, F.I.; Gelikonov, G.; Gelikonov, V.; Kuranov, R.; Sergeev, A.M.; Gladkova, N.; Shakhov, A.; Shakhova, N.; Snopova, L.; Terent’eva, A. Endoscopic applications of optical coherence tomography. Opt. Express 1998, 3, 257–270. [Google Scholar] [CrossRef]
Boppart, S.; Goodman, A.; Libus, J.; Pitris, C.; Jesser, C.; Brezinski, M.E.; Fujimoto, J. High resolution imaging of endometriosis and ovarian carcinoma with optical coherence tomography: Feasibility for laparoscopic-based imaging. BJOG Int. J. Obstet. Gynaecol. 1999, 106, 1071–1077. [Google Scholar] [CrossRef] [Green Version]
Jesser, C.; Boppart, S.; Pitris, C.; Stamper, D.L.; Nielsen, G.P.; Brezinski, M.E.; Fujimoto, J. High resolution imaging of transitional cell carcinoma with optical coherence tomography: Feasibility for the evaluation of bladder pathology. Br. J. Radiol. 1999, 72, 1170–1176. [Google Scholar] [CrossRef]
Zagaynova, E.V.; Streltsova, O.S.; Gladkova, N.D.; Snopova, L.B.; Gelikonov, G.V.; Feldchtein, F.I.; Morozov, A.N. In vivo optical coherence tomography feasibility for bladder disease. J. Urol. 2002, 167, 1492–1496. [Google Scholar] [CrossRef] [PubMed]
Manyak, M.J.; Gladkova, N.D.; Makari, J.H.; Schwartz, A.M.; Zagaynova, E.V.; Zolfaghari, L.; Zara, J.M.; Iksanov, R.; Feldchtein, F.I. Evaluation of superficial bladder transitional-cell carcinoma by optical coherence tomography. J. Endourol. 2005, 19, 570–574. [Google Scholar] [CrossRef] [PubMed]
Hariri, L.P.; Bonnema, G.T.; Schmidt, K.; Winkler, A.M.; Korde, V.; Hatch, K.D.; Davis, J.R.; Brewer, M.A.; Barton, J.K. Laparoscopic optical coherence tomography imaging of human ovarian cancer. Gynecol. Oncol. 2009, 114, 188–194. [Google Scholar] [CrossRef]
Zhang, J.; Du, M.; Fang, J.; Lv, S.; Lou, W.; Xie, Z.; Chen, Z.; Gong, X. In vivo evaluation of endometrium through dual-modality intrauterine endoscopy. Biomed. Opt. Express 2022, 13, 2554–2565. [Google Scholar] [CrossRef] [PubMed]
Doi, K. Computer-aided diagnosis in medical imaging: Historical review, current status and future potential. Comput. Med. Imaging Graph. 2007, 31, 198–211. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Asiri, N.; Hussain, M.; Al Adel, F.; Alzaidi, N. Deep learning based computer-aided diagnosis systems for diabetic retinopathy: A survey. Artif. Intell. Med. 2019, 99, 101701. [Google Scholar] [CrossRef] [Green Version]
Koprowski, R.; Teper, S.; Wróbel, Z.; Wylegala, E. Automatic analysis of selected choroidal diseases in OCT images of the eye fundus. Biomed. Eng. Online 2013, 12, 117. [Google Scholar] [CrossRef] [Green Version]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3431–3440. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Greenspan, H.; Van Ginneken, B.; Summers, R.M. Guest editorial deep learning in medical imaging: Overview and future promise of an exciting new technique. IEEE Trans. Med. Imaging 2016, 35, 1153–1159. [Google Scholar] [CrossRef]
Havaei, M.; Davy, A.; Warde-Farley, D.; Biard, A.; Courville, A.; Bengio, Y.; Pal, C.; Jodoin, P.-M.; Larochelle, H. Brain tumor segmentation with Deep Neural Networks. Med. Image Anal. 2017, 35, 18–31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Shu, L.; Yaozong, G.; Aytekin, O.; Dinggang, S. Representation learning: A unified deep learning framework for automatic prostate MR segmentation. In Medical Image Computing and Computer-Assisted Intervention: MICCAI, Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention 2013, Nagoya, Japan, 22–26 September 2013; Springer: Berlin/Heidelberg, Germany, 2013; pp. 254–261. [Google Scholar]
Liu, C.; Jiao, D.; Liu, Z. Artificial intelligence (AI)-aided disease prediction. Bio Integr. 2020, 1, 130–136. [Google Scholar] [CrossRef]
Mousa, M.; Xian, D.; Tianxiao, H.; Yu, C. Feasibility of the soft attention-based models for automatic segmentation of OCT kidney images. Biomed. Opt. Express 2022, 13, 2728–2738. [Google Scholar]
Liu, W.; Sun, Y.; Ji, Q. MDAN-UNet: Multi-Scale and Dual Attention Enhanced Nested U-Net Architecture for Segmentation of Optical Coherence Tomography Images. Algorithms 2020, 13, 60. [Google Scholar] [CrossRef] [Green Version]
Leyuan, F.; David, C.; Chong, W.; Guymer, R.H.; Shutao, L.; Sina, F. Automatic segmentation of nine retinal layer boundaries in OCT images of non-exudative AMD patients using deep learning and graph search. Biomed. Opt. Express 2017, 8, 2732–2744. [Google Scholar]
Jie, W.; Hormel, T.T.; Liqin, G.; Pengxiao, Z.; Yukun, G.; Xiaogang, W.; Bailey, S.T.; Yali, J. Automated diagnosis and segmentation of choroidal neovascularization in OCT angiography using deep learning. Biomed. Opt. Express 2020, 11, 927–944. [Google Scholar]
Abhay, S.; Leixin, Z.; Abrámoff, M.D.; Xiaodong, W. Multiple surface segmentation using convolution neural nets: Application to retinal layer segmentation in OCT images. Biomed. Opt. Express 2018, 9, 4509–4526. [Google Scholar]
Minghui, C.; Wenfei, M.; Linfang, S.; Manqi, L.; Cheng, W.; Gang, Z. Multiscale dual attention mechanism for fluid segmentation of optical coherence tomography images. Appl. Opt. 2021, 60, 6761–6768. [Google Scholar]
Aranha, D.S.V.; Leopold, S.; Hannes, S.; Martin, P.; Alina, M.; Gerald, S.; Gerhard, G.; Werkmeister, R.M. CorneaNet: Fast segmentation of cornea OCT scans of healthy and keratoconic eyes using deep learning. Biomed. Opt. Express 2019, 10, 622–641. [Google Scholar]
Guo, C.; Szemenyei, M.; Yi, Y.; Wang, W.; Chen, B.; Fan, C. SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation. In Proceedings of the International Conference on Pattern Recognition, Milan, Italy, 10–15 January 2021. [Google Scholar]
Xiang-Cong, X.; Jun-Yan, C.; Xue-Hua, W.; Rui, L.; Hong-Lian, X.; Wang, M.-Y.; Jun-Ping, Z.; Hai-Shu, T.; Yi-Xu, Z.; Xiong, K.; et al. Precise segmentation of choroid layer in diabetic retinopathy fundus OCT images by using SECUNet. Prog. Biochem. Biophys. 2022, 49, 1–10. [Google Scholar] [CrossRef]
Singh, V.K.; Kucukgoz, B.; Murphy, D.; Xiong, X.; Steel, D.; Obara, B. Benchmarking automated detection of the retinal external limiting membrane in a 3D spectral domain optical coherence tomography image dataset of full thickness macular holes. Comput. Biol. Med. 2022, 140, 105070. [Google Scholar] [CrossRef]
Gao, Z.; Chung, J.; Abdelrazek, M.; Leung, S.; Hau, W.K.; Xian, Z.; Zhang, H.; Li, S. Privileged Modality Distillation for Vessel Border Detection in Intracoronary Imaging. IEEE Trans. Med. Imaging 2020, 39, 1524–1534. [Google Scholar] [CrossRef]
Hesamian, M.H.; Jia, W.; He, X.; Kennedy, P. Deep learning techniques for medical image segmentation: Achievements and challenges. J. Digit. Imaging 2019, 32, 582–596. [Google Scholar] [CrossRef] [Green Version]
Brehar, R.; Mitrea, D.-A.; Vancea, F.; Marita, T.; Nedevschi, S.; Lupsor-Platon, M.; Rotaru, M.; Badea, R.I. Comparison of deep-learning and conventional machine-learning methods for the automatic recognition of the hepatocellular carcinoma areas from ultrasound images. Sensors 2020, 20, 3085. [Google Scholar] [CrossRef]
Devunooru, S.; Alsadoon, A.; Chandana, P.; Beg, A. Deep learning neural networks for medical image segmentation of brain tumours for diagnosis: A recent review and taxonomy. J. Ambient. Intell. Humaniz. Comput. 2021, 12, 455–483. [Google Scholar] [CrossRef]
Oktay, O.; Schlemper, J.; Folgoc, L.L.; Lee, M.; Heinrich, M.; Misawa, K.; Mori, K.; McDonagh, S.; Hammerla, N.Y.; Kainz, B. Attention u-net: Learning where to look for the pancreas. arXiv 2018, arXiv:1804.03999. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
Lee, C.S.; Tyring, A.J.; Deruyter, N.P.; Wu, Y.; Rokem, A.; Lee, A.Y. Deep-learning based, automated segmentation of macular edema in optical coherence tomography. Biomed. Opt. Express 2017, 8, 3440–3448. [Google Scholar] [CrossRef] [Green Version]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Devalla, S.K.; Renukanand, P.K.; Sreedhar, B.K.; Subramanian, G.; Zhang, L.; Perera, S.; Mari, J.-M.; Chin, K.S.; Tun, T.A.; Strouthidis, N.G. DRUNET: A dilated-residual U-Net deep learning network to segment optic nerve head tissues in optical coherence tomography images. Biomed. Opt. Express 2018, 9, 3244–3265. [Google Scholar] [CrossRef] [Green Version]
Gorgi Zadeh, S.; Wintergerst, M.W.; Wiens, V.; Thiele, S.; Holz, F.G.; Finger, R.P.; Schultz, T. CNNs enable accurate and fast segmentation of drusen in optical coherence tomography. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support; Springer: Cham, Switzerland, 2017; pp. 65–73. [Google Scholar]
Venhuizen, F.G.; van Ginneken, B.; Liefers, B.; van Asten, F.; Schreur, V.; Fauser, S.; Hoyng, C.; Theelen, T.; Sánchez, C.I. Deep learning approach for the detection and quantification of intraretinal cystoid fluid in multivendor optical coherence tomography. Biomed. Opt. Express 2018, 9, 1545–1569. [Google Scholar] [CrossRef] [Green Version]
Chen, Z.; Li, D.; Shen, H.; Mo, H.; Zeng, Z.; Wei, H. Automated segmentation of fluid regions in optical coherence tomography B-scan images of age-related macular degeneration. Opt. Laser Technol. 2020, 122, 105830. [Google Scholar] [CrossRef]
Ben-Cohen, A.; Mark, D.; Kovler, I.; Zur, D.; Barak, A.; Iglicki, M.; Soferman, R. Retinal layers segmentation using fully convolutional network in OCT images. RSIP Vis. 2017, 1–8. Available online: https://www.rsipvision.com/wpcontent/uploads//06/Retinal-Layers-Segmentation.pdf (accessed on 10 November 2022).
Kepp, T.; Droigk, C.; Casper, M.; Evers, M.; Hüttmann, G.; Salma, N.; Manstein, D.; Heinrich, M.P.; Handels, H. Segmentation of mouse skin layers in optical coherence tomography image data using deep convolutional neural networks. Biomed. Opt. Express 2019, 10, 3484–3496. [Google Scholar] [CrossRef]
Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient channel attention for deep convolutional neural networks. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020. [Google Scholar]
Naghdi, S.; Slovinsky, W.S.; Madesh, M.; Rubin, E.; Hajnóczky, G. Mitochondrial fusion and Bid-mediated mitochondrial apoptosis are perturbed by alcohol with distinct dependence on its metabolism. Cell Death Dis. 2018, 9, 1028. [Google Scholar] [CrossRef] [Green Version]
Zhang, S.; Sun, Y.; Jiang, D.; Chen, T.; Liu, R.; Li, X.; Lu, Y.; Qiao, L.; Pan, Y.; Liu, Y. Construction and optimization of an endometrial injury model in mice by transcervical ethanol perfusion. Reprod. Sci. 2021, 28, 693–702. [Google Scholar] [CrossRef] [PubMed]
Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar]
Yang, M.; Yuan, Y.; Liu, G. SDUNet: Road extraction via spatial enhanced and densely connected UNet. Pattern Recognit. 2022, 126, 108549. [Google Scholar] [CrossRef]
Roy, A.G.; Navab, N.; Wachinger, C. Concurrent spatial and channel ‘squeeze & excitation’in fully convolutional networks. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Granada, Spain, 16–20 September 2018; pp. 421–429. [Google Scholar]
Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
Badrinarayanan, V.; Kendall, A.; Cipolla, R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef] [PubMed]
Chen, L.-C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 801–818. [Google Scholar]
Zhang, S.; Fu, H.; Yan, Y.; Zhang, Y.; Wu, Q.; Yang, M.; Tan, M.; Xu, Y. Attention guided network for retinal image segmentation. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Shenzhen, China, 13–17 October 2019; pp. 797–805. [Google Scholar]
Zhou, Z.; Rahman Siddiquee, M.M.; Tajbakhsh, N.; Liang, J. Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support; Springer: Cham, Switzerland, 2018; pp. 3–11. [Google Scholar]

Figure 1. EA-UNet network structure.

Figure 2. ECA-C module structure diagram.

Figure 3. Attention Gates module structure diagram.

Figure 4. Comparison of ablation experiments of each module of EA-UNet model.

Figure 5. Comparison of segmentation effects of different attention modules.

Figure 6. Comparison of changes in loss function for different attention modules.

Figure 7. Comparison of segmentation effects of different methods.

Figure 8. Comparison of change of loss function for different methods.

Table 1. Ablation experiments of effect of different modules on performance of U-Net model.

Model	MIoU	Sensitivity	Specificity
U-Net	0.8787	0.8886	0.9865
ECA-C + UNet	0.9096	0.9226	0.9831
Attention + UNet	0.9187	0.9343	0.9803
EA-UNet	0.9379	0.9457	0.9908

Table 2. Performance comparison between EA-UNet and other attention models.

Model	MIoU	Sensitivity	Specificity
U-Net [50]	0.8787	0.8886	0.9865
SCSE + UNet [62]	0.8391	0.8502	0.9830
SE + UNet [60]	0.9051	0.9222	0.9774
CBAM + UNet [63]	0.9219	0.9399	0.9776
EA-UNet	0.9379	0.9457	0.9908

Table 3. Performance comparison between EA-UNet and other methods.

Model	MIoU	Sensitivity	Specificity	Time
U-Net [50]	0.8787	0.8886	0.9865	45.85 ms
SegNet [64]	0.7998	0.8175	0.9674	40.05 ms
DeepLabv3+ [65]	0.8688	0.8823	0.9804	42.79 ms
AGNet [66]	0.8862	0.9029	0.9765	64.14 ms
UNet++ [67]	0.8727	0.8833	0.9852	51.71 ms
EA-UNet	0.9379	0.9457	0.9908	50.91 ms

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiao, Z.; Du, M.; Liu, J.; Sun, E.; Zhang, J.; Gong, X.; Chen, Z. EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity. Photonics 2023, 10, 73. https://doi.org/10.3390/photonics10010073

AMA Style

Xiao Z, Du M, Liu J, Sun E, Zhang J, Gong X, Chen Z. EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity. Photonics. 2023; 10(1):73. https://doi.org/10.3390/photonics10010073

Chicago/Turabian Style

Xiao, Zhang, Meng Du, Junjie Liu, Erjie Sun, Jinke Zhang, Xiaojing Gong, and Zhiyi Chen. 2023. "EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity" Photonics 10, no. 1: 73. https://doi.org/10.3390/photonics10010073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

EA-UNet Based Segmentation Method for OCT Image of Uterine Cavity

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Environment and Data

2.2. Data Augmentation

2.3. Evaluation Metrics

2.4. EA-UNet Network Model

2.5. ECA-C Module

2.6. Attention Gates Module

2.7. Loss Function

3. Results

3.1. Ablation Experiment

3.2. Experiments Comparing EA-UNet Model and Other Attention Methods

3.3. Experiments Comparing EA-UNet Model and Other Methods

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI