Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (709)

Search Parameters:
Keywords = mask R-CNN

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
20 pages, 29170 KB  
Article
Hyperspectral Mapping of Pasture Nitrogen Content and Metabolizable Energy in New Zealand Hill Country Grasslands
by Nitin Bhatia and Maxence Plouviez
AgriEngineering 2026, 8(5), 170; https://doi.org/10.3390/agriengineering8050170 - 30 Apr 2026
Viewed by 62
Abstract
Hyperspectral airborne data combined with machine learning has proven effective for characterizing plant nutritional quality. However, terrain, viewing geometry, and illumination can distort spectral signatures, leading to biased models with limited generalizability for large-scale mapping across farms with a heterogeneous landscape. In this [...] Read more.
Hyperspectral airborne data combined with machine learning has proven effective for characterizing plant nutritional quality. However, terrain, viewing geometry, and illumination can distort spectral signatures, leading to biased models with limited generalizability for large-scale mapping across farms with a heterogeneous landscape. In this study, we developed a framework for mapping pasture quality using airborne hyperspectral imaging while explicitly accounting for in-field acquisition and environmental effects. Nitrogen content (N%) and metabolizable energy (ME) were used as reference indicators across four hill country farms in New Zealand with contrasting environmental and management conditions. Ground truth was obtained using standard laboratory wet chemistry methods and paired with AisaFENIX airborne hyperspectral data, resulting in 1610 spectral samples derived from 161 spatially independent ground plots. Gaussian Process Regression (GPR) and a one-dimensional convolutional neural network (1D-CNN) were trained and evaluated on an independent test dataset. Both models achieved strong predictive performance (R2 > 0.8); however, GPR provided more reliable estimates through predictive uncertainty. Using a 95% confidence interval threshold to mask uncertain predictions increased overall performance (R2 > 0.9) and consequently improved the reliability of the mapped outputs. This approach enables spatially explicit pasture nutrient assessment to support precision land management for carbon and nitrogen. Full article
Show Figures

Figure 1

28 pages, 3354 KB  
Article
Loop Closure with 3D Gaussian Splatting for Dynamic SLAM
by Zhanwu Ma, Wansheng Cheng and Song Fan
Sensors 2026, 26(9), 2669; https://doi.org/10.3390/s26092669 - 25 Apr 2026
Viewed by 639
Abstract
Robust pose estimation and high-fidelity scene reconstruction in dynamic environments represent core challenges in the field of Visual Simultaneous Localization and Mapping (SLAM). Although 3D Gaussian Splatting (3DGS)-based techniques have demonstrated significant potential, existing methods typically assume static scenes and struggle to address [...] Read more.
Robust pose estimation and high-fidelity scene reconstruction in dynamic environments represent core challenges in the field of Visual Simultaneous Localization and Mapping (SLAM). Although 3D Gaussian Splatting (3DGS)-based techniques have demonstrated significant potential, existing methods typically assume static scenes and struggle to address the inconsistency between photometric and geometric observations in dynamic settings, leading to a notable degradation in pose estimation and map accuracy. To address these issues, this paper presents a novel dynamic SLAM method: Loop Closure with 3D Gaussian Splatting for Dynamic SLAM (LCD-Splat). Taking RGB-D images as input, LCD-Splat integrates Mask R-CNN with an improved multi-view geometry approach to detect dynamic objects, generating static scene maps and filling in occluded backgrounds. By leveraging 3DGS submaps and a frame to model tracking strategy, LCD-Splat achieves dense map construction. The method initiates online loop closure detection and employs a novel coarse to fine 3DGS registration algorithm to compute loop closure constraints between submaps. Global consistency is ultimately ensured through robust pose graph optimization. Experimental results on real-world datasets such as TUM RGB-D and Bonn demonstrate that LCD-Splat outperforms existing state-of-the-art SLAM methods in terms of tracking, scene reconstruction, and rendering performance. This approach provides novel insights for high-precision SLAM in dynamic environments and holds significant implications for scene understanding in complex settings. Full article
20 pages, 4072 KB  
Article
Potato Late Blight Disease Detection on UAV Multispectral Imagery
by Mohadeseh Kaviani, Brigitte Leblon, Thangarajah Akilan, Dzhamal Amishev, Armand LaRocque and Ata Haddadi
Remote Sens. 2026, 18(9), 1292; https://doi.org/10.3390/rs18091292 - 24 Apr 2026
Viewed by 251
Abstract
In this study, Mask R-CNN was applied to 5-band raw reflectance images to detect potato plants in UAV images. The highest model performance across all metrics was achieved with a ResNeXt-101 backbone and transfer learning from the same model trained on apple orchard [...] Read more.
In this study, Mask R-CNN was applied to 5-band raw reflectance images to detect potato plants in UAV images. The highest model performance across all metrics was achieved with a ResNeXt-101 backbone and transfer learning from the same model trained on apple orchard data. An F-1 score of 84.2% was achieved. To determine whether the plant was infected with PLB, two methods were used. In the first method, a Mask R-CNN with a DINOv3 small variant backbone was applied to 5-band raw reflectance images. The highest achieved F1-score was 69.05%. In the second method, classical ML classifiers were applied to the 5-band raw reflectance images and 16 associated vegetation index images. The highest F1-score (66.71%) was obtained with a decision tree classifier applied to the 16 vegetation index images. Feature importance analysis indicated that chlorophyll- and red-edge-related indices, such as CIgreen, TCARI, OSAVI2, and Red-edge NDVI, were the most discriminative features for distinguishing healthy and unhealthy potato plants. These results show the effectiveness of combining deep learning and machine learning approaches for potato late blight detection using UAV multispectral imagery. Full article
(This article belongs to the Section AI Remote Sensing)
Show Figures

Figure 1

19 pages, 2456 KB  
Article
Adapting Mask-RCNN for Instance Segmentation of Underwater Dunes in Digital Bathymetric Models
by Nada Bouferdous, Eric Guilbert and Sylvie Daniel
Geosciences 2026, 16(5), 168; https://doi.org/10.3390/geosciences16050168 - 22 Apr 2026
Viewed by 322
Abstract
The introduction of multibeam echosounders has marked a turning point in bathymetric data acquisition, providing precise and detailed digital bathymetric models. These instruments not only enhance our understanding of underwater terrain dynamics but also reveal the presence of complex sedimentary structures, such as [...] Read more.
The introduction of multibeam echosounders has marked a turning point in bathymetric data acquisition, providing precise and detailed digital bathymetric models. These instruments not only enhance our understanding of underwater terrain dynamics but also reveal the presence of complex sedimentary structures, such as submarine dunes. Dunes play an important role in the preservation of the environment but can also be obstacles to safe navigation, requiring dragging operations. Hence, it is important to detect them from bathymetric models. Although information about these dunes has numerous applications, their identification methods remain poorly automated. This paper aims to leverage deep learning to develop a segmentation method for submarine dunes. Several challenges must be overcome. Dunes are complex objects with irregular, highly variable shapes, while bathymetric data are noisy and lack detailed information. Furthermore, in the fluvio-marine context, no labeled datasets exist for training purposes. Starting from a small pre-labeled dataset, this paper proposes a systematic approach to train a Mask R-CNN network. First, data augmentation techniques are applied to expand the dataset significantly and introduce meaningful variations. By relying on transfer learning with a carefully selected pre-trained backbone, feature extraction is optimized, reducing training time while enhancing model performance. The adaptation of the Mask R-CNN model to our submarine dune segmentation task has led to a significant improvement in detection performance, with a pixel-level F1-score reaching 89%. Additionally, the mean Average Precision has exceeded 50%, demonstrating the model’s effectiveness in identifying and delineating dunes despite their varied shapes and blurred contours. These results confirm the relevance of our approach for achieving more reliable dune segmentation in a complex fluvio-marine environment. Full article
Show Figures

Figure 1

26 pages, 964 KB  
Article
Environment-Guided Multimodal Pest Detection and Risk Assessment in Fruit and Vegetable Production Systems
by Jiapeng Sun, Yucheng Peng, Zhimeng Zhang, Wenrui Xu, Boyuan Xi, Yuanying Zhang and Yihong Song
Horticulturae 2026, 12(4), 486; https://doi.org/10.3390/horticulturae12040486 - 16 Apr 2026
Viewed by 644
Abstract
Aimed at the practical challenge that pest occurrence in fruit and vegetable horticultural production exhibits strong environmental dependency, pronounced stage characteristics, and high sensitivity to control decision-making, a multimodal pest recognition and occurrence risk joint modeling method is proposed to address the limitation [...] Read more.
Aimed at the practical challenge that pest occurrence in fruit and vegetable horticultural production exhibits strong environmental dependency, pronounced stage characteristics, and high sensitivity to control decision-making, a multimodal pest recognition and occurrence risk joint modeling method is proposed to address the limitation that conventional intelligent plant protection systems focus primarily on pest identification while lacking risk discrimination capability. Within a unified network framework, pest visual information and environmental temporal data are integrated through the construction of an environment-guided representation learning mechanism, a recognition–risk joint optimization strategy, and a risk-aware decision representation modeling structure. In this manner, pest category recognition and occurrence risk evaluation are conducted simultaneously, thereby providing direct decision support for precision prevention and control in fruit and vegetable production. Systematic experimental evaluation is conducted based on multi-crop and multi-year field data collected from Wuyuan County, Bayannur City, Inner Mongolia. Overall comparative results demonstrate that an identification accuracy of 0.947, a precision of 0.936, and a recall of 0.924 are achieved on the test set, all of which significantly outperform mainstream visual detection models such as YOLOv8, DETR, and Mask R-CNN. In terms of detection performance, mAP@50 and mAP@75 reach 0.962 and 0.821, respectively, indicating stable localization and discrimination capability under complex backgrounds and dense small-target conditions. For the occurrence risk discrimination task, a risk accuracy of 0.887 is obtained, representing an improvement of approximately 4.5 percentage points compared with the simple multimodal feature concatenation method. Cross-crop, cross-site, and cross-year generalization experiments further show that risk accuracy remains above 0.84 with stable recognition performance under significant distribution shifts. Ablation studies verify the synergistic contributions of the proposed core modules to overall performance improvement. The results indicate that the proposed framework enables the transition from single recognition to risk-driven plant protection decision-making, providing a technically viable pathway for pest diagnosis and control strategy optimization in fruit and vegetable horticulture. Full article
Show Figures

Figure 1

30 pages, 14814 KB  
Article
The Intelligent Row-Following Method and System for Corn Harvesters Driven by “Visual-Gateway” Collaboration
by Shengjie Zhou, Songling Du, Xinping Zhang, Cheng Yang, Guoying Li, Qingyang Wang and Liqing Zhao
Agriculture 2026, 16(8), 832; https://doi.org/10.3390/agriculture16080832 - 9 Apr 2026
Viewed by 383
Abstract
To address the issues of corn harvester field operations relying on driver visual guidance for row alignment, high labor intensity, and unstable operation accuracy, this study innovatively proposes a “vision-dominant, gateway-enhanced” dual-mode collaborative row-alignment assistance architecture, and independently develops the R2DC-Mask [...] Read more.
To address the issues of corn harvester field operations relying on driver visual guidance for row alignment, high labor intensity, and unstable operation accuracy, this study innovatively proposes a “vision-dominant, gateway-enhanced” dual-mode collaborative row-alignment assistance architecture, and independently develops the R2DC-Mask R-CNN instance segmentation network and MCC-KF robust filtering algorithm to form a deeply coupled hardware–software-assisted driving system. The R2DC-Mask R-CNN network is autonomously designed for corn row-detection scenarios, achieving accurate perception in complex field environments; the MCC-KF algorithm innovatively solves the state estimation divergence problem during transient vision failures through a multi-criteria constraint mechanism, ensuring continuous navigation capability; the intelligent gateway and vision system form a confidence-driven master–slave switching mechanism that adaptively enhances system robustness when vision is restricted. Field experiments demonstrate that within the speed range of 0.5–5.0 km/h, the average lateral deviation in the row alignment assisted by the system is 3.82–5.30 cm, the proportion of deviations less than 10 cm exceeds 96%, and all sample deviations remain within 20 cm; at a speed of 3.5 km/h, the system reduces the average grain loss rate from 3.76% under manual operation to 2.65%, a decrease of 29.5%. This system effectively improves row alignment accuracy and harvest quality, providing a practical human–machine collaborative solution for intelligent harvester operations. Full article
(This article belongs to the Section Agricultural Technology)
Show Figures

Figure 1

11 pages, 968 KB  
Article
Deep Learning-Assisted Localization of Cystic Lesions and Benign Tumors in the Maxillofacial Region Using Panoramic Radiographs: A Preliminary Feasibility Study
by Kai-Hua Lien, Sih-Yi Wu, Yun-Ya Yang, Jia-Yu Liu, Yi-Cheng Chen, Ten-Yi Huang, Yu-Wen Tang, Yen-Chu Hsiao, Chung-Bin Wu and Cheng-Chia Yu
J. Clin. Med. 2026, 15(7), 2784; https://doi.org/10.3390/jcm15072784 - 7 Apr 2026
Viewed by 330
Abstract
Background/Objectives: Automated localization of cystic lesions and benign tumors on panoramic radiographs may support lesion recognition in the maxillofacial region. This preliminary feasibility study aimed to develop and evaluate a deep learning model based on Mask R-CNN for the localization of dentigerous cysts [...] Read more.
Background/Objectives: Automated localization of cystic lesions and benign tumors on panoramic radiographs may support lesion recognition in the maxillofacial region. This preliminary feasibility study aimed to develop and evaluate a deep learning model based on Mask R-CNN for the localization of dentigerous cysts (DCs), radicular cysts (RCs), odontogenic keratocysts (OKCs), and ameloblastomas using panoramic radiographs. Methods: A total of 215 panoramic radiographs were retrospectively collected from Taichung Veterans General Hospital (2018–2023). After excluding postoperative, recurrent, or low-quality images, 184 lesions were allocated to the training set and 47 lesions to the testing set. Lesions were annotated based on pathology-confirmed diagnoses. The Mask R-CNN model was trained to localize and classify four lesion types. Model performance was evaluated using precision, sensitivity (recall), and F1 score at an Intersection over Union (IoU) threshold of 0.1. Results: In the testing set (n = 47), 26 lesions were correctly localized, yielding an overall sensitivity of 55.3% and a precision of 83.9%. The corresponding F1 score was 66.7%. Lesion-specific sensitivities were 40.0% for ameloblastomas, 37.5% for OKCs, 36.8% for RCs, and 93.3% for DCs. Conclusions: This study suggests the preliminary feasibility of a deep learning-assisted approach for lesion localization on panoramic radiographs. However, the absence of lesion-free control images and the limited dataset size restrict the generalizability and clinical applicability of the findings. Further validation using larger and more balanced datasets is required. Full article
(This article belongs to the Special Issue Artificial Intelligence (AI) in Dental Clinical Practice)
Show Figures

Figure 1

23 pages, 49319 KB  
Article
iLog 2.2: Volume and Nutrition Estimation for Mixed Foods via Mask R-CNN and Federated Learning
by Indira Devi Siripurapu, Laavanya Rachakonda, Saraju P. Mohanty and Elias Kougianos
Electronics 2026, 15(7), 1460; https://doi.org/10.3390/electronics15071460 - 1 Apr 2026
Viewed by 406
Abstract
Accurately estimating calorie intake and nutrient composition from what we eat remains one of the most practical challenges in maintaining a healthy lifestyle. Manual food logging and database-based estimations are often inaccurate because ingredient proportions and preparation styles vary widely. This paper presents [...] Read more.
Accurately estimating calorie intake and nutrient composition from what we eat remains one of the most practical challenges in maintaining a healthy lifestyle. Manual food logging and database-based estimations are often inaccurate because ingredient proportions and preparation styles vary widely. This paper presents a lightweight, privacy-preserving framework that estimates calories and detailed nutrient values from a single image. The model uses a Mask R-CNN-based segmentation network to identify visible food components, measure their area, estimate their volume using preset height values, and map them to nutritional information obtained from reliable datasets such as USDA and Food-a-pedia. The system integrates federated learning (FL) to ensure privacy by allowing the model to improve collaboratively without sharing raw user data. The proposed architecture achieved a mean Average Precision (mAP) of 96% for detection and 92% for segmentation, confirming its precision and efficiency. The model is trained and evaluated on a curated pizza dataset consisting of 1107 images across 50 topping categories, using a standard train-validation-test split (666/219/222) to ensure reliable performance assessment. The proposed system also achieves low nutrition estimation error, with calorie and nutrient deviations remaining within approximately 3.8% to 11.1% across evaluated metrics. A lightweight mobile interface is demonstrated through a Figma-based prototype mockup to illustrate potential real-world deployment and user interaction. Full article
Show Figures

Figure 1

15 pages, 2097 KB  
Article
A Comparative Study on Ocean Front Detection in the Northwestern Pacific Using U-Net and Mask R-CNN
by Caixia Shao, Dianjun Zhang and Xuefeng Zhang
Oceans 2026, 7(2), 29; https://doi.org/10.3390/oceans7020029 - 31 Mar 2026
Viewed by 360
Abstract
Ocean fronts play a vital role in modulating climate variability, driving material transport, and maintaining the stability of marine ecosystems. Therefore, accurate identification of ocean fronts is of great significance for marine environmental monitoring and resource management. This study focuses on the Northwestern [...] Read more.
Ocean fronts play a vital role in modulating climate variability, driving material transport, and maintaining the stability of marine ecosystems. Therefore, accurate identification of ocean fronts is of great significance for marine environmental monitoring and resource management. This study focuses on the Northwestern Pacific region and conducts a systematic comparison between two representative deep learning models—U-Net and Mask R-CNN—for automated ocean front detection. The objective is to evaluate the adaptability and strengths of different network architectures in handling multi-scale features, complex background conditions, and boundary delineation, thereby providing a theoretical basis for model selection and application-specific deployment. Experimental results show that U-Net achieves superior spatial consistency in large-scale frontal segmentation, with an IoU of 0.81 and a Dice coefficient of 0.76, while maintaining relatively high computational efficiency. In contrast, Mask R-CNN demonstrates stronger boundary modeling capabilities in detecting small-scale fronts and handling heterogeneous backgrounds, achieving an IoU of 0.78 and a Dice score of 0.73, though at the cost of increased computational demand. Overall, U-Net is more suitable for broad-scale automatic detection of ocean fronts, whereas Mask R-CNN exhibits greater potential in complex scene recognition. Integrating the structural advantages of both models holds promise for further enhancing the stability and accuracy of frontal detection, thereby offering robust technical support for ocean remote sensing analysis and environmental forecasting. Full article
(This article belongs to the Special Issue Recent Progress in Ocean Fronts)
Show Figures

Figure 1

14 pages, 3588 KB  
Article
Calculation of Morphological Characteristic Parameters of Sand Particles Based on Deep Learning
by Fei Li, Zhifeng Liang, Jinkai Wu, Jinan Wang and Pengda Cheng
Appl. Sci. 2026, 16(7), 3231; https://doi.org/10.3390/app16073231 - 27 Mar 2026
Viewed by 312
Abstract
For projects such as tailings ponds, slopes, and foundations, loose materials such as rock, slag, and sand, which are composed of particles, often have low cohesion and rely mainly on friction to maintain stability. The shear strength parameters, namely, the internal friction angle [...] Read more.
For projects such as tailings ponds, slopes, and foundations, loose materials such as rock, slag, and sand, which are composed of particles, often have low cohesion and rely mainly on friction to maintain stability. The shear strength parameters, namely, the internal friction angle and cohesion, are the core parameters that describe the mechanical properties of materials and are directly related to the engineering stability of the above projects. The shear strength properties of loose media are related to the geometric morphological characteristics of particles. Particles with high irregularity will increase the bite and friction of the contact interface between particles, thereby affecting the overall peak shear strength of the material. This study takes sand as the research object. Based on the Mask R-CNN algorithm in deep learning, a sand particle image dataset consisting of single, contact, and sand surface particles is established. An image segmentation model that can identify particles on the surface of the sand layer and obtain the corresponding particle mask is trained; a Python 3.11.4 program is written to automatically calculate seven characteristic parameters of particle morphological characteristics parameters, including the Feret major diameter, the particle Feret minor diameter, the particle aspect ratio, the particle roundness, the comprehensive shape coefficient, the roughness, and the convexity through the particle mask. This method can obtain the overall morphological characteristics of sand particles in real time and is a particle processing method that is a prerequisite for the subsequent rapid prediction of the strength properties of granular materials. Full article
Show Figures

Figure 1

31 pages, 9451 KB  
Article
Quantitative Microstructure Characterization in Additively Manufactured Nickel Alloy 625 Using Image Segmentation and Deep Learning
by Tuğrul Özel, Sijie Ding, Amit Ramasubramanian, Franco Pieri and Doruk Eskicorapci
Machines 2026, 14(4), 366; https://doi.org/10.3390/machines14040366 - 26 Mar 2026
Viewed by 428
Abstract
Laser Powder Bed Fusion for metals (PBF-LB/M) is a complex additive manufacturing process in which metal powder is selectively melted layer-by-layer to fabricate 3D parts. Process parameters critically influence the resulting microstructure in nickel alloys, with features such as melt pool marks, grain [...] Read more.
Laser Powder Bed Fusion for metals (PBF-LB/M) is a complex additive manufacturing process in which metal powder is selectively melted layer-by-layer to fabricate 3D parts. Process parameters critically influence the resulting microstructure in nickel alloys, with features such as melt pool marks, grain size and orientation, porosity, and cracks serving as key process signatures. These features are typically analyzed post-process to identify suboptimal conditions. This research aims to develop automated post-process measurement and analysis techniques using image processing, pattern recognition, and statistical learning to correlate process parameters with part quality. Optical microscopy images of build surfaces are analyzed using machine learning algorithms to evaluate porosity, grain size, and relative density in fabricated test coupons. Effect plots are generated to identify trends related to increasing energy density. A novel deep learning approach based on Mask R-CNN is used to detect and segment melt pool regions in optical microscopy images. From the segmented regions, melt pool dimensions—such as width, depth, and area—are extracted using bounding geometry coordinates. Manually labeled images (Type I and Type II) are used to train the model. A comparison between ResNet-50 and ResNet-101 backbones shows that the ResNet-50-based model (Model 2) achieves superior performance, with lower training loss (0.1781 vs. 0.1907) and validation loss (8.6140 vs. 9.4228). Quantitative evaluation using the Jaccard index, precision, and recall metrics shows that the ResNet-101 backbone outperforms ResNet-50, achieving about 4% higher mean Intersection-over-Union, with values of 0.85 for Type I and 0.82 for Type II melt pools, where Type I is detected more accurately due to its more regular morphology and clearer boundaries. By extending Faster R-CNNs with a mask prediction branch, the method allows for precise melt pool measurements, providing valuable insights into process quality and dimensional accuracy, and aiding in the detection of defects in PBF-LB-fabricated parts. Full article
(This article belongs to the Special Issue Artificial Intelligence in Mechanical Engineering Applications)
Show Figures

Figure 1

20 pages, 7211 KB  
Article
An Enhanced YOLO Framework for Multi-Scale Landslide Identification Under Complex Backgrounds
by Taowen Nie, Jianxing Wu, Shibin Xu and Yong Liu
Sustainability 2026, 18(7), 3205; https://doi.org/10.3390/su18073205 - 25 Mar 2026
Viewed by 318
Abstract
Deep learning has significantly improved landslide identification from remote sensing imagery, but accurately detecting multi-scale landslides under complex backgrounds remains challenging. This study proposes a lightweight YOLOv8-based model, namely YOLO-BEG, incorporating three improvements: a bidirectional feature pyramid network (BiFPN) for enhanced multi-scale feature [...] Read more.
Deep learning has significantly improved landslide identification from remote sensing imagery, but accurately detecting multi-scale landslides under complex backgrounds remains challenging. This study proposes a lightweight YOLOv8-based model, namely YOLO-BEG, incorporating three improvements: a bidirectional feature pyramid network (BiFPN) for enhanced multi-scale feature fusion, an embedded Gaussian attention system (EGS) to improve discrimination under complex backgrounds, and a generalized intersection over union (GIoU) loss to optimize boundary localization. The model was evaluated on two datasets: a vegetation-covered Southwest landslide database and the Sichuan–Tibet Highway database. On the Southwest database, YOLO-BEG improved Precision, Recall, and F1-score by 16%, 13%, and 15% compared with YOLOv8, while using only one tenth of the parameters of Mask R-CNN. In the Sichuan–Tibet Highway database, which has more diverse background conditions, YOLO-BEG outperformed Mask R-CNN and Faster R-CNN by 32% and 13% in F1-score, respectively. These results demonstrate that YOLO-BEG is able to operate with fewer parameters and yield high-precision identification of landslides with different scales under complex backgrounds, making it a rapid and accurate tool for landslide identification. Full article
(This article belongs to the Special Issue Sustainable Assessment and Risk Analysis on Landslide Hazards)
Show Figures

Figure 1

17 pages, 5002 KB  
Article
Comparison of Automatic Recognition Models for Building Hollow Based on Infrared Thermography
by Haohan Yao, Chengyu Liu, Dong Hu, Changyu Wu and Quansheng Lyu
Appl. Sci. 2026, 16(6), 3075; https://doi.org/10.3390/app16063075 - 23 Mar 2026
Viewed by 355
Abstract
To achieve efficient recognition of hollows in building external walls, this study adopts infrared thermography to construct a dedicated dataset and focuses on comparing the performance of three instance segmentation models: Mask R-CNN, YOLACT, and YOLOv8. Experimental results indicate that YOLACT is suitable [...] Read more.
To achieve efficient recognition of hollows in building external walls, this study adopts infrared thermography to construct a dedicated dataset and focuses on comparing the performance of three instance segmentation models: Mask R-CNN, YOLACT, and YOLOv8. Experimental results indicate that YOLACT is suitable for on-site rapid screening balancing accuracy and speed, YOLOv8 is more applicable to detection tasks requiring strict control over missed detections and accurate restoration of complex boundaries, while Mask R-CNN is better suited for non-real-time static image analysis. To further improve model performance, this paper introduces the position-sensitive attention (PSA) mechanism to YOLOv8 and trains the modified model. The improved model has achieved significant enhancements in various performance metrics. This study provides a reference scheme for the automatic detection of hollow defects and offers a basis for model selection under different application scenarios. Full article
(This article belongs to the Special Issue New Advances in Non-Destructive Testing and Evaluation)
Show Figures

Figure 1

20 pages, 20579 KB  
Article
A Deep Learning Approach for High-Throughput Multi-Tissue Cell Segmentation and Phenotypic Analysis in Chinese Cabbage Leaf Cross-Sections
by Zhiming Zhang, Jun Zhang, Tianyi Ren, Minggeng Liu and Lei Sun
Agronomy 2026, 16(6), 612; https://doi.org/10.3390/agronomy16060612 - 13 Mar 2026
Viewed by 423
Abstract
Quantitative analysis of leaf cell microstructure is crucial for deciphering agronomic traits in Chinese cabbage, including photosynthetic efficiency, stress tolerance, and yield potential. Traditional manual observation methods are inefficient and highly subjective, failing to meet the demands of large-scale breeding for high-throughput, reproducible [...] Read more.
Quantitative analysis of leaf cell microstructure is crucial for deciphering agronomic traits in Chinese cabbage, including photosynthetic efficiency, stress tolerance, and yield potential. Traditional manual observation methods are inefficient and highly subjective, failing to meet the demands of large-scale breeding for high-throughput, reproducible microscopic phenotyping. To transition breeding practices from experience-driven to data-driven, there is an urgent need to establish automated, standardized systems for acquiring cell-scale phenotypes. Therefore, this study proposes an automated instance segmentation and phenotyping analysis framework for multi-tissue cells in Chinese cabbage leaf cross-sections. This framework systematically optimizes Mask R-CNN by introducing an attention mechanism to enhance cellular feature responses in complex backgrounds. It employs weighted multi-scale feature fusion to process densely distributed small-scale cells and integrates a refined boundary optimization module to improve recognition accuracy in adherent and blurred regions. On a microscopic image dataset spanning multiple varieties, this method achieves high-precision predictions in instance segmentation tasks. Based on the predicted cell masks, an interactive phenotyping analysis tool was further developed to automatically extract standardized single-cell morphological parameters, including area, perimeter, and Feret’s diameter. The measured parameters exhibit high consistency with manual annotations (correlation coefficients (r) all exceed 0.97). This framework enables high-throughput, standardized phenotypic analysis at the cellular level of leaf cross-sections, providing a reliable method for the digital and automated interpretation of crop microscopic traits. This technical solution not only supports the systematic integration of microscopic phenotypes in Chinese cabbage breeding but also offers a scalable solution for cellular-scale phenotypic research in other crops. Full article
(This article belongs to the Special Issue AI, Sensors and Robotics for Smart Agriculture)
Show Figures

Figure 1

22 pages, 17254 KB  
Article
Landslide Susceptibility Assessment Based on a Deep Learning-Derived Landslide Inventory in Moxi Town, Sichuan, China
by Yitong Yao, Yixiang Du, Wenjun Zhang, Xianwen Liu, Jialun Cai, Hui Feng, Hongyao Xiang, Rong Hu, Yuhao Yang and Tongben Fu
Remote Sens. 2026, 18(6), 849; https://doi.org/10.3390/rs18060849 - 10 Mar 2026
Viewed by 673
Abstract
Landslides are characterized by strong suddenness and a wide range of damage; accurate prediction of their susceptibility is an important prerequisite for regional risk prevention and control. To address the difficulties in acquiring landslide inventories in complex terrain areas and the insufficient interpretability [...] Read more.
Landslides are characterized by strong suddenness and a wide range of damage; accurate prediction of their susceptibility is an important prerequisite for regional risk prevention and control. To address the difficulties in acquiring landslide inventories in complex terrain areas and the insufficient interpretability of existing prediction models, this study proposes a landslide susceptibility assessment (LSA) framework that integrates automated sample detection and interpretability analysis. The proposed framework is applied to Moxi Town, a typical alpine valley area in Sichuan Province, China. A Mask R-CNN instance segmentation model was introduced to achieve automated detection of landslide samples, resulting in a high-quality inventory containing 923 landslides. Based on the spatial relationships between the landslide inventory and influencing factors, a convolutional neural network (CNN) landslide susceptibility assessment model incorporating Shapley Additive exPlanations (SHAP) interpretability analysis was constructed. The CNN model was further compared with random forest (RF) and extreme gradient boosting (XGBoost) machine learning models. The results show that the AUC value of the CNN model has increased by 4.3% and 3.2% compared with the RF and XGBoost models, respectively, and it significantly reduces the pretzel effect of landslide susceptibility mapping (LSM). The results validate the reliability of the proposed framework, which can provide technical support for landslide disaster prevention and monitoring. Full article
(This article belongs to the Special Issue Landslide Detection Using Machine and Deep Learning)
Show Figures

Figure 1

Back to TopTop