MDPI - Publisher of Open Access Journals

20 pages, 29170 KB

Open AccessArticle

Hyperspectral Mapping of Pasture Nitrogen Content and Metabolizable Energy in New Zealand Hill Country Grasslands

by Nitin Bhatia and Maxence Plouviez

AgriEngineering 2026, 8(5), 170; https://doi.org/10.3390/agriengineering8050170 - 30 Apr 2026

Viewed by 62

Hyperspectral airborne data combined with machine learning has proven effective for characterizing plant nutritional quality. However, terrain, viewing geometry, and illumination can distort spectral signatures, leading to biased models with limited generalizability for large-scale mapping across farms with a heterogeneous landscape. In this [...] Read more.

Hyperspectral airborne data combined with machine learning has proven effective for characterizing plant nutritional quality. However, terrain, viewing geometry, and illumination can distort spectral signatures, leading to biased models with limited generalizability for large-scale mapping across farms with a heterogeneous landscape. In this study, we developed a framework for mapping pasture quality using airborne hyperspectral imaging while explicitly accounting for in-field acquisition and environmental effects. Nitrogen content (N%) and metabolizable energy (ME) were used as reference indicators across four hill country farms in New Zealand with contrasting environmental and management conditions. Ground truth was obtained using standard laboratory wet chemistry methods and paired with AisaFENIX airborne hyperspectral data, resulting in 1610 spectral samples derived from 161 spatially independent ground plots. Gaussian Process Regression (GPR) and a one-dimensional convolutional neural network (1D-CNN) were trained and evaluated on an independent test dataset. Both models achieved strong predictive performance (R² > 0.8); however, GPR provided more reliable estimates through predictive uncertainty. Using a 95% confidence interval threshold to mask uncertain predictions increased overall performance (R² > 0.9) and consequently improved the reliability of the mapped outputs. This approach enables spatially explicit pasture nutrient assessment to support precision land management for carbon and nitrogen. Full article

(This article belongs to the Special Issue Advancements in Remote Sensing and AI-Driven Analytics for Sustainable Agriculture)

► Show Figures

Figure 1

28 pages, 3354 KB

Open AccessArticle

Loop Closure with 3D Gaussian Splatting for Dynamic SLAM

by Zhanwu Ma, Wansheng Cheng and Song Fan

Sensors 2026, 26(9), 2669; https://doi.org/10.3390/s26092669 - 25 Apr 2026

Viewed by 639

Abstract

Robust pose estimation and high-fidelity scene reconstruction in dynamic environments represent core challenges in the field of Visual Simultaneous Localization and Mapping (SLAM). Although 3D Gaussian Splatting (3DGS)-based techniques have demonstrated significant potential, existing methods typically assume static scenes and struggle to address [...] Read more.

Robust pose estimation and high-fidelity scene reconstruction in dynamic environments represent core challenges in the field of Visual Simultaneous Localization and Mapping (SLAM). Although 3D Gaussian Splatting (3DGS)-based techniques have demonstrated significant potential, existing methods typically assume static scenes and struggle to address the inconsistency between photometric and geometric observations in dynamic settings, leading to a notable degradation in pose estimation and map accuracy. To address these issues, this paper presents a novel dynamic SLAM method: Loop Closure with 3D Gaussian Splatting for Dynamic SLAM (LCD-Splat). Taking RGB-D images as input, LCD-Splat integrates Mask R-CNN with an improved multi-view geometry approach to detect dynamic objects, generating static scene maps and filling in occluded backgrounds. By leveraging 3DGS submaps and a frame to model tracking strategy, LCD-Splat achieves dense map construction. The method initiates online loop closure detection and employs a novel coarse to fine 3DGS registration algorithm to compute loop closure constraints between submaps. Global consistency is ultimately ensured through robust pose graph optimization. Experimental results on real-world datasets such as TUM RGB-D and Bonn demonstrate that LCD-Splat outperforms existing state-of-the-art SLAM methods in terms of tracking, scene reconstruction, and rendering performance. This approach provides novel insights for high-precision SLAM in dynamic environments and holds significant implications for scene understanding in complex settings. Full article

(This article belongs to the Special Issue Perception and Control Technology for Intelligent Autonomous Unmanned Systems)

20 pages, 4072 KB

Open AccessArticle

Potato Late Blight Disease Detection on UAV Multispectral Imagery

by Mohadeseh Kaviani, Brigitte Leblon, Thangarajah Akilan, Dzhamal Amishev, Armand LaRocque and Ata Haddadi

Remote Sens. 2026, 18(9), 1292; https://doi.org/10.3390/rs18091292 - 24 Apr 2026

Viewed by 251

Abstract

In this study, Mask R-CNN was applied to 5-band raw reflectance images to detect potato plants in UAV images. The highest model performance across all metrics was achieved with a ResNeXt-101 backbone and transfer learning from the same model trained on apple orchard [...] Read more.

In this study, Mask R-CNN was applied to 5-band raw reflectance images to detect potato plants in UAV images. The highest model performance across all metrics was achieved with a ResNeXt-101 backbone and transfer learning from the same model trained on apple orchard data. An F-1 score of 84.2% was achieved. To determine whether the plant was infected with PLB, two methods were used. In the first method, a Mask R-CNN with a DINOv3 small variant backbone was applied to 5-band raw reflectance images. The highest achieved F1-score was 69.05%. In the second method, classical ML classifiers were applied to the 5-band raw reflectance images and 16 associated vegetation index images. The highest F1-score (66.71%) was obtained with a decision tree classifier applied to the 16 vegetation index images. Feature importance analysis indicated that chlorophyll- and red-edge-related indices, such as CIgreen, TCARI, OSAVI2, and Red-edge NDVI, were the most discriminative features for distinguishing healthy and unhealthy potato plants. These results show the effectiveness of combining deep learning and machine learning approaches for potato late blight detection using UAV multispectral imagery. Full article

(This article belongs to the Section AI Remote Sensing)

► Show Figures

Figure 1

19 pages, 2456 KB

Open AccessArticle

Adapting Mask-RCNN for Instance Segmentation of Underwater Dunes in Digital Bathymetric Models

by Nada Bouferdous, Eric Guilbert and Sylvie Daniel

Geosciences 2026, 16(5), 168; https://doi.org/10.3390/geosciences16050168 - 22 Apr 2026

Viewed by 322

Abstract

The introduction of multibeam echosounders has marked a turning point in bathymetric data acquisition, providing precise and detailed digital bathymetric models. These instruments not only enhance our understanding of underwater terrain dynamics but also reveal the presence of complex sedimentary structures, such as [...] Read more.

The introduction of multibeam echosounders has marked a turning point in bathymetric data acquisition, providing precise and detailed digital bathymetric models. These instruments not only enhance our understanding of underwater terrain dynamics but also reveal the presence of complex sedimentary structures, such as submarine dunes. Dunes play an important role in the preservation of the environment but can also be obstacles to safe navigation, requiring dragging operations. Hence, it is important to detect them from bathymetric models. Although information about these dunes has numerous applications, their identification methods remain poorly automated. This paper aims to leverage deep learning to develop a segmentation method for submarine dunes. Several challenges must be overcome. Dunes are complex objects with irregular, highly variable shapes, while bathymetric data are noisy and lack detailed information. Furthermore, in the fluvio-marine context, no labeled datasets exist for training purposes. Starting from a small pre-labeled dataset, this paper proposes a systematic approach to train a Mask R-CNN network. First, data augmentation techniques are applied to expand the dataset significantly and introduce meaningful variations. By relying on transfer learning with a carefully selected pre-trained backbone, feature extraction is optimized, reducing training time while enhancing model performance. The adaptation of the Mask R-CNN model to our submarine dune segmentation task has led to a significant improvement in detection performance, with a pixel-level F1-score reaching 89%. Additionally, the mean Average Precision has exceeded 50%, demonstrating the model’s effectiveness in identifying and delineating dunes despite their varied shapes and blurred contours. These results confirm the relevance of our approach for achieving more reliable dune segmentation in a complex fluvio-marine environment. Full article

► Show Figures

Figure 1

26 pages, 964 KB

Open AccessArticle

Environment-Guided Multimodal Pest Detection and Risk Assessment in Fruit and Vegetable Production Systems

by Jiapeng Sun, Yucheng Peng, Zhimeng Zhang, Wenrui Xu, Boyuan Xi, Yuanying Zhang and Yihong Song

Horticulturae 2026, 12(4), 486; https://doi.org/10.3390/horticulturae12040486 - 16 Apr 2026

Viewed by 644

Abstract

Aimed at the practical challenge that pest occurrence in fruit and vegetable horticultural production exhibits strong environmental dependency, pronounced stage characteristics, and high sensitivity to control decision-making, a multimodal pest recognition and occurrence risk joint modeling method is proposed to address the limitation [...] Read more.

Aimed at the practical challenge that pest occurrence in fruit and vegetable horticultural production exhibits strong environmental dependency, pronounced stage characteristics, and high sensitivity to control decision-making, a multimodal pest recognition and occurrence risk joint modeling method is proposed to address the limitation that conventional intelligent plant protection systems focus primarily on pest identification while lacking risk discrimination capability. Within a unified network framework, pest visual information and environmental temporal data are integrated through the construction of an environment-guided representation learning mechanism, a recognition–risk joint optimization strategy, and a risk-aware decision representation modeling structure. In this manner, pest category recognition and occurrence risk evaluation are conducted simultaneously, thereby providing direct decision support for precision prevention and control in fruit and vegetable production. Systematic experimental evaluation is conducted based on multi-crop and multi-year field data collected from Wuyuan County, Bayannur City, Inner Mongolia. Overall comparative results demonstrate that an identification accuracy of

0.947

, a precision of

0.936

, and a recall of

0.924

are achieved on the test set, all of which significantly outperform mainstream visual detection models such as YOLOv8, DETR, and Mask R-CNN. In terms of detection performance, mAP@50 and mAP@75 reach

0.962

and

0.821

, respectively, indicating stable localization and discrimination capability under complex backgrounds and dense small-target conditions. For the occurrence risk discrimination task, a risk accuracy of

0.887

is obtained, representing an improvement of approximately

4.5

percentage points compared with the simple multimodal feature concatenation method. Cross-crop, cross-site, and cross-year generalization experiments further show that risk accuracy remains above

0.84

with stable recognition performance under significant distribution shifts. Ablation studies verify the synergistic contributions of the proposed core modules to overall performance improvement. The results indicate that the proposed framework enables the transition from single recognition to risk-driven plant protection decision-making, providing a technically viable pathway for pest diagnosis and control strategy optimization in fruit and vegetable horticulture. Full article

(This article belongs to the Special Issue Pest Diagnosis and Control Strategies for Fruit and Vegetable Plants—Second Edition)

► Show Figures

Figure 1

30 pages, 14814 KB

Open AccessArticle

The Intelligent Row-Following Method and System for Corn Harvesters Driven by “Visual-Gateway” Collaboration

by Shengjie Zhou, Songling Du, Xinping Zhang, Cheng Yang, Guoying Li, Qingyang Wang and Liqing Zhao

Agriculture 2026, 16(8), 832; https://doi.org/10.3390/agriculture16080832 - 9 Apr 2026

Viewed by 383

Abstract

To address the issues of corn harvester field operations relying on driver visual guidance for row alignment, high labor intensity, and unstable operation accuracy, this study innovatively proposes a “vision-dominant, gateway-enhanced” dual-mode collaborative row-alignment assistance architecture, and independently develops the R²DC-Mask [...] Read more.

To address the issues of corn harvester field operations relying on driver visual guidance for row alignment, high labor intensity, and unstable operation accuracy, this study innovatively proposes a “vision-dominant, gateway-enhanced” dual-mode collaborative row-alignment assistance architecture, and independently develops the R²DC-Mask R-CNN instance segmentation network and MCC-KF robust filtering algorithm to form a deeply coupled hardware–software-assisted driving system. The R²DC-Mask R-CNN network is autonomously designed for corn row-detection scenarios, achieving accurate perception in complex field environments; the MCC-KF algorithm innovatively solves the state estimation divergence problem during transient vision failures through a multi-criteria constraint mechanism, ensuring continuous navigation capability; the intelligent gateway and vision system form a confidence-driven master–slave switching mechanism that adaptively enhances system robustness when vision is restricted. Field experiments demonstrate that within the speed range of 0.5–5.0 km/h, the average lateral deviation in the row alignment assisted by the system is 3.82–5.30 cm, the proportion of deviations less than 10 cm exceeds 96%, and all sample deviations remain within 20 cm; at a speed of 3.5 km/h, the system reduces the average grain loss rate from 3.76% under manual operation to 2.65%, a decrease of 29.5%. This system effectively improves row alignment accuracy and harvest quality, providing a practical human–machine collaborative solution for intelligent harvester operations. Full article

(This article belongs to the Section Agricultural Technology)

► Show Figures

Figure 1

11 pages, 968 KB

Open AccessArticle

Deep Learning-Assisted Localization of Cystic Lesions and Benign Tumors in the Maxillofacial Region Using Panoramic Radiographs: A Preliminary Feasibility Study

by Kai-Hua Lien, Sih-Yi Wu, Yun-Ya Yang, Jia-Yu Liu, Yi-Cheng Chen, Ten-Yi Huang, Yu-Wen Tang, Yen-Chu Hsiao, Chung-Bin Wu and Cheng-Chia Yu

J. Clin. Med. 2026, 15(7), 2784; https://doi.org/10.3390/jcm15072784 - 7 Apr 2026

Viewed by 330

Abstract

Background/Objectives: Automated localization of cystic lesions and benign tumors on panoramic radiographs may support lesion recognition in the maxillofacial region. This preliminary feasibility study aimed to develop and evaluate a deep learning model based on Mask R-CNN for the localization of dentigerous cysts [...] Read more.

Background/Objectives: Automated localization of cystic lesions and benign tumors on panoramic radiographs may support lesion recognition in the maxillofacial region. This preliminary feasibility study aimed to develop and evaluate a deep learning model based on Mask R-CNN for the localization of dentigerous cysts (DCs), radicular cysts (RCs), odontogenic keratocysts (OKCs), and ameloblastomas using panoramic radiographs. Methods: A total of 215 panoramic radiographs were retrospectively collected from Taichung Veterans General Hospital (2018–2023). After excluding postoperative, recurrent, or low-quality images, 184 lesions were allocated to the training set and 47 lesions to the testing set. Lesions were annotated based on pathology-confirmed diagnoses. The Mask R-CNN model was trained to localize and classify four lesion types. Model performance was evaluated using precision, sensitivity (recall), and F1 score at an Intersection over Union (IoU) threshold of 0.1. Results: In the testing set (n = 47), 26 lesions were correctly localized, yielding an overall sensitivity of 55.3% and a precision of 83.9%. The corresponding F1 score was 66.7%. Lesion-specific sensitivities were 40.0% for ameloblastomas, 37.5% for OKCs, 36.8% for RCs, and 93.3% for DCs. Conclusions: This study suggests the preliminary feasibility of a deep learning-assisted approach for lesion localization on panoramic radiographs. However, the absence of lesion-free control images and the limited dataset size restrict the generalizability and clinical applicability of the findings. Further validation using larger and more balanced datasets is required. Full article

(This article belongs to the Special Issue Artificial Intelligence (AI) in Dental Clinical Practice)

► Show Figures

Figure 1

23 pages, 49319 KB

Open AccessArticle

iLog 2.2: Volume and Nutrition Estimation for Mixed Foods via Mask R-CNN and Federated Learning

by Indira Devi Siripurapu, Laavanya Rachakonda, Saraju P. Mohanty and Elias Kougianos

Electronics 2026, 15(7), 1460; https://doi.org/10.3390/electronics15071460 - 1 Apr 2026

Viewed by 406

Abstract

Accurately estimating calorie intake and nutrient composition from what we eat remains one of the most practical challenges in maintaining a healthy lifestyle. Manual food logging and database-based estimations are often inaccurate because ingredient proportions and preparation styles vary widely. This paper presents [...] Read more.

Accurately estimating calorie intake and nutrient composition from what we eat remains one of the most practical challenges in maintaining a healthy lifestyle. Manual food logging and database-based estimations are often inaccurate because ingredient proportions and preparation styles vary widely. This paper presents a lightweight, privacy-preserving framework that estimates calories and detailed nutrient values from a single image. The model uses a Mask R-CNN-based segmentation network to identify visible food components, measure their area, estimate their volume using preset height values, and map them to nutritional information obtained from reliable datasets such as USDA and Food-a-pedia. The system integrates federated learning (FL) to ensure privacy by allowing the model to improve collaboratively without sharing raw user data. The proposed architecture achieved a mean Average Precision (mAP) of 96% for detection and 92% for segmentation, confirming its precision and efficiency. The model is trained and evaluated on a curated pizza dataset consisting of 1107 images across 50 topping categories, using a standard train-validation-test split (666/219/222) to ensure reliable performance assessment. The proposed system also achieves low nutrition estimation error, with calorie and nutrient deviations remaining within approximately 3.8% to 11.1% across evaluated metrics. A lightweight mobile interface is demonstrated through a Figma-based prototype mockup to illustrate potential real-world deployment and user interaction. Full article

(This article belongs to the Special Issue Secure Edge AI and IoMT: From Personalized Healthcare to Smart Agriculture)

► Show Figures

Figure 1

15 pages, 2097 KB

Open AccessArticle

A Comparative Study on Ocean Front Detection in the Northwestern Pacific Using U-Net and Mask R-CNN

by Caixia Shao, Dianjun Zhang and Xuefeng Zhang

Oceans 2026, 7(2), 29; https://doi.org/10.3390/oceans7020029 - 31 Mar 2026

Viewed by 360

Abstract

Ocean fronts play a vital role in modulating climate variability, driving material transport, and maintaining the stability of marine ecosystems. Therefore, accurate identification of ocean fronts is of great significance for marine environmental monitoring and resource management. This study focuses on the Northwestern [...] Read more.

Ocean fronts play a vital role in modulating climate variability, driving material transport, and maintaining the stability of marine ecosystems. Therefore, accurate identification of ocean fronts is of great significance for marine environmental monitoring and resource management. This study focuses on the Northwestern Pacific region and conducts a systematic comparison between two representative deep learning models—U-Net and Mask R-CNN—for automated ocean front detection. The objective is to evaluate the adaptability and strengths of different network architectures in handling multi-scale features, complex background conditions, and boundary delineation, thereby providing a theoretical basis for model selection and application-specific deployment. Experimental results show that U-Net achieves superior spatial consistency in large-scale frontal segmentation, with an IoU of 0.81 and a Dice coefficient of 0.76, while maintaining relatively high computational efficiency. In contrast, Mask R-CNN demonstrates stronger boundary modeling capabilities in detecting small-scale fronts and handling heterogeneous backgrounds, achieving an IoU of 0.78 and a Dice score of 0.73, though at the cost of increased computational demand. Overall, U-Net is more suitable for broad-scale automatic detection of ocean fronts, whereas Mask R-CNN exhibits greater potential in complex scene recognition. Integrating the structural advantages of both models holds promise for further enhancing the stability and accuracy of frontal detection, thereby offering robust technical support for ocean remote sensing analysis and environmental forecasting. Full article

(This article belongs to the Special Issue Recent Progress in Ocean Fronts)

► Show Figures

Figure 1

14 pages, 3588 KB

Open AccessArticle

Calculation of Morphological Characteristic Parameters of Sand Particles Based on Deep Learning

by Fei Li, Zhifeng Liang, Jinkai Wu, Jinan Wang and Pengda Cheng

Appl. Sci. 2026, 16(7), 3231; https://doi.org/10.3390/app16073231 - 27 Mar 2026

Viewed by 312

Abstract

For projects such as tailings ponds, slopes, and foundations, loose materials such as rock, slag, and sand, which are composed of particles, often have low cohesion and rely mainly on friction to maintain stability. The shear strength parameters, namely, the internal friction angle [...] Read more.

For projects such as tailings ponds, slopes, and foundations, loose materials such as rock, slag, and sand, which are composed of particles, often have low cohesion and rely mainly on friction to maintain stability. The shear strength parameters, namely, the internal friction angle and cohesion, are the core parameters that describe the mechanical properties of materials and are directly related to the engineering stability of the above projects. The shear strength properties of loose media are related to the geometric morphological characteristics of particles. Particles with high irregularity will increase the bite and friction of the contact interface between particles, thereby affecting the overall peak shear strength of the material. This study takes sand as the research object. Based on the Mask R-CNN algorithm in deep learning, a sand particle image dataset consisting of single, contact, and sand surface particles is established. An image segmentation model that can identify particles on the surface of the sand layer and obtain the corresponding particle mask is trained; a Python 3.11.4 program is written to automatically calculate seven characteristic parameters of particle morphological characteristics parameters, including the Feret major diameter, the particle Feret minor diameter, the particle aspect ratio, the particle roundness, the comprehensive shape coefficient, the roughness, and the convexity through the particle mask. This method can obtain the overall morphological characteristics of sand particles in real time and is a particle processing method that is a prerequisite for the subsequent rapid prediction of the strength properties of granular materials. Full article

► Show Figures

Figure 1

31 pages, 9451 KB

Open AccessArticle

Quantitative Microstructure Characterization in Additively Manufactured Nickel Alloy 625 Using Image Segmentation and Deep Learning

by Tuğrul Özel, Sijie Ding, Amit Ramasubramanian, Franco Pieri and Doruk Eskicorapci

Machines 2026, 14(4), 366; https://doi.org/10.3390/machines14040366 - 26 Mar 2026

Viewed by 428

Abstract

Laser Powder Bed Fusion for metals (PBF-LB/M) is a complex additive manufacturing process in which metal powder is selectively melted layer-by-layer to fabricate 3D parts. Process parameters critically influence the resulting microstructure in nickel alloys, with features such as melt pool marks, grain [...] Read more.

Laser Powder Bed Fusion for metals (PBF-LB/M) is a complex additive manufacturing process in which metal powder is selectively melted layer-by-layer to fabricate 3D parts. Process parameters critically influence the resulting microstructure in nickel alloys, with features such as melt pool marks, grain size and orientation, porosity, and cracks serving as key process signatures. These features are typically analyzed post-process to identify suboptimal conditions. This research aims to develop automated post-process measurement and analysis techniques using image processing, pattern recognition, and statistical learning to correlate process parameters with part quality. Optical microscopy images of build surfaces are analyzed using machine learning algorithms to evaluate porosity, grain size, and relative density in fabricated test coupons. Effect plots are generated to identify trends related to increasing energy density. A novel deep learning approach based on Mask R-CNN is used to detect and segment melt pool regions in optical microscopy images. From the segmented regions, melt pool dimensions—such as width, depth, and area—are extracted using bounding geometry coordinates. Manually labeled images (Type I and Type II) are used to train the model. A comparison between ResNet-50 and ResNet-101 backbones shows that the ResNet-50-based model (Model 2) achieves superior performance, with lower training loss (0.1781 vs. 0.1907) and validation loss (8.6140 vs. 9.4228). Quantitative evaluation using the Jaccard index, precision, and recall metrics shows that the ResNet-101 backbone outperforms ResNet-50, achieving about 4% higher mean Intersection-over-Union, with values of 0.85 for Type I and 0.82 for Type II melt pools, where Type I is detected more accurately due to its more regular morphology and clearer boundaries. By extending Faster R-CNNs with a mask prediction branch, the method allows for precise melt pool measurements, providing valuable insights into process quality and dimensional accuracy, and aiding in the detection of defects in PBF-LB-fabricated parts. Full article

(This article belongs to the Special Issue Artificial Intelligence in Mechanical Engineering Applications)

► Show Figures

Figure 1

20 pages, 7211 KB

Open AccessArticle

An Enhanced YOLO Framework for Multi-Scale Landslide Identification Under Complex Backgrounds

by Taowen Nie, Jianxing Wu, Shibin Xu and Yong Liu

Sustainability 2026, 18(7), 3205; https://doi.org/10.3390/su18073205 - 25 Mar 2026

Viewed by 318

Abstract

Deep learning has significantly improved landslide identification from remote sensing imagery, but accurately detecting multi-scale landslides under complex backgrounds remains challenging. This study proposes a lightweight YOLOv8-based model, namely YOLO-BEG, incorporating three improvements: a bidirectional feature pyramid network (BiFPN) for enhanced multi-scale feature [...] Read more.

Deep learning has significantly improved landslide identification from remote sensing imagery, but accurately detecting multi-scale landslides under complex backgrounds remains challenging. This study proposes a lightweight YOLOv8-based model, namely YOLO-BEG, incorporating three improvements: a bidirectional feature pyramid network (BiFPN) for enhanced multi-scale feature fusion, an embedded Gaussian attention system (EGS) to improve discrimination under complex backgrounds, and a generalized intersection over union (GIoU) loss to optimize boundary localization. The model was evaluated on two datasets: a vegetation-covered Southwest landslide database and the Sichuan–Tibet Highway database. On the Southwest database, YOLO-BEG improved Precision, Recall, and F1-score by 16%, 13%, and 15% compared with YOLOv8, while using only one tenth of the parameters of Mask R-CNN. In the Sichuan–Tibet Highway database, which has more diverse background conditions, YOLO-BEG outperformed Mask R-CNN and Faster R-CNN by 32% and 13% in F1-score, respectively. These results demonstrate that YOLO-BEG is able to operate with fewer parameters and yield high-precision identification of landslides with different scales under complex backgrounds, making it a rapid and accurate tool for landslide identification. Full article

(This article belongs to the Special Issue Sustainable Assessment and Risk Analysis on Landslide Hazards)

► Show Figures

Figure 1

17 pages, 5002 KB

Open AccessArticle

Comparison of Automatic Recognition Models for Building Hollow Based on Infrared Thermography

by Haohan Yao, Chengyu Liu, Dong Hu, Changyu Wu and Quansheng Lyu

Appl. Sci. 2026, 16(6), 3075; https://doi.org/10.3390/app16063075 - 23 Mar 2026

Viewed by 355

Abstract

To achieve efficient recognition of hollows in building external walls, this study adopts infrared thermography to construct a dedicated dataset and focuses on comparing the performance of three instance segmentation models: Mask R-CNN, YOLACT, and YOLOv8. Experimental results indicate that YOLACT is suitable [...] Read more.

To achieve efficient recognition of hollows in building external walls, this study adopts infrared thermography to construct a dedicated dataset and focuses on comparing the performance of three instance segmentation models: Mask R-CNN, YOLACT, and YOLOv8. Experimental results indicate that YOLACT is suitable for on-site rapid screening balancing accuracy and speed, YOLOv8 is more applicable to detection tasks requiring strict control over missed detections and accurate restoration of complex boundaries, while Mask R-CNN is better suited for non-real-time static image analysis. To further improve model performance, this paper introduces the position-sensitive attention (PSA) mechanism to YOLOv8 and trains the modified model. The improved model has achieved significant enhancements in various performance metrics. This study provides a reference scheme for the automatic detection of hollow defects and offers a basis for model selection under different application scenarios. Full article

(This article belongs to the Special Issue New Advances in Non-Destructive Testing and Evaluation)

► Show Figures

Figure 1

20 pages, 20579 KB

Open AccessArticle

A Deep Learning Approach for High-Throughput Multi-Tissue Cell Segmentation and Phenotypic Analysis in Chinese Cabbage Leaf Cross-Sections

by Zhiming Zhang, Jun Zhang, Tianyi Ren, Minggeng Liu and Lei Sun

Agronomy 2026, 16(6), 612; https://doi.org/10.3390/agronomy16060612 - 13 Mar 2026

Viewed by 423

Abstract

Quantitative analysis of leaf cell microstructure is crucial for deciphering agronomic traits in Chinese cabbage, including photosynthetic efficiency, stress tolerance, and yield potential. Traditional manual observation methods are inefficient and highly subjective, failing to meet the demands of large-scale breeding for high-throughput, reproducible [...] Read more.

Quantitative analysis of leaf cell microstructure is crucial for deciphering agronomic traits in Chinese cabbage, including photosynthetic efficiency, stress tolerance, and yield potential. Traditional manual observation methods are inefficient and highly subjective, failing to meet the demands of large-scale breeding for high-throughput, reproducible microscopic phenotyping. To transition breeding practices from experience-driven to data-driven, there is an urgent need to establish automated, standardized systems for acquiring cell-scale phenotypes. Therefore, this study proposes an automated instance segmentation and phenotyping analysis framework for multi-tissue cells in Chinese cabbage leaf cross-sections. This framework systematically optimizes Mask R-CNN by introducing an attention mechanism to enhance cellular feature responses in complex backgrounds. It employs weighted multi-scale feature fusion to process densely distributed small-scale cells and integrates a refined boundary optimization module to improve recognition accuracy in adherent and blurred regions. On a microscopic image dataset spanning multiple varieties, this method achieves high-precision predictions in instance segmentation tasks. Based on the predicted cell masks, an interactive phenotyping analysis tool was further developed to automatically extract standardized single-cell morphological parameters, including area, perimeter, and Feret’s diameter. The measured parameters exhibit high consistency with manual annotations (correlation coefficients (r) all exceed 0.97). This framework enables high-throughput, standardized phenotypic analysis at the cellular level of leaf cross-sections, providing a reliable method for the digital and automated interpretation of crop microscopic traits. This technical solution not only supports the systematic integration of microscopic phenotypes in Chinese cabbage breeding but also offers a scalable solution for cellular-scale phenotypic research in other crops. Full article

(This article belongs to the Special Issue AI, Sensors and Robotics for Smart Agriculture)

► Show Figures

Figure 1

22 pages, 17254 KB

Open AccessArticle

Landslide Susceptibility Assessment Based on a Deep Learning-Derived Landslide Inventory in Moxi Town, Sichuan, China

by Yitong Yao, Yixiang Du, Wenjun Zhang, Xianwen Liu, Jialun Cai, Hui Feng, Hongyao Xiang, Rong Hu, Yuhao Yang and Tongben Fu

Remote Sens. 2026, 18(6), 849; https://doi.org/10.3390/rs18060849 - 10 Mar 2026

Viewed by 673

Abstract

Landslides are characterized by strong suddenness and a wide range of damage; accurate prediction of their susceptibility is an important prerequisite for regional risk prevention and control. To address the difficulties in acquiring landslide inventories in complex terrain areas and the insufficient interpretability [...] Read more.

Landslides are characterized by strong suddenness and a wide range of damage; accurate prediction of their susceptibility is an important prerequisite for regional risk prevention and control. To address the difficulties in acquiring landslide inventories in complex terrain areas and the insufficient interpretability of existing prediction models, this study proposes a landslide susceptibility assessment (LSA) framework that integrates automated sample detection and interpretability analysis. The proposed framework is applied to Moxi Town, a typical alpine valley area in Sichuan Province, China. A Mask R-CNN instance segmentation model was introduced to achieve automated detection of landslide samples, resulting in a high-quality inventory containing 923 landslides. Based on the spatial relationships between the landslide inventory and influencing factors, a convolutional neural network (CNN) landslide susceptibility assessment model incorporating Shapley Additive exPlanations (SHAP) interpretability analysis was constructed. The CNN model was further compared with random forest (RF) and extreme gradient boosting (XGBoost) machine learning models. The results show that the AUC value of the CNN model has increased by 4.3% and 3.2% compared with the RF and XGBoost models, respectively, and it significantly reduces the pretzel effect of landslide susceptibility mapping (LSM). The results validate the reliability of the proposed framework, which can provide technical support for landslide disaster prevention and monitoring. Full article

(This article belongs to the Special Issue Landslide Detection Using Machine and Deep Learning)

► Show Figures

Figure 1

Search Results (709)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (709)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI