Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (125)

Search Parameters:
Keywords = VGG-UNet

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
30 pages, 47665 KB  
Article
Identification of Landslides in the Hilly Areas of Eastern China Using High-Resolution GF-2 Images and Deep Learning Models
by Xiangyu Cui, Shuo Zheng, Yanfei An, Weijia Cai and Jinlong Xu
Sustainability 2026, 18(12), 5803; https://doi.org/10.3390/su18125803 - 6 Jun 2026
Viewed by 383
Abstract
Small, dispersed, and vegetated creeping landslides in hilly areas of eastern China hinder traditional remote sensing and detection. To address this, this study takes Yixian County (Anhui Province) as a representative area. Based on high-resolution GF-2 satellite images, three deep learning models embedded [...] Read more.
Small, dispersed, and vegetated creeping landslides in hilly areas of eastern China hinder traditional remote sensing and detection. To address this, this study takes Yixian County (Anhui Province) as a representative area. Based on high-resolution GF-2 satellite images, three deep learning models embedded with the Squeeze-and-Excitation (SE) attention mechanism (ResNet18-SE, VGG13-SE, UNet-SE) were developed and compared with the original UNet model. Combined with field investigation, landslide mapping and accuracy assessment were conducted to evaluate the feature extraction capabilities of four models. The results indicate that the UNet-SE model achieved optimal performance (Precision: 0.911, Recall: 0.685, F1-score: 0.782, Kappa: 0.730, IoU: 0.643). Its F1-score exceeds ResNet18-SE, VGG13-SE, and the original UNet by 8%, 3%, and 5%, respectively, proving superior regional adaptability and generalization performance. Additional verification on creeping landslides in Kecun Town (Yixian County) and post-earthquake landslides in Lushan County (Sichuan Province) further confirms the reliability of the UNet-SE model. Furthermore, Frequency Ratio (FR), Random Forest (RF), and SHapley Additive exPlanations (SHAP) were adopted to reveal the correlation between landslide occurrence and seven geological-environmental factors. Landslides are most susceptible to develop at elevations of 400–500 m, NDVI values of 0.4–0.5, slopes below 10°, east and northeast aspects, 300–500 m away from rivers, 500–1000 m away from faults, and areas covered by soft sedimentary lithology. Distance from faults, distance from rivers, and elevation are identified as the three favorable conditional factors. In conclusion, the proposed landslide detection framework can provide reliable spatial data and technical references for regional geological hazard prevention, ecological conservation and sustainable development in hilly areas. Full article
Show Figures

Figure 1

32 pages, 75104 KB  
Article
A Feature-Optimized Deep Learning Framework for Mapping and Spatial Characterization of Tea Plantations in Complex Mountain Landscapes
by Ruyi Wang, Jixian Zhang, Xiaoping Lu, Qi Kang, Bowen Chi, Junfeng Li, Yahang Li and Zhengfang Lou
Remote Sens. 2026, 18(9), 1281; https://doi.org/10.3390/rs18091281 - 23 Apr 2026
Viewed by 275
Abstract
The unchecked expansion of tea plantations onto steep, forest-adjacent slopes in subtropical mountains engenders a conflict between agricultural productivity and ecosystem integrity, particularly by exacerbating habitat fragmentation and soil erosion. While precise monitoring is essential to navigate this trade-off for sustainable management, accurate [...] Read more.
The unchecked expansion of tea plantations onto steep, forest-adjacent slopes in subtropical mountains engenders a conflict between agricultural productivity and ecosystem integrity, particularly by exacerbating habitat fragmentation and soil erosion. While precise monitoring is essential to navigate this trade-off for sustainable management, accurate inventorying remains a challenge due to the plantations’ strong phenological variability, heterogeneous canopy structures, and high spectral confusion with surrounding vegetation. This study proposes a feature-optimized deep learning framework for mapping and characterizing tea plantations in complex landscapes, using Xinyang City, China, as a study area. The framework integrates multi-temporal Sentinel-1/2 observations with a sequential Jeffries-Matusita (JM)-Pearson feature filtering strategy. This approach effectively condenses a 132-variable high-dimensional pool (including optical spectra, vegetation indices, textures, and SAR polarimetry) into a compact 28-feature subset (a 78.8% reduction), preserving critical phenological and structural cues while minimizing redundancy. These optimized predictors drive a hybrid VGG16–UNet++ segmentation network, which couples transfer-learning-based semantic encoding with detail-preserving dense skip fusion. Extensive experiments across 18 model–feature configurations demonstrate that the optimal setting achieves an Overall Accuracy of 97.82%, an F1-score of 0.9093, and a mean IoU of 0.7968. Notably, the method significantly reduces misclassification in rugged, cloud-prone terrain, yielding a User’s Accuracy of 91.14% for tea. Based on the generated wall-to-wall map, we derived two decision-support indicators: multi-threshold steep-slope exposure and a normalized tea–forest interface density. This framework provides actionable, high-precision spatial products to support slope-based zoning, ecological restoration, and sustainable management in fragile mountain agroforestry systems. Full article
Show Figures

Figure 1

35 pages, 5739 KB  
Article
Multi-Scale Atrous Feature Fusion Based on a VGG19-UNet Encoder for Brain Tumor Segmentation
by Shoffan Saifullah and Rafał Dreżewski
Appl. Sci. 2026, 16(8), 3971; https://doi.org/10.3390/app16083971 - 19 Apr 2026
Viewed by 399
Abstract
Accurate brain tumor segmentation from magnetic resonance imaging (MRI) remains challenging due to heterogeneous tumor morphology, intensity variability, and multi-scale structural complexity. This study proposes a DeepLabV3+-based segmentation framework integrating a VGG19-UNet encoder, Atrous Spatial Pyramid Pooling (ASPP), and low-level feature refinement to [...] Read more.
Accurate brain tumor segmentation from magnetic resonance imaging (MRI) remains challenging due to heterogeneous tumor morphology, intensity variability, and multi-scale structural complexity. This study proposes a DeepLabV3+-based segmentation framework integrating a VGG19-UNet encoder, Atrous Spatial Pyramid Pooling (ASPP), and low-level feature refinement to simultaneously capture hierarchical semantics and boundary-sensitive spatial details. The architecture enhances receptive field coverage without additional downsampling while preserving fine-grained contour information during reconstruction. Extensive evaluation was conducted on the Figshare Brain Tumor Segmentation (FBTS) dataset and the BraTS 2021 and BraTS 2018 benchmarks, focusing on Whole Tumor segmentation across multiple MRI modalities and tumor grades. Under five-fold cross-validation, the proposed model achieved a mean Dice Similarity Coefficient of 0.9717 and Jaccard Index of 0.9456 on FBTS, with stable and competitive performance across FLAIR, T1, T2, and T1CE modalities in both HGG and LGG cases. Boundary-level analysis further confirmed controlled Hausdorff Distance and low Average Symmetric Surface Distance. Statistical validation and ablation analysis demonstrate consistent improvements over baseline U-Net configurations. The proposed framework provides a robust and computationally efficient solution for automated brain tumor segmentation across heterogeneous datasets. Full article
(This article belongs to the Special Issue Research on Artificial Intelligence in Healthcare)
Show Figures

Figure 1

26 pages, 4917 KB  
Article
A Comprehensive Clinical Decision Support System for the Early Diagnosis of Axial Spondyloarthritis: Multi-Sequence MRI, Clinical Risk Integration, and Explainable Segmentation
by Fatih Tarakci, Ilker Ali Ozkan, Musa Dogan, Halil Ozer, Dilek Tezcan and Sema Yilmaz
Diagnostics 2026, 16(7), 1037; https://doi.org/10.3390/diagnostics16071037 - 30 Mar 2026
Viewed by 739
Abstract
Background/Objectives: This study aims to develop a comprehensive Clinical Decision Support System (CDSS) that integrates multi-sequence sacroiliac joint (SIJ) MRIs with rheumatological, clinical, and laboratory findings into the decision-making process for the early diagnosis of axial spondyloarthritis (axSpA), incorporating segmentation-supported explainability. Methods: Multi-sequence [...] Read more.
Background/Objectives: This study aims to develop a comprehensive Clinical Decision Support System (CDSS) that integrates multi-sequence sacroiliac joint (SIJ) MRIs with rheumatological, clinical, and laboratory findings into the decision-making process for the early diagnosis of axial spondyloarthritis (axSpA), incorporating segmentation-supported explainability. Methods: Multi-sequence SIJ MRI data (T1-WI, T2-WI, STIR, and PD-WI) were analysed from 367 participants (n = 193 axSpA; n = 174 non-axSpA controls). Sequence-based classification was performed using VGG16, ResNet50, DenseNet121, and InceptionV3 models; additionally, a lightweight and parameter-efficient SacroNet architecture was developed. Slice-level probability scores were converted to patient-level scores using the Dynamic Top-K Averaging method. Image-based scores were combined with a logistic regression-based clinical risk score using weighted linear integration (0.60 image/0.40 clinical) and a conservative threshold (τ = 0.70). Grad-CAM was applied for visual interpretability. Furthermore, to support the diagnostic outcomes with precise spatial data, active inflammation in STIR and T2-WI sequences was segmented. For this purpose, the MDC-UNet model was employed and compared with baseline U-Net derivatives. Results: Sequence-specific analysis showed VGG16 performing best on T1-WI (AUC = 0.920; Accuracy = 0.878) and DenseNet121 on STIR (AUC = 0.793; Accuracy = 0.771). The SacroNet architecture provided competitive classification performance at the patient level despite its low number of parameters (~110 K). Furthermore, MDC-UNet successfully segmented active inflammation, yielding Dice scores of 0.752 (HD95: 19.25) for STIR and 0.682 (HD95: 26.21) for T2-WI. Conclusions: The findings demonstrate that patient-level decision integration based on multi-sequence MRI, when used in conjunction with clinical risk scoring and segmentation-assisted interpretability, can provide a feasible and interpretable DSS framework for the early diagnosis of axSpA. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

22 pages, 8609 KB  
Article
Integrating SimAM Attention and S-DRU Feature Reconstruction for Sentinel-2 Imagery-Based Soybean Planting Area Extraction
by Haotong Wu, Xinwen Wan, Rong Qian, Chao Ruan, Jinling Zhao and Chuanjian Wang
Agriculture 2026, 16(6), 693; https://doi.org/10.3390/agriculture16060693 - 19 Mar 2026
Cited by 1 | Viewed by 464
Abstract
Accurate and stable acquisition of the spatial distribution of soybean planting areas is essential for supporting precision agricultural monitoring and ensuring food security. However, crop remote-sensing mapping for specific regions still faces critical data bottlenecks: high-precision, large-scale pixel-level annotation is costly, resulting in [...] Read more.
Accurate and stable acquisition of the spatial distribution of soybean planting areas is essential for supporting precision agricultural monitoring and ensuring food security. However, crop remote-sensing mapping for specific regions still faces critical data bottlenecks: high-precision, large-scale pixel-level annotation is costly, resulting in scarce available labeled samples that make it difficult to construct large-scale training datasets. Although parameter-intensive models such as FCN and SegNet can achieve sufficient end-to-end training on large-scale public remote sensing datasets like LoveDA, when directly applied to the data-limited dataset in this study area, the models are prone to overfitting, leading to a significant decline in generalization ability. To address these issues, this study proposes a lightweight U-shaped semantic segmentation model, SimSDRU-Net. The model utilizes a pre-trained VGG-16 backbone to extract shallow texture and deep semantic features. The pre-trained weights mitigate the impact of overfitting in data-limited settings. In the decoding stage, a parameter-free lightweight SimAM attention module enhances effective soybean features and suppresses soil background redundancy, while an embedded S-DRU unit fuses multi-scale features for deep complementary reconstruction to improve edge detail capture. A label dataset was constructed using Sentinel-2 images as the data source and Menard County (USA) as the study area. The USDA CDL was used as a foundation for the dataset, with Google high-resolution images serving as visual interpretation aids. In the context of the experiment, Deeplabv3+ and U-Net++ were compared with U-Net under identical conditions. The results demonstrated that SimSDRU-Net exhibited optimal performance, with MIoU of 89.03%, MPA of 93.81%, and OA of 95.96%. Specifically, SimSDRU-Net uses the SimAM attention module to generate spatial attention weights by analyzing feature statistical differences through an energy function, so as to adaptively enhance soybean texture features. Meanwhile, the S-DRU unit groups, dynamically weights, and cross-branch reconstructs multi-scale convolutional features to preserve fine boundary details and achieve accurate segmentation of soybean plots. The present study demonstrates that SimSDRU-Net integrates lightweight design and high precision in data-limited scenarios, thereby providing effective technical support for the rapid extraction of soybean planting areas in North America. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

15 pages, 2914 KB  
Article
Global-Token U-Net with Hybrid Loss for Trustworthy Medical Image Super-Resolution
by Jiaqi Shang, Zhiyuan Xu and Dongdong Wang
Sensors 2026, 26(5), 1454; https://doi.org/10.3390/s26051454 - 26 Feb 2026
Viewed by 541
Abstract
Super-resolution technology significantly enhances the visual quality of low-resolution medical images, resulting in ultra-high-resolution clear images. Super-resolution technology based on artificial intelligence has achieved great success in reconstruction quality. However, like the image restoration task, super-resolution is also an ill-posed problem, and current [...] Read more.
Super-resolution technology significantly enhances the visual quality of low-resolution medical images, resulting in ultra-high-resolution clear images. Super-resolution technology based on artificial intelligence has achieved great success in reconstruction quality. However, like the image restoration task, super-resolution is also an ill-posed problem, and current work lacks consideration of trustworthiness. Medical image super-resolution needs to ensure clarity and, more importantly, to ensure that the output image is reliable and does not produce false details and mislead the diagnosis. To address the trustworthy issue of medical image super-resolution, we design a novel hybrid loss that combines a hinge-based adversarial term with a PSNR-based regularization. In the designed loss function, the adversarial term makes the reconstructed result close to the distribution of the true high-resolution image, thus generating more refined high-frequency textures, while the PSNR-based regularization term explicitly reduces the deviation from the ground truth. We apply this loss in the global-token U-Net backbone network and add a lightweight VGG as the discriminator for adversarial terms. We empirically verify that integrating the proposed methods can enhance the trustworthiness of medical image super-resolution technology while maintaining high reconstruction quality. Full article
(This article belongs to the Special Issue Sensing and Processing for Medical Imaging: Methods and Applications)
Show Figures

Figure 1

32 pages, 14017 KB  
Article
Optimized Image Segmentation Model for Pellet Microstructure Incorporating KL Divergence Constraints
by Yuwen Ai, Xia Li, Aimin Yang, Yunjie Bai and Xuezhi Wu
Mathematics 2026, 14(3), 574; https://doi.org/10.3390/math14030574 - 5 Feb 2026
Viewed by 499
Abstract
Accurate segmentation of pellet microstructure images is crucial for evaluating their metallurgical performance and optimizing production processes. To address the challenges posed by complex structures, blurred boundaries, and fine-grained textures of hematite and magnetite in pellet micrographs, this study proposes a hybrid intelligently [...] Read more.
Accurate segmentation of pellet microstructure images is crucial for evaluating their metallurgical performance and optimizing production processes. To address the challenges posed by complex structures, blurred boundaries, and fine-grained textures of hematite and magnetite in pellet micrographs, this study proposes a hybrid intelligently optimized VGG16-U-Net semantic segmentation model. The model incorporates an improved SPC-SA channel self-attention mechanism in the encoder to enhance deep feature representation, while a simplified SAN and SAW module is integrated into the decoder to strengthen its response to key mineral regions. Additionally, a hybrid loss strategy is employed with KL regularization for training optimization. Experimental results show that the model achieves an mIoU of 85.58%, an mPA of 91.54%, and an overall accuracy of 93.58%. Compared with the baseline models, the proposed method achieves improved performance to some extent. Full article
(This article belongs to the Special Issue Mathematical Methods for Image Processing and Computer Vision)
Show Figures

Figure 1

18 pages, 14590 KB  
Article
VTC-Net: A Semantic Segmentation Network for Ore Particles Integrating Transformer and Convolutional Block Attention Module (CBAM)
by Yijing Wu, Weinong Liang, Jiandong Fang, Chunxia Zhou and Xiaolu Sun
Sensors 2026, 26(3), 787; https://doi.org/10.3390/s26030787 - 24 Jan 2026
Cited by 2 | Viewed by 558
Abstract
In mineral processing, visual-based online particle size analysis systems depend on high-precision image segmentation to accurately quantify ore particle size distribution, thereby optimizing crushing and sorting operations. However, due to multi-scale variations, severe adhesion, and occlusion within ore particle clusters, existing segmentation models [...] Read more.
In mineral processing, visual-based online particle size analysis systems depend on high-precision image segmentation to accurately quantify ore particle size distribution, thereby optimizing crushing and sorting operations. However, due to multi-scale variations, severe adhesion, and occlusion within ore particle clusters, existing segmentation models often exhibit undersegmentation and misclassification, leading to blurred boundaries and limited generalization. To address these challenges, this paper proposes a novel semantic segmentation model named VTC-Net. The model employs VGG16 as the backbone encoder, integrates Transformer modules in deeper layers to capture global contextual dependencies, and incorporates a Convolutional Block Attention Module (CBAM) at the fourth stage to enhance focus on critical regions such as adhesion edges. BatchNorm layers are used to stabilize training. Experiments on ore image datasets show that VTC-Net outperforms mainstream models such as UNet and DeepLabV3 in key metrics, including MIoU (89.90%) and pixel accuracy (96.80%). Ablation studies confirm the effectiveness and complementary role of each module. Visual analysis further demonstrates that the model identifies ore contours and adhesion areas more accurately, significantly improving segmentation robustness and precision under complex operational conditions. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

19 pages, 6416 KB  
Article
A Seismic Horizon Identification Method Based on scSE-VGG16-UNet++
by Qin Wang, Cai Liu, Yang Liu, Jiaqi Fan, Dian Wang, Qi Lu and Peng Li
Appl. Sci. 2026, 16(1), 394; https://doi.org/10.3390/app16010394 - 30 Dec 2025
Viewed by 726
Abstract
The accuracy of seismic horizon identification results significantly impacts the precision of subsequent structural interpretation and reservoir analysis, making it a critical step in seismic data interpretation. While deep learning achieves horizon identification by establishing a mapping relationship between seismic data and training [...] Read more.
The accuracy of seismic horizon identification results significantly impacts the precision of subsequent structural interpretation and reservoir analysis, making it a critical step in seismic data interpretation. While deep learning achieves horizon identification by establishing a mapping relationship between seismic data and training labels, networks often struggle to learn effectively when faced with highly similar profile features within the same survey area, leading to issues such as discontinuities and horizon mis-ties. To fully leverage the spatial relationships between seismic horizons and enhance network feature extraction capabilities, this paper proposes a progressive methodology based on the VGG16-UNet architecture. This paper first employs data augmentation to enrich seismic profile features and then introduces the Spatial and Channel Squeeze and Excitation (scSE) attention mechanism to improve the representation of salient features, designing the scSE-VGG16-UNet architecture. Building on this foundation, the deep supervision mechanism from UNet++ is integrated to finally propose the data-augmented scSE-VGG16-UNet++ method for seismic horizon identification. Quantitatively, the baseline VGG16-UNet method achieves a Pixel Accuracy (PA) and mean Intersection over Union (mIoU) of only 93.01% and 46.96%, respectively. Through the synergistic effect of data augmentation, the scSE attention mechanism, and the UNet++ architecture, the final data-augmented scSE-VGG16-UNet++ method elevates these metrics to 97.70% and 79.23%. The test results on both model data and field data demonstrate that the proposed method not only achieves high-continuity horizon identification and effectively mitigates the problem of horizon mis-ties, but also exhibits robust performance in the presence of noise, indicating superior generalization ability. Full article
Show Figures

Figure 1

34 pages, 2749 KB  
Review
Exploring Structural Health Monitoring of Buildings: State of the Art on Techniques and Future Directions
by M. Kalai Selvi, R. Manjula Devi, K. S. Elango, S. Anandaraj, G. Sindhu Priya, S. Shaniya and P. Manoj Kumar
Buildings 2026, 16(1), 154; https://doi.org/10.3390/buildings16010154 - 29 Dec 2025
Cited by 3 | Viewed by 2666
Abstract
Structural deterioration inevitably leads to defects in buildings. It is primarily caused by environmental exposure, material ageing, and long-term service conditions, whereas defects such as poor soil compaction arise from improper construction practices rather than deterioration mechanisms. Major concrete defects include missing portions [...] Read more.
Structural deterioration inevitably leads to defects in buildings. It is primarily caused by environmental exposure, material ageing, and long-term service conditions, whereas defects such as poor soil compaction arise from improper construction practices rather than deterioration mechanisms. Major concrete defects include missing portions such as cracking, corrosion, dents, blemishes, and spalling. Failure to identify minor issues can lead to serious problems, which become more expensive and difficult to repair, as well as poorer overall building performance. Traditional structural assessment methods, such as visual inspections and non-destructive testing are typically used for periodic condition evaluation, whereas SHM involves continuous or long-term monitoring using sensor-based systems. However, such approaches can be manual, costly, dangerous, and biased. In order to overcome these limitations, contemporary SHM systems combine traditional approaches with building information modelling (BIM) and artificial intelligence (AI). Different AI algorithms are used, including SVM, random forest, regression, and KNN for machine learning and decision trees; random forest, K-means clustering, CNN, U-Net, ResNet, FCN, VGG16, and DeepLabv3+ for deep learning. This review will survey both the traditional and novel approaches in the field of SHM and the recent advancements. Full article
(This article belongs to the Section Building Structures)
Show Figures

Figure 1

22 pages, 5552 KB  
Article
MSA-UNet: Multiscale Feature Aggregation with Attentive Skip Connections for Precise Building Extraction
by Guobiao Yao, Yan Chen, Wenxiao Sun, Zeyu Zhang, Yifei Tang and Jingxue Bi
ISPRS Int. J. Geo-Inf. 2025, 14(12), 497; https://doi.org/10.3390/ijgi14120497 - 17 Dec 2025
Viewed by 909
Abstract
An accurate and reliable extraction of building structures from high-resolution (HR) remote sensing images is an important research topic in 3D cartography and smart city construction. However, despite the strong overall performance of recent deep learning models, limitations remain in handling significant variations [...] Read more.
An accurate and reliable extraction of building structures from high-resolution (HR) remote sensing images is an important research topic in 3D cartography and smart city construction. However, despite the strong overall performance of recent deep learning models, limitations remain in handling significant variations in building scales and complex architectural forms, which may lead to inaccurate boundaries or difficulties in extracting small or irregular structures. Therefore, the present study proposes MSA-UNet, a reliable semantic segmentation framework that leverages multiscale feature aggregation and attentive skip connections for an accurate extraction of building footprints. This framework is constructed based on the U-Net architecture, incorporating VGG16 as a replacement for the original encoder structure, which enhances its ability to capture low-discriminative features. To further improve the representation of image buildings with different scales and shapes, a serial coarse-to-fine feature aggregation mechanism was used. Additionally, a novel skip connection was built between the encoder and decoder layers to enable adaptive weights. Furthermore, a dual-attention mechanism, implemented through the convolutional block attention module, was integrated to enhance the focus of the network on building regions. Extensive experiments conducted on the WHU and Inria building datasets validated the effectiveness of MSA-UNet. On the WHU dataset, the model demonstrated a state-of-the-art performance with a mean Intersection over Union (mIoU) of 94.26%, accuracy of 98.32%, F1-score of 96.57%, and mean Pixel accuracy (mPA) of 96.85%, corresponding to gains of 1.41% in mIoU over the baseline U-Net. On the more challenging Inria dataset, MSA-UNet achieved an mIoU of 85.92%, indicating a consistent improvement of up to 1.9% over the baseline U-Net. These results confirmed that MSA-UNet markedly improved the accuracy and boundary integrity of building extraction from HR data, outperforming existing classic models in terms of segmentation quality and robustness. Full article
(This article belongs to the Special Issue Spatial Data Science and Knowledge Discovery)
Show Figures

Figure 1

14 pages, 835 KB  
Article
Prediction of Lymphovascular Invasion in Early–Stage Lung Adenocarcinoma Using Artificial Intelligence–Based Radiomics
by Yoshihisa Shimada, Kazuharu Harada, Yujin Kudo, Jinho Park, Jun Matsubayashi, Masataka Taguri and Norihiko Ikeda
Cancers 2025, 17(24), 3998; https://doi.org/10.3390/cancers17243998 - 15 Dec 2025
Viewed by 765
Abstract
Objectives: This study utilized artificial intelligence (AI)–based radiomics analysis of computed tomography (CT) images using a modified U–Net for lung nodule segmentation and convolutional neural network based on VGG–16 to predict lymphovascular invasion (LVI) in stage 0–I lung adenocarcinoma. Additionally, the study investigated [...] Read more.
Objectives: This study utilized artificial intelligence (AI)–based radiomics analysis of computed tomography (CT) images using a modified U–Net for lung nodule segmentation and convolutional neural network based on VGG–16 to predict lymphovascular invasion (LVI) in stage 0–I lung adenocarcinoma. Additionally, the study investigated whether combining radiomics data with serum microRNA (miR)–30d level as a potential biomarker could enhance predictive performance. Methods: A total of 1265 patients who underwent complete resection between 2008 and 2018 were included. AI–based CT analysis was performed, and logistic regression was applied to predict LVI using 35 imaging features. A risk score (RS) generated from 840 patients in the derivation cohort was used to identify a high–risk group, with validation performed using 425 patients. Additionally, 47 cases with extracellular vesicle (EV)–derived miR–30d level data were analyzed to evaluate the value of the integrated approach. Results: Among all the patients, 467 patients (36.9%) were LVI–positive, and LVI was independently associated with poorer overall survival. The receiver operating characteristic curve for LVI based on the RS yielded an area under the curve of 0.899. For LVI prediction, the sensitivity, specificity, and accuracy were 84.8%, 83.7%, and 83.9%, respectively, in the derivation group, and 82.3%, 79.4%, and 80.5%, respectively, in the validation group. The integrated approach with miR–30d enhanced the predictability of LVI, achieving a sensitivity of 93.3%, specificity of 70.5%, and accuracy of 85.1%. Conclusions: AI–based radiomics demonstrated high effectiveness for predicting LVI, with RSs showing broad clinical applications. The addition of EV–derived miR–30d modestly improved predictability. Full article
Show Figures

Figure 1

20 pages, 4879 KB  
Article
A Multi-Phenotype Acquisition System for Pleurotus eryngii Based on RGB and Depth Imaging
by Yueyue Cai, Zhijun Wang, Ziqin Liao, Yujie Li, Weijie Shi, Peijie Huang, Bingzhi Chen, Jie Pang, Xiangzeng Kong and Xuan Wei
Agriculture 2025, 15(24), 2566; https://doi.org/10.3390/agriculture15242566 - 11 Dec 2025
Viewed by 698
Abstract
High-throughput phenotypic acquisition and analysis allow us to accurately quantify trait expressions, which is essential for developing intelligent breeding strategies. However, there is still much potential to explore in the field of high-throughput phenotyping for edible fungi. In this study, we developed a [...] Read more.
High-throughput phenotypic acquisition and analysis allow us to accurately quantify trait expressions, which is essential for developing intelligent breeding strategies. However, there is still much potential to explore in the field of high-throughput phenotyping for edible fungi. In this study, we developed a portable multi-phenotypic acquisition system for Pleurotus eryngii using RGB and RGB-D cameras. We developed an innovative Unet-based semantic segmentation model by integrating the ASPP structure with the VGG16 architecture. This allows for precise segmentation of the cap, gills and stem of the fruiting body. By leveraging depth images from RGB-D cameras, we can effectively collect phenotypic information about Pleurotus eryngii. By combining K-means clustering with Lab color space thresholds, we are able to achieve more precise automatic classification of Pleurotus eryngii cap colors. Moreover, AlexNet is utilized to classify the shapes of the fruiting bodies. The Aspp-VGGUnet network demonstrates remarkable performance with a mean Intersection over Union (mIoU) of 96.47% and a mean pixel accuracy (mPA) of 98.53%. These results reflect respective improvements of 3.03% and 2.23% compared to the standard Unet model, respectively. The average error in size phenotype measurement is just 0.15 ± 0.03 cm. The accuracy for cap color classification reaches 91.04%, while fruiting body shape classification achieves 97.90%. The proposed multi-phenotype acquisition system reduces the measurement time per sample from an average of 76 s (manual method) to about 2 s, substantially increasing data acquisition throughput and providing robust support for scalable phenotyping workflows in breeding research. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Graphical abstract

22 pages, 2396 KB  
Article
CHROM-Y: Illumination-Adaptive Robust Remote Photoplethysmography Through 2D Chrominance–Luminance Fusion and Convolutional Neural Networks
by Mohammed Javidh, Ruchi Shah, Mohan Uma, Sethuramalingam Prabhu and Rajendran Beaulah Jeyavathana
Signals 2025, 6(4), 72; https://doi.org/10.3390/signals6040072 - 9 Dec 2025
Viewed by 1840
Abstract
Remote photoplethysmography (rPPG) enables non-contact heart rate estimation but remains highly sensitive to illumination variation and dataset-dependent factors. This study proposes CHROM-Y, a robust 2D feature representation that combines chrominance (Ω, Φ) with luminance (Y) to improve physiological signal extraction under varying lighting [...] Read more.
Remote photoplethysmography (rPPG) enables non-contact heart rate estimation but remains highly sensitive to illumination variation and dataset-dependent factors. This study proposes CHROM-Y, a robust 2D feature representation that combines chrominance (Ω, Φ) with luminance (Y) to improve physiological signal extraction under varying lighting conditions. The proposed features were evaluated using U-Net, ResNet-18, and VGG16 for heart rate estimation and waveform reconstruction on the UBFC-rPPG and BhRPPG datasets. On UBFC-rPPG, U-Net with CHROM-Y achieved the best performance with a Peak MAE of 3.62 bpm and RMSE of 6.67 bpm, while ablation experiments confirmed the importance of the Y-channel, showing degradation of up to 41.14% in MAE when removed. Although waveform reconstruction demonstrated low Pearson correlation, dominant frequency preservation enabled reliable frequency-based HR estimation. Cross-dataset evaluation revealed reduced generalization (MAE up to 13.33 bpm and RMSE up to 22.80 bpm), highlighting sensitivity to domain shifts. However, fine-tuning U-Net on BhRPPG produced consistent improvements across low, medium, and high illumination levels, with performance gains of 11.18–29.47% in MAE and 12.48–27.94% in RMSE, indicating improved adaptability to illumination variations. Full article
Show Figures

Figure 1

25 pages, 7527 KB  
Article
A Multifocal RSSeg Approach for Skeletal Age Estimation in an Indian Medicolegal Perspective
by Priyanka Manchegowda, Manohar Nageshmurthy, Suresha Raju and Dayananda Rudrappa
Algorithms 2025, 18(12), 765; https://doi.org/10.3390/a18120765 - 4 Dec 2025
Cited by 1 | Viewed by 892
Abstract
Estimating bone age is essential for accurate diagnoses, appropriate care based on biological age, and fairness in legal matters. In the Indian medicolegal context, determining age through a clinical approach involves analyzing multiple joints; however, the traditional method can be tedious and subjective, [...] Read more.
Estimating bone age is essential for accurate diagnoses, appropriate care based on biological age, and fairness in legal matters. In the Indian medicolegal context, determining age through a clinical approach involves analyzing multiple joints; however, the traditional method can be tedious and subjective, relying heavily on human expertise, which may lead to biased decisions in age-related legal disputes. Moreover, commonly used radiographs often exhibit pixel-level variations due to heterogeneous contrast, which complicate segmentation tasks and lead to inconsistencies and reduced model performance. The study presents a multifocal region-based symbolic segmentation technique to automatically retain the soft-tissue region that harbors a growth pattern of an ossification center. Experimental results demonstrate an 84.5% Jaccard similarity, an 81.4% Dice coefficient, an 88.3% precision, a 90.0% recall, and a 91.5% pixel accuracy on a novel multifocal dataset of Indian inhabitants. The proposed segmentation technique outperforms U-Net, Attention U-Net, TransU-Net, DeepLabV3+, Adaptive Otsu, and Watershed segmentation in terms of accuracy, indicating strong generalizability across joints and improving reliability. Compared with 86.4% without segmentation, the proposed integration of segmentation with VGG16 classification increases the overall accuracy to 93.8%, demonstrating that target-focused-region processing reduces unnecessary computations and improves feature discrimination without sacrificing accuracy. Full article
(This article belongs to the Special Issue Machine Learning in Medical Signal and Image Processing (4th Edition))
Show Figures

Figure 1

Back to TopTop