Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (9,719)

Search Parameters:
Keywords = image similarity

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 2437 KB  
Article
Effects of Agricultural Production Patterns on Surface Water Quality in Central China’s Irrigation Districts: A Case Study of the Four Lakes Basin
by Yanping Hu, Zhenhua Wang, Dongguo Shao, Rui Li, Wei Zhang, Meng Long, Kezheng Song and Xiaohuan Cao
Sustainability 2025, 17(19), 8838; https://doi.org/10.3390/su17198838 (registering DOI) - 2 Oct 2025
Abstract
To explore the coupling between agricultural farming models and surface water environmental in central China’s irrigation districts, this study focuses on the Four Lakes Basin within Jianghan Plain, a key grain-producing and ecological protection area. Integrating remote sensing images, statistical yearbooks, and on-site [...] Read more.
To explore the coupling between agricultural farming models and surface water environmental in central China’s irrigation districts, this study focuses on the Four Lakes Basin within Jianghan Plain, a key grain-producing and ecological protection area. Integrating remote sensing images, statistical yearbooks, and on-site monitoring data, the study analyzed the phased characteristics of the basin’s agricultural pattern transformation, the changes in non-point source nitrogen and phosphorus loads, and the responses of water quality in main canals and Honghu Lake to agricultural adjustments during the period 2010~2023. The results showed that the basin underwent a significant transformation in agricultural patterns from 2016 to 2023: the area of rice-crayfish increased by 14%, while the areas of dryland crops and freshwater aquaculture decreased by 11% and 4%, respectively. Correspondingly, the non-point source nitrogen and phosphorus loads in the Four Lakes Basin decreased by 11~13%, and the nitrogen and phosphorus concentrations in main canals decreased slightly by approximately 2 mg/L and 0.04 mg/L, respectively; however, the water quality of Honghu Lake continued to deteriorate, with nitrogen and phosphorus concentrations increasing by approximately 0.46 mg/L and 0.06 mg/L, respectively. This indicated that the adjustment of agricultural farming models was beneficial to improving the water quality of main canals, but it did not bring about a substantial improvement in the sustainable development of Honghu Lake. This may be related to various factors that undermine the sustainability of the lake’s aquatic ecological environment, such as climate change, natural disasters, internal nutrient release from sediments, and the decline in water environment carrying capacity. Therefore, to advance sustainability in this basin and similar irrigation districts, future efforts should continue optimizing agricultural models to reduce nitrogen/phosphorus inputs, while further mitigating internal nutrient release and climate disaster risks, restoring aquatic vegetation, and enhancing water environment carrying capacity. Full article
Show Figures

Figure 1

19 pages, 2476 KB  
Article
Deep Reinforcement Learning-Based DCT Image Steganography
by Rongjian Yang, Lixin Liu, Bin Han and Feng Hu
Mathematics 2025, 13(19), 3150; https://doi.org/10.3390/math13193150 - 2 Oct 2025
Abstract
In this article, we present a novel reinforcement learning-based framework in the discrete cosine transform to achieve better image steganography. First, the input image is divided into several blocks to extract semantic and structural features, evaluating their suitability for data embedding. Second, the [...] Read more.
In this article, we present a novel reinforcement learning-based framework in the discrete cosine transform to achieve better image steganography. First, the input image is divided into several blocks to extract semantic and structural features, evaluating their suitability for data embedding. Second, the Proximal Policy Optimization algorithm (PPO) is introduced in the block selection process to learn adaptive embedding policies, which effectively balances image fidelity and steganographic security. Moreover, the Deep Q-network (DQN) is used for adaptively adjusting the weights of the peak signal-to-noise ratio, structural similarity index, and detection accuracy in the reward formulation. Experimental results on the BOSSBase dataset confirm the superiority of our framework, achieving both lower detection rates and higher visual quality across a range of embedding payloads, particularly under low-bpp conditions. Full article
Show Figures

Figure 1

20 pages, 10152 KB  
Article
In Vivo Comparison of Resin-Modified and Pure Calcium-Silicate Cements for Direct Pulp Capping
by Fatma Fenesha, Aonjittra Phanrungsuwan, Brian L. Foster, Anibal Diogenes and Sarah B. Peters
Appl. Sci. 2025, 15(19), 10639; https://doi.org/10.3390/app151910639 - 1 Oct 2025
Abstract
Introduction: Direct pulp capping (DPC) aims to preserve the vitality of the dental pulp by placing a protective biocompatible material over the exposed pulp tissue to facilitate healing. There are several calcium-silicate materials that have been designed to promote mineralization and the regulation [...] Read more.
Introduction: Direct pulp capping (DPC) aims to preserve the vitality of the dental pulp by placing a protective biocompatible material over the exposed pulp tissue to facilitate healing. There are several calcium-silicate materials that have been designed to promote mineralization and the regulation of inflammation. These have strong potential for the repair and regeneration of dental pulp. Among them, Biodentine (BD) and EndoSequence RRM Putty (ES) have been found to promote in vitro and in vivo mineralization while minimizing some of the limitations of the first-generation calcium-silicate-based materials. Theracal-LC (TLC), a light-cured, resin-modified calcium-silicate material, is a newer product with potential to improve the clinical outcomes of DPC, but existing studies have reported conflicting findings regarding its biocompatibility and ability to support pulpal healing in direct contact with the pulp. A comprehensive assessment of the biocompatibility and pulpal protection provided by these three capping materials has not yet been performed. Aim: We aimed to quantify the inflammatory response, dentin bridge formation, and material adaptation following DPC using three calcium-silicate materials: ES, BD, and TLC. Materials and Methods: DPC was performed on the maxillary first molar of C57BL/6 female mice. Maxilla were collected and processed at 1 and 21 days post-DPC. The early inflammatory response was measured 24 h post-procedure using confocal imaging of anti-Lys6G6C, which indicates the extent of neutrophil and monocyte infiltration. Reparative mineralized bridge formation was assessed at 21 days post-procedure using high-resolution micro-computed tomography (micro-CT) and histology. Lastly, the homogeneity of the capping materials was evaluated by quantifying voids in calcium-silicate restorations using micro-CT. Results: DPC using TLC induced less infiltration of Lys6G6C+ cells at 24 h than BD or ES. BD promoted higher volumes of tertiary dentin than TLC, but TLC and ES showed no significant differences in volume. No differences were observed in material adaptation and void spaces among the three capping materials. Conclusions: All three materials under investigation supported pulp healing and maintained marginal integrity. However, TLC induced a lower inflammatory response on day 1 and induced similar levels of tertiary dentin to ES. These observations challenge the common perception that resin-based capping materials are not suitable for direct pulp capping. Our findings underscore the need to balance biological responses with physical properties when selecting pulp capping materials to improve long-term clinical success. Full article
Show Figures

Figure 1

33 pages, 4190 KB  
Article
Preserving Songket Heritage Through Intelligent Image Retrieval: A PCA and QGD-Rotational-Based Model
by Nadiah Yusof, Nazatul Aini Abd. Majid, Amirah Ismail and Nor Hidayah Hussain
Computers 2025, 14(10), 416; https://doi.org/10.3390/computers14100416 - 1 Oct 2025
Abstract
Malay songket motifs are a vital component of Malaysia’s intangible cultural heritage, characterized by intricate visual designs and deep cultural symbolism. However, the practical digital preservation and retrieval of these motifs present challenges, particularly due to the rotational variations typical in textile imagery. [...] Read more.
Malay songket motifs are a vital component of Malaysia’s intangible cultural heritage, characterized by intricate visual designs and deep cultural symbolism. However, the practical digital preservation and retrieval of these motifs present challenges, particularly due to the rotational variations typical in textile imagery. This study introduces a novel Content-Based Image Retrieval (CBIR) model that integrates Principal Component Analysis (PCA) for feature extraction and Quadratic Geometric Distance (QGD) for measuring similarity. To evaluate the model’s performance, a curated dataset comprising 413 original images and 4956 synthetically rotated songket motif images was utilized. The retrieval system featured metadata-driven preprocessing, dimensionality reduction, and multi-angle similarity assessment to address the issue of rotational invariance comprehensively. Quantitative evaluations using precision, recall, and F-measure metrics demonstrated that the proposed PCAQGD + Rotation technique achieved a mean F-measure of 59.72%, surpassing four benchmark retrieval methods. These findings confirm the model’s capability to accurately retrieve relevant motifs across varying orientations, thus supporting cultural heritage preservation efforts. The integration of PCA and QGD techniques effectively narrows the semantic gap between machine perception and human interpretation of motif designs. Future research should focus on expanding motif datasets and incorporating deep learning approaches to enhance retrieval precision, scalability, and applicability within larger national heritage repositories. Full article
Show Figures

Graphical abstract

32 pages, 9105 KB  
Article
Development of Semi-Automatic Dental Image Segmentation Workflows with Root Canal Recognition for Faster Ground Tooth Acquisition
by Yousef Abo El Ela and Mohamed Badran
J. Imaging 2025, 11(10), 340; https://doi.org/10.3390/jimaging11100340 - 1 Oct 2025
Abstract
This paper investigates the application of image segmentation techniques in endodontics, focusing on improving diagnostic accuracy and achieving faster segmentation by delineating specific dental regions such as teeth and root canals. Deep learning architectures, notably 3D U-Net and GANs, have advanced the image [...] Read more.
This paper investigates the application of image segmentation techniques in endodontics, focusing on improving diagnostic accuracy and achieving faster segmentation by delineating specific dental regions such as teeth and root canals. Deep learning architectures, notably 3D U-Net and GANs, have advanced the image segmentation process for dental structures, supporting more precise dental procedures. However, challenges like the demand for extensive labeled datasets and ensuring model generalizability remain. Two semi-automatic segmentation workflows, Grow From Seeds (GFS) and Watershed (WS), were developed to provide quicker acquisition of ground truth training data for deep learning models using 3D Slicer software version 5.8.1. These workflows were evaluated against a manual segmentation benchmark and a recent dental segmentation automated tool on three separate datasets. The evaluations were performed by the overall shapes of a maxillary central incisor and a maxillary second molar and by the region of the root canal of both teeth. Results from Kruskal–Wallis and Nemenyi tests indicated that the semi-automated workflows, more often than not, were not statistically different from the manual benchmark based on dice coefficient similarity, while the automated method consistently provided significantly different 3D models from their manual counterparts. The study also explores the benefits of labor reduction and time savings achieved by the semi-automated methods. Full article
(This article belongs to the Section Image and Video Processing)
Show Figures

Figure 1

18 pages, 4927 KB  
Article
Automated Grading of Boiled Shrimp by Color Level Using Image Processing Techniques and Mask R-CNN with Feature Pyramid Networks
by Manit Chansuparp, Nantipa Pansawat and Sansanee Wangvoralak
Appl. Sci. 2025, 15(19), 10632; https://doi.org/10.3390/app151910632 - 1 Oct 2025
Abstract
Color grading of boiled shrimp is a critical factor influencing market price, yet the process is usually conducted visually by buyers such as middlemen and processing plants. This subjective practice raises concerns about accuracy, impartiality, and fairness, often resulting in disputes with farmers. [...] Read more.
Color grading of boiled shrimp is a critical factor influencing market price, yet the process is usually conducted visually by buyers such as middlemen and processing plants. This subjective practice raises concerns about accuracy, impartiality, and fairness, often resulting in disputes with farmers. To address this issue, this study proposes a standardized and automated grading approach based on image processing and artificial intelligence. The method requires only a photograph of boiled shrimp placed alongside a color grading ruler. The grading process involves two stages: segmentation of shrimp and ruler regions in the image, followed by color comparison. For segmentation, deep learning models based on Mask R-CNN with a Feature Pyramid Network backbone were employed. Four model configurations were tested, using ResNet and ResNeXt backbones with and without a Boundary Loss function. Results show that the ResNet + Boundary Loss model achieved the highest segmentation performance, with IoU scores of 91.2% for shrimp and 87.8% for the color ruler. In the grading step, color similarity was evaluated in the CIELAB color space by computing Euclidean distances in the L (lightness) and a (red–green) channels, which align closely with human perception of shrimp coloration. The system achieved grading accuracy comparable to human experts, with a mean absolute error of 1.2, demonstrating its potential to provide consistent, objective, and transparent shrimp quality assessment. Full article
Show Figures

Figure 1

10 pages, 298 KB  
Article
Exploring Visualization of Beverage Consistency Through 2D and 3D Imaging Methods
by Jane Mertz Garcia, Edgar Chambers, Madison Ukele, Abby Brey Althauser and David Rehfeld
Beverages 2025, 11(5), 141; https://doi.org/10.3390/beverages11050141 - 1 Oct 2025
Abstract
Modification to a thin beverage consistency is frequently recommended when swallowing is impaired, but proper thickening during preparation is essential. Contemporary technologies provide innovative ways of addressing quality control due to ongoing challenges in the accuracy of beverage preparation. This study explored two-dimensional [...] Read more.
Modification to a thin beverage consistency is frequently recommended when swallowing is impaired, but proper thickening during preparation is essential. Contemporary technologies provide innovative ways of addressing quality control due to ongoing challenges in the accuracy of beverage preparation. This study explored two-dimensional (2D) and three-dimensional (3D) methods for visualizing beverages representing different levels of consistency (thin, mildly thick, or moderately thick). A total of 48 adults with limited knowledge about swallowing and no experience with thickened beverages participated. They learned about levels of modification and then viewed 2D images (photos) and 3D virtual models of beverage content. Results showed that their ability to recognize beverage consistency and their decision confidence was generally similar across dimensions even though study participants conveyed a strong preference for viewing 3D models. Qualitative findings underscored the importance of beverage attributes, especially color. Participants differed in their perceptions in using a constant (2D) or multiple angles (3D) when evaluating visualizations. Results help inform about the potential role of visual content in developing instructional resources about thickened beverages prescribed for patients with special medical needs. Full article
(This article belongs to the Section Quality, Nutrition, and Chemistry of Beverages)
Show Figures

Figure 1

37 pages, 87459 KB  
Article
SYNOSIS: Image Synthesis Pipeline for Machine Vision in Metal Surface Inspection
by Juraj Fulir, Natascha Jeziorski, Lovro Bosnar, Hans Hagen, Claudia Redenbach, Tobias Herrfurth, Marcus Trost, Thomas Gischkat and Petra Gospodnetić
Sensors 2025, 25(19), 6016; https://doi.org/10.3390/s25196016 - 30 Sep 2025
Abstract
The use of machine learning methods for the development of robust and flexible visual inspection systems has shown promising results. However, their performance is highly dependent on the large amount and diversity of training data, which is difficult to obtain in practice. Recent [...] Read more.
The use of machine learning methods for the development of robust and flexible visual inspection systems has shown promising results. However, their performance is highly dependent on the large amount and diversity of training data, which is difficult to obtain in practice. Recent developments in synthetic dataset generation have seen increasing success in overcoming these problems. However, the prevailing work revolves around the usage of generative models, which suffer from data shortages, hallucinations, and provide limited support for unobserved edge-cases. In this work, we present the first synthetic data generation pipeline that is capable of generating large datasets of physically realistic textures exhibiting sophisticated structured patterns. Our framework is based on procedural texture modelling with interpretable parameters, uniquely allowing us to guarantee precise control over the texture parameters as we generate a high variety of observed and unobserved texture instances. We publish the dual dataset used in this paper, presenting models of sandblasting, parallel, and spiral milling textures, which are commonly present on manufactured metal products. To evaluate the dataset quality, we go beyond final model performance comparison by measuring different image similarities between the real and synthetic domains. This uncovered a trend, indicating these metrics could be used to predict downstream detection performance, which can strongly impact future developments of synthetic data. Full article
(This article belongs to the Section Sensing and Imaging)
17 pages, 5039 KB  
Article
AI-Enhanced Lower Extremity X-Ray Segmentation: A Promising Tool for Sarcopenia Diagnosis
by Hyunwoo Park, Hyeonsu Kim and Junil Yoo
Healthcare 2025, 13(19), 2488; https://doi.org/10.3390/healthcare13192488 - 30 Sep 2025
Abstract
Background/Objectives: Sarcopenia, characterized by progressive loss of skeletal muscle mass and strength, significantly impacts physical function and quality of life in older adults. Traditional measurement methods like Dual-energy X-ray absorptiometry (DEXA) are often inaccessible in primary care. This study aimed to develop [...] Read more.
Background/Objectives: Sarcopenia, characterized by progressive loss of skeletal muscle mass and strength, significantly impacts physical function and quality of life in older adults. Traditional measurement methods like Dual-energy X-ray absorptiometry (DEXA) are often inaccessible in primary care. This study aimed to develop and validate an AI-driven auto-segmentation model for muscle mass assessment using long X-rays as a more accessible alternative to DEXA. Methods: This was a retrospective validation study using data from the Real Hip Cohort at Inha University Hospital in South Korea. 351 lower extremity X-ray images from 157 patients were collected and analyzed. AI-based semantic segmentation models, including U-Net, V-Net, and U-Net++, were trained and validated on this dataset to automatically segment muscle regions. Model performance was assessed using Intersection over Union (IoU) and Dice Similarity Coefficient (DC) metrics. The correlation between AI-derived muscle measurements and the DEXA-derived skeletal muscle index was evaluated using Pearson correlation analysis and Bland–Altman analysis. Results: The study analyzed data from 157 patients (mean age 77.1 years). The U-Net++ architecture achieved the best segmentation performance with an IoU of 0.93 and DC of 0.95. Pearson correlation demonstrated a moderate to strong positive correlation between the AI model’s muscle estimates and DEXA results (r = 0.72, *** p < 0.0001). Regression analysis showed a coefficient of 0.74, indicating good agreement with reference measurements. Conclusions: This study successfully developed and validated an AI-driven auto-segmentation model for estimating muscle mass from long X-rays. The model provides an accessible alternative to DEXA, with potential to improve sarcopenia diagnosis and management in community and primary care settings. Future work will refine the model and explore its application to additional muscle groups. Full article
Show Figures

Figure 1

17 pages, 4081 KB  
Article
A Novel Method to Determine the Grain Size and Structural Heterogeneity of Fine-Grained Sedimentary Rocks
by Fang Zeng, Shansi Tian, Hongli Dong, Zhentao Dong, Bo Liu and Haiyang Liu
Fractal Fract. 2025, 9(10), 642; https://doi.org/10.3390/fractalfract9100642 - 30 Sep 2025
Abstract
Fine-grained sedimentary rocks exhibit significant textural heterogeneity, often obscured by conventional grain size analysis techniques that require sample disaggregation. We propose a non-destructive, image-based grain size characterization workflow, utilizing stitched polarized thin-section photomicrographs, k-means clustering, and watershed segmentation algorithms. Validation against laser granulometry [...] Read more.
Fine-grained sedimentary rocks exhibit significant textural heterogeneity, often obscured by conventional grain size analysis techniques that require sample disaggregation. We propose a non-destructive, image-based grain size characterization workflow, utilizing stitched polarized thin-section photomicrographs, k-means clustering, and watershed segmentation algorithms. Validation against laser granulometry data indicates strong methodological reliability (absolute errors ranging from −5% to 3%), especially for particle sizes greater than 0.039 mm. The methodology reveals substantial internal heterogeneity within Es3 laminated shale samples from the Shahejie Formation (Bohai Bay Basin), distinctly identifying coarser siliceous laminae (grain size >0.039 mm, Φ < 8 based on Udden-Wentworth classification) indicative of high-energy depositional environments, and finer-grained clay-rich laminae (grain size <0.039 mm, Φ > 8) representing low-energy conditions. Conversely, massive mudstones exhibit comparatively homogeneous grain size distributions. Additionally, a multifractal analysis (Multifractal method) based on the S50bi/S50si ratio further quantifies spatial heterogeneity and pore-structure complexity, significantly enhancing facies differentiation and reservoir characterization capabilities. This method significantly improves facies differentiation ability, provides reliable constraints for shale oil reservoir characterization, and has important reference value for the exploration and development of the Bohai Bay Basin and similar petroliferous basins. Full article
(This article belongs to the Section Engineering)
Show Figures

Figure 1

19 pages, 12926 KB  
Article
Mapping Banana and Peach Palm in Diversified Landscapes in the Brazilian Atlantic Forest with Sentinel-2
by Victória Beatriz Soares, Taya Cristo Parreiras, Danielle Elis Garcia Furuya, Édson Luis Bolfe and Katia de Lima Nechet
Agriculture 2025, 15(19), 2052; https://doi.org/10.3390/agriculture15192052 - 30 Sep 2025
Abstract
Mapping banana and peach palm in heterogeneous landscapes remains challenging due to spatial heterogeneity, spectral similarities between crops and native vegetation, and persistent cloud cover. This study focused on the municipality of Jacupiranga, located within the Ribeira Valley region and surrounded by the [...] Read more.
Mapping banana and peach palm in heterogeneous landscapes remains challenging due to spatial heterogeneity, spectral similarities between crops and native vegetation, and persistent cloud cover. This study focused on the municipality of Jacupiranga, located within the Ribeira Valley region and surrounded by the Atlantic Forest, which is home to one of Brazil’s largest remaining continuous forest areas. More than 99% of Jacupiranga’s agricultural output in the 21st century came from bananas (Musa spp.) and peach palms (Bactris gasipaes), underscoring the importance of perennial crops to the local economy and traditional communities. Using a time series of vegetation indices from Sentinel-2 imagery combined with field and remote data, we used a hierarchical classification method to map where these two crops are cultivated. The Random Forest classifier fed with 10 m resolution images enabled the detection of intricate agricultural mosaics that are typical of family farming systems and improved class separability between perennial and non-perennial crops and banana and peach palm. These results show how combining geographic information systems, data analysis, and remote sensing can improve digital agriculture, rural management, and sustainable agricultural development in socio-environmentally important areas. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

21 pages, 4397 KB  
Article
Splatting the Cat: Efficient Free-Viewpoint 3D Virtual Try-On via View-Decomposed LoRA and Gaussian Splatting
by Chong-Wei Wang, Hung-Kai Huang, Tzu-Yang Lin, Hsiao-Wei Hu and Chi-Hung Chuang
Electronics 2025, 14(19), 3884; https://doi.org/10.3390/electronics14193884 - 30 Sep 2025
Abstract
As Virtual Try-On (VTON) technology matures, 2D VTON methods based on diffusion models can now rapidly generate diverse and high-quality try-on results. However, with rising user demands for realism and immersion, many applications are shifting towards 3D VTON, which offers superior geometric and [...] Read more.
As Virtual Try-On (VTON) technology matures, 2D VTON methods based on diffusion models can now rapidly generate diverse and high-quality try-on results. However, with rising user demands for realism and immersion, many applications are shifting towards 3D VTON, which offers superior geometric and spatial consistency. Existing 3D VTON approaches commonly face challenges such as barriers to practical deployment, substantial memory requirements, and cross-view inconsistencies. To address these issues, we propose an efficient 3D VTON framework with robust multi-view consistency, whose core design is to decouple the monolithic 3D editing task into a four-stage cascade as follows: (1) We first reconstruct an initial 3D scene using 3D Gaussian Splatting, integrating the SMPL-X model at this stage as a strong geometric prior. By computing a normal-map loss and a geometric consistency loss, we ensure the structural stability of the initial human model across different views. (2) We employ the lightweight CatVTON to generate 2D try-on images, that provide visual guidance for the subsequent personalized fine-tuning tasks. (3) To accurately represent garment details from all angles, we partition the 2D dataset into three subsets—front, side, and back—and train a dedicated LoRA module for each subset on a pre-trained diffusion model. This strategy effectively mitigates the issue of blurred details that can occur when a single model attempts to learn global features. (4) An iterative optimization process then uses the generated 2D VTON images and specialized LoRA modules to edit the 3DGS scene, achieving 360-degree free-viewpoint VTON results. All our experiments were conducted on a single consumer-grade GPU with 24 GB of memory, a significant reduction from the 32 GB or more typically required by previous studies under similar data and parameter settings. Our method balances quality and memory requirement, significantly lowering the adoption barrier for 3D VTON technology. Full article
(This article belongs to the Special Issue 2D/3D Industrial Visual Inspection and Intelligent Image Processing)
Show Figures

Figure 1

27 pages, 7020 KB  
Article
RPC Correction Coefficient Extrapolation for KOMPSAT-3A Imagery in Inaccessible Regions
by Namhoon Kim
Remote Sens. 2025, 17(19), 3332; https://doi.org/10.3390/rs17193332 - 29 Sep 2025
Abstract
High-resolution pushbroom satellites routinely acquire multi-tenskilometer-scale strips whose vendors’ rational polynomial coefficients (RPCs) exhibit systematic, direction-dependent biases that accumulate downstream when ground control is sparse. This study presents a physically interpretable stripwise extrapolation framework that predicts along- and across-track RPC correlation coefficients for [...] Read more.
High-resolution pushbroom satellites routinely acquire multi-tenskilometer-scale strips whose vendors’ rational polynomial coefficients (RPCs) exhibit systematic, direction-dependent biases that accumulate downstream when ground control is sparse. This study presents a physically interpretable stripwise extrapolation framework that predicts along- and across-track RPC correlation coefficients for inaccessible segments from an upstream calibration subset. Terrain-independent RPCs were regenerated and residual image-space errors were modeled with weighted least squares using elapsed time, off-nadir evolution, and morphometric descriptors of the target terrain. Gaussian kernel weights favor calibration scenes with a Jarque–Bera-indexed relief similar to the target. When applied to three KOMPSAT-3A panchromatic strips, the approach preserves native scene geometry while transporting calibrated coefficients downstream, reducing positional errors in two strips to <2.8 pixels (~2.0 m at 0.710 m Ground Sample Distance, GSD). The first strip with a stronger attitude drift retains 4.589 pixel along-track errors, indicating the need for wider predictor coverage under aggressive maneuvers. The results clarify the directional error structure with a near-constant across-track bias and low-frequency along-track drift and show that a compact predictor set can stabilize extrapolation without full-block adjustment or dense tie networks. This provides a GCP-efficient alternative to full-block adjustment and enables accurate georeferencing in controlled environments. Full article
Show Figures

Figure 1

14 pages, 1942 KB  
Article
Vocal Fold Disorders Classification and Optimization of a Custom Video Laryngoscopy Dataset Through Structural Similarity Index and a Deep Learning-Based Approach
by Elif Emre, Dilber Cetintas, Muhammed Yildirim and Sadettin Emre
J. Clin. Med. 2025, 14(19), 6899; https://doi.org/10.3390/jcm14196899 - 29 Sep 2025
Abstract
Background/Objectives: Video laryngoscopy is one of the primary methods used by otolaryngologists for detecting and classifying laryngeal lesions. However, the diagnostic process of these images largely relies on clinicians’ visual inspection, which can lead to overlooked small structural changes, delayed diagnosis, and interpretation [...] Read more.
Background/Objectives: Video laryngoscopy is one of the primary methods used by otolaryngologists for detecting and classifying laryngeal lesions. However, the diagnostic process of these images largely relies on clinicians’ visual inspection, which can lead to overlooked small structural changes, delayed diagnosis, and interpretation errors. Methods: AI-based approaches are becoming increasingly critical for accelerating early-stage diagnosis and improving reliability. This study proposes a hybrid Convolutional Neural Network (CNN) architecture that eliminates repetitive and clinically insignificant frames from videos, utilizing only meaningful key frames. Video data from healthy individuals, patients with vocal fold nodules, and those with vocal fold polyps were summarized using three different threshold values with the Structural Similarity Index Measure (SSIM). Results: The resulting key frames were then classified using a hybrid CNN. Experimental findings demonstrate that selecting an appropriate threshold can significantly reduce the model’s memory usage and processing load while maintaining accuracy. In particular, a threshold value of 0.90 provided richer information content thanks to the selection of a wider variety of frames, resulting in the highest success rate. Fine-tuning the last 20 layers of the MobileNetV2 and Xception backbones, combined with the fusion of extracted features, yielded an overall classification accuracy of 98%. Conclusions: The proposed approach provides a mechanism that eliminates unnecessary data and prioritizes only critical information in video-based diagnostic processes, thus helping physicians accelerate diagnostic decisions and reduce memory requirements. Full article
(This article belongs to the Special Issue Artificial Intelligence and Deep Learning in Medical Imaging)
Show Figures

Figure 1

15 pages, 20890 KB  
Article
Development of an XAI-Enhanced Deep-Learning Algorithm for Automated Decision-Making on Shoulder-Joint X-Ray Retaking
by Konatsu Sekiura, Takaaki Yoshimura and Hiroyuki Sugimori
Appl. Sci. 2025, 15(19), 10534; https://doi.org/10.3390/app151910534 - 29 Sep 2025
Abstract
Purpose: To develop and validate a two-stage system for automated quality assessment of shoulder true-AP radiographs by combining joint localization with quality classification. Materials and Methods: From the MURA “SHOULDER” subset, 2956 anteroposterior images were identified; 59 images with negative–positive inversion, excessive metallic [...] Read more.
Purpose: To develop and validate a two-stage system for automated quality assessment of shoulder true-AP radiographs by combining joint localization with quality classification. Materials and Methods: From the MURA “SHOULDER” subset, 2956 anteroposterior images were identified; 59 images with negative–positive inversion, excessive metallic implants, extreme exposure, or presumed fluoroscopy were excluded, yielding a class-balanced set of 2800 images (1400 OK/1400 NG). A YOLOX-based detector localized the glenohumeral joint, and classifiers operated on both whole images and detector-centered crops. To enhance interpretability, we integrated Grad-CAM into both whole-image and local classifiers and assessed attention patterns against radiographic criteria. Results: The detector achieved AP@0.5 = 1.00 and a mean Dice similarity coefficient of 0.967. The classifier attained AUC = 0.977 (F1 = 0.943) on a held-out test set. Heat map analyses indicated anatomically focused attention consistent with expert-defined regions, and coverage metrics favored local over whole-image models. Conclusions: The two-stage, XAI-integrated approach provides accurate and interpretable assessment of shoulder true-AP image quality, aligning model attention with radiographic criteria. Full article
(This article belongs to the Special Issue Advances in Medical Imaging: Techniques and Applications)
Show Figures

Figure 1

Back to TopTop