Journal of Imaging

15 pages, 10730 KiB

Open AccessArticle

An Efficient Forest Smoke Detection Approach Using Convolutional Neural Networks and Attention Mechanisms

by Quy-Quyen Hoang, Quy-Lam Hoang and Hoon Oh

J. Imaging 2025, 11(2), 67; https://doi.org/10.3390/jimaging11020067 - 19 Feb 2025

Cited by 1 | Viewed by 876

This study explores a method of detecting smoke plumes effectively as the early sign of a forest fire. Convolutional neural networks (CNNs) have been widely used for forest fire detection; however, they have not been customized or optimized for smoke characteristics. This paper [...] Read more.

This study explores a method of detecting smoke plumes effectively as the early sign of a forest fire. Convolutional neural networks (CNNs) have been widely used for forest fire detection; however, they have not been customized or optimized for smoke characteristics. This paper proposes a CNN-based forest smoke detection model featuring novel backbone architecture that can increase detection accuracy and reduce computational load. Since the proposed backbone detects the plume of smoke through different views using kernels of varying sizes, it can better detect smoke plumes of different sizes. By decomposing the traditional square kernel convolution into a depth-wise convolution of the coordinate kernel, it can not only better extract the features of the smoke plume spreading along the vertical dimension but also reduce the computational load. An attention mechanism was applied to allow the model to focus on important information while suppressing less relevant information. The experimental results show that our model outperforms other popular ones by achieving detection accuracy of up to 52.9 average precision (AP) and significantly reduces the number of parameters and giga floating-point operations (GFLOPs) compared to the popular models. Full article

► Show Figures

Figure 1

23 pages, 3871 KiB

Open AccessArticle

Direct Distillation: A Novel Approach for Efficient Diffusion Model Inference

by Zilai Li and Rongkai Zhang

J. Imaging 2025, 11(2), 66; https://doi.org/10.3390/jimaging11020066 - 19 Feb 2025

Viewed by 1384

Abstract

Diffusion models are among the most common techniques used for image generation, having achieved state-of-the-art performance by implementing auto-regressive algorithms. However, multi-step inference processes are typically slow and require extensive computational resources. To address this issue, we propose the use of an information [...] Read more.

Diffusion models are among the most common techniques used for image generation, having achieved state-of-the-art performance by implementing auto-regressive algorithms. However, multi-step inference processes are typically slow and require extensive computational resources. To address this issue, we propose the use of an information bottleneck to reschedule inference using a new sampling strategy, which employs a lightweight distilled neural network to map intermediate stages to the final output. This approach reduces the number of iterations and FLOPS required for inference while ensuring the diversity of generated images. A series of validation experiments were conducted involving the COCO dataset as well as the LAION dataset and two proposed distillation models, requiring 57.5 million and 13.5 million parameters, respectively. Results showed that these models were able to bypass 40–50% of the inference steps originally required by a stable U-Net diffusion model, which included 859 million parameters. In the original sampling process, each inference step required 67,749 million multiply–accumulate operations (MACs), while our two distillate models only required 3954 million MACs and 3922 million MACs per inference step. In addition, our distillation algorithm produced a Fréchet inception distance (FID) of 16.75 in eight steps, which was remarkably lower than those of the progressive distillation, adversarial distillation, and DDIM solver algorithms, which produced FID values of 21.0, 30.0, 22.3, and 24.0, respectively. Notably, this process did not require parameters from the original diffusion model to establish a new distillation model prior to training. Information theory was used to further analyze primary bottlenecks in the FID results of existing distillation algorithms, demonstrating that both GANs and typical distillation failed to achieve generative diversity while implicitly studying incorrect posterior probability distributions. Meanwhile, we use information theory to analyze the latest distillation models including LCM-SDXL, SDXL-Turbo, SDXL-Lightning, DMD, and MSD, which reveals the basic reason for the diversity problem confronted by them, and compare those distillation models with our algorithm in the FID and CLIP Score. Full article

(This article belongs to the Section AI in Imaging)

► Show Figures

Figure 1

22 pages, 5344 KiB

Open AccessArticle

Impact of Data Capture Methods on 3D Reconstruction with Gaussian Splatting

by Dimitar Rangelov, Sierd Waanders, Kars Waanders, Maurice van Keulen and Radoslav Miltchev

J. Imaging 2025, 11(2), 65; https://doi.org/10.3390/jimaging11020065 - 18 Feb 2025

Cited by 1 | Viewed by 1594

Abstract

This study examines how different filming techniques can enhance the quality of 3D reconstructions with a particular focus on their use in indoor crime scene investigations. Using Neural Radiance Fields (NeRF) and Gaussian Splatting, we explored how factors like camera orientation, filming speed, [...] Read more.

This study examines how different filming techniques can enhance the quality of 3D reconstructions with a particular focus on their use in indoor crime scene investigations. Using Neural Radiance Fields (NeRF) and Gaussian Splatting, we explored how factors like camera orientation, filming speed, data layering, and scanning path affect the detail and clarity of 3D reconstructions. Through experiments in a mock crime scene apartment, we identified optimal filming methods that reduce noise and artifacts, delivering clearer and more accurate reconstructions. Filming in landscape mode, at a slower speed, with at least three layers and focused on key objects produced the most effective results. These insights provide valuable guidelines for professionals in forensics, architecture, and cultural heritage preservation, helping them capture realistic high-quality 3D representations. This study also highlights the potential for future research to expand on these findings by exploring other algorithms, camera parameters, and real-time adjustment techniques. Full article

(This article belongs to the Special Issue Geometry Reconstruction from Images (2nd Edition))

► Show Figures

Figure 1

39 pages, 1298 KiB

Open AccessSystematic Review

Vision-Based Collision Warning Systems with Deep Learning: A Systematic Review

by Charith Chitraranjan, Vipooshan Vipulananthan and Thuvarakan Sritharan

J. Imaging 2025, 11(2), 64; https://doi.org/10.3390/jimaging11020064 - 17 Feb 2025

Cited by 1 | Viewed by 1501

Abstract

Timely prediction of collisions enables advanced driver assistance systems to issue warnings and initiate emergency maneuvers as needed to avoid collisions. With recent developments in computer vision and deep learning, collision warning systems that use vision as the only sensory input have emerged. [...] Read more.

Timely prediction of collisions enables advanced driver assistance systems to issue warnings and initiate emergency maneuvers as needed to avoid collisions. With recent developments in computer vision and deep learning, collision warning systems that use vision as the only sensory input have emerged. They are less expensive than those that use multiple sensors, but their effectiveness must be thoroughly assessed. We systematically searched academic literature for studies proposing ego-centric, vision-based collision warning systems that use deep learning techniques. Thirty-one studies among the search results satisfied our inclusion criteria. Risk of bias was assessed with PROBAST. We reviewed the selected studies and answer three primary questions: What are the (1) deep learning techniques used and how are they used? (2) datasets and experiments used to evaluate? (3) results achieved? We identified two main categories of methods: Those that use deep learning models to directly predict the probability of a future collision from input video, and those that use deep learning models at one or more stages of a pipeline to compute a threat metric before predicting collisions. More importantly, we show that the experimental evaluation of most systems is inadequate due to either not performing quantitative experiments or various biases present in the datasets used. Lack of suitable datasets is a major challenge to the evaluation of these systems and we suggest future work to address this issue. Full article

(This article belongs to the Special Issue Computer Vision and Deep Learning: Trends and Applications (2nd Edition))

► Show Figures

Figure 1

14 pages, 4632 KiB

Open AccessReview

Unraveling the Role of PET in Cervical Cancer: Review of Current Applications and Future Horizons

by Divya Yadav, Elisabeth O’Dwyer, Matthew Agee, Silvina P. Dutruel, Sonia Mahajan and Sandra Huicochea Castellanos

J. Imaging 2025, 11(2), 63; https://doi.org/10.3390/jimaging11020063 - 17 Feb 2025

Viewed by 2112

Abstract

FDG PET/CT provides complementary metabolic information with greater sensitivity and specificity than conventional imaging modalities for evaluating local recurrence, nodal, and distant metastases in patients with cervical cancer. PET/CT can also be used in radiation treatment planning, which is the mainstay of treatment. [...] Read more.

FDG PET/CT provides complementary metabolic information with greater sensitivity and specificity than conventional imaging modalities for evaluating local recurrence, nodal, and distant metastases in patients with cervical cancer. PET/CT can also be used in radiation treatment planning, which is the mainstay of treatment. With the implementation of various oncological guidelines, FDG PET/CT has been utilized more frequently in patient management and prognostication. Newer PET tracers targeting the tumor microenvironment offer valuable biologic insights to elucidate the mechanism of treatment resistance and tumor aggressiveness and identify the high-risk patients. Artificial intelligence and machine learning approaches have been utilized more recently in metastatic disease detection, response assessment, and prognostication of cervical cancer. Full article

(This article belongs to the Special Issue New Perspectives in Medical Image Analysis)

► Show Figures

Figure 1

13 pages, 1586 KiB

Open AccessArticle

Non-Hospitalized Long COVID Patients Exhibit Reduced Retinal Capillary Perfusion: A Prospective Cohort Study

by Clayton E. Lyons, Jonathan Alhalel, Anna Busza, Emily Suen, Nathan Gill, Nicole Decker, Stephen Suchy, Zachary Orban, Millenia Jimenez, Gina Perez Giraldo, Igor J. Koralnik and Manjot K. Gill

J. Imaging 2025, 11(2), 62; https://doi.org/10.3390/jimaging11020062 - 17 Feb 2025

Cited by 1 | Viewed by 5847

Abstract

The mechanism of post-acute sequelae of SARS-CoV-2 (PASC) is unknown. Using optical coherence tomography angiography (OCT-A), we compared retinal foveal avascular zone (FAZ), vessel density (VD), and vessel length density (VLD) in non-hospitalized Neuro-PASC patients with those in healthy controls in an effort [...] Read more.

The mechanism of post-acute sequelae of SARS-CoV-2 (PASC) is unknown. Using optical coherence tomography angiography (OCT-A), we compared retinal foveal avascular zone (FAZ), vessel density (VD), and vessel length density (VLD) in non-hospitalized Neuro-PASC patients with those in healthy controls in an effort to elucidate the mechanism underlying this debilitating condition. Neuro-PASC patients with a positive SARS-CoV-2 test and neurological symptoms lasting ≥6 weeks were included. Those with prior COVID-19 hospitalization were excluded. Subjects underwent OCT-A with segmentation of the full retinal slab into the superficial (SCP) and deep (DCP) capillary plexus. The FAZ was manually delineated on the full slab in ImageJ. An ImageJ macro was used to measure VD and VLD. OCT-A variables were analyzed using linear mixed-effects models with fixed effects for Neuro-PASC, age, and sex, and a random effect for patient to account for measurements from both eyes. The coefficient of Neuro-PASC status was used to determine statistical significance; p-values were adjusted using the Benjamani–Hochberg procedure. Neuro-PASC patients (N = 30; 60 eyes) exhibited a statistically significant (p = 0.005) reduction in DCP VLD compared to healthy controls (N = 44; 80 eyes). The sole reduction in DCP VLD in Neuro-PASC may suggest preferential involvement of the smallest blood vessels. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

18 pages, 1607 KiB

Open AccessArticle

Accurate Prostate Segmentation in Large-Scale Magnetic Resonance Imaging Datasets via First-in-First-Out Feature Memory and Multi-Scale Context Modeling

by Jingyi Zhu, Xukun Zhang, Xiao Luo, Zhiji Zheng, Kun Zhou, Yanlan Kang, Haiqing Li and Daoying Geng

J. Imaging 2025, 11(2), 61; https://doi.org/10.3390/jimaging11020061 - 16 Feb 2025

Viewed by 1281

Abstract

Prostate cancer, a prevalent malignancy affecting males globally, underscores the critical need for precise prostate segmentation in diagnostic imaging. However, accurate delineation via MRI still faces several challenges: (1) The distinction of the prostate from surrounding soft tissues is impeded by subtle boundaries [...] Read more.

Prostate cancer, a prevalent malignancy affecting males globally, underscores the critical need for precise prostate segmentation in diagnostic imaging. However, accurate delineation via MRI still faces several challenges: (1) The distinction of the prostate from surrounding soft tissues is impeded by subtle boundaries in MRI images. (2) Regions such as the apex and base of the prostate exhibit inherent blurriness, which complicates edge extraction and precise segmentation. The objective of this study was to precisely delineate the borders of the prostate including the apex and base regions. This study introduces a multi-scale context modeling module to enhance boundary pixel representation, thus reducing the impact of irrelevant features on segmentation outcomes. Utilizing a first-in-first-out dynamic adjustment mechanism, the proposed methodology optimizes feature vector selection, thereby enhancing segmentation outcomes for challenging apex and base regions of the prostate. Segmentation of the prostate on 2175 clinically annotated MRI datasets demonstrated that our proposed MCM-UNet outperforms existing methods. The Average Symmetric Surface Distance (ASSD) and Dice similarity coefficient (DSC) for prostate segmentation were 0.58 voxels and 91.71%, respectively. The prostate segmentation results closely matched those manually delineated by experienced radiologists. Consequently, our method significantly enhances the accuracy of prostate segmentation and holds substantial significance in the diagnosis and treatment of prostate cancer. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

23 pages, 2838 KiB

Open AccessArticle

Investigating Eye Movements to Examine Attachment-Related Differences in Facial Emotion Perception and Face Memory

by Karolin Török-Suri, Kornél Németh, Máté Baradits and Gábor Csukly

J. Imaging 2025, 11(2), 60; https://doi.org/10.3390/jimaging11020060 - 16 Feb 2025

Viewed by 1176

Abstract

Individual differences in attachment orientations may influence how we process emotionally significant stimuli. As one of the most important sources of emotional information are facial expressions, we examined whether there is an association between adult attachment styles (i.e., scores on the ECR questionnaire, [...] Read more.

Individual differences in attachment orientations may influence how we process emotionally significant stimuli. As one of the most important sources of emotional information are facial expressions, we examined whether there is an association between adult attachment styles (i.e., scores on the ECR questionnaire, which measures the avoidance and anxiety dimensions of attachment), facial emotion perception and face memory in a neurotypical sample. Trait and state anxiety were also measured as covariates. Eye-tracking was used during the emotion decision task (happy vs. sad faces) and the subsequent facial recognition task; the length of fixations to different face regions was measured as the dependent variable. Linear mixed models suggested that differences during emotion perception may result from longer fixations in individuals with insecure (anxious or avoidant) attachment orientations. This effect was also influenced by individual state and trait anxiety measures. Eye movements during the recognition memory task, however, were not related to either of the attachment dimensions; only trait anxiety had a significant effect on the length of fixations in this condition. The results of our research may contribute to a more accurate understanding of facial emotion perception in the light of attachment styles, and their interaction with anxiety characteristics. Full article

(This article belongs to the Special Issue Human Attention and Visual Cognition (2nd Edition))

► Show Figures

Figure 1

42 pages, 20752 KiB

Open AccessReview

Applications of Artificial Intelligence, Deep Learning, and Machine Learning to Support the Analysis of Microscopic Images of Cells and Tissues

by Muhammad Ali, Viviana Benfante, Ghazal Basirinia, Pierpaolo Alongi, Alessandro Sperandeo, Alberto Quattrocchi, Antonino Giulio Giannone, Daniela Cabibi, Anthony Yezzi, Domenico Di Raimondo, Antonino Tuttolomondo and Albert Comelli

J. Imaging 2025, 11(2), 59; https://doi.org/10.3390/jimaging11020059 - 15 Feb 2025

Cited by 12 | Viewed by 5459

Abstract

Artificial intelligence (AI) transforms image data analysis across many biomedical fields, such as cell biology, radiology, pathology, cancer biology, and immunology, with object detection, image feature extraction, classification, and segmentation applications. Advancements in deep learning (DL) research have been a critical factor in [...] Read more.

Artificial intelligence (AI) transforms image data analysis across many biomedical fields, such as cell biology, radiology, pathology, cancer biology, and immunology, with object detection, image feature extraction, classification, and segmentation applications. Advancements in deep learning (DL) research have been a critical factor in advancing computer techniques for biomedical image analysis and data mining. A significant improvement in the accuracy of cell detection and segmentation algorithms has been achieved as a result of the emergence of open-source software and innovative deep neural network architectures. Automated cell segmentation now enables the extraction of quantifiable cellular and spatial features from microscope images of cells and tissues, providing critical insights into cellular organization in various diseases. This review aims to examine the latest AI and DL techniques for cell analysis and data mining in microscopy images, aid the biologists who have less background knowledge in AI and machine learning (ML), and incorporate the ML models into microscopy focus images. Full article

(This article belongs to the Special Issue Advances in Biomedical Image Processing and Artificial Intelligence for Computer-Aided Diagnosis in Medicine)

► Show Figures

Figure 1

29 pages, 1351 KiB

Open AccessSystematic Review

Facial Recognition Algorithms: A Systematic Literature Review

by Nazar EL Fadel

J. Imaging 2025, 11(2), 58; https://doi.org/10.3390/jimaging11020058 - 13 Feb 2025

Cited by 1 | Viewed by 10276

Abstract

This systematic literature review aims to understand new developments and challenges in facial recognition technology. This will provide an understanding of the system principles, performance metrics, and applications of facial recognition technology in various fields such as health, society, and security from various [...] Read more.

This systematic literature review aims to understand new developments and challenges in facial recognition technology. This will provide an understanding of the system principles, performance metrics, and applications of facial recognition technology in various fields such as health, society, and security from various academic publications, conferences, and industry news. A comprehensive approach was adopted in the literature review of various facial recognition technologies. It emphasizes the most important techniques in algorithm development, examines performance metrics, and explores their applications in various fields. The review mainly emphasizes the recent development in deep learning techniques, especially CNNs, which greatly improved the accuracy and efficiency of facial recognition systems. The findings reveal that there has been a noticeable evolution in facial recognition technology, especially with the current use of deep learning techniques. Nevertheless, it highlights important challenges, including privacy concerns, ethical dilemmas, and biases in the systems. These factors highlight the necessity of using facial recognition technology in an ethical and regulated manner. In conclusion, the paper proposes several future research directions to establish the reliability of facial recognition systems and reduce biases while building user confidence. These considerations are key to responsibly advancing facial recognition technology by ensuring ethical practices and safeguarding privacy. Full article

(This article belongs to the Section Computer Vision and Pattern Recognition)

► Show Figures

Figure 1

12 pages, 4197 KiB

Open AccessArticle

Estimation of Trabecular Bone Volume with Dual-Echo Ultrashort Echo Time (UTE) Magnetic Resonance Imaging (MRI) Significantly Correlates with High-Resolution Computed Tomography (CT)

by Karen Y. Cheng, Dina Moazamian, Behnam Namiranian, Hamidreza Shaterian Mohammadi, Salem Alenezi, Christine B. Chung and Saeed Jerban

J. Imaging 2025, 11(2), 57; https://doi.org/10.3390/jimaging11020057 - 13 Feb 2025

Viewed by 1163

Abstract

Trabecular bone architecture has important implications for the mechanical strength of bone. Trabecular elements appear as signal void when imaged utilizing conventional magnetic resonance imaging (MRI) sequences. Ultrashort echo time (UTE) MRI can acquire high signal from trabecular bone, allowing for quantitative evaluation. [...] Read more.

Trabecular bone architecture has important implications for the mechanical strength of bone. Trabecular elements appear as signal void when imaged utilizing conventional magnetic resonance imaging (MRI) sequences. Ultrashort echo time (UTE) MRI can acquire high signal from trabecular bone, allowing for quantitative evaluation. However, the trabecular morphology is often disturbed in UTE-MRI due to chemical shift artifacts caused by the presence of fat in marrow. This study aimed to evaluate a UTE-MRI technique to estimate the trabecular bone volume fraction (BVTV) without requiring trabecular-level morphological assessment. A total of six cadaveric distal tibial diaphyseal trabecular bone cubes were scanned using a dual-echo UTE Cones sequence (TE = 0.03 and 2.2 ms) on a clinical 3T MRI scanner and on a micro-computed tomography (μCT) scanner. The BVTV was calculated from 10 consecutive slices on both the MR and μCT images. BVTV calculated from the MR images showed strongly significant correlation with the BVTV determined from μCT images (R = 0.84, p < 0.01), suggesting that UTE-MRI is a feasible technique for the assessment of trabecular bone microarchitecture. This would allow for the non-invasive assessment of information regarding bone strength, and UTE-MRI may potentially serve as a novel tool for assessment of fracture risk. Full article

(This article belongs to the Special Issue Advances and Challenges in Bone Imaging)

► Show Figures

Figure 1

7 pages, 4282 KiB

Open AccessBrief Report

Multiphoton Microscopy to Visualize Live Renal Nerves in Reanimated Kidney Blocks

by Joerg Reifart, Patrick T. Willey and Paul A. Iaizzo

J. Imaging 2025, 11(2), 56; https://doi.org/10.3390/jimaging11020056 - 13 Feb 2025

Viewed by 925

Abstract

Renal denervation to treat arterial hypertension is growing in adoption but still shows inconsistent results. Device improvement is difficult, as there is currently no way to study the immediate success of renal denervation devices in living tissue. In an effort to visualize live [...] Read more.

Renal denervation to treat arterial hypertension is growing in adoption but still shows inconsistent results. Device improvement is difficult, as there is currently no way to study the immediate success of renal denervation devices in living tissue. In an effort to visualize live renal nerves surrounding their arteries using multiphoton microscopy, kidney pairs were explanted from Yorkshire pigs. They were maintained viable with a pulsatile perfusion apparatus using Visible Kidney™ methodologies, in which blood is replaced by a modified, oxygenated, and warmed (37 °C) Krebs–Henseleit buffer. The block resection allows catheter placement for nerve ablation treatment. Subsequently, the kidney block was disconnected from the perfusion system and underwent multiphoton microscopy (Nikon A1R 1024 MP). A total of three renal blocks were imaged using this model. Using 780 nm excitation for autofluorescence, we were able to selectively image peri-arterial nerves (2.5–23 μm diameter) alongside arteriolar elastin fibers (1.96 ± 0.87 μm; range: 0.3–4.27) at 25× magnification at a pixel size of 1.02 µm). Autofluoresecence was not strong enough to identify nerves at 4× magnification. There was a high but variable signal-to-noise ratio of 52.3 (median, IQR 159). This model may be useful for improving future physician training and innovations in renal denervation technologies. Full article

(This article belongs to the Special Issue New Trends in Image Analysis for Next-Generation Microscopy)

► Show Figures

Figure 1

12 pages, 798 KiB

Open AccessTechnical Note

Adapting Classification Neural Network Architectures for Medical Image Segmentation Using Explainable AI

by Arturs Nikulins, Edgars Edelmers, Kaspars Sudars and Inese Polaka

J. Imaging 2025, 11(2), 55; https://doi.org/10.3390/jimaging11020055 - 13 Feb 2025

Viewed by 1762

Abstract

Segmentation neural networks are widely used in medical imaging to identify anomalies that may impact patient health. Despite their effectiveness, these networks face significant challenges, including the need for extensive annotated patient data, time-consuming manual segmentation processes and restricted data access due to [...] Read more.

Segmentation neural networks are widely used in medical imaging to identify anomalies that may impact patient health. Despite their effectiveness, these networks face significant challenges, including the need for extensive annotated patient data, time-consuming manual segmentation processes and restricted data access due to privacy concerns. In contrast, classification neural networks, similar to segmentation neural networks, capture essential parameters for identifying objects during training. This paper leverages this characteristic, combined with explainable artificial intelligence (XAI) techniques, to address the challenges of segmentation. By adapting classification neural networks for segmentation tasks, the proposed approach reduces dependency on manual segmentation. To demonstrate this concept, the Medical Segmentation Decathlon ‘Brain Tumours’ dataset was utilised. A ResNet classification neural network was trained, and XAI tools were applied to generate segmentation-like outputs. Our findings reveal that GuidedBackprop is among the most efficient and effective methods, producing heatmaps that closely resemble segmentation masks by accurately highlighting the entirety of the target object. Full article

(This article belongs to the Special Issue Advances in Biomedical Image Processing and Artificial Intelligence for Computer-Aided Diagnosis in Medicine)

► Show Figures

Figure 1

21 pages, 16064 KiB

Open AccessArticle

A Novel 3D Magnetic Resonance Imaging Registration Framework Based on the Swin-Transformer UNet+ Model with 3D Dynamic Snake Convolution Scheme

by Yaolong Han, Lei Wang, Zizhen Huang, Yukun Zhang and Xiao Zheng

J. Imaging 2025, 11(2), 54; https://doi.org/10.3390/jimaging11020054 - 11 Feb 2025

Viewed by 1557

Abstract

Transformer-based image registration methods have achieved notable success, but they still face challenges, such as difficulties in representing both global and local features, the inability of standard convolution operations to focus on key regions, and inefficiencies in restoring global context using the decoder. [...] Read more.

Transformer-based image registration methods have achieved notable success, but they still face challenges, such as difficulties in representing both global and local features, the inability of standard convolution operations to focus on key regions, and inefficiencies in restoring global context using the decoder. To address these issues, we extended the Swin-UNet architecture and incorporated dynamic snake convolution (DSConv) into the model, expanding it into three dimensions. This improvement enables the model to better capture spatial information at different scales, enhancing its adaptability to complex anatomical structures and their intricate components. Additionally, multi-scale dense skip connections were introduced to mitigate the spatial information loss caused by downsampling, enhancing the model’s ability to capture both global and local features. We also introduced a novel optimization-based weakly supervised strategy, which iteratively refines the deformation field generated during registration, enabling the model to produce more accurate registered images. Building on these innovations, we proposed OSS DSC-STUNet+ (Swin-UNet+ with 3D dynamic snake convolution). Experimental results on the IXI, OASIS, and LPBA40 brain MRI datasets demonstrated up to a 16.3% improvement in Dice coefficient compared to five classical methods. The model exhibits outstanding performance in terms of registration accuracy, efficiency, and feature preservation. Full article

(This article belongs to the Section Image and Video Processing)

► Show Figures

Figure 1

18 pages, 16173 KiB

Open AccessArticle

Comparative Analysis of Deep Learning Architectures for Macular Hole Segmentation in OCT Images: A Performance Evaluation of U-Net Variants

by H. M. S. S. Herath, S. L. P. Yasakethu, Nuwan Madusanka, Myunggi Yi and Byeong-Il Lee

J. Imaging 2025, 11(2), 53; https://doi.org/10.3390/jimaging11020053 - 11 Feb 2025

Cited by 1 | Viewed by 1988

Abstract

This study presents a comprehensive comparison of U-Net variants with different backbone architectures for Macular Hole (MH) segmentation in optical coherence tomography (OCT) images. We evaluated eleven architectures, including U-Net combined with InceptionNetV4, VGG16, VGG19, ResNet152, DenseNet121, EfficientNet-B7, MobileNetV2, Xception, and Transformer. Models [...] Read more.

This study presents a comprehensive comparison of U-Net variants with different backbone architectures for Macular Hole (MH) segmentation in optical coherence tomography (OCT) images. We evaluated eleven architectures, including U-Net combined with InceptionNetV4, VGG16, VGG19, ResNet152, DenseNet121, EfficientNet-B7, MobileNetV2, Xception, and Transformer. Models were assessed using the Dice coefficient and HD95 metrics on the OIMHS dataset. While HD95 proved unreliable for small regions like MH, often returning ‘nan’ values, the Dice coefficient provided consistent performance evaluation. InceptionNetV4 + U-Net achieved the highest Dice coefficient (0.9672), demonstrating superior segmentation accuracy. Although considered state-of-the-art, Transformer + U-Net showed poor performance in MH and intraretinal cyst (IRC) segmentation. Analysis of computational resources revealed that MobileNetV2 + U-Net offered the most efficient performance with minimal parameters, while InceptionNetV4 + U-Net balanced accuracy with moderate computational demands. Our findings suggest that CNN-based backbones, particularly InceptionNetV4, are more effective than Transformer architectures for OCT image segmentation, with InceptionNetV4 + U-Net emerging as the most promising model for clinical applications. Full article

► Show Figures

Figure 1

21 pages, 3621 KiB

Open AccessArticle

SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting

by Ahmed Zgaren, Wassim Bouachir and Nizar Bouguila

J. Imaging 2025, 11(2), 52; https://doi.org/10.3390/jimaging11020052 - 10 Feb 2025

Viewed by 2491

Abstract

Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation [...] Read more.

Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation for faster processing. This paper proposes a fully automated zero-shot method outperforming both zero-shot and few-shot methods. By exploiting feature maps from a pre-trained detection-based backbone, we introduce a new Visual Embedding Module designed to generate semantic embeddings within object contextual information. These embeddings are then fed to a Self-Attention Matching Module to generate an encoded representation for the head counter. Our proposed method has outperformed recent zero-shot approaches, achieving the best Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) results of

8.89

and

35.83

, respectively, on the FSC147 dataset. Additionally, our method demonstrates competitive performance compared to few-shot methods, advancing the capabilities of visual object counting in various industrial applications such as tree counting, wildlife animal counting, and medical applications like blood cell counting. Full article

(This article belongs to the Special Issue Recent Trends in Computer Vision with Neural Networks)

► Show Figures

Figure 1

21 pages, 4293 KiB

Open AccessArticle

A Highly Robust Encoder–Decoder Network with Multi-Scale Feature Enhancement and Attention Gate for the Reduction of Mixed Gaussian and Salt-and-Pepper Noise in Digital Images

by Milan Tripathi, Waree Kongprawechnon and Toshiaki Kondo

J. Imaging 2025, 11(2), 51; https://doi.org/10.3390/jimaging11020051 - 10 Feb 2025

Viewed by 1193

Abstract

Image denoising is crucial for correcting distortions caused by environmental factors and technical limitations. We propose a novel and highly robust encoder–decoder network (HREDN) for effectively removing mixed salt-and-pepper and Gaussian noise from digital images. HREDN integrates a multi-scale feature enhancement block in [...] Read more.

Image denoising is crucial for correcting distortions caused by environmental factors and technical limitations. We propose a novel and highly robust encoder–decoder network (HREDN) for effectively removing mixed salt-and-pepper and Gaussian noise from digital images. HREDN integrates a multi-scale feature enhancement block in the encoder, allowing the network to capture features at various scales and handle complex noise patterns more effectively. To mitigate information loss during encoding, skip connections transfer essential feature maps from the encoder to the decoder, preserving structural details. However, skip connections can also propagate redundant information. To address this, we incorporate attention gates within the skip connections, ensuring that only relevant features are passed to the decoding layers. We evaluate the robustness of the proposed method across facial, medical, and remote sensing domains. The experimental results demonstrate that HREDN excels in preserving edge details and structural features in denoised images, outperforming state-of-the-art techniques in both qualitative and quantitative measures. Statistical analysis further highlights the model’s ability to effectively remove noise in diverse, complex scenarios with images of varying resolutions across multiple domains. Full article

(This article belongs to the Special Issue Celebrating the 10th Anniversary of the Journal of Imaging)

► Show Figures

Figure 1

15 pages, 640 KiB

Open AccessArticle

Enhancing U-Net Segmentation Accuracy Through Comprehensive Data Preprocessing

by Talshyn Sarsembayeva, Madina Mansurova, Assel Abdildayeva and Stepan Serebryakov

J. Imaging 2025, 11(2), 50; https://doi.org/10.3390/jimaging11020050 - 8 Feb 2025

Cited by 1 | Viewed by 3090

Abstract

The accurate segmentation of lung regions in computed tomography (CT) scans is critical for the automated analysis of lung diseases such as chronic obstructive pulmonary disease (COPD) and COVID-19. This paper focuses on enhancing the accuracy of U-Net segmentation models through a robust [...] Read more.

The accurate segmentation of lung regions in computed tomography (CT) scans is critical for the automated analysis of lung diseases such as chronic obstructive pulmonary disease (COPD) and COVID-19. This paper focuses on enhancing the accuracy of U-Net segmentation models through a robust preprocessing pipeline. The pipeline includes CT image normalization, binarization to extract lung regions, and morphological operations to remove artifacts. Additionally, the proposed method applies region-of-interest (ROI) filtering to isolate lung areas effectively. The dataset preprocessing significantly improves segmentation quality by providing clean and consistent input data for the U-Net model. Experimental results demonstrate that the Intersection over Union (IoU) and Dice coefficient exceeded 0.95 on training datasets. This work highlights the importance of preprocessing as a standalone step for optimizing deep learning-based medical image analysis. Full article

(This article belongs to the Special Issue Self-Supervised Learning for Image Processing and Analysis)

► Show Figures

Figure 1

25 pages, 34424 KiB

Open AccessArticle

Resampling Point Clouds Using Series of Local Triangulations

by Vijai Kumar Suriyababu, Cornelis Vuik and Matthias Möller

J. Imaging 2025, 11(2), 49; https://doi.org/10.3390/jimaging11020049 - 8 Feb 2025

Viewed by 1408

Abstract

The increasing reliance on 3D scanning and meshless methods highlights the need for algorithms optimized for point-cloud geometry representations in CAE simulations. While voxel-based binning methods are simple, they often compromise geometry and topology, particularly with coarse voxelizations. We propose an algorithm based [...] Read more.

The increasing reliance on 3D scanning and meshless methods highlights the need for algorithms optimized for point-cloud geometry representations in CAE simulations. While voxel-based binning methods are simple, they often compromise geometry and topology, particularly with coarse voxelizations. We propose an algorithm based on a Series of Local Triangulations (SOLT) as an intermediate representation for point clouds, enabling efficient upsampling and downsampling. This robust and straightforward approach preserves the integrity of point clouds, ensuring resampling without feature loss or topological distortions. The proposed techniques integrate seamlessly into existing engineering workflows, avoiding complex optimization or machine learning methods while delivering reliable, high-quality results for a large number of examples. Resampled point clouds produced by our method can be directly used for solving PDEs or as input for surface reconstruction algorithms. We demonstrate the effectiveness of this approach with examples from mechanically sampled point clouds and real-world 3D scans. Full article

(This article belongs to the Special Issue Exploring Challenges and Innovations in 3D Point Cloud Processing)

► Show Figures

Figure 1

15 pages, 4634 KiB

Open AccessReview

Shaping the Optimal Timing for Treatment of Isolated Asymptomatic Severe Aortic Stenosis with Preserved Left Ventricular Ejection Fraction: The Role of Non-Invasive Diagnostics Focused on Strain Echocardiography and Future Perspectives

by Luca Dell’Angela and Gian Luigi Nicolosi

J. Imaging 2025, 11(2), 48; https://doi.org/10.3390/jimaging11020048 - 8 Feb 2025

Viewed by 873

Abstract

The optimal timing for treatment of patients with isolated asymptomatic severe aortic stenosis and preserved left ventricular ejection fraction is still controversial and research is ongoing. Once a diagnosis has been performed and other cardiac comorbidities (e.g., concomitant significant valvulopathies or infiltrative cardiomyopathies) [...] Read more.

The optimal timing for treatment of patients with isolated asymptomatic severe aortic stenosis and preserved left ventricular ejection fraction is still controversial and research is ongoing. Once a diagnosis has been performed and other cardiac comorbidities (e.g., concomitant significant valvulopathies or infiltrative cardiomyopathies) have reasonably been excluded, a hot topic is adequate myocardial characterization, which aims to prevent both myocardial dysfunction and subsequent adverse myocardial remodeling, and can potentially compromise the post-treatment outcomes. Another crucial subject of debate is the assessment of the real “preserved” left ventricular ejection fraction cut-off value in the presence of isolated asymptomatic severe aortic stenosis, in order to optimize the timing of aortic valve replacement as well. The aim of the present critical narrative review is highlighting the current role of non-invasive diagnostics in such a setting, focusing on strain echocardiography, and citing the main complementary cardiac imaging techniques, as well as suggesting potential implementation strategies in routine clinical practice in view of future developments. Full article

(This article belongs to the Special Issue Progress and Challenges in Biomedical Image Analysis)

► Show Figures

Figure 1

11 pages, 3048 KiB

Open AccessArticle

Differentiation of Atypical Lipomatous Tumors from Lipomas: Our Experience with Visual Analysis of Conventional Magnetic Resonance Imaging

by Luz Maria Moran, Chao Yuan Li Cai, Alberto Ramirez and Ana Royuela

J. Imaging 2025, 11(2), 47; https://doi.org/10.3390/jimaging11020047 - 8 Feb 2025

Viewed by 2314

Abstract

Differentiating atypical lipomatous tumors (ALTs) from lipomas using imaging techniques is a challenge, and the biopsy with immunohistochemical determination of murine double minute 2 (MDM2) oncogene is the gold standard. We are looking for a management algorithm with the visual analysis of magnetic [...] Read more.

Differentiating atypical lipomatous tumors (ALTs) from lipomas using imaging techniques is a challenge, and the biopsy with immunohistochemical determination of murine double minute 2 (MDM2) oncogene is the gold standard. We are looking for a management algorithm with the visual analysis of magnetic resonance images in these two fatty soft tissue tumors that allow us to avoid some biopsies. Two radiologists, blinded to the final diagnosis, independently assessed various features on conventional magnetic resonance imaging (MRI), in 79 patients with pathologically confirmed fatty tumors as either lipoma (MDM2 negative) or ALT (MDM2 positive). Results: The interobserver agreement for the most MRI features was moderate and the musculoskeletal radiologist accuracy for final diagnosis was 90% sensitivity and 66% specificity. Tumors with homogeneous fat signals and a maximum size < 8 cm were always lipomas (p < 0.001), and the tumors with septa thickness ≥ 2 mm, or more than one non-fat nodule, and a maximum size ≥ 12.8 cm were typically ALTs. While those tumors with septa < 2 mm or one non-fat nodule, independently of maximum size, the diagnosis of lipoma versus ALT is uncertain and a biopsy is required. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

20 pages, 9794 KiB

Open AccessArticle

Using Machine Learning and Generative Intelligence in Book Cover Development

by Nonna Kulishova and Daiva Sajek

J. Imaging 2025, 11(2), 46; https://doi.org/10.3390/jimaging11020046 - 7 Feb 2025

Viewed by 2094

Abstract

The rapid development of machine learning and artificial intelligence approaches is finding ever wider application in various areas of life. This paper considers the problem of improving editorial and publishing processes, namely self-publishing, when designing book covers using machine learning and generative artificial [...] Read more.

The rapid development of machine learning and artificial intelligence approaches is finding ever wider application in various areas of life. This paper considers the problem of improving editorial and publishing processes, namely self-publishing, when designing book covers using machine learning and generative artificial intelligence (GAI) methods. When choosing a book, readers often have certain expectations regarding the design of the publication, including the color of the cover. These expectations can be called color preferences, and they can depend on the genre of the book, its target audience, and even personal associations. Cultural context can also influence color choice, as certain colors can symbolize different emotions or moods in different cultures. Cluster analysis of book cover images of the same genre allows us to identify color preferences inherent in the genre, which is proposed to be used when designing new covers. The capabilities of generative services for creating and improving cover designs are also investigated. An improved flow chart for using GAI in creating book covers in the process of self-publishing is proposed, which includes new stages, namely exploring, conditioning, and evolving. At these stages, the designer creates prompts for GAI and examines how they and GAI’s issuances correspond to the task. Conditioning allows for even more precise adjustment of prompts to features of each book, and the evolving stage also includes post-processing of results already received from GAI. Post-processing, in turn, can be performed both in generative services and by a designer. The experiment allowed us to use the machine-learning method to determine which colors are most often found in book cover layouts of one of the genres and to check whether these colors correspond to harmonious color palettes. In accordance with the proposed scheme of the design process using generative artificial intelligence, versions of book cover layouts of a given genre were obtained. Full article

(This article belongs to the Section AI in Imaging)

► Show Figures

Figure 1

16 pages, 1772 KiB

Open AccessArticle

We Need to Talk About Lung Ultrasound Score: Prediction of Intensive Care Unit Admission with Machine Learning

by Duarte Oliveira-Saraiva, João Leote, Filipe André Gonzalez, Nuno Cruz Garcia and Hugo Alexandre Ferreira

J. Imaging 2025, 11(2), 45; https://doi.org/10.3390/jimaging11020045 - 7 Feb 2025

Viewed by 1131

Abstract

The admission of COVID-19 patients to the Intensive Care Unit (ICU) is largely dependent on illness severity, yet no standard criteria exist for this decision. Here, lung ultrasound (LU) data, blood gas analysis (BGA), and clinical parameters from venous blood tests (VBTs) were [...] Read more.

The admission of COVID-19 patients to the Intensive Care Unit (ICU) is largely dependent on illness severity, yet no standard criteria exist for this decision. Here, lung ultrasound (LU) data, blood gas analysis (BGA), and clinical parameters from venous blood tests (VBTs) were used, along with machine-learning (ML) models to predict the need for ICU admission. Data from fifty-one COVID-19 patients, including ICU admission status, were collected. The information from LU was gathered through the identification of LU findings (LUFs): B-lines, irregular pleura, subpleural, and lobar consolidations. LU scores (LUSs) were computed by summing predefined weights assigned to each LUF, as reported in previous studies. In addition, individual LUFs were analyzed without calculating a total LUS. Support vector machine models were built, combining the available clinical data to predict ICU admissions. The application of ML models to individual LUFs outperformed standard LUS approaches reported in previous studies. Moreover, combining LU data with results from other medical exams improved the area under the receiver operating characteristic curve (AUC). The model with the best overall performance used variables from all three exams (BGA, LU, VBT), achieving an AUC of 95.5%. Overall, the results demonstrate the significant role of ML models in improving the prediction of ICU admission. Additionally, applying ML specifically to LUFs provided better results compared to traditional approaches that rely on traditional LUSs. The results of this paper are deployed on a web app. Full article

(This article belongs to the Special Issue Progress and Challenges in Biomedical Image Analysis)

► Show Figures

Figure 1

14 pages, 3344 KiB

Open AccessArticle

Robot-Based Procedure for 3D Reconstruction of Abdominal Organs Using the Iterative Closest Point and Pose Graph Algorithms

by Birthe Göbel, Jonas Huurdeman, Alexander Reiterer and Knut Möller

J. Imaging 2025, 11(2), 44; https://doi.org/10.3390/jimaging11020044 - 5 Feb 2025

Viewed by 1287

Abstract

Image-based 3D reconstruction enables robot-assisted interventions and image-guided navigation, which are emerging technologies in laparoscopy. When a robotic arm guides a laparoscope for image acquisition, hand–eye calibration is required to know the transformation between the camera and the robot flange. The calibration procedure [...] Read more.

Image-based 3D reconstruction enables robot-assisted interventions and image-guided navigation, which are emerging technologies in laparoscopy. When a robotic arm guides a laparoscope for image acquisition, hand–eye calibration is required to know the transformation between the camera and the robot flange. The calibration procedure is complex and must be conducted after each intervention (when the laparoscope is dismounted for cleaning). In the field, the surgeons and their assistants cannot be expected to do so. Thus, our approach is a procedure for a robot-based multi-view 3D reconstruction without hand–eye calibration, but with pose optimization algorithms instead. In this work, a robotic arm and a stereo laparoscope build the experimental setup. The procedure includes the stereo matching algorithm Semi Global Matching from OpenCV for depth measurement and the multiscale color iterative closest point algorithm from Open3D (v0.19), along with the multiway registration algorithm using a pose graph from Open3D (v0.19) for pose optimization. The procedure is evaluated quantitatively and qualitatively on ex vivo organs. The results are a low root mean squared error (1.1–3.37 mm) and dense point clouds. The proposed procedure leads to a plausible 3D model, and there is no need for complex hand–eye calibration, as this step can be compensated for by pose optimization algorithms. Full article

(This article belongs to the Special Issue Geometry Reconstruction from Images (2nd Edition))

► Show Figures

Figure 1

16 pages, 4076 KiB

Open AccessArticle

Imaging and Image Processing Techniques for High-Resolution Visualization of Connective Tissue with MRI: Application to Fascia, Aponeurosis, and Tendon

by Meeghage Randika Perera, Graeme M. Bydder, Samantha J. Holdsworth and Geoffrey G. Handsfield

J. Imaging 2025, 11(2), 43; https://doi.org/10.3390/jimaging11020043 - 4 Feb 2025

Viewed by 1706

Abstract

Recent interest in musculoskeletal connective tissues like tendons, aponeurosis, and deep fascia has led to a greater focus on in vivo medical imaging, particularly MRI. Given the rapid T₂* decay of collagenous tissues, advanced ultra-short echo time (UTE) MRI sequences have [...] Read more.

Recent interest in musculoskeletal connective tissues like tendons, aponeurosis, and deep fascia has led to a greater focus on in vivo medical imaging, particularly MRI. Given the rapid T₂* decay of collagenous tissues, advanced ultra-short echo time (UTE) MRI sequences have proven useful in generating high-signal images of these tissues. To further these advances, we discuss the integration of UTE with Diffusion Tensor Imaging (DTI) and explore image processing techniques to enhance the localization, labeling, and modeling of connective tissues. These techniques are especially valuable for extracting features from thin tissues that may be difficult to distinguish. We present data from lower leg scans of 30 healthy subjects using a non-Cartesian MRI sequence to acquire axial 2D images to segment skeletal muscle and connective tissue. DTI helped differentiate aponeurosis from deep fascia by analyzing muscle fiber orientations. The dual echo imaging methods yielded high-resolution images of deep fascia, where in-plane spatial resolutions were between 0.3 × 0.3 mm to 0.5 × 0.5 mm with a slice thickness of 3–5 mm. Techniques such as K-Means clustering, FFT edge detection, and region-specific scaling were most effective in enhancing images of deep fascia, aponeurosis, and tendon to enable high-fidelity modeling of these tissues. Full article

(This article belongs to the Special Issue Progress and Challenges in Biomedical Image Analysis)

► Show Figures

Figure 1

13 pages, 1569 KiB

Open AccessArticle

Dual-Model Synergy for Fingerprint Spoof Detection Using VGG16 and ResNet50

by Mohamed Cheniti, Zahid Akhtar and Praveen Kumar Chandaliya

J. Imaging 2025, 11(2), 42; https://doi.org/10.3390/jimaging11020042 - 4 Feb 2025

Cited by 3 | Viewed by 1786

Abstract

In this paper, we address the challenge of fingerprint liveness detection by proposing a dual pre-trained model approach that combines VGG16 and ResNet50 architectures. While existing methods often rely on a single feature extraction model, they may struggle with generalization across diverse spoofing [...] Read more.

In this paper, we address the challenge of fingerprint liveness detection by proposing a dual pre-trained model approach that combines VGG16 and ResNet50 architectures. While existing methods often rely on a single feature extraction model, they may struggle with generalization across diverse spoofing materials and sensor types. To overcome this limitation, our approach leverages the high-resolution feature extraction of VGG16 and the deep layer architecture of ResNet50 to capture a more comprehensive range of features for improved spoof detection. The proposed approach integrates these two models by concatenating their extracted features, which are then used to classify the captured fingerprint as live or spoofed. Evaluated on the Livedet2013 and Livedet2015 datasets, our method achieves state-of-the-art performance, with an accuracy of 99.72% on Livedet2013, surpassing existing methods like the Gram model (98.95%) and Pre-trained CNN (98.45%). On Livedet2015, our method achieves an average accuracy of 96.32%, outperforming several state-of-the-art models, including CNN (95.27%) and LivDet 2015 (95.39%). Error rate analysis reveals consistently low Bonafide Presentation Classification Error Rate (BPCER) scores with 0.28% on LivDet 2013 and 1.45% on LivDet 2015. Similarly, the Attack Presentation Classification Error Rate (APCER) remains low at 0.35% on LivDet 2013 and 3.68% on LivDet 2015. However, higher APCER values are observed for unknown spoof materials, particularly in the Crossmatch subset of Livedet2015, where the APCER rises to 8.12%. These findings highlight the robustness and adaptability of our simple dual-model framework while identifying areas for further optimization in handling unseen spoof materials. Full article

(This article belongs to the Special Issue Deepfakes, Fake News and Multimedia Manipulation from Generation to Detection (2nd Edition))

► Show Figures

Figure 1

13 pages, 1650 KiB

Open AccessTechnical Note

Pano-GAN: A Deep Generative Model for Panoramic Dental Radiographs

by Søren Pedersen, Sanyam Jain, Mikkel Chavez, Viktor Ladehoff, Bruna Neves de Freitas and Ruben Pauwels

J. Imaging 2025, 11(2), 41; https://doi.org/10.3390/jimaging11020041 - 2 Feb 2025

Cited by 1 | Viewed by 1969

Abstract

This paper presents the development of a generative adversarial network (GAN) for the generation of synthetic dental panoramic radiographs. While this is an exploratory study, the ultimate aim is to address the scarcity of data in dental research and education. A deep convolutional [...] Read more.

This paper presents the development of a generative adversarial network (GAN) for the generation of synthetic dental panoramic radiographs. While this is an exploratory study, the ultimate aim is to address the scarcity of data in dental research and education. A deep convolutional GAN (DCGAN) with the Wasserstein loss and a gradient penalty (WGAN-GP) was trained on a dataset of 2322 radiographs of varying quality. The focus of this study was on the dentoalveolar part of the radiographs; other structures were cropped out. Significant data cleaning and preprocessing were conducted to standardize the input formats while maintaining anatomical variability. Four candidate models were identified by varying the critic iterations, number of features and the use of denoising prior to training. To assess the quality of the generated images, a clinical expert evaluated a set of generated synthetic radiographs using a ranking system based on visibility and realism, with scores ranging from 1 (very poor) to 5 (excellent). It was found that most generated radiographs showed moderate depictions of dentoalveolar anatomical structures, although they were considerably impaired by artifacts. The mean evaluation scores showed a trade-off between the model trained on non-denoised data, which showed the highest subjective quality for finer structures, such as the mandibular canal and trabecular bone, and one of the models trained on denoised data, which offered better overall image quality, especially in terms of clarity and sharpness and overall realism. These outcomes serve as a foundation for further research into GAN architectures for dental imaging applications. Full article

(This article belongs to the Special Issue Tools and Techniques for Improving Radiological Imaging Applications)

► Show Figures

Figure 1

14 pages, 2761 KiB

Open AccessArticle

Validation of Novel Image Processing Method for Objective Quantification of Intra-Articular Bleeding During Arthroscopic Procedures

by Olgar Birsel, Umut Zengin, Ilker Eren, Ali Ersen, Beren Semiz and Mehmet Demirhan

J. Imaging 2025, 11(2), 40; https://doi.org/10.3390/jimaging11020040 - 31 Jan 2025

Cited by 1 | Viewed by 1033

Abstract

Visual clarity is crucial for shoulder arthroscopy, directly influencing surgical precision and outcomes. Despite advances in imaging technology, intraoperative bleeding remains a significant obstacle to optimal visibility, with subjective evaluation methods lacking consistency and standardization. This study proposes a novel image processing system [...] Read more.

Visual clarity is crucial for shoulder arthroscopy, directly influencing surgical precision and outcomes. Despite advances in imaging technology, intraoperative bleeding remains a significant obstacle to optimal visibility, with subjective evaluation methods lacking consistency and standardization. This study proposes a novel image processing system to objectively quantify bleeding and assess surgical effectiveness. The system uses color recognition algorithms to calculate a bleeding score based on pixel ratios by incorporating multiple color spaces to enhance accuracy and minimize errors. Moreover, 200 three-second video clips from prior arthroscopic rotator cuff repairs were evaluated by three senior surgeons trained on the system’s color metrics and scoring process. Assessments were repeated two weeks later to test intraobserver reliability. The system’s scores were compared to the average score given by the surgeons. The average surgeon-assigned score was 5.10 (range: 1–9.66), while the system scored videos from 1 to 9.46, with an average of 5.08. The mean absolute error between system and surgeon scores was 0.56, with a standard deviation of 0.50, achieving agreement ranging from [0.96,0.98] with 96.7% confidence (ICC = 0.967). This system provides a standardized method to evaluate intraoperative bleeding, enabling the precise detection of blood variations and supporting advanced technologies like autonomous arthropumps to enhance arthroscopy and surgical outcomes. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

20 pages, 2884 KiB

Open AccessArticle

Dimensional Accuracy Assessment of Medical Anatomical Models Produced by Hospital-Based Fused Deposition Modeling 3D Printer

by Kevin Wendo, Catherine Behets, Olivier Barbier, Benoit Herman, Thomas Schubert, Benoit Raucent and Raphael Olszewski

J. Imaging 2025, 11(2), 39; https://doi.org/10.3390/jimaging11020039 - 30 Jan 2025

Cited by 1 | Viewed by 1526

Abstract

As 3D printing technology expands rapidly in medical disciplines, the accuracy evaluation of 3D-printed medical models is required. However, no established guidelines to assess the dimensional error of anatomical models exist. This study aims to evaluate the dimensional accuracy of medical models 3D-printed [...] Read more.

As 3D printing technology expands rapidly in medical disciplines, the accuracy evaluation of 3D-printed medical models is required. However, no established guidelines to assess the dimensional error of anatomical models exist. This study aims to evaluate the dimensional accuracy of medical models 3D-printed using a hospital-based Fused Deposition Modeling (FDM) 3D printer. Two dissected cadaveric right hands were marked with Titanium Kirshner wires to identify landmarks on the heads and bases of all metacarpals and proximal and middle phalanges. Both hands were scanned using a Cone Beam Computed Tomography scanner. Image post-processing and segmentation were performed on 3D Slicer software. Hand models were 3D-printed using a professional hospital-based FDM 3D printer. Manual measurements of all landmarks marked on both pairs of cadaveric and 3D-printed hands were taken by two independent observers using a digital caliper. The Mean Absolute Difference (MAD) and Mean Dimensional Error (MDE) were calculated. Our results showed an acceptable level of dimensional accuracy. The overall study’s MAD was 0.32 mm (±0.34), and its MDE was 1.03% (±0.83). These values fall within the recommended range of errors. A high level of dimensional accuracy of the 3D-printed anatomical models was achieved, suggesting their reliability and suitability for medical applications. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

19 pages, 1172 KiB

Open AccessReview

Machine Learning-Based Approaches for Breast Density Estimation from Mammograms: A Comprehensive Review

by Khaldoon Alhusari and Salam Dhou

J. Imaging 2025, 11(2), 38; https://doi.org/10.3390/jimaging11020038 - 26 Jan 2025

Cited by 2 | Viewed by 1951

Abstract

Breast cancer, as of 2022, is the most prevalent type of cancer in women. Breast density—a measure of the non-fatty tissue in the breast—is a strong risk factor for breast cancer that can be estimated from mammograms. The importance of studying breast density [...] Read more.

Breast cancer, as of 2022, is the most prevalent type of cancer in women. Breast density—a measure of the non-fatty tissue in the breast—is a strong risk factor for breast cancer that can be estimated from mammograms. The importance of studying breast density is twofold. First, high breast density can be a factor in lowering mammogram sensitivity, as dense tissue can mask tumors. Second, higher breast density is associated with an increased risk of breast cancer, making accurate assessments vital. This paper presents a comprehensive review of the mammographic density estimation literature, with an emphasis on machine-learning-based approaches. The approaches reviewed can be classified as visual, software-, machine learning-, and segmentation-based. Machine learning methods can be further broken down into two categories: traditional machine learning and deep learning approaches. The most commonly utilized models are support vector machines (SVMs) and convolutional neural networks (CNNs), with classification accuracies ranging from 76.70% to 98.75%. Major limitations of the current works include subjectivity and cost-inefficiency. Future work can focus on addressing these limitations, potentially through the use of unsupervised segmentation and state-of-the-art deep learning models such as transformers. By addressing the current limitations, future research can pave the way for more reliable breast density estimation methods, ultimately improving early detection and diagnosis. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

Journal Menu

Journal Browser

J. Imaging, Volume 11, Issue 2 (February 2025) – 39 articles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI