MDPI - Publisher of Open Access Journals

24 pages, 64409 KB

Open AccessArticle

CA-DDPM: Conditionally Embedded Attention-Aided Denoising Diffusion Probabilistic Model for High-Quality SAR Image Generation

by Yang Zheng, Duhao Liu, Ruimin Li, Rongxu Wang, Junling Fan, Kaitai Guo and Jimin Liang

Remote Sens. 2026, 18(12), 1994; https://doi.org/10.3390/rs18121994 - 15 Jun 2026

Viewed by 161

Abstract

Deep learning-based automatic target recognition (ATR) for synthetic aperture radar (SAR) imagery requires large quantities of high-quality annotated data, yet real SAR samples are costly and difficult to obtain. Existing generative adversarial network (GAN)-based SAR generation methods often suffer from limited authenticity and [...] Read more.

Deep learning-based automatic target recognition (ATR) for synthetic aperture radar (SAR) imagery requires large quantities of high-quality annotated data, yet real SAR samples are costly and difficult to obtain. Existing generative adversarial network (GAN)-based SAR generation methods often suffer from limited authenticity and insufficient diversity. To address these issues, we propose CA-DDPM, a conditionally embedded attention-aided denoising diffusion probabilistic model (DDPM) for high-quality multi-category SAR image generation. CA-DDPM employs a unified conditional embedding that fuses time-step and category information, injected into a U-Net backbone through a feature-wise linear modulation (FiLM)-based mechanism to achieve step-aware and class-aware denoising. Attention blocks are further incorporated to enhance the modeling of structural dependencies and fine scattering details. To evaluate generation quality, we develop a three-dimensional assessment framework that jointly examines authenticity, diversity, and utility in ATR. Authenticity is quantified using local and global similarity metrics under a unified Hungarian-matched statistical procedure, together with an SAR-adapted Fréchet inception distance (SAR-FID). Diversity is assessed through inter-category feature clustering, an SAR Inception Score (SAR-IS), and a newly proposed intra-category grayscale histogram-based metric. Utility is evaluated by hybrid training experiments across multiple ATR models. Experiments on the Moving and Stationary Target Acquisition and Recognition (MSTAR) dataset demonstrate that CA-DDPM produces more realistic and diverse SAR images than representative GAN- and DDPM-based baselines, and it effectively improves downstream ATR performance through data augmentation. Full article

(This article belongs to the Special Issue AI-Driven Remote Sensing Image Restoration and Generation)

► Show Figures

Figure 1

25 pages, 7978 KB

Open AccessArticle

Machine Learning Approaches for Soil Moisture Prediction Using Ground Penetrating Radar: A Comparative Study of Tree-Based Algorithms

by Jantana Panyavaraporn, Paramate Horkaew, Rungroj Arjwech and Sitthiphat Eua-apiwatch

Earth 2025, 6(3), 98; https://doi.org/10.3390/earth6030098 - 16 Aug 2025

Cited by 2 | Viewed by 3185

Abstract

Accurate soil moisture estimation is critical for precision agriculture and water resource management, yet traditional sampling methods are time-consuming, destructive, and provide limited spatial coverage. Ground Penetrating Radar (GPR) offers a promising non-destructive alternative, but optimal machine learning approaches for GPR-based soil moisture [...] Read more.

Accurate soil moisture estimation is critical for precision agriculture and water resource management, yet traditional sampling methods are time-consuming, destructive, and provide limited spatial coverage. Ground Penetrating Radar (GPR) offers a promising non-destructive alternative, but optimal machine learning approaches for GPR-based soil moisture prediction remain unclear. This study presents a comparative analysis of regression tree and boosted tree algorithms for predicting soil moisture content from Ground Penetrating Radar (GPR) histogram features across 21 sites in Eastern Thailand. Soil moisture content was measured at multiple depths (0.5, 1.0, 1.5, 2.0, 2.5, and 3.0 m) using samples collected during Standard Penetration Test procedures. Feature extraction was performed using 16-bin histograms from processed GPR radargrams. A single regression tree achieved a cross-validation RMSE of 5.082 and an R² of 0.761, demonstrating superior training accuracy and interpretability. In contrast, the boosted tree ensemble achieved significantly better generalization performance, with a cross-validation RMSE of 4.7915 and an R² of 0.708, representing a 5.7% improvement in predictive performance. Feature importance analysis revealed that specific histogram bins effectively captured moisture-related variations in GPR signal amplitude distributions. A comparative evaluation demonstrates that while single regression trees offer superior interpretability for research applications, boosted tree ensembles provide enhanced predictive performance that is essential for operational deployment in precision agriculture and hydrological monitoring systems. Full article

► Show Figures

Figure 1

20 pages, 7366 KB

Open AccessArticle

Histogram of Polarization Gradient for Target Tracking in Infrared DoFP Polarization Thermal Imaging

by Jianguo Yang, Dian Sheng, Weiqi Jin and Li Li

Remote Sens. 2025, 17(5), 907; https://doi.org/10.3390/rs17050907 - 4 Mar 2025

Cited by 1 | Viewed by 1520

Abstract

Division-of-focal-plane (DoFP) polarization imaging systems have demonstrated considerable promise in target detection and tracking in complex backgrounds. However, existing methods face challenges, including dependence on complex image preprocessing procedures and limited real-time performance. To address these issues, this study presents a novel histogram [...] Read more.

Division-of-focal-plane (DoFP) polarization imaging systems have demonstrated considerable promise in target detection and tracking in complex backgrounds. However, existing methods face challenges, including dependence on complex image preprocessing procedures and limited real-time performance. To address these issues, this study presents a novel histogram of polarization gradient (HPG) feature descriptor that enables efficient feature representation of polarization mosaic images. First, a polarization distance calculation model based on normalized cross-correlation (NCC) and local variance is constructed, which enhances the robustness of gradient feature extraction through dynamic weight adjustment. Second, a sparse Laplacian filter is introduced to achieve refined gradient feature representation. Subsequently, adaptive polarization channel correlation weights and the second-order gradient are utilized to reconstruct the degree of linear polarization (DoLP). Finally, the gradient and DoLP sign information are ingeniously integrated to enhance the capability of directional expression, thus providing a new theoretical perspective for polarization mosaic image structure analysis. The experimental results obtained using a self-developed long-wave infrared DoFP polarization thermal imaging system demonstrate that, within the same FBACF tracking framework, the proposed HPG feature descriptor significantly outperforms traditional grayscale {8.22%, 2.93%}, histogram of oriented gradient (HOG) {5.86%, 2.41%}, and mosaic gradient histogram (MGH) {27.19%, 18.11%} feature descriptors in terms of precision and success rate. The processing speed of approximately 20 fps meets the requirements for real-time tracking applications, providing a novel technical solution for polarization imaging applications. Full article

(This article belongs to the Special Issue Recent Advances in Infrared Target Detection)

► Show Figures

Figure 1

19 pages, 6005 KB

Open AccessArticle

Improving Dental Implant Outcomes: CNN-Based System Accurately Measures Degree of Peri-Implantitis Damage on Periapical Film

by Yi-Chieh Chen, Ming-Yi Chen, Tsung-Yi Chen, Mei-Ling Chan, Ya-Yun Huang, Yu-Lin Liu, Pei-Ting Lee, Guan-Jhih Lin, Tai-Feng Li, Chiung-An Chen, Shih-Lun Chen, Kuo-Chen Li and Patricia Angela R. Abu

Bioengineering 2023, 10(6), 640; https://doi.org/10.3390/bioengineering10060640 - 25 May 2023

Cited by 41 | Viewed by 7037

Abstract

As the popularity of dental implants continues to grow at a rate of about 14% per year, so do the risks associated with the procedure. Complications such as sinusitis and nerve damage are not uncommon, and inadequate cleaning can lead to peri-implantitis around [...] Read more.

As the popularity of dental implants continues to grow at a rate of about 14% per year, so do the risks associated with the procedure. Complications such as sinusitis and nerve damage are not uncommon, and inadequate cleaning can lead to peri-implantitis around the implant, jeopardizing its stability and potentially necessitating retreatment. To address this issue, this research proposes a new system for evaluating the degree of periodontal damage around implants using Periapical film (PA). The system utilizes two Convolutional Neural Networks (CNN) models to accurately detect the location of the implant and assess the extent of damage caused by peri-implantitis. One of the CNN models is designed to determine the location of the implant in the PA with an accuracy of up to 89.31%, while the other model is responsible for assessing the degree of Peri-implantitis damage around the implant, achieving an accuracy of 90.45%. The system combines image cropping based on position information obtained from the first CNN with image enhancement techniques such as Histogram Equalization and Adaptive Histogram Equalization (AHE) to improve the visibility of the implant and gums. The result is a more accurate assessment of whether peri-implantitis has eroded to the first thread, a critical indicator of implant stability. To ensure the ethical and regulatory standards of our research, this proposal has been certified by the Institutional Review Board (IRB) under number 202102023B0C503. With no existing technology to evaluate Peri-implantitis damage around dental implants, this CNN-based system has the potential to revolutionize implant dentistry and improve patient outcomes. Full article

(This article belongs to the Special Issue Recent Advance in Biomaterials, Clinical Dentistry and Dental Diseases 2.0)

► Show Figures

Graphical abstract

16 pages, 1198 KB

Open AccessArticle

A Swarming Meyer Wavelet Computing Approach to Solve the Transport System of Goods

by Zulqurnain Sabir, Tareq Saeed, Juan L. G. Guirao, Juan M. Sánchez and Adrián Valverde

Axioms 2023, 12(5), 456; https://doi.org/10.3390/axioms12050456 - 8 May 2023

Cited by 12 | Viewed by 2554

Abstract

The motive of this work is to provide the numerical performances of the reactive transport model that carries trucks with goods on roads by exploiting the stochastic procedures based on the Meyer wavelet (MW) neural network. An objective function is constructed by using [...] Read more.

The motive of this work is to provide the numerical performances of the reactive transport model that carries trucks with goods on roads by exploiting the stochastic procedures based on the Meyer wavelet (MW) neural network. An objective function is constructed by using the differential model and its boundary conditions. The optimization of the objective function is performed through the hybridization of the global and local search procedures, i.e., swarming and interior point algorithms. Three different cases of the model have been obtained, and the exactness of the stochastic procedure is observed by using the comparison of the obtained and Adams solutions. The negligible absolute error enhances the exactness of the proposed MW neural networks along with the hybridization of the global and local search schemes. Moreover, statistical interpretations based on different operators, histograms, and boxplots are provided to validate the constancy of the designed stochastic structure. Full article

(This article belongs to the Special Issue Geometry and Nonlinear Computations in Physics)

► Show Figures

Figure 1

14 pages, 2606 KB

Open AccessArticle

A Soft Computing Scaled Conjugate Gradient Procedure for the Fractional Order Majnun and Layla Romantic Story

by Zulqurnain Sabir and Juan L. G. Guirao

Mathematics 2023, 11(4), 835; https://doi.org/10.3390/math11040835 - 7 Feb 2023

Cited by 17 | Viewed by 2816

Abstract

The current study shows the numerical performances of the fractional order mathematical model based on the Majnun and Layla (FO-MML) romantic story. The stochastic computing numerical scheme based on the scaled conjugate gradient neural networks (SCGNNs) is presented to solve the FO-MML. The [...] Read more.

The current study shows the numerical performances of the fractional order mathematical model based on the Majnun and Layla (FO-MML) romantic story. The stochastic computing numerical scheme based on the scaled conjugate gradient neural networks (SCGNNs) is presented to solve the FO-MML. The purpose of providing the solutions of the fractional derivatives is to achieve more accurate and realistic performances of the FO-MML romantic story model. The mathematical model is divided into four dynamics, while the exactness is authenticated through the comparison of obtained and reference Adam results. Moreover, the negligible absolute error enhances the accuracy of the stochastic scheme. Fourteen numbers of neurons have been taken and the information statics are divided into authorization, training, and testing, which are divided into 12%, 77% and 11%, respectively. The reliability, capability, and accuracy of the stochastic SCGNNs is performed through the stochastic procedures using the regression, error histograms, correlation, and state transitions for solving the mathematical model. Full article

► Show Figures

Figure 1

23 pages, 5670 KB

Open AccessArticle

Masked Face Emotion Recognition Based on Facial Landmarks and Deep Learning Approaches for Visually Impaired People

by Mukhriddin Mukhiddinov, Oybek Djuraev, Farkhod Akhmedov, Abdinabi Mukhamadiyev and Jinsoo Cho

Sensors 2023, 23(3), 1080; https://doi.org/10.3390/s23031080 - 17 Jan 2023

Cited by 102 | Viewed by 16236

Abstract

Current artificial intelligence systems for determining a person’s emotions rely heavily on lip and mouth movement and other facial features such as eyebrows, eyes, and the forehead. Furthermore, low-light images are typically classified incorrectly because of the dark region around the eyes and [...] Read more.

Current artificial intelligence systems for determining a person’s emotions rely heavily on lip and mouth movement and other facial features such as eyebrows, eyes, and the forehead. Furthermore, low-light images are typically classified incorrectly because of the dark region around the eyes and eyebrows. In this work, we propose a facial emotion recognition method for masked facial images using low-light image enhancement and feature analysis of the upper features of the face with a convolutional neural network. The proposed approach employs the AffectNet image dataset, which includes eight types of facial expressions and 420,299 images. Initially, the facial input image’s lower parts are covered behind a synthetic mask. Boundary and regional representation methods are used to indicate the head and upper features of the face. Secondly, we effectively adopt a facial landmark detection method-based feature extraction strategy using the partially covered masked face’s features. Finally, the features, the coordinates of the landmarks that have been identified, and the histograms of the oriented gradients are then incorporated into the classification procedure using a convolutional neural network. An experimental evaluation shows that the proposed method surpasses others by achieving an accuracy of 69.3% on the AffectNet dataset. Full article

(This article belongs to the Special Issue Application of Semantic Technologies in Sensors and Sensing Systems)

► Show Figures

Figure 1

19 pages, 3220 KB

Open AccessArticle

APESTNet with Mask R-CNN for Liver Tumor Segmentation and Classification

by Prabhu Kavin Balasubramanian, Wen-Cheng Lai, Gan Hong Seng, Kavitha C and Jeeva Selvaraj

Cancers 2023, 15(2), 330; https://doi.org/10.3390/cancers15020330 - 4 Jan 2023

Cited by 65 | Viewed by 7489

Abstract

Diagnosis and treatment of hepatocellular carcinoma or metastases rely heavily on accurate segmentation and classification of liver tumours. However, due to the liver tumor’s hazy borders and wide range of possible shapes, sizes, and positions, accurate and automatic tumour segmentation and classification remains [...] Read more.

Diagnosis and treatment of hepatocellular carcinoma or metastases rely heavily on accurate segmentation and classification of liver tumours. However, due to the liver tumor’s hazy borders and wide range of possible shapes, sizes, and positions, accurate and automatic tumour segmentation and classification remains a difficult challenge. With the advancement of computing, new models in artificial intelligence have evolved. Following its success in Natural language processing (NLP), the transformer paradigm has been adopted by the computer vision (CV) community of the NLP. While there are already accepted approaches to classifying the liver, especially in clinical settings, there is room for advancement in terms of their precision. This paper makes an effort to apply a novel model for segmenting and classifying liver tumours built on deep learning. In order to accomplish this, the created model follows a three-stage procedure consisting of (a) pre-processing, (b) liver segmentation, and (c) classification. In the first phase, the collected Computed Tomography (CT) images undergo three stages of pre-processing, including contrast improvement via histogram equalization and noise reduction via the median filter. Next, an enhanced mask region-based convolutional neural networks (Mask R-CNN) model is used to separate the liver from the CT abdominal image. To prevent overfitting, the segmented picture is fed onto an Enhanced Swin Transformer Network with Adversarial Propagation (APESTNet). The experimental results prove the superior performance of the proposed perfect on a wide variety of CT images, as well as its efficiency and low sensitivity to noise. Full article

(This article belongs to the Special Issue Recent Advances in Deep Learning and Medical Imaging for Cancer Treatment)

► Show Figures

Figure 1

22 pages, 3143 KB

Open AccessArticle

Enhancement of Mammographic Images Using Histogram-Based Techniques for Their Classification Using CNN

by Khalaf Alshamrani, Hassan A. Alshamrani, Fawaz F. Alqahtani and Bander S. Almutairi

Sensors 2023, 23(1), 235; https://doi.org/10.3390/s23010235 - 26 Dec 2022

Cited by 19 | Viewed by 4427

Abstract

In the world, one in eight women will develop breast cancer. Men can also develop it, but less frequently. This condition starts with uncontrolled cell division brought on by a change in the genes that regulate cell division and growth, which leads to [...] Read more.

In the world, one in eight women will develop breast cancer. Men can also develop it, but less frequently. This condition starts with uncontrolled cell division brought on by a change in the genes that regulate cell division and growth, which leads to the development of a nodule or tumour. These tumours can be either benign, which poses no health risk, or malignant, also known as cancerous, which puts patients’ lives in jeopardy and has the potential to spread. The most common way to diagnose this problem is via mammograms. This kind of examination enables the detection of abnormalities in breast tissue, such as masses and microcalcifications, which are thought to be indicators of the presence of disease. This study aims to determine how histogram-based image enhancement methods affect the classification of mammograms into five groups: benign calcifications, benign masses, malignant calcifications, malignant masses, and healthy tissue, as determined by a CAD system of automatic mammography classification using convolutional neural networks. Both Contrast-limited Adaptive Histogram Equalization (CAHE) and Histogram Intensity Windowing (HIW) will be used (CLAHE). By improving the contrast between the image’s background, fibrous tissue, dense tissue, and sick tissue, which includes microcalcifications and masses, the mammography histogram is modified using these procedures. In order to help neural networks, learn, the contrast has been increased to make it easier to distinguish between various types of tissue. The proportion of correctly classified images could rise with this technique. Using Deep Convolutional Neural Networks, a model was developed that allows classifying different types of lesions. The model achieved an accuracy of 62%, based on mini-MIAS data. The final goal of the project is the creation of an update algorithm that will be incorporated into the CAD system and will enhance the automatic identification and categorization of microcalcifications and masses. As a result, it would be possible to increase the possibility of early disease identification, which is important because early discovery increases the likelihood of a cure to almost 100%. Full article

(This article belongs to the Special Issue Edge/Fog Computing and Blockchain for Reliable Time-Critical Applications)

► Show Figures

Figure 1

14 pages, 8404 KB

Open AccessArticle

A Stochastic Bayesian Regularization Approach for the Fractional Food Chain Supply System with Allee Effects

by Basma Souayeh, Zulqurnain Sabir, Najib Hdhiri, Wael Al-Kouz, Mir Waqas Alam and Tarfa Alsheddi

Fractal Fract. 2022, 6(10), 553; https://doi.org/10.3390/fractalfract6100553 - 29 Sep 2022

Cited by 8 | Viewed by 2199

Abstract

This motive of current research is to provide a stochastic platform based on the artificial neural networks (ANNs) along with the Bayesian regularization approach for the fractional food chain supply system (FFSCS) with Allee effects. The investigations based on the fractional derivatives are [...] Read more.

This motive of current research is to provide a stochastic platform based on the artificial neural networks (ANNs) along with the Bayesian regularization approach for the fractional food chain supply system (FFSCS) with Allee effects. The investigations based on the fractional derivatives are applied to achieve the accurate and precise results of FFSCS. The dynamical FFSCS is divided into special predator category

P (η)

, top-predator class

Q (η)

, and prey population dynamics

R (η)

. The computing numerical performances for three different variations of the dynamical FFSCS are provided by using the ANNs along with the Bayesian regularization approach. The data selection for the dynamical FFSCS is selected for train as 78% and 11% for both test and endorsement. The accuracy of the proposed ANNs along with the Bayesian regularization method is approved using the comparison performances. For the rationality, ability, reliability, and exactness are authenticated by using the ANNs procedure enhanced by the Bayesian regularization method through the regression measures, correlation values, error histograms, and transition of state performances. Full article

(This article belongs to the Special Issue Recent Advances in Fractional Differential Equations and Their Applications)

► Show Figures

Figure 1

23 pages, 3880 KB

Open AccessArticle

DETECT-LC: A 3D Deep Learning and Textural Radiomics Computational Model for Lung Cancer Staging and Tumor Phenotyping Based on Computed Tomography Volumes

by Karma M. Fathalla, Sherin M. Youssef and Nourhan Mohammed

Appl. Sci. 2022, 12(13), 6318; https://doi.org/10.3390/app12136318 - 21 Jun 2022

Cited by 12 | Viewed by 4007

Abstract

Lung Cancer is one of the primary causes of cancer-related deaths worldwide. Timely diagnosis and precise staging are pivotal for treatment planning, and thus can lead to increased survival rates. The application of advanced machine learning techniques helps in effective diagnosis and staging. [...] Read more.

Lung Cancer is one of the primary causes of cancer-related deaths worldwide. Timely diagnosis and precise staging are pivotal for treatment planning, and thus can lead to increased survival rates. The application of advanced machine learning techniques helps in effective diagnosis and staging. In this study, a multistage neurobased computational model is proposed, DETECT-LC learning.

D E T E C T

-

L C

handles the challenge of choosing discriminative CT slices for constructing 3D volumes, using Haralick, histogram-based radiomics, and unsupervised clustering. ALT-CNN-DENSE Net architecture is introduced as part of

D E T E C T

-

L C

for voxel-based classification.

D E T E C T

-

L C

offers an automatic threshold-based segmentation approach instead of the manual procedure, to help mitigate this burden for radiologists and clinicians. Also,

D E T E C T

-

L C

presents a slice selection approach and a newly proposed relatively light weight 3D CNN architecture to improve existing studies performance. The proposed pipeline is employed for tumor phenotyping and staging.

D E T E C T

-

L C

performance is assessed through a range of experiments, in which

D E T E C T

-

L C

attains outstanding performance surpassing its counterparts in terms of accuracy, sensitivity, F1-score and Area under Curve (AuC). For histopathology classification,

D E T E C T

-

L C

average performance achieved an improvement of 20% in overall accuracy, 0.19 in sensitivity, 0.16 in F1-Score and 0.16 in AuC over the state of the art. A similar enhancement is reached for staging, where higher overall accuracy, sensitivity and F1-score are attained with differences of 8%, 0.08 and 0.14. Full article

(This article belongs to the Special Issue Advances in Deep Learning-Based Medical Image Analysis)

► Show Figures

Figure 1

24 pages, 2364 KB

Open AccessArticle

A Fully Automatic Procedure for Brain Tumor Segmentation from Multi-Spectral MRI Records Using Ensemble Learning and Atlas-Based Data Enhancement

by Ágnes Győrfi, László Szilágyi and Levente Kovács

Appl. Sci. 2021, 11(2), 564; https://doi.org/10.3390/app11020564 - 8 Jan 2021

Cited by 28 | Viewed by 3860

Abstract

The accurate and reliable segmentation of gliomas from magnetic resonance image (MRI) data has an important role in diagnosis, intervention planning, and monitoring the tumor’s evolution during and after therapy. Segmentation has serious anatomical obstacles like the great variety of the tumor’s location, [...] Read more.

The accurate and reliable segmentation of gliomas from magnetic resonance image (MRI) data has an important role in diagnosis, intervention planning, and monitoring the tumor’s evolution during and after therapy. Segmentation has serious anatomical obstacles like the great variety of the tumor’s location, size, shape, and appearance and the modified position of normal tissues. Other phenomena like intensity inhomogeneity and the lack of standard intensity scale in MRI data represent further difficulties. This paper proposes a fully automatic brain tumor segmentation procedure that attempts to handle all the above problems. Having its foundations on the MRI data provided by the MICCAI Brain Tumor Segmentation (BraTS) Challenges, the procedure consists of three main phases. The first pre-processing phase prepares the MRI data to be suitable for supervised classification, by attempting to fix missing data, suppressing the intensity inhomogeneity, normalizing the histogram of observed data channels, generating additional morphological, gradient-based, and Gabor-wavelet features, and optionally applying atlas-based data enhancement. The second phase accomplishes the main classification process using ensembles of binary decision trees and provides an initial, intermediary labeling for each pixel of test records. The last phase reevaluates these intermediary labels using a random forest classifier, then deploys a spatial region growing-based structural validation of suspected tumors, thus achieving a high-quality final segmentation result. The accuracy of the procedure is evaluated using the multi-spectral MRI records of the BraTS 2015 and BraTS 2019 training data sets. The procedure achieves high-quality segmentation results, characterized by average Dice similarity scores of up to 86%. Full article

(This article belongs to the Section Electrical, Electronics and Communications Engineering)

► Show Figures

Figure 1

24 pages, 11986 KB

Open AccessArticle

Boosting Multi-Vehicle Tracking with a Joint Object Detection and Viewpoint Estimation Sensor

by Roberto J. López-Sastre, Carlos Herranz-Perdiguero, Ricardo Guerrero-Gómez-Olmedo, Daniel Oñoro-Rubio and Saturnino Maldonado-Bascón

Sensors 2019, 19(19), 4062; https://doi.org/10.3390/s19194062 - 20 Sep 2019

Cited by 163 | Viewed by 4735

Abstract

In this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the [...] Read more.

In this work, we address the problem of multi-vehicle detection and tracking for traffic monitoring applications. We preset a novel intelligent visual sensor for tracking-by-detection with simultaneous pose estimation. Essentially, we adapt an Extended Kalman Filter (EKF) to work not only with the detections of the vehicles but also with their estimated coarse viewpoints, directly obtained with the vision sensor. We show that enhancing the tracking with observations of the vehicle pose, results in a better estimation of the vehicles trajectories. For the simultaneous object detection and viewpoint estimation task, we present and evaluate two independent solutions. One is based on a fast GPU implementation of a Histogram of Oriented Gradients (HOG) detector with Support Vector Machines (SVMs). For the second, we adequately modify and train the Faster R-CNN deep learning model, in order to recover from it not only the object localization but also an estimation of its pose. Finally, we publicly release a challenging dataset, the GRAM Road Traffic Monitoring (GRAM-RTM), which has been especially designed for evaluating multi-vehicle tracking approaches within the context of traffic monitoring applications. It comprises more than 700 unique vehicles annotated across more than 40.300 frames of three videos. We expect the GRAM-RTM becomes a benchmark in vehicle detection and tracking, providing the computer vision and intelligent transportation systems communities with a standard set of images, annotations and evaluation procedures for multi-vehicle tracking. We present a thorough experimental evaluation of our approaches with the GRAM-RTM, which will be useful for establishing further comparisons. The results obtained confirm that the simultaneous integration of vehicle localizations and pose estimations as observations in an EKF, improves the tracking results. Full article

(This article belongs to the Section Intelligent Sensors)

► Show Figures

Figure 1

16 pages, 5518 KB

Open AccessArticle

A Hybrid Proposed Fundus Image Enhancement Framework for Diabetic Retinopathy

by Imran Qureshi, Jun Ma and Kashif Shaheed

Algorithms 2019, 12(1), 14; https://doi.org/10.3390/a12010014 - 4 Jan 2019

Cited by 54 | Viewed by 8903

Abstract

Diabetic retinopathy (DR) is a complication of diabetes and is known as visual impairment, and is diagnosed in various ethnicities of the working-age population worldwide. Fundus angiography is a widely applicable modality used by ophthalmologists and computerized applications to detect DR-based clinical features [...] Read more.

Diabetic retinopathy (DR) is a complication of diabetes and is known as visual impairment, and is diagnosed in various ethnicities of the working-age population worldwide. Fundus angiography is a widely applicable modality used by ophthalmologists and computerized applications to detect DR-based clinical features such as microaneurysms (MAs), hemorrhages (HEMs), and exudates (EXs) for early screening of DR. Fundus images are usually acquired using funduscopic cameras in varied light conditions and angles. Therefore, these images are prone to non-uniform illumination, poor contrast, transmission error, low brightness, and noise problems. This paper presents a novel and real-time mechanism of fundus image enhancement used for early grading of diabetic retinopathy, macular degeneration, retinal neoplasms, and choroid disruptions. The proposed system is based on two folds: (i) An RGB fundus image is initially taken and converted into a color appearance module (called lightness and denoted as J) of the CIECAM02 color space model to obtain image information in grayscale with bright light. Afterwards, in step (ii), the achieved J component is processed using a nonlinear contrast enhancement approach to improve the textural and color features of the fundus image without any further extraction steps. To test and evaluate the strength of the proposed technique, several performance and quality parameters—namely peak signal-to-noise ratio (PSNR), contrast-to-noise ratio (CNR), entropy (content information), histograms (intensity variation), and a structure similarity index measure (SSIM)—were applied to 1240 fundus images comprised of two publicly available datasets, DRIVE and MESSIDOR. It was determined from the experiments that the proposed enhancement procedure outperformed histogram-based approaches in terms of contrast, sharpness of fundus features, and brightness. This further revealed that it can be a suitable preprocessing tool for segmentation and classification of DR-related features algorithms. Full article

► Show Figures

Graphical abstract

34 pages, 20737 KB

Open AccessArticle

Infrared Image Enhancement Using Adaptive Histogram Partition and Brightness Correction

by Minjie Wan, Guohua Gu, Weixian Qian, Kan Ren, Qian Chen and Xavier Maldague

Remote Sens. 2018, 10(5), 682; https://doi.org/10.3390/rs10050682 - 27 Apr 2018

Cited by 71 | Viewed by 12308

Abstract

Infrared image enhancement is a crucial pre-processing technique in intelligent urban surveillance systems for Smart City applications. Existing grayscale mapping-based algorithms always suffer from over-enhancement of the background, noise amplification, and brightness distortion. To cope with these problems, an infrared image enhancement method [...] Read more.

Infrared image enhancement is a crucial pre-processing technique in intelligent urban surveillance systems for Smart City applications. Existing grayscale mapping-based algorithms always suffer from over-enhancement of the background, noise amplification, and brightness distortion. To cope with these problems, an infrared image enhancement method based on adaptive histogram partition and brightness correction is proposed. First, the grayscale histogram is adaptively segmented into several sub-histograms by a locally weighted scatter plot smoothing algorithm and local minima examination. Then, the fore-and background sub-histograms are distinguished according to a proposed metric called grayscale density. The foreground sub-histograms are equalized using a local contrast weighted distribution for the purpose of enhancing the local details, while the background sub-histograms maintain the corresponding proportions of the whole dynamic range in order to avoid over-enhancement. Meanwhile, a visual correction factor considering the property of human vision is designed to reduce the effect of noise during the procedure of grayscale re-mapping. Lastly, particle swarm optimization is used to correct the mean brightness of the output by virtue of a reference image. Both qualitative and quantitative evaluations implemented on real infrared images demonstrate the superiority of our method when compared with other conventional methods. Full article

(This article belongs to the Special Issue Instrumenting Smart City Applications with Big Sensing and Earth Observatory Data: Tools, Methods and Techniques)

► Show Figures

Figure 1

Search Results (16)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (16)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI