Next Article in Journal
Analysis of Intestinal and Nasopharyngeal Microbiota of Children with Meningococcemia in Pediatric Intensive Care Unit: INMACS-PICU Study
Previous Article in Journal
All That Glitters in cfDNA Analysis Is Not Gold or Its Utility Is Completely Established Due to Graft Damage: A Critical Review in the Field of Transplantation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Learning-Based Classification and Feature Extraction for Predicting Pathogenesis of Foot Ulcers in Patients with Diabetes

by
V. Sathya Preiya
1,* and
V. D. Ambeth Kumar
2,*
1
Department of Computer Science and Engineering, Panimalar Engineering College, Anna University, Chennai 600123, India
2
Department of Computer Engineering, Mizoram University, Aizawl 796004, India
*
Authors to whom correspondence should be addressed.
Diagnostics 2023, 13(12), 1983; https://doi.org/10.3390/diagnostics13121983
Submission received: 18 May 2023 / Revised: 28 May 2023 / Accepted: 31 May 2023 / Published: 6 June 2023

Abstract

:
The World Health Organization (WHO) has identified that diabetes mellitus (DM) is one of the most prevalent disease worldwide. Individuals with DM have a higher risk of mortality, and it is crucial to prioritize the treatment of foot ulcers, which is a significant complication associated with the disease, as they lead to the development of plantar ulcers, which results in the need to amputate part of the foot or leg. People with diabetes are at risk of experiencing various complications, such as heart disease, eye problems, kidney dysfunction, nerve damage, skin issues, foot ulcers, and dental diseases. Unawareness of the risk associated with diabetic foot ulcers (DFU) is a significant contributing factor to the mortality of diabetic patients. Evolving technological advancements such as deep learning techniques can be used to predict the symptoms of diabetic foot ulcers as early as possible, which helps to provide effective treatment to DM patients. This research introduces a methodology for analyzing images of foot ulcers in diabetic patients, focusing on feature extraction and classification. The dataset used in this study was collected from historical medical records and foot images of patients with diabetes, who commonly experience foot ulcers as a major complication. The dataset was pre-processed and segmented, and features were extracted using a deep recurrent neural network (DRNN). Image and numerical/text data were extracted separately, and the normal and abnormal diabetes ranges were identified. Foot images of patients with abnormal diabetes ranges were separated and classified using a pre-trained fast convolutional neural network (PFCNN) with U++net. The classification procedure involves the analysis of foot ulcers to predict their pathogenesis. To assess the effectiveness of the proposed technique, the study presented simulation results, including a confusion matrix and receiver operating characteristic curve. These results specifically focused on predicting two classes: normal and abnormal diabetes foot ulcerations. The analysis yielded various parameters, including accuracy, precision, recall curve, and area under the curve. The main goal of the study was to introduce an novel technique for assessing the risk of foot ulceration development in patients with diabetes, leveraging the analysis of foot ulcer images. The researchers collected a dataset of foot images and medical data from historical records of patients with diabetes and pre-processed and segmented the data. They then used a deep recurrent neural network to extract features from the segmented data and identified normal and abnormal diabetes ranges based on numerical and text data. Foot images of patients with abnormal diabetes ranges were classified using a pre-trained fast convolutional neural network with U++net to examine foot ulcers and forecast the development of the risk of diabetic foot ulcers (DFU). The study assessed the accuracy of the proposed technique as 99.32% by simulating results for feature extraction and the classification of diabetic foot ulcers. A comparison was made between this proposed technique and existing approaches.

1. Introduction

High-throughput computing and developments in the field of biotechnology consistently contribute to affordable data production and a faster way of analysing data using big data technology in biological research. The aim is to build a faster-responding framework that considers the fast-growing biotic data and provides quick an probable responses to basic queries in the medical and biological areas. To achieve efficiency and reliability, choosing the correct approach to identify a pattern and create an efficient model from the dataset given is essential. Research on the prognosis and treatment of diseases that pose a threat to human life is crucial, and diabetes mellitus (DM) is one of the most significant diseases falling under this category. Statistical data from national science of medicine showed that, as of 2019, the number of individuals with diabetes in India was estimated to be 77 million, and is projected to exceed 134 million by 2045, with approximately 57% of cases remaining undiagnosed. Type 2 diabetes, which constitutes the majority of cases, can result in complications affecting multiple organs, which can be broadly categorized into micro-vascular and macro-vascular complications. These complications significantly contribute to premature morbidity and mortality rates among diabetic individuals, leading to a reduced life expectancy, financial burden, and other costs, which impose a profound economic burden on the Indian healthcare system. The cause of diabetesis not fully understood, although researchers suggest that both genetic and environmental factors may contribute to its development [1]. Although DM cannot be cured, medication and drugs can assist in its management. The timely detection and management of diabetes mellitus can help prevent complications and reduce the risk of severe health problems [2]. It can be diagnosed in two ways. The first way is manual diagnosed by a medical practitioner and the second is by using automated instruments. Each way has its own advantages and disadvantages. Manual diagnosis enables healthcare providers to depend on their skills without the need for machine intervention [3]. However, in the initial stages of DM, symptoms may be so mild that even an experienced doctor may not be able to identify them [4].
Dysregulated glycemia can lead to both acute and chronic complications. Diabetes mellitus is associated with an increased predisposition to diabetic heart disease (DHD), a cardiac pathology specific to individuals with diabetes. DHD is associated with several health issues, such as high cholesterol and high blood sugar, significantly increasing the risk of stroke or heart attack [5]. However, people with diabetes may not experience any symptoms due to the nerve damage that affects the nerves controlling the heart. DHD is a leading cause of death among individuals with diabetes, particularly women, who are often underdiagnosed for heart disease. Young or diabetic women are especially vulnerable, and many heart attacks in women are not accurately diagnosed [6]. Studies have found that women with diabetes have a risk of sudden death equivalent to half of that in men suffering from diabetes. Patients with type 2 diabetes are at from two- to four-fold higher risk of developing DHD as compared to those without diabetes [7].
Diabetic foot ulcers are a common issue among many diabetic patients, and if left untreated, they can result in partial or total amputation [8]. Early detection and treatment of these ulcers can prevent their development. However, traditional methods of diagnosis can be inadequate, as diabetic patients often experience a gradual loss of sensation, making self-examination difficult. Furthermore, currently, no automated system exists for the timely identification of diabetic foot ulcers. Prior research has demonstrated an association between elevated temperature and the onset of these ulcers [9].
Various algorithms are employed for the timely detection of diabetic foot ulcers via temperature evaluation. These algorithms encompass infrared images, liquid crystal thermography (LCT), and an infrared (IR) thermometer, and the temperature sensors are incorporated in the weighing scale. To predict DHD at an early stage, various data mining technologies and machine learning techniques are employed [10]. With the advancement of machine learning (ML) and artificial intelligence (AI), an automated program that assists in the early detection and diagnosis of diseases is becoming more viable and efficient than manual methods. The benefits of using automated programs include reducing the workload of medical professionals and minimizing the risk of human error [11]. Computer-based decision support systems can play a crucial rolein facilitating effective diagnosis and management.
The field of DM generates large amounts of data through laboratory evaluations, patient reports, treatments, follow-ups, medications, etc. However, manually organizing all these data can be challenging and lead to poor data quality. To address this issue, hospitals are now utilizing automated devices to perform collation and sharing through the information systems [12]. Automated systems are not only more efficient but also more reliable than manual discovery and analysis. Therefore, it is crucial to develop automated DM systems using machine learning or artificial intelligence techniques. While these methods have their own advantages and limitations, the use of explainable AI systems can improve trust in future AI systems. Causality is also a crucial factor, which can be assessed based on efficacy, usefulness, and transparency for end-users. In recent years, researchers have employed machine learning (ML) and artificial intelligence (AI) techniques to personalize the management and control of diabetes mellitus [13].
The following Section 2 in this paper presents a comprehensive overview of the existing research on DF images. The system model and finer points of the proposed approach are elaborated in Section 3, while Section 4 provides a discussion and results analysis of the model. Finally, a discussion of the conclusions of this study is presented, and potential directions for future research are outlined, in Section 5.

2. Related Works

Research studies indicate that skin temperature variations can be measured through the use of thermal images to detect diabetic foot ulcer infection. A number of authors have utilized thermal foot images to distinguish diabetic foot conditions characterized by sympathetic skin reaction from those without a sympathetic skin reaction [14]. Furthermore, the findings of this study demonstrate thermal foot imaging’s efficiency in detecting the presence of diabetic foot diseases, monitoring their progression and evaluating the effectiveness of related treatments [15]. A method of achieving this involves the division of images into six regions of interest (ROIs): arch, heel, forehead, lateral sole, hallux, and lesser toes, followed by the mean calculation of temperature differences for comparison with healthy participants’ temperatures. In a previous study, the authors proposed a system that utilized thermal images to diagnose diabetic foot conditions through the computation of corresponding point temperatures in both feet [16]. The findings indicated that the average temperature among patients with neuropathy ranged between 32.8 and 27.9 °C. Additionally, in another study, the author presented a system for detecting diabetic foot ulcers, which entailed the application of image-processing techniques such as segmentation, registration, and abnormality detection for thermal images. The process of segmenting the foot image involved the utilization of active contours, while background removal was achieved through the application of B-spline non-rigid registration, resulting in alignment [17]. The final step in the process involved the detection of abnormalities by subtracting the level of intensity in the corresponding position from the left foot to the right foot. Specifically, if the temperature difference exceeds a predefined threshold, it is deemed indicative of an abnormal region, as per the findings of this study.In another study, infrared images were employed for the discrimination of diabetic foot ulcer classes: those with no visible signs, those with local complications, and those with diffuse complications [18]. The mean temperature difference between the ipsilateral and contralateral foot can be calculated to accomplish this. The researcher utilized thermal imaging to detect abnormalities in diabetic foot conditions by correlating area temperatures with color codes. The approach entailed employing a rainbow palette, which was segmented into 10 distinct colors based on differences between them [19]. MATLAB mobile platform and Android smartphone were employed in the study to create an Android-based thermal system. Based on a comparison with a normal test image, the system predicted ulcers from four different ROI areas. Despite this, the study did not focus on cases that carried a high rate of complications, relying only on static thermal information in predicting ulcers in diabetics with grade-zero foot conditions [20]. The use of infrared images for detecting diabetic foot conditions through the application of image processing techniques was explored in a previous study [21], which reported the highest accuracy rate of 95.66%. However, the latitude of this research was limited to detect abnormalities. In the study conducted by [22], they utilized an alternate convolutional neural network to classify diabetic foot ulcers. Meanwhile, in [23], a thermal system was proposed to detect abnormalities in diabetic foot by extracting the textural and entropy features from the decomposed discrete wavelet transform (DWT) and higher-order spectra (HOS). However, the study was limited in that it could not distinguish between different types of diabetic foot conditions, achieving an identification rate of 89.39%. In [24], the authors focused on developing an ensemble model that utilized a majority voting technique to combine unweighted predictions from various machine learning models. On the other hand, the authors of [25] focused on discriminating techniques, where they combined the basic classifiers through a process that could adjust to the input observations and output requirements of each individual learning system.In the training set, the type of combination employed could be optimized by assigning weight to each classifier in order to improve the combined performance. In a study by the authors of [26], ensemble-based methods were recommended as the most effective approach for data stream classification problems. Furthermore, the authors compared ensemble learning with sixty other algorithms. Similarly, the author of [27] notes that in the realm of machine learning, multiple classifier systems have undergone considerable development in recent years, presenting promising solutions to a range of problems and enhancing precision.

3. Proposed Work

3.1. Architectural Framework

The technique presented in this research endeavors to identify the risk of diabetic foot ulcers in patients with diabetes through the analysis of foot ulcer images, as illustrated in Figure 1. In this study, the DFU2020 dataset was utilized to extract the feature set from foot ulcer images and classify them as either normal or abnormal for patients with diabetes. The dataset consisted of both numerical and image data, which were pre-processed and segmented, and feature vectors were extracted using advanced deep neural networks, including deep recurrent neural networks (DRNN) for numerical data and deep neural networks (DNN) for image data. To detect the level of diabetes, the numerical feature vectors were analyzed, and if the diabetes range was higher, appropriate treatment was provided. The diabetic foot images of the patients were subsequently employed for the classification task, aiming to predict the pathogenesis of foot ulcers. However, if the diabetes range was normal, the data were updated in the database for future reference to identify nerve damage and predict the development or progression of foot ulcers. The foot ulcer images of patients with abnormal diabetes ranges were classified using a pre-trained fast convolutional neural network (PFCNN) with U++net. The classification results helped to predict the risk of foot ulcers. Figure 1 illustrates the flow schematic diagram for foot ulcer classification and provides a comprehensive overview of the entire process.

3.2. Feature Extraction Using Deep Recurrent Neural Network (DRNN)

Utilizing the depiction of deep neural networks (DNN), we formulated the computational protocols for the forward and backward propagation, which serve as quintessential constituents of both the inference and training phases. A schematic of the DRNN architecture can be found in Figure 2.
As a general rule, an a-by-b block matrix and the weight matrix are assumed to be W R a × b for a fully connected layer in a deep neural network (DNN) block matrix. This matrix, W, comprises several diagonal sub-matrices of size p × p . Since a total of m p × n p sub-matrices of size m p × n p exist, every sub-matrix involved in the investigation possesses its individual permutation parameters, and each of them is assigned an index based on their position, which varies from 0 to the product of m p and n p subtracted by 1. The permutated parameter value for the sub-matrix located on the lth diagonal is denoted as k l . Consequently, any value of w i j in W can be represented in the following manner:
w i j = q k l × p + c if ( c + k l ) mod p d otherwise
The variables c and d are congruent to i and j modulo e, respectively. The value of l can be computed as the product of ( i / p ) and ( n / p ) + ( j / e ) . The vector m, which includes all non-zero elements of W, is represented as q = ( q 0 , q 1 , , q m n / p 1 ) . Feature extraction using a DRNN is illustrated in Figure 2.
In the FC layer’s inference phase, y = ψ ( a ) = ψ ( W x ) 4 is used for forward propagation. In this context, the activation function is represented by ψ ( · ) , and a = W x = ( a 1 , a 2 , , a m ) T , where x = ( x 1 , x 2 , , x n ) T denotes the input vector, and y = ( y 1 , y 2 , , y m ) T represents the output vector. The activation function plays a crucial role in this scenario. To streamline the calculation of a i using the block diagonal weight matrix X, which is trained and comprises entries as per Equation (2), equation g = 0 n / p 1 w i j x j is applied. The values of j are determined by the formula j ( i + k l ) mod p + g p , with l = g + ( i / p ) × ( n / p ) . During the inference phase, weight storage necessitates only the m n / p minus the length of the vector q that includes all the non-zero entries of X. This significantly simplifies the original computation, as represented by Equation (2).
a i = g = 0 n 1 w i j x j
In order to retain the diagonal structure of the fully connected layer in a deep neural network throughout each iteration of the training phase, it is essential to include backward propagation. During forward propagation, the value of y is ψ ( a ) and the a take the values of ( a 1 , a 2 , , a m ) T . During the backward propagation process used to compute the gradient, the equation of J w i j = x j J a i is utilized, where J denotes the loss function of the neural network. As per the backward propagation principle, the present layer’s t t h is transmitted to the prior layer as a ( t 1 ) . Consequently, using the above equation, the updating rule for weight to the fully connected layer of DRNN can be derived.
w i j w i j x j J a i , for any w i j 0
J x j = g = 0 m / p 1 w i j J a i
The calculation of i is determined by the congruence of ( j + p k l ) mod p and adding it with g p ; here, the l is calculated as the sum of the ratio of g n to p and the ratio of j to p. The rate of learning is represented as η . In Equation (3), by utilizing Equation (4), it is feasible to streamline the computation of J a i via the backward propagation of the current layer’s x i as a i to the preceding layer. It is worth highlighting that Equations (3) and (4) guarantee that the weight update procedure always culminates in a sparse network that exhibits a block diagonal structure. This, in turn, yields a practical and efficient end-to-end training solution.

3.3. Inference of CONV Layer in Forward Propagation

In convolutional neural networks (CNNs), the weight tensor of a CONV layer can be viewed as a “macro”-matrix where each entry represents a filter kernel. Inspired by the diagonal structure of weight matrices in fully connected (FC) layers, a similar approach can be applied to the 4D weight tensor of a CONV layer. This can introduce diagonal dimensions to both the inward and outward channels of the weight tensor. Using this approach, a diagonal matrix can be used to specify the forward propagation of the CONV layer, similar to the method used in the FC layer. The fundamental idea behind this approach is that a diagonal matrix can simplify the computation of the output layer, and this methodology is applicable to both FC and CONV layers.
Y ( i , x , y ) = a = 0 c 2 / p 1 w = 0 w 1 1 h = 0 h 1 0 F ( i , j , w , h ) ( j , x w , y h )
According to Equation (5), the convolutional layer involves an input tensor X belonging to R c 0 × w 0 × h 0 , an output tensor Y belonging to R c 2 × w 2 × h 2 , and a weight tensor F belonging to R c 0 × c 2 × w 1 × h 1 . Here, the width and height of the input, kernel, and output tensors are denoted by w i and h i , where i takes the value of 0, 1, and 2, respectively. Furthermore, c 0 indicates the number of input channels and c 2 indicates the number of output channels. It is also worth noting that l is calculated as g added to the product of ( i / p ) and ( n / p ) , where j is the congruence of ( i + k l ) mod p + g p .

3.4. Training the CONV Layer in Backward Propagation

For the preservation of the diagonal structure of the convolutional layer in DNN during the training phase, it is necessary to modify the weight updating process. This can be achieved by following a similar approach to the fully connected layer. Consequently, the rule for updating the weight to propagate backward in the convolutional layer can be expressed as follows.
F ( i , j , w , h ) F ( i , j , w , h ) x = 0 w 2 y = 0 h 2 ( i , x w , y h ) J X ( i , x , y ) , for any F ( i , j , w , h ) 0
J X ( i , j , x ) = g = 0 ( c o / ρ ) 1 w = 0 w 1 1 h = 0 h 1 1 F ( i , j , w , h ) J Y ( i , x + w , y + h )
In order to ensure that the diagonal structure of the convolutional layer is maintained while performing the training, the process of updating the weight needs to be re-designed. Equation (6) calculates J X ( i , x , y ) and the value of i is calculated as ( j + p k l ) mod p + g p and the value of l is calculated as g n p + j p . It should be noted that this calculation is supported by the calculation in Equation (6). According to Equation (7), every X ( j , x , y ) present in the current layer will be retroactively propagated to the prior layer as M ( j , x , y ) . The computation in Equation (6) is helpful in this calculation, since each current layer’s X ( j , x , y ) is propagated backward into the prior layer as M ( j , x , y ) . By updating the weights according to Equations (6) and (7), the sparsely trained network will maintain the block diagonal structure.
After the feature extraction from the image, the numerical/text data of the diabetes dataset are obtained using recurrent neural networks (RNNs). RNNs are similar to fully connected neural networks but with some of their layers refactored into a loop, as illustrated in Figure 3. There are two inputs to this loop, which is iteratively added or concatenated, and a matrix multiplication and non-linear function are applied.
The proposed neural network architecture for processing phrases involves three input layers. The first layer receives a feature vector that contains information about the words in the phrase. This vector can vary in size and is fed into an LSTM layer. The second layer captures contextual information and has a variable length. To capture the context, a window size of n is considered, where n words to the left and right of the phrase are concatenated and fed into the LSTM layer. The last layer is a fixed-length input layer that processes a vector containing general information about the phrase and its context. This vector includes the minimum and maximum coordinate-wise values of a word vector and binary features indicating the presence or absence of various attributes in the phrase. Using a fixed-length sparse vector is not feasible since the length and context of phrases vary. Thus, the proposed architecture employs recurrent neural networks for processing vectors of arbitrary length, ensuring optimal performance.

3.5. Pre-Trained Fast Convolution Neural Network (PFCNN) for Classification

The diabetic foot ulcer is classified into five stages, wherein stage 1 represents a normal foot, emphasizing the importance of maintaining healthy feet and using appropriate footwear while also stressing the need for regular foot care. Stage 2 indicates a high-risk foot, where foot ulcers require attention due to the possibility of their developing complications such as neuropathy, ischaemia, deformity, swelling, and callus. Stages 3, 4, and 5 represent ulcerated, infected, and necrotic feet, respectively. Starting from stage 3, the foot is considered ulcerated and requires immediate attention. Stage 3 represents damage to the peripheral nerve, increasing the likelihood of developing neuropathy-related risks, which may eventually lead to cardiovascular disease. Thus, stages 1 and 2 are considered normal, while stages 3–5 are classified as abnormal. For patients with foot images showing an abnormal diabetes range, a PFCNN is utilized as a linear classifier to predict the risk of foot ulcers. During training, the CNNs in the architecture the PFCNN architecture employs the ReLU activation and completes the dropout after each convolutional layer, as the PFCNN model itself uses weighted softmax cross entropy loss with an Adam optimizer. The seven frames of size 60 × 40, centered on the current frame, are used as inputs, with hardwired kernels applied to produce the multiple channels of information. The second layer generates 33 feature maps across five channels: optflow-x, gradient-x, gray, gradient-y and optflow-y. The convolutional layers in CNNs are succeeded by an output layer and one or more fully connected layers. Figure 4 depicts the architecture of the PFCNN. A label weight is calculated by inverting the occurrence of each label in the training set.
It is depicted that the input that goes into a layer s of a neural network I ( m ) . For a pre-trained convolutional layer m, the input is a 3D object with dimensions n 1 ( m 1 ) × n 2 ( m 1 ) × n 3 ( m 1 ) and n c ( m 1 ) channels, where I ( m 1 ) R n 1 ( m 1 ) × n 2 ( m 1 ) × n 3 ( m 1 ) . The elements of the input are represented by I i , j , k ( m , l ) , where i, j, and k are the index of the 3D volume and l picks the channel.
Let the volume of filters or channels produced by a convolutional layer m, denoted by n c ( s ) , along with its dimensions, i.e., n 1 ( s ) × n 2 ( s ) × n 3 ( s ) , define its output. The result of mth layer is produced by convolving its input with a specific filter. This process can be expressed as I ( m , l ) i , j , k = f tanh ( b ( m , l ) ) + i , j , k , l I ( m 1 , l ) i i , j j , k k W ( m l ) i , j , k , l . In this equation, the parameters W ( m , l ) and b ( m , l ) define the filter used for the lth feature map in layer m. The positions where the filters are assessed, which are represented by the variables i, j, and k in the equation I ( m , l ) i , j , k , as well as the dimensions of the filters, which are represented by the non-zero values of W ( m , l ) , are both determined by the network architecture parameters. Additionally, the activation function used is a hyperbolic tangent with f tanh ( a ) = tanh ( a ) .
By preserving the spatial arrangement of the convolutional layer’s input, it gradually constructs increasingly intricate representations of the input as more layers are added. The resulting convolutional layer’s output is subsequently fed into a fully connected network layer. Convolutional layers produce a unified vector output regardless of the spatial and channel structure. Here, the output of a fully connected layer is a 1D vector of I ( m ) , where the size of the vector is determined by the network architecture parameters. Then, the result of the ith neuron in layer m is represented using Equation (8).
I i ( m ) = f R e L U b ( m , i ) + j I j ( m 1 ) W j ( m , i )
Within this particular context, the neural network’s i t h neuron within m t h layer is depicted as the parameters W ( m , i ) and b ( m , i ) . Meanwhile, the summation over j encompasses all dimensions of the input. The rectified linear unit (ReLU) is selected as the activation function and represented as f R e L U ( · ) . It is worth noting that f R e L U ( a ) is calculated as the maximum of 0 and a.
ReLU is the rectified linear unit activation function, which is popularly utilized across various fields, and its effectiveness is especially noticeable when performing the classification operation because of the sparsity it introduces in the outputs. This sparsity, in turn, assists in creating a distinct boundary between the classes at the learning process. The ultimate input that is received by the output layer is acquired from the concluding fully connected layer, and the design and configuration of the output layer are subject to change based on the specific task that is being executed. In this context, two distinct types of output functions are considered. In problems where classification into K classes is required, the softmax function is commonly used as the output function. This can be represented using Equations (9) and (10).
f i = exp ( I i ( o ) ) j exp ( I j ( o ) )
I i ( o ) = b ( o , i ) + k = 1 k W k ( o , i ) I k ( N )
In this scenario, the index N designates the final fully connected layer. The parameters of i t h output layer are represented by b ( o , i ) and W ( o , i ) . Taking the inputs into account, the output of class i, denoted as f i , has a range of [ 0 , 1 ] and is considered to be the probability of the corresponding class. Moreover, an alternative to the logistic output function is also taken into consideration, as shown by Equation (11).
f = a + ( b a ) 1 + exp b ( o ) + j W j ( o ) I j ( N ) 1
Within this particular context, the continuous output function is represented by f and restricted between the values ( a , b ) through the use of the output function of scaled logistic function. Both parameters b ( o ) and W ( o ) play a major role in restricting the output range of the function. This logistic output function is commonly known as a scaled logistic output function.
It is worth mentioning that, in the event of a multi-class classification problem based on ranking, such as when predicting the distortion levels, the use of this output function is anticipated to yield superior results.
The following Table 1 details the stride size that was applied to the input volume of the image, from left to right. Once it reaches the right end, the stride is applied again from the leftmost corner to the right of the specified stride pixel, moving down the image from the current position until it has been applied to the entire volume of the image. As shown in the table, the rectified linear unit (ReLU) activation function was utilized to fit the results and achieve higher accuracy. In this case, the input was classified into two classes, which constitute the output from the dense layer.

4. Training

The primary goal of this research is to fine-tune the parameters of a pre-existing network architecture to best match the provided data. This is accomplished by specifying an objective function and using gradient-based optimization to determine the network parameters that minimize the objective function. The collection of training examples D may be augmented and comprises input n and output y, represented by D = n i , y i i = 1 D , for each i. The aforementioned system accepts input n, which has the potential to be a segmented CT scan, and generates an output y. The output y can function either as a classifier of the input as benign or malignant or as an indicator of the malignancy level. The biases b and weights W applied to each layer of the network are symbolized by Θ . Then, the objective function will take the form defined using Equation (12).
E ( Θ ) = i = 1 D L ( y i , f ( n i , Θ ) ) + λ E prior ( Θ )
In this context, the function f ( n i , Θ ) is utilized to compute the output of the network using input n i and the set of parameters Θ . L ( y i , f ( n i , Θ ) ) is the loss function, which is applied to assess the deviation among the expected output of the network y and the predicted output y ^ . Moreover, the function E prior ( Θ ) , calculated as | W | 2 , signifies the previous decay of weight, which aims to avoid overfitting by imposing a cost on the weight norm. The magnitude of λ determines the intensity of the prior regularization.
This study employs different objective functions based on the selected output function. When using the softmax output function, the function of the standard cross-entropy loss of L ( y i , y ^ ) = k = 1 k y k log ( y ^ k ) is applied; here, y represents a binary indicator vector, and y ^ represents a probability vector for each of the K classes. However, this loss function is limited by its inability to differentiate between different classification errors. To address this, the function of squared error loss is preferred to the function of scaled logistic function, where L ( y i , y ^ ) = | ( y i , y ^ ) | 2 , and both the y value and y ^ value are the real values. To update the parameters at iteration t, equation Θ t + 1 = Θ t + Δ Θ t + 1 , is utilized, with Δ Θ t + 1 = ρ Δ Θ t ϵ E t ( Θ t ) .
The equation shown here outlines the process of updating the network’s parameters while training. The momentum parameter ρ is set to a constant value of 0.9, and the momentum vector Δ Θ t + 1 is calculated using the previous momentum vector Δ Θ t , the learning rate ϵ , and the gradient of the objective function E t ( Θ t ) is the computation of this metric, which is predicated based on the selection of training examples during the iteration t. In the first iteration, the biases are set to 0, and the filter and weight values are initialized by uniformly sampling from the interval of · 6 f a n i n + f a n o u t n , · 6 f a n i n + f a n o u t n ; here, the value of f a n i n corresponds to the number of nodes in the prior hidden layer, while the value of f a n o u t corresponds to the number of nodes in the present layer. The training process entails 2000 epochs, wherein we initialize ϵ t to 0.01, and decrease the learning rate by 10% after every 25 epochs to guarantee convergence.

5. Performance Analysis

The DFUC2021 dataset was utilized to study the pathology of diabetic foot ulcers, with a specific focus on infection and ischemia. The dataset comprises a total of 15,683 DFU images, out of which 5955 images were used for training, 5734 images were used for testing, and the remaining 3994 images were left unlabelled. The study’s primary objective was to assess the performance of an image-based risk prediction model of diabetic foot ulcers. The results of this performance analysis are showcased in this section, with Figure 5 and Figure 6 presenting the confusion matrices for diabetic range detection in the test data and train data, respectively. These matrices provide a clear overview of the model’s accuracy in predicting the diabetic range, highlighting the number of true positives, false positives, true negatives, and false negatives.
The use of the DFUC2021 dataset in this research allowed for a comprehensive assessment of the performance of the proposed model, which could aid in developing better diagnostic tools for patients with diabetes. The availability of a large dataset such as DFUC2021 can help researchers better understand the complexities of diabetic foot ulcers and ultimately lead to more effective treatment strategies for patients suffering from this condition.
The ROC curve and precision-recall curve were generated to distinguish between normal and abnormal foot ulcers through the feature extraction and classification process based on the testing data of foot ulcers, as shown in Figure 7 and Figure 8, respectively. Similarly, Figure 9 and Figure 10 depict the ROC curve and precision-recall curve for distinguishing between normal and abnormal foot ulcers through the feature extraction and classification process based on the training data of foot ulcers.
The determination of the precision-recall curve and the receiver operating characteristic (ROC) curve for classifying the normal and abnormal foot ulcer based on foot ulcer testing data is presented in Figure 7 and Figure 8.
The precision-recall curve and the receiver operating characteristic (ROC) curve used to classify diabetic foot ulcers using foot ulcer training data are determined in Figure 9 and Figure 10, respectively. Table 2 displays the input foot ulcer images used for training, and presents the pre-processing and segmentation steps, followed by the resulting classified output.
To detect anomalous foot ulcers, a pre-trained fast convolutional neural network (PFCNN) incorporating U++net architecture was employed. Consequently, the prediction of foot ulcer pathogenesis becomes simplified. The proposed system achieves a classification performance with precision and recall rates of 0.9354, as revealed by the results.
Table 3 presents a detailed comparison of the proposed classification and feature extraction techniques and the existing methods. When compared with the other models, the proposed method of PFCNN and DRNN attained an accuracy of 99.32.

6. Conclusions

This research introduces a novel methodology to automatically identify the pathogenesis of diabetic foot ulcers through image feature extraction and classification techniques. In this study, a dataset of foot ulcer images was employed to extract features using a DRNN-based feature extraction approach. This enabled the differentiation between normal and abnormal diabetes ranges. For classifying foot ulcer images within the normal and abnormal diabetes range, a PFCNN-based classification method was utilized. The classification accuracy served as an indicator to identify the risk of foot ulcer pathogenesis. The proposed architecture presents a valuable tool for detecting abnormal diabetes ranges and assessing the risk of foot ulcers. Notably, the performance of the proposed technique demonstrates an improved classification accuracy, surpassing previously recorded values. This improvement can be attributed to the effective isolation technique that was employed and the increased number of statistical features. The objective of this study is to develop an automated approach for diagnosing the risk of diabetic foot ulcers using foot ulcer classification and feature extraction techniques. The proposed method utilizes DRNN-based feature extraction and PFCNN-based classification to distinguish between normal and abnormal diabetes ranges and predict the severity of foot ulcers. The results showcase a significant enhancement in classification accuracy, reaching 99.32%, when compared to previous studies. This improvement can be attributed to the augmented number of statistical features and the utilization of an effective isolation technique.

Author Contributions

Conceptualization, V.S.P.; Methodology, V.S.P.; Software, V.S.P.; Validation, V.S.P.; Formal analysis, V.S.P.; Investigation, V.S.P.; Resources, V.S.P.; Data curation, V.S.P.; Writing—original draft, V.S.P.; Writing—review & editing, V.D.A.K.; Visualization, V.D.A.K.; Supervision, V.D.A.K.; Project administration, V.D.A.K.; Funding acquisition, V.S.P. and V.D.A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The foot ulcer images obtained from Kaggle were utilized for the purpose of this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Fraiwan, L.; AlKhodari, M.; Ninan, J.; Mustafa, B.; Saleh, A.; Ghazal, M. Diabetic foot ulcer mobile detection system using smart phone thermal camera: A feasibility study. Biomed. Eng. Online 2017, 16, 117. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Thotad, P.N.; Bharamagoudar, G.R.; Anami, B.S. Diabetic foot ulcer detection using deep learning approaches. Sens. Int. 2023, 4, 100210. [Google Scholar] [CrossRef]
  3. Ahsan, M.; Naz, S.; Ahmad, R.; Ehsan, H.; Sikandar, A. A deep learning approach for diabetic foot ulcer classification and recognition. Information 2023, 14, 36. [Google Scholar] [CrossRef]
  4. Purnima, S.; Angelin, P.S.; Priyanka, R.; Subasri, G.; Venkatesh, R. Automated Detection of Diabetic Foot Using Thermal Images by Neural Network Classifiers. Int. J. Emerg. Trends Sci. Technol. 2017, 4, 6. [Google Scholar] [CrossRef]
  5. Xu, Y.; Han, K.; Zhou, Y.; Wu, J.; Xie, X.; Xiang, W. Classification of diabetic foot ulcers using class knowledge banks. Front. Bioeng. Biotechnol. 2021, 9, 811028. [Google Scholar] [CrossRef] [PubMed]
  6. Haque, F.; Reaz, M.B.I.; Chowdhury, M.E.H.; Ezeddin, M.; Kiranyaz, S.; Alhatou, M.; Ali, S.H.M.; Bakar, A.A.A.; Srivastava, G. Machine learning-based diabetic neuropathy and previous foot ulceration patients detection using electromyography and ground reaction forces during gait. Sensors 2022, 22, 3507. [Google Scholar] [CrossRef]
  7. Aiello, E.M.; Toffanin, C.; Messori, M.; Cobelli, C.; Magni, L. Postprandial glucose regulation via KNN meal classification in type 1 diabetes. IEEE Control Syst. Lett. 2018, 3, 230–235. [Google Scholar] [CrossRef]
  8. Raza, A.; Ayub, H.; Khan, J.A.; Ahmad, I.; Salama, A.S.; Daradkeh, Y.I.; Javeed, D.; Ur Rehman, A.; Hamam, H. A hybrid deep learning-based approach for brain tumor classification. Electronics 2022, 11, 1146. [Google Scholar] [CrossRef]
  9. Carracedo, J.; Alique, M.; Ramírez-Carracedo, R.; Bodega, G.; Ramírez, R. Endothelial extracellular vesicles produced by senescent cells: Pathophysiological role in the cardiovascular disease associated with all types of diabetes mellitus. Curr. Vasc. Pharmacol. 2019, 17, 447–454. [Google Scholar] [CrossRef]
  10. Sadiq, M.T.; Yu, X.; Yuan, Z.; Zeming, F.; Rehman, A.U.; Ullah, I.; Li, G.; Xiao, G. Motor imagery EEG signals decoding by multivariate empirical wavelet transform-based framework for robust brain–computer interfaces. IEEE Access 2019, 7, 171431–171451. [Google Scholar] [CrossRef]
  11. Ahmad, I.; Ullah, I.; Khan, W.U.; Ur Rehman, A.; Adrees, M.S.; Saleem, M.Q.; Cheikhrouhou, O.; Hamam, H.; Shafiq, M. Efficient algorithms for E-healthcare to solve multiobject fuse detection problem. J. Healthc. Eng. 2021, 2021, 9500304. [Google Scholar] [CrossRef]
  12. Naqvi, R.A.; Arsalan, M.; Rehman, A.; Rehman, A.U.; Loh, W.K.; Paul, A. Deep learning-based drivers emotion classification system in time series data for remote applications. Remote Sens. 2020, 12, 587. [Google Scholar] [CrossRef] [Green Version]
  13. Sparapani, R.; Dabbouseh, N.M.; Gutterman, D.; Zhang, J.; Chen, H.; Bluemke, D.A.; Joao, A.C.L.; Gregory, L.B.; Soliman, E.Z. Detection of Left Ventricular Hypertrophy Using Bayesian Additive Regression Trees: The MESA (Multi-Ethnic Study of Atherosclerosis). J. Am. Heart Assoc. 2019, 8, e009959. [Google Scholar] [CrossRef] [PubMed]
  14. Sadiq, M.T.; Yu, X.; Yuan, Z.; Fan, Z.; Rehman, A.U.; Li, G.; Xiao, G. Motor imagery EEG signals classification based on mode amplitude and frequency components using empirical wavelet transform. IEEE Access 2019, 7, 127678–127692. [Google Scholar] [CrossRef]
  15. Jaiswal, V.; Negi, A.; Pal, T. A review on current advances in machine learning based diabetes prediction. Prim. Care Diabetes 2021, 15, 435–443. [Google Scholar] [CrossRef] [PubMed]
  16. Muthuraja, M.; Shanthi, N. A Survey on Classification of Diabetic Foot Ulcer Using Machine Learning Algorithms. Ann. Rom. Soc. Cell Biol. 2021, 25, 1881–1894. [Google Scholar]
  17. Cruz-Vega, I.; Gutiérrez-Rodríguez, A.K.; Sánchez-Pérez, H.J.; García-Rodríguez, M.L. Deep learning classification for diabetic foot thermograms. Sensors 2020, 20, 1762. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Muñoz, P.L.; Rodríguez, R.; Montalvo, N. Automatic Segmentation of Diabetic foot ulcer from Mask Region-Based Convolutional Neural Networks. J. Biomed. Res. Clin. Investig. 2020, 1, 1006. [Google Scholar]
  19. Maheswari, D.; Kayalvizhi, M. A Hybrid Deep Learning Algorithms for Diabetes Mellitus Prediction Using Thermal Foot Images. Eur. J. Mol. Clin. Med. 2021, 7, 5176–5183. [Google Scholar]
  20. Islam, M.M.; Ferdousi, R.; Rahman, S.; Bushra, H.Y. Likelihood prediction of diabetes at early stage using data mining techniques. In Computer Vision and Machine Intelligence in Medical Image Analysis; Springer: Berlin/Heidelberg, Germany, 2020; pp. 113–125. [Google Scholar]
  21. Chaki, J.; Ganesh, S.T.; Cidham, S.K.; Theertan, S.A. Machine learning and artificial intelligence based Diabetes Mellitus detection and self-management: A systematic review. J. King Saud-Univ.-Comput. Inf. Sci. 2020, 34, 3204–3225. [Google Scholar] [CrossRef]
  22. Abaker, A.A.; Saeed, F.A. A Comparative Analysis of Machine Learning Algorithms to Build a Predictive Model for Detecting Diabetes Complications. Informatica 2021, 45, 1–10. [Google Scholar] [CrossRef]
  23. Ramsingh, J.; Bhuvaneswari, V. An efficient Map Reduce-Based Hybrid NBC-TFIDF algorithm to mine the public sentiment on diabetes mellitus—A big data approach. J. King Saud-Univ.-Comput. Inf. Sci. 2018, 33, 1018–1029. [Google Scholar] [CrossRef]
  24. Natarajan, S.; Jain, A.; Krishnan, R.; Rogye, A.; Sivaprasad, S. Diagnostic accuracy of community-based diabetic retinopathy screening with an offline artificial intelligence system on a smartphone. JAMA Ophthalmol. 2019, 137, 1182–1188. [Google Scholar] [CrossRef] [PubMed]
  25. Younus, M.; Munna, M.T.A.; Alam, M.M.; Allayear, S.M.; Ara, S.J.F. Prediction model for prevalence of type-2 diabetes mellitus complications using machine learning approach. In Data Management and Analysis; Springer: Berlin/Heidelberg, Germany, 2020; pp. 103–116. [Google Scholar]
  26. Mainenti, G.; Campanile, L.; Marulli, F.; Ricciardi, C.; Valente, A.S. Machine Learning Approaches for Diabetes Classification: Perspectives to Artificial Intelligence Methods Updating. In Proceedings of the 5th International Conference on Internet of Things, Big Data and Security (IoTBDS 2020), Prague, Czech Republic, 7–9 May 2020. [Google Scholar]
  27. Mwawado, R.H. Development of a Fast and Accurate Method for the Segmentation of Diabetic Foot Ulcer Images. Ph.D. Thesis, NM-AIST, Arusha, Tanzania, 2020. [Google Scholar]
Figure 1. Flow Schematic Diagram for Foot Ulcer Classification.
Figure 1. Flow Schematic Diagram for Foot Ulcer Classification.
Diagnostics 13 01983 g001
Figure 2. DRNN architecture.
Figure 2. DRNN architecture.
Diagnostics 13 01983 g002
Figure 3. RNN architecture.
Figure 3. RNN architecture.
Diagnostics 13 01983 g003
Figure 4. Architecture of PFCNN.
Figure 4. Architecture of PFCNN.
Diagnostics 13 01983 g004
Figure 5. Resulting Confusion matrix of Testing.
Figure 5. Resulting Confusion matrix of Testing.
Diagnostics 13 01983 g005
Figure 6. Resulting Confusion matrix of Training.
Figure 6. Resulting Confusion matrix of Training.
Diagnostics 13 01983 g006
Figure 7. Precision-Recall Curve for Training.
Figure 7. Precision-Recall Curve for Training.
Diagnostics 13 01983 g007
Figure 8. Receiver Operating Characteristic Curve for Training.
Figure 8. Receiver Operating Characteristic Curve for Training.
Diagnostics 13 01983 g008
Figure 9. Precision-Recall Curve for Training.
Figure 9. Precision-Recall Curve for Training.
Diagnostics 13 01983 g009
Figure 10. Receiver Operating Characteristic Curve for Training.
Figure 10. Receiver Operating Characteristic Curve for Training.
Diagnostics 13 01983 g010
Table 1. Architecture of the CNN model.
Table 1. Architecture of the CNN model.
LayerParamsActivationOutput
Input 28 × 28 × 28
Conv 1 5 × 5 × 5 ReLu 28 × 28 × 28 × 7
Max Pool 1 × 1 × 1 , stride 2 × 2 × 4 14 × 14 × 7 × 7
Conv 2 5 × 5 × 3 ReLu 14 × 14 × 7 × 17
Max Pool 2 × 2 × 2 , stride 1 × 1 × 0 6 × 6 × 3 × 17
Dense 256
Dense 2
Table 2. Output of Classification.
Table 2. Output of Classification.
S.NOInput ImagePre-ProcessingSegmentationClassified Output
Abnormal UlcerDiagnostics 13 01983 i001Diagnostics 13 01983 i002Diagnostics 13 01983 i003Diagnostics 13 01983 i004
Diagnostics 13 01983 i005Diagnostics 13 01983 i006Diagnostics 13 01983 i007Diagnostics 13 01983 i008
Diagnostics 13 01983 i009Diagnostics 13 01983 i010Diagnostics 13 01983 i011Diagnostics 13 01983 i012
Diagnostics 13 01983 i013Diagnostics 13 01983 i014Diagnostics 13 01983 i015Diagnostics 13 01983 i016
Normal (Healthy skin)Diagnostics 13 01983 i017Diagnostics 13 01983 i018Diagnostics 13 01983 i019Diagnostics 13 01983 i020
Diagnostics 13 01983 i021Diagnostics 13 01983 i022Diagnostics 13 01983 i023Diagnostics 13 01983 i024
Diagnostics 13 01983 i025Diagnostics 13 01983 i026Diagnostics 13 01983 i027Diagnostics 13 01983 i028
Diagnostics 13 01983 i029Diagnostics 13 01983 i030Diagnostics 13 01983 i031Diagnostics 13 01983 i032
Table 3. Comparison of Classification and Feature Extraction with Proposed Method and Existing Methods.
Table 3. Comparison of Classification and Feature Extraction with Proposed Method and Existing Methods.
AuthorClassification TechniqueFeature Extraction MethodAccuracy
Xu Y et al. [5]Class Knowledge Bank(CKB)DNN-MLP79%
Haque et al. [6]K-Means ClusteringChi-square, mrmr,
Relief Algorithm, FSCNCA
96%
Ahsan et al. [3]End to end CNNDFUNet98.49%
Thotad et al. [2]EfficientNetDFUNet98.97%
Proposed workPre-trained Fast Convolution
neural network (PFCNN)
Deep recurrent
neural network (DRNN)
99.32%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sathya Preiya, V.; Kumar, V.D.A. Deep Learning-Based Classification and Feature Extraction for Predicting Pathogenesis of Foot Ulcers in Patients with Diabetes. Diagnostics 2023, 13, 1983. https://doi.org/10.3390/diagnostics13121983

AMA Style

Sathya Preiya V, Kumar VDA. Deep Learning-Based Classification and Feature Extraction for Predicting Pathogenesis of Foot Ulcers in Patients with Diabetes. Diagnostics. 2023; 13(12):1983. https://doi.org/10.3390/diagnostics13121983

Chicago/Turabian Style

Sathya Preiya, V., and V. D. Ambeth Kumar. 2023. "Deep Learning-Based Classification and Feature Extraction for Predicting Pathogenesis of Foot Ulcers in Patients with Diabetes" Diagnostics 13, no. 12: 1983. https://doi.org/10.3390/diagnostics13121983

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop