Next Article in Journal
Atranorin, a Secondary Metabolite of Lichens, Exhibited Anxiolytic/Antidepressant Activity in Wistar Rats
Next Article in Special Issue
Identifying Tumor-Associated Genes from Bilayer Networks of DNA Methylation Sites and RNAs
Previous Article in Journal
The Combination of Whole-Brain Features and Local-Lesion Features in DSC-PWI May Improve Ischemic Stroke Outcome Prediction
Previous Article in Special Issue
An Automated Method of Causal Inference of the Underlying Cause of Death of Citizens
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Automatic Detection of Tuberculosis Using VGG19 with Seagull-Algorithm

1
Department of Computer Science and Engineering, Division of Research and Innovation, Saveetha School of Engineering, SIMATS, Chennai 602105, India
2
Department of Applied Data Science, Noroff University College, 4612 Kristiansand, Norway
3
Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman 346, United Arab Emirates
4
Department of Electrical and Computer Engineering, Lebanese American University, Byblos 1401, Lebanon
5
Faculty of Engineering, Imperial College London, London SW7 2AZ, UK
6
College of Arts, Media, and Technology, Chiang Mai University, Chiang Mai 50200, Thailand
*
Author to whom correspondence should be addressed.
Life 2022, 12(11), 1848; https://doi.org/10.3390/life12111848
Submission received: 8 September 2022 / Revised: 28 October 2022 / Accepted: 9 November 2022 / Published: 11 November 2022

Abstract

:
Due to various reasons, the incidence rate of communicable diseases in humans is steadily rising, and timely detection and handling will reduce the disease distribution speed. Tuberculosis (TB) is a severe communicable illness caused by the bacterium Mycobacterium-Tuberculosis (M. tuberculosis), which predominantly affects the lungs and causes severe respiratory problems. Due to its significance, several clinical level detections of TB are suggested, including lung diagnosis with chest X-ray images. The proposed work aims to develop an automatic TB detection system to assist the pulmonologist in confirming the severity of the disease, decision-making, and treatment execution. The proposed system employs a pre-trained VGG19 with the following phases: (i) image pre-processing, (ii) mining of deep features, (iii) enhancing the X-ray images with chosen procedures and mining of the handcrafted features, (iv) feature optimization using Seagull-Algorithm and serial concatenation, and (v) binary classification and validation. The classification is executed with 10-fold cross-validation in this work, and the proposed work is investigated using MATLAB® software. The proposed research work was executed using the concatenated deep and handcrafted features, which provided a classification accuracy of 98.6190% with the SVM-Medium Gaussian (SVM-MG) classifier.

1. Introduction

In recent years, the incidence rate of Infectious Disease (ID) has been gradually increasing in humans, and timely detection and treatment are essential to cure and control the spread of the disease. The ID usually affects people with a weak immune system, and this disease is commonly transmitted from one individual to another. Thus, it is regarded as a communicable disease as well. Furthermore, the ID which infects the inner body organ is harsher in comparison to the ID which infects the eternal body organ [1,2,3,4].
In hospitals, a unique diagnostic and treatment procedure is employed to cure/control the disease in internal organs infected with ID. As part of human physiology, the lung exchanges air between the outer atmosphere and other body parts, and any infection in the lung will affect this exchange. There is evidence that the lungs are commonly affected by infections such as pneumonia, tuberculosis, and COVID-19 [5,6,7]. Several diagnostic procedures exist to detect pneumonia with the help of medical images or chosen methods. Pneumonia is an infectious disease commonly affecting people with lower immunity (children and older adults). COVID-19 is also a pathogen that causes severe pneumonia in humans, and many medication procedures have been recommended to reduce the severity of COVID-19 infection in recent years [8,9,10].
As a result of TB infection, people will experience mild to severe breathing problems, and if TB is not detected, it can result in death. A common cause of tuberculosis is Mycobacterium tuberculosis (M. tuberculosis), a bacterium responsible for infecting the lungs and other soft tissues. As TB infection in the lung is a common illness, the clinical level diagnosis is performed using biomedical images of the chosen modality. A computed tomography (CT) or chest radiograph can be used to examine the lung section. X-ray images have been used in medical clinics to examine tuberculosis more frequently than CT images. Since tuberculosis is a communicable disease, and can spread quickly from one person to another, timely detection and treatment are essential [5].
As stated by the World Health Organization (WHO), tuberculosis is a serious disease and a significant cause of death worldwide. TB is considered among the top 10 deadliest diseases because the mortality rate is higher than HIV/AIDS. According to the WHO statement, 1.4 million people worldwide died of tuberculosis in 2019, while ten million cases of tuberculosis have been estimated for 2020. It is estimated that approximately 90% of the community’s adults are infected with tuberculosis (TB), with men contracting the disease at a significantly higher rate than women. Further, TB infection rates are lower in high-income countries than in low-income countries. TB will result in financial suffering, weakness, marginalization, and bias among TB-affected citizens. TB that is detected in a timely fashion is entirely curable, and >85% of those who develop TB can recover with a six-month prescription regimen [11,12,13].
Various clinical tests, including a bio-image-based examination, are used to diagnose TB at the clinical level. Using the bio-images (CT/X-ray) analysis, an experienced pulmonologist can determine the severity of the lung infection caused by tuberculosis. In addition, CT/X-ray-supported tuberculosis detection helps the pulmonologist confirm the disease’s severity, decision-making, and treatment execution processes.
There are several automated disease detection schemes proposed and employed to evaluate CT/X-rays to reduce the diagnostic burden on pulmonologists. Furthermore, modern hospitals utilize standard procedures to assist physicians during the diagnosis and treatment of diseases, including segmentation, Machine-Learning-Scheme (MLS), and Deep-Learning-Scheme (DLS) [14,15,16,17].
This research aims to develop an automated TB detection system using optimally selected Machine-Learning-Features (MLF) and Deep-Learning-Features (DLF) to examine chest X-ray images with better diagnostic accuracy. The proposed scheme consists of the following phases: (i) image pre-processing, (ii) implementing a pre-trained DLS and mining of the DLF, (iii) extracting the necessary MLF using Local-Binary-Pattern (LBP) and Discrete-Wavelet-Transform (DWT), (iv) optimization of DLF and MLF using Seagull-Algorithm (SA), (v) serial concatenation of DLF and MLF, and (vi) binary classifier implementation and validation.
In this work, 7000 test images are collected for assessment from the database supplied by Rahman et al. [18,19]. Initially, all these images are resized to 224 × 224 × 1 pixels, and the converted images are then evaluated using the pre-trained DLS existing in the literature. For the AlexNet, the chest X-ray images are resized to 227 × 227 × 1 pixels. Initially, the conventional DLF obtained from every DLS is considered to classify the chest X-ray images using the SoftMax classifier, and the performance is verified. This investigation confirms that the VGG19 helped improve classification accuracy (92.9048%). Then, this DLF is optimized using the SA and combined with the SA-optimized MLF (DLF+MLF). These concatenated features are once again considered for the classification task, and the binary classification obtained with SVM-Medium Gaussian (SVM-MG) classifier offered an accuracy of ≈98.62%. This result is then compared and validated with the earlier result by Rahman et al. [18].
The chief involvement of this research comprises.
(i)
Implementation of pre-trained DLS-based TB detection from chest X-ray;
(ii)
Generation of MLF using LBP and DWT;
(iii)
SA-based feature optimization and serial feature concatenation to obtain DLF+MLF.
Section 2 presents earlier research, Section 3 demonstrates the methodology, and Section 4 and Section 5 demonstrate the investigational result and conclusion, respectively.

2. Earlier Research

Several diagnostic procedures have been proposed to reduce the diagnostic burden on hospitals to ensure early and accurate detection of tuberculosis. Several chest X-rays supported automated detection methods for tuberculosis are discussed in the literature. This research section examines DLS-based TB diagnostic procedures discussed in the literature. Table 1 summarizes the methods used to detect tuberculosis in earlier studies.
The recent research of Rahman et al. [18] confirms that the ideally employed DLS will help to support reliable TB detection from chest X-ray images. Furthermore, this work also contributed an image database with 7000 test images (3500 normal and 3500 TB class). In the proposed research, the dataset supplied by Rahman et al. [19] is considered for the experimental investigation, and the performance of the pre-trained VGG19 is improved with the help of SA-based feature optimization and serial feature concatenation among the optimized values of DLF and MLF. Further, this work also employed various recent binary classifiers discussed in the literature.

3. Methodology

This section presents essential information about the developed procedure to examine the chest X-ray and detect the normal/TB class images.
In this work, the necessary images are collected from the dataset of Rahman et al. [18] and each picture are resized to a dimension of 224 × 224 × 1 pixels. Figure 1 depicts the various stages employed in the proposed scheme. The test images are initially processed with VGG19 to obtain the necessary deep features of dimension 1 × 1 × 1024 . These features are then reduced using the SA and the optimized DLF are then considered for the examination. The test images are then considered to extract the necessary MLF based on the LBP and DWT and are then combined (LBP+DWT) to obtain a single 1D feature vector and these features are then reduced using the SA to obtain the optimized MLF. The optimal values of DLF and MLF are then serially concatenated to obtain the hybrid feature vector, which is then considered to train and validate the binary classifiers considered to categorize the X-ray pictures. In this work, a 10-fold cross validation is employed to obtain better classification accuracy.

3.1. Image Dataset

The worth of the automatic infection analysis is then confirmed using the clinical ranking or benchmark medicinal information. In this work, the chest X-ray pictures provided by Rahman et al. [19] are considered. This database consists of 7000 test images in which 3500 pictures fit in normal class and remaining 3500 is with TB traces. From these images, 70% images (2450 images) are considered as training class (in which 490 images are chosen for validation) and the remaining 30% (1050 images) are considered as testing class, as depicted in Table 2. Figure 2 presents the sample test images of the collected X-ray database.

3.2. Pre-Trained VGG19

DLS based clinical data evaluation is a frequently considered method and most of these approaches are used to implement automatic image categorization tasks [5,27,28,29]. In this work VGG19 is considered to examine the database and to support better TB diagnosis. Initially, the VGG19 is trained using X-ray pictures with the following tasks: (i) conventional augmentation (rotation and zoom) to boost the training images, (ii) fixing the learning rate as 1 × 10−5 to improve the training and validation accuracy, (iii) training with Linear-Dropout-Rate (LDR) and Adam optimization. During this task, other vital parameters are assigned as follows; total iteration = 2000, total epochs = 50, dropout rates in fully connected layer = 50%, and classification with SoftMax unit using a 10-fold cross validation.

3.3. Feature Extraction

This part of research outlines the extraction of DLF and MLF with chosen technique.

3.3.1. Deep-Learning-Features

The proposed scheme initially aims to extract the DLF from VGG19. The final MaxPool layer section provides a feature vector of dimension 1 × 1 × 4096 and it is then passed through 3 numbers of fully connected layers with a dropout rate of 50% to obtain a reduced the DLF value to 1 × 1 × 1024 . This DLF is further reduced using the SA based optimization and the reduced feature is then combined with the MLF to obtain the hybrid feature vector.

3.3.2. Machine-Learning-Features

In the literature, it can be noted that the combined DLF and MLF helps to obtain a better diagnostic accuracy on various medical images and in this work, the necessary MLF is obtained using the LBP and DWT methods. Initially, the resized images are separately treated with the LBP and DWT and from these images, the necessary features are extracted.
During the LBP feature extraction task, the necessary LBP patterns are produced by assigning its weights as W = 1 , 2 , 3 ,   and   4 , and from these images, essential features with a dimension of 1 × 1 × 59 are extracted from every image. The LBP technique adopted in this work is found in [30,31,32]. During the DWT feature extraction process, the test images are processed with the DWT technique discussed in [33,34,35,36,37]. This technique separates the considered test image into four equal sections, such as approximation, vertical, horizontal, and diagonal images and from every image, the necessary feature are then extracted. The extracted LBP and DWT features are then combined to obtain the original MLF, which is then reduced with the SA to obtain the optimized MLF. The necessary images of the LBP and DWT can be found in Figure 3 and Figure 4, respectively. Figure 3a–d presents LBP with various weights and Figure 4 present the results of DWT. Figure 4a,b depicts the outcome for normal and TB class images.

3.4. Seagull-Algorithm Based Feature Optimization

Recently, Heuristic-Algorithm (HA) based techniques are widely adopted in the literature to find the finest solution for a variety of real-world problems. The performance of any automatic data examination system depends mainly on the features considered to train and validate the classifier and to avoid the over-fitting problem, it is necessary to reduce the raw feature set. Feature selection is a recommended procedure, in which a chosen mathematical procedure (ex. Student’s t-test) or heuristic algorithm approach is widely adopted. The implementation of mathematical expression is widely discussed in the literature [38,39] and to automate the feature selection process, the HA based techniques are also widely considered by the researchers [40,41,42]. In the proposed work, the SA is considered to find the finest features from the raw DLF and MLF by finding the maximal value of the Cartesian Distance (CD) between the healthy and TB features.
The SA was invented in 2019 by Dhiman and Kumar [43] to find the finest solution for a chosen industrial optimization problem. This algorithm is developed by mimicking the hunting behaviors found in Seagull birds. Due to its eminence, the SA is widely adopted to solve a variety of optimization tasks [43,44,45].
The basic operation in SA includes (i) migration as a group to find fish swarm (exploration) and (ii) attacking the fish (exploitation). Usually, the seagull lives as a group and always search for the food source (fish swarm) in sea surface. When it finds the food source, every seagull in the group attacks the fish swarm with its own approach. During the search, every group is led by an experienced leader and other birds will follow the leader without collision. When the group finds the fish swarm, the leader will decide the best food source and other birds follow the best source for attacking. The pictorial representation of the SA is depicted in Figure 5.
In the proposed work, traditional SA is considered for the feature selection problem and the various stages of this scheme are discussed below.

3.4.1. Migration as a Group to Find Fish Swarm (Exploration)

This part of the SA helps to recognize the movement of Seagull (agent) from one location to other in search of the fish group by satisfying the conditions, such as collision prevention, association towards best neighbor and being close to the best neighbor till the food source is detected.
The collision prevention task is mathematically depicted in Equation (1):
C s = A × P s ( x )
where A is considered to compute the new position of agent, P s ( x ) is current position, and C s denotes the updated position.
The value of A is gradually varied to adjust the positions of agents and this work is mathematically denoted in Equation (2):
A = f c ( x × ( f c / I t e r m a x ) )
where f c = 2, I t e r m a x = 3000 and x = 0 , 1 , 2 , ... , I t e r m a x .
During the exploration, the movement of an agent towards the best agent can be defined in Equation (3):
M s = B × ( P b s ( x ) P s ( x ) )
where M s is a new position of search agents is, P b s ( x ) denotes fittest agent, and B is a randomization parameter and is computed in Equation (4).
B = 2 × A 2 ×
where is a random value of range [0, 1].
During the exploration, every agent will closely fly with the best agent with a close distance as depicted in Equation (5):
The distance   D s = | C s + M s |

3.4.2. Attacking the Fish (Exploitation)

During this phase, the agent (Seagull) takes a spiral path to dive into the sea to catch the fish and also takes the inverted path to reach the sky from the sea as depicted in Figure 5. The operation in x, y, and z-plane is defined in Equations (6)–(9):
x 1 = r × cos ( k )
y 1 = r × sin ( k )
z 1 = r × k
r = u × e k v
where r is radius of the path, k is a random parameter [ 0 k 2 π ], other parameters are constants, and these values are assigned in the earlier works [43,44,45].
The necessary parameters of the SA are assigned as follows: number of Seagull (agent) = N = 30, search dimension (D) = 2, I t e r m a x = 3000 and stopping criteria=maximization of Cartesian Distance (CD) between features or I t e r m a x . In this work, identification of maximized CD is chosen as the objective value and the SA helps to find the individual features of healthy/TB class whose value of CD is maximal. The SA based optimization and serial concatenation work is depicted in Figure 6. After finding the optimal features, the necessary hybrid features are obtained using the serial feature concatenation procedure.
The number of DLF and MLF considered in this work is presented in Equations (10)–(12) and in this work, the SA is then used to reduce these features to a lower value.
The number of DF and HF available for the optimization is depicted in Equations (10)–(12).
D L F VGG 19   ( 1 × 1 × 1024 ) = V G G 19 ( 1 , 1 ) , V G G 19 ( 1 , 2 ) , ... , V G G 19 ( 1 , 1024 )
L B P ( 1 × 1 × 236 ) = L B P 1 + L B P 2 + L B P 3 + L B P 4
D W T ( 1 × 1 × 180 ) = D W T 1 + D W T 2 + D W T 3 + D W T 4
Initially, the DLF is optimized with the SA and this process helps to reduce the feature set to a value of 1 × 1 × 539 . Similar procedure is then implemented on the MLF ( ( 1 × 1 × 236 ) + ( 1 × 1 × 180 ) ) and this procedure helped to obtain a feature vector with dimension 1 × 1 × 107 . These two features are then considered to obtain a hybrid feature as depicted in Equation (13):
( D L F + M L F ) o p t i m i z e d = ( 1 × 1 × 539 ) + ( 1 × 1 × 107 ) = 1 × 1 × 6461 × 1 × 107
The feature vector shown in Eqn.13 is then used to train and validate the classifier and, in this research, well known classifiers, such as SoftMax, Naïve-Bayes, Logistic Regression, Decision-Tree (DT) variants, K Nearest Neighbors (KNN) variants, and SVM variants are considered to validate the performance of the proposed system [40,41,42].

3.5. Performance Validation

The merit of automatic disease discovery arrangement is to be confirmed by computing the essential performance values. In this work, the measures obtained from the confusion matrix are considered to substantiate the importance of the proposed system. These measures, includes True-Positive (TP), False-Negative (FN), True-Negative (TN), False-Positive (FP), Accuracy (ACC), Precision (PRE), Sensitivity (SEN), Specificity (SPE), and Negative-Predictive-Value (NPV). The mathematical expressions of these values are presented in Equations (14)–(18) [27,28].
A C C = T P + T N T P + T N + F P + F N
P R E = T P T P + F P
S E N = T P T P + F N
S P E = T N T N + F P
N P V = T N T N + F N

4. Results and Discussion

This section of research demonstrates the experimental result, and this work is performed by a workstation, Intel i7 2.9GHz processor with 20 GB RAM and 4GB VRAM equipped with Matlab®.
The resized X-ray images are initially tested using the VGG19 scheme. Initially, the pre-trained DLS is trained using the test images depicted in Table 2 and after getting the necessary accuracy, its performance is then validated with a 10-fold cross validation procedure.
Figure 7 depicts the outcome during the deep-feature extraction procedure with VGG19. Figure 7a presents the heat-map obtained for both the normal and TB class trial images, Figure 7b–f shows the sample outcome extracted from each convolutional layer. This procedure helps to obtain a DLF of dimension 1 × 1 × 1024 . This feature vector is then considered to verify the TB detection performance of VGG19 with the SoftMax based binary classification and the attained results are shown in Table 3. This table confirms that Trial 8 helped to obtain the better classification accuracy compared to other trials. A similar procedure is then repeated with other pre-trained schemes and the attained results are presented in Table 4. The results existing in Table 4 confirm that the classification accuracy achieved with VGG19 is better compared to other methods considered for the assessment.
Figure 8 graphically shows the accuracy achieved during the 10-fold cross validation and Figure 9 presents the Spider-Plot of Table 4 to verify the overall performance of the chosen schemes. The Spider-Plot is one of the graphical procedures normally considered to find the overall performance during the classification task. The plot which covers maximal area is considered superior compared to other plots. In Figure 9, it is noted that the plot area of VGG19 is more compared to other approaches, which confirms its superiority over other DLS. The obtained Confusion-Matrix (CM) for VGG19 with SoftMax classifier is depicted in Figure 10a,b presents the result by SVM- Medium Gaussian (SVM-MG) classifier.
The performance of the proposed scheme is initially verified using the optimized value of DLF and MLF, and the results are depicted in Table 5. This result confirms that the DLF-supported classification presented better accuracy (93.6667%) when considering KNN-Coarse. Similarly, the MLF-based classification achieved 82.1905% accuracy with SVM- Fine Gaussian. The confusion matrix achieved during this work is presented in Figure 10. Figure 10b,c depict the confusion matrix achieved with optimal DLF and MLF.
After confirming the performance of VGG19 with traditional features ( 1 × 1 × 1024 ), the classification of X-ray is once again repeated with the SA optimized hybrid features presented in Equation (13) (DLF+MLF). The classification achieved with the SoftMax helps to provide an accuracy of 95.09%. Further, the classification performance of VGG19 with other binary classifiers is verified using 10-fold cross validation and the attained results are presented in Table 6. This table confirms that the result obtained with SVM-Medium Gaussian (SVM-MG) classifier is better compared to other approaches and the CM attained with this classifier is depicted in Figure 10d.
To verify the overall performance of the results shown in Table 6, a Glyph-Plot is then constructed, and the result is demonstrated in Figure 11. This plot also confirms that the SVM-MG provides better outcome compared to other classifies adopted in this study. This result confirms that the VGG19 with concatenated feature helps to obtain a better overall result when SVM-MG classifier is employed.
The merit of the proposed scheme is then confirmed with a comparative analysis with the results provided by Rahman et al. [18]. The plot depicted in Figure 12 verifies that the accuracy by the proposed system with SVM-MG classifier is very close to DenseNet201 result of [18].
This research work automatically classified X-ray images using SA-optimized hybrid image features (DLF+MLF) with a VGG19 scheme. The classification task is executed using SoftMax and other binary classifiers existing in the literature to achieve better classification accuracy. The experimental result of this work confirms that the proposed scheme works well on chest X-ray images. In the future, this scheme can be considered to evaluate the clinically collected chest X-ray images. Further, the merit of this system can be tested and confirmed using other benchmark chest X-ray pictures of different lung irregularities. The main limitation of the proposed scheme is the integration of deep and handcrafted features, and this procedure can be improved by considering the ensemble of features technique in future.

5. Conclusions

The literature confirms that TB is a harsh illness in human communities, which extensively infects the lungs. It is a communicable disease and, hence, premature identification, and handling is necessary to reduce the harshness. Further, appropriate detection and suggested medication will help to heal the TB completely. Due to its importance, a substantial amount of automatic TB detection is performed by the research. This research aims to propose a DLS based automatic TB detection using the concatenated DLF and MLF. This work considered VGG19 to extract the DLF and the necessary MLF are obtained using LBP and DWT approaches. After extracting these features, SA based feature optimization is employed and the optimized features are then combined to obtain a hybrid feature vector (DLF+MLF). This feature vector is then used to test and validate the performance of the binary classifiers using a 10-fold cross validation and the investigational result of this study confirmed that the binary classification with SVM-MG classifier helped to achieve an accuracy of >98% for the considered chest X-ray images. This outcome is then evaluated and against the result of other DLS found in the literature. This research confirmed the merit of the proposed DLF+MLF based TB detection from the chest X-ray images. In future, the performance of the proposed scheme can be tested and validated with other chest X-ray image dataset available in the literature.

Author Contributions

R.M., S.K. and O.T.: conceptualization, investigation, methodology, resources, software, and writing—original draft. A.M., V.R. and O.T.: investigation, validation, and writing—review and editing: V.R. and R.M.: conceptualization, software, writing—review and editing; V.R., S.K. and O.T.: writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Acknowledgments

This research work was partially supported by Chiang Mai University, Lebanese American University and Imperial College London.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Rajinikanth, V.; Kadry, S.; Nam, Y. Convolutional neural network assisted segmentation and SVM classification of brain tumor in clinical MRI slices. Inf. Technol. Control 2021, 50, 342–356. [Google Scholar] [CrossRef]
  2. Ambikapathy, B.; Kirshnamurthy, K.; Venkatesan, R. Assessment of electromyograms using genetic algorithm and artificial neural networks. Evol. Intell. 2021, 14, 261–271. [Google Scholar] [CrossRef]
  3. Raghavendra, U.; Gudigar, A.; Rao, T.N.; Rajinikanth, V.; Ciaccio, E.J.; Yeong, C.H.; Satapathy, S.C.; Molinari, F.; Acharya, U.R. Feature-versus deep learning-based approaches for the automated detection of brain tumor with magnetic resonance images: A comparative study. Int. J. Imaging Syst. Technol. 2021, 32, 501–516. [Google Scholar] [CrossRef]
  4. Priya, S.J.; Rani, A.J.; Subathra, M.S.P.; Mohammed, M.A.; Damaševičius, R.; Ubendran, N. Local pattern transformation-based feature extraction for recognition of Parkinson’s disease based on gait signals. Diagnostics 2021, 11, 1395. [Google Scholar] [CrossRef] [PubMed]
  5. Rajakumar, M.P.; Sonia, R.; Maheswari, B.U.; Karuppiah, S.P. Tuberculosis detection in chest X-ray using Mayfly-algorithm optimized dual-deep-learning features. J. X-ray Sci. Technol. 2021, 11, 1395. [Google Scholar] [CrossRef]
  6. Kumar, V.; Singh, D.; Kaur, M.; Damaševičius, R. Overview of current state of research on the application of artificial intelligence techniques for COVID-19. PeerJ Comput. Sci. 2021, 7, e564. [Google Scholar] [CrossRef]
  7. Dey, N.; Zhang, Y.D.; Rajinikanth, V.; Pugalenthi, R.; Raja, N.S.M. Customized VGG19 architecture for pneumonia detection in chest X-rays. Pattern Recognit. Lett. 2021, 143, 67–74. [Google Scholar] [CrossRef]
  8. Akram, T.; Attique, M.; Gul, S.; Shahzad, A.; Altaf, M.; Naqvi, S.S.R.; Damaševičius, R.; Maskeliūnas, R. A novel framework for rapid diagnosis of COVID-19 on computed tomography scans. Pattern Anal. Appl. 2021, 24, 951–964. [Google Scholar] [CrossRef]
  9. Singh, D.; Kumar, V.; Kaur, M. Densely connected convolutional networks-based COVID-19 screening model. Appl. Intell. 2021, 51, 3044–3051. [Google Scholar] [CrossRef]
  10. Becker, A.S.; Blüthgen, C.; Van, V.D.P.; Sekaggya-Wiltshire, C.; Castelnuovo, B.; Kambugu, A.; Fehr, J.; Frauenfelder, T. Detection of tuberculosis patterns in digital photographs of chest X-ray images using Deep Learning: Feasibility study. Int. J. Tuberc. Lung Dis. 2018, 22, 328–335. [Google Scholar] [CrossRef]
  11. WHO. Tuberculosis. Available online: https://www.who.int/health-topics/tuberculosis#tab=tab_1 (accessed on 7 September 2022).
  12. WHO. Global Tuberculosis Report; WHO: Geneva, Switzerland, 2020. [Google Scholar]
  13. Harding, E. WHO global progress report on tuberculosis elimination. Lancet Respir. Med. 2020, 8, 19. [Google Scholar] [CrossRef]
  14. Al-Timemy, A.H.; Khushaba, R.N.; Mosa, Z.M.; Escudero, J. An efficient mixture of deep and machine learning models for COVID-19 and tuberculosis detection using X-ray images in resource limited settings. In Artificial Intelligence for COVID-19; Springer: Cham, Switzerland, 2021; pp. 77–100. [Google Scholar]
  15. Azmi, U.Z.M.; Yusof, N.A.; Abdullah, J.; Ahmad, S.A.A.; Faudzi, F.N.M.; Raston, N.H.A.; Suraiya, S.; Ong, P.S.; Krishnan, D.; Sahar, N.K. Portable electrochemical immunosensor for detection of Mycobacterium tuberculosis secreted protein CFP10-ESAT6 in clinical sputum samples. Mikrochim. Acta 2021, 188, 20. [Google Scholar] [CrossRef] [PubMed]
  16. Rajinikanth, V.; Kadry, S. Development of a Framework for Preserving the Disease-Evidence-Information to Support Efficient Disease Diagnosis. Int. J. Data Warehous. Min. 2021, 17, 63–84. [Google Scholar] [CrossRef]
  17. Diakogiannis, F.I.; Waldner, P.; Caccetta, C.; Wu, C. ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data. ISPRS J. Photogramm. Remote Sens. 2020, 162, 94–114. [Google Scholar] [CrossRef] [Green Version]
  18. Rahman, T.; Khandakar, A.; Kadir, M.A.; Islam, K.R.; Islam, K.F. Reliable tuberculosis detection using chest X-ray with deep learning, segmentation and visualization. IEEE Access 2020, 8, 191586–191601. [Google Scholar] [CrossRef]
  19. Rahman, T.; Khandakar, A.; Chowdhury, M.E.H. Tuberculosis (TB) Chest X-ray Database. Available online: https://ieee-dataport.org/documents/tuberculosis-tb-chest-x-ray-database (accessed on 7 September 2022).
  20. Rajaraman, S.; Antani, S.K. Modality-Specific Deep Learning Model Ensembles Toward Improving TB Detection in Chest Radiographs. IEEE Access 2020, 8, 27318–27326. [Google Scholar] [CrossRef]
  21. Hijazi, M.H.A.; Hwa, S.K.T.; Bade, A.; Yaakob, R.; Jeffree, M.S. Ensemble deep learning for tuberculosis detection using chest X-ray and canny edge detected images. IAES Int. J. Artif. Intell. 2019, 8, 429. [Google Scholar] [CrossRef]
  22. Wong, A.; Lee, J.R.H.; Rahmat-Khah, H.; Sabri, A.; Alaref, A.; Liu, H. TB-Net: A Tailored, Self-Attention Deep Convolutional Neural Network Design for Detection of Tuberculosis Cases From Chest X-ray Images. arXiv 2021, arXiv:2104.03165. [Google Scholar] [CrossRef]
  23. Hooda, R.; Sofat, S.; Kaur, S.; Mittal, A.; Meriaudeau, F. Deep-Learning: A Potential Method for Tuberculosis Detection Using Chest Radiography. In Proceedings of the 2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), Kuching, Malaysia, 12–14 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 497–502. [Google Scholar]
  24. Rohilla, A.; Hooda, R.; Mittal, A. TB detection in chest radiograph using deep learning architecture. In Proceedings of the 5th International Conference on Emerging Trends in Engineering, Technology, Science and Management (ICETETSM-17), Bengaluru, India, 20 August 2017; 2017; pp. 136–147. [Google Scholar]
  25. Nguyen, Q.H.; Nguyen, B.P.; Dao, S.D.; Unnikrishnan, B.; Dhingra, R.; Ravichandran, S.R.; Satpathy, S.; Raja, P.N.; Chua, M.C.H. Deep Learning Models for Tuberculosis Detection from Chest X-ray Images. In Proceedings of the 2019 26th International Conference on Telecommunications (ICT), Hanoi, Vietnam, 8–10 April 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 381–385. [Google Scholar]
  26. Afzali, A.; Mofradand, F.B.; Pouladian, M. Feature Selection for Contour-Based Tuberculosis Detection from Chest X-ray Images. In Proceedings of the 2019 26th National and 4th International Iranian Conference on Biomedical Engineering (ICBME), Tehran, Iran, 27–28 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 194–198. [Google Scholar]
  27. Rajinikanth, V.; Raj, A.N.J.; Thanaraj, K.P.; Naik, G.R. A Customized VGG19 Network with Concatenation of Deep and Handcrafted Features for Brain Tumor Detection. Appl. Sci. 2020, 10, 3429. [Google Scholar] [CrossRef]
  28. Kadry, S.; Srivastava, G.; Rajinikanth, V.; Rho, S.; Kim, Y. Tuberculosis detection in chest radiographs using spotted hyena algorithm optimized deep and handcrafted features. Comput. Intell. Neurosci. 2022, 2022, 9263379. [Google Scholar] [CrossRef]
  29. Saeed, F.; Khan, M.A.; Sharif, M.; Mittal, M.; Goyal, L.M.; Roy, S. Deep neural network features fusion and selection based on PLS regression with an application for crops diseases classification. Appl. Soft Comput. 2021, 103, 107164. [Google Scholar] [CrossRef]
  30. Gudigar, A.; Raghavendra, U.; Devasia, T.; Nayak, K.; Danish, S.M.; Kamath, G.; Samanth, J.; Pai, U.M.; Nayak, V.; Tan, R.S.; et al. Global weighted LBP based entropy features for the assessment of pulmonary hypertension. Pattern Recognit. Lett. 2019, 125, 35–41. [Google Scholar] [CrossRef]
  31. Song, T.; Xin, L.; Gao, C.; Zhang, T.; Huang, Y. Quaternionic extended local binary pattern with adaptive structural pyramid pooling for color image representation. Pattern Recognit. 2021, 115, 107891. [Google Scholar] [CrossRef]
  32. Tuncer, T.; Dogan, S.; Acharya, U.R. Automated EEG signal classification using chaotic local binary pattern. Expert Syst. Appl. 2021, 182, 115175. [Google Scholar] [CrossRef]
  33. Karadal, C.H.; Kaya, M.C.; Tuncer, T.; Dogan, S.; Acharya, U.R. Automated classification of remote sensing images using multileveled MobileNetV2 and DWT techniques. Expert Syst. Appl. 2021, 185, 115659. [Google Scholar] [CrossRef]
  34. Haweel, R.; Shalaby, A.; Mahmoud, A.; Seada, N.; Ghoniemy, S.; Ghazal, M.; Casanova, M.F.; Barnes, G.N.; El-Baz, A. A robust DWT–CNN-based CAD system for early diagnosis of autism using task-based fMRI. Med. Phys. 2021, 48, 2315–2326. [Google Scholar] [CrossRef]
  35. He, Z.; Lu, W.; Sun, W.; Huang, J. Digital image splicing detection based on Markov features in DCT and DWT domain. Pattern Recognit. 2012, 45, 4292–4299. [Google Scholar] [CrossRef]
  36. Cheong, K.H.; Tang, K.J.W.; Zhao, X.; Koh, J.E.W.; Faust, O.; Gururajan, R.; Ciaccio, E.J.; Rajinikanth, V.; Acharya, U.R. An automated skin melanoma detection system with melanoma-index based on entropy features. Biocybern. Biomed. Eng. 2021, 41, 997–1012. [Google Scholar] [CrossRef]
  37. Ramasamy, L.K.; Padinjappurathu, S.G.; Kadry, S.; Damaševičius, R. Detection of diabetic retinopathy using a fusion of textural and ridgelet features of retinal images and sequential minimal optimization classifier. PeerJ Comput. Sci. 2021, 7, e456. [Google Scholar] [CrossRef]
  38. Stewart, J.; Sprivulis, P.; Dwivedi, G. Artificial intelligence and machine learning in emergency medicine. Emerg. Med. Australas. 2018, 30, 870–874. [Google Scholar] [CrossRef]
  39. Acharya, U.R.; Fernandes, S.L.; WeiKoh, J.E.; Ciaccio, E.J.; Fabell, M.K.M.; Tanik, U.J.; Rajinikanth, V.; Yeong, C.H. Automated Detection of Alzheimer’s Disease Using Brain MRI Images– A Study with Various Feature Extraction Techniques. J. Med. Syst. 2019, 43, 1–14. [Google Scholar] [CrossRef] [PubMed]
  40. Rajinikanth, V.; Kadry, S.; Taniar, D.; Damasevicius, R.; Rauf, H.T. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features. In Proceedings of the 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII), Chennai, India, 25–27 March 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
  41. Khan, M.A.; Kadry, S.; Alhaisoni, M.; Nam, Y.; Zhang, Y.-D.; Rajinikanth, V.; Sarfaraz, M.S. Computer-Aided Gastrointestinal Diseases Analysis From Wireless Capsule Endoscopy: A Framework of Best Features Selection. IEEE Access 2020, 8, 132850–132859. [Google Scholar] [CrossRef]
  42. Ahmed, M.; Ramzan, M.; Khan, H.U.; Iqbal, S.; Khan, M.A.; Choi, J.-I.; Nam, Y.; Kadry, S. Real-Time Violent Action Recognition Using Key Frames Extraction and Deep Learning. Comput. Mater. Contin. 2021, 69, 2217–2230. [Google Scholar] [CrossRef]
  43. Dhiman, G.; Kumar, V. Seagull optimization algorithm: Theory and its applications for large-scale industrial engineering problems. Knowl.-Based Syst. 2018, 165, 169–196. [Google Scholar] [CrossRef]
  44. Dhiman, G.; Singh, K.K.; Soni, M.; Nagar, A.; Dehghani, M.; Slowik, A.; Kaur, A.; Sharma, A.; Houssein, E.H.; Cengiz, K. MOSOA: A new multi-objective seagull optimization algorithm. Expert Syst. Appl. 2021, 167, 114150. [Google Scholar] [CrossRef]
  45. Panagant, N.; Pholdee, N.; Bureerat, S.; Kaen, K.; Yıldız, A.R.; Sait, S.M. Seagull optimization algorithm for solving real-world design optimization problems. Mater. Test. 2020, 62, 640–644. [Google Scholar] [CrossRef]
Figure 1. Proposed TB detection scheme from X-ray using concatenated features.
Figure 1. Proposed TB detection scheme from X-ray using concatenated features.
Life 12 01848 g001
Figure 2. Sample X-ray pictures of healthy/TB group.
Figure 2. Sample X-ray pictures of healthy/TB group.
Life 12 01848 g002
Figure 3. Sample test images of normal/TB class treated with LBP. (a) W = 1; (b) W = 2; (c) W = 3; (d) W = 4.
Figure 3. Sample test images of normal/TB class treated with LBP. (a) W = 1; (b) W = 2; (c) W = 3; (d) W = 4.
Life 12 01848 g003
Figure 4. Sample test images of normal/TB class treated with DWT technique. (a) Normal-DWT patterns; (b) TB-DWT patterns.
Figure 4. Sample test images of normal/TB class treated with DWT technique. (a) Normal-DWT patterns; (b) TB-DWT patterns.
Life 12 01848 g004
Figure 5. Traditional Seagull algorithm’s operation.
Figure 5. Traditional Seagull algorithm’s operation.
Life 12 01848 g005
Figure 6. The SA based feature optimization process for DLF and MLF.
Figure 6. The SA based feature optimization process for DLF and MLF.
Life 12 01848 g006
Figure 7. Results extracted for a sample test image with VGG19 architecture. (a) Heat-map; (b) Convolution-outcome 1; (c) Convolution-outcome 2; (d) Convolution-outcome 3; (e) Convolution-outcome 4; (f) Convolution-outcome 5.
Figure 7. Results extracted for a sample test image with VGG19 architecture. (a) Heat-map; (b) Convolution-outcome 1; (c) Convolution-outcome 2; (d) Convolution-outcome 3; (e) Convolution-outcome 4; (f) Convolution-outcome 5.
Life 12 01848 g007
Figure 8. Performance evaluation of 10-fold validation and considered deep-learning systems.
Figure 8. Performance evaluation of 10-fold validation and considered deep-learning systems.
Life 12 01848 g008
Figure 9. Spider-plot to demonstrate the overall performance of chosen DNN schemes.
Figure 9. Spider-plot to demonstrate the overall performance of chosen DNN schemes.
Life 12 01848 g009
Figure 10. Confusion matrix achieved for VGG19 with DLF, optimal DLF, MLF, and optimized DLF+MLF.
Figure 10. Confusion matrix achieved for VGG19 with DLF, optimal DLF, MLF, and optimized DLF+MLF.
Life 12 01848 g010
Figure 11. Glyph-plot constructed to demonstrate the performance for DLF+MLF.
Figure 11. Glyph-plot constructed to demonstrate the performance for DLF+MLF.
Life 12 01848 g011
Figure 12. Performance comparison of proposed scheme with earlier works in the literature [18].
Figure 12. Performance comparison of proposed scheme with earlier works in the literature [18].
Life 12 01848 g012
Table 1. Summary of automated TB diagnosis using chest X-ray images.
Table 1. Summary of automated TB diagnosis using chest X-ray images.
ReferenceDeveloped Procedure
Rajaraman and Antani [20]Implementation of modified DLS is presented in this research to inspect the Shenzhen CXR images and this scheme offered a classification accuracy of 83.7% on the considered images.
Hwa et al. [21]Assessment of TB using the chest X-ray with ensemble DLS and Canny supported edge discovery is discussed and this investigation offered a categorization accuracy of 89.77%.
Wong et al. [22] This research implemented a novel DLS called TB-Net to examine the TV from X-ray pictures and this work helped to obtain improved performance (accuracy = 99.86%) measure on the chosen image database.
Hooda et al. [23]Implementation of a customized DLS with 7 convolutional (Conv) and 3 fully connected (FC) layer is discussed to recognise the TB from the chosen test pictures and this DLS offered an accuracy of 94.73%.
Rohilla et al. [24]This work implemented pre-trained DLS (AlexNet and VGG16) to automatically detect the chest radiographs and achieved better accuracy (>81%).
Nguyen et al. [25]Examination of X-ray pictures using the existing DLS is discussed, and its performance is compared using chest X-ray images.
Afzali et al. [26]This scheme employed the contour-based shape descriptor practice to distinguish the TB with a better accuracy (>92%).
Rahman et al. [18]This work employed Conv Neural Network (CNN) supported joint segmentation and classification to classify 7000 number of test images into healthy/TB class. With the proposed technique, this scheme achieved a best classification accuracy of 96.47% (without segmentation) and 98.6% (with segmentation), respectively.
Table 2. Chest X-ray images considered in this research.
Table 2. Chest X-ray images considered in this research.
ClassDimensionImages
TotalTrainingTesting
Normal 224 × 224 × 1 350024501050
TB 224 × 224 × 1 350024501050
Table 3. Performance values achieved with 10-fold cross validation for VGG19 with SoftMax Classifier.
Table 3. Performance values achieved with 10-fold cross validation for VGG19 with SoftMax Classifier.
TrialsTPFNTNFPACC%PRE%SEN%SPE%NPV%
Fold189215891313785.952486.686184.952486.952485.2474
Fold290714393211887.571488.487886.381088.761986.6977
Fold390414693911187.761989.064086.095289.428686.5438
Fold49589299410690.571490.037691.238189.904891.1197
Fold59648694810291.047690.431591.809590.285791.6828
Fold6973779658592.285791.966092.666791.904892.6104
Fold7979719727892.904892.620693.238192.571493.1927
Fold8961899668491.761991.961791.523892.000091.5640
Fold9954969609091.142991.379390.857191.428690.9091
Fold109421089519990.142990.489989.714390.571489.8017
Table 4. TB detection performance of the chosen DLS with SoftMax unit.
Table 4. TB detection performance of the chosen DLS with SoftMax unit.
SchemeTPFNTNFPACC%PRE%SEN%SPE%NPV%
VGG19979719727892.904892.620693.238192.571493.1927
VGG16966849668492.000092.000092.000092.000092.0000
AlexNet962889559591.285791.012391.619090.952491.5628
ResNet101956949717991.761992.367191.047692.476291.1737
ResNet509589294710390.714390.292291.238190.190591.1453
ResNet189609094810290.857190.395591.428690.285791.3295
Inception-v3951999579390.857191.092090.571491.142990.6250
Table 5. Performance of proposed framework with DLF and HCF.
Table 5. Performance of proposed framework with DLF and HCF.
FeatureClassifierTPFNTNFPACC%PRE%SEN%SPE%NPV%
DLFSoftMax973779787292.904893.110092.666793.142992.7014
Gaussian Naive Bayes975759807093.095293.301492.857193.333392.8910
Logistic Regression976749767492.952492.952492.952492.952492.9524
DT-Coarse982689797193.381093.257493.523893.238193.5053
DT-Medium981699836793.523893.606993.428693.619093.4411
DT-Fine978729807093.238193.320693.142993.333393.1559
KNN-Coarse981699866493.666793.875693.428693.904893.4597
KNN-Medium974769797193.000093.205792.761993.238192.7962
KNN-Fine977739826893.285793.492893.047693.523893.0806
SVM-Linear982689777393.285793.080693.523893.047693.4928
SVM-Coarse Gaussian980709836793.476293.600893.333393.619093.3523
SVM-Medium Gaussian979719777393.142993.060893.238193.047693.2252
SVM-Fine Gaussian982689816993.476293.434893.523893.428693.5176
MLFSoftMax84120986418681.190581.889080.095282.285780.5219
Gaussian Naive Bayes85319784820281.000080.853181.238180.761981.1483
Logistic Regression85619485119981.285781.137481.523881.047681.4354
DT-Coarse84920186218881.476281.870880.857182.095281.0913
DT-Medium85319785919181.523881.705081.238181.809581.3447
DT-Fine86318786118982.095282.034282.190582.000082.1565
KNN-Coarse86518585219881.761981.373582.381081.142982.1601
KNN-Medium86218886118982.047682.017182.095282.000082.0782
KNN-Fine85719386418681.952482.166881.619082.285781.7408
SVM-Linear85319786318781.714382.019281.238182.190581.4151
SVM-Coarse Gaussian85819286218881.904882.026881.714382.095281.7837
SVM-Medium Gaussian86118986418682.142982.235082.000082.285782.0513
SVM-Fine Gaussian86418686218882.190582.129382.285782.095282.2519
Table 6. Experimental outcome achieved with various binary classifiers.
Table 6. Experimental outcome achieved with various binary classifiers.
ClassifierTPFNTNFPACC%PRE%SEN%SPE%NPV%
SoftMax9935710044695.095295.572794.571495.619094.6277
Gaussian Naive Bayes10262410153597.190596.701297.714396.666797.6901
Logistic Regression10311910222897.761997.356098.190597.333398.1748
DT-Coarse995559935794.666794.581794.761994.571494.7519
DT-Medium10183210232797.190597.416396.952497.428696.9668
DT-Fine10074310024895.666795.450295.904895.428695.8852
KNN-Coarse10183210074396.428695.947296.952495.904896.9201
KNN-Medium10024810014995.381095.337895.428695.333395.4242
KNN-Fine10371310292198.381098.015198.761998.000098.7524
SVM-Linear1007439925895.190594.554095.904894.476295.8454
SVM-Coarse Gaussian10133710094196.285796.110196.476296.095296.4627
SVM-Medium Gaussian1029211042898.619099.228598.000099.238198.0245
SVM-Fine Gaussian10311910361498.428698.660398.190598.666798.1991
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mohan, R.; Kadry, S.; Rajinikanth, V.; Majumdar, A.; Thinnukool, O. Automatic Detection of Tuberculosis Using VGG19 with Seagull-Algorithm. Life 2022, 12, 1848. https://doi.org/10.3390/life12111848

AMA Style

Mohan R, Kadry S, Rajinikanth V, Majumdar A, Thinnukool O. Automatic Detection of Tuberculosis Using VGG19 with Seagull-Algorithm. Life. 2022; 12(11):1848. https://doi.org/10.3390/life12111848

Chicago/Turabian Style

Mohan, Ramya, Seifedine Kadry, Venkatesan Rajinikanth, Arnab Majumdar, and Orawit Thinnukool. 2022. "Automatic Detection of Tuberculosis Using VGG19 with Seagull-Algorithm" Life 12, no. 11: 1848. https://doi.org/10.3390/life12111848

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop