IoT-Based Framework for COVID-19 Detection Using Machine Learning Techniques

: Current advancements in the technology of the Internet of Things (IoT) have led to the proliferation of various applications in the healthcare sector that use IoT. Recently, it has been shown that voice signal data of the respiratory system (i


Introduction
In the voice analysis area, classification of an illness can be made possible by using particular features of voice signals [1].Hence, this area has become a significant topic for research since it can decrease laborious work in the classification of voice pathologies [2].The healthcare field is considered an active application for the IoT, where IoT technology has been employed in various medical applications due its many features.For instance, the IoT is employed in remote health surveillance; monitoring of blood pressure; monitoring of oxygen saturation; care of the elderly; monitoring of chronic diseases; management and control of wheelchairs; and fitness schedules.Figure 1 illustrates IoT devices recently used in the healthcare sector.In addition, several healthcare, diagnostic, and imaging devices can be regarded as smart devices which play a vital part in IoT [3].The incorporation of IoT into healthcare fields is predicted to minimize healthcare service charges, as well as enhance quality of life and improve the user experience.According to the perspective of healthcare organizations, the technology of IoT holds the prospect of reducing a device's downtime via remote service provision.Correspondingly, the IoT is able to precisely specify the times for reloading supplies for various devices for the purpose of performing continued and flexible processes.Additionally, the IoT delivers effective and practical scheduling of specified resources in order to guarantee the provision of the most suitable services and rest for patients [4].IoTs in the healthcare sector [5].
Due to the significance of the healthcare field, combining the cloud and IoT within healthcare holds the potential for extensive applications in both societal and daily life, with a particular focus on the healthcare domain [6].These applications can flourish by embracing cloud and IoT technologies.For instance, there are several uses for patients with chronic conditions who need long-term monitoring [7], when the provision of constant surveillance can be critical.Cloud computing can provide massive storage and high processing power while also ensuring security [8].
Along with IoT and cloud services in healthcare applications, machine learning (ML) introduces several approaches that may aid in addressing prognostic and diagnostic difficulties in various medical areas [9].These techniques have been employed in discriminating between two or more classes of disease [10].Also, ML techniques are extensively employed in diverse domains, for instance in image identification in the healthcare field [11]; language classification [12,13]; identification of voice pathology [14,15]; classification of spam emails [16]; and vehicle detection [17].
Furthermore, ML techniques have been used in many medical applications such as COVID-19 detection [18]; lung cancer detection [19]; voice pathology classification [20]; breast cancer detection [21]; and diabetes disease detection [22,23].One of the most dangerous illnesses facing the world recently is COVID-19 [24].On 11 March 2020 this virus was declared a worldwide pandemic by the World Health Organization (WHO) [25].COVID-19 is regarded as a new infectious illness and can cause death [26].Consequently, it is imperative to exploit the features of IoT, ML and the cloud for use in the healthcare sector generally and in COVID-19 detection in particular.The general symptoms of COVID-19 comprise loss of taste or smell, gastrointestinal symptoms, muscle pain, shortness of breath, and joint pain [27].The most prevalent symptoms of COVID-19 are dry coughs, fatigue, and fever [28].Most systems of COVID-19 detection are performed using x-ray images, whereas voice analysis of COVID-19 patients has not gained much attention.Due to the significance of the healthcare field, combining the cloud and IoT within healthcare holds the potential for extensive applications in both societal and daily life, with a particular focus on the healthcare domain [6].These applications can flourish by embracing cloud and IoT technologies.For instance, there are several uses for patients with chronic conditions who need long-term monitoring [7], when the provision of constant surveillance can be critical.Cloud computing can provide massive storage and high processing power while also ensuring security [8].
Along with IoT and cloud services in healthcare applications, machine learning (ML) introduces several approaches that may aid in addressing prognostic and diagnostic difficulties in various medical areas [9].These techniques have been employed in discriminating between two or more classes of disease [10].Also, ML techniques are extensively employed in diverse domains, for instance in image identification in the healthcare field [11]; language classification [12,13]; identification of voice pathology [14,15]; classification of spam emails [16]; and vehicle detection [17].
Furthermore, ML techniques have been used in many medical applications such as COVID-19 detection [18]; lung cancer detection [19]; voice pathology classification [20]; breast cancer detection [21]; and diabetes disease detection [22,23].One of the most dangerous illnesses facing the world recently is COVID-19 [24].On 11 March 2020 this virus was declared a worldwide pandemic by the World Health Organization (WHO) [25].COVID-19 is regarded as a new infectious illness and can cause death [26].Consequently, it is imperative to exploit the features of IoT, ML and the cloud for use in the healthcare sector generally and in COVID-19 detection in particular.The general symptoms of COVID-19 comprise loss of taste or smell, gastrointestinal symptoms, muscle pain, shortness of breath, and joint pain [27].The most prevalent symptoms of COVID-19 are dry coughs, fatigue, and fever [28].Most systems of COVID-19 detection are performed using x-ray images, whereas voice analysis of COVID-19 patients has not gained much attention.Therefore, this paper presents a new IoT framework based on machine learning techniques for the detection of COVID-19 by using breathing voice signals.The main aims of this paper are as follows:

•
Propose a new IoT framework based on machine learning techniques for the detection of COVID-19 by using breathing voice signals.

•
Use mel-frequency cepstral coefficients (MFCCs) to extract the needed features from breathing voice signals and the naïve Bayes (NB) algorithm to classify whether the input voice signal is positive or negative.

•
Evaluate the proposed work based on several of the most common evaluation measurements-accuracy, sensitivity, specificity, precision, F-Measure, and G-Mean.

•
Compare the proposed NB algorithm against the SVM and RF algorithms in the detection of the COVID-19 by using breathing voice signals.

•
Compare the performance of the proposed work, in terms of accuracy, with recent studies that used the same dataset.
The remainder of the current paper is organised as follows: Section 2 provides a review of previous work relevant to the current study.Section 3 presents the materials and proposed methods, and Section 4 presents the experiment setup and discussion of results.Finally, Section 5 illustrates the conclusion of this research.

Related Works
There are different types of data on the respiratory system, such as coughing, speech, breathing, and sneezing.These data can be analyzed and processed through ML techniques for the purpose of identifying diseases of the respiratory system [29].Here, we will present a brief review of ML methods and techniques that have been used in the detection of COVID-19.The short-time magnitude spectrogram features (SRMSF) and the ResNet18 algorithm were presented and implemented for the identification of COVID19 in [30].In this method, the cough voice samples of the Coswara database were used for the training and testing of the ResNet18 algorithm.The best area under curve (AUC) result was 0.72.
Furthermore, the work in [31] has been suggested as a method for detecting COVID-19 based on voice samples.In this work, mel-filter bank features (MFBF) technique is employed to extract audio features.Subsequently, these features are fed to a support vector machine (SVM) for the purpose of identifying the voice samples as either healthy class or COVID-19 class.The voice samples were collected from YouTube for training and testing the SVM classifier.According to the experimental findings, the highest obtained result for detection accuracy was 88.60%.
The study in [32], has been presented as a system of COVID-19 detection.In this system, several handcrafted features were implemented, while a logistic regression (LR) algorithm was used as a classifier to separate voice features into healthy or COVID-19.There were two types of voice samples used for training and testing the LR classifier.The first type of voice sample was for cough samples, whilst the second type of voice sample was for breath samples.These two types of voice samples were collected from a crowd-sourced database.The experimental outcomes demonstrated that the best results showing precision using cough voice samples and breath voice samples were 80.00% and 69.00%, respectively.
The authors in [33] studied and analyzed the sustained phonation of the vowel /i/, number counting, and deep breathing for the detection of COVID-19.In addition, these three types of voice samples were collected from the DiCOVA database.The authors used various features such as harmonics, super-vectors, MFCC, and formats.In addition, they used the SVM classifier for differentiating the COVID-19 samples from healthy samples.The experimental results showed that the SVM classifier was able to recognize COVID-19 samples with AUC scores of 0.734 and 0.717 for cross-validation and testing, respectively.However, the results obtained via this method were not encouraging.
The work in [34] was presented as a method to differentiate people who were infected by COVID-19 from healthy people.Voice samples for the healthy class and COVID-19 class were collected in India.In the database, there were 152 samples which were affected by COVID-19 (i.e., positive cases) and 1143 samples which were healthy cases (i.e., negative cases).Pre-processing was implemented and the features extracted using MFCC technique.The oversampling was also conducted on the extracted features for the purpose of increasing detection accuracy.According to the experimental outcomes, the highest achieved accuracy was 92%, which was obtained by K-NN classifier.
In [35], the authors conducted an experimental study to assess the effectiveness of bottleneck feature extraction and transfer learning in the identification of COVID-19 using audio recordings of coughs, breaths, and speech.Furthermore, they utilized a dataset that included various sounds such as coughs, sneezes, speech, and background noise, but lacked COVID-19 labels.Initially, the authors pre-trained three deep neural networks, namely CNN, LSTM, and Resnet50.Subsequently, they fine-tuned these pre-trained networks using smaller datasets that contained labelled instances of COVID-19 associated with coughing.In addition, they utilized these pre-trained networks as bottleneck feature extractors.The experimental findings indicated that a Resnet50 classifier trained through this transfer learning approach consistently demonstrated excellent or near-optimal performance across all datasets.It achieved an AUC of 0.98 for the cough set, 0.94 for the breath set, and 0.92 for the speech set.
Another study used a cough dataset for the assessment of COVID-19 [36].This study aimed to employ an automatic speech identification system based on hidden Markov models (HMMs) to analyze cough signals and determine whether they originated from individuals who were healthy or COVID-19 patients.The method was developed as a customizable model that utilizes HMMs, MFCC, Gaussian mixture models (GMMs), and a dataset of cough recordings from both healthy and sick volunteers.The proposed approach successfully categorized a dry cough with a sensitivity ranging from 85.86% to 91.57%.It could also distinguish between a dry cough and a cough associated with COVID-19 symptoms with a specificity ranging from 5% to 10%.
The work in [37] focused on distinguishing between healthy individuals and those infected with COVID-19 by utilizing cough recordings.It built upon the previously successful application of the bag-of-words (BoW) classifier in biometric scenarios involving ECG signals.The evaluation measurements included accuracy in discrimination, F1-score, AUC, and sensitivity.A comprehensive series of tests was carried out using two crowdsourced datasets: COUGHVID (consisting of 250 subjects per class) and COVID-19 Sounds (comprising 450 subjects per class, recorded in a medical setting).To enhance data quality, the authors applied specific preprocessing procedures to clean the recordings and selected audio segments that exclusively contained pure cough sounds.The work also introduced new developments related to the impact of both input and output fusion techniques on classification performance.Additionally, the proposed work has overtaken the CNN algorithm in identifying COVID-19.The results showed that the BoW algorithm achieved the highest accuracy of 74.3%, 71.4% sensitivity, 75.4% F1-score, and 82.6% AUC using the COUGHVID dataset.However, the obtained results are still not encouraging in terms of detecting and identifying COVID-19 instances.
The research in [38] proposed an automatic COVID-19 detection system using cough voice data.The proposed system was based on the utilization of several features extraction techniques such as spectral centroid (SC), zero-crossing rate (ZCR), mel-frequency cepstral coefficients (MFCC), and long short-term memory (LSTM) classifier.The assessment of the proposed system was conducted on a cough voice corpus that was collected from 20 COVID-19 patients (8 female and 12 male) and 60 healthy speakers (20 female and 40 male).In addition, the collected cough voice dataset was divided into 30% for testing and 70% for training.The experimental results showed that the best performance achieved was with the utilization of MFCC features and LSTM classifier, where it hits a precision of 99.30%.
Additionally, an application for the detection of COVID-19 was proposed in [39].The proposed application was built using the short-time Fourier transform (STFT) features and decision forest (DF) classifier.The evaluation of the proposed application was conducted based on speech data.The experiment's outcomes showed that the proposed application attained results with an accuracy of 73.17%.Table 1 illustrates the related work of the current study.Based on all the above-mentioned studies, we can summarize the limitations as follows: • The outcomes of most previous works are still not encouraging and require more enhancement regarding the accuracy rate.

•
Most of the previous studies have been evaluated based on limited evaluation measurements.

Proposed Method
The proposed method comprises four steps for the detection of COVID-19.The first step refers to the use of database voice samples.The second step is the use of the IoT framework.The third step refers to the feature extraction technique.Finally, the fourth step denotes the classification technique for identifying the COVID-19 class from the healthy class.Figure 2 depicts the overall flowchart of the proposed method for detecting COVID-19 using voice data.The following subsections will provide explanations for each of these steps individually.

•
Most of the previous studies have been evaluated based on limited evaluation measurements.

Proposed Method
The proposed method comprises four steps for the detection of COVID-19.The first step refers to the use of database voice samples.The second step is the use of the IoT framework.The third step refers to the feature extraction technique.Finally, the fourth step denotes the classification technique for identifying the COVID-19 class from the healthy class.Figure 2 depicts the overall flowchart of the proposed method for detecting COVID-19 using voice data.The following subsections will provide explanations for each of these steps individually.

Database
In this work, we trained and tested the proposed method using the corona hack respiratory sound dataset (CHRSD).The voice signals of this dataset were collected from Kaggle [35].This voice database comprises numerous types of respiratory sounds.For example, shallow breath, count fast; deep breath, count normal; heavy cough; and vowels (i.e., /a/, /e/, /o/).Each type of respiratory sound contains voice signals for COVID-19-infected people and healthy people.Furthermore, in our proposed method, we selected all breath samples for the detection of COVID-19 in both classes-healthy and COVID-19.We used a balanced voice database, with 78 voices for the COVID-19 category and 78 voices for the healthy category.Thus, the total number in the database was156.The voice database was split into 70% and 30% for training and testing processes, respectively.

Database
In this work, we trained and tested the proposed method using the corona hack respiratory sound dataset (CHRSD).The voice signals of this dataset were collected from Kaggle [35].This voice database comprises numerous types of respiratory sounds.For example, shallow breath, count fast; deep breath, count normal; heavy cough; and vowels (i.e., /a/, /e/, /o/).Each type of respiratory sound contains voice signals for COVID-19infected people and healthy people.Furthermore, in our proposed method, we selected all breath samples for the detection of COVID-19 in both classes-healthy and COVID-19.We used a balanced voice database, with 78 voices for the COVID-19 category and 78 voices for the healthy category.Thus, the total number in the database was156.The voice database was split into 70% and 30% for training and testing processes, respectively.

IoT Framework
The proposed IoT framework/model, merged with cloud computing, aimed to diagnose COVID-19 via testing obtained voice samples of the vowel /a/. Figure 3  procedures of the proposed IoT framework using IoT devices for capturing and uploading the user's voice to the cloud, where the voice signal was processed and analyzed.

presents the
Sci 2024, 6, x FOR PEER REVIEW 6 of 13

IoT Framework
The proposed IoT framework/model, merged with cloud computing, aimed to diagnose COVID-19 via testing obtained voice samples of the vowel /a/. Figure 3 presents the procedures of the proposed IoT framework using IoT devices for capturing and uploading the user's voice to the cloud, where the voice signal was processed and analyzed.The proposed IoT framework can be useful for patients who are suffering from COVID-19 to self-diagnose using their voice.Some patients may have difficulty getting to hospitals or clinics for a medical examination for various reasons, such as distance from home and problems obtaining appointments at short notice; others may think that their hoarseness is not life endangering.Nonetheless, early diagnosis is imperative for this kind disease in order to determine if there are any problems with vocal folds.Using our proposed IoT framework, recording the process of a patient's voice for voice diagnosis can be made easier by using IoT devices such as smartphones.Hence, the proposed IoT framework presents not only a simpler and easier process of self-diagnosis for patients but can also motivate them to regularly check their voice when they have any hoarseness or voice disorder.The recorded voice can then be sent to the cloud for further processing and classification into COVID-19 class or healthy class by using the machine learning technique.

Feature Extraction
In this study, voice features were extracted using the MFCC technique based on performing various stages.These stages included pre-emphasis, windowing, FFT (fast Fourier transform), mel filter bank, log, and DCT (discrete cosine transform).The diagram of the MFCC technique is shown Figure 4.In the pre-emphasis stage, the energy of the voice samples were increased at a higher frequency.The windowing stage refers to splitting the voice signals into frames.In addition, the FFT stage denotes converting the voice sample from the time field into a frequency field.Subsequently, the mel filter bank is involved for the purpose of The proposed IoT framework can be useful for patients who are suffering from COVID-19 to self-diagnose using their voice.Some patients may have difficulty getting to hospitals or clinics for a medical examination for various reasons, such as distance from home and problems obtaining appointments at short notice; others may think that their hoarseness is not life endangering.Nonetheless, early diagnosis is imperative for this kind of disease in order to determine if there are any problems with vocal folds.Using our proposed IoT framework, recording the process of a patient's voice for voice diagnosis can be made easier by using IoT devices such as smartphones.Hence, the proposed IoT framework presents not only a simpler and easier process of self-diagnosis for patients but can also motivate them to regularly check their voice when they have any hoarseness or voice disorder.The recorded voice can then be sent to the cloud for further processing and classification into COVID-19 class or healthy class by using the machine learning technique.

Feature Extraction
In this study, voice features were extracted using the MFCC technique based on performing various stages.These stages included pre-emphasis, windowing, FFT (fast Fourier transform), mel filter bank, log, and DCT (discrete cosine transform).The diagram of the MFCC technique is shown Figure 4.In the pre-emphasis stage, the energy of the voice samples were increased at a higher frequency.The windowing stage refers to splitting the voice signals into frames.

IoT Framework
The proposed IoT framework/model, merged with cloud computing, aimed to diagnose COVID-19 via testing obtained voice samples of the vowel /a/. Figure 3 presents the procedures of the proposed IoT framework using IoT devices for capturing and uploading the user's voice to the cloud, where the voice signal was processed and analyzed.The proposed IoT framework can be useful for patients who are suffering from COVID-19 to self-diagnose using their voice.Some patients may have difficulty getting to hospitals or clinics for a medical examination for various reasons, such as distance from home and problems obtaining appointments at short notice; others may think that their hoarseness is not life endangering.Nonetheless, early diagnosis is imperative for this kind of disease in order to determine if there are any problems with vocal folds.Using our proposed IoT framework, recording the process of a patient's voice for voice diagnosis can be made easier by using IoT devices such as smartphones.Hence, the proposed IoT framework presents not only a simpler and easier process of self-diagnosis for patients but can also motivate them to regularly check their voice when they have any hoarseness or voice disorder.The recorded voice can then be sent to the cloud for further processing and classification into COVID-19 class or healthy class by using the machine learning technique.

Feature Extraction
In this study, voice features were extracted using the MFCC technique based on performing various stages.These stages included pre-emphasis, windowing, FFT (fast Fourier transform), mel filter bank, log, and DCT (discrete cosine transform).The diagram of the MFCC technique is shown Figure 4.In the pre-emphasis stage, the energy of the voice samples were increased at a higher frequency.The windowing stage refers to splitting the voice signals into frames.In addition, the FFT stage denotes converting the voice sample from the time field into a frequency field.Subsequently, the mel filter bank is involved for the purpose of In addition, the FFT stage denotes converting the voice sample from the time field into a frequency field.Subsequently, the mel filter bank is involved for the purpose of converting frequencies of the voice signals from Hz to mel as shown in the following equation: The DCT was implemented to convert the log mel spectrum into a time field.Therefore, each voice signal was converted into a sequence of attributes.

Classification
The naïve Bayes (NB) algorithm was used in the classification process for the purpose of identifying voice samples into the COVID-19 category and the healthy category.It is worth stating that the NB algorithm has been implemented extensively in prediction and diagnostic tasks, where the performance of the NB algorithm has been proven efficient and shown to achieve higher results compared to other algorithms [41].Furthermore, the NB algorithm can perform well based on a small number of instances for determining subjects.In addition, the NB algorithm is easy to implement.In the NB algorithm, the attribute values of the feature in a category are presumed to be independent of others, known as class conditional independence.In the NB algorithm, the attributes of a class c are indicated as a group of {f 1 , f 2 . . .f n } = (x) which represents the feature vector of voice signal samples.In addition, the probability P(c|x) denotes the probability of a new voice signal.It can be computed by employing the Bayes equation: The parameters of the equation are: is the posterior probability of class (c) given predictor (x).P(c) is the prior probability of class.P(x|c) is the likelihood which is the probability of the predictor given class.P(x) is the prior probability of the predictor.
The process of classification can easily be defined in three simple steps: (a) create a frequency table from the dataset; (b) establish a likelihood table by specifying the probabilities; and (c) utilize the Bayesian equation to measure the post-class probability.The prediction outcome is the class with the highest posterior probability.

Experimental Results
The voice samples in this work were gathered from the CHRSD database for both classes, COVID-19 and healthy.In this database, there are different types of respiratory categories.In the proposed work, we used all breathing categories for both classes in order to identify COVID-19.The used in this work was balanced-there were 78 voice samples for each class.Hence, the total number of all samples in the database was 156.The voice database was split into 70% and 30% for training and testing processes, respectively.In the training set, there were 109 voice samples, with 47 voice samples in the testing set.The experiment was conducted utilizing the Python 3 programming language on a PC (Windows 10).The NB algorithm was assessed in terms of common evaluation measurements: accuracy (Acc), sensitivity (Sen), specificity (Spe), precision (Pre), F-Measure (F-M), and G-Mean (G-M).These performance measurements were computed using the following equations [42][43][44][45][46]: G-Mean = Specificity × Sensitivity (8) where TP refers to true positive; TN denotes true negative; and FP and FN indicate false positive and false negative, respectively.The experimental results in Table 2 demonstrate that the proposed method using the NB algorithm achieved 82.97% Acc, 75.86%Sen, and 94.44% Spe.Furthermore, the achieved results for the Pre, F-M, and G-M were 95.65%, 84.61%, and 84.64%, respectively.In addition, Figure 5 exhibits the confusion matrix for the NB algorithm.
G − Mean = √Specificity × Sensitivity (8) where  refers to true positive;  denotes true negative; and  and  indicate false positive and false negative, respectively.The experimental results in Table 2 demonstrate that the proposed method using the NB algorithm achieved 82.97% Acc, 75.86%Sen, and 94.44% Spe.Furthermore, the achieved results for the Pre, F-M, and G-M were 95.65%, 84.61%, and 84.64%, respectively.In addition, Figure 5 exhibits the confusion matrix for the NB algorithm.Moreover, additional experiments were conducted utilizing the SVM and RF algorithms on the same CHRSD database.Table 3 shows all the experimental results of both SVM and RF algorithms.The experimental outcomes in Table 3 reveal that the SVM algorithm achieved 76.60% Acc, 82.14% Sen, 68.42% Spe, 79.31% Pre, 80.70% F-M, and 74.97% G-M.Furthermore, the RF algorithm attained 72.34% Acc, 68.00% Sen, 77.27% Spe, 77.27% Pre, 72.34% F-M, and 72.49% G-M.Figures 6 and 7 show the confusion matrices for the SVM and RF algorithms.Further, Figure 8 depicts the comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.Moreover, additional experiments were conducted utilizing the SVM and RF algorithms on the same CHRSD database.Table 3 shows all the experimental results of both SVM and RF algorithms.The experimental outcomes in Table 3 reveal that the SVM algorithm achieved 76.60% Acc, 82.14% Sen, 68.42% Spe, 79.31% Pre, 80.70% F-M, and 74.97% G-M.Furthermore, the RF algorithm attained 72.34% Acc, 68.00% Sen, 77.27% Spe, 77.27% Pre, 72.34% F-M, and 72.49% G-M.Figures 6 and 7 show the confusion matrices for the SVM and RF algorithms.Further, Figure 8   Based on the results in Tables 2 and 3 and Figures 5-8, it is clear that the proposed NB algorithm outperformed both the SVM and RF algorithms.The highest accuracy rate achieved by the proposed NB algorithm was 82.97% while the highest accuracy rate achieved by the SVM and RF algorithms were 76.60% and 72.34%, respectively.
In addition, we evaluated and compared the NB algorithm with other methods in [47][48][49], in terms of the detection accuracy in identifying COVID-19.These methods have been presented in the detection of COVID-19 based on all breath samples.Also, these methods used the same database that we used in the proposed NB algorithm in order to perform a fair comparison.Table 4 shows the comparison between the proposed NB algorithm with other methods in the detection of COVID-19.The experimental findings demonstrate that the proposed NB algorithm outperformed its comparatives in the detection of COVID-19 based on all breath samples.Table 4. Comparison of accuracy between the proposed NB algorithm and other methods.

Accuracy Results
The proposed NB algorithm 82.97% CNN [47] 70.37% SVM [48] 81.50% RF [49] 75.17%XGBoost [50] 63.04% Even though the results in Table 3 demonstrate that the proposed NB algorithm outperformed all of its peers in terms of classification accuracy rate, there are still some limitations of the current work, which are as follows: The current work has been evaluated on a small dataset which may affect the detection accuracy rate.
The evaluation of the present work in terms of execution time has been ignored.This research focused on classifying the respiratory system voice data into two classes (COVID-19/healthy) only, and other lung illnesses were ignored.Based on the results in Tables 2 and 3 and Figures 5-8, it is clear that the proposed NB algorithm outperformed both the SVM and RF algorithms.The highest accuracy rate achieved by the proposed NB algorithm was 82.97% while the highest accuracy rate achieved by the SVM and RF algorithms were 76.60% and 72.34%, respectively.
In addition, we evaluated and compared the NB algorithm with other methods in [47][48][49], in terms of the detection accuracy in identifying COVID-19.These methods have been presented in the detection of COVID-19 based on all breath samples.Also, these methods used the same database that we used in the proposed NB algorithm in order to perform a fair comparison.Table 4 shows the comparison between the proposed NB algorithm with other methods in the detection of COVID-19.The experimental findings demonstrate that the proposed NB algorithm outperformed its comparatives in the detection of COVID-19 based on all breath samples.

Accuracy Results
The proposed NB algorithm 82.97% CNN [47] 70.37% SVM [48] 81.50% RF [49] 75.17%XGBoost [50] 63.04% Even though the results in Table 3 demonstrate that the proposed NB algorithm outperformed all of its peers in terms of classification accuracy rate, there are still some limitations of the current work, which are as follows: The current work has been evaluated on a small dataset which may affect the detection accuracy rate.
The evaluation of the present work in terms of execution time has been ignored.This research focused on classifying the respiratory system voice data into two classes (COVID-19/healthy) only, and other lung illnesses were ignored.

Conclusions
This paper presents a new IoT framework based on a machine-learning algorithm for the detection of COVID-19 using breathing voice signals.IoT devices have been used to capture voice signals from users and then send them to the cloud.In the cloud, voice signals can be processed and analyzed using naïve Bayes (NB), one of the machine learning techniques.The voice samples for this study were gathered from the CHRSD database, with a balanced number of samples for each class-healthy and COVID-19 with each class having 78 voice samples.Furthermore, the NB algorithm was evaluated using six different evaluation measurements: accuracy, sensitivity, specificity, precision, F-Measure, and G-Mean.The results demonstrated that the NB algorithm achieved 82.97% accuracy, 75.86%, sensitivity, 94.44% specificity, 95.65% precision, 84.61% F-Measure, and 84.64% G-Mean.However, the current work is still suffering from some limitations which can be illustrated as follows: (i) evaluation of the current work was based on a small dataset, and (ii) the present work has ignored the evaluation of the proposed NB algorithm in terms of execution time.Therefore, in future work we plan to evaluate the proposed work on a bigger dataset alongside an evaluation of execution time.In addition, we would consider the classification of other lung illnesses using respiratory system voice data.Moreover, we would test the use of different classifiers for the detection of COVID-19 based on various voice types of the respiratory system such as vowels, speech, and coughing.
Therefore, this paper presents a new IoT framework based on machine learning techniques for the detection of COVID-19 by using breathing voice signals.The main aims of this paper are as follows: • Propose a new IoT framework based on machine learning techniques for the detection of COVID-19 by using breathing voice signals.• Use mel-frequency cepstral coefficients (MFCCs) to extract the needed features from breathing voice signals and the naïve Bayes (NB) algorithm to classify whether the input voice signal is positive or negative.

Figure 3 .
Figure 3.The proposed IoT framework for capturing voice signals using IoT devices.

Figure 3 .
Figure 3.The proposed IoT framework for capturing voice signals using IoT devices.

Figure 3 .
Figure 3.The proposed IoT framework for capturing voice signals using IoT devices.

Figure 5 .
Figure 5.The confusion matrix of the NB algorithm.Additionally, based on the confusion matrix for the NB algorithm's best outcomes shown in Figure 5, we can observe the following: (a) it was a clear indication that the proposed NB algorithm on the CHRSD database was able to accurately classify 22 out of 23 COVID-19 samples, and (b) the proposed NB algorithm was capable of accurately classifying 17 out of 23 healthy samples.Moreover, additional experiments were conducted utilizing the SVM and RF algorithms on the same CHRSD database.Table3shows all the experimental results of both SVM and RF algorithms.The experimental outcomes in Table3reveal that the SVM algorithm achieved 76.60% Acc, 82.14% Sen, 68.42% Spe, 79.31% Pre, 80.70% F-M, and 74.97% G-M.Furthermore, the RF algorithm attained 72.34% Acc, 68.00% Sen, 77.27% Spe, 77.27% Pre, 72.34% F-M, and 72.49% G-M.Figures6 and 7show the confusion matrices for the SVM and RF algorithms.Further, Figure8depicts the comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.

Figure 5 .
Figure 5.The confusion matrix of the NB algorithm.Additionally, based on the confusion matrix for the NB algorithm's best outcomes shown in Figure 5, we can observe the following: (a) it was a clear indication that the proposed NB algorithm on the CHRSD database was able to accurately classify 22 out of 23 COVID-19 samples, and (b) the proposed NB algorithm was capable of accurately classifying 17 out of 23 healthy samples.Moreover, additional experiments were conducted utilizing the SVM and RF algorithms on the same CHRSD database.Table3shows all the experimental results of both SVM and RF algorithms.The experimental outcomes in Table3reveal that the SVM algorithm achieved 76.60% Acc, 82.14% Sen, 68.42% Spe, 79.31% Pre, 80.70% F-M, and 74.97% G-M.Furthermore, the RF algorithm attained 72.34% Acc, 68.00% Sen, 77.27% Spe, 77.27% Pre, 72.34% F-M, and 72.49% G-M.Figures6 and 7show the confusion matrices for the SVM and RF algorithms.Further, Figure8depicts the comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.
depicts the comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.

Figure 8 .
Figure 8.The comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.

Figure 8 .
Figure 8.The comparison results of the proposed NB algorithm against the SVM and RF in COVID-19 detection.

Table 1 .
Summary of related work.

Table 1 .
Summary of related work.

Table 2 .
The best results achieved by the proposed NB algorithm in the detection of COVID-19.

Table 2 .
The best results achieved by the proposed NB algorithm in the detection of COVID-19.

Table 4 .
Comparison of accuracy between the proposed NB algorithm and other methods.