Classiﬁcation of Children’s Sitting Postures Using Machine Learning Algorithms

: Sitting on a chair in an awkward posture or sitting for a long period of time is a risk factor for musculoskeletal disorders. A postural habit that has been formed cannot be changed easily. It is important to form a proper postural habit from childhood as the lumbar disease during childhood caused by their improper posture is most likely to recur. Thus, there is a need for a monitoring system that classiﬁes children’s sitting postures. The purpose of this paper is to develop a system for classifying sitting postures for children using machine learning algorithms. The convolutional neural network (CNN) algorithm was used in addition to the conventional algorithms: Naïve Bayes classiﬁer (NB), decision tree (DT), neural network (NN), multinomial logistic regression (MLR), and support vector machine (SVM). To collect data for classifying sitting postures, a sensing cushion was developed by mounting a pressure sensor mat (8 × 8) inside children’s chair seat cushion. Ten children participated, and sensor data was collected by taking a static posture for the ﬁve prescribed postures. The accuracy of CNN was found to be the highest as compared with those of the other algorithms. It is expected that the comprehensive posture monitoring system would be established through future research on enhancing the classiﬁcation algorithm and providing an effective feedback system.


Introduction
The sedentary lifestyle is prevalent in modern society. Among the sedentary behaviors, sitting in a chair is a routine behavior of office workers and students in their work environment. However, prolonged sitting behavior and awkward sitting postures have been reported to act as negative factors affecting health. Prolonged sitting has been known to be associated with premature mortality [1], chronic illness [2], metabolic syndrome [3], and obesity [4]. Moreover, sitting with poor posture could have a negative influence on the lumbar health [5]. An excessive flexion and extension of the lumbar are representative patterns of poor sitting postures, which cause high compression in the lumbar [6]. In particular, back pain in childhood caused by improper sitting posture can be a risk factor for lumbar disease in adulthood [7]. Therefore, it is important to form correct postural habits from childhood. To prevent the occurrence of these illnesses, one should maintain a good sitting posture and take a break at the appropriate time intervals when performing tasks. However, it is difficult for people to maintain a correct posture with constant awareness or to change one's incorrect postural habits by oneself.
On this basis, researches have been conducted for developing a posture monitoring system that helps people to maintain a correct posture. The dominant approach is to use pressure distribution data obtained from pressure sensors attached to chair structures such as a seat and backrest. To realize this system, it is necessary for the process of classifying the sitting posture accurately in real time to precede the development of the posture monitoring system. Previous studies have adopted several conventional algorithms for classifying sitting postures such as the Hidden Markov Models (HMM), Naïve Bayes (NB) classifier and k-nearest neighbor (kNN) classifier [8][9][10]. Conventional machine learning algorithms including neural network, support vector machine (SVM), and kNN are still being adopted in various research objectives such as fault diagnosis, wind speed prediction, and thermal anomalies identification [11][12][13][14]. Recently, it has been proven that high performance can be obtained by using deep learning in various research fields such as image processing, and speech recognition. The convolutional neural network (CNN) is a prevalent type of deep learning algorithm, and it has a structure that can fully utilize two-dimensional input data. Thus, it has been widely applied to various image recognition fields such as motor fault diagnosis, vision-based mobile robot navigation, physiological signal analysis in medical assessment and human emotion recognition [15][16][17][18]. To our best knowledge, however, the CNN algorithm has not been used for research on the classification of sitting postures for children. As the pressure distribution data has two-dimensional of image structure, it is reasonable to assume that the sitting posture presented based on the pressure distribution could be classified using a CNN algorithm.
Previous studies on the classification of sitting postures were not focused on a particular algorithm. The reason for applying various algorithms is that the classification performance is affected by both the characteristics of the algorithm and the data set. It can be inferred that it is necessary to identify the optimal classification algorithm for various experimental environments. Most of the previous studies have focused on adults only, not children. However, as children have different physical characteristics from adults and as children in the growth phase exhibit large variations within a group, the results of previous experiments may differ from those obtained in studies comprising adult participants [19]. Furthermore, in studies that perform postural classification, it is important to target children since the habit of sitting in the right posture in childhood is more important when compared with adulthood. First of all, inappropriate sitting postural habits formed in childhood are not easily corrected when they become adults [20]. It implies that it is most likely that the wrong posture habit will be maintained even after becoming an adult. In addition, back pain caused by inappropriate posture in childhood is likely to develop again [21]. Thus, the posture classification study for children can help to prevent lumbar disease in the long term by encouraging the correct postural habit formation during childhood.
Therefore, the objective of this study is to classify the sitting postures of children via conventional algorithms and deep learning-based algorithm using the body pressure distribution data from pressure sensors. The conventional machine learning algorithms used in this study are Naïve Bayes classifier (NB), decision tree (DT), neural network (NN), and SVM. Through a comparison of the application of CNN and conventional machine learning algorithms on the data set obtained from the pressure distribution data for children, we investigate the effectiveness of our approach wherein the CNN algorithm is applied while ensuring reliability through a validation analyses (average value of accuracy = 0.953). The remainder of this study is organized as follows. Section 2 contains a literature review of the related studies and machine learning algorithms. Section 3 describes the methods of the experiment and data processing. Section 4 presents the results of the validation analysis for each machine learning algorithm and the further analysis results of the CNN algorithm are presented. Section 5 discusses the results of the machine learning algorithms used in this study and compares them with the results of the previous studies. Finally, the conclusions are summarized in Section 6 along with the applications of this study and the scope for future study.

Sitting Posture Classification
Several methods have been adopted to measure sitting posture and classify it. Camera, is one of the method used to identify and capture sitting postures [22]. Sensors can also be used by being attached to the user's back to recognize sitting postures based on motion data [23][24][25]. However, the former introduces issues with respect to privacy and unnatural behavior owing to negative feelings caused by being monitored [26], and the latter has drawbacks of the possibility of inducing unnatural movements owing to the attached sensors [27]. The use of pressure sensors mounted on the parts of a chair (e.g., seat and backrest) has been employed to classify sitting postures to overcome these issues. Previous studies have been performed for classifying sitting postures of users by attaching pressure sensors to chair structures such as the seat and backrest as a method to overcome the aforementioned issues. Table 1 shows the previous studies in which posture classification was performed, including the number of pressure sensors used, the locations in which they were installed, participant information, the number of predicted sitting conditions, the algorithm used, and the overall accuracy.
Although the number of sensors used in previous studies varied, these studies can be largely divided into two types depending on the location of installation of the sensor. In one type, the sensor was attached to the seat only [26,28,29], and in the other type the sensor was attached to both the seat and backrest [9,[30][31][32]. In addition to the pressure sensor, Benocci et al. [8] used additional sensors such as kinetic related and temperature sensors to classify the sitting postures. Most of the previous studies attempted to classify the sitting upright posture in which the waist is straight, and the feet are placed flat on the floor, the postures in which the upper body is tilted forward, backward, left, or right, and the postures in which the left or right leg is crossed. In the case of the classification of various other postures, the postures in which the legs are crossed were segmented (e.g., one leg over the other, one leg over the other with one foot on the other, one foot on the seat under the other leg's thigh) [32]. In addition, there were also cases wherein the movement pattern of the upper body and whether the user was seated were predicted [28,31]. Various methods have been adopted for classifying the sitting postures, and they include the use of PCA [32], Naïve Bayes classifier [10], hybrid cascade sitting posture classifier [30], SVM [29,33], kNN [9], dynamic time warping-based classification [26], and density-based methods [28]. Ma et al. [30] have compared the classification performance obtained on applying DT, SVM, and multilayer perception.

Learning Algorithm
The LeNet-5 which is one of the early CNN model proposed by LeCun for use in character recognition is specialized in adaptive image processing [34]. CNN is easier to train than other types of feedforward artificial neural network techniques and has the advantage of using fewer parameters. For this reason, CNN has been applied in various fields [35][36][37]. The structure of a common CNN is shown in Figure 1. It consists of a series of layers. Each layer contains one or more planes. Each plane receives inputs from a small neighborhood in the plane of the previous layer. Each plane can be viewed as a feature map with a fixed feature detector that is convolved to a local window that retrieves the plane from the previous layer. Multiple layers are used on each layer to detect multiple features. The layer is called the convolutional layer. Once the feature is detected, the exact location becomes less important. As a result, the convolutional layer typically follows a different layer that typically performs local averages and sub-samples operations. These layers are referred to as sub-sample layers. Finally, the network has a fully connected network that performs cataloguing operations using features extracted from the previous layer. The network was uniformly trained by a backpropagation algorithm.

Learning Algorithm
The LeNet-5 which is one of the early CNN model proposed by LeCun for use in character recognition is specialized in adaptive image processing [34]. CNN is easier to train than other types of feedforward artificial neural network techniques and has the advantage of using fewer parameters. For this reason, CNN has been applied in various fields [35][36][37]. The structure of a common CNN is shown in Figure 1. It consists of a series of layers. Each layer contains one or more planes. Each plane receives inputs from a small neighborhood in the plane of the previous layer. Each plane can be viewed as a feature map with a fixed feature detector that is convolved to a local window that retrieves the plane from the previous layer. Multiple layers are used on each layer to detect multiple features. The layer is called the convolutional layer. Once the feature is detected, the exact location becomes less important. As a result, the convolutional layer typically follows a different layer that typically performs local averages and sub-samples operations. These layers are referred to as subsample layers. Finally, the network has a fully connected network that performs cataloguing operations using features extracted from the previous layer. The network was uniformly trained by a backpropagation algorithm. The NB classifier is a supervised machine learning classifier, and it is a probabilistic classification algorithm based on the Bayes theorem. In this model, it is generally assumed that all attributes are independent of each other, given the class [38] and it is widely adopted to reduce the number of parameters. Equations (1)-(3) represent the Bayes rule, assumption of conditional independence, and classification rule for the new input data. According to Zhang [39], less training data is required to accurately estimate the parameters required for the classification as compared to other complex graphical models.
, k n k k n j k n j P Y y P X X Y y P Y y x X P y y P X X y y The multinomial logistic regression (MLR) classifier was introduced by McFadden, and it is a classification method that can be used to predict more than two possible discrete variables [40]. Similar to binary logistic regression, the probability of categorical membership is evaluated based on The NB classifier is a supervised machine learning classifier, and it is a probabilistic classification algorithm based on the Bayes theorem. In this model, it is generally assumed that all attributes are independent of each other, given the class [38] and it is widely adopted to reduce the number of parameters. Equations (1)-(3) represent the Bayes rule, assumption of conditional independence, and classification rule for the new input data. According to Zhang [39], less training data is required to accurately estimate the parameters required for the classification as compared to other complex graphical models. P(Y = y k |x 1 , · · · X n ) = P(Y = y k )P(X 1 · · · X n |Y = y k ) Σ j P(y = y k )P X 1 · · · X n |y = y j (1) The multinomial logistic regression (MLR) classifier was introduced by McFadden, and it is a classification method that can be used to predict more than two possible discrete variables [40]. Similar to binary logistic regression, the probability of categorical membership is evaluated based on the maximum likelihood estimation. In addition, this model does not assume normality, linearity, or homoscedasticity [41].
The DT method is a regression analysis or classification method depending on whether the response variable is continuous or categorical. The DT learning algorithm was developed by Quinlan to construct decision tree from an obtained data set [42]. This model divides the space of the predictive variables and predicts the values of the dependent variables in the divided parts. The formation process of this model mainly consists of growth, pruning, feasibility assessment, interpretation, and forecasting. This algorithm forms a tree-like structure based on separation criterion concerning an impurity index such as entropy and the Gini index. According to Kamiński [43], this model consists of three types of nodes, which are the decision, chance, and end nodes. An example of a typical structure of this model is presented in Figure 2. the maximum likelihood estimation. In addition, this model does not assume normality, linearity, or homoscedasticity [41]. The DT method is a regression analysis or classification method depending on whether the response variable is continuous or categorical. The DT learning algorithm was developed by Quinlan to construct decision tree from an obtained data set [42]. This model divides the space of the predictive variables and predicts the values of the dependent variables in the divided parts. The formation process of this model mainly consists of growth, pruning, feasibility assessment, interpretation, and forecasting. This algorithm forms a tree-like structure based on separation criterion concerning an impurity index such as entropy and the Gini index. According to Kamiński [43], this model consists of three types of nodes, which are the decision, chance, and end nodes. An example of a typical structure of this model is presented in Figure 2. The NN is a statistical learning algorithm that is inspired by neural networks of the brain, which is an animal's central nervous system [44]. To realize an acceptable performance, this model requires the use of a large number of interconnected neurons [45]. Generally, models of NNs are divided into feed forward NNs and recurrent NNs in terms of signaling schemes. The representative structure of this model is presented in Figure 3.
The output value of each node is calculated as follows: where N is a dimension of the input vector, X is a value of the input, and W is the weight value. A function σ (•) is a sigmoid activation function, which facilitates expansion to a nonlinear model. A logistic function, hyperbolic tangent function, or probit function is commonly used as the activation function. In the training process, this algorithm determines the optimization weight value Wi. In a typical NN model consisting of multiple layers, the output value of the kth node in the nth layer is described as follows: The NN is a statistical learning algorithm that is inspired by neural networks of the brain, which is an animal's central nervous system [44]. To realize an acceptable performance, this model requires the use of a large number of interconnected neurons [45]. Generally, models of NNs are divided into feed forward NNs and recurrent NNs in terms of signaling schemes. The representative structure of this model is presented in Figure 3.
The output value of each node is calculated as follows: where N is a dimension of the input vector, X is a value of the input, and W is the weight value. A function σ (·) is a sigmoid activation function, which facilitates expansion to a nonlinear model. A logistic function, hyperbolic tangent function, or probit function is commonly used as the activation function. In the training process, this algorithm determines the optimization weight value W i . In a typical NN model consisting of multiple layers, the output value of the kth node in the nth layer is described as follows: where σ is the activation function. W ik n is the weight value connected from the jth node of the n − 1th layer to the kth node of the nth layer. The output value derived from the jth node of the n − 1th layer serves as an input value. The weights are usually found by using the backpropagation method during the learning phase. x output w (6) where σ is the activation function. Wik n is the weight value connected from the jth node of the n − 1th layer to the kth node of the nth layer. The output value derived from the jth node of the n − 1th layer serves as an input value. The weights are usually found by using the backpropagation method during the learning phase. The SVM is a supervised learning model for pattern recognition and data analysis. It is mainly used for classification and regression analysis [46]. This algorithm finds a classifying hyperplane that can maximize margins in the feature space wherein the original data is mapped [47]. By solving the dual problem of the original optimization, it is only required to calculate the kernel function values, , = , , the inner product of feature vectors instead of feature vectors themselves, where is a mapping function from the original space to the feature space. This characteristic dramatically increases flexibility of the mapping function and reduces computational cost. For example, the Gaussian (radial basis) kernel function, , = exp | | with the kernel scaling factor , can be employed to construct SVM models although the corresponding feature vector cannot be explicitly calculated because it is represented as an infinite dimensional vector. The representative structure of this model is presented in Figure 4. Both one versus one extension and one versus all extension are generally employed to extend the binary classification task to multiclass classification tasks for SVM. For more details on SVM, please see [47]. The SVM is a supervised learning model for pattern recognition and data analysis. It is mainly used for classification and regression analysis [46]. This algorithm finds a classifying hyperplane that can maximize margins in the feature space wherein the original data is mapped [47]. By solving the dual problem of the original optimization, it is only required to calculate the kernel function values, k(x, y) = φ(x), φ(y) , the inner product of feature vectors instead of feature vectors themselves, where φ(x) is a mapping function from the original space to the feature space.
This characteristic dramatically increases flexibility of the mapping function and reduces computational cost. For example, the Gaussian (radial basis) kernel function, k(x, y) = exp (−γ||x − y|| 2 ) with the kernel scaling factor γ, can be employed to construct SVM models although the corresponding feature vector φ(x) cannot be explicitly calculated because it is represented as an infinite dimensional vector. The representative structure of this model is presented in Figure 4. Both one versus one extension and one versus all extension are generally employed to extend the binary classification task to multiclass classification tasks for SVM. For more details on SVM, please see [47]. Appl

Methods
The overall study and analysis procedure is shown in Figure 5. Participants are introduced to the experiment and after that instructed to take the specified sitting posture. Then researchers acquire real-time pressure data for 5 min from 64 sensors (8 by 8) installed in the seat. After enough rest time, the same experiment is carried out for the next sitting posture. After that, the input size of the raw data is modified. Preprocessing procedure was applied to the modified input data with smoothing and normalization. Using the refined data, a total of six learning algorithms are applied to classify sitting postures. Finally, we compare the performance of each classification algorithm. Details are covered in sub-sections below.

Sensor Apparatus
To collect pressure data, a pressure sensor was installed inside the seat cushion of a child chair ( Figure 6). The pressure measuring area is 318 × 318 mm, and the polyethylene terephthalate (PET) film sensor has a force sensing resister (FSR) of 8 × 8. The data is transferred to an android smartphone via Bluetooth network with 12 bits.

Methods
The overall study and analysis procedure is shown in Figure 5. Participants are introduced to the experiment and after that instructed to take the specified sitting posture. Then researchers acquire real-time pressure data for 5 min from 64 sensors (8 by 8) installed in the seat. After enough rest time, the same experiment is carried out for the next sitting posture. After that, the input size of the raw data is modified. Preprocessing procedure was applied to the modified input data with smoothing and normalization. Using the refined data, a total of six learning algorithms are applied to classify sitting postures. Finally, we compare the performance of each classification algorithm. Details are covered in sub-sections below.

Methods
The overall study and analysis procedure is shown in Figure 5. Participants are introduced to the experiment and after that instructed to take the specified sitting posture. Then researchers acquire real-time pressure data for 5 min from 64 sensors (8 by 8) installed in the seat. After enough rest time, the same experiment is carried out for the next sitting posture. After that, the input size of the raw data is modified. Preprocessing procedure was applied to the modified input data with smoothing and normalization. Using the refined data, a total of six learning algorithms are applied to classify sitting postures. Finally, we compare the performance of each classification algorithm. Details are covered in sub-sections below.

Sensor Apparatus
To collect pressure data, a pressure sensor was installed inside the seat cushion of a child chair ( Figure 6). The pressure measuring area is 318 × 318 mm, and the polyethylene terephthalate (PET) film sensor has a force sensing resister (FSR) of 8 × 8. The data is transferred to an android smartphone via Bluetooth network with 12 bits.

Sensor Apparatus
To collect pressure data, a pressure sensor was installed inside the seat cushion of a child chair ( Figure 6). The pressure measuring area is 318 × 318 mm, and the polyethylene terephthalate (PET) film sensor has a force sensing resister (FSR) of 8 × 8. The data is transferred to an android smartphone via Bluetooth network with 12 bits. Appl. Sci. 2018, 8, x FOR PEER REVIEW 9 of 20

Participants
A total of 10 children aged from 7 to 11 years (Mean = 9.10, Std. = 1.73) participated in this experiment. We recruited participants weighing 20 to 40 kg (20-29 kg: three, 30-39 kg: three, 40-49 kg: four). All the children participated in the experiment together with their parents present.

Selected Sitting Postures
In this study, it was important to reflect the characteristic of the child's sitting behavior. Therefore, to contain any awkward postures reflected in the children's sitting behavior, an online survey was conducted of parents with young children. A total of 32 parents completed the questionnaire regarding their child's unusual sitting posture. As a result, two sitting postures were included in this experiment, namely sitting at the front of the chair and sitting crossed-legged on the chair. Finally, five postures were selected to reflect the representative postures in the previous studies and the sitting behavior characteristics of the children. The selected postures included sitting straight, lean left, lean right, sitting at the front of the chair and sitting crossed-legged on the chair. The description and images of these sitting postures were presented in Table 2.

Data Collection
The children sat in front of their desks while watching children's entertainment videos. They took predefined sitting postures in accordance with the instructions of one researcher. After the other researchers confirmed that the participants took their sitting postures well, the pressure data was collected. The experimental environment is shown in Figure 7. The stored pressure sensor data was converted to a heatmap using Plot.ly's in R, which is an open source library for data visualization. The data were measured for 5 min per posture. The total experiment time was 45 min and the participants were able to take rest sufficiently during the experiment. A total of 260 body pressure distribution data were collected for one sitting posture, and a total of 13,000 sitting posture data were collected.

Participants
A total of 10 children aged from 7 to 11 years (Mean = 9.10, Std. = 1.73) participated in this experiment. We recruited participants weighing 20 to 40 kg (20-29 kg: three, 30-39 kg: three, 40-49 kg: four). All the children participated in the experiment together with their parents present.

Selected Sitting Postures
In this study, it was important to reflect the characteristic of the child's sitting behavior. Therefore, to contain any awkward postures reflected in the children's sitting behavior, an online survey was conducted of parents with young children. A total of 32 parents completed the questionnaire regarding their child's unusual sitting posture. As a result, two sitting postures were included in this experiment, namely sitting at the front of the chair and sitting crossed-legged on the chair. Finally, five postures were selected to reflect the representative postures in the previous studies and the sitting behavior characteristics of the children. The selected postures included sitting straight, lean left, lean right, sitting at the front of the chair and sitting crossed-legged on the chair. The description and images of these sitting postures were presented in Table 2.

Data Collection
The children sat in front of their desks while watching children's entertainment videos. They took predefined sitting postures in accordance with the instructions of one researcher. After the other researchers confirmed that the participants took their sitting postures well, the pressure data was collected. The experimental environment is shown in Figure 7. The stored pressure sensor data was converted to a heatmap using Plot.ly's in R, which is an open source library for data visualization. The data were measured for 5 min per posture. The total experiment time was 45 min and the participants were able to take rest sufficiently during the experiment. A total of 260 body pressure distribution data were collected for one sitting posture, and a total of 13,000 sitting posture data were collected.

Sitting Posture Front and Side Views of Sitting Postures Description Heat Map Images of Bo Pressure Distribution
Sitting

Sitting Posture Front and Side Views of Sitting Postures Description Heat Map Images of Body Pressure Distribution
Sitting

Sitting Posture Classification Model: LeNet-5
All classification models are coded in Python. The CNN algorithm uses the library TensorFlow, and the remaining machine learning algorithms use the library scikit-learn. LeNet-5 was applied as the CNN algorithm for classifying the pressure distribution data according to the sitting posture. LeNet-5 is the CNN architecture generally used for handwritten digit recognition and was successfully used to solve other visual related problems [48,49]. LeNet-5 comprises seven layers that do not include the input. It consists of two types of convolutional layers, two types of subsampling layers, fully connected layer with ReLU (Rectified Linear Unit) and Softmax, and an output layer.
As the LeNet-5 model is a model used for small-sized datasets, some modifications have been made to this study. As the size of the image has increased and the number of images to be classified has changed, the model is required to be modified. The input layer has been modified to accommodate the size of the image, and the two fully connected layer has been modified accordingly. Detailed information of the structure of the applied model was shown in Figure 8. The NB classifier with a prior empirical distribution was used. For the DT algorithm, the minimum parent size was adjusted as 5, 10, 15, 20, 25, and 30. In addition, pruning was allowed for this method. In the case of NN analysis, a backpropagation algorithm was applied to the data training process. In the output layer of the last hidden layer, a Softmax function was used to determine the class labels. Excluding for this case, the hyperbolic tangent sigmoid function was used for all the activation functions. To achieve the optimal accuracy, one to three hidden layers were selected in this study, and the number of neurons per layer was set as 20, 30, and 40. For the SVM, we employed the Gaussian kernel function, , = exp | | , of which corresponding feature mapping is infinite dimensional. A kernel scaling factor, , was set as 0.01, 0.05, and 0.1. In addition, the

Sitting Posture Classification Model: LeNet-5
All classification models are coded in Python. The CNN algorithm uses the library TensorFlow, and the remaining machine learning algorithms use the library scikit-learn. LeNet-5 was applied as the CNN algorithm for classifying the pressure distribution data according to the sitting posture. LeNet-5 is the CNN architecture generally used for handwritten digit recognition and was successfully used to solve other visual related problems [48,49]. LeNet-5 comprises seven layers that do not include the input. It consists of two types of convolutional layers, two types of subsampling layers, fully connected layer with ReLU (Rectified Linear Unit) and Softmax, and an output layer.
As the LeNet-5 model is a model used for small-sized datasets, some modifications have been made to this study. As the size of the image has increased and the number of images to be classified has changed, the model is required to be modified. The input layer has been modified to accommodate the size of the image, and the two fully connected layer has been modified accordingly. Detailed information of the structure of the applied model was shown in Figure 8.

Sitting Posture Classification Model: LeNet-5
All classification models are coded in Python. The CNN algorithm uses the library TensorFlow, and the remaining machine learning algorithms use the library scikit-learn. LeNet-5 was applied as the CNN algorithm for classifying the pressure distribution data according to the sitting posture. LeNet-5 is the CNN architecture generally used for handwritten digit recognition and was successfully used to solve other visual related problems [48,49]. LeNet-5 comprises seven layers that do not include the input. It consists of two types of convolutional layers, two types of subsampling layers, fully connected layer with ReLU (Rectified Linear Unit) and Softmax, and an output layer.
As the LeNet-5 model is a model used for small-sized datasets, some modifications have been made to this study. As the size of the image has increased and the number of images to be classified has changed, the model is required to be modified. The input layer has been modified to accommodate the size of the image, and the two fully connected layer has been modified accordingly. Detailed information of the structure of the applied model was shown in Figure 8. The NB classifier with a prior empirical distribution was used. For the DT algorithm, the minimum parent size was adjusted as 5, 10, 15, 20, 25, and 30. In addition, pruning was allowed for this method. In the case of NN analysis, a backpropagation algorithm was applied to the data training process. In the output layer of the last hidden layer, a Softmax function was used to determine the class labels. Excluding for this case, the hyperbolic tangent sigmoid function was used for all the activation functions. To achieve the optimal accuracy, one to three hidden layers were selected in this study, and the number of neurons per layer was set as 20, 30, and 40. For the SVM, we employed the Gaussian kernel function, , = exp | | , of which corresponding feature mapping is infinite dimensional. A kernel scaling factor, , was set as 0.01, 0.05, and 0.1. In addition, the The NB classifier with a prior empirical distribution was used. For the DT algorithm, the minimum parent size was adjusted as 5, 10, 15, 20, 25, and 30. In addition, pruning was allowed for this method. In the case of NN analysis, a backpropagation algorithm was applied to the data training process. In the output layer of the last hidden layer, a Softmax function was used to determine the class labels. Excluding for this case, the hyperbolic tangent sigmoid function was used for all the activation functions. To achieve the optimal accuracy, one to three hidden layers were selected in this study, and the number of neurons per layer was set as 20, 30, and 40. For the SVM, we employed the Gaussian kernel function, k(x, y) = exp (−γ||x − y|| 2 ), of which corresponding feature mapping is infinite dimensional. A kernel scaling factor, γ, was set as 0.01, 0.05, and 0.1. In addition, the fractional error bound ν, the maximum fraction of misclassified instances, was set as 0.01 and 0.05 to prevent overfitting.

Training and Test Procedure
Individual validation was applied to the training and testing of the dataset. The data collected by each individual should not affect the learning because of the individual differences. Therefore, we used nine data as the training data and one data as the test data. Thus, a total of 10 training and test results were obtained. The 10 results included the accuracy of the classification results, the precision of the individual class, and the recall value. Thus, the individual validation results for individual participant data were calculated. Accuracy refers to the percentage of precisely fitting across the entire case. The precision refers to the accuracy of the detection. That is, it indicates how many actual objects are included among the detection results. The recall means detection rate which refers to how well the target objects are detected without being missed. The confusion matrix and equations of each value were described in Table 3.

Results Comparison with Individual Validation
The maximum and minimum accuracy for NB were 0.995 and 0.656, respectively. The average accuracy for NB was 0.871 (Std. = 0.113). For MLR, the minimum and maximum accuracies were 0.737 and 0.977, respectively. The average accuracy was 0.845 (Std. = 0.091). In case of DT, all the average accuracies were less than 80% for the selected parent sizes. The minimum and maximum average accuracies for each parent size were 0.787 and 0.794, respectively, with a difference of less than 1%. The highest average accuracy was 0.794 (Std. = 0.083) when the parent size was 15, and the smallest standard deviation was obtained for a parent size of 10 (Std. = 0.077). For NN, training and test accuracies were obtained under the conditions of number of layers and neurons. Except for the accuracy of the combination of one layer and 20 neurons, all the results showed that the classification performance was higher than 90%. The highest value was 0.921 for the combination of one layer and 40 neurons. In the case of this combination, the standard deviation also showed the smallest value (Std. = 0.057), which could indicate that this combination is optimal for this data. In case of SVM, the best accuracies of test were acquired under the condition of the combination of the values of c (lambda) and gamma. The average accuracy was found to be greater than 0.9 for all the conditions except for the gamma value of 0.5. The overall accuracy was highest at 0.942 (Std. = 0.057) for the condition in which the lambda was 0.05 and gamma was 0.05. Table 4 shows the results of the CNN model in comparison with the rest of the models used in this study. For DT, NN, and SVM, the accuracy is presented for combinations of parameters that showed the best results. The individual validation results of CNN showed that the accuracy was at least 0.902, and the maximum accuracy was 1.000. The average accuracy was 0.953 (Std. = 0.037). Among learning algorithms, the performance of DT was the lowest. NB and MLR showed an accuracy of more than 80% but not more than 90%. NN, SVM, and CNN showed an accuracy of more than 90%. The performance of CNN was better than all the other learning algorithms in terms of average accuracy and standard deviation.

Results of Confusion Matrix for CNN
Although the current results showed an accuracy of over 95%, further analysis of the classification pattern of each participant and posture and of the physical characteristics of the participants is required to improve the accuracy. Further analysis is presented only for CNN as the results of CNN were superior to those of the other algorithms.
The results of the confusion matrix for each participant using CNN is shown in Figure 9. As shown in Figure 9, the classification accuracy of all the participants was above 0.900. In particular, in the case of #10 participants, the classification accuracy for all the postures was 1.000.

Relationship between Group of Body Weight and Accuracy
The mean values of the precision and recall for each sitting posture of the posture classification results are shown in Figure 10.

Relationship between Group of Body Weight and Accuracy
The mean values of the precision and recall for each sitting posture of the posture classification results are shown in Figure 10. For the experimental analysis, the participants were divided into three groups based on their weight. The results for each group precision, recall, and accuracy are shown in Figure 11. All three performance metrics, precision, recall, and accuracy had the lowest values in the 20 kg group. In addition, the value of each index showed a tendency to increase with a group of a higher body weight. Figure 11. Results of the relationship between weight groups and performance metrics (precision, recall, and accuracy).
The results of the ANOVA by weight group are shown in Table 5. There was a significant difference in precision between the weight groups at the 90% confidence level for postures (c) and (d). In the case of recall, it was confirmed that only the E posture was significantly different at the 90% confidence level. Finally, in the case of the accuracy, the p-value was the lowest among all the indices (p = 0.007). For the experimental analysis, the participants were divided into three groups based on their weight. The results for each group precision, recall, and accuracy are shown in Figure 11. All three performance metrics, precision, recall, and accuracy had the lowest values in the 20 kg group. In addition, the value of each index showed a tendency to increase with a group of a higher body weight. For the experimental analysis, the participants were divided into three groups based on their weight. The results for each group precision, recall, and accuracy are shown in Figure 11. All three performance metrics, precision, recall, and accuracy had the lowest values in the 20 kg group. In addition, the value of each index showed a tendency to increase with a group of a higher body weight. Figure 11. Results of the relationship between weight groups and performance metrics (precision, recall, and accuracy).
The results of the ANOVA by weight group are shown in Table 5. There was a significant difference in precision between the weight groups at the 90% confidence level for postures (c) and (d). In the case of recall, it was confirmed that only the E posture was significantly different at the 90% confidence level. Finally, in the case of the accuracy, the p-value was the lowest among all the indices (p = 0.007). The results of the ANOVA by weight group are shown in Table 5. There was a significant difference in precision between the weight groups at the 90% confidence level for postures (c) and (d).
In the case of recall, it was confirmed that only the E posture was significantly different at the 90% confidence level. Finally, in the case of the accuracy, the p-value was the lowest among all the indices (p = 0.007).

Discussion
The objective of this study is to classify the sitting postures of children using pressure sensors installed on the seat using the CNN algorithm. To verify the applicability of this algorithm, several learning algorithms including NB, DT, NN, and SVM were applied to the same data set for the classification of the sitting postures. The accuracy of DT was 0.794, which was the lowest among those of the applied models. The performance of NB and MLR was 0.871 and 0.845, respectively. The CNN, NN, and SVM algorithms showed more than 90% classification accuracy, and the CNN model showed the highest classification accuracy. The classification performance of SVM was 0.942, which approximately had a 1% difference as compared to that of the CNN. Besides that, the standard deviation obtained using CNN was smaller than that of SVM (std. = 0.037). It is essential in the context of classification that value of accuracy is high and the standard deviation value is small [50]. Therefore, CNN is noted to have shown the best result through high average accuracy value and small accuracy deviation between the participants.
Previous researches can be classified into cases wherein a pressure sensor is used only on the seat and wherein the pressure sensor is attached to both the seat and backrest. According to Meyer et al. [9], the accuracy was low when only the pressure data obtained from the seat was utilized without using the pressure data of the backrest. From this result, it can be inferred that under the same experimental conditions, more data can be utilized if the pressure information of the backrest is obtained in addition to that of the seat, thus resulting in a better classification performance. Therefore, it was more challenging to classify the sitting posture by using pressure data only from the seat and not from the backrest. Nonetheless, the average classification accuracy of CNN was 95.3%, which was not significantly lower than those in previous studies using both seat and backrest data [31,32] (See Table 1). Indeed, the classification performance of this study is better than that presented in several other studies [10,30]. These two previous studies classified sitting postures using the NB classifier, which was also used in this study. Although the absolute comparison of the classification performance is limited owing to differences in experimental conditions, it can be inferred that the NB algorithm is not suitable for posture classification given that the results of NB are not high in this study (See Table 4). Moreover, it can be observed that the various types of algorithms have been utilized in diverse methods in the area of research of posture classification. This suggests that attempts have been made at using various algorithms to realize higher classification performance rather than focusing on specific algorithms based on optimal experimental setups. Therefore, according to these trends, the application of CNN, which has not been employed in such an application thus far, can be considered as a valuable new approach.
The classification performance obtained for familiar users is generally more than 90%; however, there were cases in which the performance of unfamiliar users was less than 80%. For instance, Tan et al. [31] reported 96% accuracy for a familiar user; but 79% accuracy for an unfamiliar user. In addition, in a study by Kamiya et al. [29], the result of the classification performance for unfamiliar users was more than 90%, but it was approximately 5% lower than that obtained for familiar users. Therefore, the classification of the sitting postures of unfamiliar users is a more challenging approach in the derivation of the classification accuracy. Furthermore, considering the case of the classification of the sitting postures of a new user after constructing the database in advance without utilizing the data of the existing user, an approach for predicting the unfamiliar user is required. Thus, this study adopted a more conservative verification method which identifies unfamiliar users.
In addition, in this study, the participants were children and, not adults, who were dominantly focused on in previous studies. The anthropometric characteristics of children are largely distributed as they are in the process of growing. In other words, the variance in the child's anthropometric variables, such as height and weight, is greater than that in adult groups. This encouraged us to conduct further analysis based on the characteristics of the participants, especially their weight. As a result, the classification performance of the low weight group was lower than that of the high weight group. This result is consistent with that obtained in a previous study [51]. The authors of this previous study divided the groups according to the level of body mass index (BMI) and compared the classification performance. The results showed that light intensity group (low BMI) had a lower classification performance than the moderate intensity group (normal BMI). Although this result was obtained using data from adults, it implies the same issue as this current study: when performing experiments on groups of similar age, anthropometric characteristics such as body weight may influence the classification accuracy. Therefore, it is necessary to consider the human body characteristics such as weight in the posture classification. Thinking in the other direction, it is expected that a higher accuracy would be obtained if weight is used as an input variable in the classification of sitting postures. Therefore, it would be necessary to construct a database and train the obtained data by weight groups.
The current paper has collected data for 10 children, but more data will be obtained in the future to utilize a larger data size for classifying the sitting posture. Since the posture can be sufficiently classified through the system constructed in this study, it is expected that the reliability of the result can be guaranteed by acquiring more data. In addition, future studies will expand the classification method and plan to use genetic algorithms considering the learning phase and techniques [52].

Conclusions
In this study, we developed a real-time sitting posture classification system for the correct sitting postural habit formation for children. Five sitting postures were selected for classification. The CNN algorithm performs well in this type of posture classification and has been shown to outperform other machine learning algorithms. In addition, as compared to previous studies, the average value of accuracy (0.953) for the sitting posture obtained using CNN was confirmed to be suitable for the development of a smart chair. As a result of examining the accuracy of the posture classification by the weight of the children, it was found that the accuracy is higher if the weight of the group is higher. This result could not be generalized owing to the limitation of this study being that the number of participants in the weight group is small. Thus, further research will be conducted to develop a sensing cushion that has an optimal accuracy for each weight group by adjusting the position or number of sensors. Because children have a higher correlation between body shape and body weight than adults, there is a difference in the active area of the pressure sensor depending on body weight. In addition, there are cases where the different sitting postures represent the same pressure pattern because the difference in body size is large depending on the body weight. For example, the pressure sensor pattern might be similar when a child in a small body size is sitting upright in a chair and when a large child is sitting in front of a chair. In this study, the pressure sensor mat which occupies most of the seat area is used, and the position of the individual pressure sensor is arranged at equal intervals. However, to overcome the misclassification due to the body characteristics according to the weight, further studies are required to discover the optimum pressure sensor position and number. Precisely classifying the sitting posture when using a chair helps prevent the occurrence of musculoskeletal disorders in users and improves their postural habit as well as efficiency in work or study. This study is expected to be used as a fundamental research for developing a smart chair that can induce a correct sitting posture in children by using a sensor-based cushion.