Next Article in Journal
Orbital Adipose Tissue: The Optimal Control for Back-Table Fluorescence Imaging of Orbital Tumors
Next Article in Special Issue
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundation Models
Previous Article in Journal
Comparison of Bioengineered Scaffolds for the Induction of Osteochondrogenic Differentiation of Human Adipose-Derived Stem Cells
Previous Article in Special Issue
Machine Learning for Biomedical Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning Approach for Predicting Hypertension Based on Body Composition in South Korean Adults

1
Digital Health Research Division, Korea Institute of Oriental Medicine, Daejeon 34504, Republic of Korea
2
KM Data Division, Korea Institute of Oriental Medicine, Daejeon 34504, Republic of Korea
*
Author to whom correspondence should be addressed.
Bioengineering 2024, 11(9), 921; https://doi.org/10.3390/bioengineering11090921
Submission received: 3 August 2024 / Revised: 28 August 2024 / Accepted: 12 September 2024 / Published: 14 September 2024
(This article belongs to the Special Issue Machine Learning Technology in Biomedical Engineering—2nd Edition)

Abstract

:
(1) Background: Various machine learning techniques were used to predict hypertension in Korean adults aged 20 and above, using a range of body composition indicators. Muscle and fat components of body composition are closely related to hypertension. The aim was to identify which body composition indicators are significant predictors of hypertension for each gender; (2) Methods: A model was developed to classify hypertension using six different machine learning techniques, utilizing age, BMI, and body composition indicators such as body fat mass, lean mass, and body water of 2906 Korean men and women; (3) Results: The elastic-net technique demonstrated the highest classification accuracy. In the hypertension prediction model, the most important variables for men were age, skeletal muscle mass (SMM), and body fat mass (BFM), in that order. For women, the significant variables were age and BFM. However, there was no difference between soft lean mass and SMM; (4) Conclusions: Hypertension affects not only BFM but also SMM in men, whereas in women, BFM has a stronger effect than SMM.

1. Introduction

South Korea is currently facing serious issues of low birth rates and an aging population. As the population of newborns decreases, the average age of society members rises, emphasizing the importance of health management for those actively participating in economic activities [1]. Consequently, there is a need for managing various chronic adult diseases. However, access to clinical assistance has limitations in terms of accessibility and cost, highlighting the need for methods that can be easily managed at home [2]. Common methods that can be relatively easily monitored and controlled in daily life include weight and blood pressure. More specifically, body composition measurement devices have been released, allowing for more detailed analysis of the composition of different parts of the body, thus facilitating easier health management.
Body composition refers to the components that make up the human body, serving as an indicator of individual health characteristics through an expanded interpretation at the molecular level [3]. The molecular components of body composition include body fat, body water, protein, and minerals. Various body composition analyzers are used to measure these components. For body composition analysis, a small constant current of microamperes is sent through the body, and the voltage is measured to calculate the impedance, which is then analyzed [4]. Compared to the body mass index (BMI), which is the ratio of weight to height, body composition analysis is known to have higher reliability and validity as it provides a more detailed breakdown of body components [5]. There are various indicators for body composition analysis, including skeletal muscle mass, fat mass, muscle mass by body part, and body water, which are known to be indicators that can identify the presence of various diseases [6]. In particular, fat mass and muscle mass have been shown in various prior studies to be potential predictors of metabolic syndrome diseases such as hypertension, as well as cardiovascular diseases like arteriosclerosis [7,8]. Among these metabolic and cardiovascular diseases, a key clinical risk factor is hypertension. Hypertension is a straightforward indicator that can be checked using a simple blood pressure monitor; however, its measurement is limited to the systolic and diastolic values and it does not provide insights into the internal causes and conditions of the body that reflect these blood pressure values [9]. Understanding how the conditions of fat or muscle at specific body locations influence these values requires additional statistical methods and clinical interpretations to prove causality. Therefore, to manage such cardiovascular diseases, it is necessary to design or plan clinical approaches and personal health management methods based on the correlation and connectivity of more detailed elements and hypertension. Various prior studies have been conducted to develop predictive models based on the correlation between body composition indicators and hypertension for this purpose. In a study that examined the associations of body composition with blood pressure and hypertension using indicators from dual-energy X-ray absorptiometry of body composition and total skeletal muscle, body fat percentage and fat distribution were suggested as effective features for the model in predicting hypertension [10]. Another prior study examined the correlation between body size and composition and elevated blood pressure in adolescent girls, confirming the association between fat-free mass and blood pressure [11]. Another study developed a machine learning-based model to predict hypertension using body composition features without distinguishing between genders. In this study, body fat mass was included as a feature in the model, which achieved up to 90% prediction accuracy [12]. These prior studies commonly identified fat mass among body composition elements as a common indicator for classifying or predicting hypertension.
In addition to common indicators, various variables are presented as factors of hypertension but the results of most hypertension prediction models vary depending on the gender and demographic characteristics of the subjects. In addition, it is necessary to develop a prediction model using indicators that affect the occurrence of hypertension. Therefore, in order to identify hypertension indicators that take into account the characteristics of Koreans, priority should be given to identifying variables that differ in each gender.
This study aims to develop prediction models that can classify the presence of hypertension in Korean adults aged twenty and above, using various machine learning techniques and distinguishing between the models for men and women. This study also aims to verify the importance of the features included in the models.

2. Materials and Methods

2.1. Data Sources and Participant Selection

In this study, data collected through cross-sectional research were used from April 2022 to December 2022 at five hospitals in the Republic of Korea, including Dongguk University Ilsan Oriental Hospital, Pusan National University Korean Medicine Hospital, Gachon University Gill Hospital, Dongshin University Korean Medicine Hospital, and Semyung University Korean Medicine Hospital. This study was approved by the Institutional Review Boards of each hospital and was conducted according to the Declaration of Helsinki (IRB No., Dongguk University Ilsan Oriental Hospital, DUIOH-2022-01-005; Pusan National University Korean Medicine Hospital, PNUKHIRB-2022-02-001; Gachon University Gill Hospital, GIRB-22-101; Dongshin University Korean Medicine Hospital, NJ-IRB-013; Semyung University Korean Medicine Hospital, SMJOH-2022-06). All participants signed an informed consent form prior to enrollment.
A total of 3000 participants were assessed for eligibility for inclusion in the study. Of the 3000 participants, 2986 completed the clinical study after excluding individuals who did not meet inclusion/exclusion criteria. The inclusion criteria were as follows: (1) adults aged 19 years or older; (2) those without cognitive disorder; (3) those who voluntarily decided to participate after fully understanding the study, and agreed to participate in the study via written consent. The exclusion criteria were as follows: (1) those who were unable to measure their health status with a device due to mobility difficulties; (2) those who were diagnosed by a doctor with diseases such as cardiovascular diseases, cerebrovascular diseases, malignant neoplasms, mental diseases, arthritis, or thyroid disease, and have been suffering from them since diagnosis; (3) women who were pregnant, might be pregnant, or were menstruating; (4) those without a mobile phone in their name; (5) persons judged by the principal investigator to be unsuitable for participation. Among 2986 participants who completed the study, 80 participants with errors in the measurements of body composition were excluded. Finally, 820 men and 2086 women were included in the analysis (Figure 1).
The hypertension group was defined by two questions: “Have you ever been diagnosed with hypertension by a physician?” and “Have you been suffering from hypertension since diagnosis”. Participants who answered “Yes” to both questions were classified into the hypertension group, and those who did not answer “Yes” to both questions were classified into the non-hypertension group.

2.2. Body Composition Measurement

Body composition indicators were calculated after measurement by a well-trained clinical research coordinator according to standardized protocols using a multifrequency bioelectric impedance analyzer (BWA 2.0; In-Body Co., Seoul, Republic of Korea). Subjects were asked to wear light clothes and to have a minimum fasting period of four hours for the measurement. The measurement was conducted after waiting for five minutes with the subject lying on the bed and wearing clamp-type sensors on both wrists and ankles. Individual variables for body fat mass, lean mass, and body water were obtained (Table 1).

2.3. Model Development and Assessment

All statistical analyses were implemented using R version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria) [13]. In all statistical tests, two-tailed tests were applied and a significance level of 0.05 was used. To take sex-specific differences in body composition indicators into account, all statistical analyses were performed by sex. Data were randomly split into training and test sets in a 7:3 ratio. The training set was used for training and validation of model development and the test set was used for final model evaluation. To compare general characteristics and body composition indicators between the training set and test set, independent two-sample t-tests and chi-square tests were used for continuous and categorical variables in each sex, respectively. To compare general characteristics and body composition indicators between hypertension and non-hypertension groups, independent two-sample t-tests were used in the training set by sex. In each sex, simple logistic regression analyses were used in the training set to identify the association of individual body composition indices with hypertension.
To build the models, previously developed machine learning algorithms were applied using the Caret R package [14,15,16]. The Caret package has several functions, including data splitting, data preprocessing, feature selection, automatic hyperparameter tuning, training, variable importance estimation, and model assessment, which can streamline the processes of model development and evaluation.
The hypertension classification models using the combined body composition indicators were developed in the training set through six machine learning algorithms: elastic-net-regularized logistic regressions (E-net), k-nearest neighbor (K-NN), random forest (RF), support vector machine (SVM), extreme gradient boosting (XGBoost), and neural network (NN). E-net is a penalized linear or logistic regression method that combines ridge and lasso regressions, and has functions of efficient shrinkage regularization and feature selection [17]. Additionally, E-net is an effective regression method that secures the stability and generality of the model by reducing the impact of multicollinearity when multiple highly correlated explanatory variables are included. K-NN is a non-parametric method for classification and regression that works based on the similarity of data points in a given dataset [18]. K-NN is a simple and easy-to-implement method. RF is an ensemble machine learning method for classification, regression, and unsupervised learning based on a set of multiple trees [19]. RF is known to be robust to data with missing values, handles large and complex data, and reduces overfitting. SVN is a machine learning algorithm for linear or non-linear classification and regression that solves classification problems by constructing hyperplanes with optimal separation and maximum margin between individual classes [20]. SVM is known to be effective for high-dimensional data and can handle complex or non-linear relationships using kernel functions. XGBoost is a decision-tree-based ensemble machine learning method for classification and regression [21]. XGBoost can control model complexity and overfitting by adding a regular term in the loss function. NN is a machine learning algorithm that works similarly to the neural pathways in the human brain [22]. NN is composed of three connected layers of input, hidden and output, and is known to be powerful in capturing complex patterns and relationships in complex data. The hyperparameters were tuned to obtain the maximum area under the receiver operating characteristic curve (AUROC) using 5-fold stratified cross-validation in the training set to be randomly divided into the training and validation subsets in a 4:1 ratio. To investigate the contribution of body composition variables selected in the models, the relative importance of the variables was calculated.
The performance of each model was evaluated by constructing a receiver operating characteristic (ROC) curve and a confusion matrix for binary classification in the test set [23,24,25]; AUROC was obtained from the ROC curve, and kappa, F1 score, precision, accuracy, sensitivity, and specificity were obtained from the confusion matrix. The confusion matrix depends on the chosen threshold value. To obtain the optimal threshold value for each model, Youden’s index was used. AUROC is obtained by calculating the area under the ROC curve. The ROC curve is a curve drawn according to different threshold settings from 0 to 1, with the vertical axis representing sensitivity and the horizontal axis representing 1-specificity. To compare the AUROC of the model with the best AUROC value to the AUROC of each of the other models, p values were derived using the z score [26,27]. The accuracy is calculated as the proportion of accurately predicted samples to the total number of samples. Sensitivity and specificity refer to the proportion of predicted positive samples to the total number of observed positive samples and the proportion of predicted negative samples to the total number of observed negative samples, respectively. Precision presents the proportion of observed positive samples to the total number of predicted positive samples. F1 score is a combined measure of precision and sensitivity, calculated as the harmonic mean of precision and sensitivity. Kappa indicates the degree of agreement between the predicted and observed outcomes, and is calculated taking into account coincidence. The measures of performance evaluation were presented along with the 95% confidence interval (CI) obtained from 2000 bootstrap repetitions.

3. Results

3.1. General Characteristics and Body Composition

The data were randomly divided into the training and test sets in a 7:3 ratio by sex. Of a total of 820 men, 575 were included in the training set and 245 were included in the test set. Among a total of 2086 women, 1462 were included in the training set and 624 were included in the test set. In both men and women, there were no significant differences between the training and test sets in the percentage of participants in the hypertension group, or general characteristics such as age, temperature, blood pressure, pulse rate, height, weight, BMI, and other body composition variables (Table 2).
Of a total of 575 men in the training set, 75 were included in the hypertension group and 500 were included in the non-hypertension group. Among the 1462 women in the training set, 108 were included in the hypertension group and 1354 were included in the non-hypertension group. For men, there were significant differences between the hypertension and non-hypertension groups in age, blood pressure, height, and BMI in the training set. For women, there were significant differences between the hypertension and non-hypertension groups in age, blood pressure, height, weight, and BMI in the training set. For both men and women, significant differences were found between the hypertension and non-hypertension groups in several variables related to body fat mass, lean mass, and body water as well as additional variables such as visceral fat level, visceral fat area, proportion of weight to ideal weight, proportion of total body water to fat-free mass, and the 50 kHz phase angle of the trunk in the training set (Table 3).

3.2. Association of Individual Body Composition Variables with Hypertension

To investigate the association between individual body composition variables and hypertension, crude analyses were conducted in the training set (Table 4). Older men and women were more likely to be classified into the hypertension group (odds ratio [95% CI]; men, 4.32 [3.09, 6.06]; women, 3.36 [2.57, 4.4]). Men and women with higher BMI were also more likely to be in the hypertension group (men, 1.5 [1.2, 1.89]; women, 1.82 [1.55, 2.14]). For both men and women, hypertension was positively associated with most body fat mass variables. Men and women with higher percentages of body fat mass to weight were more likely to be in the hypertension group (men, 2.7 [2.02, 3.61]; women, 2.2 [1.79, 2.71]). In men, hypertension was negatively associated with lean mass variables of soft lean mass and skeletal muscle mass, whereas in women, hypertension had no association with these variables. Men and women with higher percentages of skeletal muscle mass to weight were less likely to be in the hypertension group (men, 0.33 [0.24, 0.44]; women, 0.44 [0.35, 0.54]). In men, hypertension was negatively associated with total body water, intracellular water, and extracellular water, whereas in women, hypertension was not associated with any of those variables. Men and women with higher percentages of total body water to weight were less likely to belong to the hypertension group (men, 0.38 [0.29, 0.51]; women, 0.46 [0.37, 0.57]). Men and women with higher proportions of extracellular water to total body water were more likely to belong to the hypertension group (men, 2.08 [1.61, 2.69]; women, 1.62 [1.33, 1.99]). In both men and women, hypertension was positively associated with visceral fat level, visceral fat area, proportion of weight to ideal weight, and proportion of total body water to fat-free mass. Men and women with a higher 50 kHz phase angle of the trunk were less likely to be in the hypertension group (men, 0.36 [0.27, 0.49]; women, 0.69 [0.56, 0.84]).

3.3. Comparison of Performance of the Models

The ROC curve was drawn and the AUROC value was calculated for each model in the test set (Figure 2, Table 5). AUROC values of the six models ranged from 0.783 to 0.865 for men and 0.774 to 0.831 for women. For both men and women, the E-net model reported the highest AUROC (AUROC [95% CI]; men, 0.865 [0.806, 0.914]; women, 0.831 [0.78, 0.881]). This was followed by the NN model with an AUC of 0.853 [0.797, 0.904] and the XGBoost model with an AUC of 0.827 [0.761, 0.883] for men, and was followed by the XGBoost model with an AUC of 0.826 [0.772, 0.876] and the NN model with an AUC of 0.822 [0.771, 0.871] for women. For both men and women, the SVM model has the lowest AUROC (men, 0.783 [0.697, 0.857]; women, 0.774 [0.712, 0.831]). Comparing the AUROC of the model with the best AUROC value to the AUROC of each of the other models, there were significant differences in AUROC between the E-net model and the RF model and between the E-net model and the SVM model in men, while there were no significant differences in AUROC between the E-net model and each of the other models in women.
Kappa, F1 score, precision, accuracy, sensitivity, and specificity were calculated using the classification confusion matrix obtained based on the optimal threshold for each model (Table 5). In men, the NN model exhibited the best kappa, F1 score, precision, and accuracy and achieved a well-balanced sensitivity and specificity (kappa, 0.365 [0.257, 0.469]; F1 score, 0.489 [0.37, 0.586]; precision, 0.33 [0.232, 0.424]; accuracy 0.743 [0.69, 0.796], sensitivity 0.941 [0.833, 1], specificity 0.714 [0.653, 0.771]). This was followed by the E-net model and then the XGBoost model. In women, the XGBoost model had the highest kappa, F1 score, precision, and accuracy and a good balance between sensitivity and specificity (kappa, 0.265 [0.186, 0.345]; F1 score, 0.347 [0.262, 0.43]; precision, 0.223 [0.162, 0.291]; accuracy 0.79 [0.758, 0.821], sensitivity 0.78 [0.653, 0.896], specificity 0.791 [0.758, 0.824]). The XGBoost model was followed by the NN model and then the E-net model.
As a result of the model evaluation, the E-net model showed the best performance on most evaluation metrics for both men and women, followed by the XGBoost model and the NN model. For each final model, a selected subset of variables and their importance were obtained (Figure 3). In the E-net model, three variables were selected for men: age; body fat mass percentage of trunk; and percentage of total body water to weight. In the E-net model, nine variables were selected for women: age; visceral fat level; proportion of total body water to fat-free mass; BMI; proportion of weight to ideal weight; body fat mass of trunk; body fat mass percentage of the left leg; percentage of body fat mass to weight; and percentage of total body water to weight. In the XGBoost model and the NN model, age and the 50 kHz phase angle of the trunk were selected with relatively high importance for men, and age and proportion of total body water to fat-free mass were selected with relatively high importance for women.

4. Discussion

In this study, we examined the correlations and differences in various body composition elements measured in Korean adult men and women based on the presence or absence of hypertension in each group. We then assessed the hypertension prediction accuracy when variables selected for their importance were used as features in a hypertension prediction model.
Before developing the model using machine learning techniques, we identified the variable importance factors, finding differences not only between men and women but also among the machine learning techniques themselves. Age was the common importance variable selected across six different machine learning techniques. The correlation between blood pressure and age has already been confirmed in various studies. In the study “On the interpretability of machine learning-based model for predicting hypertension”, age was presented as the highest importance feature for predicting the high risk of hypertension [28]. As age increases, blood vessels thicken and lose elasticity, and the decrease in heart pump function tends to increase systolic blood pressure [29]. This correlation is similarly mentioned in epidemiological studies like the Framingham Heart Study and other prior research that examined age-related changes in the risk of high blood pressure [30,31].
In Figure 3, to classify hypertension and non-hypertension, we identified the characteristics of the selected variables in both male and female groups. The common important variables for predicting hypertension in men were age, followed by SMM_WT, and then BFMp. In women, it was the opposite, with BFMp first, followed by SMM_WT. Prior research indicates that blood pressure is inversely associated with muscle mass and positively with fat mass [32]. Generally, men have a higher proportion of skeletal muscle mass and less fat mass compared to women. Besides this influence, the SLM and SMM of the recruited women in this study showed no statistical difference between the hypertension and non-hypertension groups; only the percentage divided by weight, SMM_WT, showed a difference, which seems to reflect the characteristics of these groups.
In Table 3, we identified the body composition variables with significant differences between the hypertension and non-hypertension groups for both men and women. In both groups, there was a statistically significant difference in the overall variable of body fat mass. However, for lean mass and muscle mass, the difference was observed only in the male group, with the hypertension group’s SLM, SMM, and SMM_WT being relatively lower than in the non-hypertension group. Overall, hypertension affects not only body fat mass but also muscle mass in men, whereas in women, body fat mass has a more significant impact than muscle mass.
In Table 4, examining the odds ratio results from the logistic regression analysis, it is evident that for both men and women, the hypertension group has approximately double the body fat mass variables compared to the non-hypertension group. Looking at the muscle variables, in men, the muscle mass in the hypertension group is about 0.33 to 0.67 times that of the non-hypertension group. This indicates that, for men, muscle mass is a more critical factor, suggesting that efforts to increase muscle mass, in addition to reducing body fat, are necessary for preventing hypertension. According to prior research, when the percentage of body fat exceeds 35%, it has a significant impact compared to when it is below 35%, with this impact being relatively higher in women than in men [33].
In Table 5, the performance metrics for the test set, commonly used as a classification performance indicator, are shown in both the men’s and women’s groups, where the E-net model demonstrated the best performance.
Elastic-net regression is one of the regression methods developed to overcome the drawbacks of Ridge and Lasso regressions [34]. Elastic-net regression applies a combination of penalties and offers the advantage of performing both variable selection and model shrinkage simultaneously [35]. Additionally, the results of logistic regression can be quantitatively explained by using the odds ratio to interpret the weight of the variables between the two groups.
Among the men’s group, the AUROC values for the five models, excluding E-net, were relatively high, ranging from 0.7 to 0.8. In the validation results based on the test set, NN, XGBoost, and K-NN showed the highest performance in that order. All three machine learning techniques, except for RF and SVM, showed no significant difference in the AUROC test values compared to E-net. When examining other metrics besides the AUROC, such as F1 score, precision, accuracy, sensitivity, and specificity, NN demonstrated the highest performance and exhibited a similar performance to E-net. The NN method has advantages in classifying complex patterns in data that cannot be explained by simple linear models. It also has advantages in handling high-dimensional data. However, its shortcomings include the potential for overfitting and the need for complex computations. The most significant disadvantage is the lack of explainability, as it is limited in providing reasons for its classifications.
In the women’s group, the AUROC for E-net was the highest, similar to the models in the men’s group. E-net also showed the best performance in sensitivity, while other metrics favored XGBoost. This is likely due to the influence of the selected variables. In the men’s group, three variables were selected by E-net, whereas in the women’s group, nine variables were selected. Having fewer selected variables offers advantages such as reduced curse of dimensionality, prevention of overfitting, increased computational efficiency, and reduced noise. Therefore, E-net’s best performance can be attributed to the smaller number of selected variables. Unlike the models in the men’s group, the women’s group models showed no statistically significant difference in AUROC compared to E-net. This suggests that when evaluating model performance based on AUROC, using various models can achieve similar classification performance. While model performance can be assessed using various criteria such as AUROC and other metrics, using classification techniques like E-net, which offer better explainability, may be advantageous for evaluating the weights or influence of each variable.
Previous studies using body composition features to classify hypertension and non-hypertension groups have demonstrated the feasibility of logistic regression for this purpose. For instance, classification models using variables obtained from Bioelectrical Impedance Analysis (BIA) achieved classification accuracies between 83% and 90% [12]. Another study developed classification models incorporating general characteristics such as age, height, weight, and BMI alongside body composition variables, achieving over 70% classification accuracy [36]. When classification accuracy is ensured, logistic regression models can identify the weights of each variable, contributing to the classification and prediction of hypertension.
The limitation of this study is the imbalance in the number of subjects between the hypertension and non-hypertension groups. Additionally, there is a need for more detailed investigations considering age-specific group characteristics to better understand the occurrence of hypertension across different age groups. Future research aims to address these limitations by including a larger sample of hypertension subjects, allowing for more detailed analysis and model development. In follow-up studies, we plan to compensate for limitations through more detailed analysis and model development by comparing more hypertension groups.

5. Conclusions

This study aims to develop a prediction model that can classify the presence or absence of hypertension in Korean adults aged 20 or older using various machine learning techniques. A common variable of interest across various machine learning techniques to investigate the association between individual body composition variables and hypertension was “age”. For men, the most important variables were skeletal muscle mass and body fat mass, and for women, they were BFM and SMM, in that order. Therefore, it can be concluded that for men, SMM is a more important factor, and efforts to increase muscle mass while reducing fat mass are important to prevent hypertension. Additionally, among the various classification techniques, E-net demonstrated the best performance when evaluating models based on the AUROC in both the men’s and women’s groups. It is important to choose and use a classification technique that aligns with the characteristics of the data in the group being classified, considering factors such as model explainability, the potential for overfitting, and the selected variables.

Author Contributions

Conceptualization, M.H.Y.; methodology, J.-W.S.; validation, S.L.; formal analysis, M.H.Y.; investigation, S.L.; resources, S.L.; data curation, M.H.Y.; writing—original draft preparation, J.-W.S.; writing—review and editing, M.H.Y.; visualization, M.H.Y.; supervision, M.H.Y.; project administration, S.L.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant (“KSN1923111” & “KSN1824130”) from the Korea Institute of Oriental Medicine.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board (IRB No., Dongguk University Ilsan Oriental Hospital, DUIOH-2022-01-005; Pusan National University Korean Medicine Hospital, PNUKHIRB-2022-02-001; Gachon University Gill Hospital, GIRB-22-101; Dongshin University Korean Medicine Hospital, NJ-IRB-013; Semyung University Korean Medicine Hospital, SMJOH-2022-06).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. These data are not publicly available because of privacy concerns.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kim, E.H.; Lee, J.; Lee, S.A.; Jung, Y.W. Impact of Maternal Age on Singleton Pregnancy Outcomes in Primiparous Women in South Korea. J. Clin. Med. 2022, 11, 969. [Google Scholar] [CrossRef] [PubMed]
  2. Genaro, L.E.; Marconato, J.V.; Tagliaferro, E.P.d.S.; Pinotti, F.E.; Valsecki Júnior, A.; Adas Saliba, T.; Rosell, F.L. Home Care for the Elderly: An Integrated Approach to Perception, Quality of Life, and Cognition. Int. J. Environ. Res. Public Health 2024, 21, 539. [Google Scholar] [CrossRef]
  3. Heymsfield, S.B.; Ebbeling, C.B.; Zheng, J.; Pietrobelli, A.; Strauss, B.J.; Silva, A.M.; Ludwig, D.S. Multi-Component Molecular-Level Body Composition Reference Methods: Evolving Concepts and Future Directions. Obes. Rev. 2015, 15, 282–294. [Google Scholar] [CrossRef] [PubMed]
  4. Kyle, U.G.; Bosaeus, I.; De Lorenzo, A.D.; Deurenberg, P.; Elia, M.; Gómez, J.M.; Heitmann, B.L.; Kent-Smith, L.; Melchior, J.C.; Pirlich, M.; et al. Bioelectrical impedance analysis--part I: Review of principles and methods. Clin. Nutr. 2004, 23, 1226–1243. [Google Scholar] [CrossRef] [PubMed]
  5. Borga, M.; West, J.; Bell, J.D.; Harvey, N.C.; Romu, T.; Heymsfield, S.B.; Leinhard, O.D. Advanced body composition assessment: From body mass index to body composition profiling. J. Investig. Med. 2018, 66, 5. [Google Scholar] [CrossRef]
  6. Pi-Sunyer, X. Changes in body composition and metabolic disease risk. Eur. J. Clin. Nutr. 2019, 73, 231–235. [Google Scholar] [CrossRef]
  7. Li, Z.; Tong, X.; Ma, Y.; Bao, T.; Yue, J. Relationship between Low Skeletal Muscle Mass and Arteriosclerosis in Western China: A Cross-Sectional Study. Front. Cardiovasc. Med. 2021, 8, 735262. [Google Scholar] [CrossRef]
  8. Cleymaet, R.; D’Hondt, M.; Scheinok, T.; Malbrain, L.; De Laet, I.; Schoonheydt, K.; Dits, H.; Van Regenmortel, N.; Mekeirele, M.; Cordemans, C.; et al. Comparison of Bioelectrical Impedance Analysis (BIA)-Derived Parameters in Healthy Volunteers and Critically Ill Patients. Life 2024, 14, 27. [Google Scholar] [CrossRef]
  9. Ceniccola, G.D.; Castro, M.G.; Piovacari, S.M.F.; Horie, L.M.; Corrêa, F.G.; Barrere, A.P.N.; Toledo, D.O. Current technologies in body composition assessment: Advantages and disadvantages. Nutrition 2019, 62, 25–31. [Google Scholar] [CrossRef]
  10. Ye, S.; Zhu, C.; Wei, C.; Yang, M.; Zheng, W.; Gan, D.; Zhu, S. Associations of Body Composition with Blood Pressure and Hypertension. Obesity 2018, 26, 1644–1650. [Google Scholar] [CrossRef]
  11. Kim, S.T.; Song, Y.-H. Nutrition, Body Composition, and Blood Pressure in Children and Adolescents from the Korea National Health and Nutrition Examination Survey. Int. J. Environ. Res. Public Health 2022, 19, 13272. [Google Scholar] [CrossRef] [PubMed]
  12. Nematollahi, M.A.; Jahangiri, S.; Asadollahi, A.; Salimi, M.; Dehghan, A.; Mashayekh, M.; Roshanzamir, M.; Gholamabbas, G.; Alizadehsani, R.; Bazrafshan, M.; et al. Body composition predicts hypertension using machine learning methods: A cohort study. Sci. Rep. 2023, 13, 6885. [Google Scholar] [CrossRef] [PubMed]
  13. R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. 2022. Available online: https://www.R-project.org/ (accessed on 1 April 2024).
  14. Kuhn, M.; Wing, J.; Weston, S.; Williams, A.; Keefer, C.; Engelhardt, A.; Cooper, T.; Mayer, Z.; Kenkel, B.; Team, C. Package ‘caret’. R J. 2020, 223, 7. [Google Scholar]
  15. Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw. 2008, 28, 1–26. [Google Scholar] [CrossRef]
  16. Weston, S.; Williams, A.; Keefer, C.; Engelhardt, A.; Cooper, T.; Mayer, Z.; Kenkel, B.; Benesty, M.; Lescarbeau, R.; Ziem, A.; et al. (The R Core Team) Caret: Classification and Regression Training; R Package Version 6.0-80. Available online: https://cran.r-project.org/web/packages/caret/caret.pdf (accessed on 11 September 2024).
  17. Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. 2005, 67, 301–320. [Google Scholar] [CrossRef]
  18. Cover, T.; Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inform. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
  19. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  20. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  21. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
  22. Yao, X. Evolving artificial neural networks. Proc. IEEE 1999, 87, 1423–1447. [Google Scholar]
  23. Stehman, S.V. Selecting and interpreting measures of thematic classification accuracy. Remote Sens. Environ. 1997, 62, 77–89. [Google Scholar] [CrossRef]
  24. Provost, F.; Kohavi, R. Guest editors’ introduction: On applied research in machine learning. Mach. Learn. 1998, 30, 127–132. [Google Scholar] [CrossRef]
  25. Huang, J.; Ling, C.X. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 2005, 17, 299–310. [Google Scholar] [CrossRef]
  26. Hanley, J.A.; McNeil, B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982, 143, 29–36. [Google Scholar] [CrossRef] [PubMed]
  27. Hanley, J.A.; McNeil, B.J. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 1983, 148, 839–843. [Google Scholar] [CrossRef]
  28. Elshawi, R.; Al-Mallah, M.H.; Sakr, S. On the interpretability of machine learning-based model for predicting hypertension. BMC Med. Inform. Decis. Mak. 2019, 19, 146. [Google Scholar] [CrossRef]
  29. Oh, Y.S. Arterial stiffness and hypertension. Clin. Hypertens. 2018, 24, 17. [Google Scholar] [CrossRef]
  30. Mahmood, S.S.; Levy, D.; Vasan, R.S.; Wang, T.J. The Framingham Heart Study and the epidemiology of cardiovascular disease: A historical perspective. Lancet 2014, 383, 999–1008. [Google Scholar] [CrossRef]
  31. Cheng, W.; Du, Y.; Zhang, Q.; Wang, X.; He, C.; He, J.; Jing, F.; Ren, H.; Guo, M.; Tian, J.; et al. Age-related changes in the risk of high blood pressure. Front. Cardiovasc. Med. 2022, 9, 939103. [Google Scholar] [CrossRef]
  32. Han, T.S.; Al-Gindan, Y.Y.; Govan, L.; Hankey, C.R.; Lean, M.E.J. Associations of body fat and skeletal muscle with hypertension. J. Clin. Hypertens. 2019, 21, 230–238. [Google Scholar] [CrossRef]
  33. Al-Sendi, A.M.; Shetty, P.; Musaiger, A.O.; Myatt, M. Relationship between body composition and blood pressure in Bahraini adolescents. Br. J. Nutr. 2003, 90, 837–844. [Google Scholar] [CrossRef]
  34. Gana, R. Ridge Regression and the Elastic Net: How Do They Do as Finders of True Regressors and Their Coefficients? Mathematics 2022, 10, 3057. [Google Scholar] [CrossRef]
  35. De Mol, C.; De Vito, E.; Rosasco, L. Elastic-net regularization in learning theory. J. Complex. 2009, 25, 201–230. [Google Scholar] [CrossRef]
  36. George, C.; Goedecke, J.H.; Crowther, N.J.; Jaff, N.G.; Kengne, A.P.; Norris, S.A.; Micklesfield, L.K. The Role of Body Fat and Fat Distribution in Hypertension Risk in Urban Black South African Women. PLoS ONE 2016, 11, e0154894. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flow chart of study sample selection.
Figure 1. Flow chart of study sample selection.
Bioengineering 11 00921 g001
Figure 2. ROC curves of the six models.
Figure 2. ROC curves of the six models.
Bioengineering 11 00921 g002
Figure 3. Relative importance of selected variables in men (a) and women (b).
Figure 3. Relative importance of selected variables in men (a) and women (b).
Bioengineering 11 00921 g003
Table 1. Description of body composition indicators.
Table 1. Description of body composition indicators.
VariablesDescription
Body fat mass
BFMBody fat mass
PBFPercentage of body fat mass to weight
FMIFat mass index
BFMp_WB †Body fat mass (%) of whole body
BFM_RABody fat mass of right arm
BFMp_RA †Body fat mass (%) of right arm
BFM_LABody fat mass of left arm
BFMp_LA †Body fat mass (%) of left arm
BFM_TRBody fat mass of trunk
BFMp_TR †Body fat mass (%) of trunk
BFM_RLBody fat mass of right leg
BFMp_RL †Body fat mass (%) of right leg
BFM_LLBody fat mass of left leg
BFMp_LL †Body fat mass (%) of left leg
Lean mass
SLMSoft lean mass
SMMSkeletal muscle mass
SMM_WTPercentage of skeletal muscle mass to weight
Body water
TBWTotal body water
ICWIntracellular water
ECWExtracellular water
TBW_WTPercentage of total body water to weight
ECW_TBWProportion of extracellular water to total body water
ECW_TBW_RAProportion of extracellular water to total body water of right arm
ECW_TBW_LAProportion of extracellular water to total body water of left arm
ECW_TBW_TRProportion of extracellular water to total body water of trunk
ECW_TBW_RLProportion of extracellular water to total body water of right leg
ECW_TBW_LLProportion of extracellular water to total body water of left leg
Additional data
VFLVisceral fat level
VFAVisceral fat area
Obesity_DProportion of weight to ideal weight
TBW_FFMProportion of total body water to fat-free mass
PAngle50_TR50 kHz phase angle of trunk
† 100% refers to the appropriate amount of body fat mass at standard body weight.
Table 2. General characteristics and body composition indicators between the training set and test set.
Table 2. General characteristics and body composition indicators between the training set and test set.
VariablesMenWomen
Training SetTest SetpTraining SetTest Setp
Participants (n)575245 1462624
Hypertension 1 0.961
Yes75 (13.04)32 (13.06) 108 (7.39)45 (7.21)
No500 (86.96)213 (86.94) 1354 (92.61)579 (92.79)
Age (years)44.27 ± 14.3943.65 ± 14.570.57446.62 ± 12.2546.7 ± 12.450.887
Temperature (°C)36.51 ± 0.3336.51 ± 0.320.99836.70 ± 0.3236.71 ± 0.330.726
SBP (mmHg)119.59 ± 14.07120.09 ± 14.290.649108.39 ± 17.15109.24 ± 17.540.307
DBP (mmHg)70.03 ± 10.1470.17 ± 10.350.86463.22 ± 10.7364.02 ± 10.970.123
Pulse rate (bmp)68.93 ± 10.0467.77 ± 9.770.12371.54 ± 10.3170.89 ± 9.870.175
Height (cm)172.20 ± 6.33172.19 ± 6.160.983159.40 ± 5.64159.60 ± 5.620.450
Weight (kg)74.47 ± 11.4374.67 ± 10.610.80458.74 ± 9.3059.16 ± 9.350.340
BMI (kg/m2)25.07 ± 3.2225.16 ± 3.140.68123.12 ± 3.5123.24 ± 3.520.509
Body fat mass
BFM17.75 ± 6.4318.02 ± 6.170.56919.42 ± 6.5119.56 ± 6.290.643
PBF23.39 ± 5.8723.73 ± 5.860.45232.41 ± 6.3132.45 ± 6.110.908
FMI5.99 ± 2.146.09 ± 2.110.5337.67 ± 2.617.71 ± 2.510.753
BFMp_WB181.46 ± 64.69184.44 ± 64.110.543190.05 ± 82.71191.23 ± 79.730.760
BFM_RA1.05 ± 0.591.07 ± 0.560.6771.37 ± 0.641.38 ± 0.590.715
BFMp_RA178.93 ± 98.29182.73 ± 96.080.607153.51 ± 71.42154.20 ± 66.080.830
BFM_LA1.08 ± 0.591.10 ± 0.560.6481.40 ± 0.641.40 ± 0.590.776
BFMp_LA183.72 ± 98.44187.49 ± 96.420.611156.33 ± 71.81156.68 ± 66.150.915
BFM_TR9.11 ± 3.529.29 ± 3.430.4959.42 ± 3.359.50 ± 3.290.601
BFMp_TR220.63 ± 83.51225.34 ± 83.850.461188.17 ± 67.78189.44 ± 66.410.691
BFM_RL2.69 ± 0.862.72 ± 0.80.6973.10 ± 0.933.12 ± 0.890.669
BFMp_RL160.18 ± 50.85161.64 ± 48.90.699136.41 ± 41.79136.89 ± 39.760.803
BFM_LL2.67 ± 0.842.69 ± 0.80.7293.09 ± 0.923.11 ± 0.880.717
BFMp_LL158.72 ± 50.12160.3 ± 48.770.675135.88 ± 41.39136.33 ± 39.400.814
Lean mass
SLM53.59 ± 6.9353.53 ± 6.540.90537.02 ± 4.1637.28 ± 4.290.196
SMM31.80 ± 4.4731.74 ± 4.220.85821.10 ± 2.6421.27 ± 2.730.194
SMM_WT42.90 ± 3.5142.70 ± 3.510.43836.24 ± 3.4636.24 ± 3.330.998
Body water
TBW41.71 ± 5.3641.66 ± 5.070.91528.89 ± 3.2429.1 ± 3.330.188
ICW25.92 ± 3.4325.87 ± 3.240.86317.71 ± 2.0317.84 ± 2.090.197
ECW15.79 ± 1.9615.79 ± 1.870.99211.18 ± 1.2311.26 ± 1.260.180
TBW_WT56.35 ± 4.3256.10 ± 4.300.44749.68 ± 4.6349.64 ± 4.490.880
ECW_TBW †378.91 ± 7.56379.29 ± 8.030.526386.98 ± 6.02386.95 ± 6.270.918
ECW_TBW_RA †375.18 ± 4.3375.39 ± 4.710.548377.98 ± 3.63378.26 ± 3.580.103
ECW_TBW_LA †375.54 ± 4.55375.59 ± 4.930.897378.58 ± 3.67378.69 ± 3.600.525
ECW_TBW_TR †378.29 ± 7.46378.65 ± 7.870.545386.96 ± 5.86386.94 ± 6.100.944
ECW_TBW_RL †379.43 ± 9.38380.06 ± 10.470.419388.57 ± 7.73388.53 ± 8.340.911
ECW_TBW_LL †382.36 ± 9.74382.61 ± 9.720.736390.57 ± 7.56390.38 ± 7.650.604
Additional data
VFL7.02 ± 2.967.22 ± 3.050.4018.72 ± 3.668.78 ± 3.590.732
VFA75.34 ± 29.9777.43 ± 30.450.36692.14 ± 37.1692.68 ± 35.950.753
Obesity_D113.94 ± 14.62114.38 ± 14.240.688110.12 ± 16.73110.63 ± 16.760.525
TBW_FFM73.55 ± 0.2573.56 ± 0.250.68773.5 ± 0.2173.49 ± 0.230.278
PAngle50_TR7.38 ± 0.917.34 ± 0.880.5165.81 ± 0.525.79 ± 0.570.357
† The numbers represent values multiplied by 1000. Abbreviations: SBP, systolic blood pressure; DBP, diastolic blood pressure; BMI, body mass index; bmp, beats per minute; p, p-value. Continuous variables were summarized as mean ± standard deviation. p-values were obtained from independent two-sample t-tests for continuous variables and chi-square tests for categorical variables between the training and test sets by sex.
Table 3. Comparison of general characteristics and body composition indicators between hypertension and non-hypertension groups.
Table 3. Comparison of general characteristics and body composition indicators between hypertension and non-hypertension groups.
VariablesMenWomen
Non-HypertensionHypertensionpNon-HypertensionHypertensionp
Participants (n)50075 1354108
General characteristics
Age (years)41.96 ± 13.4259.68 ± 10.72<0.00145.76 ± 12.0957.36 ± 8.59<0.001
Temperature (°C)36.51 ± 0.3336.50 ± 0.340.70036.70 ± 0.3236.70 ± 0.310.952
SBP (mmHg)118.52 ± 13.45126.75 ± 16<0.001107.04 ± 16.58125.20 ± 15.24<0.001
DBP (mmHg)69.50 ± 9.7973.56 ± 11.710.00562.51 ± 10.5572.03 ± 9.08<0.001
Pulse rate (bmp)68.96 ± 10.1068.72 ± 9.630.83971.61 ± 10.2270.58 ± 11.430.365
Height (cm)172.63 ± 6.16169.36 ± 6.76<0.001159.56 ± 5.61157.41 ± 5.63<0.001
Weight (kg)74.29 ± 11.2175.66 ± 12.830.38358.34 ± 8.9963.65 ± 11.56<0.001
BMI (kg/m2)24.88 ± 3.1626.29 ± 3.380.00122.92 ± 3.3625.66 ± 4.33<0.001
Body fat mass
BFM17.21 ± 6.3121.37 ± 6.08<0.00119.06 ± 6.2523.95 ± 7.92<0.001
PBF22.72 ± 5.7527.89 ± 4.61<0.00132.06 ± 6.2036.88 ± 5.97<0.001
FMI5.78 ± 2.087.43 ± 1.97<0.0017.51 ± 2.499.67 ± 3.19<0.001
BFMp_WB174.92 ± 62.93225.05 ± 59.50<0.001184.98 ± 78.92253.62 ± 101.20<0.001
BFM_RA1.00 ± 0.581.38 ± 0.58<0.0011.34 ± 0.601.81 ± 0.91<0.001
BFMp_RA169.31 ± 95.04243.1 ± 96.04<0.001149.2 ± 66.58207.52 ± 102.03<0.001
BFM_LA1.03 ± 0.581.42 ± 0.59<0.0011.36 ± 0.601.84 ± 0.90<0.001
BFMp_LA173.82 ± 95.16249.72 ± 94.97<0.001152.03 ± 67.03210.24 ± 102.11<0.001
BFM_TR8.83 ± 3.4611.03 ± 3.33<0.0019.23 ± 3.2411.79 ± 3.76<0.001
BFMp_TR212.46 ± 81.49275.12 ± 76.53<0.001183.96 ± 65.24240.94 ± 76.61<0.001
BFM_RL2.62 ± 0.843.18 ± 0.78<0.0013.06 ± 0.893.70 ± 1.18<0.001
BFMp_RL154.88 ± 49.50195.51 ± 45.57<0.001134.01 ± 39.81166.44 ± 53.14<0.001
BFM_LL2.60 ± 0.833.16 ± 0.77<0.0013.05 ± 0.893.68 ± 1.15<0.001
BFMp_LL153.49 ± 48.80193.6 ± 44.82<0.001133.51 ± 39.49165.57 ± 52.09<0.001
Lean mass
SLM53.93 ± 6.7651.3 ± 7.650.00636.99 ± 4.1237.43 ± 4.670.348
SMM32.05 ± 4.3630.15 ± 4.880.00221.09 ± 2.6221.27 ± 2.940.538
SMM_WT43.34 ± 3.4139.99 ± 2.73<0.00136.44 ± 3.4033.76 ± 3.21<0.001
Body water
TBW41.96 ± 5.2339.99 ± 5.940.00828.87 ± 3.2029.23 ± 3.640.310
ICW26.11 ± 3.3524.65 ± 3.740.00217.70 ± 2.0117.84 ± 2.250.536
ECW15.86 ± 1.9115.34 ± 2.230.0611.16 ± 1.2211.39 ± 1.400.101
TBW_WT56.83 ± 4.2453.12 ± 3.39<0.00149.93 ± 4.5646.47 ± 4.38<0.001
ECW_TBW †378.19 ± 7.30383.71 ± 7.58<0.001386.77 ± 5.97389.62 ± 6.07<0.001
ECW_TBW_RA †374.87 ± 4.14377.28 ± 4.75<0.001377.92 ± 3.63378.71 ± 3.530.027
ECW_TBW_LA †375.24 ± 4.32377.60 ± 5.470.001378.54 ± 3.70379.09 ± 3.140.086
ECW_TBW_TR †377.56 ± 7.18383.13 ± 7.48<0.001386.75 ± 5.81389.52 ± 6.01<0.001
ECW_TBW_RL †378.59 ± 9.19385.05 ± 8.74<0.001388.30 ± 7.65392.01 ± 7.88<0.001
ECW_TBW_LL †381.48 ± 9.46388.19 ± 9.67<0.001390.28 ± 7.47394.28 ± 7.73<0.001
Additional data
VFL6.75 ± 2.868.85 ± 2.98<0.0018.49 ± 3.5411.55 ± 4.01<0.001
VFA72.56 ± 29.0093.85 ± 29.93<0.00189.89 ± 35.84120.32 ± 41.72<0.001
Obesity_D113.10 ± 14.34119.51 ± 15.280.001109.16 ± 16.00122.19 ± 20.64<0.001
TBW_FFM73.53 ± 0.2473.67 ± 0.22<0.00173.49 ± 0.2173.63 ± 0.19<0.001
PAngle50_TR7.49 ± 0.876.67 ± 0.85<0.0015.83 ± 0.525.64 ± 0.47<0.001
† The numbers represent values multiplied by 1000. Abbreviations: SBP, systolic blood pressure; DBP, diastolic blood pressure; BMI, body mass index; bmp, beats per minute; p, p-value. Continuous variables were summarized as mean ± standard deviation. p values were obtained from two-sample t-test between hypertension and non-hypertension groups by sex.
Table 4. Association of individual body composition indices with hypertension.
Table 4. Association of individual body composition indices with hypertension.
VariablesMen Women
Crude OR (95% CI)pCrude OR (95% CI)p
General characteristics
Age (years)4.32 (3.09, 6.06)<0.0013.36 (2.57, 4.4)<0.001
Height (cm)0.58 (0.45, 0.75)<0.0010.68 (0.56, 0.83)<0.001
Weight (kg)1.12 (0.89, 1.42)0.3331.58 (1.34, 1.85)<0.001
BMI (kg/m2)1.50 (1.20, 1.89)0.0011.82 (1.55, 2.14)<0.001
Body fat mass
BFM1.77 (1.41, 2.22)<0.0011.78 (1.51, 2.09)<0.001
PBF2.70 (2.02, 3.61)<0.0012.20 (1.79, 2.71)<0.001
FMI2.03 (1.60, 2.58)<0.0011.89 (1.60, 2.22)<0.001
BFMp_WB2.03 (1.60, 2.58)<0.0011.89 (1.60, 2.22)<0.001
BFM_RA1.68 (1.35, 2.09)<0.0011.63 (1.40, 1.89)<0.001
BFMp_RA1.86 (1.48, 2.33)<0.0011.70 (1.46, 1.98)<0.001
BFM_LA1.71 (1.37, 2.13)<0.0011.63 (1.40, 1.89)<0.001
BFMp_LA1.90 (1.51, 2.39)<0.0011.70 (1.46, 1.97)<0.001
BFM_TR1.78 (1.41, 2.25)<0.0011.88 (1.58, 2.23)<0.001
BFMp_TR2.04 (1.60, 2.61)<0.0012.00 (1.68, 2.38)<0.001
BFM_RL1.79 (1.43, 2.25)<0.0011.69 (1.44, 1.98)<0.001
BFMp_RL2.07 (1.63, 2.63)<0.0011.79 (1.53, 2.11)<0.001
BFM_LL1.80 (1.44, 2.26)<0.0011.69 (1.44, 1.98)<0.001
BFMp_LL2.08 (1.64, 2.64)<0.0011.8 0(1.53, 2.11)<0.001
Lean mass
SLM0.67 (0.51, 0.87)0.0021.11 (0.91, 1.34)0.295
SMM0.64 (0.49, 0.83)0.0011.07 (0.88, 1.30)0.495
SMM_WT0.33 (0.24, 0.44)<0.0010.44 (0.35, 0.54)<0.001
Body water
TBW0.68 (0.52, 0.88)0.0031.12 (0.92, 1.35)0.256
ICW0.64 (0.49, 0.83)0.0011.07 (0.88, 1.3)0.494
ECW0.76 (0.59, 0.98)0.0341.2 (0.99, 1.44)0.063
TBW_WT0.38 (0.29, 0.51)<0.0010.46 (0.37, 0.57)<0.001
ECW_TBW2.08 (1.61, 2.69)<0.0011.62 (1.33, 1.99)<0.001
ECW_TBW_RA1.74 (1.36, 2.22)<0.0011.25 (1.02, 1.52)0.029
ECW_TBW_LA1.66 (1.3, 2.11)<0.0011.16 (0.95, 1.42)0.133
ECW_TBW_TR2.09 (1.62, 2.69)<0.0011.61 (1.32, 1.97)<0.001
ECW_TBW_RL2.01 (1.56, 2.59)<0.0011.64 (1.34, 2)<0.001
ECW_TBW_LL2.05 (1.58, 2.67)<0.0011.72 (1.41, 2.11)<0.001
Additional data
VFL1.88 (1.5, 2.37)<0.0012.03 (1.7, 2.42)<0.001
VFA1.86 (1.48, 2.35)<0.0011.95 (1.65, 2.32)<0.001
Obesity_D1.5 (1.19, 1.9)0.0011.82 (1.55, 2.14)<0.001
TBW_FFM1.82 (1.4, 2.37)<0.0011.9 (1.56, 2.32)<0.001
PAngle50_TR0.36 (0.27, 0.49)<0.0010.69 (0.56, 0.84)<0.001
Abbreviations: SBP, systolic blood pressure; DBP, diastolic blood pressure; p, p-value; OR, odds ratio; CI, confidence interval. p values were obtained from simple logistic regression analyses for individual body composition variables by sex.
Table 5. Performance of the six models.
Table 5. Performance of the six models.
Men’ ModelsTraining Set
AUROC
Test Set
AUROC
(95% CI)
AUROC TestpKappa
(95% CI)
F1 Score
(95% CI)
Precision
(95% CI)
Accuracy
(95% CI)
Sensitivity
(95% CI)
Specificity
(95% CI)
E-net0.8740.865
(0.806, 0.914)
0.335
(0.228, 0.435)
0.466
(0.348, 0.563)
0.310
(0.215, 0.400)
0.718
(0.657, 0.771)
0.941
(0.833, 1.000)
0.686
(0.620, 0.742)
K-NN0.8760.81
(0.74, 0.873)
E-net vs.
K-NN
0.0750.264
(0.168, 0.362)
0.409
(0.301, 0.507)
0.271
(0.186, 0.355)
0.682
(0.620, 0.739)
0.848
(0.707, 0.966)
0.659
(0.593, 0.718)
RF10.793
(0.717, 0.862)
E-net vs. RF0.0450.203
(0.132, 0.286)
0.372
(0.275, 0.459)
0.231
(0.162, 0.303)
0.584
(0.522, 0.645)
0.941
(0.840, 1.000)
0.530
(0.462, 0.600)
SVM0.9590.783
(0.697, 0.857)
E-net vs. SVM0.0180.243
(0.154, 0.337)
0.395
(0.292, 0.493)
0.256
(0.177, 0.340)
0.653
(0.592, 0.710)
0.879
(0.750, 0.971)
0.619
(0.553, 0.682)
XGBoost 0.9560.827
(0.761, 0.883)
E-net vs. XGBoost0.4330.261
(0.185, 0.354)
0.413
(0.312, 0.514)
0.263
(0.188, 0.345)
0.645
(0.584, 0.706)
0.970
(0.893, 1.000)
0.597
(0.531, 0.659)
NN0.90.853
(0.797, 0.904)
E-net vs. NN0.7570.365
(0.257, 0.469)
0.489
(0.37, 0.586)
0.330
(0.232, 0.424)
0.743
(0.69, 0.796)
0.941
(0.833, 1.000)
0.714
(0.653, 0.771)
Women’s
Models
Training Set
AUROC
Test Set
AUROC
(95% CI)
AUROC TestpKappa
(95% CI)
F1 Score
(95% CI)
Precision
(95% CI)
Accuracy
(95% CI)
Sensitivity
(95% CI)
Specificity
(95% CI)
E-net0.8330.831
(0.78, 0.881)
0.153
(0.111, 0.201)
0.259
(0.197, 0.322)
0.149
(0.110, 0.192)
0.597
(0.558, 0.636)
0.979
(0.927, 1.000)
0.568
(0.527, 0.607)
K-NN0.8490.805
(0.747, 0.855)
E-net vs. K-NN0.3990.224
(0.142, 0.299)
0.312
(0.225, 0.391)
0.198
(0.136, 0.260)
0.768
(0.732, 0.798)
0.735
(0.600, 0.857)
0.770
(0.733, 0.802)
RF10.804
(0.745, 0.857)
E-net vs. RF0.4350.145
(0.104, 0.190)
0.251
(0.191, 0.312)
0.144
(0.107, 0.187)
0.591
(0.553, 0.628)
0.957
(0.889, 1.000)
0.563
(0.524, 0.603)
SVM10.774
(0.712, 0.831)
E-net vs. SVM0.1170.179
(0.122, 0.242)
0.276
(0.209, 0.349)
0.166
(0.12, 0.219)
0.692
(0.655, 0.728)
0.829
(0.708, 0.932)
0.683
(0.644, 0.720)
XGBoost0.9150.826
(0.772, 0.876)
E-net vs. XGBoost0.9100.265
(0.186, 0.345)
0.347
(0.262, 0.430)
0.223
(0.162, 0.291)
0.790
(0.758, 0.821)
0.780
(0.653, 0.896)
0.791
(0.758, 0.824)
NN0.8460.822
(0.771, 0.871)
E-net vs. NN0.7800.165
(0.117, 0.220)
0.268
(0.206, 0.334)
0.157
(0.116, 0.203)
0.633
(0.595, 0.671)
0.938
(0.850, 1.000)
0.611
(0.569, 0.649)
Abbreviations: CI, confidence interval; E-net, elastic net; K-NN, k-nearest neighbor; RF, random forest; SVM, support vector machine; XGBoosting, extreme gradient boosting; NN, neural network; CI, confidence interval; p, p-value. The 95% CIs were obtained from 2000 bootstrap repetitions. The optimal threshold was determined using Youden’s index.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Seo, J.-W.; Lee, S.; Yim, M.H. Machine Learning Approach for Predicting Hypertension Based on Body Composition in South Korean Adults. Bioengineering 2024, 11, 921. https://doi.org/10.3390/bioengineering11090921

AMA Style

Seo J-W, Lee S, Yim MH. Machine Learning Approach for Predicting Hypertension Based on Body Composition in South Korean Adults. Bioengineering. 2024; 11(9):921. https://doi.org/10.3390/bioengineering11090921

Chicago/Turabian Style

Seo, Jeong-Woo, Sanghun Lee, and Mi Hong Yim. 2024. "Machine Learning Approach for Predicting Hypertension Based on Body Composition in South Korean Adults" Bioengineering 11, no. 9: 921. https://doi.org/10.3390/bioengineering11090921

APA Style

Seo, J. -W., Lee, S., & Yim, M. H. (2024). Machine Learning Approach for Predicting Hypertension Based on Body Composition in South Korean Adults. Bioengineering, 11(9), 921. https://doi.org/10.3390/bioengineering11090921

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop