The dynamics and diversity of human gut microbiota that can remarkably influence the wellbeing and health of the host are constantly changing through the host’s lifetime in response to various factors. The aim of the present study was to determine a set of parameters that could have a major impact on classifying subjects into a single cluster regarding gut bacteria composition. Therefore, a set of demographical, environmental, and clinical data of healthy adults aged 25–50 years (117 female and 83 men) was collected. Fecal microbiota composition was characterized using Illumina MiSeq 16S rRNA gene amplicon sequencing. Hierarchical clustering was performed to analyze the microbiota data set, and a supervised machine learning model (SVM; Support Vector Machines) was applied for classification. Seventy variables from collected data were included in machine learning analysis. The agglomerative clustering algorithm suggested the presence of four distinct community types of most abundant bacterial phyla. Each cluster harbored a statistically significant different proportion of bacterial phyla. Regarding prediction, the most important features classifying subjects into clusters were measures of obesity (waist to hip ratio, BMI, and visceral fat index), total body water, blood pressure, energy intake, total fat, olive oil intake, total fiber intake, and water intake. In conclusion, the SVM model was shown as a valuable tool to classify healthy individuals based on their gut microbiota composition.
This is an open access article distributed under the Creative Commons Attribution License
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited