Next Article in Journal
Thyme Extract Alleviates High-Fat Diet-Induced Obesity and Gut Dysfunction
Next Article in Special Issue
Healthy Snack Project: Improving Healthy Choices through Multidisciplinary Food Education Actions
Previous Article in Journal
The Beneficial Effects of Dietary Interventions on Gut Microbiota—An Up-to-Date Critical Review and Future Perspectives
Previous Article in Special Issue
Food Composition Databases (FCDBs): A Bibliometric Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of a Two-Dimensional Mapping-Based Visualization Technique: Nutrient-Value-Based Food Grouping

1
Meiji Co., Ltd., 2-2-1 Kyobashi, Chuo-ku 104-9306, Tokyo, Japan
2
Computational Systems Biology Laboratory, Division of Information Science, Graduate School of Science and Technology & Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma 630-0192, Nara, Japan
3
Medicine Nutrition, Faculty of Nutrition, Kagawa Nutrition University, 3-9-21 Chiyoda, Sakado 350-0288, Saitama, Japan
*
Author to whom correspondence should be addressed.
Nutrients 2023, 15(23), 5006; https://doi.org/10.3390/nu15235006
Submission received: 23 October 2023 / Revised: 29 November 2023 / Accepted: 1 December 2023 / Published: 4 December 2023

Abstract

:
Worldwide, several food-based dietary guidelines, with diverse food-grouping methods in various countries, have been developed to maintain and promote public health. However, standardized international food-grouping methods are scarce. In this study, we used two-dimensional mapping to classify foods based on their nutrient composition. The Standard Tables of Food Composition in Japan were used for mapping with a novel technique—t-distributed stochastic neighbor embedding—to visualize high-dimensional data. The mapping results showed that most foods formed food group-based clusters in the Standard Tables of Food Composition in Japan. However, the beverages did not form large clusters and demonstrated scattered distribution on the map. Green tea, black tea, and coffee are located within or near the vegetable cluster whereas cocoa is near the pulse cluster. These results were ensured by the k-nearest neighbors. Thus, beverages made from natural materials can be categorized based on their origin. Visualization of food composition could enable an enhanced comprehensive understanding of the nutrients in foods, which could lead to novel aspects of nutrient-value-based food classifications.

1. Introduction

To maintain and promote public health, it is important to follow a healthy and balanced diet that has adequate nutrient levels. To encourage the adoption of balanced diets containing diverse foods, the World Health Organization (WHO) established a “Healthy diet” [1], and several food-based dietary guidelines (FBDGs) were developed worldwide. However, food-classification methods for the FBDGs are diverse and differ among countries [2]. In particular, there is a wide variability in nuts, legumes, and animal food sources. For example, legumes are categorized as protein foods, starchy staples, vegetables, and legumes. Nuts are categorized as protein-rich foods, including legumes and fats/oils. In some countries, especially Latin America and the Caribbean, animal-source foods include dairy; however, in most other countries, dairy is considered a separate food group. In Japan, an FBDG called “The Japanese Food Guide Spinning Top” was developed to promote the health policy [3]. The guideline recommends a balanced diet composed of grain dishes (rice, bread, noodles, and pasta), vegetable-based dishes (vegetables, mushrooms, potatoes, and algae), fish and meat dishes (meat, fish, eggs, and soybeans), fruits, and dairy (milk and milk products). These categories are based on Japanese dietary habits [4].
Detailed information on food composition is a prerequisite for implementing dietary guidelines [5]. In Japan, the Standard Tables of Food Composition were developed based on the idea that it is important to clarify the nutrient component of daily intake [6,7]. Since the release of the first edition in 1950, the Standard Tables of Food Composition in Japan have played a role in providing basic data on food components and have been widely used in various fields, such as research and food education. The Eighth Revised Version of the Standard Tables of Food Composition in Japan was published in 2020. These food-composition tables provide insight into the changes in Japanese gastronomic culture and eating habits [7,8] and include the following food groups: cereals, potatoes and starches; sugars and sweeteners; pulses, nuts and seeds; vegetables; fruits; mushrooms; algae; fish, mollusks and crustaceans; meat; eggs, milk and milk products; fats and oils; confectionaries; beverages; seasonings and spices; and prepared foods. In these food-composition tables, foods are categorized by considering Japanese dietary habits and convenience [9]. Therefore, the nutrient-value-based food categorization could improve our understanding of the characteristics of each food group.
The visualization of high-dimensional datasets, such as food-composition tables, increases awareness of the nutritional value of foods. Data-reduction techniques are useful for investigating high-dimensional datasets. Principal component analysis (PCA) is one of the techniques [10] that have been applied to food-composition tables [11,12,13]. Recently, a new technique called the t-distributed stochastic neighbor embedding (t-SNE) was developed for visualizing high-dimensional data [14]. A previous study showed that t-SNE performs better than PCA in visualizing a food-composition table [15]. However, food classification based on nutrient values has not yet been performed using the t-SNE method.
This study aimed to classify foods based on nutrient information in the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition). We used the t-SNE mapping method and the k-nearest neighbors (k-NN), which lead to the estimation of appropriate nutrient intake through FBDGs. This nutrient value-based evaluation of food would enable a comprehensive understanding of the relationship between nutrients and food.

2. Materials and Methods

2.1. Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition)

The Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition) is an open source publication from the Ministry of Education, Culture, Sports, Science and Technology, Japan, in which 2478 foods were divided into 17 food groups. In this study, 16 food groups, except for “Prepared foods”, were used for analysis; “Prepared foods” was included in meal. The 16 food groups and number of foods in each group are shown in Table 1. The standard tables comprise 54 nutrients.

2.2. Data Imputation

In Table 2, we cannot ignore the missing values of nutrients for foods, that is, missing values exist in 26–27% of foods for saturated fatty acids, n-6 PUFA and n-3 PUFA, and 1–8% of foods for vitamins D, E, K, B6, B12 and C; folic acid, pantothenic acid; zinc, copper, and manganese. Therefore, the imputation method was applied to estimate missing values using multidimensional scaling (MDS), which represents measurements of similarity among pairs of objects as distances between points of a low-dimensional multidimensional space, and random forest regression methods based on four steps (Figure 1).
Step 1: The distance between objects can be calculated using Equation (1).
d i , i = M U d i , i U
where d i , i U represents the distance between variables without missing values in the two objects and M is the total number of variables. Thus, the distance matrix D was calculated from the original dataset X. Step 2: MDS variables U for all objects based on the distance matrix D calculated by Equation (1) can be obtained based on the MDS. Here U has compressed information of X. Step 3: A regression model of jth variable of xj without missing values based on U was created by random forest. Step 4: Using the random forest regression models, missing values were calculated for jth variable. Steps 2 and 3 were performed for all variables X.

2.3. Missing Values

The accuracy of the missing values was validated by removing 50 known values and estimating the values for the imputation method as described in the Methods Section (Section 2.2). The average correlation coefficient for the 30 variables was 0.984 (range 0.957–0.997). Thus, the imputation values for missing values are expected to be highly accurate. Furthermore, as shown in Table 1, the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition) include 2428 entries in 16 food groups; however, these are not the complete data. Data imputation was performed to increase the amount of data available for mapping. In addition to energy and water, nutrients that had an estimated average requirement (EAR), recommended dietary allowance (RDA), adequate intake (AI), or a tentative dietary goal for preventing lifestyle-related diseases (DG) in the Japanese Dietary Reference Intakes [16,17] were chosen. However, nutrients with approximately 50% of the total data were excluded (Table 2). EAR is composed of proteins, vitamin A, vitamin B1, vitamin B2, niacin equivalent, vitamin B6, vitamin B12, folic acid, vitamin C, sodium, calcium, magnesium, iron, zinc, copper, iodine, selenium, and molybdenum. RDA is composed of proteins, vitamins A, B1, B2, niacin equivalents, B6, B12, folic acid, C, calcium, magnesium, iron, zinc, copper, iodine, selenium, and molybdenum. AI is composed of n-6 polyunsaturated fatty acids, n-3 polyunsaturated fatty acids, vitamin D, vitamin E, vitamin K, pantothenic acid, biotin, potassium, phosphorus, manganese, and chromium. DG is composed of proteins, fats, saturated fatty acids, carbohydrates, dietary fiber, sodium, and potassium. In this case, the selected nutrients were energy, water, protein, fat, saturated fatty acid, n-3 polyunsaturated fatty acid, n-6 polyunsaturated fatty acid, dietary fiber, carbohydrate, sodium, potassium, calcium, magnesium, phosphorus, iron, zinc, copper, manganese, vitamin A (retinol activity equivalents), vitamin D, vitamin E (alpha-tocopherol), vitamin K, vitamin B1, vitamin B2, niacin equivalent, vitamin B6, vitamin B12, folic acid, pantothenic acid, and vitamin C, whereas iodine, selenium, chromium, molybdenum, and biotin were excluded.

2.4. Mapping

Nutrient-density data were used for mapping to exclude the effect of water. Thus, the data for nutrients per 100 g of edible portion were converted to per 100 kcal. Water and energy data were excluded from the dataset during conversion. All nutrients were mapped. Furthermore, to evaluate the contribution of macronutrients (proteins, fats, and carbohydrates) to the mapping, we performed mapping using nutrients without them.
A two-dimensional map of the nutrient matrix consisting of nutrients per 100 kcal and 2221 foods was constructed using t-distributed stochastic neighbor embedding (t-SNE). t-SNE is a nonlinear dimensionality reduction method in which Gaussian probability distributions over a high-dimensional space are constructed and used to optimize a Student’s t-distribution in a low-dimensional space. The low-dimensional embedding descriptors, that is, nutrients per 100 kcal, can be obtained by minimizing the Kullback–Leibler divergence [18] between the distributions in high- and low-dimensional spaces, maintaining pair-wise similarity to the high-dimensional space. Therefore, it is a powerful tool for understanding the relationship among objects as a low-dimensional space because the objects close to each other on the low-dimensional surface represent states that are similar in a high-dimensional space [14,19]. The performance of t-SNE is robust with sufficiently large hyper parameters [14,19]. Mapping was performed using the R Package (version 4.2.2).

2.5. Food Classification

The k-NN, a type of machine-learning model [20,21], was applied to classify foods based on nutrient information (k = 3). Nutrient data per 100 kcal were used for food classification and mapping. The misclassification rate was calculated by dividing the number of incorrect predictions by the number of the total predictions.

3. Results

All data in the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition) were imputed. The numbers of foods consumed before and after data imputation are presented in Table 1. The original food-composition table included 2428 foods in 17 food groups; however, the data for only 1619 foods were analyzable without any missing values. After data imputation, the number of foods that contained all nutrients was 2221 (/100 kcal). The accuracy of the missing values was validated by removing 50 known values and estimating the values by using the imputation method described in the Methods Section (Section 2.3). The correlation coefficients for the 30 variables averaged 0.984 (range 0.957–0.997), which is very strong [22]. Thus, the imputation values for missing values are expected to be highly accurate. The accuracy of data imputation is presented in Supplementary Figure S1.
Two-dimensional mapping was performed using/100 kcal data after data imputation. The results of mapping all nutrients using t-SNE are shown in Figure 2. Figure 3 shows the effects of each nutrient (Figure 2). Macronutrients (proteins, fats, and carbohydrates) were distributed in three directions: proteins were located in the first quadrant, fat in the second quadrant, and carbohydrates in the third quadrant. In contrast, other nutrients were located near the center of the map. Phosphorus, zinc, vitamin D, vitamin B1, niacin equivalents, vitamin B12, and pantothenic acid were present in the first quadrant. Moreover, n-3 polyunsaturated fatty acids, n-6 polyunsaturated fatty acids, dietary fiber, sodium, potassium, calcium, magnesium, iron, copper, manganese, vitamins A, D, E, K, B2, B6, folic acid, and biotin were located in the fourth quadrant.
The k-NN method was used for ensuring classification accuracy. The number of foods misclassified using k-NN is shown in Table 3, and the average misclassification rate was 13%.
The results of the mapping with nutrients, excluding macronutrients (proteins, fats, and carbohydrates), are shown in Figure 4. Furthermore, Figure 5 shows the effects of each nutrient (Figure 4). n-6 polyunsaturated fatty acids, dietary fiber, manganese, and vitamin C are located in the first quadrant. Sodium is present in the second quadrant. n-3 polyunsaturated fatty acids, magnesium, phosphorus, iron, zinc, copper, vitamin D, vitamin B2, niacin equivalents, vitamin B6, vitamin B12, and pantothenic acid is located in the third quadrant. Saturated fatty acids, potassium, calcium, vitamins A, E, K, B1, and folic acid is located in the fourth quadrant.

4. Discussion

4.1. Discussion

This is the first visualization study which was undertaken using t-SNE with the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition). Furthermore, this study showed that visualization by two-dimensional mapping was possible based on nutrient information in food-composition tables, as shown in a previous study [15]. Previous research using PCA has shown that nuts, seeds, and pulses were well separated; however, fruits and vegetables were not separated adequately [13]. However, the results of this study showed that these foods were well-separated. In addition, as shown in a previous study that indicated that t-SNE was better than PCA for the visualization of food tables [15], this study using t-SNE showed that it was possible to classify many foods into distinct food groups by focusing on the nutrients in the dietary reference intake for the Japanese population.
The results showed that most foods formed clusters in accordance with the food groups in the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition), as indicated by the misclassification rate. This indicates that many food groups had common nutrient patterns and were categorized based on nutrient similarity. Furthermore, as shown in Figure 4, which was performed without protein, fat, or carbohydrates, many foods formed clusters in accordance with the food groups. This indicates that the characteristics of the food groups were determined by micro- and macronutrient (protein, fat, and carbohydrate) balance. Given that the results of this study are based on nutrient density, nutrient density may also contribute to the classification of foods.
In contrast, the beverages did not form large clusters and were scattered. The misclassification rate of beverages was 48%, with the highest rate among the food groups. This may be because this food group comprises a wide range of beverages, including soft drinks, beverages made from natural materials (green tea, black tea, kelp tea, and cocoa), and alcoholic drinks. In Figure 2, the coordinates of green teas (refined, powdered, and middle grade) and black tea were as follows: (26.8, −19.6), (27.7, −14.3), (27.0, −18.8), and (25.2, −18.5), which fell within the vegetables cluster. The k-NN results showed that green tea (refined and powdered) and black tea could be classified as vegetables. Green tea consumption has been reported to be beneficial to overall health [23]. Generally, tea is composed of the leaves and buds of Camellia sinensis and the difference between green tea and black tea lies in the manufacturing process [24]. In addition to tea, the coordinates of coffee and instant coffee were (5.5, −19.7) and (9.7, −15.6), which were close to the vegetables cluster. The coordinates of kelp tea, which was categorized in beverages, were (−2.5, −10.8), which were near the processed kelps (salted kelp and kelp boiled in soy sauce). In addition, the coordinates of cocoa (pure powder), an ingredient of dark chocolate [25], were (−3.6, 5.8), which were close to the pulses cluster. The food was classified into pulses using k-NN. Cocoa is rich in polyphenols and has beneficial effects [26,27,28]. Moreover, cocoa contains nitrogenous proteins and minerals, such as potassium, magnesium, phosphorus, iron, zinc, and copper [29]. Furthermore, pulses are good sources of polyphenols [30], proteins, and minerals, such as magnesium, iron, and zinc [31]. Although this study did not consider polyphenols, there are many common features between cocoa and pulses, as shown in Figure 2 in terms of those micronutrients. These results indicate that the original materials were reflected in the mapping results.
There is no international consensus on the definition of vegetables and fruits; thus, definitions vary among countries or cultures [32]. The Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition) categorized vegetables of herbaceous plants, such as banana, strawberry, melon, watermelon, and pineapple, into fruits because of their convenience and Japanese dietary habits. From the results of Figure 2, their coordinates were (6.2, −38.5), (7.9, −33.3), (6.7, −37.5), (5.5, −36.2), and (5.3, −37.2), respectively, and they were all within the fruits cluster. These results indicate that mapping distinguishes vegetables from fruits in terms of nutrients.
Figure 3 shows that the mapping shown in Figure 2 is largely affected by proteins, fats, and carbohydrates. This means that foods rich in protein are drawn in the first quadrant, foods rich in fat are drawn in the second quadrant, foods rich in carbohydrates are drawn in the third quadrant, and foods rich in micronutrients are drawn between the first and fourth quadrants. These results indicate that the intake of a variety of foods was associated with the intake of a variety of nutrients. It is important to consume a diet containing a variety of foods. For example, the Dietary Approaches to Stop Hypertension (DASH) diet is composed of rich fruits, vegetables, and low-fat dairy foods, which have significant effects on blood pressure [33,34]. In addition to a DASH diet, a Mediterranean diet rich in olive oil, fruits, nuts, vegetables, and cereals, moderate in fish, and poor in red and processed meats was associated with low cardiovascular risk [35]. The WHO recommends a diversified and balanced diet and intake of foods such as fruits, vegetables, nuts, and pulses to protect against malnutrition and noncommunicable diseases [1]. For Japanese people, consuming diverse foods and increasing the amount of fruit and pulses are inversely associated with mortality [36]. Based on the results of this study, these foods are located in areas where various micronutrients are found.
The Japanese Food Guide Spinning Top was developed by the Ministry of Health, Labour, and Welfare and the Ministry of Agriculture, Forestry, and Fisheries of Japan in 2005. A diet that adheres to the Japanese Food Guide Spinning Top decreases the risk of total mortality and mortality from cardiovascular disease [37]. Figure 2 shows that the foods categorized in each dish were situated close to each other. This indicates that the mapping of food-composition tables can explain the relationship between the Japanese Food Guide Spinning Top and nutrient-based foods.

4.2. Limitation

This study has some limitations. First, information on iodine, selenium, chromium, molybdenum, and biotin was not added to the dataset for data imputation and mapping because the number of available foods was insufficient. Secondly, details of bioactive compounds, such as phytochemicals and peptides, and the quality of nutrients, such as amino acid balance and fatty acid composition, were not included in the dataset of this study. Finally, this study employed nutrient density and did not reflect the actual nutrient quantity of real intake in the mapping. When data from the Standard Tables of Food Composition in Japan increase and nutrient data become available, more precise mapping will be performed. Analysis of the quality of nutrients or other compounds may lead to a better understanding of food characteristics.

5. Conclusions

In conclusion, this is the first report on t-SNE-based visualization of the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition), and the results showed that most foods formed clusters in accordance with the food groups in the Standard Tables of Food Composition in Japan, 2020 (Eighth Revised Edition) in terms of nutrient values. In contrast, beverages did not form a large cluster, and beverages made from natural materials such as green tea, black tea, coffee, or cocoa were categorized based on their origin. The visualization of food composition enables a comprehensive understanding of the nutrients in foods, thereby leading to novel aspects of food classifications based on nutrient values.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/nu15235006/s1, Figure S1: Accuracy of the imputed data.

Author Contributions

Conceptualization, R.W. and S.T.; methodology, R.W., S.T. and S.K.; formal analysis, R.W.; data curation, R.W.; writing—original draft preparation, R.W.; writing—review and editing, R.W., S.T., K.H. and S.K.; visualization, R.W. and S.K.; supervision, S.T. and S.K.; project administration, R.W.; funding acquisition, self-funded. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

This data can be found here: https://www.mext.go.jp/a_menu/syokuhinseibun/mext_01110.html (accessed on 30 November 2023).

Acknowledgments

R.W. would like to thank Shigeru Taniguchi, Keiko Kawahata, and Teruyuki Noma for their support during my work at Meiji Co., Ltd. R.W. is grateful to Tomohito Horimoto, Yoshie Saito, and Tao Yu for useful discussions.

Conflicts of Interest

Ryota Wakayama and Satoshi Takasugi are full-time employees of Meiji Co., Ltd. Keiko Honda and Shigehiko Kanaya have academic consulting agreements with Meiji Co., Ltd.

References

  1. Healthy Diet. Available online: https://www.who.int/news-room/fact-sheets/detail/healthy-diet (accessed on 7 June 2023).
  2. Herforth, A.; Arimond, M.; Álvarez-Sánchez, C.; Coates, J.; Christianson, K.; Muehlhoff, E. A Global Review of Food-Based Dietary Guidelines. Adv. Nutr. 2019, 10, 590–605. [Google Scholar] [CrossRef] [PubMed]
  3. Yoshiike, N.; Hayashi, F.; Takemi, Y.; Mizoguchi, K.; Seino, F. A New Food Guide in Japan: The Japanese Food Guide Spinning Top. Nutr. Rev. 2007, 65, 149–154. [Google Scholar] [CrossRef]
  4. Murphy, S.P.; Barr, S.I. Food guides reflect similarities and differences in dietary guidance in three countries (Japan, Canada, and the United States). Nutr. Rev. 2007, 65, 141–148. [Google Scholar] [CrossRef]
  5. Elmadfa, I.; Meyer, A.L. Importance of food composition data to nutrition and public health. Eur. J. Clin. Nutr. 2010, 64 (Suppl. S3), S4–S7. [Google Scholar] [CrossRef] [PubMed]
  6. Watanabe, T. Research on Formulation and Application of “Standard Tables of Food Composition in Japan”. Jpn. J. Nutr. Diet. 2011, 69, 214–228. [Google Scholar] [CrossRef]
  7. Standard Tables of Food Composition in Japan–2020–(Eighth Rivised Edition); Report of the Subdivision on Resources; The Council for Science and Technology, Ministry of Education, Culture, Sports, Science and Technology: Tokyo, Japan, 2020.
  8. Watanabe, T. Food Composition Tables of Japan and the Nutrient Table/Database. J. Nutr. Sci. Vitaminol. 2015, 61, S25–S27. [Google Scholar] [CrossRef]
  9. Q&A about Standard Tables of Food Composition in Japan. Available online: https://www.mext.go.jp/a_menu/syokuhinseibun/__icsFiles/afieldfile/2019/01/15/1357804_001.pdf (accessed on 15 June 2023).
  10. Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar] [CrossRef]
  11. Balakrishna, Y.; Manda, S.; Mwambi, H.; van Graan, A. Identifying Nutrient Patterns in South African Foods to Support National Nutrition Guidelines and Policies. Nutrients 2021, 13, 3194. [Google Scholar] [CrossRef]
  12. Prado, S.B.R.D.; Giuntini, E.B.; Grande, F.; Menezes, E.W.D. Techniques to evaluate changes in the nutritional profile of food products. J. Food Compos. Anal. 2016, 53, 1–6. [Google Scholar] [CrossRef]
  13. Li, Y.; Bahadur, R.; Ahuja, J.; Pehrsson, P.; Harnly, J. Macro-and micronutrients in raw plant foods: The similarities of foods and implication for dietary diversification. J. Food Compos. Anal. 2021, 102, 103993. [Google Scholar] [CrossRef]
  14. van der Maaten, L.; Hinton, G. Viualizing data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
  15. Lunterova, A.; Spetko, O.; Palamas, G. Explorative Visualization of Food Data to Raise Awareness of Nutritional Value. In HCI International 2019—Late Breaking Papers; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; pp. 180–191. [Google Scholar] [CrossRef]
  16. Overview of the Dietary Reference Intakes for Japanese. 2020. Available online: https://www.mhlw.go.jp/content/10900000/000862500.pdf (accessed on 7 June 2023).
  17. Uenishi, K. Dietary Reference Intakes 2020. J. Cookey Sci. Jpn 2020, 53, 153–156. [Google Scholar] [CrossRef]
  18. Joyce, J.M. Kullback-Leibler Divergence. In International Encyclopedia of Statistical Science; Lovric, M., Ed.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 720–722. [Google Scholar] [CrossRef]
  19. Zhou, H.; Wang, F.; Tao, P. t-Distributed Stochastic Neighbor Embedding Method with the Least Information Loss for Macromolecular Simulations. J. Chem. Theory Comput. 2018, 14, 5499–5510. [Google Scholar] [CrossRef] [PubMed]
  20. Zhang, Z. Introduction to machine learning: K-nearest neighbors. Ann. Transl. Med. 2016, 4, 218. [Google Scholar] [CrossRef]
  21. Abu Alfeilat, H.A.; Hassanat, A.B.A.; Lasassmeh, O.; Tarawneh, A.S.; Alhasanat, M.B.; Eyal Salman, H.S.; Prasath, V.B.S. Effects of Distance Measure Choice on K-Nearest Neighbor Classifier Performance: A Review. Big Data 2019, 7, 221–248. [Google Scholar] [CrossRef]
  22. Schober, P.; Boer, C.; Schwarte, L.A. Correlation Coefficients: Appropriate Use and Interpretation. Anesth. Analg. 2018, 126, 1763–1768. [Google Scholar] [CrossRef] [PubMed]
  23. Abe, S.K.; Inoue, M. Green tea and cancer and cardiometabolic diseases: A review of the current epidemiological evidence. Eur. J. Clin. Nutr. 2021, 75, 865–876. [Google Scholar] [CrossRef]
  24. Cabrera, C.; Artacho, R.; Giménez, R. Beneficial effects of green tea—A review. J. Am. Coll. Nutr. 2006, 25, 79–99. [Google Scholar] [CrossRef]
  25. Montagna, M.T.; Diella, G.; Triggiano, F.; Caponio, G.R.; De Giglio, O.; Caggiano, G.; Di Ciaula, A.; Portincasa, P. Chocolate, “Food of the Gods”: History, Science, and Human Health. Int. J. Environ. Res. Public Health 2019, 16, 4960. [Google Scholar] [CrossRef]
  26. Magrone, T.; Russo, M.A.; Jirillo, E. Cocoa and Dark Chocolate Polyphenols: From Biology to Clinical Applications. Front. Immunol. 2017, 8, 677. [Google Scholar] [CrossRef] [PubMed]
  27. Zhao, B.; Gan, L.; Yu, K.; Männistö, S.; Huang, J.; Albanes, D. Relationship between chocolate consumption and overall and cause-specific mortality, systematic review and updated meta-analysis. Eur. J. Epidemiol. 2022, 37, 321–333. [Google Scholar] [CrossRef]
  28. Ding, E.L.; Hutfless, S.M.; Ding, X.; Girotra, S. Chocolate and prevention of cardiovascular disease: A systematic review. Nutr. Metab. 2006, 3, 2. [Google Scholar] [CrossRef] [PubMed]
  29. Latif, R. Chocolate/cocoa and human health: A review. Neth. J. Med. 2013, 71, 63–68. [Google Scholar]
  30. Singh, B.; Singh, J.P.; Shevkani, K.; Singh, N.; Kaur, A. Bioactive constituents in pulses and their health benefits. J. Food Sci. Technol. 2017, 54, 858–870. [Google Scholar] [CrossRef]
  31. Mudryj, A.N.; Yu, N.; Aukema, H.M. Nutritional and health benefits of pulses. Appl. Physiol. Nutr. Metab. 2014, 39, 1197–1204. [Google Scholar] [CrossRef]
  32. Thompson, F.E.; Willis, G.B.; Thompson, O.M.; Yaroch, A.L. The meaning of ‘fruits’ and ‘vegetables’. Public. Health Nutr. 2011, 14, 1222–1228. [Google Scholar] [CrossRef]
  33. Appel, L.J.; Moore, T.J.; Obarzanek, E.; Vollmer, W.M.; Svetkey, L.P.; Sacks, F.M.; Bray, G.A.; Vogt, T.M.; Cutler, J.A.; Windhauser, M.M.; et al. A clinical trial of the effects of dietary patterns on blood pressure. DASH Collaborative Research Group. N. Engl. J. Med. 1997, 336, 1117–1124. [Google Scholar] [CrossRef]
  34. Moore, T.J.; Conlin, P.R.; Ard, J.; Svetkey, L.P. DASH (Dietary Approaches to Stop Hypertension) diet is effective treatment for stage 1 isolated systolic hypertension. Hypertension 2001, 38, 155–158. [Google Scholar] [CrossRef] [PubMed]
  35. Estruch, R.; Ros, E.; Salas-Salvadó, J.; Covas, M.I.; Corella, D.; Arós, F.; Gómez-Gracia, E.; Ruiz-Gutiérrez, V.; Fiol, M.; Lapetra, J.; et al. Primary Prevention of Cardiovascular Disease with a Mediterranean Diet Supplemented with Extra-Virgin Olive Oil or Nuts. N. Engl. J. Med. 2018, 378, e34. [Google Scholar] [CrossRef] [PubMed]
  36. Kobayashi, M.; Sasazuki, S.; Shimazu, T.; Sawada, N.; Yamaji, T.; Iwasaki, M.; Mizoue, T.; Tsugane, S. Association of dietary diversity with total mortality and major causes of mortality in the Japanese population: JPHC study. Eur. J. Clin. Nutr. 2020, 74, 54–66. [Google Scholar] [CrossRef] [PubMed]
  37. Kurotani, K.; Akter, S.; Kashino, I.; Goto, A.; Mizoue, T.; Noda, M.; Sasazuki, S.; Sawada, N.; Tsugane, S. Quality of diet and mortality among Japanese men and women: Japan Public Health Center based prospective study. BMJ 2016, 352, i1209. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Imputation with MDS and random forest with 4 steps. Original matrix and distance matrix are determined by the numbers of foods (N) and nutrient value/kcal (M). In MDS matrix U, we set K as M in i and i’th objects, respectively.
Figure 1. Imputation with MDS and random forest with 4 steps. Original matrix and distance matrix are determined by the numbers of foods (N) and nutrient value/kcal (M). In MDS matrix U, we set K as M in i and i’th objects, respectively.
Nutrients 15 05006 g001
Figure 2. Mapping by t-SNE (with all nutrients). This figure shows the mapping based on all nutrients. The locations indicated by red numbers are the median of each food group: 1: cereals; 2: potatoes and starches; 3: sugars and sweeteners; 4: pulses; 5: nuts and seeds; 6: vegetables; 7: fruits; 8: mushrooms; 9: algae; 10: fish, mollusks, and crustaceans; 11: meat; 12: eggs; 13: milk and milk products; 14: fats and oils; 15: confectionaries; 16: beverages; and 17: seasonings and spices.
Figure 2. Mapping by t-SNE (with all nutrients). This figure shows the mapping based on all nutrients. The locations indicated by red numbers are the median of each food group: 1: cereals; 2: potatoes and starches; 3: sugars and sweeteners; 4: pulses; 5: nuts and seeds; 6: vegetables; 7: fruits; 8: mushrooms; 9: algae; 10: fish, mollusks, and crustaceans; 11: meat; 12: eggs; 13: milk and milk products; 14: fats and oils; 15: confectionaries; 16: beverages; and 17: seasonings and spices.
Nutrients 15 05006 g002
Figure 3. Effects of each nutrient in Figure 2 (with all nutrients). Effects of each nutrient in Figure 2 are shown. PROT: protein, FAT: fat, FASAT: saturated fatty acid, FAPUN3: n-3 polyunsaturated fatty acid, FAPUN6: n-6 polyunsaturated fatty acid, FIB: dietary fiber, CHOCDF: carbohydrate, NA: sodium, K: potassium, CA: calcium, MG: magnesium, P: phosphorus, FE: iron, ZN: zinc, CU: copper, MN: manganese, VITA_RAE: vitamin A (retinol activity equivalents), VITD: vitamin D, TOCPHA: vitamin E (alpha-tocopherol), VITK: vitamin K, THIA: vitamin B1, RIBF: vitamin B2, NE: niacin equivalent, VITB6A: vitamin B6, VITB12: vitamin B12, FOL: folic acid, PANTAC: pantothenic acid, VITC: vitamin C.
Figure 3. Effects of each nutrient in Figure 2 (with all nutrients). Effects of each nutrient in Figure 2 are shown. PROT: protein, FAT: fat, FASAT: saturated fatty acid, FAPUN3: n-3 polyunsaturated fatty acid, FAPUN6: n-6 polyunsaturated fatty acid, FIB: dietary fiber, CHOCDF: carbohydrate, NA: sodium, K: potassium, CA: calcium, MG: magnesium, P: phosphorus, FE: iron, ZN: zinc, CU: copper, MN: manganese, VITA_RAE: vitamin A (retinol activity equivalents), VITD: vitamin D, TOCPHA: vitamin E (alpha-tocopherol), VITK: vitamin K, THIA: vitamin B1, RIBF: vitamin B2, NE: niacin equivalent, VITB6A: vitamin B6, VITB12: vitamin B12, FOL: folic acid, PANTAC: pantothenic acid, VITC: vitamin C.
Nutrients 15 05006 g003
Figure 4. Mapping by t-SNE (without PFC). This figure shows mapping based on the nutrients without PFC. The location shown by red numbers is the median of each food group: 1: cereals; 2: potatoes and starches; 3: sugars and sweeteners; 4: pulses; 5: nuts and seeds; 6: vegetables; 7: fruits; 8: mushrooms; 9: algae; 10: fish, mollusks, and crustaceans; 11: meat; 12: eggs; 13: milk and milk products; 14: fats and oils; 15: confectionaries; 16: beverages; and 17: seasonings and spices.
Figure 4. Mapping by t-SNE (without PFC). This figure shows mapping based on the nutrients without PFC. The location shown by red numbers is the median of each food group: 1: cereals; 2: potatoes and starches; 3: sugars and sweeteners; 4: pulses; 5: nuts and seeds; 6: vegetables; 7: fruits; 8: mushrooms; 9: algae; 10: fish, mollusks, and crustaceans; 11: meat; 12: eggs; 13: milk and milk products; 14: fats and oils; 15: confectionaries; 16: beverages; and 17: seasonings and spices.
Nutrients 15 05006 g004
Figure 5. Effects of each nutrient in Figure 4 (without PFC). Effects of each nutrient in Figure 4 are shown. FASAT: saturated fatty acid, FAPUN3: n-3 polyunsaturated fatty acid, FAPUN6: n-6 polyunsaturated fatty acid, FIB: dietary fiber, NA: sodium, K: potassium, CA: calcium, MG: magnesium, P: phosphorus, FE: iron, ZN: zinc, CU: copper, MN: manganese, VITA_RAE: vitamin A (retinol activity equivalents), VITD: vitamin D, TOCPHA: vitamin E (alpha-tocopherol), VITK: vitamin K, THIA: vitamin B1, RIBF: vitamin B2, NE: niacin equivalent, VITB6A: vitamin B6, VITB12: vitamin B12, FOL: folic acid, PANTAC: pantothenic acid, VITC: vitamin C.
Figure 5. Effects of each nutrient in Figure 4 (without PFC). Effects of each nutrient in Figure 4 are shown. FASAT: saturated fatty acid, FAPUN3: n-3 polyunsaturated fatty acid, FAPUN6: n-6 polyunsaturated fatty acid, FIB: dietary fiber, NA: sodium, K: potassium, CA: calcium, MG: magnesium, P: phosphorus, FE: iron, ZN: zinc, CU: copper, MN: manganese, VITA_RAE: vitamin A (retinol activity equivalents), VITD: vitamin D, TOCPHA: vitamin E (alpha-tocopherol), VITK: vitamin K, THIA: vitamin B1, RIBF: vitamin B2, NE: niacin equivalent, VITB6A: vitamin B6, VITB12: vitamin B12, FOL: folic acid, PANTAC: pantothenic acid, VITC: vitamin C.
Nutrients 15 05006 g005
Table 1. Number of foods in food groups.
Table 1. Number of foods in food groups.
Food GroupNumber of Foods
Before Data ImputationAfter Data Imputation
All DataComplete Values/100 kcal
1 Cereals205145189
2 Potatoes and starches703760
3 Sugars and sweeteners30017
4 Pulses10874105
5 Nuts and seeds463846
6 Vegetables401246381
7 Fruits18379145
8 Mushrooms554455
9 Algae573956
10 Fish, mollusks, and crustaceans453377453
11 Meat310289310
12 Eggs232223
13 Milk and milk products594358
14 Fats and oils342831
15 Confectionaries185105176
16 Beverages611223
17 Seasonings and spices1484193
Total242816192221
This table shows the number of foods in the food group before and after data imputation. The original food tables contained 2428 foods (“all data”) and 1619 foods without missing values (“complete values”). After data imputation, values of 2221 foods per 100 kcal were available. “All data” means the number of foods in the original food tables and “complete values” means the number of foods in the original food without missing values.
Table 2. Number of foods in each nutrient group (the number of foods and percentage of total foods reported in each nutrient of the Standard Tables of Food Composition in Japan 2020).
Table 2. Number of foods in each nutrient group (the number of foods and percentage of total foods reported in each nutrient of the Standard Tables of Food Composition in Japan 2020).
Nutrient/EnergyNumber of FoodsPercentage of Missing Values (%)Selected in This Work
Energy24290
Water24290
Protein24270
Dietary FatsFat24270
Saturated fatty acid178726
n-6 PUFA177727
n-3 PUFA178327
CarbohydratesCarbohydrate24280
Dietary fiber23214
VitaminsFat-solubleVitamin A24051
Vitamin D23404
Vitamin E23204
Vitamin K22428
Water-SolubleVitamin B124210
Vitamin B224240
Niacin equivalent24280
Vitamin B623872
Vitamin B1223414
Folic acid24021
Pantothenic acid23902
Biotin124249
Vitamin C23981
MineralsMacroSodium24250
Potassium24260
Calcium24260
Magnesium24180
Phosphorus24260
MicroIron24260
Zinc24151
Copper24151
Manganese22806
Iodine123849
Selenium124949
Chromium124849
Molybdenum123849
Table 3. Comparison of the actual and the predicted food groups.
Table 3. Comparison of the actual and the predicted food groups.
Predictive Food GroupMR (%)
1234567891011121314151617
Actual food group1178000001000000010006
21560000100000002007
302130000000000020024
400010101000100001014
512013811000001010017
625010366410000000024
7141023130200000000210
810000614501000010018
902000602460000000018
1020011101144320000012
11100000000112980000004
120000000000022001004
1310000010022045160022
140000000001100290006
1563100010000020163007
1600010610000000112148
1720020710011100117519
This table shows the number of foods that were classified correctly or incorrectly using the k-NN method. The food groups included: 1: cereals; 2, potatoes and starches; 3, sugars and sweeteners; 4, pulses; 5, nuts and seeds; 6, vegetables; 7, fruits; 8, mushrooms; 9, algae; 10, fish; mollusks and crustaceans; 11, meat; 12, eggs; 13, milk and milk products; 14, fats and oils; 15confectionaries; 16, beverages; 17, seasonings and spices. MR: the misclassification rate for each food group.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wakayama, R.; Takasugi, S.; Honda, K.; Kanaya, S. Application of a Two-Dimensional Mapping-Based Visualization Technique: Nutrient-Value-Based Food Grouping. Nutrients 2023, 15, 5006. https://doi.org/10.3390/nu15235006

AMA Style

Wakayama R, Takasugi S, Honda K, Kanaya S. Application of a Two-Dimensional Mapping-Based Visualization Technique: Nutrient-Value-Based Food Grouping. Nutrients. 2023; 15(23):5006. https://doi.org/10.3390/nu15235006

Chicago/Turabian Style

Wakayama, Ryota, Satoshi Takasugi, Keiko Honda, and Shigehiko Kanaya. 2023. "Application of a Two-Dimensional Mapping-Based Visualization Technique: Nutrient-Value-Based Food Grouping" Nutrients 15, no. 23: 5006. https://doi.org/10.3390/nu15235006

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop