Mapping the Analytical Landscape of Gene–Diet Interactions in Epidemiology: From Classical Models to Causal and Multi-Omics Frameworks
Abstract
1. Introduction
2. Analytical Landscape of Gene–Diet Interactions in Epidemiology
2.1. Classical Statistical Models
2.2. Efficient Study Designs
2.3. Dietary Assessment and Measurement Error
2.4. Dietary Patterns, Mixture Models, and Non-Linear Methods
2.5. Genome-Wide, High-Dimensional, and Polygenic Approaches
2.6. Causal Inference Frameworks
2.7. Multi-Omics Integration
2.8. Machine Learning
3. Discussion
4. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Dietrich, S.; Jacobs, S.; Zheng, J.; Meidtner, K.; Schwingshackl, L.; Schulze, M.B. Gene-lifestyle interaction on risk of type 2 diabetes: A systematic review. Obes. Rev. 2019, 20, 1557–1571. [Google Scholar] [CrossRef]
- Sekar, P.; Ventura, E.F.; Dhanapal, A.C.T.A.; Cheah, E.S.G.; Loganathan, A.; Quen, P.L.; Appukutty, M.; Taslim, N.A.; Hardinsyah, H.; Md Noh, M.F.; et al. Gene-diet interactions on metabolic disease-related outcomes in Southeast Asian populations: A systematic review. Nutrients 2023, 15, 2948. [Google Scholar] [CrossRef]
- Favara, G.; Maugeri, A.; Magnano San Lio, R.; Barchitta, M.; Agodi, A. Exploring gene-diet interactions for mother-child health: A systematic review of epidemiological studies. Nutrients 2024, 16, 994. [Google Scholar] [CrossRef] [PubMed]
- McAllister, K.; Mechanic, L.E.; Amos, C.; Aschard, H.; Blair, I.A.; Chatterjee, N.; Conti, D.; Gauderman, W.J.; Hsu, L.; Hutter, C.M.; et al. Current challenges and new opportunities for gene-environment interaction studies of complex diseases. Am. J. Epidemiol. 2017, 186, 753–761. [Google Scholar] [CrossRef] [PubMed]
- Gauderman, W.J.; Mukherjee, B.; Aschard, H.; Hsu, L.; Lewinger, J.P.; Patel, C.J.; Witte, J.S.; Amos, C.; Tai, C.G.; Conti, D.; et al. Update on the state of the science for analytical methods for gene-environment interactions. Am. J. Epidemiol. 2017, 186, 762–770. [Google Scholar] [CrossRef]
- Mullins, V.A.; Bresette, W.; Johnstone, L.; Hallmark, B.; Chilton, F.H. Genomics in personalized nutrition: Can you “eat for your genes”? Nutrients 2020, 12, 3118. [Google Scholar] [CrossRef]
- Singar, S.; Nagpal, R.; Arjmandi, B.H.; Akhavan, N.S. Personalized nutrition: Tailoring dietary recommendations through genetic insights. Nutrients 2024, 16, 2673. [Google Scholar] [CrossRef]
- Aschard, H. A perspective on interaction effects in genetic association studies. Genet. Epidemiol. 2016, 40, 678–688. [Google Scholar] [CrossRef] [PubMed]
- Thomas, D. Gene-environment-wide association studies: Emerging approaches. Nat. Rev. Genet. 2010, 11, 259–272. [Google Scholar] [CrossRef]
- Pan, Z.; Liu, X.; Li, Y.; Wan, X.; Wu, Y.; Wu, F.; Zhang, Y.; Jiao, J. Effect of diet quality and genetic predisposition on hemoglobin A1c and type 2 diabetes risk: Gene-diet interaction analysis of 357,419 individuals. Diabetes Care 2021, 44, 2470–2479. [Google Scholar] [CrossRef]
- Wang, T.; Heianza, Y.; Sun, D.; Huang, T.; Ma, W.; Rimm, E.B.; Manson, J.E.; Hu, F.B.; Willett, W.C.; Qi, L. Improving adherence to healthy dietary patterns, genetic risk, and long-term weight gain: Gene-diet interaction analysis in two prospective cohort studies. BMJ 2018, 360, j5644. [Google Scholar] [CrossRef] [PubMed]
- Li, S.X.; Imamura, F.; Ye, Z.; Schulze, M.B.; Zheng, J.; Ardanaz, E.; Arriola, L.; Boeing, H.; Dow, C.; Fagherazzi, G.; et al. Interaction between genes and macronutrient intake on the risk of developing type 2 diabetes: Systematic review and findings from EPIC-InterAct. Am. J. Clin. Nutr. 2017, 106, 263–275. [Google Scholar] [CrossRef]
- Cole, J.B.; Florez, J.C.; Hirschhorn, J.N. Comprehensive genomic analysis of dietary habits in UK Biobank identifies hundreds of genetic associations and a causal role for obesity on diet composition. Nat. Commun. 2020, 11, 1467. [Google Scholar] [CrossRef]
- Larsson, S.C. Mendelian randomization as a tool for causal inference in human nutrition and metabolism. Curr. Opin. Lipidol. 2020, 31, 1–8. [Google Scholar] [CrossRef]
- Davey Smith, G.; Hemani, G. Mendelian randomization: Genetic anchors for causal inference in epidemiological studies. Hum. Mol. Genet. 2014, 23, R89–R98. [Google Scholar] [CrossRef]
- Knol, M.J.; VanderWeele, T.J. Recommendations for presenting analyses of effect modification and interaction. Int. J. Epidemiol. 2012, 41, 514–520. [Google Scholar] [CrossRef]
- Andersson, T.; Alfredsson, L.; Källberg, H.; Zdravkovic, S.; Ahlbom, A. Calculating measures of biological interaction. Eur. J. Epidemiol. 2005, 20, 575–579. [Google Scholar] [CrossRef]
- Willett, W.C.; Stampfer, M.J. Total energy intake: Implications for epidemiologic analyses. Am. J. Epidemiol. 1986, 124, 17–27. [Google Scholar] [CrossRef]
- Leite, M.L.C. Applying compositional data methodology to nutritional epidemiology. Stat. Methods Med. Res. 2016, 25, 3057–3065. [Google Scholar] [CrossRef] [PubMed]
- Zhao, J.; Li, Z.; Gao, Q.; Zhao, H.; Chen, S.; Huang, L.; Wang, W.; Wang, T. A review of statistical methods for dietary pattern analysis. Nutr. J. 2021, 20, 37. [Google Scholar] [CrossRef] [PubMed]
- Schulz, C.A.; Oluwagbemigun, K.; Nöthlings, U. Advances in dietary pattern analysis in nutritional epidemiology. Eur. J. Nutr. 2021, 60, 4115–4130. [Google Scholar] [CrossRef]
- Price, A.L.; Patterson, N.J.; Plenge, R.M.; Weinblatt, M.E.; Shadick, N.A.; Reich, D. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006, 38, 904–909. [Google Scholar] [CrossRef]
- Murcray, C.E.; Lewinger, J.P.; Gauderman, W.J. Gene-environment interaction in genome-wide association studies: A two-step approach. Am. J. Epidemiol. 2009, 169, 219–226. [Google Scholar] [CrossRef]
- Moore, R.; Casale, F.P.; Bonder, M.J.; Horta, D.; Franke, L.; Barroso, I.; Stegle, O. A linear mixed-model approach to study multivariate gene-environment interactions. Nat. Genet. 2019, 51, 180–186. [Google Scholar] [CrossRef] [PubMed]
- Dahl, A.; Nguyen, K.; Cai, N.; Gandal, M.J.; Flint, J.; Zaitlen, N. A robust method uncovers significant context-specific heritability in diverse complex traits. Am. J. Hum. Genet. 2020, 106, 71–91. [Google Scholar] [CrossRef] [PubMed]
- Lawlor, D.A.; Harbord, R.M.; Sterne, J.A.C.; Timpson, N.; Davey Smith, G. Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology. Stat. Med. 2008, 27, 1133–1163. [Google Scholar] [CrossRef] [PubMed]
- Moccia, C.; Moirano, G.; Popović, M.; Pizzi, C.; Fariselli, P.; Richiardi, L.; Ekstrøm, C.T.; Maule, M. Machine learning in causal inference for epidemiology. Eur. J. Epidemiol. 2024, 39, 1097–1108. [Google Scholar] [CrossRef]
- Argelaguet, R.; Velten, B.; Arnol, D.; Dietrich, S.; Zenz, T.; Marioni, J.C.; Buettner, F.; Huber, W.; Stegle, O. Multi-omics factor analysis—A framework for unsupervised integration of multi-omics data sets. Mol. Syst. Biol. 2018, 14, e8124. [Google Scholar] [CrossRef]
- Khorraminezhad, L.; Leclercq, M.; Droit, A.; Bilodeau, J.F.; Rudkowska, I. Statistical and machine-learning analyses in nutritional genomics studies. Nutrients 2020, 12, 3140. [Google Scholar] [CrossRef]
- Collins, G.S.; Reitsma, J.B.; Altman, D.G.; Moons, K.G.M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. Ann. Intern. Med. 2015, 162, 55–63. [Google Scholar] [CrossRef]
- Wolff, R.F.; Moons, K.G.M.; Riley, R.D.; Whiting, P.F.; Westwood, M.; Collins, G.S.; Reitsma, J.B.; Kleijnen, J.; Mallett, S. PROBAST: A tool to assess risk of bias and applicability of prediction model studies. Ann. Intern. Med. 2019, 170, 51–58. [Google Scholar] [CrossRef] [PubMed]
- Little, J.; Higgins, J.P.T.; Ioannidis, J.P.A.; Moher, D.; Gagnon, F.; von Elm, E.; Khoury, M.J.; Cohen, B.; Davey-Smith, G.; Grimshaw, J.; et al. STrengthening the REporting of Genetic Association studies (STREGA): An extension of the STROBE statement. PLoS Med. 2009, 6, e22. [Google Scholar] [CrossRef]
- von Elm, E.; Altman, D.G.; Egger, M.; Pocock, S.J.; Gøtzsche, P.C.; Vandenbroucke, J.P.; Initiative, S. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. Lancet 2007, 370, 1453–1457. [Google Scholar] [CrossRef]
- Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
- Storey, J.D.; Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. USA 2003, 100, 9440–9445. [Google Scholar] [CrossRef]
- Scott, J.G.; Berger, J.O. Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem. Ann. Stat. 2010, 38, 2587–2619. [Google Scholar] [CrossRef]
- Ročková, V.; George, E.I. The Spike-and-Slab LASSO. J. Am. Stat. Assoc. 2018, 113, 431–444. [Google Scholar] [CrossRef]
- Carvalho, C.M.; Polson, N.G.; Scott, J.G. The horseshoe estimator for sparse signals. Biometrika 2010, 97, 465–480. [Google Scholar] [CrossRef]
- Ren, J.; Zhou, F.; Li, X.; Ma, S.; Jiang, Y.; Wu, C. Robust Bayesian variable selection for gene-environment interactions. Biometrics 2023, 79, 684–694. [Google Scholar] [CrossRef]
- Wakefield, J. Reporting and interpretation in genome-wide association studies. Int. J. Epidemiol. 2008, 37, 641–653. [Google Scholar] [CrossRef]
- Freedman, L.S.; Schatzkin, A.; Midthune, D.; Kipnis, V. Dealing with dietary measurement error in nutritional cohort studies. J. Natl. Cancer Inst. 2011, 103, 1086–1092. [Google Scholar] [CrossRef]
- Carroll, R.J.; Ruppert, D.; Stefanski, L.A.; Crainiceanu, C.M. Measurement Error in Nonlinear Models: A Modern Perspective, 2nd ed.; Chapman & Hall/CRC: Boca Raton, FL, USA, 2006. [Google Scholar]
- Syring, N.; Martin, R. Calibrating general posterior credible regions. Biometrika 2019, 106, 479–486. [Google Scholar] [CrossRef]
- Kuo, L.; Tan, Y.; Wu, Y.; Qin, X.; Gong, H.; Zhao, Y.; Wu, T.; Chen, D.; Wang, M.; Wang, J.; et al. Diet-gene interaction between fruit intake and CMIP rs2925979 polymorphism in relation to type 2 diabetes: A family-based study in Northern China. Nutrients 2025, 17, 1789. [Google Scholar] [CrossRef]
- Devlin, B.; Roeder, K. Genomic control for association studies. Biometrics 1999, 55, 997–1004. [Google Scholar] [CrossRef] [PubMed]
- Kraft, P.; Yen, Y.C.; Stram, D.O.; Morrison, J.; Gauderman, W.J. Exploiting gene-environment interaction to detect genetic associations. Hum. Hered. 2007, 63, 111–119. [Google Scholar] [CrossRef]
- Kipnis, V.; Subar, A.F.; Midthune, D.; Freedman, L.S.; Ballard-Barbash, R.; Troiano, R.P.; Bingham, S.; Schoeller, D.A.; Schatzkin, A.; Carroll, R.J. Structure of dietary measurement error: Results of the OPEN biomarker study. Am. J. Epidemiol. 2003, 158, 14–21. [Google Scholar] [CrossRef]
- Subar, A.F.; Freedman, L.S.; Tooze, J.A.; Kirkpatrick, S.I.; Boushey, C.; Neuhouser, M.L.; Thompson, F.E.; Potischman, N.; Guenther, P.M.; Tarasuk, V.; et al. Addressing current criticism regarding the value of self-report dietary data. J. Nutr. 2015, 145, 2639–2645. [Google Scholar] [CrossRef]
- Lachat, C.; Hawwash, D.; Ocké, M.C.; Berg, C.; Forsum, E.; Hörnell, A.; Larsson, C.; Sonestedt, E.; Wirfält, E.; Åkesson, A.; et al. Strengthening the Reporting of Observational Studies in Epidemiology–Nutritional Epidemiology (STROBE-nut): An Extension of the STROBE Statement. PLoS Med. 2016, 13, e1002036. [Google Scholar] [CrossRef] [PubMed]
- Bailey, R.L. Overview of dietary assessment methods for measuring intakes of foods, beverages, and dietary supplements in research studies. Curr. Opin. Biotechnol. 2021, 70, 91–96. [Google Scholar] [CrossRef] [PubMed]
- Bai, J.; Bouwknegt, D.; Weersma, R.K.; Dijkstra, G.; van der Sloot, K.W.J.; Festen, E.A.M. Gene-environment interactions in inflammatory bowel disease: A systematic review of human epidemiologic studies. J. Crohns Colitis 2025, 19, jjaf061. [Google Scholar] [CrossRef]
- Tooze, J.A.; Kipnis, V.; Buckman, D.W.; Carroll, R.J.; Freedman, L.S.; Guenther, P.M.; Krebs-Smith, S.M.; Subar, A.F.; Dodd, K.W. A mixed-effects model approach for estimating the distribution of usual intake of nutrients: The NCI method. Stat. Med. 2010, 29, 2857–2868. [Google Scholar] [CrossRef]
- Shavelson, R.J.; Webb, N.M. Generalizability Theory: A Primer; SAGE Publications: Newbury Park, CA, USA, 1991. [Google Scholar]
- Brennan, R.L. Generalizability Theory; Springer: New York, NY, USA, 2001. [Google Scholar]
- Cullen, K.W.; Watson, K.; Himes, J.H.; Baranowski, T.; Rochon, J.; Waclawiw, M.; Sun, W.; Stevens, M.; Slawson, D.L.; Matheson, D.; et al. Evaluation of quality control procedures for 24-h dietary recalls: Results from the Girls health Enrichment Multisite Studies. Prev. Med. 2004, 38, S14–S23. [Google Scholar] [CrossRef]
- Stote, K.S.; Radecki, S.V.; Moshfegh, A.J.; Ingwersen, L.A.; Baer, D.J. The number of 24 h dietary recalls using the US Department of Agriculture’s automated multiple-pass method required to estimate nutrient intake in overweight and obese adults. Public Health Nutr. 2011, 14, 1736–1742. [Google Scholar] [CrossRef] [PubMed]
- Hu, F.B. Dietary pattern analysis: A new direction in nutritional epidemiology. Curr. Opin. Lipidol. 2002, 13, 3–9. [Google Scholar] [CrossRef]
- Carrico, C.; Gennings, C.; Wheeler, D.C.; Factor-Litvak, P. Characterization of weighted quantile sum regression for highly correlated data in a risk analysis setting. J. Agric. Biol. Environ. Stat. 2015, 20, 100–120. [Google Scholar] [CrossRef]
- Keil, A.P.; Buckley, J.P.; O’Brien, K.M.; Ferguson, K.K.; Zhao, S.; White, A.J. A quantile-based g-computation approach to addressing the effects of exposure mixtures. Environ. Health Perspect. 2020, 128, 047004. [Google Scholar] [CrossRef]
- Bobb, J.F.; Valeri, L.; Claus Henn, B.; Christiani, D.C.; Wright, R.O.; Mazumdar, M.; Godleski, J.J.; Coull, B.A. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics 2015, 16, 493–508. [Google Scholar] [CrossRef] [PubMed]
- Taylor, R.M.J.; Moore, J.A.; Griffiths, A.; Cousins, A.L.; Young, H.A. Unveiling dietary complexity: A scoping review and reporting guidance for network analysis in dietary pattern research. Nutrients 2025, 17, 3261. [Google Scholar] [CrossRef] [PubMed]
- Zhu, X.; Yang, Y.; Lorincz-Comi, N.; Li, G.; Bentley, A.R.; de Vries, P.S.; Brown, M.; Morrison, A.C.; Rotimi, C.N.; Gauderman, W.J.; et al. An approach to identify gene-environment interactions and reveal new biological insight in complex traits. Nat. Commun. 2024, 15, 3385. [Google Scholar] [CrossRef]
- Ma, Y.; Zhao, Y.; Zhang, J.F.; Bi, W. Efficient and accurate framework for genome-wide gene-environment interaction analysis in large-scale biobanks. Nat. Commun. 2025, 16, 3064. [Google Scholar] [CrossRef]
- Stephens, M. False discovery rates: A new deal. Biostatistics 2017, 18, 275–294. [Google Scholar] [CrossRef]
- Papadimitriou, N.; Kim, A.; Kawaguchi, E.S.; Morrison, J.; Diez-Obrero, V.; Albanes, D.; Berndt, S.I.; Bézieau, S.; Bien, S.A.; Bishop, D.T. Genome-wide interaction study of dietary intakes of fibre, fruits and vegetables and risk of colorectal cancer. eBioMedicine 2024, 102, 105146. [Google Scholar] [CrossRef]
- Westerman, K.E.; Walker, M.E.; Gaynor, S.M.; Wessel, J.; DiCorpo, D.; Ma, J.; Alonso, A.; Aslibekyan, S.; Baldridge, A.S.; Bertoni, A.G.; et al. Investigating gene-diet interactions impacting the association between macronutrient intake and glycemic traits. Diabetes 2023, 72, 653–665. [Google Scholar] [CrossRef]
- Lee, J.H.; Hong, K.W.; Park, B.; Choi, J.E.; Jung, D. Gene-diet interactions in high-density lipoprotein cholesterol-related polymorphisms and cardiovascular disease risk: Insights from the Korean Genome and Epidemiology Study. Nutrients 2025, 17, 778. [Google Scholar] [CrossRef]
- Chatterjee, N.; Shi, J.; García-Closas, M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat. Rev. Genet. 2016, 17, 392–406. [Google Scholar] [CrossRef] [PubMed]
- Martin, A.R.; Kanai, M.; Kamatani, Y.; Okada, Y.; Neale, B.M.; Daly, M.J. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 2019, 51, 584–591. [Google Scholar] [CrossRef] [PubMed]
- Janssens, A.C.; Ioannidis, J.P.; Bedrosian, S.; Boffetta, P.; Dolan, S.M.; Dowling, N.; Fortier, I.; Freedman, A.N.; Grimshaw, J.M.; Gulcher, J.; et al. Strengthening the reporting of genetic risk prediction studies (GRIPS): Explanation and elaboration. Eur. J. Clin. Investig. 2011, 41, 1010–1035. [Google Scholar] [CrossRef]
- Bowden, J.; Davey Smith, G.; Burgess, S. Mendelian randomization with invalid instruments: Effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015, 44, 512–525. [Google Scholar] [CrossRef] [PubMed]
- Qi, G.; Chatterjee, N. Mendelian randomization analysis using mixture models for robust and efficient estimation of causal effects. Nat. Commun. 2019, 10, 1941. [Google Scholar] [CrossRef]
- Verbanck, M.; Chen, C.Y.; Neale, B.; Do, R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 2018, 50, 693–698. [Google Scholar] [CrossRef]
- Burgess, S.; Foley, C.N.; Allara, E.; Staley, J.R.; Howson, J.M.M. A robust and efficient method for Mendelian randomization with hundreds of genetic variants. Nat. Commun. 2020, 11, 376. [Google Scholar] [CrossRef]
- Sanderson, E.; Davey Smith, G.; Windmeijer, F.; Bowden, J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int. J. Epidemiol. 2019, 48, 713–727. [Google Scholar] [CrossRef] [PubMed]
- Zhao, Q.; Chen, Y.; Wang, J.; Small, D.S. Powerful three-sample genome-wide design and robust statistical inference in summary-data Mendelian randomization. Int. J. Epidemiol. 2019, 48, 1478–1492. [Google Scholar] [CrossRef] [PubMed]
- Yang, Q.; Li, M.; Chen, P.; Dou, N.; Liu, M.; Lu, P.; Yu, C. Systematic evaluation of the impact of a wide range of dietary habits on myocardial infarction: A two-sample Mendelian randomization analysis. J. Am. Heart Assoc. 2025, 14, e035936. [Google Scholar] [CrossRef]
- Guo, J.; Xi, Z.; Wang, M.; Yang, X. Dietary patterns and risk of ischemic stroke: A two-sample Mendelian randomization study. Medicine 2025, 104, e45182. [Google Scholar] [CrossRef] [PubMed]
- Imai, K.; Keele, L.; Tingley, D. A general approach to causal mediation analysis. Psychol. Methods 2010, 15, 309–334. [Google Scholar] [CrossRef]
- VanderWeele, T.J. Explanation in Causal Inference: Methods for Mediation and Interaction; Oxford University Press: New York, NY, USA, 2015. [Google Scholar]
- Obón-Santacana, M.; Moratalla-Navarro, F.; Guinó, E.; Carreras-Torres, R.; Díez-Obrero, V.; Bars-Cortina, D.; Ibáñez-Sanz, G.; Rodríguez-Alonso, L.; Mata, A.; García-Rodríguez, A.; et al. Diet impacts on gene expression in healthy colon tissue: Insights from the BarcUVa-Seq study. Nutrients 2024, 16, 3131. [Google Scholar] [CrossRef]
- Shen, R.; Olshen, A.B.; Ladanyi, M. Integrative clustering of multiple genomic data types using a joint latent variable model. Bioinformatics 2009, 25, 2906–2912. [Google Scholar] [CrossRef]
- Singh, A.; Shannon, C.P.; Gautier, B.; Rohart, F.; Vacher, M.; Tebbutt, S.J.; Lê Cao, K.A. DIABLO: An integrative approach for identifying key molecular drivers from multi-omics assays. Bioinformatics 2019, 35, 3055–3062. [Google Scholar] [CrossRef]
- Langfelder, P.; Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 2008, 9, 559. [Google Scholar] [CrossRef]
- Li, K.; Tang, H.; Wang, Y.; Wang, X. Identification of hub genes involved in the pathogenesis of diabetic nephropathy: A multi-omics study integrating machine learning, Mendelian randomization and mediation analysis. Diabetes Obes. Metab. 2025, 27, 4927–4941. [Google Scholar] [CrossRef] [PubMed]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Chipman, H.A.; George, E.I.; McCulloch, R.E. BART: Bayesian additive regression trees. Ann. Appl. Stat. 2010, 4, 266–298. [Google Scholar] [CrossRef]
- Ritchie, M.D.; Hahn, L.W.; Roodi, N.; Bailey, L.R.; Dupont, W.D.; Parl, F.F.; Moore, J.H. Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am. J. Hum. Genet. 2001, 69, 138–147. [Google Scholar] [CrossRef] [PubMed]
- van Houwelingen, M.L.; Zhu, Y. Identifying and predicting dietary patterns in the Dutch population using machine learning. Eur. J. Nutr. 2025, 64, 305. [Google Scholar] [CrossRef] [PubMed]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NeurIPS), Red Hook, NY, USA, 4–9 December 2017. [Google Scholar]
- Aas, K.; Jullum, M.; Løland, A. Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. Artif. Intell. 2021, 298, 103502. [Google Scholar] [CrossRef]
- Strobl, C.; Boulesteix, A.L.; Zeileis, A.; Hothorn, T. Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinform. 2007, 8, 25. [Google Scholar] [CrossRef]
- Strobl, C.; Boulesteix, A.L.; Kneib, T.; Augustin, T.; Zeileis, A. Conditional variable importance for random forests. BMC Bioinform. 2008, 9, 307. [Google Scholar] [CrossRef]
- Greenland, S. Quantifying biases in causal models: Classical confounding vs collider-stratification bias. Epidemiology 2003, 14, 300–306. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Maugeri, A. Mapping the Analytical Landscape of Gene–Diet Interactions in Epidemiology: From Classical Models to Causal and Multi-Omics Frameworks. Nutrients 2026, 18, 880. https://doi.org/10.3390/nu18060880
Maugeri A. Mapping the Analytical Landscape of Gene–Diet Interactions in Epidemiology: From Classical Models to Causal and Multi-Omics Frameworks. Nutrients. 2026; 18(6):880. https://doi.org/10.3390/nu18060880
Chicago/Turabian StyleMaugeri, Andrea. 2026. "Mapping the Analytical Landscape of Gene–Diet Interactions in Epidemiology: From Classical Models to Causal and Multi-Omics Frameworks" Nutrients 18, no. 6: 880. https://doi.org/10.3390/nu18060880
APA StyleMaugeri, A. (2026). Mapping the Analytical Landscape of Gene–Diet Interactions in Epidemiology: From Classical Models to Causal and Multi-Omics Frameworks. Nutrients, 18(6), 880. https://doi.org/10.3390/nu18060880

