Next Article in Journal
Ultra-Processed Food and Frailty: Evidence from a Prospective Cohort Study and Implications for Future Research
Previous Article in Journal
Addressing Childhood Obesity in Children in Need in Greece: Policy Implementers’ Knowledge, Perceptions and Lessons for Effective Implementation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Genome-Wide Association Study of Circulating Serum Choline, Betaine, Dimethylglycine, and Their Ratios

1
Friedman School of Nutrition Science and Policy, Tufts University, Boston, MA 02111, USA
2
Nutritional Sciences and Toxicology, University of California, Berkeley, CA 94720, USA
3
School of Medicine and Health Sciences, George Washington University, Washington, DC 20037, USA
4
Think Healthy Group, LLC, Washington, DC 20001, USA
*
Author to whom correspondence should be addressed.
Nutrients 2025, 17(16), 2630; https://doi.org/10.3390/nu17162630
Submission received: 1 July 2025 / Revised: 1 August 2025 / Accepted: 5 August 2025 / Published: 14 August 2025
(This article belongs to the Section Nutritional Epidemiology)

Abstract

Background/Objectives: Genetic variation has been thought to alter the human dietary requirement for choline and subsequent circulating levels of its metabolites betaine and dimethylglycine (DMG). The aim of this genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) associated with serum choline, betaine, and dimethylglycine (DMG) as well as choline-to-betaine and betaine-to-DMG ratios. Methods: Data from the Collaborative Study of Genes, Nutrients and Metabolites (CSGNM; n = 2402) were used to model individual associations of choline, betaine, and DMG circulating metabolites and their ratios with 680,975 SNPs, using linear regression. Models were unadjusted (model 1), adjusted for age and sex (model 2), and further adjusted for selected principal components (model 3) and B12, B9, B6, and holotranscobalamin (model 4). Statistical significance was set at p < 5.0 × 10−5. Affected SNPs in the dbSNP (database of Single Nucleotide Polymorphisms) were then identified. Results: GWAS revealed both intuitive and novel results, including the recently described SLC25A48, several intronic variants in the gene encoding LYPLAL1, and a pair of SNPs present in the intronic region of PID1 related to serum choline. SNPs related to betaine and DMG included SLCA12, BHMT, DMGDH, and additional SLC family transporters that require further validation. While exploratory, GWAS of the choline-to-betaine and betaine-to-DMG ratios revealed common targets with direct links to choline and one-carbon metabolism. Conclusions: These results suggest that metabolic handling of choline has genetic determinants not formerly recognized in the scientific literature. Replication is needed in larger cohorts due to low statistical power.

1. Introduction

An adequate supply of choline is essential for supporting its diverse physiological functions, including maintenance of hepatic, cardiovascular, and neurological function [1,2]. Choline is a critical metabolic substrate for the synthesis of the following: phosphatidylcholine, a key component of membranes; lipoproteins, cell signaling compounds; acetylcholine, a critical neurotransmitter; and betaine, an osmolyte and methyl donor that supports one-carbon metabolism. Choline’s oxidation to betaine and use as a methyl donor by betaine-homocysteine methyltransferase (BHMT) regenerates methionine from homocysteine, yielding dimethylglycine (DMG), which can be further oxidatively demethylated and contribute one-carbon units to folate for use in one-carbon metabolism to support methylation reactions as well as the synthesis of nucleotides.
Maintenance of an adequate supply of choline to tissues is supported by the endogenous synthesis of choline through the phosphatidylethanolamine N-methyltransferase pathway, but it is insufficient to maintain plasma choline and betaine levels and organ function without an exogenous source of choline in the diet. When dietary choline intakes are adequate, plasma levels of choline, betaine, and DMG can still vary considerably among healthy individuals (7–20 μM, 20–60 μM, and 2.5–7.5 μM, respectively) [3]. In general, dietary intakes of choline and betaine only minimally impact plasma levels [4], suggesting other biological factors are likely to influence metabolite homeostasis. To date, little is known about variation in metabolic kinetics that might influence plasma metabolite concentrations, including variability in choline and betaine absorption, altered distribution of metabolites between plasma and tissue compartments, altered metabolism, and/or altered excretion.
It is thought that genetic factors may contribute to the metabolic homeostasis of choline, betaine, and DMG. Several common single nucleotide polymorphisms (SNPs) in genes encoding enzymes involved in choline and one-carbon metabolism have been thought to significantly modify the odds of developing organ dysfunction under conditions of deficiency [5]. However, this body of research comprises predominantly post hoc analyses of smaller trials that have employed a candidate gene approach, and variants in these candidate genes explain only a limited degree of the variation in circulating metabolite concentrations, suggestive of untapped genetic determinants.
The emergence of studies with high-throughput genomic data as well as metabolite analysis have allowed for the discovery of novel genetic determinants of circulating metabolites. These cohorts facilitate the discovery of SNPs that influence plasma metabolite levels and additionally have the potential to uncover novel proteins involved in metabolic homeostasis. Indeed, recent genome-wide association studies (GWAS) assessing plasma choline levels have uncovered candidate transporters for the ubiquitously expressed plasma membrane transporter not previously associated with choline dynamics [6]. Candidate SNPs identified in these analyses also serve as potential modifiers of nutrient intake requirements, broadening the rationale for precision nutrition investigations. While GWAS hold great potential, results from our systematic review of genetic determinants of 42 circulating choline-related metabolites, primarily trimethylamine N-oxide (TMAO), betaine, sphingomyelins, lysophosphatidylcholines, and phosphatidylcholines, identified a need to undertake GWAS analyses in well-phenotyped cohorts that assessed choline and its metabolites using targeted, absolute quantification (as opposed to the untargeted, relative quantification values common to metabolomic data) [7]. To this end, we have undertaken a GWAS of serum choline, betaine, and DMG utilizing data from the Collaborative Study of Genes, Nutrients and Metabolites (CSGNM) to expand on the genetic determinants of circulating metabolite concentrations and facilitate the discovery of candidates for functional analyses.

2. Materials and Methods

2.1. Study Sample

The CSGNM conducted genome-wide genotyping and serum metabolite quantification on 2402 ethnically Irish participants attending Trinity College Dublin, who were recruited between February 2003 and 2004 [8]. Participants provided a non-fasting blood sample for genetic and circulating metabolite quantification. Extracted DNA was genotyped using the HumanOmni1-Quad BeadChip (Illumina, San Diego, CA, USA). Circulating choline, betaine, and DMG were reported in micromoles per liter. Participants also completed a diet, lifestyle, and demographics questionnaire, which included information on age, sex, height, weight, medical conditions, smoking, dietary habits, and consumption of alcohol, fortified foods, and dietary supplements. Participants with non-Irish grandparents, major medical problems, or missing questionnaires were excluded from the cohort. We requested the CSGNM data for the present analyses from the dbGaP (database of Genotypes and Phenotypes) (n = 2232) [9]. The study has been described in detail elsewhere [8].

2.2. Genetic Quality Control

Genetic data quality control was conducted using PLINK (version 1.9) [10]. SNPs were limited to autosomal chromosomes. SNPs and individuals with a >20% missingness rate, call rate < 98%, minor allele frequency (MAF) < 0.05, or Hardy–Weinberg equilibrium p < 1 × 10−6 were excluded from our analyses. Sex discrepancies were checked; males with a homozygosity estimate < 0.08 and females with a homozygosity estimate > 0.2 were excluded from our analyses. Relatedness was investigated by calculating identity by descent of all sample pairs. If pi-hat was >0.2, the related pairs were excluded from our analyses. Individuals who deviated more than 3 SD from the heterozygosity rate mean were additionally excluded from our analyses. The 1000 Genomes Project data served as a reference to help define genetic ancestry [11]. Multidimensional scaling (MDS) analysis was conducted using PLINK to evaluate population structure and identify ethnic outliers by anchoring individual genetic data to the 1000 Genomes reference panel. Genotype data from CSGNM and 1000 Genomes were merged after harmonizing SNP sets, aligning reference alleles, and resolving strand inconsistencies. A pairwise identity-by-state (IBS) distance matrix was computed from linkage disequilibrium-pruned SNPs. An MDS plot was created using RStudio. We then performed MDS using only the Irish samples after excluding any potential ethnic outliers. The top 10 MDS components derived from the Irish samples were then extracted. To determine component inclusion, Scree and Scree-Eigen plots were created using the factoextra R package (version 1.0.7). Due to the small sample size, imputation and fine mapping were not performed in these analyses.

2.3. Genome-Wide Association and Statistical Analysis

The distribution of serum choline was visualized with a histogram and QQ plot and quantitatively tested using the Shapiro–Wilk test. Genome-wide association analysis was performed using linear regression models to test the individual association of serum choline, betaine, and DMG as well as choline-to-betaine and betaine-to-DMG ratio concentrations (in micromoles per liter) with 680,975 SNPs, under an additive genetic model. Serum choline models were unadjusted (model 1), adjusted for age and sex (model 2), and further adjusted for selected principal components (PCs) (model 3) and B12, B9, B6, and holotranscobalamin (HoloTC) (model 4). Models 3 and 4 were employed for serum betaine, DMG, and choline-to-betaine and betaine-to-DMG ratios. λ values were assessed in all choline-to-betaine and betaine-to-DMG ratio models. The analysis was conducted in R (version 4.2.0; R Foundation for Statistical Computing) on the Tufts University secure high-performance computing cluster.
QQ plots were used to visualize deviations of the observed p-values for each SNP under the null hypothesis. p < 5.0 × 10−5 was used as the genome-wide statistical significance threshold. False discovery rate (FDR) adjustments were performed for comparison using 0.05 as the significance threshold. Significant SNPs were searched for in the National Center for Biotechnology Information dbSNP (database of Single Nucleotide Polymorphisms) [12] to identify the associated gene. LocusZoom [13] was used to create a plot displaying association signals, linkage disequilibrium, and nearby genes for relevant loci from choline model 3. Significant SNPs from models 3 and 4 were searched on the National Institutes of Health Genotype-Tissue Expression (GTEx) Portal [14] to determine whether SNPs were significantly associated with altered gene expression levels in tissues. A final p < 5.0 × 10−5 was used as the genome-wide significance threshold. FDR correction was also applied as an additional method and used to compare against SNPs meeting the p < 5.0 × 10−5 threshold, using an FDR threshold of < 0.05. Finally, we compared SNPs identified in our GWAS with those within our previously published scoping review [7], which sought to identify and summarize reported SNPs present in the scientific literature prior to these analyses.

3. Results

3.1. Participant Characteristics

A summary of participant characteristics is displayed in Table 1. The total genotyping rate was 0.99. All participants passed the 98% call rate. We removed 3095 variants with a missingness rate > 20%, 71,983 variants with a MAF < 0.05, and 1524 variants with a Hardy–Weinberg equilibrium p < 1 × 10−6 from the dataset. Thus, there are a total of 604,373 SNPs included in our analyses. Since genetic data on participant sex were not available via dbGaP, phenotypic data were utilized in the analyses. No related individuals with a pi-hat value > 0.02 were identified in the dataset. We removed 12 participants from the dataset (8 with a heterozygosity rate deviating >3 SD from the mean and 4 with missing serum choline data). An MDS plot anchored to 1000 Genomes reference data is presented in Figure 1. For component inclusion, according to the Scree plot, there was a noticeable inflection point at component 4. The Scree-Eigen plot showed the first five components had eigenvalues > 1. Thus, components 1 to 4 were included as covariates based on information from the Scree (Figure S1) and Scree-Eigen (Figure S2) plots.

3.2. Metabolite Distributions

Based on histogram, QQ plot, and Shapiro–Wilk test results for serum choline (W = 0.96, p < 2.2 × 10−16), betaine (W = 0.98, p < 2.2 × 10−16), and DMG (W = 0.90, p < 2.2 × 10−16), the distribution of each metabolite was right skewed. Therefore, log-transformations were applied to normalize the distributions.

3.3. GWAS of Serum Choline, Betaine, and DMG

All SNP and regression results from model 3 that met the p < 5.0 × 10−5 significance threshold are presented in Table 2, Table 3 and Table 4. Model 4 results and FDR comparisons are presented in Tables S1–S3. QQ plots used to visualize deviations of the observed p-values for each SNP under the null hypothesis are presented in Figures S3, S4 and S5 for choline, betaine, and DMG (model 3), respectively. QQ plots for choline, betaine, and DMG (model 4) are presented in Tables S6, S7 and S8, respectively. All results from model 3 are visualized in the circular Manhattan plot presented in Figure 2. Model 3 results for SNPs in choline and folate metabolism that have previously been identified to modify the risk of organ dysfunction in post hoc analyses of controlled feeding studies that administer choline-deficient diets (<50 mg/d) to participants are presented in Table 5. None of these SNPs reached statistical significance (p < 5.0 × 10−5). A LocusZoom plot of chromosome 5 from the choline model results is presented in Figure 3. Data from GTEx that matched genes from SNP IDs meeting the significance threshold for model 3 are presented in Tables S7, S8, S10, S11, S13 and S14.

3.4. GWAS of Serum Choline-to-Betaine and Betaine-to-DMG Ratios

Both choline-to-betaine and betaine-to-DMG ratio distributions were right skewed. Therefore, for clarity of the interpretation of results, genomic control (λ) values were checked for both the choline-to-betaine (model 3: λ = 0.982; model 4: λ = 0.97) and betaine-to-DMG (model 3: λ = 1.01; model 4: λ = 1.01) ratio models. No transformations were performed, since λ values were deemed acceptable for both ratio models. SNP and regression results from model 3 with a p < 5.0 × 10−5 significance threshold for the choline-to-betaine and betaine-to-DMG ratio models are presented in Table 6 and Table 7, respectively. Model 4 results and FDR comparisons are presented in Tables S4 and S5. QQ plots used to visualize deviations of the observed p-values for each SNP under the null hypothesis are presented in Figures S9 and S10 for choline-to-betaine and betaine-to-DMG ratios (model 3), respectively. QQ plots for choline-to-betaine and betaine-to-DMG ratios (model 4) are presented in Tables S11 and S12, respectively. Data from GTEx that matched genes from SNP IDs meeting the significance threshold for model 3 are presented in Tables S18 and S19.

3.5. SNP Literature Comparison

In our recent scoping review, we identified 577 SNPs from prior GWAS of circulating choline metabolites, of which only 28 were identified in ≥2 unique cohorts. None of these SNPs previously identified in the scientific literature met our p < 5.0 × 10−5 significance threshold; however, 2 SNPS (rs1800588 and rs261332) reached the p < 5.0 × 10−3 threshold.

4. Discussion

Although wide variability in circulating choline, betaine, and DMG is observed in generally healthy populations, determinants of this variation are poorly characterized in the scientific literature to date. The results of this GWAS indicate the potential for significant genetic determinants of these key metabolites. While our results certainly require repetition in larger cohorts, they are useful as candidate genes for functional analyses to uncover novel cellular determinants of choline handling and metabolism and for future exploration in diet-x-genotype analyses aiming to advance ‘precision nutrition’.
The idea that genetic variants modify choline metabolism is generally supported by post hoc analyses of controlled feeding trials demonstrating that when individuals are fed choline-deficient diets, the carrier status for common SNPs in choline and folate metabolism dramatically modifies the risk of displaying choline deficiency–associated organ dysfunction [5,15]. Whether such genetic variants play a role in explaining the variation in circulating choline metabolites in the general population has received limited attention, in part due to the lack of large cohorts with complete genetic and metabolite data. In our analysis, classical variants described as modifiers of choline deficiency–associated phenotypes did not emerge as determinants of plasma choline (Table 5). Rather, SNPs nearby to genes not previously linked directly to choline metabolism emerged in our analysis, with the exception of the recently described solute carrier SLC25A48 locus that encodes the mitochondrial choline importer [16,17].
Several of these novel SNPs not previously linked to choline homeostasis, particularly those with multiple SNPs, may be worthy of additional functional characterization. While the emergence of SLC25A48 in this analysis is encouraging, we note caution in assuming observed relationships represent causal links to choline biology. Several intronic variants in the gene encoding lysophospholipase 1 (LYPLAL1), a cytosolic acyl thioesterase protein reported to de-palmitoylate its target proteins, were associated with serum choline concentrations; intriguingly, LYPLAL1 has been reported to lack lysophospholipase activity, a potential avenue toward a more direct link to choline homeostasis through lysophosphatidylcholine metabolism. LYPLAL1 has not previously been associated directly with choline metabolism; given its broad tissue expression profile and the limited description of its target proteins for de-palmitoylation, its link to choline metabolism remains speculative. Variants in LYPLAL1 have been associated with other metabolic phenotypes, including waist-to-hip ratio, fat distribution, and metabolic dysfunction-associated steatotic liver disease (MASLD; previously nonalcoholic fatty liver disease or NAFLD) [18,19,20,21,22,23], and results in sex-specific body composition phenotypes in LYPLAL1 knockout mice, which suggests that this protein’s de-palmitoylation activity has broad effects on metabolic homeostasis. Consistent with such sex-specific effects in the knockout mouse, an indirect link between LYPLAL1 and choline homeostasis may result from differences in estrogen signaling secondary to altered palmitoylation of its receptor [24]. An additional pair of SNPs present in the intronic region of phosphotyrosine interaction domain containing 1 (PID1) emerged in our analysis that have also been reported to regulate key metabolic processes, including the clearance of triglyceride-rich lipoproteins as well as antagonizing insulin receptor signaling, and common variants in PID1 are associated with obesity and cardiometabolic disease. Direct links between PID1 and choline metabolism have also not been reported but might theoretically occur due to altered lipoprotein fluxes and redistribution of choline across metabolic compartments.
Our GWAS results for serum betaine and DMG, the oxidative products of choline metabolism that shunt choline toward one-carbon metabolism, revealed both intuitive and novel results worthy of exploration in repetition cohorts and for further functional validation. Top hits directly related to betaine and DMG metabolism included the following: SLC6A12, the known sodium- and chloride-dependent betaine transporter; BHMT, catalyzing the use of betaine as a methyl donor to remethylate homocysteine; and dimethylglycine dehydrogenase (DMGDH), catalyzing the use of DMG as a methyl donor in mitochondrial folate metabolism. While directly linked to choline and one-carbon metabolism, it is intriguing that few variants in these genes have yet been explored as potential modifiers of the dietary choline requirement or in the heterogeneous metabolic responses to choline and betaine supplementation. Additional SLC family transporters also emerged from our analysis, including SLC6A13 and SLC22A3, that require further functional validation for links to one-carbon metabolism.
In addition to absolute concentrations of metabolites, we explored novel ratio outcome variables (choline-to-betaine and betaine-to-DMG) given their direct metabolic interrelationships and potential to inform upon metabolic partitioning. Ratios of choline metabolites have previously been explored in small, randomized trials as well as epidemiological cohorts, but they have received little attention in GWAS investigations. While exploratory, it is notable that this approach revealed common targets, including a broader array of SLC6A12 variants, and a novel lipid droplet-associated phosphatidylserine hydrolase (LDAH), of theoretical relevance given phosphatidylcholine’s role as a precursor to phosphatidylserine. Few other direct links to choline and one-carbon metabolism emerged, questioning the added benefit of this approach; further exploration in larger cohorts is warranted.
Our results have notable strengths and limitations worth highlighting. Our results build on the emerging analyses of cohorts with both whole-genome sequencing as well as metabolomic data, facilitating the uncovering of genetic determinants of metabolic homeostasis and potentially novel genes involved in metabolic regulation. The use of CSGNM data is particularly useful due to the targeted absolute quantification of several nutritional biomarkers, whereas other cohorts use metabolomic data that have semi-quantitative analyses (e.g., untargeted NMR metabolomics). We sought to leverage these strengths to undertake an exploratory GWAS that could consider not only predictors of absolute serum concentrations but also adjust for nutrient status indicators, a critical consideration given the many nutrient–nutrient interactions across choline and one-carbon metabolism. Our analyses, however, have significant limitations, largely owing to the relatively small sample size of the CSGNM cohort and resulting limited statistical power within our analyses. The use of a p-value threshold of 5.0 × 10−5, which is common in exploratory settings, increases the risk of false positives. While our analysis contained important internal positive controls, such as our observation that variants in SLC25A48, a recently recognized mitochondrial choline importer, are associated with circulating serum choline, betaine, DMG, choline-to-betaine, and/or betaine-to-DMG, not all variants observed are necessarily true genetic determinants and may indeed be false positives. The LD of the region could be indicated by a different gene located close to the signal. The wide array of targets without clear or direct links to choline or general metabolic processes may yield potential insights into cellular determinants of choline homeostasis, but they should generally be viewed with caution until repetition in larger cohorts provide similar signals for these loci and/or other plausible physiological links emerge. Furthermore, the generalizability of our results may be limited by changing in lifestyle, environment, or medical advances over time, given the CSGNM cohort data are 20 years old. We further note that the CSGNM cohort is comprised solely of individuals of Caucasian descent, which inhibits broad extrapolation of the results. Future research should replicate or validate our findings using more recent data.

5. Conclusions

In summary, the results of this GWAS suggest that metabolic handling of choline has genetic determinants not formerly recognized in the scientific literature. However, replication is needed in larger cohorts due to low statistical power.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/nu17162630/s1: Table S1: Choline—B-vitamin adjusted GWAS results. Table S2: Betaine—B-vitamin adjusted GWAS results. Table S3: Dimethylglycine—B-vitamin adjusted GWAS results. Table S4: Choline-to-betaine ratio—B-vitamin adjusted GWAS results. Table S5: Betaine-to-dimethylglycine—B-vitamin adjusted GWAS results. Table S6: Choline adjusted GWAS results. Table S7: Choline adjusted-GTEx GWAS results. Table S8: Choline and B-vitamin adjusted-GTEx GWAS results. Table S9: Betaine adjusted GWAS results. Table S10: Betaine adjusted-GTEx GWAS results. Table S11: Betaine and B-vitamin adjusted-GTEx GWAS results. Table S12: Dimethylglycine adjusted GWAS results. Table S13: Dimethylglycine adjusted-GTEx GWAS results. Table S14: Dimethylglycine and B-vitamin adjusted-GTEx GWAS results. Table S15: Betaine-to-dimethylglycine ratio adjusted GWAS results. Table S16: Choline-to-betaine ratio adjusted GWAS results. Table S17: Choline-to-betaine and B-vitamin adjusted-GTEx GWAS results. Table S18: Betaine-to-dimethylglycine adjusted-GTEx GWAS results. Table S19: Betaine-to-dimethylglycine and B-vitamin adjusted-GTEx GWAS results. Figure S1: Scree plot. Figure S2: Scree-Eigen plot. Figure S3: QQ plot for choline GWAS. Figure S4: QQ plot for betaine GWAS. Figure S5: QQ plot for dimethylglycine GWAS. Figure S6: QQ plot for choline GWAS (b-vitamin adjusted). Figure S7: QQ plot for betaine GWAS (b-vitamin adjusted). Figure S8: QQ plot for dimethylglycine GWAS (b-vitamin adjusted). Figure S9: QQ plot for choline-to-betaine ratio GWAS. Figure S10: QQ plot for betaine-to-dimethylglycine ratio GWAS.

Author Contributions

Conceptualization, T.C.W. and M.C.; methodology, L.E.L., K.C.K., T.C.W., J.M. and M.C.; software, L.E.L., K.C.K., T.C.W., J.M. and M.C.; validation, L.E.L., K.C.K., T.C.W., J.M. and M.C.; formal analysis, L.E.L.; investigation, L.E.L., K.C.K., T.C.W., J.M. and M.C.; resources, T.C.W., J.M. and M.C.; data curation, L.E.L., K.C.K., T.C.W., J.M. and M.C.; writing—original draft preparation, L.E.L.; writing—review and editing, L.E.L., K.C.K., T.C.W., J.M. and M.C.; visualization, L.E.L., K.C.K., T.C.W., J.M. and M.C.; supervision, T.C.W. and M.C.; project administration, T.C.W. and M.C.; and funding acquisition, T.C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was primarily supported by an investigator-initiated unrestricted educational grant from the Egg Nutrition Center to Think Healthy Group, LLC. The data used in this study were supported by the Intramural Research Programs of the National Institutes of Health, the National Human Genome Research Institute, and the Eunice Kennedy Shriver National Institute of Child Health and Development. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health and by the National Cancer Institute, the National Human Genome Research Institute, the National Heart, Lung, and Blood Institute, the National Institute on Drug Abuse, the National Institute of Mental Health, and the National Institute of Neurological Disorders and Stroke. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are available at dbGaP study accession phs000789.v1.p1.

Conflicts of Interest

K.C.K. has received past research grants unrelated to this paper, funded by the Bill and Melinda Gates Foundation and Balchem, Corp. K.C.K. has served as an independent research consultant for Androvia LifeSciences and Chromadex. He is also a consultant clinical dietitian for Precision Preventive Cardiology. T.C.W. has received competitive unrestricted research grants related from the American Pulse Association, Balchem, Corp., Egg Nutrition Center, Florida Citrus, National Dairy Council, National Pork Board, Nestlé Health Sciences, New Capstone, Inc., Oak Ridge Institute for Science and Education (ORISE) through the U.S. Army Research Institute for Environmental Medicine, The Simply Good Foods Company, and Reach Global Strategies. T.C.W. receives royalties from the Academy of Nutrition and Dietetics for editing the Health Professionals Guide to Dietary Supplements. T.C.W. has received consulting fees from the Academy of Nutrition and Dietetics and the National Pork Board. T.C.W. has received honoraria for lectures from the American Society for Nutrition, Berry Health Benefits Symposium, Tea Institute, Amazentis, and Balchem, Corp. T.C.W. has received support for attending meetings from the Academy of Nutrition and Dietetics and Brightseed, Inc. T.C.W. is on the science advisory board for AHARA, Beli, Deerland Probiotics & Enzymes, Forbes Health, National Pork Board, and Produce for Better Health Foundation; is a member of the Brightseed Bioactives Coalition; and is a past member of the Academy of Nutrition and Dietetics Working Group on Flavan-3-ols. T.C.W. is the Editor-in-Chief of the Journal of Dietary Supplements, Deputy Editor-in-Chief of the Journal of the American Nutrition Association, and Nutrition Section Editor of Annals of Medicine. T.C.W. has received in-kind nutrient analyses services from Eurofins U.S. Food and is a Senior Fellow of the Center for Magnesium Education & Research. MC has received unrestricted education grants from Egg Nutrition Center, Florida Citrus, National Dairy Council, National Pork Board, and Nestle Health Sciences through Think Healthy Group, Inc. L.E.L. and J.M. have no conflicts to disclose.

Abbreviations

The following abbreviations are used in this manuscript:
BHMTbetaine-homocysteine methyltransferase
CSGNMCollaborative Study of Genes, Nutrients and Metabolites
dbGaPdatabase of Genotypes and Phenotypes
dbSNPdatabase of Single Nucleotide Polymorphisms
DMGdimethylglycine
DMGDHdimethylglycine dehydrogenase
FDRfalse discovery rate
GTExGenotype-Tissue Expression
GWASgenome-wide association studies
HoloTCholotranscobalamin
IBSidentity-by-state
LDAHlipid droplet-associated phosphatidylserine hydrolase
MAFminor allele frequency
MASLDmetabolic dysfunction-associated steatotic liver disease
MDSmultidimensional scaling
PCprincipal component
SLCsolute carrier
SNPsingle nucleotide polymorphism
TMAOtrimethylamine N-oxide

References

  1. Wallace, T.C.; Blusztajn, J.K.; Caudill, M.A.; Klatt, K.C.; Natker, E.; Zeisel, S.H.; Zelman, K.M. Choline: The underconsumed and underappreciated essential nutrient. Nutr. Today 2018, 53, 240–253. [Google Scholar] [CrossRef] [PubMed]
  2. Institute of Medicine. Dietary Reference Intakes for Thiamin, Riboflavin, Niacin, Vitamin B6, Folate, Vitamin B12, Pantothenic Acid, Biotin, and Choline. Available online: https://www.ncbi.nlm.nih.gov/books/NBK114310/ (accessed on 1 October 2024).
  3. Holm, P.I.; Ueland, P.M.; Kvalheim, G.; Lien, E.A. Determination of choline, betaine, and dimethylglycine in plasma by a high-throughput method based on normal-phase chromatography-tandem mass spectrometry. Clin. Chem. 2003, 49, 286–294. [Google Scholar] [CrossRef] [PubMed]
  4. de Zwart, F.J.; Slow, S.; Payne, R.J.; Lever, M.; George, P.M.; Gerrard, J.A.; Chambers, S.T. Glycine betaine and glycine betaine analogues in common foods. Food Chem. 2003, 83, 197–204. [Google Scholar] [CrossRef]
  5. Ganz, A.B.; Klatt, K.C.; Caudill, M.A. Common genetic variants alter metabolism and influence dietary choline requirements. Nutrients 2017, 9, 837. [Google Scholar] [CrossRef] [PubMed]
  6. Kenny, T.C.; Khan, A.; Son, Y.; Yue, L.; Heissel, S.; Sharma, A.; Pasolli, H.A.; Liu, Y.; Gamazon, E.R.; Alwaseem, H.; et al. Integrative genetic analysis identifies FLVCR1 as a plasma-membrane choline transporter in mammals. Cell Metab. 2023, 35, 1057–1071. [Google Scholar] [CrossRef] [PubMed]
  7. Louck, L.E.; Cara, K.C.; Klatt, K.; Wallace, T.C.; Chung, M. The relationship of circulating choline and choline-related metabolite levels with health outcomes: A scoping review of genome-wide association studies and Mendelian randomization studies. Adv. Nutr. 2024, 15, 100164. [Google Scholar] [CrossRef] [PubMed]
  8. Mills, J.L.; Carter, T.C.; Scott, J.M.; Troendle, J.F.; Gibney, E.R.; Shane, B.; Kirke, P.N.; Ueland, P.M.; Brody, L.C.; Molloy, A.M. Do high blood folate concentrations exacerbate metabolic abnormalities in people with low vitamin B-12 status? Am. J. Clin. Nutr. 2011, 94, 495–500. [Google Scholar] [CrossRef]
  9. National Center for Biotechnology Information. dbGaP (Database of Genotypes and Phenotypes): Accession phs000789.v1.p1. Available online: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000789.v1.p1 (accessed on 1 October 2024).
  10. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  11. Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; Abecasis, G.R. A global reference for human genetic variation. Nature 2015, 526, 68–74. [Google Scholar] [CrossRef] [PubMed]
  12. National Center for Biotechnology Information. dbSNP (Database of Single Nucleotide Polymorphisms). Available online: https://www.ncbi.nlm.nih.gov/snp/ (accessed on 1 October 2024).
  13. Boughton, A.P.; Welch, R.P.; Flickinger, M.; VandeHaar, P.; Taliun, D.; Abecasis, G.R.; Boehnke, M. LocusZoom.js: Interactive and embeddable visualization of genetic association study results. Bioinformatics 2021, 37, 3017–3018. [Google Scholar] [CrossRef]
  14. Genotype-Tissue Expression (GTEx) Project. GTEx Portal. Available online: https://gtexportal.org/home/ (accessed on 1 June 2024).
  15. Ganz, A.B.; Cohen, V.V.; Swersky, C.C.; Stover, J.; Vitiello, G.A.; Lovesky, J.; Chuang, J.C.; Shields, K.; Fomin, V.G.; Lopez, Y.S.; et al. Genetic variation in choline-metabolizing enzymes alters choline metabolism in young women consuming choline intakes meeting current recommendations. Int. J. Mol. Sci. 2017, 18, 252. [Google Scholar] [CrossRef] [PubMed]
  16. Khan, A.; Unlu, G.; Lin, P.; Liu, Y.; Kilic, E.; Kenny, T.C.; Birsoy, K.; Gamazon, E.R. Metabolic gene function discovery platform GeneMAP identifies SLC25A48 as necessary for mitochondrial choline import. Nat. Genet. 2024, 56, 1614–1623. [Google Scholar] [CrossRef] [PubMed]
  17. Verkerke, A.R.P.; Shi, X.; Li, M.; Higuchi, Y.; Yamamuro, T.; Katoh, D.; Nishida, H.; Auger, C.; Abe, I.; Gerszten, R.E.; et al. SLC25A48 controls mitochondrial choline import and metabolism. Cell Metab. 2024, 36, 2156–2166. [Google Scholar] [CrossRef] [PubMed]
  18. Randall, J.C.; Winkler, T.W.; Kutalik, Z.; Berndt, S.I.; Jackson, A.U.; Monda, K.L.; Kilpeläinen, T.O.; Esko, T.; Mägi, R.; Li, S.; et al. Sex-stratified genome-wide association studies including 270,000 individuals show sexual dimorphism in genetic loci for anthropometric traits. PLoS Genet. 2013, 9, e1003500. [Google Scholar] [CrossRef] [PubMed]
  19. Speliotes, E.K.; Yerges-Armstrong, L.M.; Wu, J.; Hernaez, R.; Kim, L.J.; Palmer, C.D.; Gudnason, V.; Eiriksdottir, G.; Garcia, M.E.; Launer, L.J.; et al. Genome-wide association analysis identifies variants associated with nonalcoholic fatty liver disease that have distinct effects on metabolic traits. PLoS Genet. 2011, 7, e1001324. [Google Scholar] [CrossRef] [PubMed]
  20. Di Costanzo, A.; Belardinilli, F.; Bailetti, D.; Sponziello, M.; D’Erasmo, L.; Polimeni, L.; Baratta, F.; Pastori, D.; Ceci, F.; Montali, A.; et al. Evaluation of polygenic determinants of non-alcoholic fatty liver disease (NAFLD) by a candidate genes resequencing strategy. Sci. Rep. 2018, 8, 3702. [Google Scholar] [CrossRef] [PubMed]
  21. Shungin, D.; Winkler, T.W.; Croteau-Chonka, D.C.; Ferreira, T.; Locke, A.E.; Mägi, R.; Strawbridge, R.J.; Pers, T.H.; Fischer, K.; Justice, A.E.; et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 2015, 518, 187–196. [Google Scholar] [CrossRef] [PubMed]
  22. Heid, I.M.; Jackson, A.U.; Randall, J.C.; Winkler, T.W.; Qi, L.; Steinthorsdottir, V.; Thorleifsson, G.; Zillikens, M.C.; Speliotes, E.K.; Mägi, R.; et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat. Genet. 2010, 42, 949–960. [Google Scholar] [CrossRef]
  23. Saliba-Gustafsson, P.; Justesen, J.M.; Ranta, A.; Sharma, D.; Bielczyk-Maczynska, E.; Li, J.; Najmi, L.A.; Apodaka, M.; Aspichueta, P.; Björck, H.M.; et al. A functional genomic framework to elucidate novel causal metabolic dysfunction-associated fatty liver disease genes. Hepatology 2024. published ahead of print. [Google Scholar] [CrossRef] [PubMed]
  24. Vohnoutka, R.B.; Kuppa, A.; Hegde, Y.; Chen, Y.; Pant, A.; Tohme, M.E.; Choi, E.K.; McCarty, S.M.; Bagchi, D.P.; Du, X.; et al. Knockout of murine Lyplal1 confers sex-specific protection against diet-induced obesity. J. Mol. Endocrinol. 2023, 70, e220131. [Google Scholar] [CrossRef]
Figure 1. Multidimensional scaling plot of the study cohort anchored to the 1000 Genomes reference panel. AFR = African ancestry; AMR = Admixed American ancestry; EUR = European ancestry; OWN = the study cohort.
Figure 1. Multidimensional scaling plot of the study cohort anchored to the 1000 Genomes reference panel. AFR = African ancestry; AMR = Admixed American ancestry; EUR = European ancestry; OWN = the study cohort.
Nutrients 17 02630 g001
Figure 2. A circular Manhattan plot visualizing all model 3 results (adjusted for age, sex, and PCs). Starting from the inner circle, the rings represent choline, betaine, DMG, and the choline-to-betaine and betaine-to-DMG ratios. The scales represent GWAS p-values, individual data points represent SNPs, and SNPs are aggregated by chromosome. Red circular lines mark the significance p value threshold.
Figure 2. A circular Manhattan plot visualizing all model 3 results (adjusted for age, sex, and PCs). Starting from the inner circle, the rings represent choline, betaine, DMG, and the choline-to-betaine and betaine-to-DMG ratios. The scales represent GWAS p-values, individual data points represent SNPs, and SNPs are aggregated by chromosome. Red circular lines mark the significance p value threshold.
Nutrients 17 02630 g002
Figure 3. LocusZoom plot of chromosome 5 from the choline GWAS (model 3), displaying association signals, linkage disequilibrium, and nearby genes. The purple diamond indicates the most significant SNP, and gray solid circles indicate SNPs not in NIH LDlink API database.
Figure 3. LocusZoom plot of chromosome 5 from the choline GWAS (model 3), displaying association signals, linkage disequilibrium, and nearby genes. The purple diamond indicates the most significant SNP, and gray solid circles indicate SNPs not in NIH LDlink API database.
Nutrients 17 02630 g003
Table 1. Participant characteristics.
Table 1. Participant characteristics.
VariableValue (n = 2220)
Age (y), mean ± SD (min, max)22.4 ± 1.69 (19.0, 28.0)
Sex, No. (%)
   Male918 (41.4)
   Female1302 (58.6)
Serum choline (µmol/L), mean ± SD (min, max)7.65 ± 1.81 (3.30, 19.7)
Table 2. GWAS of serum choline (model 3).
Table 2. GWAS of serum choline (model 3).
SNPLog (β)SEp-ValueGene
rs107793490.01410.003271.63 × 10−5LYPLAL1: intron variant
rs48465510.01650.003731.08 × 10−5LYPLAL1: intron variant
rs66719910.01820.003852.40 × 10−6LYPLAL1: intron variant
rs10205941−0.01400.003221.57 × 10−5None
rs6890039−0.02500.005618.96 × 10−6SLC25A48: intron variant
rs6596270−0.02500.005618.96 × 10−6SLC25A48: synonymous variant
rs4272127−0.01690.004154.79 × 10−5SLC25A48: intron variant
rs66011670.02280.005221.26 × 10−5None
rs226915−0.02290.005179.97 × 10−6LOC100507336: intron variant
rs22610330.01710.003992.00 × 10−5PRRC2A: intron variant
rs78121170.01700.004174.57 × 10−5NXPH1: intron variant
rs1050290−0.01620.003711.40 × 10−5MEOX2: 5′-UTR variant
LINC02587: 2KB upstream variant
rs740566−0.01670.003603.57 × 10−6LINC02587: 2KB upstream variant
MEOX2: 2KB upstream variant
rs12669299−0.01520.003562.03 × 10−5None
rs2155939−0.01890.004644.81 × 10−5NCALD: intron variant
rs108193410.02640.006202.11 × 10−5None
rs8828950.02510.006063.55 × 10−5None
rs7944388−0.05220.01263.82 × 10−5None
rs11787040.04550.01114.31 × 10−5PTPRQ: intron variant
LOC105369867: intron variant
rs12813355−0.01650.003882.21 × 10−5LOC105369867: intron variant
rs1495830.02470.005842.48 × 10−5ATXN8OS: intron variant
rs49048950.01670.003831.30 × 10−5LRFN5: intron variant
rs1227005−0.02070.004494.08 × 10−6PID1: intron variant
rs1362243−0.01610.003924.21 × 10−5PID1: intron variant
Model 3 adjusted for age, sex, and PCs.
Table 3. GWAS of serum betaine (model 3).
Table 3. GWAS of serum betaine (model 3).
SNPLog (β)SEp-ValueGene
rs10001408−0.03217370.007867154.48 × 10−5TMEM150C: intron variant
rs10016201−0.03360970.007969712.57 × 10−5TMEM150C: intron variant
rs10208223−0.02484630.006031893.94 × 10−5LOC730100: intron variant
rs1051104−0.03003260.006046927.34 × 10−7SLC6A12: intron variant
rs10944−0.02503230.005440364.44 × 10−6BHMT2: 3′-UTR variant
rs11062011−0.03314390.005613434.10 × 10−9SLC6A12
rs12208357−0.07953630.018395761.60 × 10−5SLC22A1: missense variant
LOC124901452: 2KB upstream variant
rs12566456−0.08917630.021742214.25 × 10−5KCNH1: intron variant
rs128270.082317390.020035064.13 × 10−5IQSEC3-AS1: intron variant
IQSEC3: 3′-UTR variant
rs13088151−0.03400560.008063132.57 × 10−5CNTN4: intron variant
rs1411120.032333180.007721662.93 × 10−5HLA-DOA
HLA-DPA1
rs1644004−0.02486090.005513936.87 × 10−6DMGDH: intron variant
rs16965026−0.05071240.012047632.66 × 10−5LOC105370772: intron variant
rs17692962−0.03164860.005731033.74 × 10−8IQSEC3: intron variant
IQSEC3-AS1: intron variant
rs183355−0.02501720.005686671.14 × 10−5SLC6A12: 3′-UTR variant
IQSEC3-AS1: 2KB upstream variant
rs185077−0.02381110.00543651.24 × 10−5DMGDH: intron variant
rs1902623−0.05343410.012374391.64 × 10−5RPS1
rs1903530−0.03323670.007700021.66 × 10−5RNA5SP173
rs1986680−0.03940490.009612034.29 × 10−5SYT16
rs20633450.029461260.006983752.56 × 10−5SLC22A3: intron variant
rs2075228−0.03053740.006299581.34 × 10−6SLC6A12: intron variant
rs2162440.021987860.00520522.50 × 10−5SLC6A12: intron variant
rs21749140.029277080.007023093.18 × 10−5SLC22A3: intron variant
rs22843290.024294310.005443888.50 × 10−6SLC6A12: intron variant
rs22843300.024607840.005301293.66 × 10−6SLC6A12: intron variant
rs2291922−0.04359990.006805521.82 × 10−10IQSEC3-AS1: noncoding transcript variant
IQSEC3: intron variant
rs233162−0.02699730.006630084.83 × 10−5ZNF385D: intron variant
rs24575750.029829370.006984432.03 × 10−5SLC22A3: intron variant
rs2568131−0.05900110.014377364.21 × 10−5NAV2: intron variant
LOC124902644: 2KB upstream variant
LOC124902644: 2KB upstream variant
rs26618340.029394910.006999232.78 × 10−5SLC22A3: intron variant
rs2702450−0.03159480.007720034.42 × 10−5RNA5SP173
rs28349320.028790380.007053784.63 × 10−5LOC100506403: intron variant
rs28584580.032209390.007726123.18 × 10−5HLA-DOA
HLA-DPA1
rs2876711−0.02700620.00576242.95 × 10−6LOC105370265
KCTD12
rs2937110.023572650.005759144.41 × 10−5BPIFB9P
rs31289240.027311470.006365481.86 × 10−5HLA-DPA2
COL11A2P1
rs31350210.034089740.008173013.15 × 10−5HLA-DPA1: intron variant
HLA-DPB1: intron variant
rs3775510.028714050.006944053.68 × 10−5SLC22A3: intron variant
rs3795690.032323360.007728323.00 × 10−5HLA-DOA
HLA-DPA1
rs3804680.032209390.007726123.18 × 10−5HLA-DOA
HLA-DPA1
rs3821980.032334880.007727722.97 × 10−5HLA-DOA
HLA-DPA1
rs3944870.028992070.006943063.09 × 10−5SLC22A3: intron variant
rs4120650.032223530.007728463.17 × 10−5HLA-DOA
HLA-DPA1
rs4265230.032209390.007726123.18 × 10−5HLA-DOA
HLA-DPA1
rs4398520.032209390.007726123.18 × 10−5HLA-DOA
HLA-DPA1
rs4448210.032209390.007726123.18 × 10−5HLA-DOA
HLA-DPA1
rs495360−0.02774660.005275051.58 × 10−7SLC6A12
rs49741570.024054870.005376858.08 × 10−6ERC2: intron variant
rs505387−0.03002130.006139141.08 × 10−6SLC6A12: 3′-UTR variant
rs510714−0.02578090.005958241.58 × 10−5SLC6A12: intron variant
rs542736−0.02915880.00583916.39 × 10−7SLC6A12: intron variant
rs557881−0.02812080.005177676.22 × 10−8SLC6A12: missense variant
rs562487−0.02475410.005355114.01 × 10−6BHMT: 2KB upstream variant
rs5699190.027609910.00679144.96 × 10−5SLC22A3
rs570680−0.03381190.005596111.79 × 10−9SLC6A13: intron variant
rs579292−0.02265180.005306372.05 × 10−5LOC105371956: intron variant
rs586199−0.02646210.005484111.49 × 10−6BHMT
rs6145640.026219310.006249982.84 × 10−5LOC105378088: 2KB upstream variant
rs631151−0.02531540.005520724.78 × 10−6DMGDH: intron variant
rs633159−0.02893330.005901651.02 × 10−6SLC6A12: 3′-UTR variant
rs64577090.031007790.007392512.84 × 10−5HLA-DPA1
rs6633100.032225850.00772853.17 × 10−5HLA-DOA
HLA-DPA1
rs70155220.049626820.010452992.19 × 10−6CARSIP2
rs7171449−0.05028940.011031035.43 × 10−6LOC105370772: intron variant
rs7599255−0.03013090.00729833.79 × 10−5ALK: intron variant
rs76138180.021249710.005226844.96 × 10−5ERC2: intron variant
rs77821950.024065080.00564712.12 × 10−5MAGI2: intron variant
rs7960096−0.0256640.005751398.52 × 10−6IQSEC3-AS1: intron variant
IQSEC3: 3′-UTR variant
rs7971499−0.02232230.005131521.42 × 10−5SLC6A12: 3′-UTR variant
rs8093227−0.0314310.007198591.32 × 10−5LOC105371956: intron variant
rs92937610.023683490.005479981.62 × 10−5ARSB
rs98734800.025833220.006099772.38 × 10−5ERC2: intron variant
Model 3 adjusted for age, sex, and PCs.
Table 4. Serum DMG model results adjusted for age, sex, and PCs.
Table 4. Serum DMG model results adjusted for age, sex, and PCs.
SNPLog (β)SEp-ValueGene
rs10043368−0.01633390.0040184.97 × 10−5ZNF366: intron variant
rs10229540.022309210.005414743.93 × 10−5MYO16: intron variant
rs10514154−0.03626160.008246161.15 × 10−5DMGDH: intron variant
rs10557490.021277310.00489981.47 × 10−5MCPH1-AS1: intron variant
MCPH1: 3′-UTR variant
rs10570910.022502770.004909494.83 × 10−6MCPH1: missense variant
MCPH1-AS1: intron variant
rs11252395−0.02386390.005171094.16 × 10−6LOC105376368
rs11996743−0.01578110.003858364.47 × 10−5ZFPM2-AS1: intron variant
rs12211424−0.03811970.008519448.05 × 10−6ZNF184: intron variant
rs1385961−0.03685560.008185617.07 × 10−6RNU6-214P
rs1385962−0.03602010.008138161.01 × 10−5RNU6-214P
rs1422096−0.0264880.006286292.61 × 10−5LOC105379107: intron variant
rs16876302−0.02801520.005871751.95 × 10−6DMGDH: intron variant
rs170543970.042192310.010325754.54 × 10−5ITK: 3′-UTR variant
rs17082203−0.03431620.008144042.61 × 10−5RNU6-214P
rs17082229−0.03449520.008142712.37 × 10−5RNU6-214P
rs17155846−0.02746970.006285381.30 × 10−5LOC105379107: intron variant
rs175511200.036252130.007993456.06 × 10−6STXBP5-AS1: intron variant
rs1805072−0.03734940.008244376.21 × 10−6DMGDH: synonymous variant
rs1805074−0.02781760.00587382.32 × 10−6DMGDH: missense variant
rs1872624−0.01585550.003838313.75 × 10−5B4GALT7: intron variant
rs19980840.017726140.004114221.72 × 10−5LOC100505664: intron variant
rs2034900−0.02811370.005873311.81 × 10−6DMGDH: intron variant
rs24313320.031436120.005675213.40 × 10−8DMGDH: intron variant
rs24331490.021812020.004908019.27 × 10−6MCPH1-AS1: intron variant
MCPH1: 3′-UTR variant
rs2505130.026764840.006186791.59 × 10−5DMGDH: intron variant
rs283260.031986950.007336471.36 × 10−5DMGDH: intron variant
rs37585620.020194450.004881163.65 × 10−5PRF1: intron variant
rs3792849−0.01783980.004274343.11 × 10−5B4GALT7: intron variant
rs38922450.042038650.010329464.87 × 10−5ITK: 3′-UTR variant
rs4365962−0.03702520.008189956.49 × 10−6RNU6-214P
rs44210870.020659350.004513124.97 × 10−6SNP not documented in dbGaP
rs4458584−0.01611640.003846082.90 × 10−5B4GALT7
rs4522−0.01756080.003950749.23 × 10−6HSBP1: 3′-UTR variant
rs4685654−0.02952620.006512266.10 × 10−6LRRN1
rs47504580.035750540.008261261.58 × 10−5FRMD4A: intron variant
rs4891832−0.02236090.005184451.68 × 10−5LOC105372180
rs5065000.021360810.004609623.80 × 10−6BHMT: intron variant
LOC124901012: intron variant
rs5858000.025957430.004988422.14 × 10−7BHMT: 3′-UTR variant
rs6044680−0.01792420.00415381.67 × 10−5RNU1-131P
rs6313050.042467290.007169963.66 × 10−9BHMT
rs6424310.0398440.006404115.88 × 10−10DMGDH: intron variant
BHMT2: 2KB upstream variant
rs6429340.0398440.006404115.88 × 10−10DMGDH: intron variant
BHMT2: 2KB upstream variant
rs6774615−0.03020610.00690941.29 × 10−5SLC6A6: intron variant
rs67962540.020491160.00501144.49 × 10−5LOC105376959
rs6942900.020724960.004613627.42 × 10−6BHMT: intron variant
rs72175600.024835430.005284012.76 × 10−6COPS3
rs72205770.020807980.00496882.93 × 10−5COPS3
rs731124−0.02750950.0058562.79 × 10−6DMGDH: noncoding transcript variant
rs7934785−0.01884170.004127375.27 × 10−6CNTN5: intron variant
rs8050249−0.01629090.004002414.86 × 10−5HS3ST4: intron variant
rs81347750.02727070.006002885.85 × 10−6LINC00310
rs93013070.015299680.003727734.20 × 10−5MYO16: intron variant
rs933684−0.02792840.005872222.10 × 10−6DMGDH: intron variant
rs9489944−0.03014320.007377814.55 × 10−5RNU6-214P
Model 3 adjusted for age, sex, and PCs.
Table 5. GWAS results for SNPs classically associated with choline and folate metabolism (model 3).
Table 5. GWAS results for SNPs classically associated with choline and folate metabolism (model 3).
GeneSNPLog (β)SEp-Value
MTHFRrs1801133−0.005870.003970.139
rs18011310.001720.004150.679
MTHFD1rs2236225NANANA
RFCrs1051266−0.006830.003440.0469
MTRrs18050870.001920.005600.731
MTRRrs1801394−0.002500.003380.460
CHKArs107919570.003650.003400.284
CHDHrs9001NANANA
rs12676NANANA
BHMTrs37338900.005140.004690.274
PEMTrs12325817NANANA
rs4646343NANANA
rs2266782−0.001020.003510.772
SLC44A1rs7873937NANANA
rs31999660.001820.009570.849
Abbreviation: NA, not available. Model 3 adjusted for age, sex, and PCs.
Table 6. GWAS of serum choline-to-betaine ratio (model 3).
Table 6. GWAS of serum choline-to-betaine ratio (model 3).
SNPβSEp-ValueGene
rs66815780.009910930.002369332.99 × 10−5CACHD1: intron variant
rs21647200.014673050.003302479.31 × 10−6LDAH: intron variant
rs130263090.006746950.001468674.60 × 10−6THADA: intron variant
rs31362470.011038630.002444636.65 × 10−6MSH6: intron variant
rs18009320.010746490.002436541.08 × 10−5MSH6: synonymous variant
rs77112530.01654640.003543813.21 × 10−6BDP1: intron variant
rs77079560.011253550.002712873.48 × 10−5No documented gene consequence
rs77060730.019206560.004679064.20 × 10−5LINC01950: intron variant
rs22880680.015674170.003765423.27 × 10−5CYFIP2: intron variant
rs173074780.015235640.00366023.27 × 10−5KIAA0319: intron variant
rs132151600.016267860.003884852.93 × 10−5GABRR1: intron variant
rs122083570.022627070.005095019.40 × 10−6SLC22A1: missense variant
LOC124901452: 2KB upstream variant
rs7565130.00670780.00150188.35 × 10−6No documented gene consequence
rs173642510.013943740.003316152.72 × 10−5No documented gene consequence
rs108187540.012779690.002956721.61 × 10−5RC3H2: 3′-UTR variant
rs112229600.02699770.006231051.54 × 10−5NTM: intron variant
rs22919220.011114520.001889714.69 × 10−9IQSEC3-AS1: noncoding transcript variant
IQSEC3: intron variant
rs176929620.007748250.001591491.20 × 10−6IQSEC3: intron variant
IQSEC3-AS1: intron variant
rs79600960.006917080.00159511.51 × 10−5IQSEC3-AS1: intron variant
IQSEC3: 3′-UTR variant
rs1833550.006982780.001577331.00 × 10−5SLC6A12: 3′-UTR variant
IQSEC3-AS1: 2KB upstream variant
rs6331590.008107860.001636177.77 × 10−7SLC6A12: 3′-UTR variant
rs5053870.007954510.001703023.18 × 10−6SLC6A12: 3′-UTR variant
rs5597590.006486290.001561543.40 × 10−5SLC6A12: 3′-UTR variant
rs79714990.006609710.00142233.56 × 10−6SLC6A12: 3′-UTR variant
rs10511040.007725040.001677964.39 × 10−6SLC6A12: intron variant
rs5427360.007682980.0016212.28 × 10−6SLC6A12: intron variant
rs20752280.007824150.00174858.04 × 10−6SLC6A12: intron variant
rs2284329−0.00650390.00151071.74 × 10−5SLC6A12: intron variant
rs2284330−0.00654950.00147048.84 × 10−6SLC6A12: intron variant
rs5578810.006689190.001438663.52 × 10−6SLC6A12: missense variant
rs4953600.00718130.001463649.96 × 10−7SLC6A12
rs110620110.008371560.001560268.93 × 10−8SLC6A12
rs5706800.008587840.001551163.46 × 10−8SLC6A13: intron variant
rs22201680.007896970.001927384.33 × 10−5ITPR2: intron variant
rs111743350.008893760.001892912.78 × 10−6TAFA2: intron variant
rs111743420.009024360.001956874.23 × 10−6TAFA2: intron variant
LOC124902950: intron variant
rs125787780.019923620.00477823.17 × 10−5ANO4: intron variant
rs169341320.006917790.00158421.32 × 10−5ACACB: intron variant
rs79765520.007449490.001635175.51 × 10−6ACACB: intron variant
rs169479780.020286510.004533058.02 × 10−6KSR2: intron variant
rs16816880.005841760.001428754.49 × 10−5TMEM132D: intron variant
rs170781720.015059290.00356162.45 × 10−5HMGA1P6
rs73352000.014717460.003530613.18 × 10−5HMGA1P6
rs19841630.009502160.002219041.93 × 10−5HMGA1P6
rs28767110.007247160.001598266.09 × 10−6LOC107984587
rs101351230.011754980.002891554.97 × 10−5URK1
rs71714490.012522070.003061174.46 × 10−5LOC105370772: intron variant
rs19026230.014174810.003432313.77 × 10−5RPS15P8
rs29296440.020906220.005037463.45 × 10−5LOC102724253
rs29296470.024165420.005412388.42 × 10−6LOC102724253
rs11540961−0.02143290.00491661.37 × 10−5NUBP2: missense variant
rs47803330.010573560.002475522.03 × 10−5CIITA: intron variant
rs169738220.016131070.003807862.37 × 10−5RBBP6: intron variant
rs42431340.013814040.003353063.93 × 10−5MON1B
SYCE1L
rs45692980.013723230.003353394.42 × 10−5MON1B
SYCE1L
rs80932270.009616460.001995421.54 × 10−6LOC105371956: intron variant
rs1893459−0.0071080.001662181.98 × 10−5NETO1: intron variant
rs129593920.006263310.001497332.99 × 10−5NETO1: intron variant
rs121509650.015036550.003382579.21 × 10−6CACNG8: 2KB upstream variant
rs13013933−0.0227550.005496933.61 × 10−5GPC1: 3′-UTR variant
rs130881510.009454970.002235712.44 × 10−5CNTN4: intron variant
rs98831030.007392010.001792333.86 × 10−5VEPH1: intron variant
LOC101928236: intron variant
rs68402050.009878810.002420794.65 × 10−5ATP10D: intron variant
rs168516810.009970560.002412523.72 × 10−5ATP10D: missense variant
rs119354230.01279630.003144134.87 × 10−5TRMT10A: intron variant
Model 3 adjusted for age, sex, and PCs.
Table 7. GWAS of serum betaine-to-DMG ratio (model 3) PCs.
Table 7. GWAS of serum betaine-to-DMG ratio (model 3) PCs.
SNPLog (β)SEp-ValueGene
rs6425992−0.23384710.053033531.09 × 10−5FTLP18
rs12772070.349102750.083972413.34 × 10−5AKNAD1: missense variant
rs75192170.325521110.077112052.53 × 10−5LOC343508
rs76037440.679319960.16474343.87 × 10−5LINC01250: intron variant
rs7228890.818807780.173930262.66 × 10−6RNA5SP94
rs18419910.298154760.051411087.62 × 10−9ARSB: intron variant
rs17175700.296043860.051259458.77 × 10−9ARSB: intron variant
rs92937610.298008740.050532374.27 × 10−9ARSB: intron variant
rs185077−0.26005910.049997532.16 × 10−7DMGDH: intron variant
rs248383−0.25867720.05108654.46 × 10−7DMGDH: intron variant
rs248385−0.26480060.051266392.62 × 10−7DMGDH: synonymous variant
rs631151−0.29046790.050928971.33 × 10−8DMGDH: intron variant
rs1644004−0.29207370.050885381.08 × 10−8DMGDH: intron variant
rs642431−0.3921910.083158442.55 × 10−6DMGDH: intron variant; BHMT2: 2KB upstream variant
rs642934−0.3921910.083158442.55 × 10−6DMGDH: intron variant; BHMT2: 2KB upstream variant
rs10944−0.29701960.050192583.78 × 10−9BHMT2: 3′-UTR variant
rs586199−0.304390.050607272.11 × 10−9BHMY
rs631305−0.41533240.093076358.52 × 10−6BHMY
rs562487−0.29299060.049406693.51 × 10−9BHMT: 2KB upstream variant
rs506500−0.35814450.05943611.97 × 10−9BHMT: intron variant
LOC124901012: intron variant
rs694290−0.35109830.059492194.16 × 10−9BHMT: intron variant
rs4421087−0.33349880.058166671.12 × 10−8SNP/gene not on file
rs585800−0.3332030.064629242.76 × 10−7BHMT: 3′-UTR variant
rs19157060.237322810.05310338.26 × 10−6BHMT
JMY
rs23645940.256715640.056160745.12 × 10−6BHMT
rs131583090.246629040.053412174.11 × 10−6JMY
BHMT
rs21211070.257350150.056150164.84 × 10−6JMY
BHMT
rs121892480.242163940.054127458.07 × 10−6LOC124901011: intron variant
LOC124900205: 500B downstream variant
rs100788150.249992070.05545296.88 × 10−6LOC124901011: intron variant
rs25913920.224647650.054297813.65 × 10−5JMY: intron variant
rs25913870.230577660.055675053.58 × 10−5JMY: intron variant
rs131825120.226354360.05342412.36 × 10−5JMY: missense variant
rs77119390.234097430.055703612.75 × 10−5JMY: intron variant
rs100748820.234121970.055716082.75 × 10−5JMY: intron variant
rs10623260.228087710.054382962.85 × 10−5JMY: 3′-UTR variant
LOC102724530: intron variant
rs27504730.339492930.07710351.12 × 10−5LOC107986623: intron variant
rs22973740.263718720.058113835.99 × 10−6SLC22A1: intron variant
rs119824320.377337780.08963632.66 × 10−5No documented gene consequence
rs763317−0.19915430.048909664.83 × 10−5EGFR: intron variant
rs125426090.639695670.15284592.96 × 10−5LOC100419761
rs69866720.480124650.104501384.59 × 10−6KIF13B: missense variant
rs69819560.480956130.104488754.40 × 10−6KIF13B: intron variant
rs2461063−0.24201740.056676952.04 × 10−5LOC107986893
rs1550734−0.24220150.054087957.92 × 10−6LOC107986893
rs7040466−0.6855110.167094384.24 × 10−5JKAMPP1
rs78739410.479656380.113244242.37 × 10−5JKAMPP1
rs169372050.72001780.170951812.64 × 10−5NAMPTP1
rs5869100.66293880.127287222.08 × 10−7PAMR1
FJX1
rs6594010.454424970.103931251.29 × 10−5PAMR1
FJX1
rs107681540.459098960.103955361.05 × 10−5PAMR1
FJX1
rs2291922−0.32517660.063190422.90 × 10−7IQSEC3-AS1: noncoding transcript variant
IQSEC3: intron variant
rs17692962−0.24726140.053145653.47 × 10−6IQSEC3: intron variant
IQSEC3-AS1: intron variant
rs633159−0.22391660.054708794.41 × 10−5SLC6A12: 3′-UTR variant
rs1051104−0.24495880.056033461.29 × 10−5SLC6A12: intron variant
rs542736−0.23306850.054133211.74 × 10−5SLC6A12: intron variant
rs2075228−0.26022250.058401378.78 × 10−6SLC6A12: intron variant
rs2162440.201645840.048184032.97 × 10−5SLC6A12: intron variant
rs557881−0.27400040.047855951.17 × 10−8SLC6A12: missense variant
rs495360−0.27367330.048777532.27 × 10−8SLC6A12
rs11062011−0.30623730.051913574.23 × 10−9SLC6A13
rs570680−0.29733860.051911321.16 × 10−8SLC6A13: intron variant
rs901510−0.19920710.048417484.03 × 10−5LOC105370224
rs71987240.580035130.136404242.20 × 10−5ZC3H18: intron variant
rs347909080.299659410.072835524.03 × 10−5TNFSF12: 2KB upstream variant
TNFSF12-TNFSF13: 2KB upstream variant
rs45115930.248312630.058967522.65 × 10−5TNFSF12: intron variant
TNFSF12-TNFSF13: intron variant
rs42270.282881830.068129763.42 × 10−5MPDU1: Noncoding transcript variant
SOX15: 500B downstream variant
rs80734980.234986410.053180781.04 × 10−5ATP1B2
TP53
rs75052060.207901430.050215273.60 × 10−5LOC107985171
rs12479693−0.20262920.049443854.32 × 10−5PTPRT: intron variant
LOC105372623: intron variant
rs72682020.506671270.119273522.25 × 10−5BCAS4
rs8709080.661925850.146970847.03 × 10−6RNA5SP117
rs100489700.760153860.167848916.25 × 10−6TAMM41: intron variant
rs15290420.202938120.049291113.98 × 10−5GPR15
rs901884−0.44976570.101365719.57 × 10−6NLGN1: intron variant
rs119408050.201490320.047897392.70 × 10−5LOC107986306: intron variant
Model 3 adjusted for age, sex, and PCs.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Louck, L.E.; Klatt, K.C.; Wallace, T.C.; Ma, J.; Chung, M. A Genome-Wide Association Study of Circulating Serum Choline, Betaine, Dimethylglycine, and Their Ratios. Nutrients 2025, 17, 2630. https://doi.org/10.3390/nu17162630

AMA Style

Louck LE, Klatt KC, Wallace TC, Ma J, Chung M. A Genome-Wide Association Study of Circulating Serum Choline, Betaine, Dimethylglycine, and Their Ratios. Nutrients. 2025; 17(16):2630. https://doi.org/10.3390/nu17162630

Chicago/Turabian Style

Louck, Lauren E., Kevin C. Klatt, Taylor C. Wallace, Jiantao Ma, and Mei Chung. 2025. "A Genome-Wide Association Study of Circulating Serum Choline, Betaine, Dimethylglycine, and Their Ratios" Nutrients 17, no. 16: 2630. https://doi.org/10.3390/nu17162630

APA Style

Louck, L. E., Klatt, K. C., Wallace, T. C., Ma, J., & Chung, M. (2025). A Genome-Wide Association Study of Circulating Serum Choline, Betaine, Dimethylglycine, and Their Ratios. Nutrients, 17(16), 2630. https://doi.org/10.3390/nu17162630

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop