Next Article in Journal
Changes of Hematological and Hemorheological Parameters in Rabbits with Hypercholesterolemia
Previous Article in Journal
Improved Bone Quality and Bone Healing of Dystrophic Mice by Parabiosis
Previous Article in Special Issue
FoodOmicsGR_RI: A Consortium for Comprehensive Molecular Characterisation of Food Products
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification and Reproducibility of Urinary Metabolomic Biomarkers of Habitual Food Intake in a Cross-Sectional Analysis of the Cancer Prevention Study-3 Diet Assessment Sub-Study

by
Ying Wang
1,*,
Rebecca A. Hodge
1,
Victoria L. Stevens
1,
Terryl J. Hartman
2 and
Marjorie L. McCullough
1
1
Department of Population Science, American Cancer Society, Atlanta, GA 30303, USA
2
Department of Epidemiology, Rollins School of Public Health, Winship Cancer Institute, Emory University, Atlanta, GA 30322, USA
*
Author to whom correspondence should be addressed.
Metabolites 2021, 11(4), 248; https://doi.org/10.3390/metabo11040248
Submission received: 19 March 2021 / Revised: 6 April 2021 / Accepted: 14 April 2021 / Published: 17 April 2021
(This article belongs to the Special Issue Nutritional Metabolomics)

Abstract

:
Previous cross-sectional metabolomics studies have identified many potential dietary biomarkers, mostly in blood. Few studies examined urine samples although urine is preferred for dietary biomarker discovery. Furthermore, little is known regarding the reproducibility of urinary metabolomic biomarkers over time. We aimed to identify urinary metabolomic biomarkers of diet and assess their reproducibility over time. We conducted a metabolomics analysis among 648 racially/ethnically diverse men and women in the Diet Assessment Sub-study of the Cancer Prevention Study-3 cohort to examine the correlation between >100 food groups/items [101 by a food frequency questionnaire (FFQ), and 105 by repeated 24 h diet recalls (24HRs)] and 1391 metabolites measured in 24 h urine sample replicates, six months apart. Diet–metabolite associations were examined by Pearson’s partial correlation analysis. Biomarkers were evaluated for prediction accuracy assessed using area under the curve (AUC) calculated from the receiver operating characteristic curve and for reproducibility assessed using intraclass correlation coefficients (ICCs). A total of 1708 diet–metabolite associations were identified after Bonferroni correction for multiple comparisons and restricting correlation coefficients to >0.2 or <−0.2 (1570 associations using the FFQ and 933 using 24HRs), 513 unique metabolites correlated with 79 food groups/items. The median ICCs of the 513 putative biomarkers was 0.53 (interquartile range 0.42–0.62). In this study, with comprehensive dietary data and repeated 24 h urinary metabolic profiles, we identified a large number of diet–metabolite correlations and replicated many found in previous studies. Our findings revealed the promise of urine samples for dietary biomarker discovery in a large cohort study and provide important information on biomarker reproducibility, which could facilitate their utilization in future clinical and epidemiological studies.

1. Introduction

Nutritional epidemiological studies have significantly advanced understanding of the relationships between diet and chronic diseases and have led to dietary guidelines for disease prevention in recent decades [1,2,3]. However, the field is still largely impeded by inconsistent findings from many studies. Most studies rely on self-reported dietary data, such as those collected from food frequency questionnaires (FFQs), which involve systematic and random measurement errors that could result in underestimated risk estimates [4]. Robust and reliable objective dietary biomarkers are important to estimate dietary intake or calibrate self-reported dietary data, thus holding promise to advancing research on diet and cancer and other health outcomes; however, such dietary biomarkers are limited to a few nutrients and do not exist for most foods and dietary patterns.
Metabolomics has shown great promise for identifying novel dietary biomarkers from human blood and urine samples in both cross-sectional and feeding studies [5]. Several large metabolomics analyses conducted in cohort studies employing a cross-sectional study design have identified hundreds of potential biomarkers of habitual food intakes [6,7,8,9,10,11,12,13] or dietary patterns [14,15]. Our previous metabolomics analyses of blood samples from the Cancer Prevention Study-II (CPS-II) Nutrition Cohort [6] and CPS-3 [13] have identified more than 200 putative food-related metabolic markers, many of which replicated findings from other population and feeding studies. The most common metabolite class was xenobiotics, which were highly correlated with plant foods. Amino acids, lipids and their metabolic end products were commonly associated with animal products such as meat, poultry, fish and dairy. Few large cohort studies have examined archived urine samples for dietary biomarkers, although urine is considered a preferred biospecimen for dietary biomarker discovery [16]. Compared with blood, urine has lower protein levels, which interfere in biomarker measurements and has a better coverage of dietary biomarkers, as more diet–metabolite associations were found in urine than in serum [11]. Furthermore, it is important to assess biomarker reproducibility to determine their future use in epidemiological and clinical studies where limited samples may be available [17]. However, little is known about urinary metabolomic biomarker reproducibility over time [18].
We previously published results on biomarkers of food intake from fasting plasma samples and their reproducibility over six months from the CPS-3 Diet Assessment Sub-study (DAS), a 12-month diet validation study [13]. In the present study, we extended our previous research to urine by utilizing the resources from the CPS-3 DAS including the post-study FFQ, repeated 24 h diet recalls (24HRs) and two 24 h urine samples collected six months apart. To fill the literature gap in the present study, we aimed to (1) identify urinary metabolites associated with individual food groups/items using untargeted metabolomics, and (2) to assess the reproducibility of identified metabolites over six months.

2. Results

2.1. Participant Characteristics

Characteristics of the study participants of the CPS-3 DAS are shown in Table 1. Among the 648 participants included in the urinary metabolomics analysis, 60.5% were white, 24.4% were black, 15.1% were Hispanic. The majority (65.0%) were female. The mean age was 52.2 ± 9.4 years.

2.2. 24 h Urinary Metabolites Correlated with Habitual Dietary Intake Assessed by Post-FFQ and 24HRs

We identified a total of 1708 food–metabolite associations (Supplemental Table S1), with 1570 associations using the post-FFQ (p < 3.56 × 10−7 and |r| > 0.2, Supplemental Table S2) and 933 associations using the 24HRs (p < 3.42 × 10−7 and |r| > 0.2, Supplemental Table S3); A total of 513 unique urinary metabolites were associated with 79 food groups/items assessed using either the FFQ or 24HRs, as one metabolite could be correlated with multiple food groups/items and vice versa. The majority of the diet-related metabolites were xenobiotics (n = 152; 29.6%), amino acids (n = 71, 13.8%), or unknown (n = 178; 34.7%); the rest were lipids (n = 28; 5.5%), cofactors and vitamins (n = 14; 2.7%), peptides (n = 8; 1.6%), carbohydrates (n = 15, 2.9%), nucleotides (n = 14; 2.7%) and partially characterized molecules (n = 27; 5.3%).
Area under the curve (AUC) of receiver operating characteristic (ROC) curve was calculated to inform how well the diet-related metabolites can discriminate top from bottom quartiles of dietary intake. The AUCs were generally higher when dietary intake was assessed using the FFQ than using 24HRs.The top 3 most predictive metabolites for each of the 79 food groups/items are shown in Table 2 (according to the post-FFQ assessment, if less than 3 metabolites are identified then top metabolites according to 24HRs were presented). The most predictive metabolite usually also had the highest |r| with a food group/item.

2.2.1. Fruits

We identified 119 food–metabolite associations for 17 fruit groups/items estimated either from the FFQ or 24HRs, including 1 for grapes, 1 for prunes, 6 for bananas,19 for avocado, 2 for apples or pears, 6 for apples (24HRs only), 23 for total citrus fruits and juices, 16 for oranges, 15 for orange juice, 2 for grapefruit, 1 for watermelon, 1 for cantaloupe, 10 for berries, 3 for strawberries, 11 for blueberries, 1 for peaches and plums (Supplemental Table S1); 84 associations were observed using the FFQ (Supplemental Table S2) and 82 using the 24HRs (Supplemental Table S3). The AUCs ranged from 0.6 for vanillactate predicting prune intake assessed using the 24HRs to 0.94 for stachydrine predicting total citrus fruit and juice intake assessed by the post-FFQ.

2.2.2. Vegetables

There are 150 associations for 15 vegetable groups or individual vegetables (119 associations using the FFQ, and 91 associations using the 24HRs), including 1 metabolite for ketchup and salsa, 9 for beans, 58 for all soy products, 8 for fermented soy products, 17 for soy milk, 6 for soy protein powder, 10 for cruciferous vegetables, 2 for leafy greens, 1 for iceberg or head lettuce, 2 for peppers, 7 for mushrooms, 5 for allium vegetables, 5 for onions, 17 for garlic, and 2 for garlic powder. The AUCs ranged from 0.58 for 4-acetylphenyl sulfate predicting fermented soy products assessed using the 24HRs to 0.91 for 4 metabolites predicting total bean intake assessed using the FFQ.

2.2.3. Grains

We identified 35 grain–metabolite associations for total whole grains (n = 10), whole-grain bread (n = 2), whole-grain cereals (n = 7), corn products (n = 8), popcorn (n = 2), other whole grains (n = 1) and refined grains (n = 5); 33 associations were identified using the FFQ, and 14 using 24HRs. The AUCs ranged from 0.74 for 3,5-dihydroxybenzoic acid predicting whole-grain bread intake assessed by 24HRs to 0.91 for 2,6-dihydroxybenzoic acid predicting total whole-grain intake assessed by the FFQ.

2.2.4. Proteins

We identified 404 diet–metabolite associations for 10 protein food groups/items (107 for red meat, 126 for processed meat, 119 for poultry, 7 for total fish, 6 for dark fish, 3 for shellfish, 10 for total nuts, 9 for peanuts, 9 for other nuts, and 8 for seeds); 376 associations were identified using the FFQ and 158 using 24HRs. Most metabolites correlated with red, processed meat and poultry had negative correlations with intake. The AUCs ranged from 0.69 for 3-carboxy-4-methyl-5-propyl-2-furanpropanoate (CMPF) and X-23587 predicting shellfish intake using the FFQ to 0.9 for two metabolites (tryptophan betaine and X-24412) predicting total nut intake using the FFQ.

2.2.5. Dairy/Dairy Alternatives

There were 98 diet–metabolite associations for 4 dairy/dairy alternative groups (20 for milk, 4 for almond milk or rice milk, and 4 for total cheese, and 70 for cream); 93 associations were found using the FFQ, and 10 using the 24HRs. The AUCs ranged from 0.64 for X-13846 predicting almond milk or rice milk intake from 24HRs to 0.88 for heptenedioate (C7:1-DC) predicting total cheese intake from the FFQ.

2.2.6. Fats and Oils

Twenty-two associations were identified for 3 fats and oils (17 for creamy salad dressing, 1 for oil and vinegar salad dressing, and 4 for olive oil); 21 were found using the FFQ and only 1 found by 24HRs. The AUCs ranged from 0.70 for 2,6-dimethylphenol sulfate predicting cream to 0.80 for N-methyltaurine predicting olive oil (FFQ) and oil and vinegar salad dressing (24HRs).

2.2.7. Alcohol

Using either FFQ or 24HRs, we identified 443 associations for alcohol, including 136 for total alcohol, 53 for beer, 120 for wine, 104 for red wine, 24 for white wine, and 6 for liquor. 421 associations were found using the FFQ, and 243 associations were found using the 24HRs. The AUCs ranged from 0.66 for several metabolites as biomarkers of white wine intake to 0.99 for ethyl glucuronide as the biomarker of total alcohol. Ethyl glucuronide was also the most predictive metabolite for all subtypes of alcohol (beer, red wine, white wine and liquor).

2.2.8. Beverages

There were 359 associations for 9 beverage groups, including 145 for total coffee, 142 for caffeinated coffee, 5 for decaffeinated coffee, 24 for total tea, 8 for green tea, 13 for black tea, 6 for herbal tea, 10 for sugar-sweetened beverages and 6 for diet beverages, with 349 found from the FFQ and 304 from 24HRs. The AUCs ranged from 0.63 for X-17686 as a biomarker of herbal tea estimated from 24HRs to 1.0 for glucuronide of C19H28O4 (1) and citraconate/glutaconate as biomarkers of total coffee intake. Glucuronide of C19H28O4 (1) is also the most predictive metabolite for caffeinated (AUC = 0.98) and decaffeinated coffee (AUC = 0.66). For tea consumption, N-acetyltheanine was the most predictive biomarker for total tea, green tea, and black tea but was not correlated with herbal tea intake.

2.2.9. Miscellaneous

The remaining 78 associations were found for 8 miscellaneous food groups, including 22 for French fries, 20 for all chips, 12 for chocolate candies, 12 for dark chocolate, 2 for desserts, 3 for bars (breakfast, energy and high protein bars combined), 2 for soy sauce and 5 for artificial sweeteners. Acesulfame, sucralose, saccharin, erythritol and X-25785 that were associated with all artificial sweetener intake were also associated with diet beverages. The lowest AUC was 0.66 for erythritol as a biomarker of artificial sweetener intake (estimated from 24HRs); the highest AUC was 0.85 for pentose acid, abscisate for French fries (negative correlations) and X-12823 for chocolate candies (estimated from post-FFQ).

2.3. Reproducibility of the Identified Food Metabolites

Of the 513 metabolites that were significantly associated with food groups/items identified via FFQ or 24HRs, the median ICC for duplicate samples over six months was 0.53 (interquartile range: 0.42–0.62). By super pathway, the median ICC ranged from 0.40 for carbohydrates to 0.65 for energy metabolites.
Combining information on both prediction accuracy (AUC) and reproducibility (ICC) over time can inform the reliability of a biomarker to be used in future studies. The combined information on AUC and ICC for the most predictive metabolites of the 79 food groups/items are shown in Figure 1a,b. Biomarkers in the upper right corner with both high AUC and ICC are considered reliable, while those in the lower left corner with both low AUC and ICC are less reliable. Reliable biomarkers were seen for several food groups/items including coffee, alcohol, nuts, fish, tea, processed meat, poultry, and chocolate candies. Due to the design of DAS to capture seasonal variation by collecting 24 h urine six months apart, the low ICCs of metabolites might reflect true variation in dietary intake. We further investigated the relationship between consumption frequency in relation to AUC and ICC. Biomarkers of foods with low consumption frequencies tend to have lower AUCs and ICCs (Figure 2). Exceptions included biomarkers for fish and alcohol.

3. Discussion

In this cross-sectional metabolomics study among 648 men and women in the CPS-3 DAS with comprehensive dietary data assessed using both FFQ and repeated 24HRs, and with two 24 h urine samples collected approximately 6 months apart, we identified 1708 diet–metabolite correlations after adjusting for multiple comparisons. More diet–metabolite correlations were found using FFQ than 24HRs. Reproducibility of the 513 unique metabolites over six months was good for a large proportion, with 28% of metabolites with an ICC > 0.6. The comparisons of urinary dietary biomarkers identified in the present study with our previous findings in fasting plasma samples in the same study [13] revealed several overlapping food biomarkers identified in both blood and urine and many more putative biomarkers identified in urine for further evaluation. This study also provided important information on the reproducibility of the urinary biomarkers, which could facilitate their utilization in future clinical and epidemiological studies.
Urine collection is less invasive, cheaper, and offers greater volumes than blood collection. Most food components (e.g., phytochemicals) are xenobiotics that will be transformed and eliminated quickly via urine or feces. Therefore, urine as a biospecimen could be very useful for identifying dietary biomarkers in large population studies. The usefulness of urine was recently highlighted by a population study comparing dietary biomarkers measured in blood and urine samples from the same individuals. Playdon et al. [11] identified more diet–metabolite correlations in urine than in blood and more than a third of the correlations found in blood were also found in urine with similar magnitude. We previously published findings of diet-related biomarkers identified in fasting plasma samples in the CPS-3 DAS [13]. Among 671 men and women with at least one fasting blood sample in the CPS-3 DAS, a total of 677 diet–metabolite associations were identified (238 metabolites were associated with 76 food groups/items). In the present study, among a similar number of participants with at least one 24 h urine sample we identified a greater number of associations (n = 1708). We also found many overlapping diet–metabolite correlations in urine as we found previously in fasting plasma samples in the same study. For example, the same plausible biomarkers (food constituents or derivatives) were found for apples or pears (4-allphenol sulfate), citrus fruits and juices (stachydrine, N-methylhydroxyproline, N-methylproline), soy products (genistein glucuronide), cruciferous vegetables (S-methylcycteine or S-methylcycteine sulfoxide), garlic (alliin, N-acetylalliin), whole grains (2,6-dihydroxybenzoic acid, 2-acetamidophenol sulfate, 4-methoxyphenol sulfate, 2-aminophenol sulfate), poultry (3-methylhistidine), fish (CMPF), nuts (tryptophan betaine, 4-vinylphenol sulfate), milk (N,N,N-trimethyl-5-aminovalerate and galactonate), artificial sweeteners (acesulfame, saccharin, and erythritol), alcohol (ethyl glucuronide and ethyl α-glucopyranoside), coffee (e.g., quinate, 3-hydroxypyridine sulfate, trigonelline (N-methylnicotinate)), and diet beverages (acesulfame). We previously found theanine, a potentially specific biomarker of tea intake in blood [6,13]. A derivative of theanine, N-acetyltheanine, was found to be the most predictive biomarker of tea in urine in the present study. The magnitude of the correlations was similar in blood and urine. We also observed similar ICCs for the same biomarkers measured in both blood and urine. The high consistency between blood and urine findings in the CPS-3 DAS is also likely influenced by the fact that 24 h urine samples were returned on the same day when fasting blood samples were collected from the same participants.
We additionally replicated many other plausible biomarkers found in previous feeding or population studies, from either blood or urine. For example, we replicated biomarkers for banana (dopamine 3-O-sulfate) [6], citrus fruits and juices (e.g., N-methylglutamate, chiro-inositol, naringenin 7-glucuronide) [11,19], berries (catechol sulfate) [20], soy products (daidzein, genistein, daidzein sulfate, genistein sulfate, daidzein glucuronide and genistein glucuronide) [21,22], cruciferous vegetables (sulforaphane, sulforaphane-N-acetyl-cysteine) [23], garlic (S-allylcysteine, N-acetyl-S allyl-L-cysteine) [6,24,25], whole grains (3-methoxycatechol sulfate) [26], milk (phenylacetylglycine, 2,8-quinolinediol sulfate) [6], and coffee (citraconate/glutaconate, feruloylquinate, 2-Furoylglycine) [6,12,27]. There are many other potentially novel biomarkers identified in the present study which need to be confirmed in other studies and further evaluated.
Reproducibility of food-based biomarkers, affected by many sources of variability, is very important to inform the application of such biomarkers in large-scale clinical and epidemiological studies [17]. Large within-person variation in the biomarker over time is a major source of measurement errors that could lead to underestimated diet-disease risk estimates and inconsistent findings. Generally, we found lower reproducibility (or ICCs) for urinary biomarkers than for blood biomarkers, with a median ICC being 0.53 vs. 0.56 [13]. It is likely because most urinary biomarkers are xenobiotics and amino acids that are hydrophilic which have shorter half-lives than lipophilic biomarkers. Many polyphenol biomarkers have half-lives shorter than 24 h [28]. Metabolites with a short half-life tend to have a higher within-person variation, and thus a lower ICC. However, some may still be useful to capture habitual diet if the food/beverage is consumed frequently in the population (e.g., coffee), as we observed a positive relationship between consumption frequency and reproducibility of the biomarkers. Although our goal is to identify reliable biomarkers for habitual dietary intake, sensitive and specific short-term biomarkers, such as isoflavones and their derivatives for soy products, are still useful in monitoring dietary compliance in intervention studies or in populations with higher frequency of consumption. On the other hand, lipophilic or erythrocyte-associated biomarkers have longer half-lives in weeks or months because of the equilibrium of biomarkers between blood and fatty tissues, or because of binding to red blood cells [5]; thus, are useful as long-term biomarkers. For example, even though fish and alcohol were not frequently consumed among participants in the present study, their most predictive metabolites (CMPF and ethyl glucuronide, respectively) still had high reproducibility over the six-month period.
Plausible biomarkers should have positive correlations with food intake. Many metabolites were inversely correlated with foods such as red and processed meat and may not be good candidates for further evaluation. A large proportion of the diet-related metabolites are unknowns which need annotation in future studies. We reported the unknowns herein given their strong relationships with dietary factors, so they may be compared with future studies using this platform. Moving forward, more research is needed to systematically evaluate plausible food and food group biomarkers in multiple aspects such as robustness in different populations and study settings, half-lives, dose–response relationships over a range of intakes, and comparisons to benchmark biomarkers [29].
The present study has several strengths, including its large sample size, comprehensive dietary data collected using both an FFQ and repeated 24HRs, availability of 24 h urine samples, and metabolomic profile data measured by an untargeted and sensitive mass spectrometry-based approach. These rich resources enabled us to explore a large number of diet–metabolite correlations simultaneously. The repeated measures of 24 h urinary metabolic profiles make the study unique because most cohort studies did not collect urine samples or only collected spot urine and because the repeated measures allowed for an assessment of biomarker reproducibility over time. This study also has limitations. Metabolites with low correlation coefficients may not be ideal biomarkers as they only explain a small portion of the variation in dietary intake. The low correlations do not exclude them from further evaluation as candidate dietary biomarkers though, as diet was assessed using self-reported instruments in this study that have measurement errors which could attenuate the correlation estimates with biomarkers. We were not able to distinguish acute intake biomarkers from habitual dietary biomarkers as the study was designed to not to burden the participants by collection 24HRs and biospecimens at the same time. Future studies need to confirm these biomarkers in spot urine samples as 24 h urine collections are burdensome and generally not feasible in large population studies.

4. Materials and Methods

4.1. Study Population

The Diet Assessment Sub-study (DAS) was a one-year observational study among 745 men and women enrolled in the CPS-3, designed to evaluate the validity and reproducibility of the newly modified CPS-3 FFQ over a year. CPS-3 is a large prospective cohort study of 303,682 adults aged 30–65 residing in 35 states plus the District of Columbia and Puerto Rico, who were enrolled between 2006 and 2013 as described in detail elsewhere [30]. Briefly, at enrollment, participants provided a blood sample, had waist circumference measured and completed an enrollment survey. Most participants also completed a more comprehensive baseline survey that assessed extensive lifestyle, medical and other information. Follow-up questionnaires were sent in 2015 to those who completed the baseline survey after enrollment (n = 254,650) to update lifestyle and medical information and to assess diet using the CPS-3 FFQ for the first time.
To recruit participants to the DAS, CPS-3 participants living in 5 regions defined by Quest Diagnostics business units (Atlanta, GA, USA; Dallas, TX, USA; Auburn Hills, MI, USA; West Hills, CA, USA; San Jose, CA, USA) were invited. Enrolled participants were asked to complete the 2015 follow-up survey (to serve as the pre-FFQ), six telephone-administered 24HRs throughout the year, provide two fasting blood and two 24 h urine samples and complete a post-FFQ at the end of the study. The six 24HRs aimed to include four weekdays and two weekend days. Blood and urine samples were collected approximately six months apart to capture seasonal variation.
A total of 745 men and women met the minimum inclusion criteria of completing both pre- and post-FFQs and the first 24HR. For the urinary metabolomics analysis, we excluded participants who completed less than three 24HRs (n = 2), had poor post-FFQs (n = 20; defined as missing 2 or more sections, an entire page, >100 line items, or with daily energy intake <800 or >4500 kcal for men, and <600 or >3800 kcal for women), or had missing or invalid urine collections at both time points (n = 30). Invalid urine collections were defined as missed or spilled voiding ≥2 times, incorrect collection or flushing of the next morning samples, missing volume or extreme total volume (top and bottom 1% distribution), extreme urinary creatinine (top and bottom 1% distribution), or total collection period <20 or >28 h. We further excluded current smokers (n = 19), those whose body weight was missing at both urine collection appointments (n = 1) or weight change was >20 lbs between urine collections (n = 13), and pregnant women (n = 12). Finally, 648 men and women were included in the urinary metabolomics analysis (Supplemental Figure S1). Those with two eligible urine samples (n = 482) were included in the analysis of assessing reproducibility. The CPS-3 DAS protocol was approved by the Emory University (Atlanta, GA, USA) Institutional Review Board.

4.2. Diet Assessment

Diet was assessed using the newly modified CPS-3 FFQ as described elsewhere [31]. Briefly, the Willett FFQ [32,33] was modified for the CPS-3 study population, of which 17.3% were non-white participants. Modifications to the FFQ were informed through telephone-administered 24HRs, analyses of NHANES 2009-2010, and focus groups. The final modified FFQ included 191-line items. We defined 101 food groups/items from the FFQ as shown in Supplemental Table S4, generally consistent with the definitions in our previous analysis in the CPS-II Nutrition Cohort [6]. Comparable food groups were derived from the 24HRs to match those from the FFQ. We also created a few food groups using the 24HRs that are not asked (e.g., mushroom) or asked in combination with other foods (e.g., apples) on the FFQ. A total of 105 food groups/items were derived from the 24HRs. Only the post-FFQ was used in the present study as it assessed average dietary intake in the past 12 months during which period 24 h urine samples were collected.

4.3. 24 h Urine Collection and Processing

Participants were instructed to begin 24 h urine collections in the morning the day prior to their fasting blood collection appointment. Urine collection started after voiding the first specimen in the morning, and participants collected all urine for the next 24 h including the following morning’s first specimen. Urine was collected in 3 L unpreserved jugs, and participants were instructed to refrigerate or keep samples in a cooler with cool packs provided. The following morning, participants delivered their completed 24 h urine collection to a Quest Patient Service Center and volume was recorded. Urine specimens were then transported to a Quest Diagnostics regional processing laboratory where samples were aliquoted into 4 × 5 mL and 5 × 1.8 mL labeled cryovials. All aliquots were frozen and shipped on dry ice to an off-site biorepository (Fisher BioServices, Inc., Frederick, MD, USA) for long-term storage in the vapor phase of liquid nitrogen.

4.4. Metabolomics Analysis

Metabolomic profiling was conducted by Metabolon, Inc. (Durham, NC, USA) using ultrahigh performance liquid chromatography-tandem mass spectrometry (UPLC–MS/MS) described in detail elsewhere [34,35]. Briefly, 100 µL urine samples were treated with 450 µL of methanol to precipitate proteins using an automated liquid handling robot (Hamilton LabStar, Hamilton Robotics, Inc., Reno, NV, USA). Four sample fractions were dried and reconstituted in different solvents for measurement under four different platforms. Two aliquots were analyzed using two separate reverse phase (RP)/UPLC–MS/MS methods with positive ion mode electrospray ionization (ESI), one chromatographically optimized for more hydrophilic compounds and one for more hydrophobic compounds. Another aliquot was analyzed using RP/UPLC–MS/MS with negative ion mode ESI using a separate dedicated C18 column. The last aliquot was analyzed via hydrophilic interaction chromatography (HILIC)/UPLC–MS/MS with negative ion mode ESI. Mobile phases of the RP positive ion method consisted of 0.1% formic acid in water and 0.1% formic acid in methanol. Mobile phases of the RP negative ion method consisted of 6.5 mM ammonium bicarbonate in water (pH 8) and 6.5 mM ammonium bicarbonate in 95% methanol/5% water. Mobile phases of the HILIC method consisted of 10 mM ammonium formate in 15% water, 5% methanol, 80% acetonitrile and 10 mM ammonium formate in 50% water, 50% acetonitrile. For all methods, the injection volume was 5 µL and a 2× needle loop overfill was used. Individual metabolites were identified by comparison with a chemical library maintained by Metabolon that comprises more than 3300 authenticated standards and recurrent unknown entities, based on retention time/index, mass to charge ratio, and chromatographic data (including MS/MS spectral data).
A total of 1551 metabolites were detected in the 24 h urine samples. Metabolites that were below the detection limit in >90% of the samples were excluded (n = 147). Values for each sample were normalized by osmolality. To correct the day-to-day variation from the platform, each metabolite was then rescaled to set the median equal to 1. Lastly, missing values are imputed with the minimum. Triplicates of 44 participant samples were used as quality controls to assess inter- and intra-batch variation. Intraclass correlation coefficients (ICCs) were calculated among the quality control samples to test the reproducibility of the platforms. Metabolites with an ICC < 0.5 were further excluded from the analysis, leaving 1391 for diet–metabolite analysis. Of the 1391 included metabolites, the median technical ICC was 0.94 (interquartile range: 0.89 to 0.97), suggesting a very high reproducibility of the platforms.

4.5. Statistical Analysis

Metabolite and food variables (from FFQs and 24HRs) were generalized log transformed [36] and auto-scaled before all analyses. Metabolite levels were averaged for participants with two measurements. Pearson’s partial correlation was used to determine the food–metabolite correlations, controlling for age (continuous), gender, race/ethnicity (white, black, Hispanic), education (no college, college graduate, graduate school, unknown), smoking status (never, former), physical activity (metabolic equivalent hours per week (MET-h/wk): <5, 5–<10 or missing, 10–<15, ≥15), body mass index (kg/m2, continuous), ethanol intake (g/d, continuous; except for alcohol-containing items), and energy intake (kcal/d, continuous). Associations were considered statistically significant if p values were less than the Bonferroni-corrected threshold (0.05/1391/101 = 3.56 × 10−7 for FFQ, 0.05/1391/105 = 3.42 × 10−7 for 24HRs). To minimize false-positive findings, we further required the absolute values of the correlation coefficient (|r|) were greater than 0.2.
Putative dietary biomarkers were further evaluated for predictive accuracy of discriminating top from bottom quartile of consumption (highest vs. lowest intake), assessed using the AUC calculated from the ROC curve using R package pROC [37]. AUC < 0.7 was considered to be low, 0.7–<0.8 to be moderate, and ≥0.8 to be high.
The reproducibility of the identified food-related metabolites over six months was assessed using ICCs. ICCs were calculated as the ratio of between-person variance to the total variance among participants with repeated measures of urinary metabolic profiles. Between-person variance was estimated from a random effects model where participants were modeled as a random variable. We considered ICCs > 0.6 to be good and >0.75 to be excellent reproducibility.

5. Conclusions

In conclusion, in this large cross-sectional analysis of habitual diet and 24 h urinary metabolic profiles in a free-living population of 648 racially/ethnically diverse men and women, we identified many more potential dietary biomarkers in urine than fasting blood samples in the same study, and replicated several found in other previous studies. These findings provided complimentary information to blood biomarkers and important information on the reproducibility of the urinary biomarkers. These candidate biomarkers warrant further evaluation and reliable ones could be used in future clinical and epidemiological studies.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/metabo11040248/s1. Figure S1: study population exclusion; Table S1: Food–metabolite associations identified using either FFQ or 24 h diet recalls in the CPS-3 Diet Assessment Sub-study; Table S2: Food–metabolite associations identified using the FFQ in the CPS-3 Diet Assessment Sub-study; Table S3: Food–metabolite associations identified using the average 24 h diet recalls in the CPS-3 Diet Assessment Sub-study; Table S4: Food group definitions in the CPS-3 Diet Assessment Sub-study.

Author Contributions

Y.W. and M.L.M. designed the research; R.A.H. performed the statistical analysis; Y.W. wrote the paper; R.A.H., V.L.S., T.J.H. and M.L.M. provided the critical review; Y.W. takes primary responsibility for the final content. All authors have read and agreed to the published version of the manuscript.

Funding

The American Cancer Society funds the creation, maintenance, and updating of the Cancer Prevention Study-3 cohort. Support for this project was funded by the American Cancer Society.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of the Emory University (IRB ID CR001-IRB00059007, approved on 10/23/2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data described in the manuscript and analytic code are not available to protect participant confidentiality and in adherence with institutional policies.

Acknowledgments

The authors express sincere appreciation to all CPS-3 Diet Assessment Sub-study participants and to each member of the study and biospecimen management group.

Conflicts of Interest

All authors declared no conflict of interest to the study.

Disclaimer

The views expressed here are those of the authors and do not necessarily represent the American Cancer Society or the American Cancer Society—Cancer Action Network.

References

  1. Rock, C.L.; Thomson, C.; Gansler, T.; Gapstur, S.M.; McCullough, M.L.; Patel, A.V.; Andrews, K.S.; Bandera, E.V.; Spees, C.K.; Robien, K.; et al. American Cancer Society guideline for diet and physical activity for cancer prevention. CA Cancer J. Clin. 2020, 70, 245–271. [Google Scholar] [CrossRef]
  2. US Department of Health and Human Services; US Department of Agriculture. 2015–2020 Dietary Guidelines for Americans. Available online: http://www.health.gov/DietaryGuidelines (accessed on 18 August 2020).
  3. World Cancer Research Fund; American Institute for Cancer Research. Diet, Nutrition, Physical Activity and Cancer: A Global Perspective. Continuous Update Project Expert Report 2018. Available online: https://www.wcrf.org/sites/default/files/Summary-of-Third-Expert-Report-2018.pdf (accessed on 18 August 2020).
  4. Brennan, L.; Hu, F.B. Metabolomics-Based Dietary Biomarkers in Nutritional Epidemiology-Current Status and Future Opportunities. Mol. Nutr. Food Res. 2019, 63, e1701064. [Google Scholar] [CrossRef]
  5. Scalbert, A.; Brennan, L.; Manach, C.; Andres-Lacueva, C.; Dragsted, L.O.; Draper, J.; Rappaport, S.M.; van der Hooft, J.J.; Wishart, D.S. The food metabolome: A window over dietary exposure. Am. J. Clin. Nutr. 2014, 99, 1286–1308. [Google Scholar] [CrossRef] [Green Version]
  6. Wang, Y.; Gapstur, S.M.; Carter, B.D.; Hartman, T.J.; Stevens, V.L.; Gaudet, M.M.; McCullough, M.L. Untargeted Metabolomics Identifies Novel Potential Biomarkers of Habitual Food Intake in a Cross-Sectional Study of Postmenopausal Women. J. Nutr. 2018, 148, 932–943. [Google Scholar] [CrossRef]
  7. Pallister, T.; Jennings, A.; Mohney, R.P.; Yarand, D.; Mangino, M.; Cassidy, A.; MacGregor, A.; Spector, T.D.; Menni, C. Characterizing Blood Metabolomics Profiles Associated with Self-Reported Food Intakes in Female Twins. PLoS ONE 2016, 11, e0158568. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Guertin, K.A.; Moore, S.C.; Sampson, J.N.; Huang, W.Y.; Xiao, Q.; Stolzenberg-Solomon, R.Z.; Sinha, R.; Cross, A.J. Metabolomics in nutritional epidemiology: Identifying metabolites associated with diet and quantifying their potential to uncover diet-disease relations in populations. Am. J. Clin. Nutr. 2014, 100, 208–217. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Andersen, M.B.; Kristensen, M.; Manach, C.; Pujos-Guillot, E.; Poulsen, S.K.; Larsen, T.M.; Astrup, A.; Dragsted, L. Discovery and validation of urinary exposure markers for different plant foods by untargeted metabolomics. Anal. Bioanal. Chem. 2014, 406, 1829–1844. [Google Scholar] [CrossRef] [PubMed]
  10. Zheng, Y.; Yu, B.; Alexander, D.; Steffen, L.M.; Boerwinkle, E. Human metabolome associates with dietary intake habits among African Americans in the atherosclerosis risk in communities study. Am. J. Epidemiol. 2014, 179, 1424–1433. [Google Scholar] [CrossRef] [Green Version]
  11. Playdon, M.C.; Sampson, J.N.; Cross, A.J.; Sinha, R.; Guertin, K.A.; Moy, K.A.; Rothman, N.; Irwin, M.L.; Mayne, S.T.; Stolzenberg-Solomon, R.; et al. Comparing metabolite profiles of habitual diet in serum and urine. Am. J. Clin. Nutr. 2016, 104, 776–789. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Edmands, W.M.; Ferrari, P.; Rothwell, J.A.; Rinaldi, S.; Slimani, N.; Barupal, D.K.; Biessy, C.; Jenab, M.; Clavel-Chapelon, F.; Fagherazzi, G.; et al. Polyphenol metabolome in human urine and its association with intake of polyphenol-rich foods across European countries. Am. J. Clin. Nutr. 2015, 102, 905–913. [Google Scholar] [CrossRef] [Green Version]
  13. Wang, Y.; Hodge, R.A.; Stevens, V.L.; Hartman, T.J.; McCullough, M.L. Identification and Reproducibility of Plasma Metabolomic Biomarkers of Habitual Food Intake in a US Diet Validation Study. Metabolites 2020, 10, 382. [Google Scholar] [CrossRef] [PubMed]
  14. Playdon, M.C.; Moore, S.C.; Derkach, A.; Reedy, J.; Subar, A.F.; Sampson, J.N.; Albanes, D.; Gu, F.; Kontto, J.; Lassale, C.; et al. Identifying biomarkers of dietary patterns by using metabolomics. Am. J. Clin. Nutr. 2017, 105, 450–465. [Google Scholar] [CrossRef] [Green Version]
  15. McCullough, M.L.; Maliniak, M.L.; Stevens, V.L.; Carter, B.D.; Hodge, R.A.; Wang, Y. Metabolomic markers of healthy dietary patterns in US postmenopausal women. Am. J. Clin. Nutr. 2019, 109, 1439–1451. [Google Scholar] [CrossRef] [PubMed]
  16. Maruvada, P.; Lampe, J.W.; Wishart, D.S.; Barupal, D.; Chester, D.N.; Dodd, D.; Djoumbou-Feunang, Y.; Dorrestein, P.C.; Dragsted, L.O.; Draper, J.; et al. Perspective: Dietary Biomarkers of Intake and Exposure-Exploration with Omics Approaches. Adv. Nutr. 2020, 11, 200–215. [Google Scholar] [CrossRef]
  17. Sampson, J.N.; Boca, S.M.; Shu, X.O.; Stolzenberg-Solomon, R.Z.; Matthews, C.E.; Hsing, A.W.; Tan, Y.T.; Ji, B.T.; Chow, W.H.; Cai, Q.; et al. Metabolomics in epidemiology: Sources of variability in metabolite measurements and implications. Cancer Epidemiol. Biomark. Prev. 2013, 22, 631–640. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Xiao, Q.; Moore, S.C.; Boca, S.M.; Matthews, C.E.; Rothman, N.; Stolzenberg-Solomon, R.Z.; Sinha, R.; Cross, A.J.; Sampson, J.N. Sources of variability in metabolite measurements from urinary samples. PLoS ONE 2014, 9, e95749. [Google Scholar] [CrossRef] [PubMed]
  19. Davis, B.D.; Needs, P.W.; Kroon, P.A.; Brodbelt, J.S. Identification of isomeric flavonoid glucuronides in urine and plasma by metal complexation and LC-ESI-MS/MS. J. Mass Spectrom. 2006, 41, 911–920. [Google Scholar] [CrossRef]
  20. Pimpao, R.C.; Ventura, M.R.; Ferreira, R.B.; Williamson, G.; Santos, C.N. Phenolic sulfates as new and highly abundant metabolites in human plasma after ingestion of a mixed berry fruit puree. Br. J. Nutr. 2015, 113, 454–463. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Shelnutt, S.R.; Cimino, C.O.; Wiggins, P.A.; Ronis, M.J.; Badger, T.M. Pharmacokinetics of the glucuronide and sulfate conjugates of genistein and daidzein in men and women after consumption of a soy beverage. Am. J. Clin. Nutr. 2002, 76, 588–594. [Google Scholar] [CrossRef] [Green Version]
  22. Shelnutt, S.R.; Cimino, C.O.; Wiggins, P.A.; Badger, T.M. Urinary pharmacokinetics of the glucuronide and sulfate conjugates of genistein and daidzein. Cancer Epidemiol. Biomark. Prev. 2000, 9, 413–419. [Google Scholar]
  23. Saha, S.; Hollands, W.; Teucher, B.; Needs, P.W.; Narbad, A.; Ortori, C.A.; Barrett, D.A.; Rossiter, J.T.; Mithen, R.F.; Kroon, P.A. Isothiocyanate concentrations and interconversion of sulforaphane to erucin in human subjects after consumption of commercial frozen broccoli compared to fresh broccoli. Mol. Nutr. Food Res. 2012, 56, 1906–1916. [Google Scholar] [CrossRef] [PubMed]
  24. de Rooij, B.M.; Boogaard, P.J.; Rijksen, D.A.; Commandeur, J.N.; Vermeulen, N.P. Urinary excretion of N-acetyl-S-allyl-L-cysteine upon garlic consumption by human volunteers. Arch. Toxicol. 1996, 70, 635–639. [Google Scholar] [CrossRef]
  25. Pratico, G.; Gao, Q.; Manach, C.; Dragsted, L.O. Biomarkers of food intake for Allium vegetables. Genes Nutr. 2018, 13, 34. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Zhu, Y.; Wang, P.; Sha, W.; Sang, S. Urinary Biomarkers of Whole Grain Wheat Intake Identified by Non-targeted and Targeted Metabolomics Approaches. Sci. Rep. 2016, 6, 36278. [Google Scholar] [CrossRef] [Green Version]
  27. Heinzmann, S.S.; Holmes, E.; Kochhar, S.; Nicholson, J.K.; Schmitt-Kopplin, P. 2-Furoylglycine as a Candidate Biomarker of Coffee Consumption. J. Agric. Food Chem. 2015, 63, 8615–8621. [Google Scholar] [CrossRef]
  28. Manach, C.; Williamson, G.; Morand, C.; Scalbert, A.; Remesy, C. Bioavailability and bioefficacy of polyphenols in humans. I. Review of 97 bioavailability studies. Am. J. Clin. Nutr. 2005, 81, 230S–242S. [Google Scholar] [CrossRef] [Green Version]
  29. Dragsted, L.O.; Gao, Q.; Scalbert, A.; Vergeres, G.; Kolehmainen, M.; Manach, C.; Brennan, L.; Afman, L.A.; Wishart, D.S.; Andres Lacueva, C.; et al. Validation of biomarkers of food intake-critical assessment of candidate biomarkers. Genes Nutr. 2018, 13, 14. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Patel, A.V.; Jacobs, E.J.; Dudas, D.M.; Briggs, P.J.; Lichtman, C.J.; Bain, E.B.; Stevens, V.L.; McCullough, M.L.; Teras, L.R.; Campbell, P.T.; et al. The American Cancer Society’s Cancer Prevention Study 3 (CPS-3): Recruitment, study design, and baseline characteristics. Cancer 2017, 123, 2014–2024. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Troeschel, A.N.; Hartman, T.J.; Flanders, W.D.; Wang, Y.; Hodge, R.A.; McCullough, L.E.; Mitchell, D.C.; Sampson, L.; Patel, A.V.; McCullough, M.L. The American Cancer Society Cancer Prevention Study-3 FFQ Has Reasonable Validity and Reproducibility for Food Groups and a Diet Quality Score. J. Nutr. 2020, 150, 1566–1578. [Google Scholar] [CrossRef] [PubMed]
  32. Rimm, E.B.; Giovannucci, E.L.; Stampfer, M.J.; Colditz, G.A.; Litin, L.B.; Willett, W.C. Reproducibility and validity of an expanded self-administered semiquantitative food frequency questionnaire among male health professionals. Am. J. Epidemiol. 1992, 135, 1114–1126; discussion 1127–1136. [Google Scholar] [CrossRef]
  33. Feskanich, D.; Rimm, E.B.; Giovannucci, E.L.; Colditz, G.A.; Stampfer, M.J.; Litin, L.B.; Willett, W.C. Reproducibility and validity of food intake measurements from a semiquantitative food frequency questionnaire. J. Am. Diet. Assoc. 1993, 93, 790–796. [Google Scholar] [CrossRef]
  34. Evans, A.M.; DeHaven, C.D.; Barrett, T.; Mitchell, M.; Milgram, E. Integrated, nontargeted ultrahigh performance liquid chromatography/electrospray ionization tandem mass spectrometry platform for the identification and relative quantification of the small-molecule complement of biological systems. Anal. Chem. 2009, 81, 6656–6667. [Google Scholar] [CrossRef] [PubMed]
  35. Evans, A.M.; Bridgewater, B.; Liu, Q.; Mitchell, M.; Robinson, R.; Dai, H.; Stewart, S.; DeHaven, C.; Miller, L.J.M. High resolution mass spectrometry improves data quantity and quality as compared to unit mass resolution mass spectrometry in high-throughput profiling. Metabolomics 2014, 4, 1. [Google Scholar]
  36. Huber, W.; Von Heydebreck, A.; Sültmann, H.; Poustka, A.; Vingron, M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002, 18, S96–S104. [Google Scholar] [CrossRef] [PubMed]
  37. Robin, X.; Turck, N.; Hainard, A.; Tiberti, N.; Lisacek, F.; Sanchez, J.-C.; Müller, M. pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 2011, 12, 77. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Metabolite prediction accuracy for food intake by metabolite reproducibility for the most predictive metabolite of 79 food groups/items in the Cancer Prevention Study-3 Diet Assessment Sub-study. (a) The most predictive metabolites for 71 food groups/items assessed using the food frequency questionnaire; (b) the most predictive metabolites for 60 food groups/items assessed using the average of 24 h diet recalls. Prediction accuracy was assessed by area under the curve (AUC) from the receiver operating characteristic curve, which indicates how well a metabolite could discriminate top quartile from bottom quartile intake of a food group/item. Reproducibility was assessed by intraclass correlation coefficients (ICCs), calculated as the ratio of between-person variance to the total variance among participants with repeated blood metabolic profiles measured six months apart.
Figure 1. Metabolite prediction accuracy for food intake by metabolite reproducibility for the most predictive metabolite of 79 food groups/items in the Cancer Prevention Study-3 Diet Assessment Sub-study. (a) The most predictive metabolites for 71 food groups/items assessed using the food frequency questionnaire; (b) the most predictive metabolites for 60 food groups/items assessed using the average of 24 h diet recalls. Prediction accuracy was assessed by area under the curve (AUC) from the receiver operating characteristic curve, which indicates how well a metabolite could discriminate top quartile from bottom quartile intake of a food group/item. Reproducibility was assessed by intraclass correlation coefficients (ICCs), calculated as the ratio of between-person variance to the total variance among participants with repeated blood metabolic profiles measured six months apart.
Metabolites 11 00248 g001
Figure 2. Metabolite prediction accuracy and reproducibility by food consumption frequency for the most predictive metabolite of 71 food groups/items assessed using the FFQ in the Cancer Prevention Study-3 Diet Assessment Sub-study. (a) Metabolite prediction accuracy, assessed by area under the curve (AUC) from the receiver operating characteristic curve, in relation to food consumption frequency; (b) metabolite reproducibility, assessed by intraclass correlation coefficients (ICCs) over six months, in relation to food consumption frequency.
Figure 2. Metabolite prediction accuracy and reproducibility by food consumption frequency for the most predictive metabolite of 71 food groups/items assessed using the FFQ in the Cancer Prevention Study-3 Diet Assessment Sub-study. (a) Metabolite prediction accuracy, assessed by area under the curve (AUC) from the receiver operating characteristic curve, in relation to food consumption frequency; (b) metabolite reproducibility, assessed by intraclass correlation coefficients (ICCs) over six months, in relation to food consumption frequency.
Metabolites 11 00248 g002
Table 1. Characteristics of participants (n = 648) in the Cancer Prevention Study-3 Diet Assessment Sub-study 1.
Table 1. Characteristics of participants (n = 648) in the Cancer Prevention Study-3 Diet Assessment Sub-study 1.
CharacteristicsMen (n = 227)Women (n = 421)
Age (year)52.3 ± 10.152.2 ± 9.1
Race/ethnicity
White146 (64.3)246 (58.4)
Black38 (16.7)120 (28.5)
Hispanic43 (18.9)55 (13.1)
BMI at pre-FFQ (kg/m2)27.2 (5.0)27.8 (6.5)
Education
<College39 (17.2)104 (24.7)
College78 (34.4)137 (32.5)
≥Graduate school101 (44.5)167 (39.7)
Unknown9 (4.0)13 (3.1)
Smoking status
Never178 (78.4)336 (79.8)
Former49 (21.6)85 (20.2)
Recreational physical activity (MET-h/wk)
0–<541 (18.1)120 (28.5)
5–<10 271 (31.3)143 (34.0)
10–<1549 (21.6)72 (17.1)
≥1566 (29.1)86 (20.4)
Ethanol intake (g/d)10.4 ± 14.16.8 ± 11.1
Energy from post-FFQ (kcal/d)2134 ± 6872001 ± 611
Average energy intake from 24HRs (kcal/d)2198 ± 5701724 ± 407
Abbreviations: BMI, body mass index; 24HR, 24 h diet recall; FFQ, food frequency questionnaire; MET-h, metabolic equivalent hour. 1 Values are the mean ± standard deviation for continuous variables, and frequency (%) for categorical variables. 2 Includes missing.
Table 2. Top three predictive metabolites for 79 food group/item assessed using the CPS-3 FFQ and average of 24 h diet recalls in the Cancer Prevention Study-3 Diet Assessment Sub-study 1.
Table 2. Top three predictive metabolites for 79 food group/item assessed using the CPS-3 FFQ and average of 24 h diet recalls in the Cancer Prevention Study-3 Diet Assessment Sub-study 1.
Food Group/ItemsBiochemical Name 2Super PathwaySub PathwayPost-FFQAverage 24HRsICC 3
Rp ValueAUCRp ValueAUC
FRUITS
Grapesnaringenin 7-glucuronideXenobioticsFood Component/Plant0.101.52 × 10−20.730.211.42 × 10−70.700.25 (0.17, 0.34)
PrunesvanillactateAmino AcidTyrosine Metabolism0.117.29 × 10−30.630.211.68 × 10−70.600.67 (0.62, 0.72)
Bananadopamine 3-O-sulfateAmino AcidTyrosine Metabolism0.306.60 × 10−150.830.251.48 × 10−100.730.53 (0.47, 0.60)
X-24338 0.303.37 × 10−140.820.275.94 × 10−120.730.40 (0.33, 0.48)
ethyl pyruvateXenobioticsFood Component/Plant0.252.24 × 10−100.800.251.17 × 10−100.740.38 (0.31, 0.46)
Avocado3-methyladipateLipidFatty Acid, Dicarboxylate0.272.91 × 10−120.830.187.13 × 10−60.740.56 (0.50, 0.62)
homocitrateXenobioticsFood Component/Plant0.211.75 × 10−70.820.138.34 × 10−40.740.50 (0.44, 0.57)
X-17335 0.211.23 × 10−70.820.085.29 × 10−20.720.38 (0.31, 0.46)
Apples or pears4-allylphenol sulfateXenobioticsFood Component/Plant0.211.34 × 10−70.79 0.40 (0.32, 0.47)
xyloseCarbohydratePentose Metabolism0.211.74 × 10−70.77 0.34 (0.27, 0.43)
Apples 44-allylphenol sulfateXenobioticsFood Component/Plant 0.251.10 × 10−100.710.40 (0.32, 0.47)
xyloseCarbohydratePentose Metabolism 0.241.94 × 10−90.710.34 (0.27, 0.43)
X-25838 0.252.01 × 10−100.700.40 (0.32, 0.47)
Total citrus fruits and juicesstachydrineXenobioticsFood Component/Plant0.527.41 × 10−450.940.466.06 × 10−350.850.44 (0.37, 0.51)
N-methylglutamateAmino AcidGlutamate Metabolism0.464.00 × 10−340.900.391.90 × 10−240.820.47 (0.40, 0.54)
X-12111 0.404.24 × 10−260.900.409.72 × 10−260.820.40 (0.33, 0.48)
OrangesstachydrineXenobioticsFood Component/Plant0.305.24 × 10−150.820.275.55 × 10−120.760.44 (0.37, 0.51)
N-methylglutamateAmino AcidGlutamate Metabolism0.251.23 × 10−100.810.202.09 × 10−70.730.47 (0.40, 0.54)
X-19183 0.241.94 × 10−90.790.251.42 × 10−100.750.34 (0.27, 0.42)
Orange juicestachydrineXenobioticsFood Component/Plant0.369.49 × 10−210.890.359.29 × 10−200.800.44 (0.37, 0.51)
N-methylglutamateAmino AcidGlutamate Metabolism0.362.62 × 10−200.880.342.14 × 10−180.790.47 (0.40, 0.54)
X-12111 0.321.35 × 10−160.870.351.38 × 10−190.790.40 (0.33, 0.48)
GrapefruitstachydrineXenobioticsFood Component/Plant0.251.87 × 10−100.710.187.47 × 10−60.620.44 (0.37, 0.51)
N-methylglutamateAmino AcidGlutamate Metabolism0.218.50 × 10−80.710.151.22 × 10−40.610.47 (0.40, 0.54)
WatermelonX-25271 0.381.43 × 10−230.830.316.53 × 10−160.690.25 (0.17, 0.34)
CantaloupeX-25271 0.312.17 × 10−150.760.211.88 × 10−70.660.25 (0.17, 0.34)
BerriesquinateXenobioticsFood Component/Plant0.215.87 × 10−80.840.101.54 × 10−20.710.82 (0.79, 0.85)
4-allylphenol sulfateXenobioticsFood Component/Plant0.202.67 × 10−70.830.192.05 × 10−60.750.40 (0.32, 0.47)
X-24757 0.234.37 × 10−90.820.123.62 × 10−30.720.64 (0.59, 0.69)
StrawberriesxyloseCarbohydratePentose Metabolism0.198.56 × 10−70.780.223.30 × 10−80.740.34 (0.27, 0.43)
X-25523 0.151.56 × 10−40.780.217.47 × 10−80.720.49 (0.42, 0.56)
ursocholateLipidSecondary Bile Acid Metabolism−0.211.12 × 10−70.78−0.061.58 × 10−10.690.67 (0.61, 0.71)
BlueberriesX-23970 0.221.26 × 10−80.830.244.83 × 10−100.750.52 (0.46, 0.59)
X-25523 0.222.00 × 10−80.830.211.09 × 10−70.730.49 (0.42, 0.56)
catechol sulfateXenobioticsBenzoate Metabolism0.222.65 × 10−80.830.151.50 × 10−40.720.71 (0.66, 0.75)
Blackberries 4isocitric lactoneEnergyTCA Cycle 0.223.49 × 10−80.640.43 (0.36, 0.51)
Peaches or plumsxyloseCarbohydratePentose Metabolism0.092.16 × 10−20.760.223.62 × 10−80.710.34 (0.27, 0.43)
VEGETABLES
Ketchup and salsaX-25247 0.211.25 × 10−70.800.121.79 × 10−30.720.30 (0.23, 0.39)
BeansX-17365 0.237.88 × 10−90.910.184.68 × 10−60.710.50 (0.43, 0.56)
N-acetylalliinXenobioticsFood Component/Plant0.232.64 × 10−90.910.172.73 × 10−50.710.37 (0.30, 0.45)
X-23639 0.222.43 × 10−80.910.121.72 × 10−30.700.66 (0.61, 0.71)
Soy productsglycitein glucuronide (2) *XenobioticsFood Component/Plant0.396.26 × 10−250.800.395.00 × 10−250.740.38 (0.31, 0.46)
glycitein sulfate (2)XenobioticsFood Component/Plant0.351.52 × 10−190.790.397.49 × 10−240.750.46 (0.39, 0.53)
daidzein sulfate (2)XenobioticsFood Component/Plant0.356.36 × 10−200.790.351.54 × 10−190.740.44 (0.37, 0.52)
Fermented soy productscarnosineAmino AcidHistidine Metabolism−0.202.17 × 10−70.68−0.101.01 × 10−20.600.40 (0.33, 0.47)
isovalerylcarnitine (C5)Amino AcidLeucine, Isoleucine and Valine Metabolism−0.246.68 × 10−100.67−0.122.86 × 10−30.590.54 (0.47, 0.60)
N,N,N-trimethyl-5-aminovalerateAmino AcidLysine Metabolism−0.203.51 × 10−70.67−0.085.80 × 10−20.590.40 (0.33, 0.48)
Soy milkdaidzein sulfate (1)XenobioticsFood Component/Plant0.311.97 × 10−150.650.371.85 × 10−210.640.43 (0.36, 0.50)
X-18750 0.287.48 × 10−130.650.316.07 × 10−160.620.40 (0.33, 0.47)
glycitein sulfate (2)XenobioticsFood Component/Plant0.309.22 × 10−150.650.315.51 × 10−160.620.46 (0.39, 0.53)
Soy protein powderX-16649 0.219.86 × 10−80.640.121.70 × 10−30.600.53 (0.46, 0.59)
daidzein sulfate (1)XenobioticsFood Component/Plant0.202.28 × 10−70.640.101.16 × 10−20.600.43 (0.36, 0.50)
genisteinXenobioticsFood Component/Plant0.202.68 × 10−70.630.075.94 × 10−20.610.36 (0.28, 0.44)
Cruciferous vegetablesX-25217 0.378.67 × 10−220.870.198.94 × 10−70.740.23 (0.16, 0.33)
S-methylcysteine sulfoxideAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.308.50 × 10−150.860.211.50 × 10−70.770.46 (0.39, 0.53)
X-24330 0.251.84 × 10−100.850.143.83 × 10−40.700.57 (0.51, 0.63)
Leafy greenscytosineNucleotidePyrimidine Metabolism, Cytidine containing0.222.39 × 10−80.830.084.02 × 10−20.690.43 (0.36, 0.50)
X-23970 0.191.07 × 10−60.810.211.35 × 10−70.740.52 (0.46, 0.59)
Iceberg or head lettucepentose acid *Partially Characterized MoleculesPartially Characterized Molecules−0.223.45 × 10−80.73−0.114.15 × 10−30.580.57 (0.50, 0.62)
PeppersX-23780 0.289.82 × 10−130.820.218.85 × 10−80.780.39 (0.31, 0.47)
X-17365 0.222.35 × 10−80.800.168.55 × 10−50.750.50 (0.43, 0.56)
Mushrooms 4N-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism 0.234.77 × 10−90.700.51 (0.44, 0.57)
X-17365 0.224.53 × 10−80.700.50 (0.43, 0.56)
N-acetylalliinXenobioticsFood Component/Plant 0.219.37 × 10−80.700.37 (0.30, 0.45)
Allium vegetablesX-17365 0.374.08 × 10−220.820.223.19 × 10−80.760.50 (0.43, 0.56)
N-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.342.14 × 10−180.820.247.67 × 10−100.750.51 (0.44, 0.57)
2,3-dimethylsuccinateAmino AcidLeucine, Isoleucine and Valine Metabolism0.291.79 × 10−130.810.202.10 × 10−70.740.36 (0.28, 0.44)
OnionX-17365 0.361.10 × 10−200.830.219.24 × 10−80.750.50 (0.43, 0.56)
N-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.332.65 × 10−170.830.232.79 × 10−90.730.51 (0.44, 0.57)
2,3-dimethylsuccinateAmino AcidLeucine, Isoleucine and Valine Metabolism0.281.12 × 10−120.810.204.48 × 10−70.720.36 (0.28, 0.44)
GarlicN-acetylalliinXenobioticsFood Component/Plant0.382.58 × 10−230.840.203.59 × 10−70.710.37 (0.30, 0.45)
N-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.239.41 × 10−90.820.218.04 × 10−80.730.51 (0.44, 0.57)
X-17365 0.295.77 × 10−140.820.211.49 × 10−70.720.50 (0.43, 0.56)
Garlic powderN-acetyl-S-allyl-L-cysteineXenobioticsFood Component/Plant0.235.31 × 10−90.740.042.72 × 10−10.690.46 (0.39, 0.53)
S-allylcysteineXenobioticsFood Component/Plant0.222.18 × 10−80.730.061.07 × 10−10.690.35 (0.28, 0.44)
GRAINS
Whole grains2,6-dihydroxybenzoic acidXenobioticsDrug-Topical Agents0.325.70 × 10−170.910.232.50 × 10−90.800.52 (0.45, 0.58)
2-acetamidophenol sulfateXenobioticsDrug-Analgesics, Anesthetics0.343.29 × 10−180.900.265.93 × 10−110.800.51 (0.44, 0.57)
4-methoxyphenol sulfateAmino AcidTyrosine Metabolism0.277.24 × 10−120.890.168.07 × 10−50.770.33 (0.26, 0.41)
Whole-grain bread3,5-dihydroxybenzoic acidXenobioticsFood Component/Plant0.311.42 × 10−150.840.281.53 × 10−120.740.52 (0.46, 0.59)
2-acetamidophenol sulfateXenobioticsDrug-Analgesics, Anesthetics0.232.99 × 10−90.810.191.67 × 10−60.710.51 (0.44, 0.57)
Whole-grain cereals2,6-dihydroxybenzoic acidXenobioticsDrug-Topical Agents0.362.30 × 10−200.860.351.62 × 10−190.850.52 (0.45, 0.58)
2-acetamidophenol sulfateXenobioticsDrug-Analgesics, Anesthetics0.361.07 × 10−200.850.312.85 × 10−150.820.51 (0.44, 0.57)
2-aminophenol sulfateXenobioticsChemical0.306.27 × 10−150.830.281.13 × 10−120.800.45 (0.39, 0.53)
Corn productsX-25247 0.321.19 × 10−160.860.121.93 × 10−30.730.30 (0.23, 0.39)
X-23680 0.262.38 × 10−110.860.115.39 × 10−30.730.56 (0.50, 0.62)
carnitine of C10H14O2 (2) *Partially Characterized MoleculesPartially Characterized Molecules0.215.12 × 10−80.850.051.95 × 10−10.720.39 (0.32, 0.47)
Popcornglucuronide of C12H20O3 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.246.28 × 10−100.750.198.12 × 10−70.700.24 (0.16, 0.33)
X-25247 0.266.63 × 10−110.750.164.24 × 10−50.690.30 (0.23, 0.39)
Other whole grains3,5-dihydroxybenzoic acidXenobioticsFood Component/Plant0.202.33 × 10−70.78 0.186.67 × 10−60.690.52 (0.46, 0.59)
Refined grainsX-23680 0.211.78 × 10−70.85 0.113.85 × 10−30.840.56 (0.50, 0.62)
1,5-anhydroglucitol (1,5-AG)CarbohydrateGlycolysis, Gluconeogenesis, and Pyruvate Metabolism0.217.39 × 10−80.83 0.085.81 × 10−20.840.42 (0.35, 0.49)
N6-carbamoylthreonyladenosineNucleotidePurine Metabolism, Adenine containing0.211.86 × 10−70.83 0.017.49 × 10−10.830.64 (0.58, 0.69)
PROTEINS
Red meatisovalerylcarnitine (C5)Amino AcidLeucine, Isoleucine and Valine Metabolism0.318.67 × 10−160.890.252.07 × 10−100.770.54 (0.47, 0.60)
3,4-dihydroxyphenylacetate sulfateAmino AcidTyrosine Metabolism−0.285.22 × 10−130.88−0.266.04 × 10−110.790.54 (0.48, 0.61)
N,N,N-trimethyl-5-aminovalerateAmino AcidLysine Metabolism0.319.55 × 10−160.880.235.08 × 10−90.770.40 (0.33, 0.48)
Processed meat1-ribosyl-imidazoleacetate *Amino AcidHistidine Metabolism−0.348.25 × 10−190.86−0.264.02 × 10−110.790.66 (0.60, 0.71)
X-23970 −0.313.37 × 10−150.86−0.163.16 × 10−50.780.52 (0.46, 0.59)
pentose acid *Partially Characterized MoleculesPartially Characterized Molecules−0.312.70 × 10−150.85−0.186.15 × 10−60.780.57 (0.50, 0.62)
PoultryanserineAmino AcidHistidine Metabolism0.523.02 × 10−440.850.372.40 × 10−220.790.37 (0.30, 0.45)
3-methylhistidineAmino AcidHistidine Metabolism0.561.01 × 10−540.840.452.89 × 10−320.820.46 (0.39, 0.53)
X-13835 0.563.71 × 10−530.840.431.67 × 10−290.820.60 (0.54, 0.66)
Total fishCMPFLipidFatty Acid, Dicarboxylate0.395.89 × 10−240.820.281.25 × 10−120.710.82 (0.79, 0.85)
X-25419 0.311.96 × 10−150.800.242.02 × 10−90.710.55 (0.49, 0.61)
X-13835 0.312.68 × 10−150.770.172.64 × 10−50.660.60 (0.54, 0.66)
Dark meat fishCMPFLipidFatty Acid, Dicarboxylate0.382.03 × 10−230.820.232.64 × 10−90.710.82 (0.79, 0.85)
X-25419 0.293.68 × 10−140.780.171.28 × 10−50.730.55 (0.49, 0.61)
X-13835 0.241.51 × 10−90.770.151.48 × 10−40.680.60 (0.54, 0.66)
ShellfishX-25419 0.391.92 × 10−240.780.276.23 × 10−120.740.55 (0.49, 0.61)
CMPFLipidFatty Acid, Dicarboxylate0.241.43 × 10−90.690.172.08 × 10−50.700.82 (0.79, 0.85)
X-23587 0.234.70 × 10−90.690.139.62 × 10−40.680.56 (0.49, 0.62)
Total nutstryptophan betaineAmino AcidTryptophan Metabolism0.422.59 × 10−280.900.312.38 × 10−150.830.76 (0.72, 0.79)
X-24412 0.388.07 × 10−240.900.325.62 × 10−170.830.52 (0.46, 0.59)
X-23644 0.317.22 × 10−160.890.263.59 × 10−110.800.31 (0.24, 0.40)
PeanutsX-24412 0.436.25 × 10−300.860.381.01 × 10−220.770.52 (0.46, 0.59)
tryptophan betaineAmino AcidTryptophan Metabolism0.424.80 × 10−280.860.341.98 × 10−180.760.76 (0.72, 0.79)
4-vinylphenol sulfateXenobioticsBenzoate Metabolism0.402.88 × 10−250.860.253.70 × 10−100.710.41 (0.33, 0.48)
Other nutsX-25524 0.271.13 × 10−110.870.273.75 × 10−120.810.41 (0.33, 0.48)
X-25523 0.261.41 × 10−110.860.263.57 × 10−110.810.49 (0.42, 0.56)
X-23970 0.272.17 × 10−120.860.252.48 × 10−100.790.52 (0.46, 0.59)
SeedsX-11847 0.152.06 × 10−40.750.261.63 × 10−110.760.60 (0.54, 0.66)
X-11858 0.137.09 × 10−40.740.245.89 × 10−100.760.50 (0.43, 0.56)
X-18059 0.241.10 × 10−90.740.183.92 × 10−60.710.32 (0.24, 0.40)
DAIRY/DAIRY ALTERNATIVES
MilkphenylacetylglycinePeptideAcetylated Peptides0.402.79 × 10−260.870.284.49 × 10−130.790.53 (0.47, 0.59)
2,8-quinolinediol sulfateXenobioticsFood Component/Plant0.313.17 × 10−150.840.191.24 × 10−60.770.50 (0.43, 0.57)
N,N,N-trimethyl-5-aminovalerateAmino AcidLysine Metabolism0.301.02 × 10−140.830.281.67 × 10−120.790.40 (0.33, 0.48)
Almond milk or rice milkN,N,N-trimethyl-5-aminovalerateAmino AcidLysine Metabolism−0.165.33 × 10−50.71−0.211.38 × 10−70.690.40 (0.33, 0.48)
catechol sulfateXenobioticsBenzoate Metabolism0.221.69 × 10−80.710.221.20 × 10−80.650.71 (0.66, 0.75)
X-25800 0.163.56 × 10−50.690.211.20 × 10−70.650.33 (0.26, 0.42)
Total cheeseheptenedioate (C7:1-DC) *LipidFatty Acid, Dicarboxylate0.242.07 × 10−90.880.222.14 × 10−80.780.49 (0.42, 0.55)
4-methylhexanoylglutamineLipidFatty Acid Metabolism (Acyl Glutamine)0.241.87 × 10−90.870.235.23 × 10−90.780.51 (0.44, 0.57)
glutamine conjugate of C9H16O2 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.152.23 × 10−40.860.221.26 × 10−80.780.52 (0.46, 0.59)
Creamglucuronide of C19H28O4 (1)*Partially Characterized MoleculesPartially Characterized Molecules0.394.89 × 10−240.800.131.22 × 10−30.700.83 (0.79, 0.85)
X-25500 0.344.09 × 10−190.790.121.80 × 10−30.680.63 (0.57, 0.68)
X-12738 0.358.62 × 10−200.780.101.22 × 10−20.680.72 (0.67, 0.76)
FATS AND OILS
Creamy salad dressingcarnitine of C10H14O2 (2) *Partially Characterized MoleculesPartially Characterized Molecules0.264.09 × 10−110.780.165.02 × 10−50.670.39 (0.32, 0.47)
X-24363 0.264.26 × 10−110.770.184.13 × 10−60.670.56 (0.50, 0.62)
X-13693 0.221.04 × 10−80.770.191.81 × 10−60.670.47 (0.40, 0.54)
Oil and vinegar salad dressingN-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.165.45 × 10−50.760.202.63 × 10−70.800.51 (0.44, 0.57)
Olive oilN-methyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.244.33 × 10−100.800.207.40 × 10−70.760.51 (0.44, 0.57)
X-25419 0.244.25 × 10−100.790.185.28 × 10−60.740.55 (0.49, 0.61)
X-17733 −0.216.36 × 10−80.78−0.137.26 × 10−40.740.50 (0.44, 0.57)
MISCELLANEOUS
French friespentose acid *Partially Characterized MoleculesPartially Characterized Molecules−0.272.28 × 10−120.85−0.071.00 × 10−10.720.57 (0.50, 0.62)
abscisateXenobioticsFood Component/Plant−0.286.40 × 10−130.85−0.034.65 × 10−10.720.47 (0.40, 0.54)
catechol sulfateXenobioticsBenzoate Metabolism−0.241.01 × 10−90.84−0.092.68 × 10−20.720.71 (0.66, 0.75)
Chipsglutamine conjugate of C8H12O2 (2) *Partially Characterized MoleculesPartially Characterized Molecules0.259.16 × 10−110.800.251.45 × 10−100.760.45 (0.38, 0.52)
glucuronide of C10H14O2 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.286.97 × 10−130.800.188.55 × 10−60.740.42 (0.35, 0.50)
X-23970 −0.202.30 × 10−70.79−0.101.45 × 10−20.720.52 (0.46, 0.59)
Chocolate candyX-12823 0.384.94 × 10−230.850.303.18 × 10−140.830.40 (0.33, 0.48)
3-methylxanthineXenobioticsXanthine Metabolism0.321.98 × 10−160.830.274.96 × 10−120.820.60 (0.54, 0.66)
7-methylurateXenobioticsXanthine Metabolism0.321.50 × 10−160.830.283.94 × 10−130.820.62 (0.56, 0.67)
Dark chocolatetheobromineXenobioticsXanthine Metabolism0.296.72 × 10−140.790.233.38 × 10−90.710.58 (0.52, 0.64)
X-12823 0.325.05 × 10−170.790.251.36 × 10−100.690.40 (0.33, 0.48)
3, 7-dimethylurateXenobioticsXanthine Metabolism0.302.03 × 10−140.790.221.76 × 10−80.690.56 (0.50, 0.62)
DessertsX-24340 0.218.12 × 10−80.800.151.96 × 10−40.740.50 (0.44, 0.57)
3, 4-methylene heptanoylglycineLipidFatty Acid Metabolism (Acyl Glycine)0.211.32 × 10−70.800.101.07 × 10−20.730.56 (0.50, 0.62)
BarsX-16649 0.215.23 × 10−80.810.204.42 × 10−70.730.53 (0.46, 0.59)
2-(4-hydroxyphenyl)propionateXenobioticsBenzoate Metabolism0.211.13 × 10−70.800.151.13 × 10−40.710.36 (0.29, 0.44)
sucraloseXenobioticsFood Component/Plant0.211.41 × 10−70.800.115.11 × 10−30.700.50 (0.43, 0.56)
Soy sauceX-11847 0.179.43 × 10−60.740.222.46 × 10−80.670.60 (0.54, 0.66)
X-11849 0.182.84 × 10−60.740.202.50 × 10−70.660.63 (0.58, 0.69)
Artificial sweetenerssucraloseXenobioticsFood Component/Plant0.315.88 × 10−160.750.352.71 × 10−190.770.50 (0.43, 0.56)
acesulfameXenobioticsFood Component/Plant0.214.72 × 10−80.750.252.78 × 10−100.730.49 (0.43, 0.56)
X-25785 0.233.71 × 10−90.720.171.59 × 10−50.680.48 (0.41, 0.55)
ALCOHOL
Total alcoholethyl glucuronideXenobioticsChemical0.655.84 × 10−780.990.591.08 × 10−600.940.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.574.01 × 10−560.970.482.15 × 10−380.900.53 (0.46, 0.59)
2,3-dihydroxyisovalerateXenobioticsFood Component/Plant0.445.03 × 10−310.920.421.22 × 10−280.860.46 (0.39, 0.53)
Beerethyl glucuronideXenobioticsChemical0.455.34 × 10−330.890.415.84 × 10−270.860.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.434.82 × 10−300.860.389.29 × 10−240.840.53 (0.46, 0.59)
2,3-dihydroxy-3-methylvalerateAmino AcidLeucine, Isoleucine and Valine Metabolism0.302.87 × 10−140.830.287.16 × 10−130.820.42 (0.35, 0.50)
Total wineethyl glucuronideXenobioticsChemical0.621.86 × 10−680.940.497.76 × 10−390.840.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.511.55 × 10−430.910.397.03 × 10−240.780.53 (0.46, 0.59)
X-17306 0.521.81 × 10−450.890.485.30 × 10−380.830.57 (0.51, 0.63)
Red wineethyl glucuronideXenobioticsChemical0.543.62 × 10−500.880.411.48 × 10−270.800.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.451.47 × 10−330.840.322.12 × 10−160.760.53 (0.46, 0.59)
2,3-dihydroxy-3-methylvalerateAmino AcidLeucine, Isoleucine and Valine Metabolism0.447.17 × 10−320.810.362.02 × 10−200.770.42 (0.35, 0.50)
White wineethyl glucuronideXenobioticsChemical0.444.89 × 10−310.780.362.24 × 10−210.730.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.361.22 × 10−200.750.311.76 × 10−150.710.53 (0.46, 0.59)
X-17306 0.351.62 × 10−190.740.366.39 × 10−210.720.57 (0.51, 0.63)
Liquorethyl glucuronideXenobioticsChemical0.421.65 × 10−280.790.251.26 × 10−100.700.63 (0.57, 0.68)
ethyl alpha-glucopyranosideXenobioticsFood Component/Plant0.362.17 × 10−200.770.192.12 × 10−60.670.53 (0.46, 0.59)
N-acetyltaurineAmino AcidMethionine, Cysteine, SAM and Taurine Metabolism0.277.24 × 10−120.730.167.99 × 10−50.680.63 (0.58, 0.69)
BEVERAGES
Total coffeeglucuronide of C19H28O4 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.830.56 × 101651.000.810.82 × 101450.990.83 (0.79, 0.85)
citraconate/glutaconateEnergyTCA Cycle0.710.35 × 101001.000.692.65 × 10−890.970.77 (0.73, 0.81)
feruloylquinate (3)XenobioticsFood Component/Plant0.682.40 × 10−860.990.664.46 × 10−790.970.71 (0.67, 0.76)
Decaffeinatedglucuronide of C19H28O4 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.241.83 × 10−90.660.204.26 × 10−70.630.83 (0.79, 0.85)
quinateXenobioticsFood Component/Plant0.211.36 × 10−70.650.164.67 × 10−50.630.82 (0.79, 0.85)
X-25666 0.219.93 × 10−80.650.185.43 × 10−60.630.72 (0.67, 0.76)
Caffeinatedglucuronide of C19H28O4 (1) *Partially Characterized MoleculesPartially Characterized Molecules0.780.27 × 101290.980.760.68 × 101210.970.83 (0.79, 0.85)
3-hydroxypyridine glucuronideXenobioticsChemical0.692.27 × 10−910.980.662.06 × 10−800.950.77 (0.73, 0.80)
3-hydroxypyridineXenobioticsChemical0.720.71 × 101010.980.671.20 × 10−840.950.76 (0.72, 0.80)
Total teaN-acetyltheanineXenobioticsFood Component/Plant0.529.90 × 10−460.930.511.04 × 10−430.880.55 (0.48, 0.61)
coumaroylquinate (1)XenobioticsFood Component/Plant0.366.16 × 10−210.830.351.02 × 10−190.790.53 (0.47, 0.60)
2-methoxyresorcinol sulfateXenobioticsChemical0.319.47 × 10−160.810.331.20 × 10−170.760.62 (0.56, 0.67)
Green teaN-acetyltheanineXenobioticsFood Component/Plant0.296.14 × 10−140.740.347.48 × 10−190.720.55 (0.48, 0.61)
S-adenosylhomocysteine (SAH)Amino AcidMethionine, Cysteine, SAM and Taurine Metabolism−0.264.23 × 10−110.73−0.264.40 × 10−110.660.42 (0.35, 0.50)
X-12740 0.257.96 × 10−110.720.222.62 × 10−80.660.41 (0.33, 0.48)
Black teaN-acetyltheanineXenobioticsFood Component/Plant0.411.17 × 10−270.810.474.63 × 10−360.810.55 (0.48, 0.61)
2-methoxyresorcinol sulfateXenobioticsChemical0.258.05 × 10−110.730.275.46 × 10−120.720.62 (0.56, 0.67)
1,2,3-benzenetriol sulfate (2)XenobioticsChemical0.251.98 × 10−100.730.264.95 × 10−110.710.57 (0.51, 0.63)
Herbal teaX-12306 0.241.75 × 10−90.740.191.77 × 10−60.650.66 (0.61, 0.71)
X-23423 0.222.43 × 10−80.740.151.18 × 10−40.630.51 (0.45, 0.58)
catechol sulfateXenobioticsBenzoate Metabolism0.223.74 × 10−80.730.167.34 × 10−50.630.71 (0.66, 0.75)
Sugar-sweetened beveragesX-23970 −0.223.64 × 10−80.82−0.122.96 × 10−30.770.52 (0.46, 0.59)
X-23424 0.202.10 × 10−70.810.052.27 × 10−10.760.28 (0.20, 0.37)
hydroxy-N6, N6, N6-trimethyllysine *Amino AcidLysine Metabolism0.244.34 × 10−100.810.083.37 × 10−20.760.62 (0.56, 0.67)
Diet beveragesX-25785 0.474.73 × 10−360.840.431.04 × 10−290.810.48 (0.41, 0.55)
acesulfameXenobioticsFood Component/Plant0.421.45 × 10−280.830.321.15 × 10−160.770.49 (0.43, 0.56)
sucraloseXenobioticsFood Component/Plant0.411.79 × 10−270.810.287.01 × 10−130.740.50 (0.43, 0.56)
1. Diet–metabolite correlations in bold had p < 3.56 × 10−7 for FFQ and p < 3.42 × 10−7 for average 24 h diet recalls and |r| > 0.2 from Pearson’s partial correlation analysis. Adjusted for age, gender, race/ethnicity, education, smoking status, physical activity, body mass index, ethanol consumption (except for alcohol-containing items), and energy intake. CPS-3, Cancer Prevention Study-3; DAS, Diet Assessment Sub-study. 2. Biochemical name of metabolite correlated with respective food or food group. Metabolites starting with X are unnamed and the super pathway of these is unknown. Asterisk (*) represents putative identity that has not been officially confirmed based on a standard. (1) and (2) indicate that the metabolite differs from another with the same mass in the position of the R group. CMPF, 3-carboxy-4-methyl-5-propyl-2-furanpropanoate. 3. ICC, intraclass correlation coefficient, to assess the reproducibility of the identified food-related metabolites over six months. 4. Items are only available on 24 h-diet recalls.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, Y.; Hodge, R.A.; Stevens, V.L.; Hartman, T.J.; McCullough, M.L. Identification and Reproducibility of Urinary Metabolomic Biomarkers of Habitual Food Intake in a Cross-Sectional Analysis of the Cancer Prevention Study-3 Diet Assessment Sub-Study. Metabolites 2021, 11, 248. https://doi.org/10.3390/metabo11040248

AMA Style

Wang Y, Hodge RA, Stevens VL, Hartman TJ, McCullough ML. Identification and Reproducibility of Urinary Metabolomic Biomarkers of Habitual Food Intake in a Cross-Sectional Analysis of the Cancer Prevention Study-3 Diet Assessment Sub-Study. Metabolites. 2021; 11(4):248. https://doi.org/10.3390/metabo11040248

Chicago/Turabian Style

Wang, Ying, Rebecca A. Hodge, Victoria L. Stevens, Terryl J. Hartman, and Marjorie L. McCullough. 2021. "Identification and Reproducibility of Urinary Metabolomic Biomarkers of Habitual Food Intake in a Cross-Sectional Analysis of the Cancer Prevention Study-3 Diet Assessment Sub-Study" Metabolites 11, no. 4: 248. https://doi.org/10.3390/metabo11040248

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop