1. Introduction
Commonly used dietary assessment methods relying on self-report, including the 24-h dietary recall (24HDR) and the food frequency questionnaire (FFQ), are subject to substantial recall bias [
1,
2,
3]. This presents analytical challenges for population monitoring and nutritional epidemiological studies by introducing bias into prevalence estimates and regression coefficients and by reducing statistical power to detect diet-health relationships [
4,
5]. Given these challenges, there is increasing interest in the development and validation of biomarkers to provide unbiased measures of intake of nutrients, foods, food groups, and dietary patterns [
6,
7,
8,
9]. Existing dietary biomarkers range from recovery/predictive biomarkers, from which absolute intake during a given time period may be determined based on a known quantitative relationship between the biomarker and intake level [
1,
5,
10], to concentration biomarkers, which although less quantitatively related to absolute intake, are more common and still have proven useful in reducing bias and increasing power to detect diet-health associations [
11].
Although dietary biomarkers are not subject to systematic error from recall bias, the use of short-term assessment methods, whether recall- or biomarker-based, to represent long-term usual intake introduces measurement error due to the presence of random, day-to-day variation in dietary intakes within individuals. In addition, many foods and some nutrients (e.g., pre-formed retinol) may be episodically consumed; that is, they are not consumed every day (or nearly every day) by most individuals in a study population. For dietary components falling into this category, the aforementioned challenge of within-individual variation (WIV) is exacerbated by the presence of excess zero values in dietary intake data, which further complicates statistical analyses [
12]. In areas of high agricultural productivity and/or where local patterns of consumption of produce are influenced by seasonal variations, episodic consumption and/or WIV in dietary intakes can partially be explained by season [
13,
14,
15,
16]. In the case of pregnancy, biological [
17,
18] or dietary changes [
19,
20,
21] related to gestation progression could introduce an additional source of WIV into biomarker measurements.
Measurement error models developed to account for random WIV in the estimation of a population’s usual dietary intake distributions [
22,
23,
24] and diet-health associations [
25,
26] have traditionally been applied to repeated 24HDR data but may be useful for biomarker data as well. Urine collections are a minimally invasive method with a relatively low participant burden (especially spot urine collections) that may be beneficial for measuring biomarkers of multiple dietary components from a single sample. Since many measured excretion products are likely to represent short-term intake, similar to 24HDR, repeated sample collections may allow for modeling of usual intake, which is of most interest in nutrition studies. The use of such modeling strategies relies on knowledge of the measurement error structure of the dietary intake data—namely, the within- and between-individual variance components. For episodically consumed dietary components, the probability of daily consumption can be modeled and used in conjunction with the amount consumed on consumption days to model average daily intake [
22].
Citrus fruits and juices represent one such dietary component that may be of interest to measure for population monitoring and/or nutritional epidemiological studies due to their high content of essential nutrients such as vitamin C, folate, and fiber, as well as other bioactive phytochemicals that may confer health benefits in humans (reviewed in [
27] and [
28]). Some epidemiological evidence supports a negative association between consumption of citrus fruit and/or the flavonoids present in them and inflammatory markers in women [
29] and ischemic stroke in men [
30], and randomized clinical trials have suggested a beneficial effect of orange juice consumption on endothelial function [
31,
32,
33]. Beyond the interest in measuring citrus intake specifically, biomarkers of citrus fruit consumption may serve as an important component of suites of biomarkers that aim to capture general dietary patterns or consumption of specific food groups (e.g., fruits and vegetables) [
34,
35,
36].
Proline betaine (Pro-B), also known as stachydrine, is a plant osmo-protective compound found in high concentrations in citrus fruits and juices that have emerged as a promising biomarker of citrus intake, given its high abundance in, and relatively high specificity to, these foods [
37,
38,
39,
40]. A number of exploratory studies have identified Pro-B as a known or potential biomarker of consumption of citrus fruit/juice [
41,
42,
43,
44,
45,
46,
47,
48], fruit [
8,
39,
49,
50], and/or certain healthy dietary patterns [
8,
49,
51,
52,
53,
54]. Dietary intervention studies have provided further validation of Pro-B as a direct biomarker of acute or short-term citrus intake [
37,
39,
55,
56]. However, while some researchers have proposed its use as a marker of habitual or long-term citrus intake [
46,
56,
57], excretion profiles of Pro-B after acute intake of orange juice demonstrate that, at least for this dietary source, most Pro-B is excreted within the first 24 or fewer hours [
37,
39,
56], with urinary concentrations peaking between 2 and 6 h and small elevations remaining up to 72 or 96 h after consumption [
56]. These data suggest that a single urinary Pro-B measurement should be considered a short- to medium-term biomarker. Thus, it is of interest to understand the degree of WIV that may be expected across repeated urinary Pro-B measurements, which would help clarify the need for multiple measurements per individual to account for this variation when assessing usual intake.
While exploratory metabolomics and acute feeding studies establish urinary Pro-B as an important dietary biomarker, scarce data exist on the quantitative relationship between this largely acute biomarker and reported usual intake in observational studies. Instead, observational studies assessing this quantitative relationship to date have either focused on a relatively short dietary recall period (1–4 days) [
39,
55,
58] or assessed only the relative abundance of Pro-B in biospecimens [
45,
46,
48,
57]. One multi-cohort study in pregnant women found correlations between Pro-B levels in serum samples and reported usual consumption of citrus fruit, citrus juice, or combined citrus fruit and juice (correlation coefficients ranging from
r = 0.29 to
r = 0.42) [
42]. However, to our knowledge, no studies to date have assessed Pro-B measured in urine samples as a dietary biomarker during pregnancy, nor have studies evaluated random day-to-day and other sources of WIV in Pro-B concentrations in pregnant or non-pregnant populations.
Understanding the nature of WIV in this biomarker is essential for accurately modeling long-term average, or usual, levels of citrus intake or Pro-B biological concentrations. The aims of the present study were therefore to (1) quantify within-individual, between-individual, gestational, and seasonal components of variance in urinary Pro-B concentrations during pregnancy, (2) determine the number of specimens required to estimate usual urinary Pro-B and rank individuals on usual Pro-B levels, and (3) determine the correlation between repeated measures of urinary Pro-B and reported usual consumption of citrus foods.
4. Discussion
This study aimed to inform the use of a urinary biomarker of citrus intake in pregnant women. Here, the magnitude of WIV in urinary Pro-B concentrations, measured by 1H-NMR spectroscopy in repeated spot or 24-h urine specimens, was quantified and potential sources of temporal variation during pregnancy (i.e., seasonal, gestational, and residual random variation) were identified. In parsing out sources of this variance, citrus season (December–May) was a significant predictor of greater urinary Pro-B concentrations, while gestational age was inversely associated with non-normalized Pro-B concentrations. In linear mixed effects models, a high degree of WIV as a percentage of the total variance in urinary Pro-B was discovered (≥69%) regardless of the normalization method, indicating that multiple samples per participant are likely needed when using this biomarker to assess usual citrus intake or Pro-B exposure. Finally, moderate correlations were found between usual citrus intake reported on a semi-quantitative FFQ and single or averaged repeated urinary Pro-B measurements, with stronger correlations found with repeated measures.
Limited literature exists on the WIV of urinary Pro-B in free-living populations. Wang et al. recently reported intraclass correlation coefficients (ICCs) of relative abundance of candidate dietary biomarkers, including Pro-B, measured in two repeat samples collected 6 months apart, and found a slightly lower proportion of variance from WIV than in the current study (0.56 vs. ≥0.69) [
48]. However, in addition to studying a different population, the methods used in that study differed from our study in several aspects; in particular, in the Wang et al. study, all specimens were from 24-h urine collections, relative abundance rather than absolute concentrations were measured, and values were normalized to osmolality [
48]. Here, we report relatively high proportions of variance from WIV in 1 to 5 urine specimens, 85% of which were spot collections, collected throughout pregnancy. A higher degree of WIV was found among the spot urines compared to the combined sample; thus, more repeated samples may be needed for studies collecting spot urines compared to 24-h urine specimens.
A number of potential sources of variation may explain the high level of WIV found in urinary Pro-B. First, given the short- to medium-term nature of this biomarker in urine and its strong postprandial response to the intake of citrus products [
37,
39,
56,
57], large fluctuations in Pro-B concentration are likely largely attributable to true variation in intake. However, the portion of variation reflective of true intake is a function not only of the quantity consumed, but also of the time since consumption (especially in spot urine collections) and, potentially, the type of citrus product eaten. Considerable variation in Pro-B content has been reported among citrus fruit varieties [
37,
39] and forms (e.g., 1316 ± 72 mg/L vs. 761 ± 89 mg/L vs. 251 ± 153 mg/L in orange juice from concentrate, orange, and lemon, respectively [
39]); thus, the specific products consumed may be influential on measured Pro-B concentrations. Apart from variations in the amount and type of citrus consumed, inter-individual differences or intra-individual changes in digestion, absorption, and metabolism of consumed foods and their components may influence the Pro-B concentration in a given urine sample. Although a large amount of ingested Pro-B is excreted unchanged, biotransformation products including sulfate and monoglucuronide derivatives have been identified in urine after consumption [
57], and it is possible these processes could be influenced by person-specific factors. More research is needed to determine the extent to which both person-specific and temporal factors may affect the metabolism and subsequent excretion of Pro-B.
Citrus season at the time of sample collection was found to be a significant predictor of Pro-B levels above 30 µmol/mmol creatinine and of urinary Pro-B concentration in continuous data analyses. In line with these results, a cohort study of women living in and around Grand Forks, North Dakota found higher reported citrus consumption during the winter and spring seasons [
71]. Seasonal variation was also observed in serum levels of β-cryptoxanthin, a carotenoid found in orange-flesh fruits and vegetables, among adults in a rural area of Japan, with related seasonal changes in reported Satsuma mandarin consumption [
72]. Evidence of seasonal variation in dietary patterns and nutrient intakes has been well established in rural settings where local agriculture is likely to influence food availability and/or dietary consumption patterns, or where wild foods are available seasonally [
13,
16,
73,
74]. In contrast, studies conducted in industrialized settings have yielded mixed results in demonstrating seasonal variation in dietary intakes of energy, nutrients, and/or food groups [
75,
76], which appears to be declining over time [
76], perhaps due to increased globalization of food markets allowing for year-round importation of seasonal products [
77]. However, as shown in a U.S. cohort, the consumption of specific fruits and vegetables may vary seasonally with no or incongruent fluctuations in overall food groups or nutrients [
71]. It may be particularly plausible that in locations of high agricultural productivity, such as the current study area, consumption of seasonal produce items grown in the area (e.g., citrus) could vary by season due to increased availability from local markets and residential fruit trees. Nevertheless, in the current analyses, season explained only a small proportion (3%) of the variation in urinary Pro-B, and its relationship with elevated Pro-B in logistic regression analyses did not reach statistical significance for non-normalized data (
Table 5). The latter result may point to the creatinine-normalized cutoff as a better indicator of recent citrus consumption. To summarize, these results indicate that even in industrialized settings, agricultural season may have some influence on dietary intakes of particular foods, and that these changes are detectable in a biomarker of citrus consumption in pregnant women living in Northern California. These results are likely context-specific; therefore, dietary surveys and epidemiological studies assessing diet should consider the local context and be designed to account for this potential source of variation.
A small, statistically significant negative association was found between gestational age and non-normalized urinary Pro-B concentration, suggesting a slight reduction in Pro-B over the course of gestation. In contrast to citrus season, which is assumed to influence Pro-B levels largely through true changes in diet, gestational age could potentially be associated with Pro-B levels either due to dietary changes associated with pregnancy progression or through changes in physiology affecting digestion, metabolism, or urinary excretion dynamics. With regard to diet, some evidence suggests variation in dietary intake of certain food groups [
19,
78] and nutrients [
78,
79] throughout pregnancy. For example, a study of Canadian women observed decreases in the fruit and vegetable sub-score of the Canadian Healthy Eating Index across pregnancy trimesters [
19]. This study also found decreases across trimesters in reported nausea (a well-known phenomenon), food cravings, and food aversions [
19], conditions that can influence dietary consumption behavior during pregnancy [
20,
78]. In particular, in a study assessing self-reported dietary changes in pregnancy and reasons thereof, craving was found to be the most commonly named reason for increasing intake of foods, with fruit among the top listed foods that were increased for this reason [
20]. Thus, changes in fruit consumption during pregnancy is plausible, although consistent trends across gestation may not be expected. Accordingly, we observed no association between gestational age and Pro-B above the defined thresholds, a likely indicator of recent citrus consumption, in our logistic regression analyses.
With respect to potential biological effects of gestation on urinary biomarker levels, changes in kidney anatomy and function occurring during pregnancy, e.g., increased glomerular filtration rate and kidney volume and decreased reabsorption of certain substances such as glucose (reviewed in [
17]), could conceivably influence Pro-B concentrations in spot and 24-h urine samples. However, although several studies point to Pro-B being minimally metabolized and rapidly excreted [
37,
39,
56,
80], an in-depth understanding of the renal mechanisms by which Pro-B is excreted in mammals appears to be lacking, pointing to the need for more comprehensive pharmacokinetic research [
81]. Given this research gap, the potential effects of changes in kidney function on urinary Pro-B concentration, as well as the appropriateness of adjustment for hydration status by creatinine normalization [
18,
82], remain unclear. Finally, it is important to consider that the beta coefficients reported here indicated only a small association between non-normalized Pro-B and gestational age and thus may not translate into meaningful biological or dietary changes. Based on the inconsistent results regarding this association for non-normalized vs. creatinine-normalized Pro-B levels and the unavailability of acute dietary intake data in most participants, additional research is needed to investigate whether gestational age augments the quantitative relationship between citrus consumption and Pro-B levels and whether it should be accounted for when using Pro-B as a dietary biomarker during pregnancy. Nevertheless, given the potential for dietary changes throughout pregnancy and the existence of critical windows for nutritional effects on fetal development [
83,
84,
85,
86,
87], biomarker measurements taken during multiple trimesters or at hypothesis-specific time points during pregnancy will likely remain relevant for dietary surveys and epidemiological studies.
Our study complements and extends upon previously reported data on the correlation between short- to medium-term citrus intake and urinary Pro-B levels [
39,
55,
58], as well as studies relating relative abundance of Pro-B to long-term reported intake [
45,
46,
48,
57]. First, we applied a previously reported threshold for creatinine-normalized urinary Pro-B relevant to recent citrus consumption [
56] and found it performed moderately well in a subset of samples for which 24HDR data were available, with similar results for a cutoff derived for non-normalized Pro-B. Given that for some misclassified samples it was discovered that items potentially containing citrus were reportedly consumed (although their citrus content could not be confirmed), our finding of 82.6% and 78.3% agreement between the biomarker cutoffs and reported intake is likely an underestimate. Second, by using repeated measures of quantified Pro-B in urine in relation to a measure of usual citrus intake during pregnancy, we have shown that reported usual citrus consumption, both in terms of frequency of consumption and average daily servings, is highly predictive of elevated urinary Pro-B in short-term urine specimens and moderately correlated with urinary Pro-B concentration averaged from repeated measurements, respectively. The strength of this correlation is likely impacted by several, uncorrelated sources of measurement error in the biomarker and dietary recall instrument. For example, while a FFQ is designed to directly capture usual intake within a specified period of time, substantial recall bias may occur due to social desirability bias, imperfect recollection, and other person-specific factors [
88,
89]. Furthermore, although urinary Pro-B is not subject to recall bias, this measure is more akin to a 24HDR in that it reflects more of the random day-to-day variation in dietary intake, which introduces error when used as a measure of usual intake. In partial support of this, the correlation coefficients observed in this study suggest stronger correlations when 2 vs. 1, and possibly 3 vs. 2, measures per individual are used (
Figure 2 and
Figures S3 and S4, Supplemental Materials). However, more research is needed to confirm whether the collection of >2 specimens leads to stronger correlations, given the small number of individuals with 3 or more repeats available in this study. Nevertheless, based on the estimated variance components, the number of required samples to estimate usual Pro-B level or rank individuals with a desirable degree of accuracy was higher than the number commonly collected or likely to be considered feasible to collect in large population studies (
Table 8); thus, modeling strategies to account for random day-to-day variation and other measurement error similar to those applied to 24HDR data [
22,
23,
90] may be preferable to averaged repeats when analyzing urinary Pro-B data to assess usual citrus exposure for population distributions or epidemiological studies. A second source of biomarker error may derive from the consumption of non-citrus sources of Pro-B [
37,
39,
62,
63,
64,
65,
66]. The importance of this source of error is likely population-specific as it depends on the consumption of a small number of specific food items with substantial Pro-B levels (e.g., gorgonzola cheese). In our study population, such items were not reportedly consumed among the subsample of participants for which 24HDR data were available, which in part could have been influenced by dietary safety recommendations during pregnancy advising against consumption of certain seafood and soft cheeses [
91]. Thirdly, the relationship between dietary intake and biomarkers is influenced by the digestion, absorption, metabolism, and excretion of the biomarker (or its dietary precursors), which could vary across individuals. To elucidate these factors for Pro-B in the context of citrus consumption, existing pharmacokinetic studies, which have focused largely on orange juice consumption [
37,
39,
56], should be expanded to investigate the aforementioned factors and include a range of commonly consumed citrus juices and fruits.
Several limitations should be considered in interpreting the results. First, the use of spot urine specimens collected at the same time of day (first-morning void) may not be ideal for representing citrus consumption over the course of a whole day. Some previous research has observed that most Pro-B excretion occurs in the first 14 h following orange juice consumption [
39]; thus, collecting spot urine specimens only at the first-morning void may systematically under-measure citrus consumed in the morning, since this timing may not capture the previous or current morning’s consumption. A solution to this problem might be to alternate the timing of spot urine collections at different times during the day to ensure the representativeness of morning, midday, and evening consumption among individuals. Second, while taking a
1H-NMR metabolomics approach to measuring Pro-B offers several advantages, including the ability to obtain absolute quantification of multiple compounds using a single experiment, this method is limited in its sensitivity compared to other approaches, such as mass spectrometry, which can detect concentrations in the nanomolar range. The resulting uncertainty in the quantification of Pro-B present in low concentrations, which occurred in a substantial proportion of samples in the present study, may have influenced the variance component estimates reported here. However, we also present a second approach to data analysis that avoids this uncertainty by considering a concentration threshold likely indicative of recent citrus intake. Employing a two-part, probability × amount model, as has previously been conducted in dietary recall analysis to account for the often episodic nature of food consumption [
22,
25] may allow researchers to maintain the efficiency of measuring multiple biomarkers simultaneously (as opposed to opting for a more sensitive method) while reducing this uncertainty in the quantitative data. A third potential limitation is the unknown effects of long storage times and up to three freeze-thaw cycles on the stability of Pro-B in urine samples. Although no previous literature was found to address these questions in the case of urine, one study reported high percent recoveries (means of 94–100%) of Pro-B in rat plasma after three freeze-thaw cycles or 30 days of storage at −20 °C [
92]. In the present study, no clear pattern in urinary Pro-B was observed by the number of freeze-thaw cycles or storage time. Measurement error in the estimation of gestational age is common and may have been present in this study; such error can introduce bias into measured associations under some circumstances [
93]. Finally, it should be noted that the FFQ form used in this study was not specifically validated for use in pregnant women to cover food intake during pregnancy; instead, the defined recall period was modified to cover the first or second half of pregnancy by way of written and/or oral instructions given to participants at the time of form administration (for question wording, see
Table S1, Supplemental Materials). This discrepancy might have introduced additional error into participants’ estimation of food intake for the period of interest and influenced the biomarker-FFQ correlation.
This study also has several strengths. We provide for the first time a quantitative analysis of the within- and between-individual variance components of urinary Pro-B concentration in pregnant women, an understudied population in terms of research on this biomarker. Analyzing up to 5 specimens per person covering all trimesters of pregnancy and seasons allowed us to differentiate random day-to-day variation from two potential sources of systematic WIV in biomarker levels—gestational age and citrus production season—and to test the effect of the number of samples per individual on the correlation with reported usual intake. Collecting >2 repeated measures also may have allowed for better variance component estimation than collecting fewer repeats and/or collecting repeats on a smaller subset of participants, although additional research is needed to determine the ideal timing between and the number of repeated measurements. Another strength was the inclusion of data from spot urine collections along with 24-h specimens, which allowed for the assessment of whether the collection method contributed to variation in the biomarker and for the observation of a higher degree of WIV in Pro-B among spot urine samples. In the context of large epidemiological studies, using spot urine samples carries the advantage of having a smaller participant burden relative to 24-h collections; thus, the results presented here can inform future studies aiming to use this measure to assess usual citrus intake in free-living populations.
Some additional research is warranted before relying on Pro-B as a quantitative measure of citrus intake, whether acute or usual, for the purposes of replacing, validating, or correcting for measurement errors in dietary recall instruments. Notably, recently developed calibration equations that successfully predicted mean total citrus intake over 4 consecutive days based on a single first void urine specimen in an Irish population [
55,
58] are promising and should be validated in other populations with varying dietary habits and genetic backgrounds, as well as in pregnant populations. From this angle, further defining the temporal limits of the predictive ability of a single urinary measurement would be of value (i.e., how many days’ intake can be quantitatively predicted from a single measurement?). While our study provides useful data towards the goal of quantifying the relationship between usual citrus consumption and urinary Pro-B concentrations in repeated urine specimens, given that long-term usual intake is not directly observable as a comparative measure, additional studies to understand the short-term excretion kinetics and percent recovery of dietary Pro-B from a variety of citrus products in diverse populations may further inform the development of (1) mathematical models for quantitative citrus intake prediction, and (2) urine sampling protocols to optimize the number and timing of repeated measures for estimating citrus intake for a given time period. (For example, one approach may involve the collection of 2 or 3 spot urine specimens collected throughout the day to better capture spikes in Pro-B levels and differentiate between recent consumption of a small portion and consumption of a larger portion several hours ago.) Importantly, the aforementioned research gaps apply to pregnant women as well as other subpopulations of interest for nutrition monitoring and epidemiology (e.g., non-pregnant adults, children, etc.).