Next Article in Journal
Brain and Cognitive Development in Adolescents with Anorexia Nervosa: A Systematic Review of fMRI Studies
Next Article in Special Issue
Association between Grain Intake, Nutrient Intake, and Diet Quality of Canadians: Evidence from the Canadian Community Health Survey–Nutrition 2015
Previous Article in Journal
Relationship between Nutrition Intake and 28-Day Mortality Using Modified NUTRIC Score in Patients with Sepsis
Previous Article in Special Issue
Differences in Dietary Intakes among Lebanese Adults over a Decade: Results from Two National Surveys 1997–2008/2009
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Early Experience Analyzing Dietary Intake Data from the Canadian Community Health Survey—Nutrition Using the National Cancer Institute (NCI) Method

1
Bureau of Food Surveillance and Science Integration, Health Canada, Ottawa, ON K1A 0K9, Canada
2
Office of Nutrition Policy and Promotion, Health Canada, Ottawa, ON K1A 0L2, Canada
*
Author to whom correspondence should be addressed.
Nutrients 2019, 11(8), 1908; https://doi.org/10.3390/nu11081908
Submission received: 18 April 2019 / Revised: 1 August 2019 / Accepted: 9 August 2019 / Published: 15 August 2019

Abstract

:
Background: One of the underpinning elements to support evidence-based decision-making in food and nutrition is the usual dietary intake of a population. It represents the long-run average consumption of a particular dietary component (i.e., food or nutrient). Variations in individual eating habits are observed from day-to-day and between individuals. The National Cancer Institute (NCI) method uses statistical modeling to account for these variations in estimation of usual intakes. This method was originally developed for nutrition survey data in the United States. The main objective of this study was to apply the NCI method in the analysis of Canadian nutrition surveys. Methods: Data from two surveys, the 2004 and 2015 Canadian Community Health Survey—Nutrition were used to estimate usual dietary intake distributions from food sources using the NCI method. The effect of different statistical considerations such as choice of the model, covariates, stratification compared to pooling, and exclusion of outliers were assessed, along with the computational time to convergence. Results: A flowchart to aid in model selection was developed. Different covariates (e.g., age/sex groups, cycle, weekday/weekend of the recall) were used to adjust the estimates of usual intakes. Moreover, larger differences in the ratio of within to between variation for a stratified analysis or a pooled analysis resulted in noticeable differences, particularly in the tails of the distribution of usual intake estimates. Outliers were subsequently removed when the ratio was larger than 10. For an individual age/sex group, the NCI method took 1 h–5 h to obtain results depending on the dietary component. Conclusion: Early experience in using the NCI method with Canadian nutrition surveys data led to the development of a flowchart to facilitate the choice of the NCI model to use. The ability of the NCI method to include covariates permits comparisons between both 2004 and 2015. This study shows that the improper application of pooling and stratification as well as the outlier detection can lead to biased results. This early experience can provide guidance to other researchers and ensures consistency in the analysis of usual dietary intake in the Canadian context.

1. Introduction

Health Canada relies on the statistical analysis of dietary surveillance data to help in establishing effective and evidence-based policies, regulations and standard-settings to promote overall health. The 2015 Canadian Community Health Survey—Nutrition [1] (2015 CCHS—See list of Abbreviations in Appendix A) represents the most recent national food consumption dataset. The survey also provides information with respect to nutrient from food and dietary supplements, physical measurements, household food insecurity, and food exclusion (e.g., excluding meat). An important goal of this national nutrition survey is to analyze the usual dietary intake of dietary components (e.g., calcium or fluid milk), by demographic variables of interest, and to estimate the proportion of the population with intakes above or below reference thresholds.
The usual dietary intake represents the long-term consumption of a dietary component; yet population-based nutrition surveys typically use only a few 24-h recalls of individuals to measure dietary variables [2]. These recalls collect information on the quantity and types of foods consumed on a given day. However, since individuals do not generally eat the same foods from one day to another, intra-individual variation is present and must be incorporated into the statistical analysis to avoid over- or under-estimation of nutritional intakes [2]. Hence, statistical methods are used to model usual dietary intake of population to account for the intra- and inter-individual variations present [3].
Since the previous CCHS-Nutrition survey was conducted in 2004 (2004 CCHS), advances in statistical methods for the analysis of usual dietary intake have occurred, and new methods have been proposed [3]. In particular, a joint technical working group comprised of statisticians from Health Canada and Statistics Canada was created in 2016 to evaluate existing statistical methods for estimating the usual dietary intake. After a review of available methods, the working group concluded that the National Cancer Institute (NCI) method [4,5,6] for estimating the usual intake would be recommended for the analysis of the 2015 CCHS data. Many advantages of the NCI method over competing methods were noted, including: Permitting estimation of usual intakes for episodically dietary components (e.g., red meats); permitting adjustment for important covariates of interest; and accounting for the correlation between probability of consumption and amount consumed [5,6,7].
The general reference documentation on the NCI method is available elsewhere [4] including macros for implementation. The NCI method is implemented in three separate collections of SAS macros. The first set models a single dietary component which may be episodically consumed; the second set models two components simultaneously and the third set models many components simultaneously. The focus of this article is to share insights with respect to the first set of macros for modelling a single dietary component.
Objective
The NCI method was originally designed for the National Health and Nutrition Examination Survey (NHANES) [8] conducted in the United States. With a different sampling design, limited advice has been published with respect to implementing the NCI method in the context of CCHS-Nutrition surveys [1,9]. While Statistics Canada has provided a tutorial on “how to” implement the NCI SAS macros in CCHS-Nutrition (available upon request), the objective of this article is to summarize results of early experience implementing the NCI method, and highlight important statistical considerations which arose during the analysis of usual dietary intake. In particular, we considered the following five questions:
A. Choice of Model:Which of the different NCI models should be implemented?
B. Covariates:Which covariates should be included in the model?
C. Stratification versus Pooling:Should age/sex groups be pooled to provide more precise estimates?
D. Outliers:How should outliers be identified and when should they be removed?
E. Computational Time:What is the computational time required by the NCI method?

2. Materials and Methods

2.1. Data Set and Variables of Interest

The analysis considered dietary consumption data from the 2004 CCHS [10] and the 2015 CCHS [1]—Share files. Both surveys collected detailed information on the consumption of foods, nutrient from foods, and dietary supplements among a representative sample of Canadians living in private households in the 10 provinces. The 2015 CCHS considered respondents aged one and older, whereas the 2004 CCHS sampled all individuals aged zero and older. In both surveys, consumption data were collected using 24-h recalls, in a similar way as the NHANES. Respondents were asked to list all foods and beverages consumed during the 24-h prior to the day of their interview (midnight to midnight), and interviewers used the automated multi-pass method with a five-step approach to help respondents remember their food and beverage choices [11]:
(i)
A quick list (respondents reported all foods and beverages consumed in whatever order they wished);
(ii)
questions about specific food categories and frequently forgotten foods;
(iii)
questions about the time of consumption and type of meal (for example, lunch, dinner);
(iv)
questions seeking more detailed, precise descriptions of foods and beverages and quantities consumed;
(v)
a final review of food choices.
A subsample of the population responded to a second 24-h recall 3–7 days later to help assess intra-person variations in the food and beverage intake. The energy and nutrient content of the foods were derived from Health Canada’s Canadian Nutrient File (CNF) Version 2001b Supplement for the 2004 CCHS, and from Health Canada’s CNF 2015 for the 2015 CCHS. In 2004, the initial recall was completed by 35,107 people, and a subsample of 10,786 completed the second recall, yielding response rates of 76.5% and 72.8%, respectively. Correspondingly in 2015, a total of 20,487 people completed the initial 24-h dietary recall, and a subsample of 7608 completed the second recall, yielding response rates of 61.6% and 68.6%, respectively. For this analysis, the CCHS Share files were used, consisting of respondents who agreed to share their data with government analysts for the purpose of research (n = 33,469 for 2004 and n = 19,673 for 2015). Other important differences between the two nutrition surveys are summarized elsewhere [1,9].
In the present analysis, one nutrient (dietary folate equivalents (DFE)) and one episodically consumed food (red meats) were chosen to represent intakes of daily and episodically consumed dietary components, respectively.
Descriptive statistics for the usual dietary intake (e.g., means, percentiles) and prevalence of inadequacy for DFE (i.e., percent below the estimated average requirement (EAR)) were estimated. Standard errors were computed using bootstrap weights supplied by Statistics Canada [1,10] to properly account for the sampling error in estimation.
Similarly, the means and percentiles of the distribution of usual intakes for other nutrients as well as the comparison of usual intakes to different dietary cut-offs can be found in the Compendium of Nutrient Intakes from Food [12].

2.2. Statistical Methods

The NCI method for a single dietary component utilizes two main SAS macros, MIXTRAN and DISTRIB [4]. The MIXTRAN macro fits statistical models to estimate both the probability of consumption and the amount consumed. This is referred to as the two-part model. The probability of consumption may be related to the amount consumed, in which case their correlation can be accounted for in MIXTRAN. For the case where the dietary component is ubiquitously consumed, the probability of consumption is assumed to be one. This case is known as the one-part (or amount-only) model. The DISTRIB macro uses the parameter estimates from MIXTRAN to estimate the distribution of usual intakes as well as the proportion of intakes below or above a certain cut-off [5,6].
For the implantation of the NCI method, a dietary component as well as strata (e.g., age/sex, province) and potential covariates must be identified from the onset. Then, the following questions should be addressed:
A. Choice of Model: To determine whether to use the one-part (amount) or two-part (uncorrelated or correlated) model, we considered a similar approach to the one proposed by Krebs-Smith et al. (2010) [2].
  • If less than 5% of the 24-h recalls (unweighted) within each stratum had zero intake of the dietary component, the amount-only model is used;
  • If greater than or equal to 5% of the 24-h recalls had zero intake, the two-part correlated model is fitted. The correlation between the probability of consumption and the amount consumed is assessed via its Fisher’s transformation using a smaller number of bootstrap replicates (e.g., 50). If it is found to be significant, the correlated model is selected. Otherwise, the uncorrelated model is chosen.
If convergence of the two-part models is not achieved, the amount-only model can be used only if the percentage of the 24-h recalls with zero intakes is between 5%–10%. If more than 10% of the 24-h recalls have zero intakes, the amount-only model over-estimates the usual intake distribution since observations with zero intakes are replaced by one half the minimum observed value. For this case, if the convergence of the two-part model is not achieved, other options should be explored, such as pooling of neighboring age/sex groups or expanding the scope of the dietary component (e.g., from red meats to all meats).
B. Covariates: Dietary component intakes vary by age and sex as well as weekday/weekend of the consumption day; these variables were included as covariates in the model. As per NCI recommendations [4], the sequential effect of the recall was considered in the models. Data from 2004 and 2015 CCHS were combined and the survey year was included as a covariate to distinguish between the two surveys and to obtain year-specific estimates. One advantage of including the survey year as a covariate is that it permits hypothesis testing of a difference in the mean usual intake between the two survey years.
Results included in this paper correspond to estimates at the national level. However, for the Compendium of Nutrient Intakes from Food [12], the province is included as a covariate and estimates are generated at the national and provincial levels.
Bootstrapping is used for the estimation of the standard error of covariate estimates. Statistics Canada’s recommendation is to use all of the 500 bootstraps provided with the survey data [1,10]. We tested the impact of the varying number of bootstraps in the size of the standard errors. We compared the use of 50, 100, 200, 300, 400 and 500 replicates.
C. Stratification compared to Pooling: The NCI method requires sufficient sample size to provide stable estimates of the variance components. This is sometimes difficult to achieve, particularly in the analysis of episodically consumed dietary components. To this end, previous NCI research has considered pooling categories of important covariates (e.g., age groups) so as to borrow information, lead to convergence [2] and generate more precise estimates.
The stratified analysis consists of applying the NCI method on each age-sex group of interest separately. We used the same age-sex groupings recommended by the Institute of Medicine of the National Academies Dietary Reference Intakes (DRI): 1–3 years, 4–8 years (both sexes combined); and separate analysis for males and females aged 9–13 years, 14–18 years, 19–30 years, 31–50 years, 51–70 years and 71 years and older [13].
When pooling the analysis for DFE, we used a similar method proposed by Krebs-Smith et al. (2010) [2], namely children <9 years old; females 9 years and older and males 9 years and older. For red meat, we pooled children <19, females 19+ and males 19+ in order to increase the sample size in the youngest group.
To assess how appropriate the pooling by age/sex groups is as opposed to computing stratified estimates, ratios of within-person to between-person (within-between) variance components were compared. A rule of thumb is that subgroups may be pooled if their estimated within-between variance ratios are similar. Alternatively, the estimated within-between variance ratios obtained from the stratified and from the pooled analysis should be similar under the equal variance assumption.
D. Outliers: Outliers were investigated for their impact on statistical estimates. Previous research has indicated that within-between variance ratios larger than 10 may attenuate diet-health relationships in studies with a limited number of dietary recalls [14]. Our proposed method considers domains where the ratio of within-between variance components is greater than 10, and flags observations if they fall more than ±3, ±2.5 or ±2 standard deviations away from the mean for the distribution of the difference between the first and second recalls. With this method, second day recalls of the flagged observation are removed, since second recalls are potentially biased due to the learning effect. The scenario which resulted in the greatest improvement in the within-between variance ratio and removed the fewest observations is selected.
E. Computational Time: The computational time associated with each model run was noted. In addition, the possibility of using smaller number of bootstrap replicates to reduce the computational time in different analysis was investigated.
Analysis was completed using the SAS Enterprise Guide 5.1 and the macros MIXTRAN and DISTRIB (Version 2) as available from the NCI website [4]. A significance level of 5% was used throughout.

3. Results

We developed a flowchart to help in the application of the NCI method for the case with a single dietary component (Supplementary Material). This flowchart provides a pathway through the selection of the model to use, the detection of outliers, and the estimation of the distribution of usual intakes.

3.1. Choice of Model

Following the choice of a dietary component and of strata of interest, the next step listed on the flowchart is to evaluate the percent of observations with zero intake.
For DFE, among all age/sex groups only 61 observations (less than 1%) had zero intake in either of the recalls, thus the amount-only model was used to estimate usual intakes (Steps 5A–8A of the flowchart).
For red meats, the percentage of zero intakes was greater than 40% for all DRI groups, thus the two-part model was selected (Step 5B). In all situations, the correlation between person-specific effects (Step 6C) was not significantly different from zero (p > 0.05). Estimates of the usual intakes were then obtained using the uncorrelated model (Steps 8B–10B).

3.2. Covariates

Covariates included in the modelling for DFE and for red meat were: Age/sex groups (when different age/sex groups are pooled), weekend/weekday of recall, and sequential effect of the recall (1st or 2nd recall) as well as survey year, similar to other published recommendations from the National Cancer Institute [4].
Figure 1 illustrates the 2004 and 2015 usual intakes of DFE for (a) females 9–13 and (b) for females 51–70. There is no significance difference between the mean usual intakes of DFE between the two cycles for females 9–13 (p = 0.40). Females 51–70 reported significantly larger mean usual consumptions of DFE in 2004 (p = 0.002).
Results showed that an increasing number of bootstrap replicates had minimal effect on the significance of the covariates in distinguishing the distributions of DFE from two survey cycles beyond 50 bootstrap replicates. As an example, the p-values presented in Figure 1 were calculated using 500 bootstrap replicates. When using only 50 bootstrap replicates instead, the p-values for the difference in mean usual intakes by survey year for females 9–13 is unchanged while for females 51–70 it was now found to be p = 0.004 as opposed to p = 0.002.

3.3. Stratification versus Pooling

Usual intakes for DFE and for red meat were obtained using the stratified approach and the pooling of age groups. Key percentiles of the usual intake distribution and estimates of the ratio of within-between variance for DFE and red meats for the 2015 CCHS data are presented in Table 1 and Table 2 respectively. Note that only results for children and females are presented. Results for males and for the 2004 CCHS were analogous.
When comparing estimates between the stratified and pooled approaches, we considered the relative difference in the mean and percentile estimates as well as the ratio of within-between variance. In particular for DFE intakes, we noted that the estimates of the mean and median were similar for almost all age/sex groups in the pooled or stratified analysis (differing by less than 4%, except for females 9–13) (Table 1). However, estimates of the 5th percentile differed by 17.3% (children 1–3 years) and 27.5% (females 9–13 years) between the pooled and stratified results. Similarly, large differences were noted at the 95th percentile (9% and 9.4% difference, respectively). Figure 2a illustrates the larger tails associated with the usual intake distribution for the pooled analysis over the stratified analysis for females 9–13 years. Figure 2b shows that the two methods yielded similar distributions for females 19–30. For age groups in which large differences were found, we noted the within to between ratio was much different for the stratified versus the pooled analysis (Table 1), indicating that the equal variance assumption may not hold and pooling these groups results in different estimates of usual intake distributions.
For red meats, original stratification yielded a within-between variance ratio >10 for children 2–3 even after excluding outliers (data not shown). As a result, the smallest strata where the within-between ratio was <10 was when children 2–3 years old were pooled with children 4–8. In Table 2, results listed for children 2–3 and for children 4–8 under “stratified” were obtained from pooling these age groups together. Listed under “pooled” for the same age groups are the results obtained when pooling children <19.
A similar situation occurred with females 19–30 where the initial within-between variance ratio was >10. In Table 2, results listed for females 19–30 and females 31–50 under “stratified” were obtained from pooling these two age/sex groups together. Listed under “pooled” for the same age groups are the results obtained when pooling females 19+.
Similar to DFE, with the exception of the 5th percentile, the largest differences were found for those with greater differences in the ratio of within/between variances (female 14–18 years and female 71+). Interpretation with caution is advised in all 5th percentiles reported due to the large variability of the estimates (i.e., CV in [16.6, 33.3]). We also compared the impact of using 250 and 500 bootstraps replicates. Table 3 provides estimates of standard errors obtained using the pooled and stratified approaches for DFE, using 250 and 500 bootstrap replicates. Standard errors from the stratified analysis were equivalent or larger than those from the pooled analysis. The pooled estimates are expected to be more precise as they are based on a larger sample and “borrow strength” from other observations to better estimate the variance components. When increasing the number of bootstrap replicates, minimal improvement to standard error estimates were found when more than 250 bootstrap replicates were used. This result is consistent with other published literature [6].

3.4. Outliers

For DFE, ratios of the within-between variance were less than 10 for all DRI groups except children 4–8. For this group, 17 observations were flagged as outliers and their second day of recall were removed. For red meats, no outliers were removed for the pooled analysis, whereas for the stratified analysis, 109 out of 6396 s recalls were removed over five DRI groups (9 for children 2–3; 32 for children 4–8; 23 for females 9–13; 21 for females 31–50; and 24 for females 51–70).

3.5. Computational Time

As expected, the two-part model requires more time to reach convergence as compared to the amount-only model. Pooled analysis requires more time as compared to the total computation time of all the strata analyzed separately. Based on our analysis, the computational time for the stratified analysis was found to be between 2–5 h per DRI age/sex group for red meat and 1–2 h for DFE. Considering the pooled analysis, there was a 50% increase in the total computational times of the two-part model over the one-part model. Additionally, the total time for DFE was 23 h for the stratified analysis and 44.4 h for the pooled analysis. For red meats, the corresponding total computational times were 48.2 h and 66.1 h, respectively.

4. Discussion

While various authors have analyzed usual dietary intakes from the CCHS-Nutrition survey using the NCI method [15,16,17,18,19], few authors have assessed the statistical considerations mentioned in the present article. The NCI method applied to the CCHS-Nutrition data offers computational advantages for estimation of usual dietary intake such as the ability to analyze episodically consumed foods (and nutrients) and the inclusion of covariates. However, increased analytical capability comes with an increased number of statistical assumptions, which should be verified as part of the analysis. In particular, statistical considerations such as the assumption of equal variance, choice of model, correlation of variance components and the presence of outliers should be acknowledged as part of an analysis of usual intakes using the NCI method.
We propose the following recommendations when analyzing dietary intake data from the 2004 and 2015 CCHS using the NCI method:
  • Consider the unweighted proportion of the 24-h recalls having zero intake of a nutrient or food in order to determine whether a one- or two-part model should be used.
  • If different categorical covariates are of interest (e.g., DRI age/sex groups; income levels), use the MIXTRAN macro to compute estimates of the parameters in two ways: (a) Pooling all levels by using a covariate and (b) separate analysis for every level of the covariate. In each case, compare the ratio of the within-between variance components, and consider a stratified analysis if this variance ratio is at least 30% higher or lower than for the other approach.
  • When analyzing dietary components that are rarely consumed (i.e., very episodic), there might not be enough respondents with non-zero intake to obtain stable estimates of the variance components. In addition, estimated distributions may show large variability (i.e., large CVs), particularly in the tails of the distribution. For such cases, a recommendation would be to retain broader nutrient and/or covariate categories. For example, an analyst could model all meats instead of red meats; or pooling with other age/sex groups is a viable option, although not always recommended.
  • Perform outlier detection by evaluating recalls which may affect equal variance assumptions. Remove outliers and re-fit using the MIXTRAN macro to evaluate the impact on parameter estimates.
Supplementary Material presents a flowchart for the univariate analysis of usual intakes using the NCI method, based on results from the present article.
Future research would include evaluation of the bivariate and multivariate NCI macros as applied to the CCHS-Nutrition surveys, which are available for more complex scenarios involving ratios of two or more dietary components [20,21,22].
Despite the many advantages of the NCI method, one difficulty is that the method requires a large sample size in order to generate reliable estimates, which in turn may necessitate pooling subgroups. This issue intensifies for episodically consumed dietary components. However, a consequence of pooling groups is that the pooled variance components may incorrectly estimate the variances within subgroups when the equal variance assumption does not hold. Consequently, the adjustment factor will be inaccurate, leading to bias in the estimation of the usual intake distribution. The other caveat is the decision of how to pool. In this research, the analyses were pooled by neighboring age/sex groups. However, pooling may be performed using other groups or variables, for example neighboring provinces. Choosing the type of pooling strategy is a balance between the adequate sample size, the equal variance assumption, and the appropriate research question.
Retaining observations flagged as outliers in the analysis may result in over-estimation of the ratio of the within-between variance. This, in turn, inflates the estimated standard errors to a point that the estimates are no longer publishable according to Statistics Canada’s guidelines (i.e., large CV’s). In this research, we proposed a method which removes outliers based on the difference between the two days of recall. Another approach recommended by Krebs-Smith et al. (2010) [2] is to Box-Cox transform the raw non-zero values and flag observations which were 2.5 times the interquartile range below the 25th and above the 75th percentiles, respectively. Our proposed method identifies outliers as those points having the greatest impact on the within-between variation whereas the second method characterizes observations which may violate normality assumptions.
Lastly, implementation of the NCI method for large surveys such as CCHS that requires bootstrapping is computationally intensive and may limit the ability to perform standard model diagnostics. We noted that increasing the number of bootstraps has minimal effect on the significance of the covariates or the correlation coefficient. To reduce the computational time, we recommend employing model building steps with fewer bootstraps weights (e.g., 50). After finalizing the model, results may be run for all the 500 bootstraps replicates to have precise estimates of standard errors.

5. Conclusions

The use of the NCI method is a valuable tool for Canadian nutrition surveys. Lessons learned from early experience using the NCI method lead us to the development of a flowchart to facilitate the choice of the NCI model to use. The ability of the NCI method to include covariates permits comparisons between both the 2004 and 2015 CCHS and informs health policymakers and regulators as to changes in the prevalence of nutrient inadequacy over time. This study shows that the improper application of pooling and stratification as well as outlier detection can lead to biased results. This early experience can provide guidance to other researchers and ensures consistency in analysis of usual dietary intake in the Canadian context.

Supplementary Materials

The following are available online at https://www.mdpi.com/2072-6643/11/8/1908/s1, Flowchart for Estimation of Usual Intakes using the univariate NCI method.

Author Contributions

The individual contributions of the authors are: “conceptualization, K.A.D., A.S. and D.I.; methodology, K.A.G., L.L., C.Q., A.S., M.V., K.C.W. and D.I.; software, A.G., C.Q.; validation, K.A.D., A.G., L.L., G.Q., A.S., M.V., K.C.W. and D.I.; formal analysis, K.A.D., A.G. and C.Q.; writing—original draft preparation, K.A.D., A.S. and D.I.; writing—review and editing, K.A.D., A.G., L.L., C.Q., A.S., M.V., K.C.W. and D.I.; supervision, D.I.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

List of Abbreviations:

AMPMAutomated Multi-Pass Method
CCHSCanadian Community Health Survey
CNFCanadian Nutrient File
CVCoefficient of Variation
DFEDietary folate equivalents
DRIDietary Reference Intake
EAREstimated Average Requirement
NCI methodNational Cancer Institute method
NHANESNational Health and Nutrition Examination Survey
SEStandard Error
SIDESoftware for Intake Distribution estimation

References

  1. Statistics Canada. Canadian Community Health Survey 2015—Nutrition User Guide; Statistics Canada: Ottawa, ON, Canada, 2017. [Google Scholar]
  2. Krebs-Smith, S.M.; Guenther, P.M.; Subar, A.M.; Kirkpatrick, S.I.; Dodd, K.W. Americans do not meet federal dietary recommendations. J. Nutr. 2010, 140, 1832–1838. [Google Scholar] [CrossRef] [PubMed]
  3. Dodd, K.W.; Guenther, P.M.; Freedman, L.S.; Subar, A.F.; Kipnis, V.; Midthune, D.; Tooze, J.A.; Krebs-Smith, S.M. Statistical methods for estimating usual intake of nutrients and foods: A review of the theory. J. Am. Diet. Assoc. 2006, 106, 1640–1650. [Google Scholar] [CrossRef] [PubMed]
  4. National Cancer Institute. Usual Dietary Intakes: The NCI Method. 2018. Available online: https://epi.grants.cancer.gov/diet/usualintakes/method.html (accessed on 30 May 2019).
  5. Tooze, J.A.; Midthune, D.; Dodd, K.W.; Freedman, L.S.; Krebs-Smith, S.M.; Subar, A.F.; Guenther, P.M.; Carroll, R.J.; Kipnis, V. A new statistical method for estimating the usual intake of episodically consumed foods with application to their distribution. J. Am. Diet. Assoc. 2006, 10, 1575–1587. [Google Scholar] [CrossRef] [PubMed]
  6. Tooze, J.A.; Kipnis, V.; Buckman, D.W.; Carroll, R.J.; Freedman, L.S.; Guenther, P.M.; Krebs-Smith, S.M.; Subar, A.F.; Dodd, K.W. A mixed-effects model approach for estimating the distribution of usual intake of nutrients: The NCI method. Stat. Med. 2010, 29, 2857–2868. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Nusser, S.M.; Carriquiry, A.L.; Dodd, K.W.; Fuller, W.A. A semi-parametric transformation approach to estimating usual nutrient intake distributions. J. Am. Stat. Assoc. 1996, 91, 1440–1449. [Google Scholar] [CrossRef]
  8. National Center for Health Statistics. National Health and Nutrition Examination Survey. Available online: https://www.cdc.gov/nchs/nhanes/index.htm (accessed on 5 July 2018).
  9. Health Canada. Reference Guide to Understanding and Using the Data—2015 Canadian Community Health Survey—Nutrition. 2017. Available online: https://www.canada.ca/en/health-canada/services/food-nutrition/food-nutrition-surveillance/health-nutrition-surveys/canadian-community-health-survey-cchs/reference-guide-understanding-using-data-2015.html (accessed on 30 May 2019).
  10. Statistics Canada. Canadian Community Health Survey (CCHS) Cycle 2.2 (2004) Nutrition—General Health (including Vitamin & Mineral Supplements) & 24-Hour Dietary Recall Components User Guide. 2008. Available online: http://www23.statcan.gc.ca/imdb-bmdi/document/5049_D24_T9_V1-eng.pdf (accessed on 30 May 2019).
  11. Moshfegh, A.J.; Rhodes, D.G.; Baer, D.J.; Murayi, T.; Clemens, J.C.; Rumpler, W.V.; Paul, D.R.; Sebastian, R.S.; Kuczynski, K.J.; Ingwersen, L.A.; et al. The US Department of Agriculture Automated Multiple-Pass Method reduces bias in the collection of energy intakes. Am. J. Clin. Nutr. 2008, 88, 324–332. [Google Scholar] [CrossRef] [PubMed]
  12. Health Canada. Compendium of Data Tables for Usual Intakes of Energy, Nutrients & Other Dietary Components from Food (2004 and 2015 CCHS-Nutrition) derived from Statistics Canada’s 2004 and 2015 Canadian Community Health Survey, Nutrition, Share file, Ottawa. 2019; in progress. [Google Scholar]
  13. Otten, J.; Hellwig, J.; Meyers, L. Dietary DRI Reference Intakes; National Academies Press: Washington, DC, USA, 2006. [Google Scholar]
  14. Huybrechts, I.; de Bacquer, D.; Cox, B.; Temme, E.H.M.; van Oyen, H.; de Backer, G.; de Henauw, S. Variation in energy and nutrient intakes among pre-school children: Implications for study design. Eur. J. Public Health 2008, 18, 509–516. [Google Scholar] [CrossRef] [PubMed]
  15. Black, J.L.; Billette, J.-M. Do Canadians meet Canada Food Guide’s recommendations for fruits and vegetables? Appl. Physiol. Nutr. Metab. 2013, 38, 234–242. [Google Scholar] [CrossRef] [PubMed]
  16. Black, J.L.; Billette, J.-M. Fast food intake in Canada: Differences among Canadians with diverse demographic, socio-economic and lifestyle characteristics. Can. J. Public Health 2015, 106, e52–e58. [Google Scholar] [CrossRef] [PubMed]
  17. Garriguet, D. Beverage consumption of Canadian adults. Health Rep. 2008, 19, 23–29. [Google Scholar] [PubMed]
  18. Kirkpatrick, S.I.; Dodd, K.W.; Parsons, R.; Ng, C.; Garriguet, D.; Tarasuk, V. Household Food Insecurity is a Stronger Marker of Adequacy of Nutrient Intakes among Canadian Compared to American Youth and Adults. J. Nutr. 2015, 145, 1596–1603. [Google Scholar] [CrossRef] [PubMed]
  19. McLaren, L.; Heidinger, S.; Dutton, D.J.; Tarasuk, V.; Campbell, N.R. A repeated cross-sectional study of socio-economic inequities in dietary sodium consumption among Canadian adults: Implications for national sodium reduction strategies. Int. J. Equity Health 2014, 13, 1–14. [Google Scholar] [CrossRef] [PubMed]
  20. Freedman, L.S.; Guenther, P.M.; Dodd, K.W.; Krebs-Smith, S.M.; Midthune, D. The population distribution of ratios of usual intakes of dietary components that are consumed every day can be estimated from repeated 24-hour recalls. J. Nutr. 2010, 140, 111–116. [Google Scholar] [CrossRef] [PubMed]
  21. Zhang, S.; Krebs-Smith, S.M.; Midthune, D.; Perez, A.; Buckman, D.; Kipnis, V.; Freedman, L.S.; Dodd, K.W.; Carroll, R.J. Fitting a bivariate measurement error model for episodically consumed dietary components. Int. J. Biostat. 2011, 7, 1. [Google Scholar] [CrossRef]
  22. Zhang, S.; Midthune, D.; Guenther, P.M.; Krebs-Smith, S.M.; Kipnis, V.; Dodd, K.W.; Buckman, D.W.; Tooze, J.A.; Freedman, L.; Carroll, R.J. A new multivariate measurement error model with zero-inflated dietary data, and its application to dietary assessment. Ann. Appl. Stat. 2011, 5, 1456–1487. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Comparison of estimates of usual intake distribution of dietary folate equivalents (DFE) (μg/day) between 2004 and 2015 for (a) females 9–13 years old; and (b) females 51–70 years old.
Figure 1. Comparison of estimates of usual intake distribution of dietary folate equivalents (DFE) (μg/day) between 2004 and 2015 for (a) females 9–13 years old; and (b) females 51–70 years old.
Nutrients 11 01908 g001
Figure 2. Comparison of estimates of usual intake distribution of DFE (μg/day) between pooled versus the stratified approach for (a) females 9–13 years old; and (b) females 19–30 years old. Data Sources: Statistics Canada, 2004 and 2015 Canadian Community Health Survey—Nutrition Share files.
Figure 2. Comparison of estimates of usual intake distribution of DFE (μg/day) between pooled versus the stratified approach for (a) females 9–13 years old; and (b) females 19–30 years old. Data Sources: Statistics Canada, 2004 and 2015 Canadian Community Health Survey—Nutrition Share files.
Nutrients 11 01908 g002
Table 1. Comparison of distributional estimates of dietary folate equivalents (μg/day) by the National Academies Dietary Reference Intakes (DRI) groups—National Cancer Institute (NCI) Method—Amount Model.
Table 1. Comparison of distributional estimates of dietary folate equivalents (μg/day) by the National Academies Dietary Reference Intakes (DRI) groups—National Cancer Institute (NCI) Method—Amount Model.
DRI Age/Sex GroupMethodRatio of Within/Between2015 Usual Intake Distribution
MeanPercentiles% < EAR
5th50th90th95th
Children 1–3 yearsPooled3.6281.7174.1275.8377.2408.5<3
Stratified1.6281144273.4403.8445.4F
% Diff1 0.217.30.97.19
Children 4–8 yearsPooled3.6401.1265.2394.6520.2560.4<3
Stratified5.5398.8284.3392.9500.1533.90
% Diff 0.67.20.43.94.7
Females 9–13 yearsPooled2.5421.9249.9410.1578.7634.25.01 E
Stratified6.5438.2318.5433.5540.4574.3<3
% Diff 3.927.55.76.69.4
Females 14–18 yearsPooled2.5412.7244.9400.8564.8620.124.87
Stratified1.9412.2219.1398.9588.5649.628.41
% Diff 0.110.50.54.24.8
Females 19–30 yearsPooled2.5399.6233.4387.4550.4607.325.95
Stratified2.1407.3226.7394.4572.5632.325.87
% Diff 1.92.91.844.1
Females 31–50 yearsPooled2.5401234.9389.5551.2604.125.11
Stratified3.5407.5255.8397.8545.2593.620.37 E
% Diff 1.68.92.11.11.7
Females 51–70 yearsPooled2.5369.8214358.2512.6563.535.05
Stratified1.9358.9201.9347.7502.3555.539.55
% Diff 2.95.72.921.4
Female 71+ yearsPooled2.5339.7193.1329.3473.1522.746.29
Stratified1.8332.2193.7323.3458501.248.52
% Diff 2.20.31.83.24.1
Data Sources: Statistics Canada, 2004 and 2015 Canadian Community Health Survey—Nutrition Share files. Symbol Legend: E Data with a coefficient of variation (CV) from 16.6% to 33.3%; interpret with caution; F Data with a coefficient of variation (CV) greater than 33.3% with a 95% confidence interval not entirely between 0 and 3%; suppressed due to extreme sampling variability; <3 Data with a coefficient of variation (CV) greater than 33.3% with a 95% confidence interval entirely between 0 and 3%; interpret with caution; 1 Absolute percent difference relative to Pooled Method computed for means and percentiles (e.g., | ( Pooled   Mean Stratified   Mean ) | 100 Pooled   Mean ). EAR, estimated average requirement.
Table 2. Comparison of distributional estimates of red meats (gram) by DRI groups—NCI Method —Two Part Model—Uncorrelated.
Table 2. Comparison of distributional estimates of red meats (gram) by DRI groups—NCI Method —Two Part Model—Uncorrelated.
DRI Age/Sex GroupMethodRatio of Within/BetweenMeanPercentiles
5th50th90th95th
Children 2–3 yearsPooled 18.117.55.6 E15.730.335.4
Stratified 27.416.34.2 E14.629.134.4
% Diff4 6.724.27.13.72.9
Children 4–8 yearsPooled8.124.58.2 E22.241.348
Stratified 27.422.76.36 E20.739.546.1
% Diff 7.423.97.24.54
Females 9–13 yearsPooled8.127.79.5E25.546.353.3
Stratified6.129.9F27.151.660.3 E
% Diff 7.6F611.413.1
Females 14–18 yearsPooled8.128.79.3 E2649.157.2
Stratified323.8F20.144.954.7
% Diff 17.2F22.98.64.3
Females 19–30 yearsPooled7.331.69.5 E28.854.763.5
Stratified 37.832.48.0 E2958.668.5
% Diff 2.515.40.77.17.8
Females 31–50 yearsPooled7.335.311.1 E32.459.969.1
Stratified 37.835.99.3 E32.863.373.2
% Diff 1.616.61.25.55.9
Females 51–70 yearsPooled7.338.512.5 E35.664.674.6
Stratified5.536.816.4 E34.557.165.1
% Diff 4.530.83.211.612.7
Females 71+ yearsPooled7.334.410.9 E31.858.167
Stratified331.3F27.258.369.9
% Diff 9.2F14.40.34.3
Data Sources: Statistics Canada, 2004 and 2015 Canadian Community Health Survey—Nutrition Share files. Symbol Legend: E Data with a coefficient of variation (CV) from 16.6% to 33.3%; interpret with caution; F Data with a coefficient of variation (CV) greater than 33.3%; suppressed due to extreme sampling variability; 1 Pooling strategy: Children and adolescents < 19; Males 19+ and Females 19+; 2 For children 2–3, a stratified analysis yielded a within-between variance ratio >10. So, a pooled estimate of 2–3 and 4–8 years is provided; 3 For females 19–30, a stratified analysis yielded a within-between variance ratio >10. So, a pooled estimate of females 19–30 and 19–50 is provided; 4 Absolute percent difference relative to pooling strategy computed for means and percentiles (e.g., | ( Pooled   Mean Stratified   Mean ) | 100 Pooled   Mean ).
Table 3. Comparison of standard errors (SE) of distributional estimates of dietary folate equivalents (μg/day)—Canadian Community Health Survey (CCHS) 2015—Nutrition.
Table 3. Comparison of standard errors (SE) of distributional estimates of dietary folate equivalents (μg/day)—Canadian Community Health Survey (CCHS) 2015—Nutrition.
DRI Age/Sex GroupNo. of Bootstrap ReplicatesPooledStratified
SE of MeanSE of 5th PercentileSE of 95th PercentileSE of MeanSE of 5th PercentileSE of 95th Percentile
Children
1–3 years
2506.914.217.58.512.516.3
5006.514.618.58.112.317
Children
4–8 years
2507.815.924.18.724.235.5
5007.916.725.38.924.635
Females
9–13 years
2506.68.114.410.321.627.7
5006.88.314.610.222.228.2
Females
14–18 years
2507.48.415.311.91528.7
5007.78.515.711.714.529.4
Females
19–30 years
2509.38.818.313.816.136.1
5008.88.417.813.615.934.9
Females
31–50 years
2506.98.114.110.220.935
5006.78.1149.920.834.7
Females
51–70 years
2504.77.311.16.311.418.2
5004.97.511.26.311.517.9
Females
71+ years
2504.36.710.75.78.415.3
5004.66.711.168.514.7
Data Sources: Statistics Canada, 2004 and 2015 Canadian Community Health Survey—Nutrition Share files.

Share and Cite

MDPI and ACS Style

Davis, K.A.; Gonzalez, A.; Loukine, L.; Qiao, C.; Sadeghpour, A.; Vigneault, M.; Wang, K.C.; Ibañez, D. Early Experience Analyzing Dietary Intake Data from the Canadian Community Health Survey—Nutrition Using the National Cancer Institute (NCI) Method. Nutrients 2019, 11, 1908. https://doi.org/10.3390/nu11081908

AMA Style

Davis KA, Gonzalez A, Loukine L, Qiao C, Sadeghpour A, Vigneault M, Wang KC, Ibañez D. Early Experience Analyzing Dietary Intake Data from the Canadian Community Health Survey—Nutrition Using the National Cancer Institute (NCI) Method. Nutrients. 2019; 11(8):1908. https://doi.org/10.3390/nu11081908

Chicago/Turabian Style

Davis, Karelyn A., Alejandro Gonzalez, Lidia Loukine, Cunye Qiao, Alireza Sadeghpour, Michel Vigneault, Kuan Chiao Wang, and Dominique Ibañez. 2019. "Early Experience Analyzing Dietary Intake Data from the Canadian Community Health Survey—Nutrition Using the National Cancer Institute (NCI) Method" Nutrients 11, no. 8: 1908. https://doi.org/10.3390/nu11081908

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop