Application of Resolution Regression and Resolution Graphs in Evaluating Probability Forecasts Generated Using Binary Choice Models
Abstract
1. Introduction
- (1)
- to evaluate the out-of-sample prediction success of logit models using resolution regression and resolution graphs, and
- (2)
- to compare the use of resolution regression and resolution graphs with conventional prediction-success metrics that rely on predetermined cut-off values. To achieve these objectives4, we estimate logit models associated with purchases/non-purchases of non-alcoholic beverages made by U.S. households. We subsequently retrieve out-of-sample predicted probabilities of purchase and non-purchase. In turn, these predicted probabilities are then evaluated based on expectation-prediction success as well as using resolution regression and resolution graphs.
2. Materials and Methods
- = vector of index values generated for each observation through regression of the binary choice variable on the vector of explanatory variables;
- = vector of explanatory variables in the logistic regression (price of each product and other socio-economic and demographic variables in this study);
- = vector of estimated regression coefficients associated with each explanatory variable;
- = associated probability generated through assuming the index variable, Z, has a logistic distribution;
- = cumulative distribution function.
- = predicted purchase probability generated from the binary choice model (logit model in this study);
- = vector of predictor variables used in the binary choice model (logit model in this study);
- = intercept term;
- = observed outcome index (zero or one) associated with predicted probabilities of the binary choice model;
- = estimated parameter associated with the outcome index;
- = error term of the resolution regression. This error term is distributed with a non-linear Beta distribution.
3. Results
4. Discussion
5. Conclusions, Limitations, and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
| Isotonic Drinks | Regular Soft Drinks | Diet Soft Drinks | |||||||
| Coef. | Std. Err. | p > |z| | Coef. | Std. Err. | p > |z| | Coef. | Std. Err. | p > |z| | |
| Price | 0.5659 | 0.1703 | 0.001 | −0.1336 | 0.1238 | 0.280 | 0.2075 | 0.1291 | 0.108 |
| price_2 | −0.0620 | 0.0258 | 0.016 | −0.0035 | 0.0129 | 0.783 | −0.0400 | 0.0191 | 0.036 |
| pov185 | −0.0831 | 0.1301 | 0.523 | 0.3084 | 0.1896 | 0.104 | −0.3136 | 0.1083 | 0.004 |
| agehh2529 | −2.0961 | 0.8768 | 0.017 | 1.3408 | 1.2209 | 0.272 | −0.2959 | 0.8190 | 0.718 |
| agehh3034 | −2.0345 | 0.8570 | 0.018 | 1.5795 | 1.1544 | 0.171 | 0.2469 | 0.8029 | 0.758 |
| agehh3544 | −1.8535 | 0.8476 | 0.029 | 1.2769 | 1.1158 | 0.252 | 0.4807 | 0.7944 | 0.545 |
| agehh4554 | −2.2248 | 0.8467 | 0.009 | 0.7770 | 1.1098 | 0.484 | 0.2789 | 0.7927 | 0.725 |
| agehh5564 | −2.4878 | 0.8493 | 0.003 | 0.4738 | 1.1112 | 0.670 | 0.5285 | 0.7943 | 0.506 |
| agehhgt64 | −2.8128 | 0.8548 | 0.001 | −0.1427 | 1.1167 | 0.898 | 0.2803 | 0.7977 | 0.725 |
| emphhpt | −0.1266 | 0.1261 | 0.315 | 0.1332 | 0.1942 | 0.493 | −0.0266 | 0.1118 | 0.812 |
| emphhft | −0.2457 | 0.1072 | 0.022 | −0.2434 | 0.1608 | 0.130 | 0.0149 | 0.0953 | 0.875 |
| eduhhhs | −0.0739 | 0.2394 | 0.757 | 0.0937 | 0.3383 | 0.782 | 0.0652 | 0.2015 | 0.746 |
| eduhhu | −0.2746 | 0.2342 | 0.241 | −0.0485 | 0.3268 | 0.882 | 0.1820 | 0.1965 | 0.354 |
| eduhhpc | −0.4562 | 0.2672 | 0.088 | −0.6164 | 0.3504 | 0.079 | 0.1891 | 0.2213 | 0.393 |
| reg_central | 0.2124 | 0.1333 | 0.111 | 0.0648 | 0.1754 | 0.712 | 0.2067 | 0.1140 | 0.07 |
| reg_south | 0.3229 | 0.1137 | 0.005 | 0.0991 | 0.1539 | 0.519 | −0.0128 | 0.0961 | 0.894 |
| reg_west | 0.1574 | 0.1331 | 0.237 | 0.0404 | 0.1700 | 0.812 | −0.0432 | 0.1103 | 0.695 |
| race_black | −0.2448 | 0.1338 | 0.067 | 0.6566 | 0.2029 | 0.001 | −1.0445 | 0.1058 | 0.000 |
| race_oriental | −0.4874 | 0.2742 | 0.075 | −0.3890 | 0.3388 | 0.251 | −0.7531 | 0.2178 | 0.001 |
| race_other | −0.3464 | 0.2047 | 0.091 | 0.4517 | 0.3807 | 0.235 | −0.3590 | 0.1823 | 0.049 |
| hisp_yes | 0.5823 | 0.1697 | 0.001 | 0.3200 | 0.3216 | 0.320 | −0.0165 | 0.1624 | 0.919 |
| agepclt6_only | 0.5880 | 0.2052 | 0.004 | −0.0428 | 0.2013 | 0.831 | |||
| agepc6_12only | 0.8456 | 0.1613 | 0.000 | 0.2768 | 0.1676 | 0.099 | |||
| agepc13_17only | 0.8644 | 0.1443 | 0.000 | 0.3591 | 0.1517 | 0.018 | |||
| agepclt6_6_12only | 0.6524 | 0.2261 | 0.004 | 0.0336 | 0.2335 | 0.885 | |||
| agepclt6_13_17only | 0.9476 | 0.4229 | 0.025 | −0.0680 | 0.4298 | 0.874 | |||
| agepc6_12and13_17only | 0.6141 | 0.1819 | 0.001 | −0.0672 | 0.1810 | 0.71 | |||
| agepclt6_6_12and13_17 | 0.1304 | 0.3921 | 0.739 | 0.1052 | 0.4004 | 0.793 | |||
| fhonly | −0.2975 | 0.1064 | 0.005 | −0.9039 | 0.1309 | 0.000 | −0.2864 | 0.0860 | 0.001 |
| mhonly | −0.3086 | 0.1587 | 0.052 | −1.1402 | 0.1631 | 0.000 | −0.9961 | 0.1196 | 0.000 |
| _constant | 0.0843 | 0.9066 | 0.926 | 2.4588 | 1.1612 | 0.034 | 0.2491 | 0.8350 | 0.765 |
| Number of Observations | 3820 | 3820 | 3820 | ||||||
| McFadden R2 | 0.0754 | 0.0925 | 0.0538 | ||||||
| Log Likelihood | −1867.50 | −1114.98 | −2333.56 | ||||||
| High-Fat Milk | Low-Fat Milk | Fruit Drinks | |||||||
| Coef. | Coef. | p > |z| | Coef. | Std. Err. | p > |z| | Coef. | Std. Err. | p > |z| | |
| price | −0.3177 | −0.3177 | 0.001 | 0.7734 | 0.1378 | 0.000 | 0.1828 | 0.0986 | 0.064 |
| price_2 | 0.0285 | 0.0285 | 0.016 | −0.0932 | 0.0212 | 0.000 | −0.0141 | 0.0117 | 0.229 |
| pov185 | 0.2431 | 0.2431 | 0.523 | −0.3362 | 0.1058 | 0.001 | 0.1430 | 0.1251 | 0.253 |
| agehh2529 | 0.2620 | 0.2620 | 0.017 | −0.3687 | 0.9085 | 0.685 | 0.9060 | 0.9316 | 0.331 |
| agehh3034 | 1.2414 | 1.2414 | 0.018 | −0.5077 | 0.8904 | 0.569 | 1.1157 | 0.8992 | 0.215 |
| agehh3544 | 0.9501 | 0.9501 | 0.029 | −0.4807 | 0.8824 | 0.586 | 0.8837 | 0.8784 | 0.314 |
| agehh4554 | 0.9357 | 0.9357 | 0.009 | −0.5101 | 0.8811 | 0.563 | 0.3185 | 0.8754 | 0.716 |
| agehh5564 | 1.0403 | 1.0403 | 0.003 | −0.4791 | 0.8823 | 0.587 | −0.1568 | 0.8761 | 0.858 |
| agehhgt64 | 0.9629 | 0.9629 | 0.001 | −0.2143 | 0.8855 | 0.809 | −0.5464 | 0.8797 | 0.534 |
| emphhpt | 0.2161 | 0.2161 | 0.315 | −0.0791 | 0.1085 | 0.466 | 0.1905 | 0.1291 | 0.140 |
| emphhft | 0.2629 | 0.2629 | 0.022 | −0.1247 | 0.0920 | 0.175 | −0.1608 | 0.1069 | 0.132 |
| eduhhhs | 0.0172 | 0.0172 | 0.757 | 0.5064 | 0.1972 | 0.010 | 0.1636 | 0.2228 | 0.463 |
| eduhhu | −0.3198 | −0.3198 | 0.241 | 0.7174 | 0.1927 | 0.000 | 0.0645 | 0.2168 | 0.766 |
| eduhhpc | −0.6136 | −0.6136 | 0.088 | 0.9201 | 0.2181 | 0.000 | −0.2107 | 0.2420 | 0.384 |
| reg_central | 0.2165 | 0.2165 | 0.111 | −0.1621 | 0.1099 | 0.140 | −0.0048 | 0.1238 | 0.969 |
| reg_south | 0.2485 | 0.2485 | 0.005 | −0.2899 | 0.0946 | 0.002 | −0.0390 | 0.1081 | 0.718 |
| reg_west | −0.0294 | −0.0294 | 0.237 | −0.2657 | 0.1091 | 0.015 | −0.1240 | 0.1224 | 0.311 |
| race_black | 0.2237 | 0.2237 | 0.067 | −0.8052 | 0.1053 | 0.000 | 1.0953 | 0.1535 | 0.000 |
| race_oriental | −0.0749 | −0.0749 | 0.075 | −0.3283 | 0.2200 | 0.136 | 0.1695 | 0.2752 | 0.538 |
| race_other | −0.1828 | −0.1828 | 0.091 | −0.3934 | 0.1777 | 0.027 | 0.2654 | 0.2310 | 0.251 |
| hisp_yes | 0.5899 | 0.5899 | 0.001 | −0.0579 | 0.1571 | 0.713 | 0.3246 | 0.1998 | 0.104 |
| agepclt6_only | 2.1597 | 2.1597 | 0.004 | 0.2563 | 0.2039 | 0.209 | |||
| agepc6_12only | 1.0907 | 1.0907 | 0.000 | 0.0637 | 0.1579 | 0.687 | |||
| agepc13_17only | 0.4222 | 0.4222 | 0.000 | 0.1760 | 0.1407 | 0.211 | |||
| agepclt6_6_12only | 1.1356 | 1.1356 | 0.004 | 0.2517 | 0.2325 | 0.279 | |||
| agepclt6_13_17only | 0.9242 | 0.9242 | 0.025 | −0.2017 | 0.4221 | 0.633 | |||
| agepc6_12and13_17only | 0.3981 | 0.3981 | 0.001 | 0.1064 | 0.1771 | 0.548 | |||
| agepclt6_6_12and13_17 | 0.5935 | 0.5935 | 0.739 | 0.4642 | 0.3997 | 0.246 | |||
| fhonly | −0.4079 | −0.4079 | 0.005 | −0.2874 | 0.0837 | 0.001 | −0.4075 | 0.0929 | 0.000 |
| mhonly | −0.7848 | −0.7848 | 0.052 | −0.5944 | 0.1192 | 0.000 | −1.0045 | 0.1236 | 0.000 |
| _constant | 1.0970 | 1.0970 | 0.926 | −0.4634 | 0.9183 | 0.614 | 0.7146 | 0.9107 | 0.433 |
| Number of Observations | 3820 | 3820 | 3820 | 3820 | |||||
| McFadden R2 | 0.0615 | 0.0615 | 0.0455 | 0.0862 | |||||
| Log Likelihood | −1706.05 | −1706.05 | −2437.65 | −1961.99 | |||||
| Fruit Juice | Bottled Water | ||||||||
| Coef. | Std. Err. | p > |z| | Coef. | Std. Err. | p > |z| | ||||
| price | 0.9083 | 0.1458 | 0 | 0.2251 | 0.0967 | 0.02 | |||
| price_2 | −0.0568 | 0.0142 | 0 | −0.0157 | 0.012 | 0.19 | |||
| pov185 | −0.1683 | 0.1881 | 0.371 | −0.4521 | 0.1108 | 0 | |||
| agehh2529 | −13.121 | 744.3962 | 0.986 | −0.5998 | 1.1273 | 0.595 | |||
| agehh3034 | −11.9996 | 744.3962 | 0.987 | −0.2816 | 1.1114 | 0.8 | |||
| agehh3544 | −13.0856 | 744.3961 | 0.986 | −0.6098 | 1.0996 | 0.579 | |||
| agehh4554 | −13.1990 | 744.3961 | 0.986 | −0.8437 | 1.0979 | 0.442 | |||
| agehh5564 | −13.1709 | 744.3961 | 0.986 | −0.9895 | 1.0987 | 0.368 | |||
| agehhgt64 | −12.8456 | 744.3961 | 0.986 | −1.4964 | 1.101 | 0.174 | |||
| emphhpt | 0.2945 | 0.239 | 0.218 | 0.1689 | 0.1165 | 0.147 | |||
| emphhft | −0.3136 | 0.1771 | 0.077 | 0.0948 | 0.0992 | 0.339 | |||
| eduhhhs | 0.3233 | 0.3133 | 0.302 | −0.1668 | 0.2128 | 0.433 | |||
| eduhhu | 0.7058 | 0.3062 | 0.021 | −0.1832 | 0.208 | 0.378 | |||
| eduhhpc | 0.6285 | 0.3589 | 0.08 | −0.2899 | 0.2334 | 0.214 | |||
| reg_central | −0.0387 | 0.224 | 0.863 | −0.1941 | 0.115 | 0.092 | |||
| reg_south | −0.3086 | 0.1917 | 0.107 | −0.1032 | 0.1009 | 0.307 | |||
| reg_west | −0.7758 | 0.2062 | 0 | 0.0894 | 0.1181 | 0.449 | |||
| race_black | 0.845 | 0.2575 | 0.001 | 0.3303 | 0.1211 | 0.006 | |||
| race_oriental | 0.3293 | 0.4855 | 0.498 | 0.1079 | 0.2617 | 0.68 | |||
| race_other | −0.3499 | 0.3239 | 0.28 | 0.4361 | 0.2171 | 0.045 | |||
| hisp_yes | 0.3386 | 0.3241 | 0.296 | 0.142 | 0.183 | 0.438 | |||
| agepclt6_only | 0.1246 | 0.2332 | 0.593 | ||||||
| agepc6_12only | 0.3047 | 0.1922 | 0.113 | ||||||
| agepc13_17only | 0.1678 | 0.1584 | 0.289 | ||||||
| agepclt6_6_12only | −0.0782 | 0.2547 | 0.759 | ||||||
| agepclt6_13_17only | 0.3251 | 0.5574 | 0.56 | ||||||
| agepc6_12and13_17only | 0.3115 | 0.2147 | 0.147 | ||||||
| agepclt6_6_12and13_17 | 0.5746 | 0.5035 | 0.254 | ||||||
| fhonly | −0.6982 | 0.1572 | 0 | −0.0720 | 0.0895 | 0.421 | |||
| mhonly | −1.3657 | 0.1849 | 0 | −0.7892 | 0.122 | 0 | |||
| _constant | 14.1394 | 744.3961 | 0.985 | 1.6086 | 1.122 | 0.152 | |||
| Number of Observations | 3820 | 3820 | |||||||
| McFadden R2 | 0.0864 | 0.0559 | |||||||
| Log Likelihood | −879.50 | −2187.69 | |||||||
| Coffee | Tea | ||||||||
| Coef. | Std. Err. | p > |z| | Coef. | Std. Err. | p > |z| | ||||
| price | −0.9775 | 0.1161 | 0.000 | 0.1480 | 0.0949 | 0.119 | |||
| price_2 | 0.0491 | 0.0148 | 0.001 | −0.0101 | 0.0112 | 0.368 | |||
| pov185 | −0.3628 | 0.1257 | 0.004 | 0.0616 | 0.1178 | 0.601 | |||
| agehh2529 | 0.4237 | 0.8358 | 0.612 | −0.1606 | 0.9200 | 0.861 | |||
| agehh3034 | 0.5214 | 0.8179 | 0.524 | 0.1605 | 0.9029 | 0.859 | |||
| agehh3544 | 1.2749 | 0.8096 | 0.115 | 0.1724 | 0.8932 | 0.847 | |||
| agehh4554 | 1.6393 | 0.8085 | 0.043 | 0.3366 | 0.8916 | 0.706 | |||
| agehh5564 | 1.9975 | 0.8111 | 0.014 | 0.1239 | 0.8926 | 0.890 | |||
| agehhgt64 | 2.3876 | 0.8175 | 0.003 | 0.0574 | 0.8964 | 0.949 | |||
| emphhpt | −0.0661 | 0.1281 | 0.606 | −0.0080 | 0.1214 | 0.947 | |||
| emphhft | −0.0031 | 0.1091 | 0.977 | −0.3663 | 0.1015 | 0.000 | |||
| eduhhhs | −0.5881 | 0.2945 | 0.046 | 0.2429 | 0.2088 | 0.245 | |||
| eduhhu | −0.6844 | 0.2875 | 0.017 | 0.3716 | 0.2034 | 0.068 | |||
| eduhhpc | −0.6825 | 0.3084 | 0.027 | 0.3356 | 0.2295 | 0.144 | |||
| reg_central | −0.3497 | 0.1289 | 0.007 | −0.9535 | 0.1206 | 0.000 | |||
| reg_south | 0.0480 | 0.1130 | 0.671 | −0.3878 | 0.1105 | 0.000 | |||
| reg_west | −0.3768 | 0.1252 | 0.003 | −0.7964 | 0.1211 | 0.000 | |||
| race_black | −0.6734 | 0.1151 | 0.000 | −0.0020 | 0.1169 | 0.986 | |||
| race_oriental | 0.0185 | 0.2454 | 0.94 | −0.2995 | 0.2268 | 0.187 | |||
| race_other | −0.1813 | 0.2144 | 0.398 | 0.1562 | 0.1964 | 0.426 | |||
| hisp_yes | 0.4499 | 0.1959 | 0.022 | −0.2554 | 0.1676 | 0.128 | |||
| agepclt6_only | −0.0085 | 0.2133 | 0.968 | 0.5069 | 0.2442 | 0.038 | |||
| agepc6_12only | 0.2355 | 0.1787 | 0.188 | 0.1340 | 0.1765 | 0.448 | |||
| agepc13_17only | 0.0344 | 0.1575 | 0.827 | 0.2465 | 0.1594 | 0.122 | |||
| agepclt6_6_12only | 0.2818 | 0.2541 | 0.267 | −0.1853 | 0.2408 | 0.442 | |||
| agepclt6_13_17only | −0.1492 | 0.4462 | 0.738 | 0.1725 | 0.5116 | 0.736 | |||
| agepc6_12and13_17only | −0.3532 | 0.1881 | 0.060 | −0.0493 | 0.1921 | 0.797 | |||
| agepclt6_6_12and13_17 | 0.9204 | 0.5173 | 0.075 | 0.4005 | 0.4706 | 0.395 | |||
| fhonly | −0.6690 | 0.0982 | 0.000 | −0.2944 | 0.0905 | 0.001 | |||
| mhonly | −1.0535 | 0.1322 | 0.000 | −0.8637 | 0.1216 | 0.000 | |||
| _constant | 2.7379 | 0.8684 | 0.002 | 1.0259 | 0.9240 | 0.267 | |||
| Number of Observations | 3820 | 3820 | |||||||
| McFadden R2 | 0.1415 | 0.0437 | |||||||
| Log Likelihood | −1892.47 | −2163.69 | |||||||
| Beverage | Area Under ROC Curve | Brier Score | Kullback–Leibler Information Criteria |
|---|---|---|---|
| Isotonic drinks | 0.67 | 0.15 | −0.47 |
| Regular soft drinks | 0.69 | 0.08 | −0.28 |
| Diet soft drinks | 0.64 | 0.22 | −0.62 |
| High-fat milk | 0.67 | 0.14 | −0.43 |
| Low-fat milk | 0.66 | 0.22 | −0.62 |
| Fruit drinks | 0.68 | 0.17 | −0.50 |
| Fruit juices | 0.74 | 0.06 | −0.21 |
| Bottled water | 0.63 | 0.20 | −0.58 |
| Coffee | 0.73 | 0.18 | −0.53 |
| Tea | 0.63 | 0.19 | −0.56 |
| Beverage | Yates Decomposition of the Brier Score | ||||
|---|---|---|---|---|---|
| Variance of Outcome Index | Minimum Variance Probabilities | Scatter | Bias | Covariance of Outcome Index and Probability | |
| Isotonic drinks | 0.16 | 0.0007 | 0.01 | 0.00048 | 0.02 |
| Regular soft drinks | 0.08 | 0.0001 | 0.004 | 0.000062 | 0.007 |
| Diet soft drinks | 0.23 | 0.0007 | 0.01 | 0.000025 | 0.02 |
| High-fat milk | 0.14 | 0.0003 | 0.008 | 0.00014 | 0.01 |
| Low-fat milk | 0.23 | 0.0009 | 0.01 | 0.00086 | 0.03 |
| Fruit drinks | 0.18 | 0.0007 | 0.01 | 0.00014 | 0.02 |
| Fruit juices | 0.06 | 0.0001 | 0.003 | 0.000038 | 0.005 |
| Bottled water | 0.21 | 0.0006 | 0.01 | 0.000034 | 0.02 |
| Coffee | 0.20 | 0.0040 | 0.03 | 0.00013 | 0.06 |
| Tea | 0.20 | 0.0003 | 0.009 | 0.000031 | 0.02 |
| 1 | A wide range of qualitative choice models are available, such as probit, logit, mixed logit, ordered probit, generalized extreme value, nested logit, multinomial probit, multinomial logit, etc. | ||||||||||||||||||||||||||||||||||||||||||
| 2 | The probit and logit models yield similar results in the case of binary choice models. Additionally, since the logistic density function closely resembles the t-distribution with seven degrees of freedom (Hanushek & Jackson, 1977), the logit and probit formulations are quite similar. The only difference is that the logistic density has a slightly heavier tail than the standard normal density. For probit models, the error term is assumed to follow a standard normal distribution, and for logit models, the error term is assumed to follow a logistic distribution. | ||||||||||||||||||||||||||||||||||||||||||
| 3 | Receiver Operating Characteristics (ROC) curves (Hsieh & Turnbull, 1996; Reiser & Faraggi, 1997) offer relief for this issue in classifying probabilities by calculating and plotting probability outcomes based on a wide range of cut-off probabilities. Alternatively, a log-likelihood function approach, which selects models closest to the true data-generating process based on the Kullback–Leibler Information Criterion, has been used to assess the performance of models generating probabilities (Stone, 1977; Shao, 1993; Norwood et al., 2004). Additionally, other techniques such as calibration, calibration graphs, and scoring rules such as the Brier Score (Brier, 1950) and the Yates partition of the Brier Score (Yates, 1982) have been used to measure the accuracy of these types of predictions (Zellner et al., 1991). In Appendix A, we provide two tables associated with these metrics applied to out-of-sample probabilities generated in this study. In this way, we provide a more inclusive picture of additional methods available to evaluate forecast performance of binary choice models other than the measures proposed in this study (resolution regression and resolution graphs). | ||||||||||||||||||||||||||||||||||||||||||
| 4 | Although we rely on the use of binary logit models, the analysis can nevertheless be carried out using binary probit models. Because no implicit ordering is associated with the decision to purchase non-alcoholic beverages, we rely on the use of binary logit models in this analysis. Also, we performed the same analysis using binary probit models using the same data. Since the probit model analysis resulted in very similar outcomes/results, for brevity, we do not produce the probit model results here. | ||||||||||||||||||||||||||||||||||||||||||
| 5 | Alternatives to selecting one cut-off probability value to correctly classify probabilities are the receiver operating characteristic curve (ROC) and the cumulative accuracy profile (CAP) chart. The use of ROCs and CAP charts is beyond the scope of this analysis and is reserved for future research. The theoretical framework for ROC and CAP charts can be found in Mann and Whitney (1947) and Bamber (1975). | ||||||||||||||||||||||||||||||||||||||||||
| 6 | The use of these data is for demonstration purposes only in this analysis. No claims are made regarding the current market behavior of households in the decision to purchase or not to purchase non-alcoholic beverages. | ||||||||||||||||||||||||||||||||||||||||||
| 7 | Alternatively, to impute prices, some studies used auxiliary regressions based on observed unit values as a function of household income, household size, and region. | ||||||||||||||||||||||||||||||||||||||||||
| 8 | The Pearson correlation coefficients of the weighted average unit value and the observed unit values of each non-alcoholic beverage are shown below. The null hypothesis of “no correlation” is soundly rejected. p-values are given in parentheses.
| ||||||||||||||||||||||||||||||||||||||||||
References
- Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415. [Google Scholar] [CrossRef]
- Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1–3. [Google Scholar] [CrossRef]
- Dawid, A. P. (1986). Probability forecasting: Encyclopedia of statistical sciences (vol. 7). Wiley. [Google Scholar]
- Greene, W. H. (2012). Econometric analysis (7th ed.). Pearson. [Google Scholar]
- Hanushek, J., & Jackson, E. (1977). Statistical methods for social scientists. Academic Press. [Google Scholar]
- Hill, H., & Lynchehaun, F. (2002). Organic milk: Attitudes and consumption patterns. British Food Journal, 104(7), 526–542. [Google Scholar] [CrossRef]
- Hsieh, F., & Turnbull, B. W. (1996). Nonparametric and semiparametric estimation of the receiver operating characteristic curve. Annals of Statistics, 24(1), 25–40. [Google Scholar] [CrossRef]
- Kennedy, P. (2003). Limited dependent variables. In A guide to econometrics (5th ed.). MIT Press. [Google Scholar]
- Maddala, G. S. (1983). Limited dependent and qualitative variables in econometrics. Cambridge University Press. [Google Scholar]
- Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics, 18(1), 50–60. [Google Scholar] [CrossRef]
- McFadden, D. (1984). Econometric analysis of qualitative response models. In Z. Griliches, & M. Intriligator (Eds.), Handbook of econometrics (vol. 2, pp. 1395–1457). Elsevier. [Google Scholar]
- Norwood, F. B., Lusk, J. L., & Brorsen, B. W. (2004). Model selection for discrete dependent variables: Better statistics for better steaks. Journal of Agricultural and Resource Economics, 29(3), 404–419. [Google Scholar]
- Pindyck, R. S., & Rubinfield, D. L. (1998). Econometric models and economic forecasts (4th ed.). McGraw-Hill. [Google Scholar]
- Reiser, B., & Faraggi, D. (1997). Confidence intervals for the generalized ROC criterion. Biometrics, 53, 644–652. [Google Scholar] [CrossRef] [PubMed]
- Sanders, F. (1963). On subjective probability forecasting. Journal of Applied Meteorology, 2(2), 191–201. [Google Scholar] [CrossRef]
- SAS Institute, Inc. (2022). SAS/STAT version 9.4, user guide. SAS Institute, Inc. [Google Scholar]
- Shao, J. (1993). Linear model selection by cross-validation. Journal of American Statistical Association, 88, 486–494. [Google Scholar] [CrossRef]
- Stock, J. H., & Watson, M. W. (2007). Introduction to econometrics (2nd ed.). Pearson Education Inc. [Google Scholar]
- Stone, M. (1977). An asymptotic equivalence of choice of model by cross-validation and Akaike’s criterion. Journal of Royal Statistical Society, Series B (Methodological), 39(1), 44–47. [Google Scholar] [CrossRef]
- Train, K. E. (2003). Discrete choice methods with simulation (1st ed.). Cambridge University Press. [Google Scholar]
- Yates. (1982). External correspondence: Decompositions of the maen probability score. Organizational Behavior and Human Performance, 30, 132–156. [Google Scholar] [CrossRef]
- Zellner, A., Hong, C., & Min, C. (1991). Forecasting turning points in international output growth rates using bayesian exponentially weighted autoregression, time-varying parameter, and pooling techniques. Journal of Econometrics, 49, 275–304. [Google Scholar] [CrossRef]


| Actual Outcome | |||
|---|---|---|---|
| 0 | 1 | ||
| Predicted Outcome | 0 | True Negative | False Negative |
| 1 | False Positive | True Positive |
| 0.5 Cut-Off Value | Market Penetration Cut-Off Value | |||||
|---|---|---|---|---|---|---|
| Beverage Category | Sensitivity | Specificity | Sum of Sensitivity and Specificity | Sensitivity | Specificity | Sum of Sensitivity and Specificity |
| Isotonics | 0.06 | 0.98 | 1.04 | 0.63 | 0.62 | 1.25 |
| Regular Soft Drinks | 1.00 | 0.00 | 1.00 | 0.68 | 0.64 | 1.35 |
| Diet Soft Drinks | 0.90 | 0.19 | 1.09 | 0.73 | 0.46 | 1.19 |
| High-Fat Milk | 1.00 | 0.00 | 1.00 | 0.51 | 0.71 | 1.22 |
| Low-Fat Milk | 0.87 | 0.27 | 1.15 | 0.55 | 0.67 | 1.22 |
| Fruit Drinks | 0.99 | 0.03 | 1.02 | 0.58 | 0.67 | 1.25 |
| Fruit Juices | 1.00 | 0.00 | 1.00 | 0.60 | 0.66 | 1.26 |
| Bottled Water | 0.98 | 0.07 | 1.05 | 0.66 | 0.54 | 1.20 |
| Coffee | 0.93 | 0.27 | 1.20 | 0.71 | 0.60 | 1.31 |
| Tea | 0.98 | 0.06 | 1.04 | 0.58 | 0.60 | 1.18 |
| Beverage Category | Intercept | Slope | Chi-Squared Statistics Associated with the Joint Test | Mean Probability for Purchases of Non-Alcoholic Beverages, Outcome Index = 1 | Mean Probability for Non-Purchases of Non-Alcoholic Beverages, Outcome Index = 0 |
|---|---|---|---|---|---|
| Regular Soft Drinks | 1.8479 a (0.0338) b [0.0001] c | 0.4185 (0.0355) [0.0001] | 16,519.82 [0.0001] | 0.91 | 0.84 |
| Diet Soft Drinks | 0.4527 (0.0150) [0.0001] | 0.2356 (0.0177) [0.0001] | 1994.99 [0.0001] | 0.67 | 0.61 |
| High-Fat Milk | 1.2580 (0.0204) [0.0001] | 0.2869 (0.0232) [0.0001] | 6220.09 [0.0001] | 0.82 | 0.77 |
| Low-Fat Milk | 0.2726 (0.0135) [0.0001] | 0.2613 (0.0164) [0.0001] | 2959.70 [0.0001] | 0.63 | 0.57 |
| Fruit Drinks | 0.8761 (0.0169) [0.0001] | 0.3191 (0.0197) [0.0001] | 3039.86 [0.0001] | 0.77 | 0.68 |
| Fruit Juices | 2.0894 (0.0497) [0.0001] | 0.5715 (0.0514) [0.0001] | 17,882.33 [0.0001] | 0.93 | 0.89 |
| Bottled Water | 0.7128 (0.0147) [0.0001] | 0.2471 (0.0174) [0.0001] | 2343.81 [0.0001] | 0.72 | 0.66 |
| Coffee | 0.5086 (0.0255) [0.0001] | 0.6736 (0.0297) [0.0001] | 539.81 [0.0001} | 0.85 | 0.62 |
| Tea | 0.8160 (0.0146) [0.0001] | 0.1991 (0.0168) [0.0001] | 3127.16 [0.0001] | 0.73 | 0.69 |
| Isotonic Drinks | −1.3167 (0.0114) [0.0001] | 0.3487 (0.0258) [0.0001] | 20,539.25 [0.0001] | 0.29 | 0.20 |
| Demographic Characteristics | Categories |
|---|---|
| Age of Household Head | Less than 25 years |
| 25–29 years | |
| 30–34 years | |
| 35–44 years | |
| 45–54 years | |
| 55–64 years | |
| At least 65 years | |
| Employment Status of Household Head | Household head employed full-time |
| Household head employed part-time | |
| Household head not employed | |
| Education Level of Household | Less than high school |
| High school level | |
| Undergraduate level | |
| Post-college level | |
| Region | Midwest |
| South | |
| West | |
| East | |
| Race of Household | Black |
| Asian | |
| White | |
| Other | |
| Hispanic Ethnicity of the Household Head | Hispanic Yes |
| Hispanic No | |
| Age and Presence of Children | Less than 6-years-old |
| 6–12-years-old | |
| 13–17-years-old | |
| Less than 6-years-old and 6–12-years-old | |
| Less than 6-years-old and 13–17-years-old | |
| Less than 6-years-old, 6–12, and 13–17-years-old | |
| No children under 18 years | |
| Gender of the Household Head | Household head male only |
| Household head female only | |
| Households with both male and female | |
| Poverty Status of the Household | Above poverty line of 185% |
| Below poverty line of 185% |
| Demographic Characteristics | Categories | Percentage Sample A | Percentage Sample B | p-Value for the Balance Tests |
|---|---|---|---|---|
| Age of Household Head | 25–29 years | 2.28 | 2.57 | 0.40 |
| 30–34 years | 5.84 | 6.50 | 0.23 | |
| 35–44 years | 21.80 | 20.56 | 0.18 | |
| 45–54 years | 27.88 | 27.31 | 0.57 | |
| 55–64 years | 22.62 | 23.88 | 0.18 | |
| At least 65 years | 19.40 | 18.80 | 0.50 | |
| Employment Status of Household Head | Employed full-time | 45.97 | 44.88 | 0.33 |
| Employed part-time | 15.84 | 17.02 | 0.15 | |
| Education Level of Household | High school level | 25.24 | 23.25 | 0.04 |
| Undergraduate level | 60.50 | 62.00 | 0.18 | |
| Post-college level | 10.92 | 11.02 | 0.88 | |
| Region | West | 21.13 | 21.21 | 0.93 |
| Midwest | 19.24 | 17.99 | 0.15 | |
| South | 38.53 | 39.17 | 0.54 | |
| Race of Household | Black | 12.36 | 13.62 | 0.10 |
| Asian | 2.60 | 3.14 | 0.15 | |
| Other | 6.10 | 6.78 | 0.22 | |
| Hispanic Ethnicity of the Household Head | Hispanic Yes | 8.32 | 7.72 | 0.33 |
| Age and Presence of Children | Less than 6-years-old 6-12-years-old 13-17-years-old Less than 6-years-old and 6-12-years-old Less than 6-years-old and 13-17-years-old 6-12 and 13-years-old Less than 6-years-old, 6-12, and 13-17-years-old | 3.64 5.76 7.23 2.75 0.65 4.66 0.89 74.42 | 3.46 6.49 7.12 3.11 0.42 4.14 1.00 74.26 | 0.66 0.19 0.86 0.34 0.15 0.26 0.63 |
| Gender of the Household Head | Household head male only | 27.85 | 27.26 | 0.85 |
| Household head female only | 10.52 | 10.66 | 0.55 | |
| Poverty Status of the Household | Below poverty line of 185% | 13.64 | 13.12 | 0.52 |
| Sample A Number of Households | Sample B Number of Households | |||||
|---|---|---|---|---|---|---|
| Beverage Category | Purchase | Non-Purchase | Market Penetration | Purchase | Non-Purchase | Market Penetration |
| Isotonic drinks | 846 | 2974 | 22% | 763 | 3056 | 20% |
| Regular Soft Drinks | 3444 | 376 | 90% | 3477 | 342 | 91% |
| Diet Soft Drinks | 2494 | 1326 | 65% | 2499 | 1320 | 65% |
| High-Fat Milk | 3121 | 699 | 82% | 3161 | 658 | 83% |
| Low-Fat Milk | 2332 | 1488 | 61% | 2440 | 1379 | 64% |
| Fruit Drinks | 2866 | 954 | 75% | 2923 | 896 | 77% |
| Fruit Juices | 3555 | 265 | 93% | 3584 | 235 | 94% |
| Bottled Water | 2693 | 1127 | 70% | 2681 | 1138 | 70% |
| Coffee | 2812 | 1008 | 74% | 2746 | 1073 | 72% |
| Tea | 2753 | 1067 | 72% | 2785 | 1034 | 73% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Dharmasena, S.; Bessler, D.A.; Capps, O., Jr. Application of Resolution Regression and Resolution Graphs in Evaluating Probability Forecasts Generated Using Binary Choice Models. Econometrics 2026, 14, 10. https://doi.org/10.3390/econometrics14010010
Dharmasena S, Bessler DA, Capps O Jr. Application of Resolution Regression and Resolution Graphs in Evaluating Probability Forecasts Generated Using Binary Choice Models. Econometrics. 2026; 14(1):10. https://doi.org/10.3390/econometrics14010010
Chicago/Turabian StyleDharmasena, Senarath, David A. Bessler, and Oral Capps, Jr. 2026. "Application of Resolution Regression and Resolution Graphs in Evaluating Probability Forecasts Generated Using Binary Choice Models" Econometrics 14, no. 1: 10. https://doi.org/10.3390/econometrics14010010
APA StyleDharmasena, S., Bessler, D. A., & Capps, O., Jr. (2026). Application of Resolution Regression and Resolution Graphs in Evaluating Probability Forecasts Generated Using Binary Choice Models. Econometrics, 14(1), 10. https://doi.org/10.3390/econometrics14010010
