Growing health awareness, the rising preferences for vegan diets, and the presence of frequent cases of lactose intolerance within the European population  have increased consumers’ preferences for dairy-alternative products. Amongst the main factors that are driving the growth of this sector, changing lifestyles and the growing health concerns towards the negative impact of the saturated fatty acids originating from animals are among the most driving factors. By consuming the vegetable milk-substitute formulations, particularly soy milk, some of the potentially deleterious aspects of animal products, such as the cholesterol, can be avoided . Moreover, the increasing prevalence of cases of cow’s milk protein allergies  is another driving factor affecting the adoption of these types of beverages.
The dairy-alternative sector is mainly segmented on the basis of “type” into soy milk, oat milk, rice milk, almond milk, and others. According to the Spanish regulation, the word milk cannot be used to refer to these beverages with the exception of almond. However, it is widely known amongst consumers as vegetable milk. These products are also segmented on the basis of “formulation characteristic” into plain-sweetened, plain unsweetened, flavored sweetened, flavored-unsweetened, and others, along with several “flavor” and “format” descriptors within the food and beverages class. According to the Europe Dairy Alternative Market , the market of those products in Europe constitutes 21.2% of the global market and was valued at €1797.38 million in 2013, reaching €2765.54 million by 2018. The main dairy alternatives in the EU are soy milk (70.5%), almond milk (14.64%), rice milk (7.9%), and others (6.8%). The market has increasingly benefited in recent years from the perceived health and taste benefits of dairy-alternative products. Soy milk, and soy products in general, are rich in isoflavones which have attracted the consumer’s interest as potential health-promoting foods . Spain reached the highest percentage amongst France, Germany, Italy, Poland, and the UK in its consumption . About 15% of European consumers avoid dairy products for a variety of reasons, including medical reasons such as lactose intolerance (LI), cow’s milk allergy (CMA), cholesterol issues, and phenylketonuria, as well as lifestyle choices like a vegetarian/vegan diet or concerns about growth hormones or antibiotic residues in cow’s milk [7,8].
It is expected that this sector will continue to grow, which makes the analysis of consumers’ preference analysis of these products highly relevant in order to propose reliable patterns and suggest recommendations that may allow marketers to better target the different market segments and to improve the profitability in the dairy-alternative drinks industry. According to Oltenacu and Broom , the increased production in dairy cattle should be viewed with concern because the increase in milk yield has been accompanied by a decline in cows’ welfare including fertility problems, increasing leg and metabolic diseases, decreased longevity, higher disease incidence, and modification of normal behavior. Improving animal welfare is important as it is regarded by the public as indicative of sustainable systems and good product quality. Decreasing the intensity of the intensive milk production farming would contribute to more sustainable consumption patterns. Therefore, taking into account the resource availability and the consequences of functioning and morality of action , we will consider animal welfare as one of the relevant factors that contribute to more sustainable production systems .
In this context, the objective of this paper is twofold. Firstly, to analyze consumers’ preferences toward dairy-alternative products in Catalonia. In particular, to identify the willingness to pay (WTP) for the most important attributes that consumers take into consideration when purchasing dairy-alternative drinks. Secondly, to study the determinant factors that affect the purchasing frequency of the dairy-alternative products.
The application of the discrete choice experiment (DCE) involves the characterization of a product through a series of attributes and their levels that can be combined, following an experimental design, to create different scenarios of the product. These scenarios are presented in an array of “choice sets” representing different possible “states” of the product. Subjects are asked in a survey to select their preferred “product” within each choice set or neither of them. This approach allows us to understand the trade-offs that participants are willing to make among the descriptors of the product, thereby revealing their preference for certain characteristics. The DCE can be hypothetical (hypothetical discrete choice experiment, H-DCE) by asking consumers to select their preferred product by only simulating their behavior in a real market place. This approach may suffer from the hypothetical bias, induced by the hypothetical nature of surveys. This bias is defined as the difference between what a respondent indicates he/she would purchase in a survey or interview and what he/she would actually do in the real market. According to Loomis , hypothetical bias in surveys reflects the old saying that “there is a difference between saying and doing”. In this context, hypothetical surveys are, in general, not incentive compatible. That is to say, its dominant strategy would not truthfully reveal the real value that the product has to the consumer. Loomis  summarized an array of different ex-ante and ex-post approaches to reduce the hypothetical bias in surveys. One of the ex-ante ways is to let the survey to be consequential to the respondent. This approach constitutes the basic form of the non-hypothetical discrete choice experiment (NH-DCE) by creating a “real shopping scenario” at the end of the survey [13,14]. Individuals who agree to participate are asked to purchase their preferred product from a randomly selected choice set and to mandatorily pay its posted price. In the NH-DCE, participants are, in general, rewarded by real money that at least covers the highest price level of the products presented in the choice sets. Both approaches (i.e., the H-DCE and NH-DCE) belong to the stated preference method, where consumers “state” in a survey their preferences.
Opposite to the stated preference approach, consumers’ preferences can be elicited using revealed preference data. In this case, the observation of the purchasing behavior “reveals” what they prefer. The revealed preferences can be carried out by means of scanner data, which are electronic records of transactions that the establishments collect as part of the operation of their businesses, most commonly collected via the scanning of the bar codes at checkout lines of retail stores . Hence, scanner information constitutes a nontraditional data source for economic application . Scanner data collected on consumer purchases come from two types of sources: Point-of-sale (retail) or store scanner data sources, which use the universal product code (UPC) of products sold at retail checkout counters to identify products and quantities sold and their prices; and household-based scanner data, which are derived from a sample of households that scan universal product codes (UPCs) of all purchased products after each shopping trip. In this case, even though the entire panel scans all products with a UPC, a subset of the panel also records purchases of random-weight or non-UPC products from other stores than the usual supermarkets (e.g., butchery, corner stores, self-services, greengrocers, bakeries, and others); households scan them through special codes provided by the owner of the data sets in order to collect and record all the possible food and beverage acquisitions. Although point-of-sale scanner data have been available to academic researchers since at least the early 1980s, household-based scanner data are a more recent innovation.
The scanner type of data can be integrated to the stated choice experiment data, as was done in the seminal work of Adamowicz et al.  and others [18,19,20,21] that combined the revealed and stated preference. Models’ estimates based on pooled data may reduce the hypothetical bias and improve the goodness of fit of the models in preference analysis and predictions. In addition, scanner data have been used in several economic studies for decades to answer a variety of questions about food consumption, food pricing, and the operation of retail food markets. Most applications have used retail data and only a few have used household data or a combination of the two. Scanner data have been used most often to examine pricing behavior in particular product markets, including the influence of private label foods on name brand pricing , strategic pricing responses in markets , and the effect of political pressure on prices . Scanner data have also been used to measure the value of product attributes  and to analyze seasonality in prices and consumption . Scanner data have been used for policy-relevant food and nutrition research, such as studying the effects of mandatory nutrition labeling , among other research.
Our study fit within the preference analysis carried out exclusively on the revealed choice data, similar to the applications of Guadagni and Little , Pancras and Dey , and Wasi and Keane , among others. Guadagni and Little  carried out a multinomial logit choice model calibrated on scanner data on coffee purchases, which computed the probability of choosing an alternative as a function of the attributes of all the alternatives available. This study permitted an explicit assessment of several explanatory variables, namely; brand loyalty, size loyalty, presence/absence of store promotion, regular shelf price, and promotional price cut. Although, it received some criticism due to factors such as the missing modeling of the purchase occasion and the fact that the coefficients of these variables were modeled to be the same for all coffee brand sizes. Pancras and Dey  applied discrete choice modelling using the latent class and the generalized multinomial logit models on the AC Nielsen scanner panel data in the United States on the ketchup category from the largest retailer market. Wasi and Keane  applied discrete choice modelling on frozen pizza choice using retailer-scan data by selecting random subsets of the full choice of 100 varieties of pizza in order to construct their choice set. In this context, consumers’ purchasing behavior from scan data allowed them to estimate consumers’ preferences and willingness to pay.
Home-scan data are individual detailed data on sales of consumer goods obtained by ‘scanning’ the bar codes for individual products using an electronic device within households. The availability in academic research of this data type is scarce because of its high cost. However, it is worth the effort because this kind of information is very timely and precise, which may allow researchers to accurately analyze preferences in a revealed context approach. In fact, during the past two decades, the studies using revealed data sets have increased considerably and become more prevalent as they provide huge opportunities for consumers’ preference research and marketing decision making on one hand, and a managerial tool for retailers on the other . This type of data collection has remarkable advantages over traditional data collection by avoiding so many hypothetical and strategic biases of questionnaires. Also, it offers opportunities to improve the stated techniques to analyze consumer preferences by allowing research to investigate the gap between what people do in real life and what they say to do in surveys. It deals with the conflict of goals in reaching both the representativeness and the continuity of the observed items . Adding to that, this data type allows us to observe individual household decisions continuously by identifying their economic and sociodemographic characteristics . The simultaneous collection of data regarding price, quantity purchased, and the frequency of purchase is another substantial advantage over conventional cross section data.
The main issues with the use of home-scan data sets are related to the credibility and validity of the information reported, because they are self-recorded and the recording process is time-consuming . Households who agree to participate in the sample might not be representative of the observed population, since the acceptance to participate is not fully random and is, in general, unbalanced in regards to technology education. Furthermore, household who agree to participate may not record their purchases accurately or may misreport some trip information about the store and date . According to Hardesty et al. , the scanner data collection may suffer from bias related to the discrepancies between the shelf label price and the scanning system’s charge. These inaccuracies in scanned prices may have relevant implications on marketing models and the interpretation of results. Furthermore, the household scanning panels are slightly more price-sensitive . Another issue is related to who participates in home-scan data. According to Lusk and Brooks , the home-scan data may suffer from sample selection and participation biases. The preferences analysis based on home-scan data also omits the position of the product on store shelves. The shelf positioning of products at retail places plays an important role in affecting the purchase decision . Accordingly, results based on home-scan data should be treated with care taking into account the potential sources of bias.
2.1. Data Base
The data collection used in this study followed the ethical principles according to the Spanish and European regulations on protecting personal information and ensuring anonymity. Households included in this database signed a consent form and received an explanation about how they should collect the data. Participants were economically compensated for their participation. A data set including only the purchase of dairy-alternative drinks was separated from the original data set that included many product categories, e.g., oil, olives, meat, dried fruits, fruits and vegetables, eggs, honey, bread, etc., in order to fulfill the research objectives.
These data have been extracted from an even bigger fixed sample of households, representative of the population of Spanish households, which allows for continuously analyzing and studying the acts of purchase. The company’s network in Spain consists of households selected by stratification according to several socioeconomic variables. Proportional both demographically and by region, recruitment, selection, and maintenance ensure quality control of the data are guaranteed due to 1935 polling points and 12,000 cooperating households equipped with a barcode reader each, transmitting complete information about food purchases. Even though the entire panel scans all products with the electronic device, a subset of the panel also records the purchases of random-weight (i.e., with no bar code) from other stores than the usual supermarkets (e.g., butchery, corner stores, self-services, greengrocers, bakeries and others). In this case, households scan products through special codes provided by the company, Kantar (Barcelona, Spain), in order to collect and record all the possible food and beverage acquisitions into the data set. The resulting data set representing all the dairy-alternative products purchased by households within the sample during one year resulted in 5746 observations. The panel provides rich information about every single act of purchase as if it was recreated. In other words, detailed data about each of the families, the product purchased, its attributes, and the circumstances of the act of purchase are collected. It is worth highlighting that personal data (names, address, e-mail, or phone) were not recorded.
2.2. Consumers’ Preferences Analysis Approach
Consumers’ preferences and the willingness to pay were analyzed following the discrete choice experiment (DCE). The DCE is one of the most used methods because of its capacity to analyze complex goods such as food and beverages products and because of its ability to analyze the WTP and the marginal utility of a product’s attributes and levels. The DCE is derived from the consumer theory of Lancaster  and the random utility theory. The former postulated that the utility of a product is obtained from the characteristics that the good possesses, rather than the good per se. The latter suggests that subject chooses product from a set of alternative products according to a utility function with a systematic component , a vector of k attributes, a random error term , and socioeconomic characteristics of individual .
Different discrete choice modeling approaches are available to predict the probability of individuals choosing one product from the choice set of products. The different approaches are obtained from different specifications of the different assumptions about the distribution of the error term . The multinomial logit (MNL) developed by McFadden  is one of the basic models. However, the MNL imposes homogeneity of preferences for observed attributes. Thus the mixed or heterogeneous logit models (known also as random parameter logit, RPL) were introduced to handle this restriction [40,41].
The RPL model extends the MNL, introducing the unobserved heterogeneity by allowing random coefficients of attributes . In this case, the utility to person n from choosing alternative j on purchase occasion (or in choice set t) is given by:
Here, is the vector of mean attribute utility weights in the population, whereas is the vector of person n-specific deviations from the mean. The idiosyncratic error is assumed to be an independent and identically distributed (i.i.d.) extreme value and has a variance that has been implicitly normalized (to that of the standard extreme value distribution) to achieve identification. The researcher may specify any distribution for the vector, but in most applications it is assumed to be multivariate normal .
Regardless of the different models used to analyze choice data for interpretation, the marginal rate of substitution (MRS) between attributes can be calculated. Once parameters are estimated and one of the attributes is expressed in monetary terms (i.e., the price), it is possible to determine the willingness to pay (WTP) for each level of the attributes by calculating the negative quotient of the coefficient of any nonmonetary attribute by the coefficient of the price.
According to Hole and Kolstad , the RPL makes it possible to account for heterogeneity in preferences which are unrelated to observed characteristics, and it has been shown that any discrete choice random utility model can be approximated by an appropriately specified RPL. When estimating the RPL, the researcher specifies that the distribution of preferences follow a particular distribution, for instance a normal distribution. The parameters of this distribution, such as the mean and the standard deviation in the case of a normal distribution, are then estimated.
Since the WTP for an attribute is given by the ratio of the attribute coefficient to the monetary coefficient, the WTP from a RPL model is given by the ratio of two randomly distributed terms, if the monetary term enters in the model as random. Depending on the choice of distributions for the coefficients, this can lead to WTP distributions that are heavily skewed and that may not even have defined moments. A common approach to dealing with this potential problem is to specify the monetary coefficient to be fixed (i.e., nonrandom). This is a convenient assumption, as in this case the distribution of the willingness to pay for an attribute is simply the distribution of the attribute coefficient scaled by the fixed price coefficient. The problem is that it is often unreasonable to assume that all individuals have the same marginal utility of income, so this approach implies an undesirable trade-off between reality and modeling convenience .
Train and Weeks  suggest another solution to circumvent this problem by estimation of the RPL in WTP space, rather than in preference space (i.e., the above-described approach). This involves estimating the distribution of willingness to pay directly by reformulating the model in such a way that the coefficients represent the WTP measures. The researcher then makes priori assumptions about the distributions of WTP rather than the attribute coefficients. According to the RPL in the WTP space model, the utility person (decision makers) n derives from choosing product j in choice situation t is specified as a function of the price () and other nonmonetary attributes of the product (). That is specified as separable in price and nonprice attributes to facilitate discussion:
where and are individual specific coefficients for the price and the other attributes of the product and vary randomly across decision makers (consumers) and is extreme value distribution, though the analysis is analogous to other distributions. Since the WTP for an attribute is the ratio of the attribute’s coefficient to the price coefficient, the utility can be rewritten as:
which is what Train and Weeks  call the model in WTP space. Under this parameterization, the estimated coefficients are directly considered as the WTP values. The WTP space model has been applied in a several studies, in particular within the disciplines of environmental economics and marketing. These models were found to produce more realistic WTP measures .
2.3. Factors Affecting the Purchase Frequency
The determinant factors that affect the frequency of purchasing of dairy-alternative products were analyzed. This section specifies the model that was used to study the variables that affect the frequency of purchasing dairy-alternative products. Specifically, the dependent variable measured how many times individuals have purchased dairy-alternative products by month during one year. This variable is count data and can only take non-negative integer values. In this context, the ordinary least squares (OLS) estimation can generate biased, inconsistent, and inefficient estimates . Thus, count data models are a better alternative.
Count data models (Poisson and negative binomial) are nonlinear regression models for counts estimated by maximum likelihood. The standard model for count data is the Poisson model. The Poisson model is a nonlinear regression model based on the Poisson distribution .
For a discrete random variable observed over a period of length , and observed frequencies, , where is a non-negative integer count, which represent the number of occurrences of the event of interest by the customer. Assume that is distributed according to a Poisson distribution:
where is the purchase frequency. Covariates can be introduced into this model by relating them to the purchase frequency through a log linear model: , where denotes the vector of the subject’s characteristic variables and is the corresponding coefficient vector of .
The standard application of the Poisson model is constrained by its equidispersion assumption, which considers equality for the conditional mean and variance of the dependent variable (i.e., purchase frequency). In practice, the variance of the data is often greater than the mean, resulting in inconsistent estimates of model parameters when the Poisson likelihood is used . In all cases, overdispersion has little effect on parameter estimates but leads to underestimation of standard errors . To accommodate this overdispersion, one can assume a random-effects distribution for instead of a log linear model. This assumption results in a model capable of reflecting data with greater variation. When the random effects are assumed to be gamma distributed, the result is the negative binomial distribution (NBD) model, which adds a parameter to the P model that reflects unobserved heterogeneity and overcomes the problem of overdispersion. Accordingly, the probabilities in the negative binomial model are given by:
where θ is the over dispersion parameter. The connection between the two models is that the Poisson model results if .
2.4. Empirical Application
2.4.1. Consumers’ Preferences Analysis Approach
The first step in any empirical application of the DCE is to correctly identify the attributes and levels that constitute the main characteristics of the studied product. As we are using revealed home-scan data, we were able to identify all the attributes and levels that appear in the description of each purchased product in a real situation. Due to the impossibility to cope with all the attributes and levels, we selected an array of the most important ones, focusing on the attributes that we were interested in and that we hypothesized that consumers take into consideration when purchasing dairy-alternative drinks, including the brand label, biological information, type of vegetable drink, flavor, additional ingredients, and price. In a subsequent step, we identified the different levels to consider. The attribute type, flavor, and additional ingredients had a high number of levels, which led to a large number of products or alternatives to be included in the choice sets, with only a few purchases in the whole data set. Therefore, we performed an aggregation procedure of levels of the few purchased products (less than 5%) and irrelevant levels. Finally, the levels of the “type” attribute were: Soy, oats, rice, and other. For the attribute “biological information”, the level was: Organic or not. For the flavor attribute, the levels were: Original non-dairy beverage flavor, chocolate, and other flavors. For the “additional ingredient” attribute, the levels were: Without added ingredient, with added calcium and vitamins, with added natural calcium, and with other added ingredients. As a result, 56 real categories were identified, of which 26 were soy drink, seven were rice drink, 13 were oat drink, and 10 were other vegetable type drink. Thus, we considered the 56 existing categories in the data set arranged into only one choice set.
The construction of choice sets using revealed data sets do not follow the traditional choice design (full factorial, fractional, and orthogonal or D-efficient design). In this case, the choice set in the DCE calibrated on the home-scan data would be as the consumer was confronted by a situation when he/she is at a point of purchase aiming to buy non-dairy alternative beverages, faced with several products. The available products on shelves represent a unique choice set where the consumer has to choose his/her preferred product in every purchasing occasion. To construct the choice set, we first determined the number of available products to be included. Two main limitations were identified:
Firstly, the set of the available products is not the same in each supermarket/hypermarket because they have different number of product references (brands and sub-brands) and marketing strategies. This means that the revealed choice set to be constructed will not be the same among the different purchasing points available in the data set. That is, for instance, if a consumer purchased in supermarket X he/she would face a set of products that is different to the choice set offered in supermarket Y. This limitation was avoided by the aggregation procedure we followed, in which different references (products) were allocated to the same category. For instance, if the product A is only offered by Supermarket X and the product B is only offered in Supermarket Y, both products (A and B) will be assigned to the same category because the aggregation procedure was carried out on a common set of the attributes that are the same in all purchasing points.
Secondly, the individual frequency of purchasing the dairy alternatives (i.e., by panelist) was not equal among panelists during the year, which means that the number of choice sets to be constructed will be different for each group of panelists (according to how many time they purchased during the year). For instance, if one consumer purchased only once during the year and only one product by purchase, he/she should only face one constructed choice set. Furthermore, if one consumer purchased twice during the year and only one product by purchase, he/she should face two constructed choice sets and so on. This limitation was easily treated by introducing the nature of the unbalanced panel of the data set into the estimation procedure when estimating the model. In this case, for the price attribute, because of the aggregation of irrelevant levels and attributes, we obtained an array of different categories, in which each one contained more than one real purchased product. As a result, we considered all the aggregated categories in a unique choice set consistent with all the recorded purchases in all the points of purchase.
As a result, we considered all the aggregated categories in a unique choice set consistent with all the recorded purchases in all the points of purchase. For each observation (purchase action), we observed which choice was made and we registered “one” in the choice variable corresponding to the purchased category belonging to the product, and zero otherwise depending on the buying record in every observation in the database. Thus, within each block of 56 rows, the choice variable will be one once and zero otherwise. This structure of the arrangement is the standard way to estimate discrete choice modelling .
Regarding the price attribute, because of the aggregation of irrelevant levels and attributes, we obtained an array of different categories, in which each one contained more than one real purchased product. Thus, for the price attribute, we calculated from the individual price of the purchased product a monthly average of all the products that belong to each grouped category. Price was introduced in the modeling estimation as a continuous variable and the rest of attributes were effect-coded. The effect codes use only ones, zeros, and minus ones to convey all of the necessary information, and it allows the interpretation of the estimated coefficient β as the difference in marginal utility that the respondent gets with respect to the conditional mean or the average utility of the alternative.
2.4.2. Factors Affecting the Purchasing Frequency of Dairy Alternatives
Different models were used to study the covariates that affect the frequency of purchasing, using the purchase frequency variable as a revealed count data; the dependent variable measured how many times individuals have purchased dairy-alternative drinks by month during one year.
The models estimated were the Poisson regression model (P) and the negative binomial model (NB), and the two models were estimated, respectively, by the packages msm and MASS with the software R . As mentioned, the dependent variable was the purchasing frequency of the dairy alternatives and the independent variables included in the two models comprise expenditure of the shopping basket, expenditure on the dairy-alternative drinks, number of purchased units, life cycle, general study of medias (GSM), class, social class, homemaker’s age, metropolitan habitat, municipal habitat, nationality, household members, children presence, province, region, dogs presence, body mass index (BMI), and number of the visited purchase places.
3. Results and Discussion
Table 1 summarizes the sociodemographic characteristics of the sample, which consisted of 343 Catalan households.
3.1. Attribute Preferences of Dairy-Alternative Products
The results of RPL in the WTP space model are presented in Table 2. The goodness of fit was assessed through a highly acceptable McFadden’s pseudo-R2. A pseudo-R2 of 0.3 represents a decent model fit for a discrete choice model. Values between the range of 0.3 and 0.4 can be translated as an R2 of between 0.6 and 0.8 for the linear model equivalent .
As mentioned before, this modelling approach allows for estimation the distribution of willingness to pay directly by reformulating the model in such a way that the coefficients represent the WTP measures. Then, we made priori assumptions about the distributions of WTP rather than the attribute coefficients. A salient feature of the WTP space model is that the estimated parameters are also the parameters of the implied WTP distributions. We estimated models with utility, as specified in Equation (5), where the coefficient of each non-price attribute is the product of the WTP for that attribute times the price coefficient. The price coefficient was given a normal distribution  and the elements of (i.e., WTPs) were also specified to be normal . The WTPs were assumed to be uncorrelated over attributes. In Table 2 we report the estimates for models parameterized in WTP space.
Results showed statistically significant values for all attributes. The biological information (organic) attribute results showed preference for non-organic products rather than organic products. Furthermore, results showed preferences for producer brands rather than private-labeled ones. Focusing on the flavor attribute preferences, results showed preference for original non-dairy beverage flavor, similar to the results obtained by Siegrist et al. , rather than chocolate flavor. Such tendency could be emphasized when deciding on marketing strategies by producing more products in natural flavors without any additional taste. For the additional ingredients attribute, results showed preferences for the dairy-alternative drinks without additional compounds and those with added calcium and vitamins. The selected households showed a negative preference for only added natural calcium. Finally, the last attribute that represented the types of dairy alternatives, we noticed that soy drinks were more preferred than rice and oats drinks.
Consumers who made their purchase of a dairy-alternative drink were willing to pay an extra 0.18 €/unit to purchase the producer-branded, an additional 0.89 €/unit for original flavor, and only 0.11 €/unit for chocolate flavor. They also exhibited a WTP of 0.27 €/unit to obtain a dairy alternative without additional ingredients, while they were willing to pay only an additional 0.16 €/unit for those added calcium and vitamins. In addition, they were willing to pay an extra 0.56 €/unit to make the purchase of soy drinks and an extra 0.23 €/unit to obtain the oat drinks. However, they were willing to accept a discount of 1.16 €/unit to purchase organic dairy-alternative drinks and 0.17 €/unit to purchase a dairy alternative with natural calcium, while they required a discount of 0.19 €/unit to purchase a rice-type drink.
3.2. Factors Affecting the Purchasing Frequency of Dairy Alternatives
The results of the estimated P and NB models are shown in Table 3. The categorical independent variables allowed us to determine the percentage increase or decrease in counts of one group versus the base level. For continuous independent variables we were able to interpret how a single unit increase or decrease in that variable is associated with a percentage increase or decrease in the counts of the dependent variable.
For every extra euro spent on the shopping basket and on the dairy-alternative drinks, the purchase frequency increased, respectively, by 0.4% and 2%. For the life cycle, the incidence rate for single-parent households, adult couples without children, couples with children of middle age, couples with young children, and young couples without children, in comparison to the group reference, was, respectively, 0.61, 0.69, 0.72, 0.59, and 0.48 times the incidence rate, while the results obtained by Ellen et al.  showed that living with a partner and number of children were insignificant. The incidence rate for the household within the low and medium GSM class was, respectively, 1.21 and 1.21 times the incidence rate of the reference group (high GSM class). The lower middle social class had an incidence rate of 1.20 times the incidence rate of the reference level (upper and upper middle social class). In regards to the homemaker’s age range variable, for consumers with ages between 35 and 49 years and between 50 and 64 years, compared to the reference level (homemaker’s age range between 18 and 34 years), the purchase number increased, respectively, by 0.50 and 0.51 times, showing that the older consumers buy the dairy-alternative drinks more frequently, similar to the results obtained by Ellen et al.  and in contrast with the results obtained by De Silva et al. . Furthermore, the incidence rate for households within municipal habitats between 30,001 and 100,000 habitants was 0.87 times the incidence rate for the reference group (municipal habitats less than 10,000). Likewise, the incidence rate for households within municipal habitats with more than 500,000 habitants was 1.26 times the incidence rate for the reference group holding the other variables at constant.
3.3. Consumers’ Heterogeneity Analysis
We identified the consumers’ heterogeneity by carrying out a K-means cluster analysis on the aggregated data, using the average values by purchase occasion along the year of both variables, representing the total expenditure of the shopping basket and the expenditure on the dairy-alternative drinks. Table 4 presents the results of an ANOVA test for the results of K-means cluster analysis that produced a solution in three clusters. The target variables were two factors that represent consumers’ expenditure.
Assuming a significance level of 5% (0.050) the significance statistic (Sig.) indicates that the null hypothesis was rejected for the average of the expenditure of the shopping basket (F = 858.40, Sig. = 0.000), and for the average of the expenditure on the dairy-alternative drinks per basket (F = 10.15, Sig. = 0.000). Hence, the results of the test confirmed that the clusters were different. From Table 5 and based on both average expenditure on the dairy-alternative drinks and average shopping basket value, the consumers of the third cluster spent more money on both variables, followed by the first cluster and the second one.
As shown in the Table 5, the households involved in the study were unequally distributed among the three clusters, where 62% of the sample belonged to the second cluster, around 28% to the first, and the remaining 10% to the third one. Further, the segments were described. First, a test of ANOVA was performed in order to check the differences between the three clusters using the cluster membership variable and a post hoc analysis, more specifically the Games–Howell and Tukey tests. The significance of the statistic indicated that the null hypothesis was rejected at 5% of significance level for “the number of purchases” and “the number of purchased units by purchase occasion” and accepted for the variable “expenditure by unit”. Therefore, the results of the tests confirmed that there was a heterogeneity of variances for the variables “the number of purchases” and “the number of purchased units by purchase occasion” and for the variable “expenditure by unit” there was homogeneity of variance.
Then, for the first two variables we applied robust tests of equality of means, the results of the Welch and Brown–Forsythe tests showed that both tests were significant at 5% significance level for the variables “the number of purchases” and “the number of purchased units by purchase occasion”, hence there was a statistically significant difference between the clusters on account of these two variables. For the “expenditure by unit” variable, results showed non-significance for the two tests and thus a non-significant difference between the clusters.
As can be seen in Table 6, results showed that for “the number of purchases” variable the members of the first cluster did not show significant results, while the members belonging to the other two clusters showed significant results. This indicates that the individuals in the second cluster made more purchases than the individuals in the third cluster, while the individuals in the first clusters did not differ from the other two clusters. Concerning “the number of purchased units by purchase occasion” variable, the Tukey test (Table 6) showed a non-difference between clusters 1 and 3 and a difference between these two clusters and cluster 2. In other words, the individuals in clusters 1 and 3 bought more units than the individuals in cluster 2.
Second, a Chi-square test was applied to sociodemographic characteristics. Results showed that the variables life cycle, social class, and homemaker’s age range had statistically significant results at 5% significance level, which allowed us to reject the null hypothesis of independence between the correspondent variables and the cluster membership variable. After the test, a cross-tabulation between the sociodemographic variables one-by-one and the cluster membership variables was carried out to describe the three clusters.
As shown in the Table 7, the first cluster was characterized by the biggest percentage of couples with young children among the three clusters; the second cluster was characterized by the biggest percentage of adult couples without children and retirees; while the third involved more couples with median aged children and with older children.
Table 8 shows that the biggest percentage within each of the three clusters belonged to the middle class, while the second cluster was the one with the highest percentage of lower middle class and lower class, and the third cluster had a high proportion of individuals from the upper and upper middle class.
Concerning the homemaker’s age range variable, the results in the Table 9 show that more than half of the individuals in the first cluster had an age between 35 and 49 years, while the second cluster included a higher percentage of the persons aged between 50 and 64 years old, and included a relatively high number of aged individuals. The third cluster included more young individuals than the other two clusters.
As a conclusion for all above-mentioned variables, we could conclude that the second cluster’s members spent less money on the dairy-alternative drinks and on food in general, while they made a higher number of purchases but they bought less number of units per purchase occasion. They belonged to the lower middle class and lower class, were generally adult couples without children, and retirees with an average homemaker’s age over 50 years.
In contrast, the third cluster members spent more money, devoted a higher budget for the dairy-alternative drinks and for the food shopping in general, they made less numbers of purchases but they bought a higher number of units per purchase than the second cluster. They belonged to the middle class and the higher and higher middle classes, mostly couples with median aged children and couples with older children, with generally a younger homemaker.
The first cluster was a mixture of individuals with medial sociodemographic characteristics and expenditure on the dairy-alternative drinks and on food. They were mainly middle class households, couples with young children, and a homemaker’s age between 35 and 49 years.
As expected, results showed negative signs on the marginal utility of the price, indicating that an increase in the price will decrease the utility of the dairy-alternative drinks offered to consumers. Such a result emphasizes that the price attribute is a relevant driving factor for producing dairy-alternative drinks as consumers are sensitive to price and may change their attributes towards dairy alternative drinks if they notice an increase in the price.
Another important attribute was the flavor. The original non-dairy beverage flavor compared to the other flavors showed higher contribution to consumers’ utility when making the purchase of dairy-alternative drinks. Such an indicator could be emphasized when deciding on marketing strategies in a way to make it profitable, by producing more products in natural flavors without any additional taste. Marketing strategies should promote products by focusing on the “original” and “pure” version of the product without additional healthy ingredients and with reduction of the undesirable compounds they existed.
In addition, consumers prefer manufacturer brands against the private ones, which is an indicator to industry of the higher contribution of their own brands to consumers’ utility as an added value when making the purchase decision. Concerning the remaining attributes, results showed that consumers do not take into account the organic production alternative during the purchase occasion, which means that this doesn’t present a profitable tool in marketing strategies, and they exhibited a preference for soy drinks without additional ingredients.
Concerning the factor affecting the purchase frequency of the dairy-alternative products, results showed that for every extra euro spent on the shopping basket and on the dairy-alternative drinks, the purchase frequency increased, respectively, by 0.4% and 2%. The purchase frequency increased significantly with age, highlighting the relevance of marketing strategies that focus on older consumers for the purchase of dairy-alternative drinks with more frequent purchase behavior.
Finally, these results are tightly related to the aggregation decision made for the different products with very low purchase frequency that could have led to information lost. Alternative aggregation procedures and modeling approaches can be further applied to test for the validity of our results.
Conceptualization, M.L. and Z.K.; methodology, Z.K. and M.L.; software, M.L. and Z.K.; validation, Z.K.; formal analysis, M.L. and Z.K.; investigation, M.L. and Z.K.; data curation, M.L.; writing—original draft preparation, M.L.; writing—review and editing, Z.K.; supervision, Z.K.; project administration, Z.K.
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
Ugidos-Rodríguez, S.; Matallana-González, M.C.; Sánchez-Mata, M.C. Lactose malabsorption and intolerance: A review. Food Funct.2018, 9, 4056–4068. [Google Scholar] [CrossRef] [PubMed]
Woodside, J.V.; Brennan, S.; Cantwell, M. Are Soy-Milk Products Viable Alternatives to Cow’s Milk? In Beverage Impacts on Health and Nutrition; Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 151–162. [Google Scholar]
Rangel, A.; Sales, D.; Urbano, S.; Galvao, J.; Andrade, J.; Macedo, C. Lactose intolerance and cow’s milk protein allergy. Food Sci. Technol.2016, 32. [Google Scholar] [CrossRef]
Jayne, V.W.; Michael, S.M. Are Soy-Milk Products Viable Alternatives to Cow’s Milk? In Beverages in Nutrition and Health; Wilson, T., Temple, N.J., Eds.; Humana Press Inc.: Totowa, NJ, USA, 2004. [Google Scholar]
Jago, D. Free from Foods—Mintel report. In FreeFrom Allergy and Intolerance 2011; FDIN Seminar: Daventry, UK, 2011. [Google Scholar]
Leatherhead Food Research. Food Allergies and Intolerances: Consumer Perceptions and Market Opportunities for ‘Free From’ Foods; Leatherhead Food International: Surrey, UK, 2010. [Google Scholar]
Oltenacu, P.A.; Broom, D.M. The impact of genetic selection for increased milk yield on the welfare of dairy cows. Anim. Welf.2010, 19, 39–49. [Google Scholar]
Broom, D.M. The use of the concept Animal Welfare in European conventions, regulations and directives. Food Chain2001, 2001, 148–151. [Google Scholar]
McGlone, J. Farm animal welfare in the context of other societal issues: Toward sustainable systems. Livest. Prod. Sci.2001, 72, 75–81. [Google Scholar] [CrossRef]
Loomis, J.B. Strategies for overcoming hypothetical bias in stated preference surveys. J. Agric. Resour. Econ.2014, 39, 34–46. [Google Scholar]
Lusk, J.L.; Schroeder, T.C. Are choice experiments incentive compatible? A test with quality differentiated beef steaks. Am. J. Agric. Econ.2004, 86, 467–482. [Google Scholar] [CrossRef]
Chang, J.B.; Lusk, J.L.; Norwood, F.B. How closely do hypothetical surveys and laboratory experiments predict field behavior? Am. J. Agric. Econ.2009, 91, 518–534. [Google Scholar] [CrossRef]
Robert, C.F.; Matthew, M.S. Introduction to: Scanner Data and Price Indexes; University of Chicago Press: Chicago, IL, USA, 2003. [Google Scholar]
Nayga, R.M., Jr. Scanner Data in Supermarkets: Untapped Data Source for Agricultural Economists; Review of Marketing and Agricultural Economics; Australian Agricultural and Resource Economics Society: Melbourne, Australia, 1992; Volume 60, pp. 1–8. [Google Scholar]
Adamowicz, W.; Louviere, J.; Williams, M. Combining revealed and stated preference methods for valuing environmental amenities. J. Environ. Econ. Manag.1994, 26, 271–292. [Google Scholar] [CrossRef]
Whitehead, J.C.; Pattanayak, S.K.; Van Houtven, G.L.; Gelso, B.R. Combining revealed and stated preference data to estimate the nonmarket value of ecological services: An assessment of the state of the science. J. Econ. Surv.2008, 22, 872–908. [Google Scholar] [CrossRef]
Brooks, K.; Lusk, J.L. Stated and revealed preferences for organic and cloned milk: Combining choice experiment and scanner data. Am. J. Agric. Econ.2010, 92, 1229–1241. [Google Scholar] [CrossRef]
Helveston, J.P.; Feit, E.M.; Michalek, J.J. Pooling stated and revealed preference data in the presence of RP endogeneity. Transp. Res. Part B Methodol.2018, 109, 70–89. [Google Scholar] [CrossRef]
Ward, M.B.; Shimshack, J.P.; Perloff, J.M.; Harris, J.M. Effects of the private-label invasion in food industries. Am. J. Agric. Econ.2002, 84, 961–973. [Google Scholar] [CrossRef]
Vickner, S.S.; Davies, S.P. Estimating strategic price response using cointegration analysis: The case of the domestic black and herbal tea industries. Agribus. Int. J.2002, 18, 131–144. [Google Scholar] [CrossRef]
Cotterill, R.W.; Franklin, A.W. An estimation of consumer benefits from the public campaign to lower cereal prices. Agribus. Int. J.1999, 15, 273–287. [Google Scholar] [CrossRef]
Bonnet, C.; Simioni, M. Assessing consumer response to protected designation of origin labeling: A mixed multinomial logit approach. Eur. Rev. Agric. Econ.2001, 28, 433–449. [Google Scholar] [CrossRef]
Chevalier, J.A.; Kashyap, A.K.; Rossi, P.E. Why don’t prices rise during periods of peak demand? Evidence from scanner data. Am. Econ. Rev.2004, 93, 15–37. [Google Scholar] [CrossRef]
Mathios, A.D. The importance of nutrition labeling and health claim regulations on product choice: An analysis of the cooking oil market. Agric. Resour. Econ. Rev.1998, 27, 159–168. [Google Scholar] [CrossRef]
Guadagni, P.M.; Little, J.D.C. A Logit Model of Brand Choice Calibrated on Scanner Data. Mark. Sci.1983, 2, 203–238. [Google Scholar] [CrossRef]
Pancras, J.; Dey, D.K. A comparison of generalized multinomial logit and latent class approaches to studying consumer heterogeneity with some extensions of the generalized multinomial logit model. Appl. Stoch. Models Bus. Ind.2011, 27, 567–578. [Google Scholar] [CrossRef]
Wasi, N.; Keane, M. Estimation of Discrete Choice Models with Many Alternatives Using Random Subsets of the Full Choice Set: With an Application to Demand for Frozen Pizza; No. 2012-W13; Economics Group, Nuffield College, University of Oxford: Oxford, UK, 2010. [Google Scholar]
Hury, J.; Lamboray, C. The Use of Scanner Data in the Luxembourg CPI: First Lessons Learned; Institut national de la statistique et des études économiques du Grand-Duché de Luxembourg: Luxembourg, 2013. [Google Scholar]
Cohen, M.A.; Rysman, M. Payment Choice with Consumer Panel Data; Federal Reserve, Bank of Boston: Boston, MA, USA, 2013. [Google Scholar]
Einav, L.; Leibtag, E.; Nevo, A. On the Accuracy of Nielsen Homescan Data; ERR-69-56490; United States Department of Agriculture, Economic Research Service: Washington, DC, USA, 2008. [Google Scholar]
Hardesty, D.M.; Goodstein, R.C.; Grewal, D.; Miyazaki, A.D.; Kopalle, P. The accuracy of scanned prices. J. Retail.2014, 90, 291–300. [Google Scholar] [CrossRef]
Lusk, J.L.; Brooks, K. Who participates in household scanning panels? Am. J. Agric. Econ.2011, 93, 226–240. [Google Scholar] [CrossRef]
Gidlöf, K.; Anikin, A.; Lingonblad, M.; Wallin, A. Looking is buying. How visual attention and choice are affected by consumer preferences and properties of the supermarket shelf. Appetite2017, 116, 29–38. [Google Scholar] [CrossRef] [PubMed]
Lancaster, K. A new approach to consumer theory. J. Political Econ.1966, 74, 132–157. [Google Scholar] [CrossRef]
Train, K.E. Discrete Choice Methods with Simulation; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
McFadden, D. Conditional logit analysis of qualitative choice behavior. In Frontiers in Econometrics; Zarembka, P., Ed.; Academic Press: New York, NY, USA, 1974. [Google Scholar]
McFadden, D.; Train, K. Mixed MNL models for discrete response. J. Appl. Econ.2000, 15, 447–470. [Google Scholar] [CrossRef]
Train, K. Discrete Choice Methods with Simulation, 2nd ed.; University Press: Cambridge, UK, 2009; ISBN 0-521-74738-4. [Google Scholar]
Ben-Akiva, M.; McFadden, D.; Abe, M.; Bockenholt, U.; Bolduc, D.; Gopinath, D.; Morikawa, T.; Ramaswamy, V.; Rao, V.; Revelt, D.; et al. Modeling Methods for Discrete Choice Analysis. Mark. Lett.1997, 8, 273–286. [Google Scholar] [CrossRef]
Fiebig, D.G.; Keane, M.P.; Louviere, J.; Wasi, N. The generalized multinomial logit model: Accounting for scale and coefficient heterogeneity. Mark. Sci.2010, 29, 393–421. [Google Scholar] [CrossRef]
Hole, A.R.; Kolstad, J.R. Mixed logit estimation of willingness to pay distributions: A comparison of models in preference and WTP space using data from a health-related choice experiment. Empir. Econ.2012, 42, 445–469. [Google Scholar] [CrossRef]
Train, K.; Weeks, M. Discrete choice models in preference space and willingness-to-pay space. In Applications of Simulation Methods in Environmental and Resource Economics; Alberini, A., Scarpa, R., Eds.; Kluwer Academic Publishers: Boston, MA, USA; Dordrecht, The Netherlands; London, UK, 2005; pp. 1–16. [Google Scholar]
Muñiz, C.; Rodríguez, P.; Suárez, M.J. Sports and cultural habits by gender: An application using count data models. Econ. Model.2014, 36, 288–297. [Google Scholar] [CrossRef]
Cameron, A.C.; Trivedi, P.K. Regression Analysis of Count Data; Cambridge University Press: Cambridge, UK, 2013; Volume 53. [Google Scholar]
Lichung, J.; Chou, C.H.; Greg, M.A. A Bayesian Approach to Modeling Purchase Frequency. Mark. Lett.2003, 14, 5–20. [Google Scholar]
De-Magistris, T.; Gracia, A.; Nayga, R.M., Jr. On the use of honesty priming tasks to mitigate hypothetical bias in choice experiments. Am. J. Agric. Econ.2013, 95, 1136–1154. [Google Scholar] [CrossRef]
Siegrist, M.; Stampfli, N.; Kastenholz, H. Acceptance of nanotechnology foods: A conjoint study examining consumers’ willingness to buy. Br. Food J.2009, 111, 660–668. [Google Scholar] [CrossRef]
Van Loo, E.; Caputo, V.; Nayga, R.M., Jr.; Meullenet, J.-F.; Crandall, P.G.; Ricke, S.C.; Nayga, J. Effect of Organic Poultry Purchase Frequency on Consumer Attitudes Toward Organic Poultry Meat. J. Food Sci.2010, 75, S379–S384. [Google Scholar] [CrossRef] [PubMed]
De Silva, P.; Atapattu, N.; Sandika, A. A study of the socio-cultural parameters associated with meat purchasing and consumption pattern: A case of Southern Province, Sri Lanka. J. Agric. Sci.2011, 5, 71–79. [Google Scholar] [CrossRef]
Sociodemographic characteristics of the sample.
Sociodemographic characteristics of the sample.