1. Introduction
With the advent of the era of big data, trading on e-commerce platforms develops rapidly [
1,
2]. Consumers can effectively access product information and make purchase transactions on an e-commerce platform without the restriction of time [
3,
4,
5]. For most consumers, the choice of purchasing product is related to the joy of product experience. With the help of the e-commerce platform, consumers can identify products based on their specific demands and then seek desirable ones. Nowadays, many e-commerce websites, e.g., Amazon (
https://www.amazon.cn/), Zol.com.cn (
http://www.zol.com.cn/), JD.com (
https://www.jd.com/) and Taobao.com (
https://www.taobao.com/) have provided platforms for consumers to select their products and share their online ratings or reviews on product experience. For instance, if a consumer wants to purchase a tablet computer due to occupational demand, before purchasing, he/she can browse an overall recommendation of tablet computers via Zol.com.cn which is an IT and business portal. Furthermore, e-commerce sites, such as Zol.com.cn, allow consumers who have already purchased tablet computers to post their personal evaluations publicly online, so that other consumers can get helpful references from these websites.
Figure 1 shows a list of tablet computers from Zol.com.cn (
http://detail.zol.com.cn/tablepc/good_pic.html). Therefore, it is of great significant to explore the product selection problem oriented on online ratings and reviews in e-commerce platform.
Currently, the evaluation or selection of products oriented to online ratings or reviews has been paid attention by many scholars globally. To support consumer’s selection and purchase decision, Fan et al. designed a novel method based on stochastic dominance and PROMETHEE-II method to rank the alternative products by using online ratings [
6]. Yang et al. proposed a method to synthesize rich and heterogeneous information and further used it to rank products with Electronic Word of Mouth (eWOM) score [
7]. Chen et al. visualized the market structure from different perspectives and formulated a Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS)-based product selection model [
8]. Najmi et al. presented a product ranking system that facilitates the online shopping experience according to online evaluations and description on the products [
9]. Wu et al. developed a two-stage consumer decision model from the risk perspective to understand the role of online reviews in the consumers’ Willingness-To-Pay (WTP) [
10]. In the field of automobile product selection, Liu et al. proposed an approach on ranking products with online reviews [
11]. The approach integrated the sentiment analysis technique and the intuitionistic fuzzy set theory. Using the mobile phone as research object, Peng et al. presented a fuzzy PROMETHEE method to rank alternative products based on online product reviews provided by consumers [
12]. To effectively analyze consumer reviews for the purpose of monitoring consumer satisfaction with mobile phones, Kang and Park proposed a sentiment analysis and Vlsekriterijumska Optimizacija I Kompromisno Resenje (VIKOR) method to select the desirable mobile application [
13]. A social appraisal mechanism (SAM) was proposed by Li et al. [
14] Such mechanism combines social network analysis (SNA), intuitionistic fuzzy sets (IFSs), and the TOPSIS method to achieve social decision support for online users. Li et al. investigated the determinants of consumer satisfaction in hospitality venues by analyzing online reviews [
15]. Li et al. introduced a fuzzy decision support technique based on Choquet Integral (CI), which is an aggregation function, into the multiple criteria decision making (MCDM) problem of hotel selection [
16]. With respect to restaurant selection, Zhang et al. [
17] established a decision support model to help independent tourists utilize social information on TripAdvisor.com.
Although existing literature has made significant contributions to product evaluation or selection problems based on online ratings or reviews, they mainly considered that the consumer is entirely rational. Actually, when the consumer realizes that selecting other products might have better results, he/she would feel regret selecting the current one. Otherwise, the decision maker may feel rejoice. Additionally, the methods to process online ratings or reviews are still rough. Since the scale of the online evaluations is massive, how to design an effective quantification method for the product selection problem is complicated and challenging. Therefore, it is essential to establish a reasonable model to address this problem with massive online evaluations.
Compared with traditional uncertainty problems, the evaluation or selection problem oriented on e-commerce sites with online evaluation data is more complex, because of the complexity of massive data processing. Therefore, this study aimed at proposing a decision support model to select the desirable product(s). To accomplish the overall objective for ranking and selecting products, there are two main tasks to be completed. First, the online evaluation data need to be crawled and normalized. Then, a reasonable model considering the regret behavior of consumers was built to support the product selection and purchase. Finally, a practical selection problem for the method was tested to demonstrate the feasibility of the proposed decision support model.
The remainder of this paper is organized as follows. In
Section 2, several basic preliminaries of stochastic variable and regret theory are introduced. In
Section 3, the problem formulation and the resolution process for selecting product are established based on online ratings. In
Section 4, a novel decision support model is proposed. According to online ratings of products associated with each evaluation attribute, the evaluations of products in format of stochastic variables are constructed. Then, the gain and loss degrees of each alternative over others are calculated. Considering the regret aversion behavior of the consumer, the perceived utility values of alternatives associated with each attribute can be computed. Based on the prior order of evaluation attributes and the perceived utility values, the prior weights of evaluation attributes are determined. Thus, we proposed an aggregating method to derive the ranking of the products. A practical example of selecting products, taking Zol.com.cn as the carrier, is presented to illustrate the practicability of the proposed method in
Section 5. Concluding remarks are given in
Section 6.
3. Problem Formulation and Resolution Procedure
3.1. Formulation of the Problem
Considering a problem that is selecting the most desirable product(s) in e-commerce website. By an advance screening, several acceptable products are determined, which are regarded as the alternatives set. However, different products have different advantages and disadvantages on different attributes. Thus, the consumer hesitates among the several alternative products. Besides, the consumer may have regret aversion behavior in the decision process. To select a desirable product, several evaluation attributes are considered, which are determined according to the consumer’s personal online ratings. To support the consumer’s selection and purchase, many online ratings of the alternative products concerning the attributes are crawled from the related e-commerce platform. The problem considered in this paper is how to rank the products based on the online ratings.
For convenience, throughout this paper, the following notations are used in the problem. Suppose , and are three sets of subscripts. Suppose with is a set of alternative products, where expresses the th alternative product, . Usually, the alternatives set can be predetermined by the consumer. with is a finite set of evaluation attribute, where denotes the th evaluation attribute, . Usually, the set can be determined by two sources. One is provided by consumer indirectly and the other is derived from online evaluations. Assume is a vector of attribute weights, where is the weight assigned to attribute with for , and . However, sometimes the weight vector of evaluation attribute is unknown. The prior order of evaluation attributes is provided by the consumer. In addition, suppose is the rating scales set used by an electronic platform, where is the th rating scale, . is the total number of online ratings for product associated with attribute . is a number that scale used as the online rating for product associated with attribute . In fact, and should be derived in this study.
The problem addressed in this study is how to select the most desirable alternative product(s) from the finite alternative set
based on the online ratings and evaluation attributes with prior order, consider the regret aversion behavior of the consumer. In
Figure 3, taking the selection of tablet computers as an example, the formulation of ranking and selection of tablet computers through online ratings is described concretely.
3.2. Framework and Processing for the Problem
To solve the above decision-making problem, a resolution process of selecting products is proposed as shown in
Figure 4. In
Figure 4, the resolution process can be divided into two parts: (1) preparatory phase, i.e. crawling the related online ratings data and constructing the stochastic evaluations of products; and (2) ranking phase, i.e. considering the behavior of the consumer, a ranking method for products is proposed. A brief illustration of each part is given below.
Preparatory Phase. Based on the alternative products and the evaluation attributes considered by the consumer, the online ratings of the alternative products concerning all attributes are crawled from the related website using web crawler software and are preprocessed using the probability theory. Further, based on the probabilistic theory method and online ratings of products, the evaluations of products in format of stochastic variables are constructed.
Ranking Phase. According to online ratings of products associated with each evaluation attribute, the evaluations of products in format of stochastic variables are constructed. Then, the gain and loss degrees of each alternative over others is calculated. Considering the regret aversion behavior of the decision maker, based on the regret–rejoice function, the perceived utility values of alternatives associated with each attribute is computed. Based on the prior order of evaluation attributes and the perceived utility values of the products, the prior weights of evaluation attributes are determined. Thus, the aggregating method obtains the ranking of the products.
4. The Proposed Stochastic Decision Model Considering Consumer’s Regret Behavior
Based on the resolution process shown in
Figure 4, a description of the proposed method for ranking and selecting products through online ratings is given in this section, where the description for determining the stochastic evaluations information is in
Section 4.1. Then, the method for calculating gain and loss degrees of alternative over others is established in
Section 4.2. The regret and rejoice perceived values of alternatives is proposed in
Section 4.3. Based on the stochastic evaluations and attribute prior order provided by consumer, the prior weight vector of attribute is determined in
Section 4.4. Finally, in
Section 4.5, the method for obtaining the ranking result of products is presented.
4.1. Determining the Stochastic Evaluation Information
In this paper, to provide a decision support model to select desirable(s) product, the online ratings for the products concerning all evaluation attributes should be considered. Thus, the preparatory phase can be further divided into two parts: (1) crawling online ratings for the alternative with respect to all attributes; and (2) preprocessing online ratings to alternatives’ assessments in format of stochastic variables. Thee detailed description of each part is given below.
Nowadays, some web crawlers have been presented, which can be used to derive the online evaluations. Currently, some websites encourage consumers to post their evaluations according to a pre-established framework of evaluation attributes. For example, Zol.com.cn constructs a framework on the attributes of products, and encourages the consumers to post their ratings and reviews according to the framework. In this paper, the crawler software named Octopus collector is used to obtain online ratings. According to the alternative products set, i.e.,
, the online ratings with respect to alternative products can be crawled and collected from the related website. Besides, a situation where ratings are posted according to the pre-established framework of attributes set
, for example, an online rating and review for Apple iPad Air 2 from Zol.com.cn, is shown in
Figure 5 (
http://detail.zol.com.cn/372/371503/review.shtml). In this figure, we can obtain the five attributes for selecting tablet computers, i.e., appearance, photograph, performance, endurance and cost performance. With respect to five attributes, the online ratings of tablet computers are provided by every consumer.
We suppose is a rating scales set used by an electronic platform. To accurately describe the differences, in this paper, the evaluations of alternative products are represented as evaluations in format of stochastic variables. After crawling the online ratings, based on Excel, the statistical data aare summarized. Suppose is the total number of online ratings for products associated with attribute . is the number scale used as the online ratings for products associated with attribute .
Let
,
, in format of stochastic variable, be the evaluation value for product
associated with attribute
. The probability that the alternative product
on attribute
is expressed as
is defined as
,
;
;
.
where
,
,
,
.
4.2. Calculation of the Gain and Loss Degrees
In the process of decision making, the consumer will compare alternative products to other products, If he/she selects an alternative product instead of a better one, he/she may feel regret for this selection. On the contrary, the consumer may feel rejoice. Thus, to measure the perceived utility values of alternatives to select the desirable alternative, the gain and loss degrees of alternatives with respect to others should be calculated. We suppose and are gain degree and loss degree of alternative with respect to on attribute .
For gain degree and loss degree , the following properties are provided.
Property 4. (Complementarity) For and , we obtain and .
Property 5. (Compensatory) Let and be expectations of stochastic variables and . For , we obtain . For , we obtain .
Proof. For
, based on Equations (5) and (6), we can obtain that
The expectations
and
of stochastic variables
and
are given as
Since stochastic variables
and
are independent [
18], we obtain
. Thus, it can be observed that
. ☐
Property 6. (Middle Boundary) For , we have if and only if . For , we obtain if and only if .
Proof. For , if , we obtain according to Property 4. According to Property 5, we obtain , i.e., . In contrast, if , we obtain , i.e., .
Similar to the above proof, we can prove if and only if . ☐
Property 7. (Transitivity) For and , if and , then .
Proof. For , if , i.e., , we obtain according to Property 6. If , i.e., , we obtain by Property 6; thus, . According to Property 6, it can be observed that , i.e., . ☐
Example 1. Let and be two interval stochastic variables whose probability distributions are shown as in Table 1. By using Equations (5) and (6), we could obtain the gain and loss degrees of over , respectively.
4.3. Determining the Stochastic Evaluation Information
In addition to comparing the selected alternative with other options, the consumer has a behavior characteristic of regret aversion. That is, the consumer is more sensitive with loss degree to the same magnitude of gain degree. To calculate the perceived values of alternatives, let
and
be the rejoice value and regret value of alternative
relative to
on attribute
. Based on the function of regret–rejoice [
24,
25],
and
can be obtained by
where
is the regret aversion parameter, which measures the level of regret aversion of the decision maker.
According to Equations (10) and (11), several interesting properties are summarized as follows:
Property 8. and .
Property 9. and increase monotonically as the gain degree and loss degree increase, respectively.
Furthermore, the perceived utility matrix on attribute can be established. In the matrix , the element , which combines rejoice degree and regret degree , denotes the perceived utility value of alternative over alternative for attribute which combines rejoice degree and regret degree . It is given by
Based on Equations (10)–(12), the following characteristics can be obtained.
Property 10. If , then alternative net superior to for attribute . If , then alternative net superior to for attribute .
After obtaining the perceived utility value
of alternative
over
on attribute
, the overall perceived utility value
of alternative
on attribute
can be integrated by
4.4. Determination of Evaluation Attributes’ Weights
In this study, for evaluation attributes set , we suppose the prior order of evaluation attributes is provided by consumer. Without loss of generality, we suppose . To aggregated the perceived utility values into overall perceived utility value , the prior weight vector should be determined. In the following, we determine the prior weight of evaluation attribute based on the perceived utility value of alternative on evaluation attribute .
Taking the alternative as an example, first, let the weight of highest evaluation attribute be . Then, the others relative prior weight of evaluation attribute is defined as
Here, we use a sigmoid non-linear transformation function to transform the perceived utility value
into
:
where
is the parameter of transformation function.
Then, we normalize the relative prior weight
of prior evaluation attribute
into prior weight
:
According to Equations (14)–(16), interesting properties are summarized in the following.
Property 11. For , , we have , and , i.e., increase monotonically as the perceived utility value .
Proof. Since , and , we have , that is , , increase monotonically as the perceived utility value . ☐
Property 12. For , , if , , we have . That is, the evaluation attribute weight of high prior is not less than the evaluation attribute weight of the lower prior.
Proof. Since and , if for , then we have based on Equation (14), and based on based on Equation (15). Thus, . Based on Equation (16), we have . That is the evaluation attribute weight of high prior is not less than the evaluation attribute weight of the lower prior. ☐
4.5. Ranking Alternatives
After determining the prior weights of evaluation attributes as well as the perceived utility values of alternatives on all attributes, the overall perceived utility value of alternative can be computed as
According to perceived utility value , the ranking list of alternatives can be derived. A larger value of indicates that is a better alternative.
Based on the above analysis, the decision procedures of the proposed method and flowchart (
Figure 6) for handling the selection problem with online ratings can be summarized as follows:
- Step 1.
Crawl the online ratings and construct evaluation in format of stochastic variable .
- Step 2.
Construct gain matrix and loss matrix on attribute according to Equations (5) and (6).
- Step 3.
Obtain perceived utility matrix on attribute according to Equations (10)–(12).
- Step 4.
Determine the prior weight vector associated with according to Equations (13)–(16).
- Step 5.
Calculate overall perceived utility value of alternative according to Equation (17).
- Step 6.
Determine the alternatives ranking result.
5. A Case Study
We used the case of selecting a desirable tablet computer through online ratings in ZOL website to explain the practicality of the proposed decision support model.
A consumer wants to buy a tablet computer because of the occupational requirement. However, since the consumer has limited knowledge on electronic products, it is difficult to select an optimal or satisfying alternative tablet computer in time. Thus, it is necessary to decide using an e-commerce platform. ZOL website (
http://www.zol.com.cn/) is one of the leading e-commerce platforms in China, which locates in the sales promotion of IT interactive portal. ZOL website collects product data for information technology on professional online video interactive marketing as one of the complex media. It is also the CBS interactive group interactive media company’s flagship media in China. It provides many kinds of online ratings and reviews about electronic products.
By limiting the brand, basic function and price of tablet computers, five alternative tablet computers were initially screened by the consumer on ZOL website. The consumer needs to select a desirable tablet computer from the following five alternative tablet computers.
: Apple iPad mini 2
: GALAXY Tab S T800
: MI Pad
: ASUS ZenPad 3S 10
: Apple iPad Air 2
Based on the information of the e-commerce services platform, the following five attributes associated with alternative tablet computers are considered: appearance (
), photograph (
), performance (
), endurance (
) and cost performance (
). The prior order of the attributes is
. The consumer provides the prior order of evaluation attributes as a weight vector of the five evaluation attributes, which are denoted as
. The proposed method in
Section 4 is applied to rank these five alternative tablet computers. The calculation processes and discussion are expressed below.
5.1. Methodology and Results
Because the original data obtained from e-commerce websites are complex and massive, we must further convert the original data into structural data. First, we used crawl software to extract the online ratings of five tablet computers on evaluation attributes from the website of ZOL.COM.CN. The total numbers of the online ratings for tablet computers are 471, 144, 138, 102 and 429, respectively. On this website, the scales set of ratings is
. Then, the total number of each scale for each tablet computer on each evaluation attribute can be counted. Thus, the stochastic evaluations
of alternative tablet computer
concerning attribute
are formatted as distributed information using Equation (4). Thus, we form the distribution linguistic decision matrix of five alternative tablet computers with respect to five attributes as shown in
Table 2,
Table 3,
Table 4,
Table 5 and
Table 6.
Then, according to Equations (5) and (6), based on the evaluations of alternatives in format of stochastic variables, the gain matrix
and loss matrix
on attribute
are constructed. Considering the regret behavior of consumer, the perceived utility matrix
can be calculated according to Equations (10)–(12). Here, we suppose the parameter
. Given space limitations in this paper, we only give the perceived utility matrices in the following:
Then, the perceived utility value
of alternative
on attribute
can be calculated by Equation (13). The perceived utility matrix is shown as follows:
Based on prior order of evaluation attributes and Equations (14)–(16), the prior weights vectors of evaluation attributes associated with product
are
,
,
,
, and
. By aggregating the perceived utility values of alternatives on different attributes using Equation (17), the overall perceived utility values are:
According to the descending order of the values of
, the ranking result of these five alternatives are obtained as follows:
5.2. Analysis on the Effect of the Parameter of Regret Aversion
In this section, to show the robustness of the proposed method, we design a sensitivity analysis for our proposal. In a real decision-making situation, due to the different focuses of the decision makers, their regret aversion degrees are different. The degrees of regret aversion are embodied in different regret aversion parameters. Thus, different regret aversion parameters might be used by different decision makers.
We suppose that the parameter
can be assigned different numbers:
,
,
and
. Different
s yield different results, as shown in
Table 7.
Thus, the ranking results under the different parameter are
Table 7, in which the ranking results are similar but not identical. If
or
, the ranking result is
. In the case of
or
, the ranking result is
. Thus, the parameter
reflects regret aversion degree of consumer, which directly leads to the different ranking results.
5.3. Comparison Analysis
In this subsection, to show the better feasibility of the proposed method, we take the example from Fan and coworkers’ study [
6]. Furthermore, we compare the proposed method with the previously designed ones by Fan et al. [
6] and Kang and Park [
13]. The problem formulation and comparison process are shown as follows.
Example. [6] A consumer wants to buy an SUV from the following five alternatives: : Jeep, Compass; : Mazda, CX5; : Subaru, Forester; : Toyota, Highlander; and : Chevolet, Kopacz. Based on the information provided by the e-commerce services platform (http://www.autohome.com.cn/), the following eight attributes associated with alternative SUVs are considered: Trunk space (), Power (), Control (), Fuel consumption (), Comfortability (), Appearance (), Cost performance () and Price (). First, the consumer provides the weight vector of attributes: . The corresponding online product ratings are extracted from the website of Autohome (http://www.autohome.com.cn/). Then, the evaluations of products are formulated as distribution information presented in Table 8. The methods developed by Fan et al. [
6] and Kang and Park [
13] as well as our method presented in
Section 4 are separately applied to rank these five alternative SUVs. Since first two methods only address the situation that consumer is entirely rational, to make the comparison worthwhile, we also suppose that the consumer is entirely rational. Consequently, we first assume that the regret–rejoice function is
. Then, the different degrees of consumer’s regret behavior are considered subsequently. The ranking results are listed in
Table 9.
As shown in
Table 9, the trend of sorting results by using the mentioned methods are similar, except the first ranking result is quite different from others. With respect to the first method provided by Kang and Park [
13], the assessments that are transformed from online reviews were slightly rough. Moreover, some information on different evaluation levels is partially ignored. Therefore, comparing the results, the method proposed in this study can avoid information loss when transforming the online reviews into crisp evaluations. Furthermore, by using our method, more accurate overall compromise values of alternative SUVs can be generated to obtain a more persuasive ranking result. With respect to the method based on stochastic dominance theory provided by Fan et al. [
6], the ranking result is completely consistent with the one given by this study. However, by stochastic dominance theory, the dominance relationship between any two alternatives can be determined. According to the dominance relationship, the dominance degree of a product over another one can be calculated subsequently. The method to determine the dominance relationship and dominance degrees has a higher computational complexity than the proposed method. Additionally, in this study, the different degrees of consumer’s regret behavior are considered, which improves the feasibility of the decision results in dealing with the realistic problem.
6. Conclusions
The decision-making problem for products selection oriented on online evaluations has important theoretical significance and practical application value. To facilitate the selection of products for consumers, an effective decision-making support model needs to be investigated. In this paper, with respect to selection of products, a decision support model is proposed based on massive online ratings. In the proposed method, by crawling the massive online ratings for products provided by different reviewers, quantitative information is prepared for decision making process. Then, considering the regret aversion behavior of consumer, the perceived utility values of consumer for alternatives is proposed. It provides one more valid tool for uncertain decision-making problem with stochastic variables information.
Compared with existing studies, this research has the following contributions. On the one hand, from a realistic perspective, the product selection problem oriented on massive online ratings is investigated to propose a novel decision model to support selection and purchase. Then, the regret aversion behavior of consumer is considered, which is more realistic than other methods based on online evaluations. On the other hand, from a theoretical point of view, to preserve the integrity of the raw data as much as possible, we processed the massive ratings into stochastic evaluations. This avoids the information loss or distortion in the existing methods. Then, a more accurate method of comparison any two stochastic variables is constructed to calculate rejoice values and regret values of alternatives, subsequently.
For future studies, to make the decision result more comprehensive, it is worth noting that online reviews can be considered as well as online ratings [
11,
12]. Further, since the evaluation attribute is unknown, mining evaluation attributes from online reviews needs to be investigated. Additionally, evaluation on green supply chain, population resources and environment by public participation [
26,
27,
28] can be incorporated into future investigations.