Next Article in Journal
Assessment of Lake Water Quality in Central Serbia—Using Serbian and Canadian Water Quality Indices on the Example of the Garaši Reservoir
Previous Article in Journal
Urban, Architectural, and Socioeconomic Factors Contributing to the Concentration of Potential Arbovirus Vectors and Arbovirosis in Urban Environments from a One Health Perspective: A Systematic Review
Previous Article in Special Issue
Exploring the Impact of the Green Marketing Mix on Environmental Attitudes and Purchase Intentions: Moderating Role of Environmental Knowledge in China’s Emerging Markets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Source Data-Driven Personalized Recommendation and Decision-Making for Automobile Products Based on Basic Uncertain Information Order Weighted Average Operator

1
School of Advanced Interdisciplinary Studies, Hunan University of Technology and Business, Changsha 410205, China
2
Xiangjiang Laboratory, Changsha 410205, China
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(9), 4078; https://doi.org/10.3390/su17094078
Submission received: 9 April 2025 / Revised: 27 April 2025 / Accepted: 29 April 2025 / Published: 30 April 2025
(This article belongs to the Special Issue Sustainable Marketing: Consumer Behavior in the Age of Data Analytics)

Abstract

:
The extensive electronic word-of-mouth (eWOM) data generated by consumers encapsulates authentic product experience information. By leveraging advanced data analysis technologies, enterprises can extract sustainable consumer behavior preference knowledge, thereby supporting the optimization of their marketing and management strategies. However, existing data-driven product ranking processes predominantly focus on single-source eWOM data and rarely mine product insights from a multi-source perspective. Moreover, the quality of eWOM data cannot be overlooked. Consequently, this study uses automobile products as a case example and integrates rating eWOM data, complaint eWOM data, and safety test data to construct a multi-source data-driven personalized product ranking recommendation algorithm. Specifically, an evaluation index system is established for each of the three data types. To model information quality, these data are transformed into basic uncertain information (BUI), which incorporates scoring information and credibility metrics. The XLNet model is employed to convert complaint text data into scoring data, and three targeted credibility evaluation models are developed to assess the reliability of the three data types. Subsequently, BUI is aggregated using the BUI ordered weighted average (BUIOWA) aggregation operator. Based on this, a personalized product ranking method aligned with user preferences is proposed, offering consumers recommendation results that match their preferences. Finally, using automobile products as an illustrative example, this study elucidates the multi-source data-driven personalized product recommendation process and provides managerial implications for enterprises.

1. Introduction

Electronic word-of-mouth (eWOM) serves as a channel enabling individuals to instantly access information regarding the consumption experiences of products within consumer groups. Across various platforms, eWOM manifests in multiple forms, including star ratings, textual reviews, quality complaint texts, and feedback information, either individually or simultaneously. Through advanced data analysis tools, enterprises can extract insights into consumer behavior preferences, monitor market dynamics and evolving trends, and based on these findings, develop forward-looking marketing and management strategies to enhance market competitiveness and achieve sustainable development. Consumers, in turn, can leverage this multi-dimensional and authentic product information to assist them in making informed purchasing decisions. The core objective of automobile product ranking decision research driven by eWOM is to construct a group intelligence aggregation model through the integration of large-scale eWOM data, thereby providing personalized recommendation services for consumers. Initially, the primary focus of information aggregation was on user-provided star ratings, combined with statistical methods to generate product ranking results. However, existing research predominantly relies on eWOM data from a single platform, with relatively few studies adopting a cross-platform perspective. Moreover, the influence of quality-related eWOM, such as complaint texts and safety test information, on purchasing decisions is no less significant than that of satisfaction-related eWOM, such as ratings and reviews [1]. For instance, unlike other studies, one study incorporates quality-related eWOM, such as the number of faults per 100 vehicles, into the product ranking decision-making process alongside satisfaction-related eWOM, achieving superior outcomes [2]. Existing research remains limited to mining crowd intelligence knowledge from single-dimensional data sources, with few studies focusing on extracting and aggregating crowd intelligence knowledge for product ranking decisions from multi-source information, including rating eWOM, quality complaint eWOM, and safety test data [3]. The central issue addressed in this study is how to mine valuable knowledge from multi-source and multi-dimensional eWOM data and construct a comprehensive research framework for product ranking decision-making.
In large-scale data-driven product ranking research, the quality of eWOM data plays a critical role in determining the reliability of decision-making outcomes. Due to the low posting threshold for users and the high complexity of platform reviews, the credibility of eWOM disclosed by online service platforms varies significantly. The accumulation of a substantial amount of low-credibility eWOM has exacerbated information quality issues, severely undermining the quality of data-driven decision-making. For instance, nearly 90% of Uber reviews may be rated as 5 stars, which highlights the uneven quality of eWOM data. This inconsistency leads to reduced credibility in product ranking results, making it challenging to provide effective guidance for consumers [4]. Therefore, integrating information quality considerations into large-scale eWOM data-driven product ranking is essential for enhancing decision-making outcomes. Notably, the basic uncertainty information (BUI) model, capable of simultaneously characterizing evaluation information and credibility [5,6], offers a pivotal tool for addressing the challenges. In traditional multi-criteria decision-making (MCDM) processes, fuzzy set decision theory describes the fuzziness of object attributes through membership functions [7,8], while probability set decision theory relies on probability distributions to depict the objective occurrence frequency of random events [9,10]. In contrast, the latest BUI theory focuses on modeling information credibility, providing a novel research perspective for solving information quality problems in big data-driven MCDM. Currently, research on BUI primarily centers on aggregation operators and extended sets, leading to the development of a series of MCDM theories. Regarding aggregation operators, advancements such as the BUI ordered weighted average [11], BUI ordered weighted geometric average [12], and BUI generalized aggregation operator [13] have enriched the decision-making theory system based on BUI. In terms of theoretical expansion, scholars have integrated BUI with (interval) bipolar semantic sets [14,15], rough sets [16], and other frameworks, defining a range of BUI extension sets. A series of studies [16,17,18,19] have demonstrated that BUI theory holds significant advantages in modeling information quality within MCDM contexts, offering a robust theoretical foundation for product ranking research based on massive eWOM data in this study.
It is highly significant to incorporate the adjustment mechanism of users’ personalized preferences into the research on product ranking and recommendation systems. As the diversified characteristics and personalized demands of customers become increasingly prominent [20], consumer preferences in cities at different stages of development have gradually exhibited significant heterogeneous features [21]. Some recommendation models typically adopt a fixed parameter system, lacking sufficient flexibility for personalized adjustments tailored to individual users. This falls short of meeting the requirements of the “user autonomous choice mechanism” emphasized in certain regulations (e.g., China’s “Regulations on the Administration of Algorithmic Recommendation for Internet Information Services”). Fortunately, some studies have developed product ranking methods grounded in users’ personalized preferences, achieving commendable results [22,23]. Consequently, research on personalized product ranking methods based on multi-platform information aggregation holds substantial practical significance.
In conclusion, it is highly significant to mine valuable preference knowledge and develop product ranking and recommendation algorithms by leveraging data analysis tools to process the extensive variety of consumer preferences embedded in multi-source eWOM. However, most existing studies focus on single-dimensional eWOM data sources, with relatively limited exploration into integrated recommendations derived from multi-platform and multi-dimensional eWOM data. The information quality of eWOM substantially impacts the reliability of product ranking decision outcomes. Moreover, incorporating users’ personalized preference adjustment mechanisms into product ranking and recommendation systems aligns with societal development needs. Therefore, this study will commence by analyzing multi-source eWOM information, including rating satisfaction eWOM, user complaint eWOM, and security test data. It will introduce the BUI model as an information representation tool, integrate the user’s personalized adjustment mechanism, and conduct research on multi-source data-driven personalized product ranking and recommendation decision-making. Specifically, the study will (1) construct MCDM evaluation indicators for the three-dimensional data; (2) transform the three-dimensional data into BUI form through processes such as evaluation information mining and credibility conversion; (3) perform information aggregation using the ordered weighted average operator for basic uncertain information; and (4) propose a product ranking algorithm based on user personalization. This research can extract useful knowledge from multi-source eWOM data, assisting consumers in making informed shopping decisions while also guiding the formulation of enterprise management strategies.

2. Literature Review

In the study of ranking decisions for automobile products, compared to ordinary daily necessities, the ranking and recommendation of automotive products involve greater complexity. First, given their high price, users tend to exhibit greater caution during the purchasing process. Second, user purchase requirements are multifaceted, and the overall duration of the decision-making process is relatively prolonged. Research on automotive product ranking decisions can primarily be categorized into two aspects: scoring-driven ranking decisions and text-review-driven ranking decisions.
Research on ranking decision based on scoring. The distributed linguistic term set continues to serve as the core representation tool for converting score information, with PROMETHEE-II and TODIM being extended to the linguistic term set environment to propose a product ranking method [24,25]. To address the issue of data sparsity, scholars employed the grey correlation prediction method to forecast missing score data and introduced the cloud model to construct an automobile product recommendation algorithm [26]. A multi-level classification mechanism was designed, and a group score aggregation method was proposed, defining the IFINWIBM operator while describing the interaction degree between attributes through an operator parameter learning mechanism to avoid subjectivity-induced unscientificity. An online multi-dimensional rating aggregation decision model was developed to address product ranking challenges [27]. With a focus on reliability, research has been conducted on rating aggregation methods for cross-platform distributions, proposing a user weighting model based on feature information and an order preference technical model based on ideal solution similarity using BULI distance measurement to enhance product evaluation aggregation methodologies [19]. A novel dynamic three-way decision model was investigated, and an integrated framework for MCDM based on BULI was developed utilizing rough sets from decision theory [16]. Standard weights for electric vehicles were determined based on Shannon’s entropy, and electric vehicle rankings were established using TOPSIS [28].
Research on ranking decision-making based on text comments. The existing research primarily employs a variety of sentiment analysis tools to extract the sentiment tendencies from online text comments and develops product ranking methods grounded in theories such as fuzzy sets and distributed language term sets. To enhance the accuracy of sentiment intensity recognition, scholars have proposed an ideal-scheme-based ranking decision method by incorporating interval type-2 fuzzy sets [29]. To address the issue of single-feature information representation, BERT and q-rung orthopair fuzzy set theory are employed to construct a product ranking framework for online reviews concerning feature quality allocation [30]. In the literature, emotion analysis technology is utilized to output sentiment intensity levels and their frequencies, which are subsequently converted into trust values for different sentiment intensity levels. A product ranking decision-making method based on emotion analysis and evidence theory is then constructed [31]. Based on sentiment analysis technology, the sentiment orientation of each comment toward each alternative under each attribute is distinguished, and an information conversion mechanism is defined to facilitate the transformation of unstructured data into fuzzy numbers in pictures. The weight determination method of fuzzy entropy measurement in pictures is applied to determine attribute weights [32]. Sentiment analysis results are transformed into hesitant intuitionistic fuzzy elements (HIFE), and sorting outcomes are obtained through the extended ORESTE method based on hesitant intuitionistic fuzzy Chebyshev distance [33]. Considering the advantages of probabilistic linguistic term sets in representing emotional tendencies and their distribution forms, scholars use sentiment analysis technology to output five sentiment levels and integrate TODIM and evidence theory to construct relevant product ranking decision-making methods [34]. Multi-attribute decision-making for product ordering is combined with q-rung orthopair fuzzy set theory and prospect theory [35].
The aforementioned research status reveals that the current studies in the field of automotive product ranking are primarily conducted separately based on rating eWOM data and text eWOM data. Specifically, some studies have concentrated on rating eWOM data to evaluate the quality of automotive products by analyzing consumer ratings for these products. Another segment of research leverages text eWOM data to mine consumer attitudes and evaluation information embedded within the text for achieving product ranking. However, few studies have effectively integrated the knowledge derived from both rating eWOM data and text eWOM data, thereby limiting the comprehensiveness and accuracy of automotive product ranking research to a certain extent. Furthermore, in rating-driven car product ranking research, some scholars have introduced the BUI theory to address information quality issues. This theory can efficiently assess and handle low-credibility scoring data, thus enhancing the reliability of ranking outcomes. Conversely, in text-data-driven product ranking research, relatively limited attention has been paid to information quality. Given the unstructured nature and complex semantics of text data, information quality challenges can significantly impact the accuracy of sorting results. Therefore, introducing effective information quality assessment methods in this domain holds substantial research significance. Meanwhile, automotive safety test data play a pivotal role in automotive product recommendations. Safety performance is one of the key factors consumers consider when purchasing a vehicle. Automotive safety test data can objectively reflect the safety levels of automotive products and provide critical reference points for consumers. For instance, with the increasing prevalence of electric vehicles, battery-related safety incidents have become more frequent. Scholars have analyzed Chinese safety standards from multiple dimensions, including battery materials, cells, modules, and battery systems, offering valuable suggestions for establishing improved international battery safety standards [36].
To summarize, this study aims to integrate the knowledge of consumer behavior preferences embedded in rating eWOM, complaint text eWOM, and security test data. Specifically, the credibility of these three types of data will be assessed individually, and the BUI model will be constructed based on the evaluation outcomes. Through this model, the characteristics and information quality of different data sources can be comprehensively considered. Subsequently, personalized product ranking and recommendation algorithms will be developed to enhance the accuracy and effectiveness of automotive product ranking and recommendation, thereby providing stronger support for the development of the automotive industry and aiding consumers’ car purchase decisions.

3. Model Construction

3.1. Construction of Evaluation System

The problem of automotive product ranking based on online eWOM data can be formulated as follows: given large-scale, multi-dimensional eWOM data, a ranking and recommendation of the set of alternative automotive products A = A i i = 1 , 2 , , m is achieved through an aggregation decision evaluation method. A practical challenge in automotive product ranking research lies in the high similarity of eWOM scores, which leads to low differentiation and makes it difficult to intuitively distinguish between automotive products. To address this issue, this paper first constructs a comprehensive evaluation system from three key dimensions: rating word-of-mouth, complaint word-of-mouth, and safety test data.

3.1.1. Evaluation Indicators and Set Definition Based on Rating Word-of-Mouth

The set P 1 of online service platforms that provides the rating-based eWOM data and its corresponding weight set P W 1 are defined as follows:
P 1 = P 1 l l = 1 , 2 , , L 1 , P W 1 = p w 1 l l = 1 , 2 , , L 1 ; l = 1 L 1 p w 1 l = 1
There are seven evaluation indicators derived from the rated eWOM data, namely, space, configuration, cost-performance, interior, appearance, fuel consumption, and driving experience, as shown in Figure 1. The rating indicator set C 1 and its corresponding weight set C W 1 are formally defined as follows:
C 1 = C 1 j C 11 = Space , C 12 = Configuration , C 13 = Cos t - performance , C 14 = Interior , C 15 = Appearance , C 16 = Fuel   consumption , C 17 = Driving   experience , j = 1 , 2 , , J 1
C W 1 = c w 1 j j = 1 , 2 , , J 1 ; j = 1 J 1 c w 1 j = 1
The user set U 1 l i and its corresponding weight set U W 1 l i within the online service platform are formally defined as follows:
U 1 l i = u 1 k l i k = 1 , 2 , , K l i , U W 1 l i = u w 1 k l i k = 1 , 2 , , K l i ; k = 1 K l i u w 1 k l i = 1
Then, the user’s rating matrix is
G D i 1 l = α i j k l K l i × J 1 , i = 1 , 2 , , m ; l = 1 , 2 , , L 1
where α i j k l 1 , 5 represents the score assigned by user u 1 k l i on platform P 1 l l = 1 , 2 , , L 1 for attribute C 1 j j = 1 , 2 , , J 1 of product A i i = 1 , 2 , , m .

3.1.2. Evaluation Indicators and Set Definition Based on Complaint Word-of-Mouth

The set P 2 of online service platforms providing user complaints and eWOM data, along with its corresponding weight set P W 2 , is formally defined as follows:
P 2 = P 2 l l = 1 , 2 , , L 2 , P W 2 = p w 2 j j = 1 , 2 , , L 2 ; l = 1 L 2 p w 2 l = 1
There are eight evaluation indicators for customer complaints, namely, engine, transmission, steering, braking, tires, front and rear axle and suspension systems, body accessories, and electrical and service issues, as shown in Figure 2. The complaint indicator set C 2 and its corresponding weight set C W 2 are formally defined as follows:
C 2 = C 2 j C 21 = Engine , C 22 = Transmission , C 23 = Steering , C 24 = Braking , C 25 = Tires , C 26 = Front   and   rear   axle   and   suspension   systems , C 27 = Body   accessoriesand   electrical , C 28 = Service   issues , j = 1 , 2 , , J 2
C W 2 = c w 2 j j = 1 , 2 , , J 2 ; j = 1 J 2 c w 2 j = 1
The set U 2 l i j of users who provide complaint content on the service platform, along with its corresponding weight set U W 2 l i j , is formally defined as follows:
U 2 l i j = u 2 k l i j k = 1 , 2 , , K l i j , U W 2 l i j = u w 2 k l i j k = 1 , 2 , , K l i j ; k = 1 K l i j u w 2 k l i j = 1
Then, the user’s complaint risk level matrix is
G D i j 2 l = β i j k l K l i j × J 2 , i = 1 , 2 , , m ; l = 1 , 2 , , L 2 ; j = 1 , 2 , , J 2
where β i j k l 1 , 5 represents the risk level of user u 2 k l i j on platform P 2 l l = 1 , 2 , , L 2 for product A i i = 1 , 2 , , m with respect to attribute C 1 j j = 1 , 2 , , J 1 . It is important to note that a higher grade value corresponds to a lower risk level; specifically, a grade value of 1 indicates the highest risk, while a grade value of 5 indicates the lowest risk.
Note that since automatic transmission cars do not encounter clutch-related issues and the majority of consumers currently purchase automatic transmission vehicles, the problem of clutch failure will not be taken into consideration in this study. Section 3.2 presents a method leveraging natural language processing techniques to transform the original complaint text information into a quantifiable user complaint risk level β i j k l 1 , 5 .

3.1.3. Evaluation Indicators and Set Definition Based on Safety Test Data

The set P 3 of online service platforms providing security assessment data, along with their corresponding weight set P W 3 , is formally defined as follows:
P 3 = P 3 l l = 1 , 2 , , L 3 , P W 3 = p w 3 j j = 1 , 2 , , L 3 ; l = 1 L 3 p w 3 l = 1
There are seven evaluation indicators derived from the rated eWOM data, specifically, occupant protection, VRU protection, and active safety, as shown in Figure 3. The safety test indicator set C 3 and its corresponding weight set C W 3 are formally defined as follows:
C 3 = C 3 j C 31 = Occupant   protection , C 32 = VRU   protection , C 33 = Active   safety , j = 1 , 2 , , J 3
C W 3 = c w 3 j j = 1 , 2 , , J 3 ; j = 1 J 3 c w 3 j = 1
Then, the user’s safety classification matrix D 3 l is
D 3 l = γ i j l m × J 3 , l = 1 , 2 , , L 3
where γ i j l 1 , 5 represents the security level of platform P 3 l l = 1 , 2 , , L 3 for product A i i = 1 , 2 , , m with respect to attribute C 3 j j = 1 , 2 , , J 3 , and a higher security level value indicates greater security.

3.2. Information Acquisition

3.2.1. The Transformation of Complaint Text Information

As shown in Figure 1, Figure 2 and Figure 3, the rating eWOM and safety test data can readily yield quantifiable rating data. In contrast, complaint eWOM is predominantly described in textual form and lacks direct rating data. Consequently, a method is required to transform these textual descriptions into corresponding rating data for subsequent computational analysis. This study will employ a deep learning and neural network model based on artificial intelligence to address this issue.
XLNET is a natural language processing (NLP) model introduced by Google, employing a generalized autoregressive pre-training method. The literature [37] presents a comprehensive comparison of XLNet with models such as BERT and RoBERTa. Under identical data and hyperparameter configurations, XLNet demonstrates significantly superior performance compared to BERT in multiple natural language understanding tasks. When the experiments were extended to larger-scale datasets and more optimized hyperparameter settings were applied, XLNet exhibited outstanding performance on benchmark test sets, including SQuAD, RACE, and GLUE. In some tasks, it even outperformed RoBERTa, thereby showcasing its robust capability in handling complex natural language processing tasks. This study employs XLNet as the core model for complaint text risk scoring, primarily due to its several key characteristics that align well with the task requirements. Complaint texts are generally lengthy and contain dispersed information. XLNet, through its utilization of the long-range dependency mechanism in Transformer-XL and relative position encoding, effectively overcomes the text length limitations inherent in traditional models, enabling it to fully preserve risk signals across paragraphs. Furthermore, XLNet integrates the bidirectional global modeling capability of permutation language models, which resolves the pre-training and fine-tuning bias issues encountered by masked language models such as BERT. Consequently, XLNet is capable of more precisely capturing the intricate.
An autoregressive (AR) language model assumes a linear dependency among sequence data. Given a text sequence x = x 1 , , x T , AR language modeling is trained by maximizing the likelihood under forward autoregressive decomposition.
max θ log p θ x = t = 1 T log p θ x t x < t = t = 1 T log exp h θ x 1 : t 1 Τ e x t x exp h θ x 1 : t 1 Τ e x
where h θ x 1 : t 1 denote the context representation generated by the neural model and let e x represent the embedding of x . Since h θ x 1 : t 1 is conditioned only on position t (i.e., the token to the left), this characteristic plays a crucial role in the model’s behavior.
In this study, we engaged experts to annotate the risk levels of complaint texts and constructed a labeled dataset for subsequent model training. During the model training phase, a dataset comprising nearly ten thousand entries was utilized. To ensure the accuracy and validity of model learning, car names were anonymized in the data preprocessing stage to prevent potential biases in model learning that might arise from similarities in vehicle model names. Furthermore, during the training process, the F1 score was selected as the evaluation metric for assessing model performance. The F1 score provides a comprehensive consideration of precision and recall, enabling an objective reflection of the model’s performance in classification tasks and facilitating a more accurate evaluation of its strengths and weaknesses. The specific procedures are outlined as follows:
Step 1. Obtain the unmarked set T D j of the complaint indicator set C 2 = C 2 j j = 1 , 2 , , J 2 :
T D j = T e x t - β k j K j × J 2 , j = 1 , 2 , , J 2
Step 2. Obtain a manually annotated risk level matrix.
Step 2.1. Expert group E = e q q = 1 , 2 , , Q conducted manual labeling to derive the risk grade matrix:
T D j q = β k j q K j × J 2 , j = 1 , 2 , , J 2 ; q = 1 , 2 , , Q
Step 2.2. Develop a comprehensive, manually labeled risk level matrix:
T D j = β k j K j × J 2 , j = 1 , 2 , , J 2
where β k j = 1 Q q = 1 Q β k j q .
Step 3. The comprehensively manually labeled risk level matrix is divided into a training set and a test set in a 7:3 ratio.
Step 4. The model undergoes training, and its accuracy and precision are enhanced through parameter fine-tuning.
Step 5. After the model training process has been completed, the resulting trained model is utilized to generate the quantitative matrix:
G D i j 2 l = β i j k l K l i j × J 2 , i = 1 , 2 , , m ; l = 1 , 2 , , L 2 ; j = 1 , 2 , , J 2

3.2.2. Information Reliability Calculation

In the first step, the score data related to rated word of mouth, complaint word of mouth, and safety test were collected. Subsequently, the corresponding credibility information will be acquired.
(a)
Obtain platform credibility information.
Step 1. Develop a comprehensive platform credibility assessment system. Fifteen indicators are developed to assess the credibility of the platforms, including overall enterprise capability, honor index, Baidu index, negative news index, user approval rate, comprehensiveness of annotation rules, richness of content presentation, overall stability of reviews, intuitiveness of navigation links, efficiency of the retrieval system, aesthetics of page layout, coherence of overall style, effectiveness of anti-cheating algorithms, data acquisition efficiency, and code transparency. The set F of platform credibility assessment indicators and their corresponding weight set F W are presented, respectively.
F = F j , j = 1 , 2 , , J F
F W = f w j j = 1 , 2 , , J F ; j = 1 J F f w j = 1
Step 2. Obtain platform credibility information.
Step 2.1. Obtain a trustworthiness matrix for each platform.
F P 1 = f p l j 1 L 1 × J F , F P 2 = f p l j 2 L 2 × J F , F P 3 = f p l j 3 L 3 × J F
Step 2.2. Assess the reliability of various platforms.
f p l 1 = j = 1 J F f w j f p l j 1 , l = 1 , 2 , , L 1 f p l 2 = j = 1 J F f w j f p l j 2 , l = 1 , 2 , , L 2 f p l 3 = j = 1 J F f w j f p l j 3 , l = 1 , 2 , , L 3
(b)
Obtain indicator credibility information.
Step 1. Reliability assessment of rating evaluation indicators.
f c i j 1 l = λ 1 1 1 4 K l i k = 1 K l i α i j k l 1 K l i k = 1 K l i α i j k l + 1 λ 1 f p l 1
where f c i j 1 l represents the reliability of product A i i = 1 , 2 , , m on rating platform P 1 l l = 1 , 2 , , L 1 for attribute C 1 j j = 1 , 2 , , J 1 . The credibility of the rating evaluation index is determined by integrating the fluctuation degree of user scores under this attribute with the credibility of the platform f p l 1 .
Step 2. Reliability assessment of complaint evaluation indicators.
f c i j 2 l = λ 2 1 K l i j k = 1 K l i j T β i j k l + 1 λ 2 f p l 2
where f c i j 2 l represents the reliability of product A i i = 1 , 2 , , m on complaint platform P 2 l l = 1 , 2 , , L 2 for attribute C 2 j j = 1 , 2 , , J 2 , and T β i j k l denotes the progress index of risk information in complaint texts for category β i j k l , serving as a grading reference.
Step 3. Reliability assessment of safety testing evaluation indicators.
f c i j 3 l = λ 2 2 T γ i l + T γ j l + 1 λ 2 f p l 3
where f c i j 3 l represents the reliability of product A i i = 1 , 2 , , m on safety testing platform P 3 l l = 1 , 2 , , L 3 for attribute C 3 j j = 1 , 2 , , J 3 , T γ i l denotes the evaluation time of platform P 3 l l = 1 , 2 , , L 3 for automotive product A i i = 1 , 2 , , m , and T γ j l represents the evaluation depth index of the platform. The evaluation depth index reflects the number of secondary indexes assessed, where a higher number indicates a more comprehensive evaluation. This serves as a grading criterion for performance assessment.

3.3. Information Aggregation

3.3.1. Fundamental Concepts

To begin, this section introduces several fundamental concepts pertinent to this study.
(a)
The basic uncertainty information (BUI) [5,6] particle is represented as a pair of values x , c , where the first value x 0 , 1 corresponds to the input value (typically the evaluation score), and the second value c 0 , 1 represents the degree of certainty associated with the score, and 1 c 0 , 1 represents the degree of uncertainty related to the score. The set of all BUI particles is denoted by Β . The BUI aggregate function x , c : n Β for the input of an n-dimensional variable is formally defined as x , c = x , c i i = 1 n = x i , c i i = 1 n B n , where x : n 0 , 1 represents the value mapping function and c : n 0 , 1 denotes the confidence mapping function.
(b)
BUIOWA aggregation function.
The BUIOWA aggregation operator was first introduced in reference [11], which was inspired by the concept of the Choquet integral and processes BUI input via problem decomposition and integration techniques. Its design principle and underlying concept involve partitioning the BUI input into two components—certain E and uncertain n \ E —based on the degree of certainty. The traditional OWA operator and mean operator are then, respectively, employed for aggregation, while the integral approach is incorporated to facilitate continuous aggregation. This process enables effective processing and precise aggregation of the BUI input. For further details regarding the specific design process of this operator, please refer to reference [11].
This paper first presents the formal definition of an uncertain order weighted average (UOWA) operator, assuming constant input variables.
Definition 1
([11]). For any input function x : n 0 , 1 and a subset E n on which the input function values are certain (with its complement n \ E , on which the input function values are uncertain), an UOWA operator for n with a family of OWA weight vectors  W = w n n = 1 n , having orness α , is a mapping U O W A W : 0 , 1 × 2 n 0 , 1 such that
U O W A W x , E = E n O W A W E x | E + 1 E n m e a n n \ E x | n \ E
where the weight vectors are calculated by orness α = o r n e s s W = i = 1 n w i n n i n 1 , and the convention O W A w ϕ x | ϕ 0 and m e a n ϕ x | ϕ 0 holds true. The attached aggregation certainty A C : 2 n 0 , 1 (aggregation uncertainty A U : 2 n 0 , 1 ) is defined by
A C E = E / n A U E = 1 E / n
Next, in accordance with the principle and concept of UOWA, the definition of the basic uncertain information ordered weighted averaging (BUIOWA) aggregation operator is formally introduced.
Definition 2
([11]). A BUIOWA operator for n with a family of OWA weight vectors, having orness α , W = w n n = 1 , is a mapping B U I O W A W : Β n Β such that
B U I O W A W x , c = 0 1 U O W A W x , E t d t , 0 1 A C E t d t
where E t = k n : c k t .
The specific calculation process of the BUIOWA operator is elaborated upon in detail through an illustrative example provided in Appendix A.
In the process of aggregating three types of eWOM data to derive the respective comprehensive BUI scores, it is essential to consider the aggregation across three dimensions: users, metrics, and platforms. For the user dimension, this study employs an equal-weighted averaging method. This approach is straightforward and intuitive, ensuring that each user input is treated equally. For indicators and platform aggregation, the BUIOWA operator is utilized. Its weight assignment mechanism is intricately tied to its unique design principle, comprehensively reflecting the weight design concepts from both data-driven and preference-setting perspectives. On one hand, the BUIOWA operator achieves dynamic weight allocation by introducing a threshold-based partitioning method and employing differentiated calculation approaches for distinct deterministic BUI particles. This process relies entirely on the inherent credibility of the data, effectively mitigating the potential influence of low-credibility information on the aggregation outcomes, thereby demonstrating the scientific rigor of data-driven weight allocation. On the other hand, the BUIOWA operator provides decision-makers with the flexibility to customize aggregated weights by adjusting the orness value. The orness value, a critical parameter in the OWA operator, reflects the decision-maker’s preference for optimism or pessimism. By modulating the orness value, decision-makers can flexibly control the aggregation results’ sensitivity to larger or smaller input values. A higher orness value signifies a more optimistic inclination, tilting the aggregation results toward larger input values, whereas a lower orness value indicates a relatively more pessimistic stance, causing the aggregation results to lean toward smaller input values.

3.3.2. Aggregation of Rating Word-of-Mouth

This section presents a method for aggregating rated eWOM information, which enables us to aggregate the ratings of all users on multiple attributes of a product in each platform into platforms’ comprehensive BUI evaluation data. The data input of the rating eWOM aggregation algorithm is G D i 1 l = α i j k l K l i × J 1 , i = 1 , 2 , , m ; l = 1 , 2 , , L 1 .
Step 1. The multi-attribute rating matrix D 1 l for product A i i = 1 , 2 , , m on platform P 1 l l = 1 , 2 , , L 1 is derived by aggregating the group scores in a systematic manner.
D 1 l = α i j l m × J 1 , α i j l = k = 1 K l i u w 1 k l i α i j k l
Step 2. Aggregate multi-attribute ratings.
Step 2.1. Based on the rating reliability f c i j 1 l obtained through the rating data reliability calculation method presented in Section 3.2, the multi-attribute rating matrix D 1 l = α i j l m × J 1 is transformed into the BUI matrix:
B U I D 1 l = Θ i j 1 l m × J 1 , Θ i j 1 l = α ¯ i j l ; f c i j 1 l , α ¯ i j l = α i j l / 5 0 , 1
Step 2.2. The BUI value of the product on each platform was calculated by employing the BUIOWA operator:
Θ i 1 l = α ¯ i l ; f c i 1 l = B U I O W A Θ i 1 11 , Θ i 2 12 , , Θ i J 1 1 L 1 = 0 1 E t n O W A W E t α ¯ i j l | E t d t + 0 1 1 E t n m e a n n \ E t α ¯ i j l | n \ E t d t , 1 J 1 j = 1 J 1 f c i j 1 l
Step 3. The BUI values from multiple scoring platforms were aggregated to calculate the comprehensive BUI value of the product.
Θ i 1 = α ¯ i ; f c i 1 = B U I O W A Θ i 11 , Θ i 12 , , Θ i 1 L 1 = 0 1 E t n O W A W E t α ¯ i l | E t d t + 0 1 1 E t n m e a n n \ E t α ¯ i l | n \ E t d t , 1 L 1 l = 1 L 1 f c i 1 l
Finally, the comprehensive BUI results of each rating platform are output Θ i 1 i = 1 , 2 , , m .

3.3.3. Aggregation of Complaint Word-of-Mouth

This section presents a complaint information aggregation method utilizing the BUIOWA aggregation operator, which can consolidate the complaint rating data obtained from the method described in Section 3.2 into comprehensive BUI complaint rating data. The data input of the complaint eWOM aggregation algorithm is G D i j 2 l = β i j k l K l i j × J 2 , i = 1 , 2 , , m ; l = 1 , 2 , , L 2 ; j = 1 , 2 , , J 2
Step 1. The multi-attribute complaint rating matrix D 2 l on complaint platform P 2 l l = 1 , 2 , , L 2 is derived by aggregating the group scores in a systematic manner.
D 2 l = β i j l m × J 2 , β i j l = k = 1 K l i j u w 2 k l i j β i j k l , l = 1 , 2 , , L 2
Step 2. Multi-attribute aggregation.
Step 2.1. Based on the complaint rating reliability f c i j 2 l obtained through the complaint data reliability calculation method presented in Section 3.2, the multi-attribute complaint rating matrix D 2 l = β i j l m × J 2 is transformed into the BUI matrix:
B U I D 2 l = Θ i j 2 l m × J 2 , Θ i j 2 l = β ¯ i j l ; f c i j 2 l , β ¯ i j l = β i j l / 5 0 , 1
Step 2.2. The BUI value of the product on each complaint platform was calculated by employing the BUIOWA operator.
Θ i 2 l = β ¯ i l ; f c i 2 l = B U I O W A Θ i 1 21 , Θ i 2 22 , , Θ i J 1 2 L 2 0 1 E t n O W A W E t β ¯ i j l | E t d t + 0 1 1 E t n m e a n n \ E t β ¯ i j l | n \ E t d t , 1 J 2 j = 1 J 2 f c i j 2 l
Step 3. The BUI values from multiple complaint platforms were aggregated to obtain the comprehensive BUI value of the product.
Θ i 2 = β ¯ i ; f c i 2 = B U I O W A Θ i 21 , Θ i 22 , , Θ i 2 L 2 = 0 1 E t n O W A W E t β ¯ i l | E t d t + 0 1 1 E t n m e a n n \ E t β ¯ i l | n \ E t d t , 1 L 2 l = 1 L 2 f c i 2 l
Finally, the comprehensive BUI results of each complaint platform are output Θ i 2 i = 1 , 2 , , m .

3.3.4. Aggregation of Safety Testing Word-of-Mouth

This section presents a method for aggregating safety testing eWOM information, which enables us to aggregate the ratings of all safety testing platforms on multiple attributes of a product into comprehensive BUI evaluation data. The data input of the safety testing eWOM aggregation algorithm is D 3 l = γ i j l m × J 3 , l = 1 , 2 , , L 3 .
Step 1. Multi-attribute aggregation.
Step 1.1. Based on the safety testing rating reliability f c i j 3 l obtained through the safety testing data reliability calculation method presented in Section 3.2, the multi-attribute safety testing rating matrix D 3 l = γ i j l m × J 3 is transformed into the BUI matrix:
B U I D 3 l = Θ i j 3 l m × J 3 , Θ i j 3 l = γ ¯ i j l ; f c i j 3 l , γ ¯ i j l = γ i j l / 5 0 , 1
Step 1.2. The BUI value of the product on each safety testing platform was obtained using the BUIOWA operator.
Θ i 3 l = γ ¯ i l ; f c i 3 l = B U I O W A Θ i 1 3 l , Θ i 2 3 l , , Θ i J 3 3 l 0 1 E t n O W A W E t γ ¯ i j l | E t d t + 0 1 1 E t n m e a n n \ E t γ ¯ i j l | n \ E t d t , 1 J 3 j = 1 J 3 f c i j 3 l
Step 2. The BUI values from multiple safety testing platforms are aggregated to obtain the comprehensive BUI value of the product.
Θ i 3 = γ ¯ i l ; f c i 2 = B U I O W A Θ i 31 , Θ i 32 , , Θ i 3 L 3 = 0 1 E t n O W A W E t γ ¯ i l | E t d t + 0 1 1 E t n m e a n n \ E t γ ¯ i l | n \ E t d t , 1 L 3 l = 1 L 3 f c i 3 l
Finally, the comprehensive BUI results of each complaint safety testing platform are output Θ i 3 i = 1 , 2 , , m .

3.3.5. The Aggregation of eWOM Information on Multiple Platforms

This section presents a method for aggregating comprehensive BUI-type eWOM from the three types of online service platforms into comprehensive BUI information that holistically represents product evaluation. The algorithm’s input consists of the comprehensive BUI results Θ i 1 , Θ i 2 , Θ i 3 i = 1 , 2 , , m from the three platform types.
The BUI values from multiple platform types were aggregated to obtain the comprehensive BUI value.
Θ i = A M Θ i 1 , Θ i 2 , Θ i 3 = 1 3 α ¯ i + β ¯ i + γ ¯ i ; 1 3 f c i 1 + f c i 2 + f c i 3 = ϑ i ; f c i , i = 1 , 2 , , m
Finally, the comprehensive BUI results Θ i of multiple platforms are output.

3.4. Product Ranking Based on User Preferences

Based on the comprehensive BUI results from three types of platforms Θ i 1 , Θ i 2 , Θ i 3 i = 1 , 2 , , m and the aggregated BUI outcomes across multiple platforms Θ i , this section develops a personalized recommendation algorithm for automotive products.
Step 1. For each alternate vehicle, construct a four-element BUI group ¯ i = Θ i 1 , Θ i 2 , Θ i 3 , Θ i and its corresponding ordering vector P ¯ i .
Step 1.1. Construct a four-element BUI group ¯ i = Θ i 1 , Θ i 2 , Θ i 3 , Θ i for each alternative car.
Step 1.2. Based on the BUI sorting method, the sorting percentage vector for the four-element BUI group was computed.
P ¯ i = P Θ i 1 , P Θ i 2 , P Θ i 3 , P Θ i T
where P Θ i 1 , P Θ i 2 , P Θ i 3 , P Θ i , respectively, represents the ranking percentage of product A i i = 1 , 2 , , m on the rating platform, complaint platform, security platform, and multi-platform combination.
Step 2. Initial recommendation results based on users’ preferred ranking percentage.
Step 2.1. Input the set of user expectations regarding the percentage of the sorted four-element BUI result group:
P e r - P ¯ = P e r - P Θ 1 , P e r - P Θ 2 , P e r - P Θ 3 , P e r - P Θ T
where P e r - P Θ 1 , P e r - P Θ 2 , P e r - P Θ 3 , P e r - P Θ , respectively, denote the ranking percentages of the user’s expected rating platform, complaint platform, security platform, and multi-type platform.
Step 2.2. Based on P ¯ i and P e r - P ¯ , the initial set P e r - A 0 of personalized recommendations is determined.
P e r - A 0 = A i P Θ i 1 P e r - P Θ 1 , P Θ i 2 P e r - P Θ 2 , P Θ i 3 P e r - P Θ 3 , P Θ i P e r - P Θ
Step 3. The aggregated BUI value P e r - Θ i i = 1 , 2 , , m across multiple platforms, considering user preferences, was computed.
Step 3.1. Determine the user platform preference weights P e r - w 1 , P e r - w 2 , P e r - w 3 , which, respectively, represent the user’s preference weights for the rating platform, the complaint platform, and the security platform, while ensuring that P e r - w 1 + P e r - w 2 + P e r - w 3 = 1 .
Step 3.2. Compute the multi-platform aggregated BUI value P e r - Θ i by incorporating user preferences.
P e r - Θ i = W M P e r Θ i 1 , Θ i 2 , Θ i 3 = P e r - w 1 α ¯ i + P e r - w 2 β ¯ i + P e r - w 3 γ ¯ i ; P e r - w 1 f c i 1 + P e r - w 2 f c i 2 + P e r - w 3 f c i 3 = P e r - ϑ i ; P e r - f c i , i i A i P e r - A 0
Step 4. By applying the BUI ranking method, the ranking results of the schemes within the initial set of personalized recommendations are obtained.
A σ i A σ i + 1 , P e r - Θ σ i P e r - Θ σ i + 1 , i i A i P e r - A 0

4. Product Ranking Recommendation Algorithm Based on Multi-Type Platform eWOM Data

In this section, the overall process of the automotive product ranking algorithm based on rating eWOM data, complaint eWOM data, and safety test data is introduced, as shown in Figure 4.
Step 1. The alternative vehicle product set A , rating platform set P 1 , complaint platform set P 2 , and safety test platform set P 3 are identified.
Step 2. Relying on crawler technology, obtain the eWOM data of candidate automotive product set A on platform sets P 1 , P 2 , and P 3 .
Step 2.1. Construct the user rating matrix G D i 1 l for the i-th automobile product on the l-th platform in platform set P 1 .
Step 2.2. Construct a complaint text matrix T D i j l for the j-th attribute of the i-th automotive product on the l-th platform in platform set P 2 .
Step 2.3. Construct the security level matrix D 3 l for the l-th platform in platform set P 3 .
Step 3. To convert complaint text data into calculable numerical data, we follow the complaint text conversion method described in Section 3.2.1 First, the annotation set is manually annotated to obtain a comprehensive, manually annotated risk level matrix T D j , which can then be utilized in subsequent model training.
Step 4. Based on the complaint text conversion method described in Section 3.2.1, the XLNET model is trained using the manually annotated risk level matrix T D j obtained in Step 3. Subsequently, the trained model is utilized to convert the complaint text matrix T D i j l into a quantitative matrix G D i j 2 l .
Step 5. Based on the platform reliability calculation method presented in Section 3.2, three types of platform reliability are computed.
F P 1 = f p l j 1 L 1 × J F , F P 2 = f p l j 2 L 2 × J F , F P 3 = f p l j 3 L 3 × J F
Step 6. Based on the credibility calculation method for eWOM information presented in Section 3.2, the credibility of the rating eWOM index f c i j 1 l , the credibility of the complaint eWOM index f c i j 2 l , and the credibility of the safety test index f c i j 3 l are computed.
Step 7. Based on the information aggregation method presented in Section 3.3, the information aggregation results for rating eWOM Θ i 1 , complaint eWOM Θ i 2 , and safety test data Θ i 3 were computed using the BUIOWA aggregation operator.
Step 8. Based on the product ranking method for user personalized preferences presented in Section 3.4, automobile product ranking recommendations tailored to user personalized preferences are realized.

5. A Case Study of Product Ranking Driven by Multi-Source Data

This section will employ real-world cases to demonstrate the multi-source data-driven product ranking algorithm presented in Section 4. The data originate from three types of online service platforms: user ratings, user complaints, and security tests. Suppose that all data are fully available, and the data scale is sufficiently large. Furthermore, in this case analysis, we assume that decision-makers demonstrate a relatively low level of optimistic preference inclination. Specifically, during the application of the BUIOWA operator, the orness parameter is assigned a value of 0.25.
Step 1. The alternative vehicle product set A , the rating platform set P 1 , the complaint platform set P 2 , and the safety test platform set P 3 are defined as follows:
A = A 1 = BMW   3   Series , A 2 = Benz   GLB , A 3 = Volkswagen   ID . 3 , A 4 = Ford   Explorer , A 5 = Subaru   Forester , A 6 = Honda   CR - V , A 7 = Toyota   Corolla , A 8 = Modern   Sonata , A 9 = Honda   Accord
P 1 = P 11 = autohome . com , P 12 = dongchedi . com
P 2 = P 21 = 12365 auto . com , P 22 = 315 qc . com
P 3 = P 31 = c - ncap . org . cn , P 32 = euroncap . com
Step 2. Based on crawler technology, the real data for the candidate automotive product set A on platform sets P 1 , P 2 , and P 3 were obtained. The data scale is presented in Table 1. After organizing the data, the user rating matrix G D i 1 l , complaint text matrix T D i j l , and security level matrix D 3 l were constructed.
Due to the large volume of data, only a portion of the data is presented below.
Step 2.1. The user rating matrix G D i 1 l is constructed (using the BMW 3 series coefficient data from the Autohome.com platform as an example).
G D 1 11 = 5 4 5 5 5 5 5 5 5 5 5 4 5 5 3 5 5 5 4 4 4 4 5 5 5 4 1 5 4 5 5 5 5 5 5
Step 2.2. The complaint text matrix T D i j l is constructed. Table 2 presents a portion of the complaint text contents regarding the BMW 3 series from the 12365auto.com platform.
Step 2.3. Security level matrix D 3 l is constructed.
D 3 l Τ = 96.30 % 92.42 % 89.32 % 88.44 % 91.32 % 90.07 % 91.33 % 90.64 % 89.89 % 84.05 % 74.17 % 73.19 % 66.09 % 81.05 % 80.56 % 77.20 % 65.57 % 77.79 % 95.11 % 100 % 85.98 % 100 % 95.11 % 88.09 % 100 % 100 % 83.59 %
Step 3. Manually annotate the dataset to obtain a comprehensive, manually annotated risk level matrix T D j .
T D j Τ = 4 4 3 5 5 4 2
Step 4. The XLNet model presented in Section 3.2 was trained using the aforementioned synthetically and manually labeled risk level matrix T D j . The variation trends of model accuracy and loss function during the training process are illustrated in Figure 5 and Figure 6. Subsequently, the trained model was utilized to transform the complaint text matrix T D i j l into a quantitative matrix G D i j 2 l .
Step 5. Based on the platform reliability calculation method presented in Section 3.2, the reliability of various platforms was computed.
Step 5.1. Obtain a reliability matrix for each platform. For each platform, there are 15 indicators to evaluate its reliability.
F P 1 Τ = 1 0.75 0.75 0.75 0.75 0.75 0.75 0.5 0.75 0.75 0.75 0.75 0.75 0.5 0.75 1 1 1 0.75 0.75 1 0.75 0.75 1 1 1 1 0.75 0.75 0.75 , F P 2 Τ = 1 1 1 0.75 0.75 1 0.5 0.5 0.75 0.75 0.5 0.75 0.5 0.75 0.75 1 1 1 0.75 0.75 1 0.25 0.5 0.75 0.5 0.5 0.75 0.5 0.5 0.5 , F P 3 Τ = 1 1 1 0.75 0.5 1 0.5 0.75 0.75 0.75 0.75 0.75 0.5 0.5 0.5 1 1 1 0.75 0.5 1 0.25 0.75 0.25 0.25 0.5 0.75 0.5 0.25 0.5 ,
Step 5.2. Calculate the reliability of various platforms. The reliability of each platform is obtained by aggregating the corresponding reliability values of the 15 evaluation indicators for platform reliability.
f p 1 1 = 0.454 ,   f p 2 1 = 0.546 f p 1 2 = 0.523 ,   f p 2 2 = 0.477 f p 1 3 = 0.543 ,   f p 2 3 = 0.457
Step 6. Based on the credibility calculation method presented in Section 3.2, the credibility of the rating eWOM indicator f c i j 1 l , complaint eWOM indicator f c i j 2 l , and safety test indicator f c i j 3 l is calculated.
Step 6.1. The credibility of the rating eWOM indicator is calculated as follows:
f c i j 11 ; f c i j 12 Τ = 0.556 0.724 0.556 0.584 0.556 0.584 0.556 0.864 0.696 0.724 0.696 0.584 0.556 0.584 0.836 0.724 0.696 0.724 0.556 0.584 0.556 0.584 0.696 0.444 0.556 0.864 0.556 0.584 0.556 0.584 0.556 0.584 0.416 0.724 0.416 0.584 0.416 0.584 0.556 0.584 0.556 0.584 0.556 0.724 0.416 0.444 0.416 0.584 0.556 0.724 0.276 0.584 0.556 0.584 0.696 0.584 0.696 0.584 0.836 0.304 0.276 0.864 0.696 0.444 0.416 0.584 0.556 0.584 0.556 0.724 0.696 0.584 0.556 0.584 0.556 0.724 0.416 0.584 0.556 0.724 0.416 0.444 0.556 0.584 0.556 0.584 0.416 0.584 0.416 0.584 0.556 0.724 0.556 0.584 0.696 0.444 0.556 0.864 0.556 0.584 0.556 0.724 0.556 0.584 0.696 0.584 0.556 0.584 0.556 0.724 0.416 0.584 0.416 0.584 0.556 0.864 0.416 0.584 0.416 0.724 0.556 0.724 0.276 0.584 0.416 0.584
Step 6.2. The credibility of the complaint eWOM indicator is calculated as follows:
f c i j 21 ; f c i j 22 Τ = 0.676 0.493 0.507 0.493 0.587 0.537 0.735 0.493 0.332 0.493 0.682 0.668 0.682 0.493 0.740 0.493 0.787 0.668 0.834 0.493 0.626 0.493 0.776 0.668 0.682 0.493 0.332 0.493 0.755 0.668 0.834 0.493 0.783 0.493 0.839 0.668 0.687 0.493 0.629 0.493 0.857 0.668 0.698 0.439 0.322 0.439 0.733 0.668 0.733 0.493 0.733 0.493 0.787 0.668 0.682 0.493 0.629 0.493 0.857 0.668 0.719 0.493 0.404 0.493 0.807 0.668 0.787 0.493 0.710 0.493 0.804 0.668 0.750 0.493 0.668 0.493 0.682 0.563 0.682 0.493 0.332 0.563 0.733 0.668 0.733 0.493 0.857 0.563 0.734 0.668 0.733 0.493 0.708 0.563 0.857 0.563 0.628 0.493 0.332 0.563 0.733 0.668 0.733 0.493 0.857 0.563 0.734 0.668 0.755 0.493 0.682 0.581 0.682 0.668 0.682 0.493 0.332 0.581 0.682 0.668 0.783 0.493 0.857 0.581 0.682 0.668 0.733 0.493 0.682 0.493 0.587 0.537 0.682 0.493 0.332 0.493 0.783 0.668 0.755 0.493 0.682 0.493 0.787 0.668
Step 6.3. The credibility of the safety test indicator is calculated as follows:
f c i j 31 ; f c i j 32 Τ = 0.688 0.487 0.688 0.487 0.688 0.575 0.688 0.575 0.600 0.487 0.775 0.487 0.688 0.487 0.688 0.137 0.600 0.225 0.513 0.487 0.513 0.487 0.513 0.575 0.513 0.575 0.425 0.487 0.513 0.487 0.513 0.487 0.513 0.137 0.425 0.225 0.600 0.487 0.600 0.487 0.775 0.575 0.600 0.575 0.513 0.487 0.775 0.487 0.600 0.487 0.600 0.137 0.513 0.225
Step 7. Based on the information aggregation method presented in Section 3.3, the BUIOWA aggregation function was utilized with orness α = 0.25 to calculate the aggregated results of rating eWOM Θ i 1 , complaint eWOM Θ i 2 , and safety test Θ i 3 , respectively.
Step 7.1. The multi-attribute rating matrix D 1 l for product A i i = 1 , 2 , , 9 on platform P 1 l l = 1 , 2 is derived by aggregating the ratings provided by different user groups
D 11 ; D 12 Τ = 0.880 0.725 0.845 0.725 0.913 0.675 0.895 0.800 0.970 0.750 0.970 0.825 0.833 0.600 1.000 0.750 0.955 0.850 0.913 0.775 0.828 0.650 0.953 0.775 0.895 0.725 0.870 0.775 0.835 0.675 0.850 0.650 0.750 0.675 0.845 0.700 0.855 0.775 0.860 0.650 0.798 0.775 0.768 0.750 0.850 0.725 0.855 0.675 0.898 0.625 0.750 0.700 0.908 0.700 0.970 0.875 0.933 0.825 0.988 0.800 0.898 0.800 0.920 0.725 0.935 0.825 0.890 0.725 1.000 0.750 0.948 0.875 0.895 0.750 0.903 0.825 0.848 0.600 0.870 0.700 0.803 0.600 0.648 0.600 0.788 0.575 0.750 0.700 0.710 0.600 0.793 0.750 0.888 0.650 0.980 0.755 0.878 0.750 0.875 0.725 0.850 0.700 0.893 0.675 1.000 0.700 0.860 0.700 0.734 0.700 0.765 0.625 0.863 0.575 0.893 0.725 0.853 0.650 0.850 0.600 0.883 0.650 0.750 0.725 0.765 0.600
The multi-attribute BUI matrix is constructed by integrating the multi-attribute scoring matrix and the credibility matrix of the rating eWOM indicator into a binary unit. By applying the BUIOWA aggregation operator to aggregate the attribute dimensions, the BUI ratings Θ i 1 for nine automobile brands across two platforms can be obtained.
Θ i 1 l Τ = 0.839 , 0.556 0.846 , 0.556 0.886 , 0.596 0.857 , 0.516 0.881 , 0.536 0.841 , 0 . 556 0.845 , 0.556 0.842 , 0.496 0.853 , 0.536 0.749 , 0.664 0.682 , 0.604 0.699 , 0.504 0.732 , 0.824 0.705 , 0.544 0.672 , 0.624 0.627 , 0.624 0.705 , 0.655 0.686 , 0.604
The BUI values of multiple rating platforms were aggregated to determine the comprehensive BUI value of the product.
Θ i 1 = 0.772 , 0.610 0.723 , 0.580 0.746 , 0.550 0.763 , 0.670 0.749 , 0.540 0.714 , 0.590 0.682 , 0.590 0.739 , 0.570 0.728 , 0.570
Step 7.2. The multi-attribute complaint risk level matrix D 2 l for product A i i = 1 , 2 , , 9 on platform P 2 l l = 1 , 2 is derived by aggregating the ratings provided by different user groups
D 21 ; D 22 Τ = 0.547 0.688 0.611 0.417 0.250 0.565 0.610 0.550 0.636 0.750 0.720 0.500 0.735 0.614 0.609 0.500 0.553 0.500 0.758 0.636 0.824 1.000 0.819 0.750 0.756 0.857 0.768 0.708 0.768 0.710 0.825 0.736 0.793 0.250 0.805 0.528 0.560 0.528 0.288 0.375 0.375 0.250 0.379 0.550 0.603 0.500 0.485 0.417 0.425 0.517 0.428 0.438 0.508 0.359 0.903 0.641 0.950 0.611 0.898 0.667 0.992 0.607 0.917 0.750 0.921 0.708 0.915 0.676 0.917 0.350 0.918 0.558 0.287 0.583 0.548 0.000 0.750 0.445 0.548 0.445 0.625 0.445 0.696 0.750 0.639 0.250 0.317 0.445 0.523 0.643 0.750 0.750 0.682 0.668 0.750 0.668 0.650 0.750 0.547 0.668 0.745 0.750 0.712 0.500 0.640 0.668 0.675 0.588 0.540 0.625 0.583 0.332 0.417 0.000 0.250 0.000 0.350 0.332 0.220 0.750 0.283 0.250 0.275 0.332 0.270 0.365 0.398 0.607 0.350 0.250 0.250 0.443 0.570 0.250 0.514 1.000 0.522 0.250 0.390 0.500 0.424 0.250 0.428 0.438
The multi-attribute BUI matrix is constructed by integrating the multi-attribute complaint risk level matrix and the credibility matrix of the complaint eWOM indicator into a binary unit. By applying the BUIOWA aggregation operator to aggregate the attribute dimensions, the BUI ratings Θ i 2 for nine automobile brands across two platforms can be obtained.
Θ i 2 l Τ = 0.522 , 0.731 0.525 , 0.647 0.459 , 0.788 0.516 , 0.692 0.598 , 0.341 0.553 , 0 . 725 0.533 , 0.749 0.465 , 0.777 0.513 , 0.763 0.623 , 0.493 0.411 , 0.513 0.379 , 0.609 0.460 , 0.502 0.605 , 0.521 0.530 , 0.668 0.479 , 0.493 0.377 , 0.521 0.459 , 0.668
The BUI values of multiple complaint platforms were aggregated to determine the comprehensive BUI value of the product.
Θ i 2 = 0.547 , 0.612 0.440 , 0.580 0.399 , 0.699 0.474 , 0.597 0.600 , 0.431 0.536 , 0.697 0.493 , 0.621 0.399 , 0.649 0.473 , 0.716
Step 7.3. The multi-attribute security level matrix D 3 l for product A i i = 1 , 2 , , 9 on platform P 2 l l = 1 , 2 is obtained.
D 31 ; D 32 Τ = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0.75 0.75 0.75 0.75 0.75 0.75 1 0.75 1 0.75 0.75 1 0.75 0.25 0.75 0.5 1 0.75 1 0.75 1 1 1 1 1 0.75 1 0.75 1 0.75 1 0.5 1 1
The multi-attribute BUI matrix is constructed by integrating the multi-attribute security level matrix and the credibility matrix of the safety testing evaluation indicator into a binary unit. By applying the BUIOWA aggregation operator to aggregate the attribute dimensions, the BUI ratings Θ i 3 for nine automobile brands across two platforms can be obtained.
Θ i 3 l Τ = 1.000 , 0.600 0.833 , 0.600 0.883 , 0.659 0.883 , 0.600 1.000 , 0.513 1.000 , 0 . 688 0.883 , 0.600 0.883 , 0.600 0.903 , 0.513 0.903 , 0.487 0.822 , 0.487 0.883 , 0.575 0.883 , 0.575 0.822 , 0.487 0.822 , 0.487 0.903 , 0.487 0.583 , 0.137 0.807 , 0.225
The BUI values of multiple safety testing platforms were aggregated to determine the comprehensive BUI value of the product.
Θ i 3 = 0.927 , 0.544 0.825 , 0.544 0.883 , 0.617 0.883 , 0.588 0.867 , 0.500 0.867 , 0.588 0.888 , 0.544 0.658 , 0.369 0.831 , 0.369
Step 8. Based on the product ranking algorithm that incorporates users’ personalized preferences as described in Section 3.4, personalized product ranking recommendations are realized.
Step 8.1. Construct a quadruple BUI group ¯ i = Θ i 1 , Θ i 2 , Θ i 3 , Θ i along with its corresponding sorting percentage vector P ¯ i .
¯ i Τ = 0.772 , 0.610 0.723 , 0.580 0.746 , 0.550 0.763 , 0.670 0.749 , 0.540 0.714 , 0.590 0.682 , 0.590 0.739 , 0.570 0.728 , 0.570 0.547 , 0.612 0.440 , 0.580 0.399 , 0.699 0.474 , 0.597 0.600 , 0.431 0.536 , 0.697 0.493 , 0.621 0.399 , 0.649 0.473 , 0.716 0.927 , 0.544 0.825 , 0.544 0.883 , 0.617 0.883 , 0.588 0.867 , 0.500 0.867 , 0.588 0.888 , 0.544 0.658 , 0.369 0.831 , 0.369 0.749 , 0.589 0.663 , 0.568 0.676 , 0.622 0.707 , 0.618 0.739 , 0.490 0.706 , 0.625 0.688 , 0.585 0.599 , 0.529 0.677 , 0.522
P ¯ i Τ = 100 % 33 % 66 % 88 % 77 % 22 % 11 % 55 % 44 % 88 % 33 % 22 % 55 % 100 % 77 % 66 % 22 % 44 % 100 % 44 % 77 % 77 % 33 % 66 % 88 % 22 % 55 % 100 % 22 % 33 % 77 % 88 % 66 % 55 % 11 % 44 %
Step 8.2. Preliminary results derived from the users’ desired ranking ratio.
Let the set of user expectations regarding the percentage of the sorted four-element BUI result group be denoted as follows.
P e r - P ¯ = 22 % 33 % 33 % 44 %
Based on P ¯ i and P e r - P ¯ , the initial set of personalized recommendations can be determined as follows.
P e r - A 0 = A 1 , A 4 , A 5 , A 6 , A 9
Step 8.3. Assuming that users assign equal preference weights P e r - w 1 = P e r - w 2 = P e r - w 3 = 1 / 3 to the scoring platform, complaint platform, and security platform, the comprehensive BUI value P e r - Θ i i = 1 , 2 , , 5 of multi-type platforms, based on user preferences, is computed as follows.
P e r - Θ i = 0.749 , 0.589 , 0.707 , 0.618 , 0.739 , 0.490 , 0.706 , 0.625 , 0.677 , 0.522
Step 8.4. In accordance with the BUI ranking rule, the ranking of the five types of cars in the initial recommendation set is determined as A 1 > A 5 > A 4 > A 6 > A 9 , and the final optimal car recommendation is A 1 .

6. Comparative Analysis and Management Enlightenment

6.1. Weight Sensitivity Analysis

In Step 8.3 of Section 5, decision-makers can determine the weights of the three types of platforms according to their preferences, thereby obtaining a personalized comprehensive BUI value for multi-type platforms. In this section, sensitivity analysis will be conducted by designing various platform weight configurations. Table 3 presents seven weight allocation cases, where the first three cases indicate that decision-makers consider only the eWOM data from a single type of platform; Case 4 to Case 6 indicate that decision-makers consider the eWOM data from two types of platforms; and Case 7 indicates that decision-makers consider the eWOM data from all three types of platforms. Based on these seven personalized weight cases, the comprehensive BUI values and ranking results for multi-type platforms, derived from user preferences, were calculated, respectively. The obtained results are presented in Table 4.
The results indicate that the ranking outcomes under the seven scenarios exhibit significant differences, suggesting that rating eWOM data, complaint eWOM data, and safety test data encapsulate distinct product ranking knowledge. Focusing solely on a single type of eWOM data or failing to comprehensively integrate all three types of eWOM data will lead to incomplete ranking results, which are insufficient to serve as an effective reference for consumers. Only through the comprehensive consideration of rating eWOM data, complaint eWOM data, and safety test data can the most representative product ranking outcomes be achieved.

6.2. Theoretical Comparative Analysis

In the field of MCDM, traditional baseline methods such as TOPSIS and AHP have been extensively utilized for scheme ranking. Specifically, the TOPSIS method ranks alternatives by calculating their distances from the ideal solution, while the AHP method relies on hierarchical weight analysis to determine the relative importance of each criterion.
However, when addressing multi-source heterogeneous data (e.g., online eWOM data), these traditional methods exhibit significant limitations. Multi-source heterogeneous data often possess characteristics such as uncertainty, conflict, and quality variability, which traditional methods struggle to handle effectively. In contrast, fuzzy MCDM methods, through the integration of fuzzy theory, can more flexibly quantify subjective preferences and fuzzy information, thereby alleviating some of the shortcomings of traditional approaches. Nevertheless, this method still lacks a systematic evaluation mechanism for assessing information quality, which, to some extent, constrains its application in complex decision-making scenarios.
Against this backdrop, the large-scale group decision-making method based on the BUI model presented in this study demonstrates unique advantages.
From the perspective of information quality-driven decision-making, traditional MCDM methods solely rely on evaluation data for decision-making while neglecting the credibility of the evaluation information itself. The BUI model dynamically assesses the quality of multi-source data, enabling comprehensive consideration of both evaluation information and its credibility. This approach enhances the robustness of decisions and improves the reliability of the decision structure, leading to more scientific and reasonable outcomes.
Regarding large-scale group fusion, the BUIOWA operator achieves dynamic weight distribution. On the one hand, it simulates different decision-maker preferences by adjusting the orness parameter, such as maximizing pessimistic or optimistic criteria, thereby accommodating diverse decision-making scenarios. On the other hand, the BUIOWA operator effectively mitigates the influence of low-credibility information on aggregation results, thus enhancing the reliability of decision-making outcomes.
This decision-making mechanism, which integrates information quality analysis, enables BUIOWA to significantly outperform traditional MCDM methods and baseline models in scenarios such as product ranking that depend on multi-source eWOM data. It provides more reliable support for complex decision-making processes and assists decision-makers in achieving more scientific and reasonable decisions when confronted with complex multi-source heterogeneous data. Table 5 presents the comparative analysis results of the method proposed in this paper against other MCDM methods.

6.3. The Management Enlightenment of Enterprise Sustainable Development

In the era of big data, various online service platforms encompass diverse types of consumer behavior preference data, offering enterprises critical insights into market dynamics and consumer demands. For instance, among the three categories of eWOM data examined in this study, rated eWOM data quantitatively reflect consumers’ actual experiences with enterprise products. By integrating such data, enterprises can assess the overall reputation and competitiveness of their products in the market. Complaint eWOM data highlight areas where consumers are most dissatisfied with enterprise products. A thorough analysis of these data enables enterprises to precisely identify weaknesses in their operations. Safety test data, evaluated from a professional automotive safety perspective, demonstrate the safety performance of products and provide targeted directions for safety technology development and improvement. Compared to relying on a single type of eWOM data, comprehensively considering multi-type user eWOM data and extracting consumer behavior preferences is essential for enterprise sustainability. Consequently, enterprises should proactively adjust management strategies and prioritize the application of data analysis technologies in uncovering consumer behavioral preferences. Additionally, enterprises must establish a robust data analysis framework to ensure the effective extraction of consumer preference information from multi-dimensional eWOM data. By leveraging multi-source data-driven consumer behavior analysis capabilities, enterprises can enhance their sensitivity to market changes, promptly adjust strategic directions, improve product market competitiveness, and achieve sustainable development objectives.

7. Discussion and Conclusions

7.1. Discussion

This study utilizes multi-source eWOM data to extract reliable consumer behavior preferences for personalized product ranking and recommendation. Nevertheless, there are certain limitations in this study, along with potential future research directions, which are outlined below.
(1)
Algorithm update mechanism: The product ranking and recommendation system developed in this study is heavily reliant on data. Given the rapid advancement of new energy vehicles, novel types of user complaints may arise in critical areas such as battery performance and intelligent cabin functionalities. The existing risk assessment model may struggle to promptly adapt to and address these emerging issues. Consequently, it is essential to establish a systematic mechanism for continuous updates and maintenance to ensure the ongoing accuracy and applicability of the model.
(2)
Cross-platform data access and interoperability constraints: Currently, the strong independence of different platforms, coupled with the presence of anti-crawling mechanisms, renders the data acquisition process heavily reliant on manual intervention. This not only results in low data collection efficiency but also undermines the system’s cross-platform interaction capabilities and real-time performance. In the future, it is advisable to develop intelligent agents equipped with cross-platform interaction capabilities to facilitate automated access, parsing, and interaction of multi-platform data, thereby enhancing the system’s overall data integration capabilities.
(3)
Computational demands for large-scale model deployment: This study employs the XLNet model to handle the transformation of complaint text, which imposes a significant demand on computing resources, particularly in large-scale scenarios. With the advancement of general-purpose large language models, future approaches could involve leveraging pre-trained general models alongside lightweight fine-tuning techniques to facilitate rapid deployment, alleviate the computational burden, and improve the flexibility and cost-effectiveness of system scalability.
(4)
User privacy and data protection concerns: When processing sensitive data, such as user complaints, stringent privacy protection measures are required due to the substantial risk of data breaches. In the future, data anonymization techniques could be incorporated during the preprocessing phase, including desensitization (e.g., removing personally identifiable information like names, phone numbers, and license plates) and entity substitution (e.g., replacing user identity details with unified identifiers). Additionally, differential privacy mechanisms could be implemented by introducing controlled noise into the training datasets, thereby effectively preventing the model from reverse-engineering original data points and ensuring robust user privacy and data security.
(5)
Model explainability and result transparency: Complex models frequently exhibit a lack of explainability when generating outputs for non-technical stakeholders, thereby hindering their ability to support managerial decision-making or foster user trust. In the future, during the design of product ranking and recommendation systems, it is advisable to incorporate explainability modules such as attention visualization and decision path tracking. These techniques can enhance model transparency and improve its practical usability and reliability in real-world business contexts.
Scalability and cross-domain application: This study uses the recommendation process of car products as a case to illustrate the practical application of the multi-source data-driven personalized product ranking and recommendation approach. This method demonstrates broad applicability, enabling it to effectively support car product recommendations while also being adaptable to other domains that rely on user-generated online content for personalized recommendations, such as electronic consumer goods, tourism services, and online education platforms. When applying this method to different fields, it is essential to reassess the sources and types of multi-source reputation data, as these vary significantly across domains in terms of generation channels and presentation formats. Additionally, given the distinct characteristics of reputation data in various fields, it is necessary to reevaluate the MCDM indicators based on the evaluation categories of reputation data. This ensures the scientific rigor and effectiveness of the method when applied to personalized recommendations across diverse domains, thereby achieving precise product ranking and recommendation outcomes.

7.2. Conclusions

In the digital era, various online service platforms collect diverse types of eWOM data, which contain rich product experience information. This provides an essential foundation for enterprises to understand consumer behavior preferences and optimize their marketing and management strategies. However, existing data-driven product ranking research predominantly focuses on single-data-source analysis, failing to fully exploit the integration value of multi-source eWOM data. Additionally, data quality issues further limit its application potential. This study comprehensively incorporates rating eWOM data, complaint eWOM data, and safety test data, focusing on the development of a product personalized ranking recommendation algorithm driven by multi-source data. The main contributions of this study are as follows:
(1)
Construction of a multi-source data-driven evaluation index system: Based on the distinct characteristics of the three types of data, a scientific and comprehensive product evaluation index system is established for each data type. Specifically, seven product evaluation indicators are designed for rating eWOM data, eight product complaint indicators are formulated based on complaint eWOM data, and three safety performance evaluation indicators are developed using safety test data. This systematic design provides a solid foundation for subsequent data analysis and knowledge extraction.
(2)
Information quality modeling and data transformation: To address the issue of information quality, this study adopts the basic uncertain information (BUI) model as a standardized representation for eWOM information, encompassing two dimensions: rating information and credibility. For rating information, given that only complaint eWOM data is in textual form, the XLNet model is utilized to transform complaint texts into scoring data. Regarding credibility, a tailored reliability evaluation model is developed for the three types of data, enabling the standardized conversion of all data into the BUI format.
(3)
BUI aggregation and personalized ranking method: Effective aggregation of BUI is achieved through the BUI order weighted average (OWA) operator. On this foundation, a product ranking method incorporating user preferences is proposed. This approach enables comprehensive consideration of individual consumer needs and preferences, thereby providing users with the anticipated ranking outcomes.
(4)
Case analysis and managerial implications: Using automobile products as an example, this study details the multi-source data-driven personalized product ranking recommendation process and validates the effectiveness and feasibility of the proposed algorithm. Furthermore, this study highlights the managerial value of multi-source eWOM data: since such data can reflect consumers’ behavioral preferences from diverse perspectives, enterprises can leverage data analysis tools to precisely capture market dynamics, develop more targeted marketing strategies, and achieve sustainable development.

Author Contributions

Conceptualization, Y.Y. and J.P.; methodology, J.P. and Y.Y.; software, J.P. and M.J.; validation M.J.; formal analysis, M.J.; investigation, M.J.; data curation, J.P.; writing—original draft preparation, J.P., writing—review and editing, M.J.; visualization, M.J.; supervision, Y.Y.; project administration, M.J.; funding acquisition, Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Hunan Provincial Innovation Foundation for Postgraduate, China (Grant no. CX20240094), the General Project of Xiangjiang Laboratory, China (Grant no. 23XJ03002), the Interdisciplinary Research Project at Hunan University of Technology and Business, China (Grant no. 2023SZJ13), the Project of Hunan Natural Science Foundation (Grant no. 2024JJ5115).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the first author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Example A1 ([11]). Assume five initial BUI inputs B U I i = ( ( x ( i ) , c ( i ) ) ) i = 1 5 are, respectively, ( ( 0.7 , 0.5 ) , ( 0.3 , 0.8 ) , ( 0.8 , 0.0 ) , ( 0.3 , 0.2 ) , ( 0.5 , 0.5 ) ) for n = 5 ; that is, x = 0.7 , 0.3 , 0.8 , 0.3 , 0.5  and c = 0.5 , 0.8 , 0.0 , 0.2 , 0.5 .
Suppose we adopt a family of OWA weight vectors, having orness  α = 0.25 , W = { w ( s ) } s = 1 , which includes (only for  1 s 5 ): w ( 1 ) = ( 1 ) w ( 2 ) = ( 0.25 , 0.75 ) w ( 3 ) = ( 0.1 , 0.3 , 0.6 ) w ( 4 ) = ( 0.05 , 0.15 , 0.3 , 0.5 ) , and w ( 5 ) = ( 0.025 , 0.075 , 0.15 , 0.375 , 0.375 )
The computation can use the following factorization of  0 , 1  (as in the computation of discrete Choquet integral). Based on different threshold intervals  t , the deterministic sets containing BUI labels can, respectively, be obtained as   E t = [ 5 ] = 1 , 2 , 3 , 4 , 5 ( t { 0 } ) ,  E t = { 1 , 2 , 4 , 5 } ( t ( 0 , 0.2 ] ) ,  E t = { 1 , 2 , 5 } ( t ( 0.2 , 0.5 ] ) ,  E t = { 2 } ( t ( 0.5 , 0.8 ] ) , and  E t = ( t ( 0.8 , 1 ] ) .
Then, using Equation (11), when t 0 ,  UOWA W ( x | E 1 ) = 5 5
OWA w ( 5 ) ( x | [ 5 ] ) + ( 1 1 5 ) mean ( x | ) = OWA w ( 5 ) ( x | [ 5 ] ) = ( 0.025 ) ( 0.8 ) + ( 0.075 ) ( 0.7 ) + ( 0.15 ) ( 0.5 ) + ( 0.375 ) ( 0.3 ) + ( 0.375 ) ( 0.3 ) = 0.3725
UOWA W ( x | E 1 ) = 4 5 OWA w ( 4 ) ( x | { 1 , 2 , 4 , 5 } ) + ( 1 4 5 ) mean { 3 } ( x | { 3 } ) = 0.8 OWA w ( 4 ) ( 0.7 , 0.3 , 0.3 , 0.5 ) + 0.2 mean { 3 } ( 0.8 ) = 0.8 0.35 + 0.2 0.8 = 0.44
when t 0.2 , 0.5 ,
UOWA W ( x | E 1 ) = 3 5 OWA w ( 3 ) ( x | { 1 , 2 , 5 } ) + ( 1 3 5 ) mean { 3 , 4 } ( x | { 3 , 4 } ) = 0.6 OWA W 3 0.7 , 0.3 , 0.5 + 0.4 mean 3 , 4 0.8 , 0.3 = 0.6 0.1 0.7 + 0.3 0.5 + 0.6 0.3 + 0.4 mean 3 , 4 0.8 , 0.3 = 0.6 0.4 + 0.4 0.55 = 0.46
when t 0.5 , 0.8 ,
UOWA W ( x | E 1 ) = 1 5 OWA w ( 1 ) ( x | { 2 } ) + ( 1 1 5 ) mean { 1 , 3 , 4 , 5 } ( x | { 1 , 3 , 4 , 5 } ) = ( 0.2 ) OWA w ( 1 ) ( 0.3 ) + ( 0.8 ) mean { 1 , 3 , 4 , 5 } ( 0.7 , 0.8 , 0.3 , 0.5 ) = ( 0.2 ) ( 0.3 ) + ( 0.8 ) ( 0.575 ) = 0.52
when t 0.8 , 1 ,
UOWA W ( x | E 1 ) = 0 5 OWA w ( 0 ) ( x | ) + ( 1 0 5 ) mean { 5 } ( x | { 5 } ) = mean { 5 } ( x | { 5 } ) = 0.52
Then,
P x ( BUIOWA W ( ( x , c ) ) ) = 0 1 UOWA W ( x | E 1 ) d t = [ 0 ] UOWA W ( x | E 1 ) d t + ( 0 , 0.2 ] UOWA W ( x | E 1 ) d t + ( 0.2 , 0.5 ] UOWA W ( x | E 1 ) d t + ( 0.5 , 0.8 ] UOWA W ( x | E 1 ) d t + ( 0.8 , 1 ] UOWA W ( x | E 1 ) d t = 0 0.3725 + 0.2 0 0.44 + 0.5 0.2 0.46 + 0.8 0.5 0.52 + 1 0.8 0.52 = 0.486
Moreover,
P c ( BUIOWA W ( ( x , c ) ) ) = 1 5 i = 1 5 c ( i ) = 0.4
Consequently,
BUIOWA W ( ( x , c ) ) = ( 0.486 , 0.4 )

References

  1. Hájek, P.; Sahut, J. Mining behavioural and sentiment-dependent linguistic patterns from restaurant reviews for fake review detection. Technol. Forecast. Soc. 2022, 177, 121532. [Google Scholar] [CrossRef]
  2. Song, Y.; Li, G.; Li, T.; Li, Y. A purchase decision support model considering consumer personalization about aspirations and risk attitudes. J. Retail Consum. Serv. 2021, 63, 102728. [Google Scholar] [CrossRef]
  3. Fan, Z.; Li, G.; Liu, Y. Processes and methods of information fusion for ranking products based on online reviews: An overview. Inform. Fusion 2020, 60, 87–97. [Google Scholar] [CrossRef]
  4. Rocklage, M.; Rucker, D.; Nordgren, L. Mass-scale emotionality reveals human behaviour and marketplace success. Nat. Hum. Behav. 2021, 5, 1323–1329. [Google Scholar] [CrossRef]
  5. Jin, L.; Mesiar, R.; Borkotokey, S.; Kalina, M. Certainty aggregation and the certainty fuzzy measures. Int. J. Intell. Syst. 2018, 33, 759–770. [Google Scholar] [CrossRef]
  6. Mesiar, R.; Borkotokey, S.; Jin, L.; Kalina, M. Aggregation under uncertainty. IEEE Trans. Fuzzy Syst. 2018, 26, 2475–2478. [Google Scholar] [CrossRef]
  7. Kou, G.; Yang, P.; Peng, Y.; Xiao, H.; Xiao, F.; Chen, Y.; Alsaadi, F.E. A cross-platform market structure analysis method using online product reviews. Technol. Econ. Dev. Economy 2021, 27, 992–1018. [Google Scholar] [CrossRef]
  8. Zhou, S.; Meng, Z.; Huang, Z.; Zhang, H.; Wang, D. A Hybrid Prospect–Regret Decision-Making Method for Green Supply Chain Management Under the Interval Type-2 Trapezoidal Fuzzy Environment. Sustainability 2025, 17, 3323. [Google Scholar] [CrossRef]
  9. Nguyen, J.; Montserrat-Adell, J.; Agell, N.; Sánchez, M.; Ruiz, F.J. Fusing hotel ratings and reviews with hesitant terms and consensus measures. Neural Comput. Appl. 2020, 32, 15301–15311. [Google Scholar] [CrossRef]
  10. Luo, Y.; Zhang, X.; Qin, Y.; Yang, Z.; Liang, Y. Tourism attraction selection with sentiment analysis of online reviews based on probabilistic linguistic term sets and the IDOCRIW-COCOSO model. Int. J. Fuzzy Syst. 2021, 23, 295–308. [Google Scholar] [CrossRef]
  11. Jin, L.; Chen, Z.S.; Yager, R.R.; Senapati, T.; Mesiar, R.; Zamora, D.G.; Dutta, B.; Martínez, L. Ordered weighted averaging operators for basic uncertain information granules. Inform. Sci. 2023, 645, 119357. [Google Scholar] [CrossRef]
  12. Jin, L.; Mesiar, R.; Senapati, T.; Jana, C.; Ma, C.; García-Zamora, D.; Yager, R.R. Ordered weighted geometric averaging operators for basic uncertain information. Inf. Sci. 2024, 663, 120275. [Google Scholar] [CrossRef]
  13. Yang, Y.; Chen, Z.S.; Pedrycz, W.; Gómez, M.; Bustince, H. Using i-subgroup-based weighted generalized interval t-norms for aggregating basic uncertain information. Fuzzy Sets Syst. 2024, 476, 108771. [Google Scholar] [CrossRef]
  14. Chen, Z.S.; Martinez, L.; Chang, J.P.; Wang, X.J.; Xionge, S.H.; Chin, K.S. Sustainable building material selection: A QFD- and ELECTRE III-embedded hybrid MCGDM approach with consensus building. Eng. Appl. Artif. Intell. 2019, 85, 783–807. [Google Scholar] [CrossRef]
  15. Chen, Z.S.; Yang, L.L.; Chin, K.S.; Yang, Y.; Pedrycz, W.; Chang, J.P.; Martínez, L.; Skibniewski, M.J. Sustainable building material selection: An integrated multi-criteria large group decision making framework. Appl. Soft Comput. 2021, 113, 107903. [Google Scholar] [CrossRef]
  16. Yang, Y.; Jie, M.; Chen, Z. Dynamic three-way multi-criteria decision making with basic uncertain linguistic information: A case study in product ranking. Appl. Soft Comput. 2024, 152, 111228. [Google Scholar] [CrossRef]
  17. Jin, L.; Chen, Z.S.; Pedrycz, W.; Senapati, T.; Yatsalo, B.; Mesiar, R.; Beliakov, G.; Martínez, L. Aggregation of basic uncertain information with two-step aggregation frame. IEEE Trans. Emerg. Top. 2024, 8, 1102–1109. [Google Scholar] [CrossRef]
  18. Liu, L.; Shao, F.; He, C. Selection of Green Recycling Suppliers for Shared Electric Bikes: A Multi-Criteria Group Decision-Making Method Based on the Basic Uncertain Information Generalized Power Weighted Average Operator and Basic Uncertain Information-Based Best–Middle–Worst TOPSIS Model. Sustainability 2024, 16, 8647. [Google Scholar] [CrossRef]
  19. Yang, Y.; Xia, D.X.; Pedrycz, W.; Deveci, M.; Chen, Z.S. Cross platform distributed product online ratings aggregation approach for decision making with basic uncertain linguistic information. Int. J. Fuzzy Syst. 2024, 26, 1936–1957. [Google Scholar] [CrossRef]
  20. Zhang, X.; Chen, L.; Sheng, G.; Lu, X.; Ming, X. An innovation service system and personalized recommendation for customer-product interaction life cycle in smart product service system. J. Clean. Prod. 2023, 398, 136470. [Google Scholar] [CrossRef]
  21. Xiong, S.; Yuan, Y.; Yao, J.; Bai, B.; Ma, X. Exploring consumer preferences for electric vehicles based on the random coefficient logit model. Energy 2022, 263, 125504. [Google Scholar] [CrossRef]
  22. Bi, J.W.; Han, T.Y.; Yao, Y.; Li, H. Ranking hotels through multi-dimensional hotel information: A method considering travelers’ preferences and expectations. Inf. Technol. Tour. 2022, 24, 127–155. [Google Scholar] [CrossRef]
  23. Song, Y.; Li, G.; Zhu, H. Multisource Data Driven Product Ranking Model With Heterogeneous Customers. IEEE Trans. Eng. Manag. 2023, 71, 9440–9459. [Google Scholar] [CrossRef]
  24. Fan, Z.; Xi, Y.; Liu, Y. Supporting consumer’s purchase decision: A method for ranking products based on online multi-attribute product ratings. Soft Comput. 2018, 22, 5247–5261. [Google Scholar] [CrossRef]
  25. Wu, X.; Liao, H. Modeling personalized cognition of customers in online shopping. Omega 2021, 104, 102471. [Google Scholar] [CrossRef]
  26. Geng, X.; Yang, Z. Scheme recommendation based on grey correlation prediction and trust cloud hybrid algorithm. Comput. Integr. Manuf. Syst. 2020, 26, 980–988. [Google Scholar]
  27. Yang, Y.; Yang, F.; Yi, G.; Xia, D.; Li, J. Product online multidimensional ratings aggregation decision-making model based on group division and attribute interaction. Eng. Appl. Artif. Intel. 2023, 126, 106835. [Google Scholar] [CrossRef]
  28. Dwivedi, P.; Dilip, K. Evaluation and ranking of battery electric vehicles by Shannon’s entropy and TOPSIS methods. Math. Comput. Simulat. 2023, 212, 457–474. [Google Scholar] [CrossRef]
  29. Bi, J.W.; Liu, Y.; Fan, Z.P. Representing sentiment analysis results of online reviews using interval type-2 fuzzy numbers and its application to product ranking. Inform. Sci. 2019, 504, 293–307. [Google Scholar] [CrossRef]
  30. Yin, S.; Wang, Y.; Shafiee, S. Ranking products through online reviews considering the mass assignment of features based on BERT and q-rung orthopair fuzzy set theory. Expert Syst. Appl. 2022, 213, 119142. [Google Scholar] [CrossRef]
  31. Shifan, H.; Wang, Y. Evaluating new energy vehicles by picture fuzzy sets based on sentiment analysis from online reviews. Artif. Intell. Rev. 2023, 56, 2171–2192. [Google Scholar]
  32. Tian, Z.P.; Liang, H.M.; Nie, R.X.; Wang, X.K.; Wang, J.Q. Data-driven multi-criteria decision support method for electric vehicle selection. Comput. Ind. Eng. 2023, 177, 109061. [Google Scholar] [CrossRef]
  33. Yang, Z.; Li, Q.; Charles, V.; Xu, B.; Gupta, S. Supporting personalized new energy vehicle purchase decision-making: Customer reviews and product recommendation platform. Int. J. Prod. Econ. 2023, 265, 109003. [Google Scholar] [CrossRef]
  34. You, T.; Zhang, J.; Fan, Z. Multi-Attribute Online Review Decision Making Method Based on Sentiment Analysis and Evidence Theory. J. Syst. Manag. 2019, 28, 536–544. [Google Scholar]
  35. Liu, Y.; Jiang, C.; Zhao, H. Using contextual features and multi-view ensemble learning in product defect identification from online discussion forums. Decis. Support Syst. 2018, 105, 1–12. [Google Scholar] [CrossRef]
  36. Yu, Q.; Nie, Y.; Peng, S.; Miao, Y.; Zhai, C.; Zhang, R.; Han, J.; Zhao, S.; Pecht, M. Evaluation of the safety standards system of power batteries for electric vehicles in China. Appl. Energ. 2023, 349, 121674. [Google Scholar] [CrossRef]
  37. Yang, Z.; Dai, Z.; Yang, Y.; Carbonell, J.; Salakhutdinov, R.R.; Le, Q.V. XLNet: Generalized autoregressive pretraining for language understanding. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Brooklyn, NY, USA, 8–14 December 2019; pp. 5753–5763. [Google Scholar]
Figure 1. EWOM ratings and their corresponding evaluation indicator on the Autohome website.
Figure 1. EWOM ratings and their corresponding evaluation indicator on the Autohome website.
Sustainability 17 04078 g001
Figure 2. Complaint eWOM and their corresponding evaluation indicator on the 12365auto.com website.
Figure 2. Complaint eWOM and their corresponding evaluation indicator on the 12365auto.com website.
Sustainability 17 04078 g002
Figure 3. C-ncap platform and safety evaluation index.
Figure 3. C-ncap platform and safety evaluation index.
Sustainability 17 04078 g003
Figure 4. Flowchart of the multi-source data-driven product ranking algorithm.
Figure 4. Flowchart of the multi-source data-driven product ranking algorithm.
Sustainability 17 04078 g004
Figure 5. Variation trend of training accuracy for the XLNet model.
Figure 5. Variation trend of training accuracy for the XLNet model.
Sustainability 17 04078 g005
Figure 6. Variation trend of training loss function for the XLNet model.
Figure 6. Variation trend of training loss function for the XLNet model.
Sustainability 17 04078 g006
Table 1. The volume of data for individual car brands across various platforms.
Table 1. The volume of data for individual car brands across various platforms.
BrandRating PlatformComplaint PlatformSafety Test Platform
P 11 P 12 P 21 P 22 P 31 P 32
A 1 40002704234012311
A 2 2795643482411
A 3 13975122043311
A 4 144810632541911
A 5 25642173821211
A 6 40001322415110911
A 7 40001314430213511
A 8 40002831242111
A 9 40002682346012311
Table 2. A partial complaint text regarding A 1 is available on the 12365auto.com platform.
Table 2. A partial complaint text regarding A 1 is available on the 12365auto.com platform.
Complaint NumberBrandComplaint Text Contents
1,079,482 A 1 During the driving process, the vehicle frequently experiences shaking, similar to a person shivering. We request the manufacturer to address and resolve this issue.
1,074,623 A 1 During parking and driving, the steering wheel emits a creaking sound, as though something is shaking internally. Additionally, when braking at low speeds, a strong and harsh noise occurs. Given that I purchased this vehicle less than a week ago, the manufacturer is responsible for providing a satisfactory solution.
1,074,254 A 1 Having driven less than ten thousand kilometers, the car battery has already failed. Local repairs have detected that the battery does not hold a charge. Typically, a battery should last four or five years without issue. It is unclear why the BMW battery has performed so poorly, and a replacement is required.
1,069,589 A 1 The paint on the body contains burrs and defects, which indicate quality issues that need to be addressed.
1,064,656 A 1 At high speeds of 110 km/h, the car will produce noise, while no sound occurs at low speeds. This issue may affect usability and requires a resolution. At high speeds of 110 km/h, the car will produce noise, while no sound occurs at low speeds. This issue may affect usability and requires a resolution.
Table 3. Case design for user platform preference weight.
Table 3. Case design for user platform preference weight.
Case P e r - w 1 P e r - w 2 P e r - w 3
Case 1100
Case 2010
Case 3001
Case 41/21/20
Case 51/201/2
Case 601/21/2
Case 71/31/31/3
Table 4. Multi-platform comprehensive BUI values based on user preferences across diverse cases.
Table 4. Multi-platform comprehensive BUI values based on user preferences across diverse cases.
Brand A 1 A 4 A 5 A 6 A 9
Case 1BUI(0.772, 0.610)(0.763, 0.670)(0.749, 0.540)(0.714, 0.590)(0.728, 0.570)
Rank12354
Case 2BUI(0.547, 0.612)(0.474, 0.597)(0.600, 0.431)(0.536, 0.697)(0.473, 0.716)
Rank24135
Case 3BUI(0.927, 0.544)(0.883, 0.588)(0.867, 0.500)(0.867, 0.588)(0.831, 0.369)
Rank12334
Case 4BUI(0.660, 0.611)(0.619, 0.634)(0.675, 0.486)(0.625, 0.644)(0.601, 0.643)
Rank24135
Case 5BUI(0.850, 0.577)(0.823, 0.629)(0.808, 0.520)(0.791, 0.589)(0.780, 0.470)
Rank12345
Case 6BUI(0.737, 0.578)(0.679, 0.593)(0.734, 0.466)(0.702, 0.643)(0.652, 0.543)
Rank14235
Case 7BUI(0.749, 0.589)(0.707, 0.618)(0.739, 0.490)(0.706, 0.625)(0.677, 0.522)
Rank13245
Table 5. A comparative analysis of different MCDM methods.
Table 5. A comparative analysis of different MCDM methods.
MethodsData ApplicabilityUncertaintyInformation Quality
TOPSISStructural data
AHPHierarchical structure data
Fuzzy MCDMFuzzy data
Proposed methodMulti-source heterogeneous data
√ indicates that the method incorporates the factors.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, Y.; Jie, M.; Pan, J. Multi-Source Data-Driven Personalized Recommendation and Decision-Making for Automobile Products Based on Basic Uncertain Information Order Weighted Average Operator. Sustainability 2025, 17, 4078. https://doi.org/10.3390/su17094078

AMA Style

Yang Y, Jie M, Pan J. Multi-Source Data-Driven Personalized Recommendation and Decision-Making for Automobile Products Based on Basic Uncertain Information Order Weighted Average Operator. Sustainability. 2025; 17(9):4078. https://doi.org/10.3390/su17094078

Chicago/Turabian Style

Yang, Yi, Mengqi Jie, and Jiajie Pan. 2025. "Multi-Source Data-Driven Personalized Recommendation and Decision-Making for Automobile Products Based on Basic Uncertain Information Order Weighted Average Operator" Sustainability 17, no. 9: 4078. https://doi.org/10.3390/su17094078

APA Style

Yang, Y., Jie, M., & Pan, J. (2025). Multi-Source Data-Driven Personalized Recommendation and Decision-Making for Automobile Products Based on Basic Uncertain Information Order Weighted Average Operator. Sustainability, 17(9), 4078. https://doi.org/10.3390/su17094078

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop