1. Introduction
Group decision-making is a process that human beings must carry out on a daily basis. We can define a group decision-making process as the selection of one option from a set of options. We call these the alternatives [
1,
2]. In real life, decisions that can affect a large group of people are made, making it is necessary to have the support of computerised systems to assist group decision-making processes and the application of methods that allow an objective, organised and fair process to be carried out [
1]. Opinions are fundamental to almost all human activities because they are key factors that influence our behaviours. The acquisition of public and consumer opinions has long been a big business in itself for marketing, public relations and political campaigning companies [
3].
Social networks represent a rich source of data that can be accessed for study and analysis. Individuals and organisations are increasingly using its content to carry out decision-making processes. Among other things, new automatic tools need to be developed to extract information from the networks, analyse it and enable the system to understand the course of the debate. One of the major stumbling blocks in this process is that the information represented in these media uses natural language, which means that special methods are needed for its analysis.
Current decision-making methods that extract information from networks work with sentiment analysis methods. In this way, by analysing experts’ contributions to the debate, it is possible to determine the degree of an expert’s preference for an alternative. However, no measure of control over how that result has been extracted and no information other than the calculated preference is established. This can be a problem, as it is impossible to know whether the preference extracted is real or whether the text being analysed has been misinterpreted. Therefore, establishing measures of error or reliability of the information with which these types of automatic preference extraction methods work is very important if we want to be able to trust the results they provide. Within a group decision-making process, it is important that the group of experts obtain a high degree of consensus prior to the application of the selection process [
4]. In order to obtain a high level of consensus among the experts, it is advisable to provide the whole group of experts with some advice (feedback) on how far the group is from consensus, what are the most controversial issues (alternatives), what are the major disagreements with the rest of the group and how their change could influence the overall degree of consensus [
5].
In this article, we present a new group decision-making method that works on debates held on social networks. Our objective is to create a new group decision-making method that, based on a debate carried out in a social network, extracts the information from the debate, analyses it, obtains the preferences of the experts on the alternatives and, subsequently, determines the final decision reached. In addition, our method will be able to calculate the consensus reached among the participants. The extraction of preferences is done by using sentiment analysis techniques. This type of technique works on written texts using unstructured natural language. We will perform a dichotomous analysis of the texts (positive sentiments and negative sentiments), of the texts associated with each alternative mentioned by the expert. For the calculation of consensus we propose two new measures of consensus. One is based on the degree of subjectivity of the natural language expressions provided by the experts in the debate. The other measure will compare the preference values extracted for each expert and each alternative. Our method will also include the measurement of opinion reliability. This is calculated as the ratio of positive and negative words used in the discussion. For instance, if an expert uses both positive and negative words in the description of an alternative, then the preference extracted is not as clear as if only positive or negative words were used. In order to take this measure into consideration, we will use Z-numbers [
6] to represent the preference values, as this type of representation allows us to represent the reliability of a piece of data.
The article is organised as follows.
Section 2 details the basic concepts necessary to understand the method presented. In
Section 3, the proposed method is presented. In
Section 4, an applied example of the proposal is proposed using a real example in the social network X. In
Section 5, we make a comparison of our method with similar methods already published. We will end by showing some conclusions in
Section 6.
2. Preliminaries
In this section, the basics necessary to understand the proposed method are presented. In
Section 2.1, the basics of group decision-making are presented. In
Section 2.2, the sentiment analysis procedure is explained. Finally, in
Section 2.3, Z-numbers are presented, which we will use to establish the reliability of the preferences extracted from the discussion.
2.1. Group Decision-Making
In this section we describe the basic concepts of group decision-making processes. In these processes, two or more experts express their preferences over a set of alternatives in order to rank them according to their suitability for solving a given problem. A classic way for experts to provide preferences to the system is through the use of preference relations. A group decision-making problem using preference relations can be defined as follows: Starting from a set of preferences [
7],
, and a group of experts
, each expert,
, provide their preferences over
X using a preference relation,
, where
is the preference degree of alternative
over
for
[
8]. A group decision-making problem consists on sorting
X by using the preference values
, that experts have provided to the system.
Within group decision-making processes, two processes are of great importance: consensus and selection. The consensus process allows the experts to reach a final solution that is relatively agreed upon by all, i.e., that has an adequate level of agreement among all experts. The use of consensus measures turns the group decision-making process into a dynamic and iterative process, which may be composed of several rounds where experts express themselves, discuss the different alternatives and try to bring positions closer together. The selection process, on the other hand, uses all individual preferences to obtain a final ranking of alternatives. The alternative that comes first is the most appropriate solution to the problem.
Generally, to solve the formulated problem, most of the methods in the literature follow these steps [
2]:
Providing preferences: First, experts discuss the available alternatives. Next, the experts are asked to provide their preferences [
9].
Aggregating the information: All expert preferences are aggregated into a single piece of collective preference information representing the overall expert opinion [
10]. There are a multitude of techniques such as, for example, additive weighted aggregation (AWA) [
11]. This operator allows us to derive the ranking of the given alternatives in order to find the most desirable one. Another example is the ordered weighted average (OWA) family of operators [
12].
Calculating consensus: Calculating consensus measures can help us to know whether the experts have reached an agreement or, on the contrary, whether they need to discuss the issue further to reach a reasonable level of consensus. It is important, for the sake of improving the reliability of decision-making results, to allow experts to conduct a thorough discussion before reaching a final decision. Therefore, if the consensus is lower than desired, the experts may be asked to conduct a further round of discussion to bring them closer together. Generally, a maximum number of rounds is also established in case the experts do not reach consensus within a reasonable period of time [
13].
Calculating alternatives ranking: Using the collective preference matrix as a basis, the alternatives are ranked according to the level of preference provided by the experts [
10]. This step returns a ranking of alternatives ordered according to the experts’ preferences.
In
Figure 1, we can see a graphical representation of the defined process. Group decision-making processes have a strong presence in the current scientific literature. For example, in [
14], a consensus process for large-scale group decision-making is demonstrated, relying on limited trust and social networks. In [
15], a new model of expressed and private opinions is developed in a scenario where individual private views are updated by reasoning based on the neighbors’ private opinions derived from their actions, while the expressed opinions are influenced by social pressure. In [
16], a consensus cost-oriented method for group decision-making is created, which utilises incomplete probabilistic linguistic preference relations. In [
17], a group decision-making method driven by dual consistency relying on fuzzy preference relations among individuals and groups is introduced. In [
18], the authors present a new decision-making method where sentiment analysis is used for the detection of aggressive behaviour. Finally, in [
19], the authors present a new decision-making method that aims to determine the best hotel selection based on user reviews.
2.2. Sentiment Analysis
Traditionally, the most common way to reveal emotions to others was through speech and facial expressions. But with the advancement of technology and social media, people express their emotions through text messages and emoticons on social media. Although the advancement of technology allows users of social media platforms to show their emotions through multimedia content, text is still the most common form used for communication. People profess their emotions through their messages on them (e.g., statuses, comments, blogs, microblogs, …) [
20].
Sentiment analysis methods [
21] are concerned with identifying how feelings are expressed in texts. We can identify all kinds of concrete feelings: anger, surprise, sadness, joy, etc. We can also make a more general analysis. For example, if in a text we look for expressions that indicate positive (favourable) or negative (unfavourable) reactions towards the subject, we can study the positivity and negativity associated with the subject that the author is dealing with. Specifically, sentiment analysis involves the identification of the following:
Polarity refers to the overall tone of the expression, that is, positive, negative or neutral. Thanks to the polarity analysis of the expression, it is possible to know what is the opinion and sentiment that the writer has on the commented topic. Concretely, it is possible to know how the topic make them feel. The strength of the sentiment allows us to know how much importance the writer attaches to the subject matter.
Thanks to this analysis, we can extract information from free text. For example, we can analyse the opinion of users of online documents such as web pages, chat rooms and news articles, instead of doing special surveys with questionnaires that make the user have to provide information explicitly. The result of the sentiment analysis process is a confidence value and a label, such as positive, negative, mixed, neutral or unknown [
22]. Sentiment analysis methods provide a degree of occurrence of the sentiment being analysed, which allows us to identify what the author’s goal or sentiment was when writing the text.
In general, sentiment analysis has been investigated mainly at three levels: at document level, at sentence level and at aspect and entity level. Depending on the context and the topic under discussion, we can distinguish between two types of sentences [
3]:
Descriptive sentences: In this case, the user is describing a certain topic or element. Therefore, we can apply sentiment analysis to identify a sentiment or several sentiments based on what is stated.
Comparative sentences: This type of sentence compares several elements with each other. In this case, sentiment analysis must be able to identify the elements to be compared and look for comparative expressions that relate them. In this way, it is possible to identify which elements the user prefers.
Some of the most frequently applied techniques in the area are as follows [
23]:
Sentiment classification: This primarily involves categorizing opinions into three key groups: positive, negative or neutral. The result of many methods is typically a value that falls within a specific range. Various scales can be utilised to assess an opinion, such as the interval [−1, 1], with −1 representing the highest level of negativity and 1 denoting the highest level of positivity.
Subjectivity classification: This mainly consists of detecting whether a given sentence is subjective or not. The higher the percentage of positive and negative words used in the text compared to the total number of words, the higher the degree of subjectivity in the text.
Summary of opinions: It is especially focused on extracting the main features of a given topic from a set of texts. It also identifies the feelings associated with such texts.
Opinion mining: Efforts to obtain documents that convey a viewpoint on a specified query. In this kind of system, two evaluations are necessary for every document: the relevance to the query and the perspective on the query. Both are typically employed to assess the ranking of documents.
Sarcasm and irony: It focuses on detecting ironic and sarcastic content on the texts. There is great difficulty in defining reliable techniques that allow the computer to detect them [
24].
The following is a typical procedure for sentiment analysis in social networks [
2]:
We select the target feeling; for example, satisfaction, adoration, sadness.
We generate the list of words associated to the sentiment: .
We extract the texts to be analysed from the social networks. To select the texts, we can use keywords. In this way, we can identify if a text is relevant to the topic or if, on the contrary, it is irrelevant and not worth analysing.
The obtained texts, , are analysed.
Final results are shown. It is important to notice that each text can be related with more than one sentiment.
The application of sentiment analysis has generated very good results in fields such as marketing, psychology and information extraction. For example, in [
25], the authors present a new method based on sentiment analysis that allows measuring people’s perceptions by combining Neurolinguistic Programming (NLP) dictionaries together with machine learning. In [
26], the authors apply a business context-aware decision-making approach to select the most appropriate sentiment analysis technique in e-marketing situations. In [
27], the authors identify the different sentiment components that are vital to extract topics, sentiments and opinions from the analysed text. In [
28], they develop a transformer architecture for sentiment anlysis and clasiffications of hotels opinions. In [
29], authors present a review on prompt-based sentiment analysis methods. In [
30], they present a novel sentiment analysis approach that uses knowledge-aware dependency graph networks.
2.3. Z-Numbers
Much of the decisions that humans make are based on imprecise information or on the experience of people who express themselves using inexact concepts rather than numerical data. Conceptual information cannot be directly processed by the computer. Therefore, we need to use information representation models that can represent imprecise information and that, in turn, can be managed by a computational system. One of the methods used to represent imprecise information are the Z-numbers [
31].
Formally, Z can be represented as . The first component, A, is a restriction on the values that a real-valued fuzzy variable can take, X. The second component, B, is a measure of reliability or certainty of the first component. For our method, we decided to use the Z-numbers as they allow us to use the reliability component to represent the level of confidence we have in the preferences extracted from the discussion transcripts.
The presence of Z-numbers in scientific articles has grown exponentially in recent years. For example, in [
32], a review of the state of the art on z-numbers in group decision-making methods is presented. In [
33], authors develop a preference-based regret three-way decision method on multiple decision information systems that uses fuzzy numbers for information representation. Finally, in [
34], authors present a hybrid decision approach based on Z-numbers for evaluating blockchain implementation solutions in the sustainable supply chain.
3. Group Decision-Making with Consensus Based on Subjectivity and Sentiment Analysis
The newly designed group decision-making process and the associated consensus measures presented in this paper are shown in this section. Our method automatically extracts experts’ preferences by analysing the transcripts of the discussion conducted. For this purpose, we use sentiment analysis. Then, using the preferences and the information provided by the sentiment analysis process, we identify the degree of consensus among opinions. We can also measure the degree of subjectivity that experts have when giving their opinions. Finally, Z-numbers allow us to calculate a degree of reliability on the extracted preference value. Thus, if an expert provides both positive and negative expressions about a given alternative, then the degree of reliability will be low. If the expressions associated with an alternative are all positive or negative, then the degree of reliability will be high. The method developed follows these steps:
Decision-making process initialization: First, it is necessary to define the parameters of the decision-making process to be carried out. To do this, the topic of discussion, the set of possible alternatives, the participants, and the posts to be extracted are defined.
Selection of the data to be analysed: Next, we extract the data from social network X. We must identify which alternatives each text is referring to. To do this, we can make use of hashtags. Each text has the alternative and the expert it refers to assigned.
Preference extraction: Using sentiment analysis techniques, we extract the experts’ preferences from the texts. For this, we use two sets of words, one with positive and accepting expressions and the other with negative or rejecting expressions. We also use Z-numbers to obtain a measure of the reliability of the information obtained.
Consensus calculation: Based on the text provided by the experts, the results of the sentiment analysis process and the preferences obtained, we calculate the consensus reached among the experts. For this purpose, we propose two measures, one that calculates the consensus based on the preferences obtained and another that is based on the subjectivity of the comments provided to the system.
Final ranking of alternatives calculation: Once the consensus reached is high, we aggregate the preferences and, using the selection measures, the final ranking of the alternatives is calculated.
The proposed method has been implemented in R and tested using real data obtained from the social network X using the X API. X is a very interesting social network for this type of analysis as each post is limited to 140 characters. Thanks to this, users of the network are forced to be specific and concise about their contributions. In
Figure 2, we can see an outline of the process that the method carries out. In the following subsections, we will describe each of the mentioned steps in detail.
3.1. Decision-Making Process Initialization
First of all, we must define the decision environment in which we are going to work. To do so, we will follow these steps:
Defining the topics for discussion: The first thing to do is to define the use of a mechanism to obtain the experts’ texts from the social network. The social network X uses hashtags to identify different topics and elements to discuss. It is very common for users to engage in conversations on a particular topic identified by a particular hashtag. Each hashtag can also represent an alternative. In
Figure 3 we can see some selected texts by the hashtag #ClaroEcua.
Set the number of participants: The number of participants in a given social media discussion can be fixed or open. The more experts participate in a discussion, the more points of view it will have and, therefore, the more interesting the result will be. In the developed model, we do not limit the number of participants. Once the debate is over, a list of hashtags is made with the information they used, which will be original and unique for each of the alternatives to be analysed. We define the set of experts as .
It is possible to assign different weights to the experts, depending on their degree of knowledge or experience on the topic. In our proposed model, we adopt the same weight for all experts.
Establish the number of posts: The number of posts obtained depends on the input provided by experts on the proposed topics. They are collected for further analysis and use in the decision-making process.
3.2. Selection of the Data to Be Analysed
A post is a complex object (see
Figure 4 and
Figure 5), with a lot of associated properties [
35] such as Text, Username, Mention, Replies, Follower, RePost, Hashtag and Privacy. For the analysis of the proposed group decision-making process, we focused on the written text, as this is where the user presented the arguments for or against the different alternatives.
For the implementation and testing of the proposed method, we have used the R Software and the X social network API. The X API allows us to obtain the posts published by users and their corresponding metadata [
35,
36]. This API facilitates the extraction of data in a structured format that can be easily analysed. The R software contains a library that allows us to extract information from the X API and carry out the analysis process.
Before being able to extract preferences and carry out the group decision-making process with consensus, it is important to debug the text to remove all those elements that may hinder the analysis. Therefore we have to remove hashtags, connectors, conjunctions, words without semantic relevance, hyperlinks, the X(@) handle and URLs from the tags. Once this process is finished, the text is reduced to the most relevant set of words. This will make the extraction of preferences more efficient and reliable, as there is less text to analyse and the analysed text is relevant.
As we have already mentioned, hashtags allow us to identify which are the alternatives that experts are describing in their posts. Moreover, as each post is assigned to a user account, we can determine which post belongs to which user. The X API gives us access to this information in an organised way through a set of queries to a database.
Once the texts assigned to each alternative and expert have been obtained, in the next section we will see how to extract the preferences for subsequent use in the group decision-making process and the calculation of consensus and subjectivity.
3.3. Preference Extraction
After initialising the decision-making process and obtaining the data, our method must apply sentiment analysis to obtain the experts’ preferences for each of the alternatives. The reliability of the obtained preferences will also be calculated. This whole process is carried out through the following steps:
Obtain the sentiment score: Once we have selected the relevant texts from each of the experts, sentiment analysis must be applied for each of them. To do this, we will use functions from the R Syuzhet package [
37]. This package provides four sentiment dictionaries and a method to access the robust sentiment extraction tool developed in the Stanford NLP group. In addition, it allows the use of a dictionary that associates words with ten different emotions (anger, anticipation, disgust, fear, joy, sadness, surprise, trust, negative and positive) (see
Figure 6). For our research we will only consider positive and negative, since our aim is to determine whether the expert is for or against each of the alternatives.
Obtain the polarity of the post: In this step, the method obtains the polarity of a post
, i.e., the positive or negative degree
obtained from expert survey from the alternative of
sensitive. The following expression is used:
where
refers to the number of positive words contained in
and
the number of negative words.
The result of this process is a numerical value in the range [−1,1]. The closer the result is to the value 1, the more evident the positivity of the post will be. Conversely, the closer the result is to −1, the greater the disagreement with the alternative described. Finally, all those values close to 0 would imply that the expert has a neutral attitude towards the alternative, neither in favour nor against it. In
Figure 7, a post analysis example result is shown graphically. The score for each post, as mentioned above, is in the range of [−1,1], where 1 represents very positive polarity and −1 very negative.
Once the preference has been extracted, we change the domain of the value from the interval [−1,1] to the interval [0,1]. To do this, we apply the following expression:
Calculating the reliability of preferences: Once the preference value has been obtained, we can calculate its associated reliability. For this, as we have already mentioned, we will analyse the number of positive and negative words associated with each alternative. Therefore, if the alternative has many more positive words than negative words associated with it or vice versa, this will mean that the reliability is high. Conversely, if the expert rates the alternative with a similar amount of positivity and negativity, then the expert’s opinion regarding the alternative is unclear. Once the preference and its associated reliability value are obtained, we can create the Z-number representing the preference. For a clearer and more user-friendly representation of preferences that allows a more convenient analysis of expert opinion, we will use linguistic labels to represent each calculated preference value. We will associate each preference value with a linguistic label using the conversion described in the
Table 1. Once the linguistic label representing the preference value has been obtained, the associated reliability value is calculated. To accomplish this, we use the following expression:
where
is the number of positive words,
the number of negative words and
.
The more extreme, i.e., the closer it is to 1/−1, either positive or negative, the more polarised the expression is and, therefore, the clearer the expert’s positivity and negativity. Conversely, the less difference there is between positive and negative words, the less reliable the preference is and the less clear the expert’s opinion of the alternative is.
3.4. Group Decision-Making with Consensus Measures
Once the preferences have been calculated from the transcripts of the discussion text, the group decision-making process with consensus can begin. To do this, the method will follow the steps described below:
Preferences calculation. In the previous section, we used sentiment analysis on the debate transcripts to calculate the preferences of each expert on each alternative. To do this, we created sets of texts for each alternative and expert and extracted the preferences using a sentiment analysis process. The great advantage of this process is that the experts do not have to explicitly provide the preferences to the system. In the following, we will use this information to initiate a decision-making process. We will represent the expert’s preference over the alternative as . is a value located in the interval [0,1], with 0 indicating a total disagreement with the alternative and 1 a total agreement with it.
Consensus calculation. Once the alternatives have been defined, it is possible to calculate the consensus associated with the round of the decision-making process. To do this, we will measure the closeness of the experts’ opinions to obtain a degree of consensus. As we know, in practice, it is sometimes not possible to reach a full and unanimous consensus, but it is important to try to get experts to debate and bring positions closer together. The goal of consensus measures is to help decision-makers reach an agreement on the best solution to a decision-making problem. Therefore their role is increasingly important in solving group decision-making problems. In this article, we propose two different measures of consensus. One measures the closeness of opinions and the other compares experts according to the subjectivity of their opinions. To calculate consensus based on opinion, we use the following expression:
where:
where
n is the total number of alternatives and
m is the number of experts.
For the calculation of the consensus based on the subjectivity of their opinions, we can use the following expression:
Expression (
8) is for positive words and Expression (
9) is for negative ones. The measure of subjectivity in the debate is a key detail that can help us determine which experts show a more vehement attitude in the discussion and which, on the contrary, are more passive or indifferent to the subject under debate.
Once the consensus has been calculated, the following two actions can be carried out:
Thanks to these two actions, we try to involve all experts as much as possible in the debate. It is very important that they try to discuss and bring positions closer together in order for the final decision to be as consensual as possible. It is also important that all experts are involved in reaching it. To motivate experts to try to reach consensus, it is possible to set a threshold above which, if not exceeded, experts are allowed to conduct a longer discussion. Our method allows thresholds to be set for both measures. However, for the sake of brevity, we will set only a consensus threshold of for the opinion-based consensus measure.
Calculation of the collective preference matrix and final ranking. The preference values calculated for each of the experts are aggregated to calculate the collective preference value specific to each alternative. In this way, we can determine the overall expert opinion. Formally, we aggregate all the
values using Expression (
7).
As the values of lie between [0,1], the resulting collective preference values also lie in the same interval. With the values it is possible to score each of the alternatives, order them and generate the ranking that allows us to determine which solution are the best. It is important to remember that for our study, all experts are considered to be of equal importance in the analysis process.
A complete description of the presented method is shown in Algorithm 1.
Algorithm 1 Sentiment-based GDM with Consensus and Reliability Measures (SGDM-CR) |
Require: Alternatives A, Posts T, Sentiment Dictionaries D, Threshold Ensure: Final ranking, Consensus level, Expert feedback
- 1:
Initialise decision-making process (hashtags, experts, post collection) - 2:
Clean and preprocess posts (remove hashtags, links, stopwords) - 3:
for each post do - 4:
Identify associated alternative and expert - 5:
Compute sentiment score: - 6:
Normalise polarity: - 7:
Compute reliability: - 8:
end for - 9:
Aggregate values to get - 10:
Build preference matrix P and reliability matrix Z - 11:
Compute consensus - 12:
Compute subjectivity per expert: - 13:
if then - 14:
Compute collective preference - 15:
Rank alternatives by descending - 16:
else - 17:
Provide expert feedback and restart from preprocessing - 18:
end if - 19:
return Ranking, , subjectivity and reliability feedback
|
5. Discussion and State-of-the-Art Comparison
In this paper, we have presented a new group decision-making method that uses sentiment analysis to automatically extract the system preferences. In order to measure the consensus in the process, we present two different measures. One is based on the extracted preferences and the other is based on the subjectivity that experts show in the discussion. In addition, to measure the reliability of the preferences extracted from the debate, we use Z-numbers. The main advantages provided by the presented method are the following:
Preference extraction method based on sentiment analysis: Preference information is obtained directly from the texts that experts provide to the system during the discussion. This approach has two advantages. On the one hand, experts do not have to provide preferences to the system. On the other hand, the preferences are extracted using all the information provided by the experts during the discussion as a source. When we ask the expert to provide preferences, in the end, we are working with summarised information. However, sentiment analysis methods allow us to extract the information from the discussion so that everything that has been discussed in the discussion is taken into account.
Using Z-numbers to measure the reliability of the extracted information: Our paper uses Z-numbers to establish a measure of the reliability of the information provided by the experts in the debate. Thus, if an expert provides both positive and negative expressions about an alternative, then the extracted preference will be unreliable. Conversely, if the expert only uses positive or negative expressions to define an alternative, then the reliability of the extracted preference will be high.
Consensus-taking methods calculated directly from the text of the debate: Analysis of the debate makes it possible to establish measures of expert participation in the debate. Thus, the proposed method proposes consensus measures that take into account the involvement of the experts in the debate. That is, if an expert is very clear in their opinion and provides a high percentage of positive expressions depending on the text analysed, then they can be considered to be very vehement in their opinion. On the contrary, if this percentage is low, then it can be stated that the expert is not very involved or acts neutral. Our method uses this measure of subjectivity in opinion to create a new measure of consensus which, together with the measure of consensus calculated from preferences, gives us a lot of information about how the debate is developing and what the predominant positions are.
Calculation of the ranking of alternatives based on the information from the debate: Our method converts the information provided by the experts into preferences and calculates the ranking by using selection methods over the obtained preferences. Thanks to this process, our method is capable of determining which alternatives are preferred by the experts. As the extraction of preferences is automatic, this method works even in decision environments where there is a high number of experts and alternatives. Therefore, it is ideal for application in discussions conducted om social networks.
Applicable to any field: Our proposed model can extract preferences in any field of application, such as tourism, medicine, politics or any other topic where knowledge of users’ opinions and emotions can inform decisions. Although the method was initially designed for use in social networks, there is no reason why it cannot be applied in other types of environments, such as written transcripts of debates, closed forums or other types of decision support systems.
When there is a large amount of text provided by experts and these texts contain words that allow the positivity or negativity of each expert’s discourse to be established, then the proposed method works adequately. However, there may be cases where some experts have provided very little information to the system or where the texts they share are very neutral. In these cases, it is difficult to automatically extract their opinion. The method will assign that expert a neutral opinion value for the alternative, neither positive nor negative. To solve this, it is possible to ask the expert to provide new information to the system and suggest that they express their opinion more directly. The positivity and negativity of opinions is linked to experts using words collected in the dictionaries used for sentiment detection. There may be cases in which the dictionary used does not include the terminology associated with a particular topic or the information does not match to how a particular expert expresses themself. It should be noted that these cases will be minimal, as these dictionaries contain terms that are commonly used by everyone, regardless of the topic being discussed. However, if this occurs, specific dictionaries with the terms used in the topic being discussed or by the expert should be used. Another possible solution is to expand existing dictionaries with new information. Similarly, if we want to work with texts that are not in English or that use a specific tone, it is necessary to include dictionaries that contain the positive, negative and neutral terms for the required cases.
In recent state of the art on the topic, there are few papers that include consensus in a sentiment-analysis-related group decision-making process. For example, in [
38], the authors present a novel group decision-making method that uses a sentiment analysis approach to extract preferences from texts and, afterwards, apply consensus measures over the results. For representing the information, they employ distributed linguistic preference relations (DLPRs) for representing the information once it is extracted from the texts. A sentiment analysis approach using dictionaries is used for extracting the preferences. On [
39], authors study the phenomenon of echo chambers in group decision-making problems by applying sentiment analysis and consensus measures. They extract opinions from social media and calculate a sentiment polarity matrix. By the use of Aspect-Based Sentiment analysis (ABSA), they use a polarity analysis to calculate consensus and detect echo chambers. In [
18], authors present a novel group decision-making process that uses sentiment analysis for detecting aggressiveness and positivity in experts’ comments. Since the method is centred in dealing with a high amount of information, they use clusters of experts in order to be able to handle it.
Although there are methods like the ones mentioned above that use consensus measures and sentiment analysis in group decision-making methods, they do not include any reliability or subjectivity measures. Therefore, they extract less information from the debate and have less means to improve the decision process performance.
In [
40], authors use sentiment analysis to detect neutral opinions and improve sentiment analysis procedures in order to carry out a better automatic preference extraction process. They employ the Induced Ordered Weighted Averaging (IOWA) operator for aggregating the information. In order to calculate consensus, they employ the proximity function to the neutral point. This paper already worked on detecting neutrality and identifies the concepts of subjectivity and reliability. Nevertheless, the paper is centred on detecting and filtering all those neutrality contributions. In our paper, we developed two novel measures in order to detect and work with those two cases separately. Instead of filtering unreliable texts, we give the experts a chance to modify their way of expressing themselves in order to be clearer. In the case of experts being too neutral, we can recommend to them that they be more involved in the process.
In [
19], authors resolve the problem of hotel selection by using a hotel database review. They apply sentiment analysis to extract the preferences and group decision-making methods to select the best hotel. The information extracted is represented by using fuzzy preference relation matrices. Also, the consensus measures are applied over the preferences extracted. This paper is centred on solving one specific problem using a specific review database, while our proposed method is prepared to work over social media discussions. Therefore, it is capable of extracting the information in real time from social media and applying the group decision-making process.
In [
26], the authors use group decision-making techniques to identify the best sentiment analysis technique to apply in a given situation. They use a hierarchical Analytic Hierarchy Process (AHP) approach with an automatised learning method. Therefore, they do not use sentiment analysis for the same purposes as in our paper. Also, they do not provide consensus measures over the opinions found.
When searching in the literature, we could not find a single article that applies Z-numbers to sentiment-analysis-related group decision-making processes. There are several articles, such as [
41,
42], that apply Z-numbers to represent preferences in group decision-making methods. However, their reasons for its use differ from the ones proposed in our article. For consensus calculation in [
41], they employ an aggregation of preferences based on the extended OWA operator. On the contrary, in [
42], they employ the Z-number weighted Bonferroni mean (OZWBM).
In conclusion, the mentioned articles are the closest ones to the research carried out in this article that we could find in research databases. Therefore, we could not find any research that directly deals with the reliability and neutrality of experts’ contributions in group decision-making processes. The goal of this paper, therefore, has been to design novel consensus measures that provide a solution to this issue.
All the methods analysed are designed to be applied in different scenarios and cover different needs, making it difficult to make a fair comparison between them. Depending on the context and needs, some will be more suitable than others. For example, when dealing with large amounts of information, the method in [
18] may be the most suitable, because it implements tools that allow for the processing of large amounts of information. When detecting different patterns such as neutrality in opinions or the echo chamber effect, the most appropriate methods to use are [
39,
40], respectively. If we are interested in carrying out the decision-making process and obtaining measures of subjectivity and reliability in the extraction of comments, our method is the most appropriate, as it includes these measures.
With regard to the efficiency of the methods, it should be noted that the method used to represent preferences has a significant impact, since the simpler the representation of preferences, the faster the final calculation. Also, aggregation operations provide faster results. However, these faster methods end up providing less information about the decision-making process, since the more complex the representation of the information, the more details are represented and the more information we obtain at the end of the process. The use of Z-numbers means that the processes that handle them take longer to process the information. Specifically, it takes twice as long, because two values per preference have to be processed instead of one. However, their use allows the level of reliability in the extraction of the information provided to be identified and stored, which can be very useful when studying the reliability of a preference that has been automatically generated from a text.
Table 12 shows a comparison of the characteristics of all the models analysed. Finally, in
Table 13, the methods analysed in this section are compared based on a series of characteristics.
The presented method has some limitations that we will improve in the future. One of the main objectives of the method presented is to be able to carry out a group decision-making process based on information provided in social networks. For this reason, and given that we are interested in knowing whether a user is for or against an issue, we have opted for a dichotomous analysis of emotions. In this way, we determine whether the user is for or against the alternative being discussed. However, it is possible, in future work, to try to perform a deeper analysis of certain emotions that are common in texts written on social networks. For instance, joy, sadness, anger or surprise could be detected. Another limitation of the method is that it does not detect irony and sarcasm. However, as it is a scalable method, it is possible to include a phase in which an existing sarcasm and irony analyser is applied, for example [
43,
44], to detect these situations and be able to deal with them appropriately. Finally, in order to carry out an adequate analysis, it is necessary that the texts provided by the users are sufficiently informative, i.e., that they contain enough words considered positive and negative. Otherwise, the method considers the text to be neutral and it becomes very difficult to really understand how the user feels about the topic. In this case, a possible solution is to communicate to the user that it is not possible to extract their preferences and encourage them to provide longer and more informative texts to the system. Without sufficient information about what a user thinks about an alternative, the method cannot perform adequately. It is possible to determine whether a user has provided enough information by setting a minimum number of
. In this way, if the user’s texts about the alternative do not reach the required minimum, it is possible to stop the analysis and ask for more information.