5.1. Datasets
We conducted the experiments of sentiment leader identification in positive/negative sentiment systems using the data collected from Slashdot (
http://snap.stanford.edu/data/soc-sign-Slashdot090216.html, accessed on 8 July 2025), which is a technology-related news website known for its specific user community. Its news content covers a wide range of technological fields, including hardware, software, the internet, and more. What is special about Slashdot is that all the news on the website is voluntarily submitted by users, and after being reviewed and selected by editors, it is published on the homepage. Each news story is accompanied by a comments section, where users can actively participate in discussions and express their views. That is to say that Slashdot allows users to tag each other as friends or foes. The network contains friend/foe links between the users of Slashdot. We firstly built the social networks based on the trust relationships between users. Then, we considered their attitudes in the system regarding news in order to determine a user’s sentiment That is to say that they both share the same sentiment if he/she trusts another user’s review. Otherwise, they are in different sentiment communities.
Table 1 illustrates the data storage representation of a dataset based on the trust relationships between users.
Table 1 contains three columns (userIDs and the trust/distrust relationship). Each row denotes the trust/distrust relationship between two users. If two users trust reciprocally, they both are considered to share the same sentiments. Otherwise, they have different sentiments. For example, users “1”, “112”, “604”, and “605” share the same sentiment. They should be in the same sentiment community (for example,
). Since user “1” distrusts user “163” and “522”, they should be in different sentiment communities. Hence, users “163” and “522” are in another sentiment community (for example,
). Thus, user “483” and user “522” are in sentiment community
.
For the positive/negative sentiment representation system of Case 1,
Table 2 presents the statistics of Dataset I which contains 81,871 nodes and 545,671 edges, with 422,349 trust relationships and 123,322 distrust relationships between users.
Figure 4 shows the visualization of this social network. Obviously, the positive sentiment dominates the whole social network, rather than the negative sentiment.
For the five-star rating sentiment representation system of Case 2, we collect the product rating dataset (Dataset II) from TaoBao.com (
www.taobao.com, accessed on 8 July 2025), which is the largest E-commerce social networking site with the query keywords of the latest smartphone made by Apple Corporation “Apple iPhone 3GS White (16 GB) Smart phone”. Consequently, we obtained 110 reviews made by 110 customers. Consequently,
Figure 5 shows the visualization of this social network.
5.2. Detection of Sentiment Communities
The problem of detecting the sentiment communities is solved using the aforementioned SDP optimization methodology for two types of sentiment representation systems. Let us take
Table 1 as an example and obtain the two sentiment communities (positive and negative sentiment communities) under Case 1.
We run the SDP optimization approach of Case 1 on the Slashdot social network dataset to get the positive and negative sentiment communities.
Figure 6a shows the positive sentiment community, including 422,349 positive edges; i.e., the consumers have positive sentiment toward a product. On the contrary,
Figure 6b shows the negative sentiment community, including 123,320 negative edges; i.e., the consumers have a negative sentiment on the product.
As shown in
Figure 6, the negative sentiment community is quite sparse compared to the positive sentiment community. Hence, this is consistent with real-life sentiment distribution on a product; i.e., most customers hold a positive sentiment on a new product, while a small fraction of customers hold a negative sentiment on the product.
Figure 7 reports the detection results of sentiment communities of Dataset II. We run the SDP optimization approach of Case 2 on Dataset II and obtain five sentiment communities in terms of rating on “Apple iPhone 3GS White (16 GB) Smart phone".
5.4. Evaluation Metrics
To evaluate the performance of the proposed SentiRank algorithm from perspectives on the link structure and content, the one-step sentiment coverage and all-path sentiment coverage are introduced.
One-step sentiment coverage: One-step sentiment coverage is defined as the number of nodes that are directly infected via this set of nodes. This is a special case of sentiment coverage.
All-path sentiment coverage: All-path sentiment coverage is defined as the number of nodes that are directly or indirectly infected via this set of nodes. This is a general concept of sentiment coverage.
To obtain the sentiment coverage of the above algorithms, we incorporate the topological information of the social network; e.g., if two nodes are connected with a link and have the same sentiment, there exists a sentiment interaction/sentiment propagation between them. However, in the sentiment communities, this type of tropology is broken and divided into some substructures of the social network. For all these algorithms, we compare the sentiment coverage at a given time is subjected to different seed sets with the size ranging from 1 to 46 and with the step width = 5 in Dataset I. For Dataset II, we compare the sentiment coverage subject to different seed sets with the size ranging from 1 to 5 and with the step width = 1. Then, we evaluate the sentiment coverage of SentiRank with respect to sentiment communities for two datasets.
5.5. Experimental Results and Analysis
We performed independent t-tests to compare the mean sentiment coverage between SentiRank and each baseline method. The results are summarized in
Table 3.
For the experimental results presented in
Table 3, statistical analyses were conducted across multiple baselines. The homogeneity of variance (
p-values for homogeneity: 0.9887 for degree, 0.9441 for closeness, 0.9437 for betweenness, and 0.0339 for random) varied among the baselines. The T-statistics were +1.1004 (degree), +2.1566 (closeness), +1.7450 (betweenness), and +7.0184 (random). Corresponding p-values were 0.2857 (degree, non-significant), 0.0448 (closeness, significant at conventional levels), 0.0980 (betweenness, marginally non-significant), and 0.0000 (random, highly significant). Effect sizes (d) were +0.4921 (degree), +0.9644 (closeness), +0.7804 (betweenness), and +3.1387 (random). Overall, closeness and random baselines demonstrated significant effects (*), while degree did not reach significance, and betweenness was marginally non-significant, indicating differential impacts across these network-related baselines in the experiment.
Table 4 and
Table 5 present statistical analyses of experimental results for different baselines (degree, closeness, betweenness, and random) under
= 0.05. In
Table 4, for
Figure 8, the random baseline shows a significant result (marked with *) due to a low
p-value (0.0019), large t-statistic (+4.5596), and high effect size (+2.8838), while others have non-significant
p-values. In
Table 5, for
Figure 9, the random baseline also has a significant result (
p-value = 0.0104) with a notable t-statistic (+3.3265) and effect size (+2.1039), and the other baselines have higher
p-values, indicating non-significance. Overall, the random baseline demonstrates significant effects in both analyses compared to the other baselines.
We denote the sentiment community with five stars as
. The top five sentiment leaders are identified via
SentiRank.
.
Figure 9 and
Figure 10 show the one-step sentiment coverage and all-path sentiment coverage of different algorithms in terms of various
K values. In both cases, the performance of our proposed algorithm
SentiRank is the same as the degree-based algorithm [
44]. However, the closeness-based algorithm [
45] is worse than the degree-based algorithm and
SentiRank.
As shown in
Table 4 and
Table 5, the statistical analysis shows that the inter-group differences between the SentiRank and random methods are statistically significant (
p < 0.05) and have a large effect size, indicating the high reliability of the results. Therefore, their practical significance can be discussed in detail.
In the sentiment communities with three stars and with one star , there are 6 and 16 users within the corresponding sentiment community, respectively. In both cases, this customer is the sentiment leader in the corresponding sentiment community.
The problem addressed in this paper and the proposed solution are benefit to social marketing and social advertising. Identifying the sentiment leaders in social networks can help businesses understand what consumers are thinking about their products (
http://venturebeat.com/2011/03/20/why-sentiment-analysis-is-the-future-of-ad-optimization/, accessed on 10 March 2025). Nowadays, social measurement for companies is becoming pervasive in marketing organizations. Identifying the sentiment leaders can help businesses track in real time how a new product has been spreading and decide what opinions to re-promote in order to grease the wheels of message spread. By analyzing the sentiment of users and sentiment leaders, marketers are able to predict exactly how a marketing campaign will perform and, in real time, incorporate strategies into the campaign.
In addition, exploring whether sentiment leaders predominantly share positive or negative sentiments, and how their influence correlates with specific topics (e.g., politics, culture, or consumer trends), would indeed enrich the analysis. This could reveal whether their impact stems from reinforcing existing emotional biases or challenging them and whether certain themes inherently attract more emotional leadership. Such depth would clarify whether these leaders operate as niche emotional curators or cross-topic influencers, enhancing the understanding of their role in shaping network-wide emotional dynamics.