Detection of Hidden Communities in Twitter Discussions of Varying Volumes
Abstract
:1. Introduction
2. Related Work
2.1. Selecting Community Detection Algorithms
- The need to use additional metrics to estimate the distance between vertices, and, in the case of oriented graphs, this choice is very narrow;
- The limitations of computing on large-dimensional graphs (with more than 50,000 nodes) (in particular, the speed of searching for new clusters is significantly reduced, and, on super-large graphs (with more than 500,000 nodes), this class of algorithms cannot get a result without additional optimization);
- Limited application to sparse data [10].
2.2. Communities in Networked Discussions: A Social Scientific Perspective
3. Models and Methods in SNA
3.1. User Discussion Model
3.2. Description of Community Detection Methods
3.2.1. The Directed Louvain Algorithm
3.2.2. The Leiden Algorithm
- Local node movement;
- Improving partition;
- Enhanced partition-based network aggregation using a non-enhanced partition to create an initial partition for the aggregate network.
3.2.3. The Directed Label Propagation Algorithm
- The algorithm assigns a unique label to each node;
- Each node selects a label among its neighbors based on the frequency of occurrence;
- If the distribution of labels reaches a steady state, the algorithm stops, otherwise it returns to step 2;
3.2.4. The Infomap Algorithm
3.2.5. The Generalized K-Means Algorithm
- The initialization stage: k centroid vertices are randomly selected;
- The assignment stage: using Voronoi diagrams with centroids to divide the set of vertices into subsets;
- The update stage: building subgraphs, calculating PageRank for each subgraph, and updating the centroids.
3.2.6. The Order Statistics Local Optimization Method
- The search for significant clusters before convergence;
- Analysis of the resulting set of clusters to detect their internal structure or possible associations;
- Discovery of the hierarchical structure of clusters.
3.2.7. The Speaker–Listener Propagation Algorithm
- The memory of each node is initialized with the identifier of this node (unique label).
- The steps are repeated until the stopping criterion is met:
- one node is selected as a listener;
- each neighbor of the selected node sends one tag following a certain conversation rule, e.g., choosing a random tag from its memory with a probability proportional to the frequency of occurrence of this tag in memory;
- the listener accepts a label from a collection of labels received from neighbors following a specific listening rule, e.g., choosing the most popular label from what it has observed at the current stage.
- Finally, post-processing based on in-memory labels of nodes is used to display communities.
3.3. Evaluation Metrics for Community Detection
- The NEDindex [30] value ranges from 0 to 1. Cluster nodes are more strongly connected to the entire graph if NEDindex tends towards 1, and vice versa, cluster nodes are weakly connected if NEDindex is close to 0.This metric indicates the strength of the connection between the cluster vertices relative to the entire graph.
- Directed modularity [31] is an extended version of the modularity metric [32] for directed graphs. The higher the value, the better the result.Modularity reflects the concentration of edges within communities compared with random distribution of links between all nodes regardless of communities. Modularity also shows the effectiveness of the method to detect large communities.
- Clustering Coefficient is used in this paper as a version of the clustering coefficient [33] extended for directed and weighted graphs.In this metric, the average value of the ratio of existing triangles based on the vertex i to all kinds of triangles based on this vertex is considered, that is, the completeness of the relationship between the vertices is considered. The closer the metric value is to 1, the better the community detection.
- Conductance [34] is the proportion of the total number of edges outside the community for unweighted networks or the proportion of the total weight of such edges for weighted networks. This metric allows you to know the “conductivity” of the resulting community. The closer the conductance value to 0, the better the quality of the community.
- Contraction [34] measures the average number of edges per node within community C or the average weight per node of such edges. This metric shows how important this community is to the rest of the graph. The closer the contraction value to 1, the better the quality of the community.
- Expansion [34] measures the average number of edges (per node) outside the community C, or the average weight per node of such edges. This metric shows how strongly this community is connected to the rest of the graph. The lower the expansion value, the better the quality of the community.
- Community Fitness [35] calculates the ratio of the total indegree number of the community C to the total degree of α, where is α a positive number that controls the size of communities. This allows you to find out the density of detected communities. The higher the community fitness value, the better result we get.
4. Experiment
4.1. Experiment Description
4.2. The Datasets
5. Results
- The Infomap algorithm tends to highlight one large community, which can be used as a starting point for deeper research;
- Similar to the Infomap algorithm, GANXiS (both pure and overlapping versions) distinguishes one large society, but this set is smaller than for Infomap;
- The generalized K-means algorithm does not take into account the lack of connections between isolates, which is the reason for the unification of almost all small discussions (2-3-4 participants) into one large society;
- Directed Louvain shows good results. However, it was not able to get ahead in any metric, steadily holding on to the second or third place;
- Despite it only partially belonging to the directed graph clustering algorithms, Leiden (both the one that uses modularity and the one that uses the clique percolation method) shows good results, being ahead of the directed Louvain algorithm in almost every point. Its peculiarity is that it highlights the communities of average size and/or close to each other in size;
- DLPA allocates almost twice as many communities as other algorithms, all but the largest do not differ much in size;
- OSLOM identifies a large number of communities, which, moreover, can strongly overlap.
- Infomap showed a more even selection of communities than on the Biryulevo dataset, nevertheless retaining the emphasis on the largest of them;
- GANXiS continued to highlight one large community, and the emphasis on this community grew;
- As in the case of the Biryulevo dataset, generalized K-means has combined minor discussions into one large group;
- Directed Louvain performed well again, continuing to hold on to second or third place;
- Unlike for the first dataset, the sizes of communities after the Leiden algorithm differ depending on the metric (CPM or modularity). The first option showed a large number of fairly small communities, while the second showed a smaller number of moderately large communities;
- DLPA on the Cologne dataset showed the risk of going into “overflow” of one community, not giving any other one chances to grow while, again, having twice as many communities as other algorithms;
- OSLOM distinguished communities of not the best quality even in comparison with GANXiS.
- Infomap once again got four best metrics results, receiving, though, one large community;
- GANXiS kept the trend shown on small and medium datasets;
- The generalized K-Means algorithm has shown its inapplicability to sufficiently perform on large graphs, since both the computational cost and the memory cost made it impossible to use this algorithm;
- Directed Louvain retains its position relative to other algorithms;
- The Leiden algorithm kept the trend shown on the medium-sized dataset;
- DLPA identified more equal-sized communities than on the medium dataset, which demonstrates variability of the algorithm performance depending on the size of initial data;
- OSLOM got results worse than on medium dataset on every, except metric (8).
- Infomap showed a better division into same-size communities, retaining the emphasis on the largest of them;
- It revealed that the GANXiS algorithm is not applicable to extra-large networks;
- As in the case of the large dataset, generalized K-means is not applicable to the networks of this size;
- The directed Louvain continued to evenly allocate communities;
- Leiden shown dramatically grown difference between the size of CPM and modularity metric-based communities;
- DLPA on an extra-large dataset has allocated a lot of small, smaller than before, communities;
- OSLOM has got even worse results than on the large dataset on every metric.
6. Discussion and Conclusions
- How does the structure of hidden communities relate to the expected social, cultural, and/or political cleavages in the discussion?
- How can algorithmic detection of hidden communities come closer to detecting communities of views, as linked to communities of formal connections?
- Conduct semantic analysis of the quality of the results obtained using experts or NLP methods;
- Expand the work results by adding agglomerative clustering and Markov stopping moment for optimal clustering [10].
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
DirLouv | Directed Louvain |
GKM | Generalized K-means |
LeidMod | Leiden modularity |
LeidCPM | Leiden CPM |
GANXiSd | SLPA disjoint |
GANXiSo | SLPA overlapping |
DirMod | Directed modularity |
ClusCoe | Clustering coefficient |
NEDind | NEDindex |
Conduc | Conduction |
Contrac | Contraction |
Expans | Expansion |
Bir | the Biryulevo case |
Col | the Cologne case |
Fer | the Ferguson case |
ChE | the Charlie Hebdo case |
Appendix A. Summary Tables
DirMod | ClusCoe | ComFit | NEDind | Conduc | Contrac | Expans | ||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | Bir | Col | Fer | ChE | |
DLPA | 0.486 | - | - | - | 0.056 | 0.07 | 0.085 | 0.027 | 1.373 | 2.299 | 1.522 | 1.375 | 0.605 | 0.723 | 0.64 | 0.672 | 0.625 | 0.521 | 0.599 | 0.514 | 0.849 | 0.815 | 0.847 | 0.764 | 2.361 | 1.378 | 2.735 | 1.55 |
DirLouv | 0.574 | - | - | - | 0.039 | 0.028 | 0.027 | 0.012 | 6.548 | 7.753 | 6.318 | 4.395 | 0.793 | 0.917 | 0.949 | 0.953 | 0.286 | 0.234 | 0.208 | 0.163 | 0.766 | 0.691 | 0.666 | 0.604 | 0.715 | 0.459 | 0.371 | 0.256 |
Infomap | 0.088 | - | - | - | 0.034 | 0.024 | 0.026 | 0.012 | 7.877 | 8.468 | 6.476 | 4.381 | 0.94 | 0.949 | 0.959 | 0.942 | 0.209 | 0.201 | 0.199 | 0.166 | 0.685 | 0.653 | 0.655 | 0.608 | 0.375 | 0.344 | 0.34 | 0.268 |
OSLOM | - | - | - | - | 0.035 | 0.034 | 0.034 | 0.015 | 2.479 | 3.38 | 2.426 | 1.73 | 0.501 | 0.495 | 0.493 | 0.479 | 0.967 | 0.963 | 0.964 | 0.979 | 0.087 | 0.098 | 0.096 | 0.054 | 1.403 | 1.249 | 1.305 | 1.239 |
GKM | 0.39 | - | - | - | 0.0099 | 0.013 | - | - | 7.956 | 28.498 | - | - | 0.423 | 0.422 | - | - | 0.95 | 0.934 | - | - | 0.19 | 0.209 | - | - | 2.161 | 2.6 | - | - |
LeidMod | 0.584 | - | - | - | 0.039 | 0.028 | 0.027 | 0.011 | 6.686 | 7.764 | 6.244 | 4.408 | 0.806 | 0.916 | 0.945 | 0.955 | 0.278 | 0.233 | 0.212 | 0.162 | 0.761 | 0.693 | 0.67 | 0.603 | 0.686 | 0.459 | 0.38 | 0.253 |
LeidCPM | 0.571 | - | - | - | 0.04 | 0.033 | 0.035 | 0.016 | 5.052 | 4.379 | 4.494 | 3.351 | 0.676 | 0.689 | 0.74 | 0.758 | 0.365 | 0.397 | 0.345 | 0.276 | 0.828 | 0.793 | 0.782 | 0.713 | 1.001 | 1.030 | 1.062 | 0.663 |
GANXiSd | 0.406 | - | - | - | 0.04 | 0.031 | 0.031 | - | 2.538 | 3.057 | 3.303 | - | 0.661 | 0.693 | 0.772 | - | 0.526 | 0.501 | 0.422 | - | 0.841 | 0.807 | 0.773 | - | 1.681 | 1.798 | 1.26 | - |
GANXiSo | - | - | - | - | 0.049 | 0.042 | 0.034 | - | 3.295 | 3.551 | 4.064 | - | 0.643 | 0.67 | 0.799 | - | 0.55 | 0.526 | 0.397 | - | 0.848 | 0.829 | 0.756 | - | 2.282 | 2.290 | 1.294 | - |
Algorithm | Case | Count | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
---|---|---|---|---|---|---|---|---|---|
DLPA | Bir | 1159 | 741 | 205 | 171 | 115 | 111 | 108 | 105 |
Col | 2479 | 20,669 | 425 | 277 | 249 | 198 | 173 | 168 | |
Fer | 18,389 | 3059 | 1224 | 1044 | 913 | 877 | 708 | 662 | |
ChE | 84,409 | 1244 | 882 | 726 | 662 | 576 | 529 | 488 | |
DirLouv | Bir | 243 | 691 | 583 | 494 | 486 | 463 | 459 | 381 |
Col | 735 | 4356 | 4105 | 4091 | 3604 | 3447 | 1933 | 1886 | |
Fer | 4429 | 16,029 | 13,916 | 13,391 | 12,936 | 8827 | 4334 | 4053 | |
ChE | 26,415 | 68,217 | 45,678 | 44,734 | 40,369 | 26,807 | 22,583 | 18,774 | |
Infomap | Bir | 202 | 9276 | 232 | 86 | 41 | 33 | 28 | 26 |
Col | 673 | 27,016 | 3726 | 1145 | 590 | 465 | 445 | 338 | |
Fer | 4321 | 106,521 | 15,617 | 1651 | 1012 | 506 | 409 | 353 | |
ChE | 26,497 | 164,839 | 33,635 | 32,111 | 25,038 | 20,098 | 17,226 | 14,033 | |
OSLOM | Bir | 642 | 2637 | 1743 | 1425 | 847 | 763 | 737 | 619 |
Col | 1690 | 10,823 | 7336 | 3058 | 1881 | 1519 | 1466 | 921 | |
Fer | 11,546 | 16,417 | 15,372 | 10,438 | 8874 | 6587 | 4081 | 3074 | |
ChE | 67,296 | 34,960 | 27,735 | 26,061 | 17,319 | 10,912 | 7614 | 6962 | |
GKM | Bir | 200 | 3300 | 2815 | 1812 | 815 | 496 | 313 | 90 |
Col | 200 | 11,474 | 7737 | 4353 | 2971 | 2947 | 1859 | 1119 | |
Fer | - | - | - | - | - | - | - | - | |
ChE | - | - | - | - | - | - | - | - | |
LeidMod | Bir | 238 | 716 | 464 | 458 | 451 | 416 | 388 | 356 |
Col | 734 | 4453 | 3882 | 3540 | 3212 | 3181 | 2686 | 1739 | |
Fer | 4481 | 15,804 | 12,715 | 12,212 | 11,385 | 9162 | 4355 | 4260 | |
ChE | 26,335 | 56,201 | 39,371 | 33,976 | 27,120 | 25,745 | 20,270 | 17,705 | |
LeidCPM | Bir | 315 | 627 | 613 | 226 | 226 | 214 | 208 | 208 |
Col | 1221 | 2395 | 672 | 625 | 414 | 405 | 362 | 355 | |
Fer | 6226 | 1001 | 729 | 704 | 648 | 572 | 554 | 531 | |
ChE | 34,647 | 949 | 719 | 626 | 607 | 565 | 531 | 521 | |
GANXiSd | Bir | 627 | 4881 | 190 | 146 | 129 | 113 | 103 | 101 |
Col | 1864 | 24,911 | 128 | 110 | 108 | 94 | 89 | 80 | |
Fer | 8472 | 98,344 | 211 | 184 | 159 | 104 | 100 | 97 | |
ChE | - | - | - | - | - | - | - | - | |
GANXiSo | Bir | 647 | 4956 | 638 | 255 | 166 | 146 | 139 | 83 |
Col | 1964 | 25,365 | 458 | 210 | 162 | 128 | 110 | 83 | |
Fer | 7489 | 111,798 | 184 | 159 | 155 | 102 | 95 | 84 | |
ChE | - | - | - | - | - | - | - | - |
References
- Bruns, A.; Burgess, J. The use of Twitter hashtags in the formation of ad hoc publics. In Proceedings 6th European Consortium for Political Research (ECPR) General Conference 2011; Bruns, A., De Wilde, P., Eds.; The European Consortium for Political Research (ECPR): Colchester, UK, 2011; pp. 1–9. [Google Scholar]
- Bruns, A.; Burgess, J. Twitter hashtags from ad hoc to calculated publics. In Hashtag Publics: The Power and Politics of Discursive Networks [Digital Formations, Volume 103]; Rambukkana, N., Ed.; Peter Lang Publishing: New York, NY, USA, 2015; pp. 13–27. ISBN 978-1-4331-2899-8. [Google Scholar]
- Perliger, A.; Pedahzur, A. Social Network Analysis in the Study of Terrorism and Political Violence. PS Political Sci. Politics 2011, 44, 45–50. [Google Scholar] [CrossRef] [Green Version]
- Wang, S.; Gong, M.; Liu, W.; Wu, Y. Preventing epidemic spreading in networks by community detection and memetic algorithm. Appl. Soft Comput. 2020, 89, 106118. [Google Scholar] [CrossRef]
- Van Lierde, H.; Delvenne, J.-C.; Van Dooren, P.; Saerens, M. Spectral Clustering Algorithms for Directed Graphs. Available online: https://scholar.google.com/citations?view_op=view_citation&hl=fr&user=5VNLlR0AAAAJ&citation_for_view=5VNLlR0AAAAJ:u5HHmVD_uO8C (accessed on 10 October 2021).
- George, R.; Shujaee, K.; Kerwat, M.; Felfli, Z.; Gelenbe, D.; Ukuwu, K. A Comparative Evaluation of Community Detection Algorithms in Social Networks. Procedia Comput. Sci. 2020, 171, 1157–1165. [Google Scholar] [CrossRef]
- Ester, M.; Kriegel, H.-P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD’96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD’96), Portland, OR, USA, 2–4 August 1996; AAAI Press: Menlo Park, CA, USA, 1996; pp. 226–231. [Google Scholar] [CrossRef]
- Kriegel, H.-P.; Kröger, P.; Sander, J.; Zimek, A. Density-based clustering. WIREs Data Min. Knowl. Discov. 2011, 1, 231–240. [Google Scholar] [CrossRef]
- Lloyd, S.P. Least squares quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef]
- Bodrunova, S.S.; Orekhov, A.V.; Blekanov, I.S.; Lyudkevich, N.S.; Tarasov, N.A. Topic Detection Based on Sentence Embeddings and Agglomerative Clustering with Markov Moment. Future Internet 2020, 12, 144. [Google Scholar] [CrossRef]
- Cauteruccio, F.; Corradini, E.; Terracina, G.; Ursino, D.; Virgili, L. Investigating Reddit to detect subreddit and author stereotypes and to evaluate author assortativity. J. Inf. Sci. 2020, 016555152097986. [Google Scholar] [CrossRef]
- Rosvall, M.; Axelsson, D.; Bergstrom, C.T. The map equation. Eur. Phys. J. Spéc. Top. 2009, 178, 13–23. [Google Scholar] [CrossRef]
- Blondel, V.D.; Guillaume, J.-L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef] [Green Version]
- Raghavan, U.N.; Albert, R.; Kumara, S. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 2007, 76, 036106. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Agreste, S.; De Meo, P.; Fiumara, G.; Piccione, G.; Piccolo, S.; Rosaci, D.; Sarne, G.M.L.; Vasilakos, A.V. An Empirical Comparison of Algorithms to Find Communities in Directed Graphs and Their Application in Web Data Analytics. IEEE Trans. Big Data 2017, 3, 289–306. [Google Scholar] [CrossRef] [Green Version]
- Deng, X.; Zhai, J.; Lv, T.; Yin, L. Efficient Vector Influence Clustering Coefficient Based Directed Community Detection Method. IEEE Access 2017, 5, 17106–17116. [Google Scholar] [CrossRef]
- Lancichinetti, A.; Radicchi, F.; Ramasco, J.J.; Fortunato, S. Finding Statistically Significant Communities in Networks. PLoS ONE 2011, 6, e18961. [Google Scholar] [CrossRef]
- Amati, G.; Angelini, S.; Cruciani, A.; Fusco, G.; Gaudino, G.; Pasquini, D.; Vocca, P. Topic Modeling by Community Detection Algorithms. In Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks, Virtual Space, 30 August–2 September 2021; Association for Computing Machinery: New York, NY, USA, 2021; pp. 15–20. [Google Scholar] [CrossRef]
- Yu-Liang, L.; Jie, T.; Jie, T.; Hao, G.; Yu, W. Infomap Based Community Detection in Weibo Following Graph. In Proceedings of the 2012 Second International Conference on Instrumentation, Measurement, Computer, Communication and Control, Harbin, China, 8–10 December 2012; IEEE Computer Society: Washington, DC, USA, 2012; pp. 1220–1222. [Google Scholar] [CrossRef]
- Mothe, J.; Mkhitaryan, K.; Haroutunian, M. Community Detection: Comparison of State of the Art Algorithms. In Proceedings of the 2017 Computer Science and Information Technologies (CSIT), Yerevan, Armenia, 25–29 September 2017; pp. 125–129. [Google Scholar] [CrossRef]
- Deitrick, W.; Hu, W. Mutually Enhancing Community Detection and Sentiment Analysis on Twitter Networks. J. Data Anal. Inf. Process. 2013, 01, 19–29. [Google Scholar] [CrossRef] [Green Version]
- Chen, H.; Yin, H.; Li, X.; Wang, M.; Chen, W.; Chen, T. People Opinion Topic Model: Opinion Based User Clustering in Social Networks. In Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, 3–7 April 2017; International World Wide Web Conferences Steering Committee: Geneva, Switzerland; pp. 1353–1359. [Google Scholar] [CrossRef] [Green Version]
- Xie, J.; Szymanski, B.K.; Liu, X. SLPA: Uncovering Overlapping Communities in Social Networks via a Speaker-Listener Interaction Dynamic Process. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, Vancouver, BC, Canada, 11 December 2011; pp. 344–349. [Google Scholar] [CrossRef] [Green Version]
- Bodrunova, S.S.; Blekanov, I.; Smoliarova, A.; Litvinenko, A. Beyond Left and Right: Real-World Political Polarization in Twitter Discussions on Inter-Ethnic Conflicts. Media Commun. 2019, 7, 119–132. [Google Scholar] [CrossRef] [Green Version]
- Bodrunova, S.S.; Blekanov, I.S.; Maksimov, A. Measuring Influencers in Twitter Ad-Hoc Discussions: Active Users vs. Internal Networks in the Discourse on Biryuliovo Bashings in 2013. In Proceedings of the 2016 IEEE Artificial Intelligence and Natural Language Conference (AINL), St. Petersburg, Russia, 10–12 November 2016; pp. 1–10. [Google Scholar]
- Dugué, N.; Perez, A. Directed Louvain: Maximizing Modularity in Directed Networks. Ph.D. Thesis, Université d’Orléans, Orléans, France, 2015. [Google Scholar] [CrossRef]
- Traag, V.A.; Waltman, L.; Van Eck, N.J. From Louvain to Leiden: Guaranteeing well-connected communities. Sci. Rep. 2019, 9, 5233. [Google Scholar] [CrossRef]
- Li, X. Directed LPA: Propagating labels in directed networks. Phys. Lett. A 2018, 383, 732–737. [Google Scholar] [CrossRef]
- Hajij, M.; Said, E.; Todd, R. Generalized K-means for Metric Space Clustering Using PageRank. In Computer Graphics and Visual Computing (CGVC); The Eurographics Association: Goslar, Germany, 2020. [Google Scholar]
- Rahman, M.K. NEDindex: A New Metric for Community Structure in Networks. In Proceedings of the 2015 18th International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, 21–23 December 2015; pp. 76–81. [Google Scholar] [CrossRef] [Green Version]
- Leicht, E.A.; Newman, M.E.J. Community Structure in Directed Networks. Phys. Rev. Lett. 2008, 100, 118703. [Google Scholar] [CrossRef] [PubMed]
- Newman, M.E.J. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 2006, 103, 8577–8582. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Fagiolo, G. Clustering in complex directed networks. Phys. Rev. E 2007, 76, 026107. [Google Scholar] [CrossRef] [Green Version]
- Chen, M.; Nguyen, T.; Szymanski, B. On Measuring the Quality of a Network Community Structure. In Proceedings of the 2013 International Conference on Social Computing, Alexandria, VA, USA, 1 September 2013; pp. 122–127. [Google Scholar] [CrossRef] [Green Version]
- Kaur, S.; Singh, S.; Kaushal, S.; Sangaiah, A. Comparative Analysis of Quality Metrics for Community Detection in Social Networks Using Genetic Algorithm. Neural Netw. World 2016, 26, 625–641. [Google Scholar] [CrossRef] [Green Version]
- Bodrunova, S.S.; Litvinenko, A.A.; Blekanov, I.S. Please Follow Us: Media Roles in Twitter Discussions in the United States, Germany, France, and Russia. Journal. Pract. 2018, 12, 177–203. [Google Scholar] [CrossRef]
- Bodrunova, S.S.; Blekanov, I.S. Power Laws in Ad Hoc Conflictual Discussions on Twitter. In Digital Transformation and Global Society. DTGS 2018. Communications in Computer and Information Science; Alexandrov, D., Boukhanovsky, A., Chugunov, A., Kabanov, Y., Koltsova, O., Eds.; Springer: Cham, Switzerland, 2018; Volume 859, pp. 67–82. [Google Scholar] [CrossRef]
DirMod | ClusCoe | ComFit | NEDind | Conduc | Contrac | Expans | |
---|---|---|---|---|---|---|---|
DLPA | 0.486 | 0.056 | 1.373 | 0.605 | 0.625 | 0.849 | 2.361 |
DirLouv | 0.574 | 0.039 | 6.548 | 0.793 | 0.286 | 0.766 | 0.715 |
Infomap | 0.088 | 0.034 | 7.877 | 0.940 | 0.209 | 0.685 | 0.375 |
OSLOM | - | 0.035 | 2.479 | 0.501 | 0.967 | 0.087 | 1.403 |
GKM | 0.390 | 0.0099 | 7.956 | 0.423 | 0.950 | 0.190 | 2.161 |
LeidMod | 0.584 | 0.039 | 6.686 | 0.806 | 0.278 | 0.761 | 0.686 |
LeidCPM | 0.571 | 0.040 | 5.052 | 0.676 | 0.365 | 0.828 | 1.001 |
GANXiSd | 0.406 | 0.040 | 2.538 | 0.661 | 0.526 | 0.841 | 1.681 |
GANXiSo | - | 0.049 | 3.295 | 0.643 | 0.550 | 0.848 | 2.282 |
Count | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
---|---|---|---|---|---|---|---|---|
DLPA | 1159 | 741 | 205 | 171 | 115 | 111 | 108 | 105 |
DirLouv | 243 | 691 | 583 | 494 | 486 | 463 | 459 | 381 |
Infomap | 202 | 9276 | 232 | 86 | 41 | 33 | 28 | 26 |
OSLOM | 642 | 2637 | 1743 | 1425 | 847 | 763 | 737 | 619 |
GKM | 200 | 3300 | 2815 | 1812 | 815 | 496 | 313 | 90 |
LeidMod | 238 | 716 | 464 | 458 | 451 | 416 | 388 | 356 |
LeidCPM | 315 | 627 | 613 | 226 | 226 | 214 | 208 | 208 |
GANXiSd | 627 | 4881 | 190 | 146 | 129 | 113 | 103 | 101 |
GANXiSo | 647 | 4956 | 638 | 255 | 166 | 146 | 139 | 83 |
DirMod | ClusCoe | ComFit | NEDind | Conduc | Contrac | Expans | |
---|---|---|---|---|---|---|---|
DLPA | - | 0.070 | 2.299 | 0.723 | 0.521 | 0.815 | 1.378 |
DirLouv | - | 0.028 | 7.753 | 0.917 | 0.234 | 0.691 | 0.459 |
Infomap | - | 0.024 | 8.468 | 0.949 | 0.201 | 0.653 | 0.344 |
OSLOM | - | 0.034 | 3.380 | 0.495 | 0.963 | 0.098 | 1.249 |
GKM | - | 0.013 | 28.498 | 0.422 | 0.934 | 0.209 | 2.600 |
LeidMod | - | 0.028 | 7.764 | 0.916 | 0.233 | 0.693 | 0.459 |
LeidCPM | - | 0.033 | 4.379 | 0.689 | 0.397 | 0.793 | 1.030 |
GANXiSd | - | 0.031 | 3.057 | 0.693 | 0.501 | 0.807 | 1.798 |
GANXiSo | - | 0.042 | 3.551 | 0.670 | 0.526 | 0.829 | 2.290 |
Count | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
---|---|---|---|---|---|---|---|---|
DLPA | 2479 | 20,669 | 425 | 277 | 249 | 198 | 173 | 168 |
DirLouv | 735 | 4356 | 4105 | 4091 | 3604 | 3447 | 1933 | 1886 |
Infomap | 673 | 27,016 | 3726 | 1145 | 590 | 465 | 445 | 338 |
OSLOM | 1690 | 10,823 | 7336 | 3058 | 1881 | 1519 | 1466 | 921 |
GKM | 200 | 11,474 | 7737 | 4353 | 2971 | 2947 | 1859 | 1119 |
LeidMod | 734 | 4453 | 3882 | 3540 | 3212 | 3181 | 2686 | 1739 |
LeidCPM | 1221 | 2395 | 672 | 625 | 414 | 405 | 362 | 355 |
GANXiSd | 1864 | 24,911 | 128 | 110 | 108 | 94 | 89 | 80 |
GANXiSo | 1964 | 25365 | 458 | 210 | 162 | 128 | 110 | 83 |
DirMod | ClusCoe | ComFit | NEDind | Conduc | Contrac | Expans | |
---|---|---|---|---|---|---|---|
DLPA | - | 0.085 | 1.522 | 0.640 | 0.599 | 0.847 | 2.735 |
DirLouv | - | 0.027 | 6.318 | 0.949 | 0.208 | 0.666 | 0.371 |
Infomap | - | 0.026 | 6.476 | 0.959 | 0.199 | 0.655 | 0.340 |
OSLOM | - | 0.034 | 2.426 | 0.493 | 0.964 | 0.096 | 1.305 |
GKM | - | - | - | - | - | - | - |
LeidMod | - | 0.027 | 6.244 | 0.945 | 0.212 | 0.670 | 0.380 |
LeidCPM | - | 0.035 | 4.494 | 0.740 | 0.345 | 0.782 | 1.062 |
GANXiSd | - | 0.031 | 3.303 | 0.772 | 0.422 | 0.773 | 1.260 |
GANXiSo | - | 0.034 | 4.064 | 0.799 | 0.397 | 0.756 | 1.294 |
Number | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
---|---|---|---|---|---|---|---|---|
DLPA | 18,389 | 3059 | 1224 | 1044 | 913 | 877 | 708 | 662 |
DirLouv | 4429 | 16,029 | 13,916 | 13,391 | 12,936 | 8827 | 4334 | 4053 |
Infomap | 4321 | 106,521 | 15,617 | 1651 | 1012 | 506 | 409 | 353 |
OSLOM | 11,546 | 16,417 | 15,372 | 10,438 | 8874 | 6587 | 4081 | 3074 |
GKM | - | - | - | - | - | - | - | - |
LeidMod | 4481 | 15,804 | 12,715 | 12,212 | 11,385 | 9162 | 4355 | 4260 |
LeidCPM | 6226 | 1001 | 729 | 704 | 648 | 572 | 554 | 531 |
GANXiSd | 8472 | 98,344 | 211 | 184 | 159 | 104 | 100 | 97 |
GANXiSo | 7489 | 111,798 | 184 | 159 | 155 | 102 | 95 | 84 |
DirMod | ClusCoe | ComFit | NEDind | Conduc | Contrac | Expans | |
---|---|---|---|---|---|---|---|
DLPA | - | 0.027 | 1.375 | 0.672 | 0.514 | 0.764 | 1.550 |
DirLouv | - | 0.012 | 4.395 | 0.953 | 0.163 | 0.604 | 0.256 |
Infomap | - | 0.012 | 4.381 | 0.942 | 0.166 | 0.608 | 0.268 |
OSLOM | - | 0.015 | 1.730 | 0.479 | 0.979 | 0.054 | 1.239 |
GKM | - | - | - | - | - | - | - |
LeidMod | - | 0.011 | 4.408 | 0.955 | 0.162 | 0.603 | 0.253 |
LeidCPM | - | 0.016 | 3.351 | 0.758 | 0.276 | 0.713 | 0.663 |
GANXiSd | - | - | - | - | - | - | - |
GANXiSo | - | - | - | - | - | - | - |
Number | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
---|---|---|---|---|---|---|---|---|
DLPA | 84,409 | 1244 | 882 | 726 | 662 | 576 | 529 | 488 |
DirLouv | 26,415 | 68,217 | 45,678 | 44,734 | 40,369 | 26,807 | 22,583 | 18,774 |
Infomap | 26,497 | 164,839 | 33,635 | 32,111 | 25,038 | 20,098 | 17,226 | 14,033 |
OSLOM | 67,296 | 34,960 | 27,735 | 26,061 | 17,319 | 10,912 | 7614 | 6962 |
GKM | - | - | - | - | - | - | - | - |
LeidMod | 26,335 | 56,201 | 39,371 | 33,976 | 27,120 | 25,745 | 20,270 | 17,705 |
LeidCPM | 34,647 | 949 | 719 | 626 | 607 | 565 | 531 | 521 |
GANXiSd | - | - | - | - | - | - | - | - |
GANXiSo | - | - | - | - | - | - | - | - |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Blekanov, I.; Bodrunova, S.S.; Akhmetov, A. Detection of Hidden Communities in Twitter Discussions of Varying Volumes. Future Internet 2021, 13, 295. https://doi.org/10.3390/fi13110295
Blekanov I, Bodrunova SS, Akhmetov A. Detection of Hidden Communities in Twitter Discussions of Varying Volumes. Future Internet. 2021; 13(11):295. https://doi.org/10.3390/fi13110295
Chicago/Turabian StyleBlekanov, Ivan, Svetlana S. Bodrunova, and Askar Akhmetov. 2021. "Detection of Hidden Communities in Twitter Discussions of Varying Volumes" Future Internet 13, no. 11: 295. https://doi.org/10.3390/fi13110295
APA StyleBlekanov, I., Bodrunova, S. S., & Akhmetov, A. (2021). Detection of Hidden Communities in Twitter Discussions of Varying Volumes. Future Internet, 13(11), 295. https://doi.org/10.3390/fi13110295