Next Article in Journal
Effects of the Number of Classes and Pressure Map Resolution on Fine-Grained In-Bed Posture Classification
Next Article in Special Issue
Cooperation Dynamic through Individualistic Indirect Reciprocity Mechanism in a Multi-Dynamic Model
Previous Article in Journal
Accuracy Analysis on Design of Stochastic Computing in Arithmetic Components and Combinational Circuit
Previous Article in Special Issue
A Novel Methodology Analyzing the Influence of Micro-Stresses on Human-Centric Environments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Building Political Hashtag Communities: A Multiplex Network Analysis of U.S. Senators on Twitter during the 2022 Midterm Elections

1
Center for the Study of Digital Society, North Dakota State University, Fargo, ND 58102, USA
2
Industrial and Manufacturing Engineering, North Dakota State University, Fargo, ND 58102, USA
*
Author to whom correspondence should be addressed.
Computation 2023, 11(12), 238; https://doi.org/10.3390/computation11120238
Submission received: 20 October 2023 / Revised: 19 November 2023 / Accepted: 20 November 2023 / Published: 1 December 2023
(This article belongs to the Special Issue Computational Social Science and Complex Systems)

Abstract

:
This study examines how U.S. senators strategically used hashtags to create political communities on Twitter during the 2022 Midterm Elections. We propose a way to model topic-based implicit interactions among Twitter users and introduce the concept of Building Political Hashtag Communities (BPHC). Using multiplex network analysis, we provide a comprehensive view of elites’ behavior. Through AI-driven topic modeling on real-world data, we observe that, at a general level, Democrats heavily rely on BPHC. Yet, when disaggregating the network across layers, this trend does not uniformly persist. Specifically, while Republicans engage more intensively in BPHC discussions related to immigration, Democrats heavily rely on BPHC in topics related to identity and women. However, only a select group of Democratic actors engage in BPHC for topics on labor and the environment—domains where Republicans scarcely, if at all, participate in BPHC efforts. This research contributes to the understanding of digital political communication, offering new insights into echo chamber dynamics and the role of politicians in polarization.

1. Introduction

The U.S. Senate stands as a polarized institution, marked by a party divide that fosters an environment of divisive political rhetoric. Daily, Democratic and Republican senators engage in contentious debates across a broad spectrum of issues. Concurrently, senators also utilize their media presence and direct communications with constituents as powerful platforms to launch political attacks against their opponents. While press releases have conventionally served as a means to reach a broader audience, the advent of social media, particularly Twitter, has ushered in an alternative communication method that circumvents traditional media channels [1,2,3,4,5].
In the midst of the ongoing and widespread political conflicts on Twitter, notable distinctions become evident in the level of participation exhibited by politicians. Some political figures are seen engaging in multiple subjects concurrently, while others concentrate their efforts on a single issue, and a few choose to remain disengaged from the politically charged discourse altogether. These dynamics have garnered significant attention in recent times. Scholars have shifted their focus from traditional sources of information to social media platforms (i.e., [4,6,7,8,9,10,11]). In line with this empirical trend, we undertake a specific analysis of the public Twitter activity of members of the U.S. Senate in the context of the 2022 midterm election.
Our work provides a comprehensive analysis of interactions among U.S. Senators by proposing an innovative way to model communication strategies among elites. First, instead of examining only the direct interactions on Twitter (i.e., follows, retweets, replies, or mentions), we delve deeper into the patterns of reciprocal hashtag usage to uncover another potential communication strategy among political figures. Second, contrary to a common approach that models elite Twitter interactions via monoplex networks, our study harnesses the advances of multiplex networks, a technique that political scientists have explored less. In our multiplex network construction, layers are defined by the reciprocal use of specific hashtags between pairs of accounts. Here, we introduce the concept of Building Political Hashtag Communities (BPHC). In the BPHC framework, the creation and maintenance of political communities on social media are not merely based on overt interactions like mentions or retweets, but more subtly through reciprocal and strategic hashtag use. The underlying premise of BPHC is that reciprocal involvement in hashtags is not coincidental but a deliberate communication strategy. By consistently echoing similar hashtags, politicians not only assert their standpoints but also subtly cultivate virtual communities around shared political concerns or values. This strategic echo, we argue, can critically serve to limit the information variety in their followers’ feeds, thereby reinforcing specific narratives or viewpoints. The continuous reciprocal involvement in these hashtags strengthens this echo chamber effect, establishing more cohesive and ideologically aligned online communities.
Do politicians strategically use hashtags to build their own communities? To address this question, we test our models on a real-world dataset capturing the Twitter interactions of American Senators during the midterm elections of 2022. Adding another layer of innovation, we introduce a novel topic modeling method, leveraging the capabilities of OpenAI’s Chat GPT-4. This approach empowers us to categorize a vast array of hashtags using both supervised and unsupervised techniques, effectively overcoming constraints seen in previous studies, such as sample restrictions or biased categorizations. Aligning with the current literature (i.e., [4,6,7,8,9,10,11]) on congressional tweets, we find that certain subgroups among senators (Democrats, incumbents, females, and representatives from the southern region, as opposed to Republicans, candidates, males, and those from the northern region) cultivate deeper connections and foster more dynamic lines of communication to varying degrees. However, a more nuanced examination using advanced multiplex community analysis offers fresh insights into elite behavior on Twitter. Our multiplex analysis of Democratic and Republican networks shows that, at a general level, Democrats heavily rely on BPHC. Yet, when disaggregating the network across layers, this trend does not uniformly persist. Specifically, while Republicans engage more intensively in BPHC discussions related to immigration, Democrats heavily rely on BPHC in topics related to identity and women. However, only a select group of Democratic actors engage in BPHC for topics on labor and the environment—domains where Republicans scarcely, if at all, participate in BPHC efforts. These nuances underscore the multifaceted nature of BPHC strategies employed by different political actors.
This article makes important contributions. First, the extant literature predominantly employs monoplex network models, offering a limited perspective on the multifaceted interactions occurring on these platforms. To the best of our knowledge, this is the first multiplex analysis to understand the strategic hashtag involvement of politicians. Second, relying on AI-driven hashtag modeling to construct network layers, our study provides a more nuanced and comprehensive understanding of digital political communication. Finally, our findings also speak to existing research on polarization. Conventional wisdom asserts that elites are the source of mass polarization [12]. Evidence, however, is still rare to depict how they strategically do that. We show how elites contribute to the echo chamber by strategically involving/not involving themselves in specific topics.
Following this introduction, the paper presents a literature review, delineating the theoretical framework and empirical background of the study. Subsequently, we elaborate on the methodology employed for data collection and analysis. The findings section provides a detailed account of the study’s results, followed by a discussion integrating these findings with the existing literature and theoretical framework. The paper concludes with remarks on the study’s implications and suggestions for future research.

2. Related Literature

Crafting and maintaining a positive public image has always been a critical component of a politician’s overall strategy. Prior to the digital age, political communication primarily relied on in-person interactions. However, social media has created new direct (arguably more intimate) channels of communication between politicians and the electorate. New communication strategies (e.g., sharing daily activities [13] or replicating messages across platforms [14]) have given politicians the ability to easily shape their public image [15] and clearly express their positions on diverse political issues [16].
The use of online platforms by politicians, however, has evolved significantly in the last decade (see [13,17,18,19,20]). Interactions between politicians, such as retweets, replies, mentions, and likes, are now more common than ever before, especially since the 2016 election cycle (for a review, see [21]). Today, these platforms are not just communication tools but also serve as informational platforms for local and global content [22]. Politicians commonly use these platforms to influence media coverage of political discussions (see [23]) and promote their policies and ideological stances, either by supporting their colleagues or attacking opposition [4].
These dynamics have gained substantial attention in recent years, as scholars—who are interested in examining the congressional ecosystem—shift their focus from traditional sources like roll call data [24], bill text [25], or dissemination of news on media [26] to social networks [27,28,29,30,31,32]. We follow in this empirical trend by specifically analyzing the public Twitter activity of U.S. Senate members in the context of the 2022 midterm election but offer three additional contributions.
First, recent works show that analyzing only direct interactions (i.e., follows, retweets, or mentions) misses a significant part of how politicians interact on Twitter: hashtags [30,31,33]. The importance of hashtag analysis comes from its ability to reveal underlying patterns in social media use that are not immediately apparent. Hashtag analysis is a powerful tool in social media conversations, providing insights into user interests, behaviors, and community dynamics [34]. Many of Twitter’s interactions unfold in this manner, with conversations defined by hashtags that enable users to engage with each other on particular topics directly (for a comprehensive review on hashtag usage research, see Pilar et al. [35]).
On Twitter, however, the use of a specific hashtag in a tweet does more than merely increase the visibility of that message. Hashtags can go beyond their bookmark function and serve as an implicit form of communication with other Twitter users who are utilizing the same hashtag, even if there is no direct connection between them. Unlike traditional follower-based networks, these hashtag-driven spaces for discourse do not necessitate mutual follower relationships, setting them apart from existing social media dynamics [36].
Recent research convincingly shows how hashtags can serve as symbols of communities, functioning as rallying points and banners where individuals with shared interests and concerns unite. This phenomenon has been termed the “imagined audience” in academic literature [37]. Yang et al. [38], for example, articulate the dual role of hashtags and show how hashtag involvement can capture adoption behavior by users and predict future engagements. It is evident that hashtags could serve as tools that facilitate the formation of conversational platforms where people can engage in discussions and contribute to the cultural interpretation of various subjects like racial justice, gender equality, and health matters [39]. Recent research by Hemphill et al. [30], for instance, identifies distinct “polarizing hashtags” that predict an author’s political party with high accuracy.
Building on this literature, we investigate whether political figures strategically employ the latter function of hashtags to establish their own communities, rather than solely using them as an effective means to increase the visibility of certain figures or merely express support for the ideas encapsulated within the hashtags. This communication strategy, which we termed “Building Political Hashtag Communities" (BPHC), not only provides politicians with a direct means to connect and engage with citizens but also facilitates the organization of their base into distinct online groups through deliberate reciprocal usage of hashtags. Coordination network analysis, for example, has proven helpful for learning about how disinformation campaigns operate (see, for example, [40]); however, the usefulness of these tools for political hashtag community building is still an open question. While politicians can coordinate along multiple dimensions, our study is the first to investigate the coordination of hashtags. In this article, instead of focusing on the detection of hashtags mainly used by politicians, constructing a coordination network by connecting senators if they use the same hashtag, we address coordinated activity classification to provide evidence about whether the activity of senators appears to be part of a BPHC.
Second, in studies examining the implications of politicians’ social networks, a common approach to model Twitter interactions among politicians is to construct a monoplex network based on following/follower relations [27], or networks based on either retweets [28,29] or explicit mentions indicated by the “@” character [32]. Following a smaller, but growing, literature (see [31,33]), we model multiplex networks, an area that political scientists have explored less. Recent progress in the field of multiplex networks has indicated that examining multiple monoplex connections together could reveal new insights that are not apparent when looking at each connection separately. Bonifazi et al. [33], for example, enhance this discussion by applying multilayer network analysis to investigate various social phenomena, including the spread of misinformation and public reaction during different phases of the COVID-19 pandemic. Their findings underscore the power of multilayer network representations in analyzing user interactions and the spread of content across social platforms. Similarly, Hanteer and Rossi [31] introduce a thematic multiplex model to capture the nuanced interactions among Danish politicians during the parliamentary elections of 2015. By using a multiplex network model, their study unveils distinct community dynamics based on shared hashtags that traditional interaction models do not reveal, highlighting the strategic use of hashtags in shaping political narratives. We build upon this literature and expand its scope to examine how politicians may strategically use hashtags to shape the information environment of their political base.
Finally, we also offer a new, relatively more effective layer-building strategy. Prior studies that used hashtags to create multiplex/multilayer networks mainly depended on qualitative analyses to identify politically polarizing hashtags (see [31,33]). This approach not only raised the potential for bias but also restricted the sample of hashtags that researchers could work with. Thanks to recent innovations in generative pre-trained transformers, we introduce a new topic modeling method. Utilizing OpenAI’s Chat GPT-4, this strategy enables the categorization of a vast array of hashtags using both supervised and unsupervised techniques, overcoming previous limitations. To our knowledge, there has been no prior research focused on using AI-driven topic modeling on social media platforms for the purpose of identifying interactions among politicians.

3. Multiplex Networks

Complicated systems do not emerge from isolated networks with connections of identical significance and implications. Instead, they involve layers of interactions that are difficult to comprehend all at once. For example, in transportation networks, diverse modes of transportation can be differentiated (bus, subway, train, etc.). Likewise, in molecular biology and neural networks, interactions can possess varying meanings, emphasizing the necessity of examining these systems using a comprehensive approach that acknowledges the distinctions among different types of interactions. Within social networks, various kinds of social connections can be identified (friends, coworkers, acquaintances, family connections, etc.). The concept of multilayer networks was initially introduced in the realm of social science to describe the various categories of social connections that exist among the nodes within a social network [41]. Multiplex networks, a special form of multilayer networks, are commonly employed in scenarios where a consistent group of nodes is linked by connections that represent distinct kinds of interactions. A multiplex network is denoted as M = ( N , L , V , E ) , where N refers to a set of nodes, L represents a set of layers, and ( V , E ) constitutes a graph. This structure adheres to the condition that for any edge ( n 1 , l 1 , n 2 , l 2 ) E , the layers l 1 and l 2 must be identical. In this study, we adopt a multiplex network representation of U.S. senators where each network layer is formed by a distinct hashtag relationship. Such a representation enables us to obtain macro- to micro-level analysis, such as comparison of layers using network metrics informing about similarity of layers, most sustained relationships through layers, revealing hidden communities accounting for all layers, and analyzing central senators across all layers.

3.1. Network Construction

We construct two main multiplex networks, one in a supervised fashion and the other in an unsupervised fashion. The supervised multiplex network has seven layers and the unsupervised one has six. The first multiplex network is formed in a supervised fashion with the hypothesis that some outstanding hashtags will be more informative to distinguish echo chambers, inter-party relationships, regional differences, and gender preferences. The second multiplex network is constructed in an unsupervised fashion using the capability of GPT4 in topic modeling. Our motivation is to contrast and learn from the two strategies, as the latter one provides a more universal and unbiased approach to topic selection. Layers are formed if any hashtag belonging to a layer is used by pairs of accounts reciprocally. If a hashtag is used by one account but not used by the other one, these accounts are not connected by an edge. Otherwise, an edge incident to these accounts is added to the network. At least one such relationship is required to add an edge to the network. The multiplicity of pairwise hashtags is not counted. Multiplex network constructions are illustrated in Figure 1.

3.2. Sub-Networks

While senator-level networks offer distinct information, investigations also extend to party-level, gender-level, and region-level networks. For the senator-level networks, nodes and edges are subset to retain only the Republican or Democratic party nodes while preserving the existing edges. A similar subsetting approach is employed to obtain regional, gender, and candidate networks.

3.3. Network Metrics

We employ descriptive network analysis metrics for the flattened and multiplex networks. The metrics used for the flattened networks are: density, number of connected components, and average shortest path length. As for multiplex network metrics, we use Jeffrey degree, Pearson degree, and correlations, facilitating a comprehensive examination of both dissimilarity and similarity aspects in the multiplex network. Jeffrey degree is a statistical metric employed to evaluate the correlation between node degrees in distinct layers of a multiplex network. This measure quantifies the ’dissimilarity’ between the joint distribution of degrees in two layers. We can express the dissimilarity using the following formula [42]:
k = 1 K f r k , l 1 log f r k , l 1 f r k , l 2 + k = 1 K f r k , l 2 log f r k , l 2 f r k , l 1
where f r k , l 1 represents the relative frequency of degree value k in layer 1. Higher values of the Jeffrey degree indicate increased dissimilarity between the degrees of nodes in the two layers, offering valuable insights into the structural differences across multiplex network strata. In contrast, Pearson degree measures the ’similarity’ between the degrees of nodes across each pair of layers, calculated through the computation of the Pearson correlation coefficient. We can express degree correlations using the following formula [42]:
[ p l 1 mean ( p l 1 ) ] · [ p l 2 mean ( p l 2 ) ] p l 1 mean ( p l 1 ) · p l 2 mean ( p l 2 )
where p l 1 represents the degree vector of layer 1. Positive values of Pearson degree indicate that nodes with similar degrees are connected (assortativity), while negative values suggest disassortativity. We used R igraph [43] and multinet [44] libraries for computing these metrics.

4. Overview of the Dataset

4.1. Data Collection

Exploring the intricate networks of the U.S. Senate on Twitter, this article delves into the patterns and dynamics exhibited by senators. Data used to construct this network were gathered from a period spanning one month before and one month after the election. Although the roots of polarization might be more easily discerned in the U.S. House, studies indicate that the Senate has progressively become a chamber marked by polarization, mirroring the House. According to American political literature, evidence over the years underscores the transformation of the Senate into a chamber increasingly characterized by polarization, akin to the House [4,5,45,46].
Twitter serves as a particularly valuable arena for examining the networks of the U.S. Senate for two main reasons. First, it is an influential platform that facilitates the dissemination of new information by Senators. Second, the extensive quantity and diversity of tweets from politicians enable an in-depth analysis of their readiness to engage in partisan conflicts. Additionally, Twitter has become a leading channel for political communication among Senators and a stage where partisan battles frequently unfold.
In the context of the 2022 midterm elections, our study aims to assess the extent to which BPHC strategy is followed by different senator networks constructed by partisan alignment, region, incumbency, and gender. This study encompasses each senator’s Twitter activity from 1 October 2022, to 12 December 2022. Our sample comprises 137 observations, encompassing various senatorial types, which provides a comprehensive perspective on different electoral outcomes and statuses. All descriptive details are documented in Table 1. The majority of our sample, 65 senators, served in the preceding Senate term and did not participate in the recent election yet continue their senatorial roles. Overall, 24 Senators from our sample served in the prior term, entered the election, secured a victory, and retained their seats. A subset, constituting 5 Senators, were incumbents who contested the election but faced defeat and subsequently no longer serve in the Senate. Another 11 politicians in our data did not serve in the preceding Senate term, contested and won the recent election, and are now serving as Senators. Finally, the dataset includes 32 individuals who were not Senators in the prior term, contested the recent election, faced defeat, and thus remained outside the Senate. It is noteworthy that our dataset does not encompass a distinct group of 6 Senators who were incumbents but opted for resignation, and hence no longer hold office. Out of the 137 Twitter users in our sample, we were unable to obtain data for 9 of them for various reasons, such as not having a Twitter account or not tweeting within the specified time frame.
Tweet data was collected using Twitter’s official Academic API, which enabled researchers to access Twitter’s historical data and advanced search features through API v2 (this free API for academics was discontinued by Twitter on 27 March 2023). To interact with the API, we employed the Tweepy [47] package, a user-friendly Python library that streamlines data extraction and parsing. The Tweepy package also features capabilities to handle rate limits imposed by Twitter. We created a Python codebase that saves the API data, in JSON format, into an SQLite database, a lightweight and efficient database management system well suited for small- to medium-sized projects. This allowed us to query our API data, whether in table format or JSON format. We stored fields such as entities, where hashtag data is stored, in JSON columns, while fields like tweet or user IDs were kept in standard SQLite columns. The combination of the Academic API, Tweepy, and SQLite (see Figure 2) enabled us to efficiently collect, manage, query, and analyze the dataset for our research. You can find all the code used in this project in the GitHub repository for our project (https://github.com/harunpirim/multilayer_polarization/ accessed on 21 November 2023).
In our research, we formed a collaboration with the Digital Society Project (http://digitalsocietyproject.org/ accessed on 21 November 2023), renowned for its extensive collection of election-centric data both in the US and globally. This project provided us with datasets containing the Twitter accounts of senatorial candidates. These datasets meticulously catalog the official Twitter handles of the candidates. Notably, we observed that some candidates had multiple official accounts listed. To maintain consistency in our analysis, we consolidated these multiple accounts into a single representation for each candidate. This approach was justified as most candidates predominantly used one main Twitter account during our study’s timeframe.
For senators who were not identified as candidates in the Digital Society Project’s dataset, we sourced their Twitter data from an alternative repository, found at https://github.com/unitedstates/congress-legislators (accessed on 21 November 2023). Leveraging these two data sources allowed us to enrich our tweet dataset with critical metadata elements, including Party Identity, Incumbency, Region, and Sex. These metadata features were instrumental in creating network layers for our analysis. To manage and process this data effectively, we employed the Python Pandas package [48], which enabled us to efficiently process query results from our SQLite database and seamlessly integrate the metadata, facilitating the construction of comprehensive and layered networks based on the tweet data.

4.2. Hashtag Topic Modeling

The process of data construction in our study is illustrated in Figure 2. Our approach involved constructing a network data model grounded in the shared use of hashtags. In this model, a connection, or ’edge,’ was established between two users when they utilized the same hashtag at least once. To extract meaningful insights from these networks, we categorized hashtags into groups based on their semantic content. This categorization was facilitated by the use of ChatGPT-4, an advanced language model known for its proficiency in classical NLP tasks such as named entity recognition, topic modeling, and summarization. Studies indicate that ChatGPT-4 performs on par with or surpasses current state-of-the-art models in these domains [49,50,51,52]. Our application of ChatGPT-4 involved feeding it hashtags, to which it responded by identifying their inherent meanings, thus enabling us to understand the underlying thematic connections among the hashtags used by different political figures. This methodology is pivotal in revealing the nature of political discourse and alignments within the Twitter sphere, linking directly back to the core objectives of our research.
In the hashtag clustering task we are interested in, there is a twist that classical NLP models, like topic modeling, cannot handle. Our goal is to cluster these hashtags according to their semantic relation with the tweets, which requires context information about the hashtags themselves. Training or fine-tuning a specific model like RoBERTa, or other state-of-the-art language models, for this task requires extensive training data, labor-intensive labeling, expertise in deep learning, and fine-tuning language models, which translates into significant budget and time concerns. Instead of taking this route, we crafted an innovative approach using a large language model (LLM), specifically ChatGPT-4, since it is the best performing one among others [53]. Considering its vast training dataset, our hypothesis was that using an LLM would enable us to capture the inherent semantic meanings needed for clustering these hashtags into meaningful groups, which would then be used to create semantic networks. ChatGPT-4 excels when the task requires context awareness with tailored prompting specific to the case [49]. To align with this approach, we used various prompts to achieve the desired results, which was to cluster as many hashtags as possible into groups. Before determining the right prompts for our task, we tested several (see Table A3 in Appendix A). To ensure the accuracy of our results, we carefully reviewed all hashtag clusterings with our expertise in polarization.
During our analysis, we faced a significant challenge with hashtags that had multiple spelling variants, such as ’#AZSen,’ ’#AZSEN,’ and ’#AZsen.’ This variation in spellings was problematic for accurately constructing our network, as it prevented us from linking users who used different variants of the same hashtag. To resolve this, we employed OpenRefine [54], a powerful tool designed for cleaning and transforming messy data. Through OpenRefine, we applied eight different algorithms specifically tailored to identify and unify similar texts. This data cleaning step was crucial; it reduced the total number of distinct hashtags from 1964 to 1660, significantly enhancing the accuracy of our network analysis. For example, prior to cleaning, the three variants of ’#AZSEN’ were treated as separate entities, hindering our ability to establish connections between users employing different versions of this hashtag. After unifying these variants into a single ’#AZSEN’ tag using OpenRefine, we could accurately link users, reflecting a more authentic representation of the online interactions. This process not only corrected mismatches but also enriched our network, increasing the total number of usable hashtags from 376 to 406. This improvement in data quality allowed us to create richer and more precise networks, capturing a more accurate picture of the hashtag-driven interactions among users.
We implemented two distinct methods for clustering hashtags: ’loosely supervised’ and ’unsupervised.’ In the loosely supervised approach, we directed ChatGPT-4 to consider key issues that are central to polarization in US politics. These issues included health care, immigration, climate change and environmental policy, gun control, economic policy, racial and social justice, abortion, and LGBTQ+ rights. This method allowed us to observe how hashtags aligned with these specific political topics. Conversely, for the unsupervised method, we did not provide ChatGPT-4 with any predefined context. Instead, we allowed the model to cluster hashtags based purely on their inherent patterns and relationships, offering insights into natural groupings that emerge from the data itself.
After the initial clustering by ChatGPT-4 in both methods, we conducted a second refinement phase. This step was crucial to further refine and make sense of the clusters, enhancing their relevance and coherence. For example, in this phase, we grouped similar themes like ’Appreciation Veterans’, ’Appreciation Youth’, and ’Awareness Adoption’ into a broader category labeled ’Awareness & Appreciation’. This process of refinement and categorization led to the creation of more meaningful networks. A detailed list of these refined clusters with details can be found in our GitHub repository (https://github.com/harunpirim/multilayer_polarization/tree/main/hashtags accessed on 21 November 2023). The results of our clustering efforts, encompassing both the supervised and unsupervised methods, are summarized in Table 2. This table provides an overview of the hashtag groups and their respective categorizations, illustrating the diverse ways in which political discourse manifests on Twitter.

5. Results

We begin our analysis by presenting the descriptive results of our analysis, comparing networks that utilize different modeling strategies (i.e., supervised and unsupervised) across various analytical levels (i.e., Party ID, Incumbency, Sex, and Region).
Figure 3 immediately makes apparent the intricate dynamics of Senate collaborations. (For a detailed breakdown of edge distribution across layers, along with confidence intervals for proportions, see Table A1 and Table A2 in Appendix A.) It reveals that certain groups consistently demonstrate denser BPHC networks than their counterparts. Notably, Democrats, incumbents, females, and representatives from the southern region (as opposed to Republicans, candidates, males, and those from the northern region) display heightened collaboration and interconnectedness across hashtags. These patterns prevail in both supervised and unsupervised networks, with a more pronounced prominence in the former. However, a notable exception is the similarity observed between incumbent and candidate senators within unsupervised networks.
A deeper examination through distinct layers provides further insights. While some layers maintain consistent patterns, others reveal subtle variations that underscore the multidimensional nature of polarization in U.S. politics. Economy-related hashtags in supervised networks and awareness-related hashtags in unsupervised networks seem to bolster in-group communication. Another interesting trend is that topics related to social justice—such as identity, labor, women, and rights—emerge as the primary sources of division and polarization among politicians.
When examining differences across various analytical categories, it becomes evident that the underlying network structures could shape the dynamics within distinct layers. For instance, while Democrats generally have denser connections than Republicans, this pattern is somewhat weakened in the economy layer, reversed in the immigration and state politics layers, and remained the same in other areas. Notably, metrics like density and the number of edges demonstrate that by constructing a multiplex network using supervised topic modeling with a focus on reciprocal hashtag usage, we can predict senators’ party affiliation with a minimum accuracy of 82% in the immigration layer and with accuracy levels ranging from 93% (i.e., Health layer) to a flawless 100% (i.e., Labor layer) across all other topics, excluding the economy.
The pattern for other analytical categories also exhibits interesting shifts. The trend of female senators forming denser networks than their male counterparts is mostly consistent but weakens in the media layer and inverts in the federal/state politics and environment-associated hashtag groups. In a similar vein, while incumbents typically have denser connections than candidates, this trend diminishes in the identity and federal politics layers and reverses in areas such as campaign, state politics, immigration, and women-centric hashtags. Finally, the propensity for southern senators to have denser networks than their northern peers is evident in most layers but diminishes in the campaign layer and flips in the environment, identity, women, and rights layers. It is noteworthy that senators from the South show no involvement in labor-related issues.
Overall, both supervised and unsupervised networks provide invaluable insights. They reinforce our initial hypothesis that elements such as gender, geography, incumbency, and party affiliation have a pronounced impact on the structure of politicians’ online networks. For the subsequent analysis, however, we will focus on the supervised networks among Democrats and Republicans to delve deeper. It is evident that adopting an unsupervised topic modeling strategy leads to a greater number of edges, signifying a more densely connected network than what is observed in supervised networks. This outcome is not so surprising, given that the hashtag sample size in unsupervised modeling is nearly twice as large as that in supervised modeling. However, the advantages of supervised networks are also clear. They offer a more detailed view of polarization dynamics among elites. For example, in networks utilizing supervised modeling, the density among Democrats is five times that of Republicans. In contrast, this ratio is only 1.8 times in models based on unsupervised modeling. Another compelling observation is that while networks based on supervised modeling capture several omitted layers (e.g., labor, women), those relying on unsupervised structures fail to discern such nuanced divisions. We conclude that these qualitative advantages could allow us to delve better into subtle insights into how specific thematic areas contribute to the formation of echo chambers and polarization in US politics.
We begin our multiplex analysis using Jeffrey degree and Pearson degree correlations for pairwise layer comparisons across political parties. These layer comparison matrices, visualized as heatmaps in Figure 4, offer further insights into the nuanced landscape of political communication and polarization within the U.S. Senate.
The Jeffrey degree dissimilarity function calculates differences between degree distributions of layers, with higher values indicating greater differences. Among Democrats, Jeffrey degree values are noticeably elevated when comparing the identity layer with those of women, labor, immigration, and especially environment. This suggests a unique assembly of senators engaging with identity-related discussions, underlining the range of perspectives within Democratic dialogues on identity. For Republicans, our data reveals pronounced Jeffrey degree values between the economy layer and most other layers, which implies that discussions related to the economy involve a distinct group of actors, distinct from those engaged in other topics.
Examining Pearson degree scores, which provide correlations between actors’ degrees across different layers, reveals more nuances. For Democrats, there is a predominant positive correlation across various topics, suggesting aligned actor activity levels and possibly indicating coordinated communication strategies or shared priorities. For instance, there are notable positive correlations between economy and other layers like health, identity, and labor, implying actors active in economy discussions are similarly engaged in these areas. On the other hand, the Republican network presents varied Pearson degree correlations. Positive correlations are observed between economy and health, and economy and immigration, while a negative correlation is evident between identity and almost all other layers. This variation suggests that actors active in economy discussions within the Republican network may not be as engaged in identity-related dialogues, pointing to divergent engagement patterns and potentially different focal points or priorities among Republican actors.
Figure 5 presents an overview of the flattened network that spans all layers, illustrating the distribution of party relations across diverse topic layers. A unique color illustrates each layer’s relationship. Notably, the ’Economy’ topic, represented by the purple edges, stands out as the predominant theme, making up 41.17% of the relationships. Following the economy, the ’Identity’ and ’Health’ topics emerge as the next most dominant, accounting for 25.89% and 22.53%, respectively. At the lower end, ’Women’, ’Environment’, and ’Labor’ surface as the least addressed topics, each representing less than 3% of the relationships. This observation suggests that while these topics retain importance, they may not be at the core of the BPHC communication strategy as much as the more prevalent subjects. The node sizes, on the other hand, are scaled based on their degrees, indicating the number of connections. Democrats are represented by blue nodes, Republicans by red, and the gray nodes likely depict Independents. The figure clearly underscores that Democratic senators are considerably more engaged in BPHC across various topics compared to their Republican peers.
Figure 6 offers additional insights by detailing the relationships either between or within parties for each layer individually. In the Economy layer, the dense clustering of both blue and red nodes reaffirms that economic issues garner significant attention from both parties, positioning it as a central policy discussion area. In the Health layer, both parties seem notably engaged, though there is a modest prevalence of blue nodes. Intriguingly, in the Immigration layer, we observe two distinct clusters. This implies that even though both parties engage with BPHC, they might be approaching from unique viewpoints or potentially targeting their respective communities with specific hashtags. For the remaining layers, which include Women, Environment, Identity, and Labor, there is an implicit pattern of more pronounced Democratic involvement. It is particularly noteworthy that the ’Identity’ layer, despite ranking as the second most significant in relationship density, is overwhelmingly dominated by blue nodes.
Figure 5 and Figure 6 shed light on the intricacies of the topics that feature prominently in the BPHC strategies across different political parties. They offer a multi-layered understanding of topic engagement but do not clearly indicate if parties pivot their BPHC strategies to break away from echo chambers, a phenomenon where ideas and beliefs are amplified by repeated internal communication. To obtain a clearer perspective on which topics serve as catalysts for inter-party dialogues, we further filtered the data to focus solely on inter-party edges, as presented in Figure 7. At first glance, it is evident that the Economy and Health layers act as significant platforms for fostering dialogue between the parties. These layers brim with interactions, underscoring their roles as central arenas for bipartisan discussions. In stark contrast, the Immigration and Labor layers tell a different story. They are notably devoid of any inter-party conversations, suggesting that, at least within the context of BPHC strategies, these topics might be approached with more insular perspectives by each party. The implications could be that discussions on these layers are more internally focused, potentially amplifying party-specific narratives.

6. Discussion

Hashtags play a crucial role on social media platforms, functioning not only as bookmarks for content but also as symbols representing communities. On the one hand, hashtags connect tweets with similar subjects, simplifying the process of identifying and categorizing discussions related to specific hashtags [55]. On the other hand, hashtags go beyond their functional utility—they serve as symbols of communities, functioning as rallying points and banners where individuals with shared interests and concerns unite [38].
This article investigates whether political figures strategically utilize the latter function of hashtags to establish their own communities, rather than solely using them as an effective means to increase the visibility of certain figures, launch attacks against opponents, or merely express support for the ideas encapsulated within the hashtags. This communication strategy, which we termed “Building Political Hashtag Communities" (BPHC), not only provides politicians with a direct means to connect and engage with citizens but also facilitates the organization of their base into distinct online groups through deliberate reciprocal usage of hashtags.
Our research sheds light on the complex dynamics within the Senate, particularly regarding the use of BPHC as a communication strategy. Our results reveal that Democrats, specifically incumbents, women, and representatives from the southern region, are more actively engaged in this BPHC strategy, leading to denser collaboration networks compared to their counterparts (namely Republicans, candidates, men, and representatives from the northern region). These findings align with prior research (i.e., [4,6,7,8,9,10,11]) on congressional tweets, confirming that certain subgroups among politicians could cultivate deeper connections and foster more immediate and dynamic lines of communication to varying degrees.
However, a more nuanced examination using advanced multiplex community analysis offers fresh insights into elite behavior on Twitter. Our multiplex analysis of Democratic and Republican networks reveals areas within the political spectrum, particularly in economic policy discussions, where the conventional wisdom of diverging characteristics may not entirely hold. In these contexts, the internal dynamics of interaction appear to be consistent across party lines, indicating that both Republicans and Democrats employ BPHC as their primary communication strategy. Furthermore, while Republicans engage more intensively in BPHC discussions related to immigration, Democrats heavily rely on BPHC topics related to identity and women. Only a select group of Democratic actors use BPHC for discussions on labor and the environment, areas where Republicans are rarely or never involved in BPHC efforts. More interestingly, using hashtag involvement detection algorithms, and without considering the content of the tweets, we are able to infer the political affiliation of each politician with up to 100.0% accuracy in the Senate. Considering the similar levels of Twitter activity (e.g., number of tweets sent, number of # usage) among party members, as shown in Table 1, these nuances underscore the multifaceted nature of the BPHC strategies employed by different political actors.
In conclusion, our study underscores the significance of BPHC as a communication strategy in the political arena. It reveals complex patterns of engagement and highlights the potential for cross-party collaboration within certain BPHC networks. As we continue to navigate the digital age, understanding the dynamics of BPHC and its implications for political discourse will be essential for both scholars and practitioners in the field of political communication. The enduring question of the impact of BPHC on public opinion warrants further exploration. While our study has primarily focused on the strategies employed by political actors, it remains an open question whether BPHC influences the beliefs and attitudes of regular social media users. Future research could delve into the extent to which engagement with BPHC content shapes public discourse and perceptions. Does exposure to BPHC discussions lead to greater polarization, or does it encourage more nuanced and informed perspectives among social media users? Additionally, the longevity of BPHC communities and their adaptability over time present intriguing possibilities. How do these communities evolve in response to changing political landscapes, emerging issues, and shifts in public sentiment? Can BPHC strategies remain effective in an environment where the social media landscape is constantly evolving? We also acknowledge a limitation in our study due to the absence of a time-based analysis distinguishing pre- and post-election periods. Future research could also explore behavior patterns before and after the election, offering a more comprehensive understanding of temporal dynamics in BPHC strategies. It would have been promising to understand how these dynamics evolve over time, particularly in response to shifting political climates, emerging policy challenges, and the continuous evolution of social media as a tool for political communication and public engagement.
Exploring these questions can provide valuable insights into the evolving nature of political communication and online communities. This research lays the groundwork for an expanded understanding of political interactions on social media, providing a lens through which we can better comprehend the nuanced dynamics of political communication in the digital age. To optimize our network construction strategy, we also introduce a new topic modeling method, utilizing OpenAI’s Chat GPT-4. This strategy enables the categorization of a vast array of hashtags using both supervised and unsupervised techniques, overcoming previous limitations (e.g., sample restrictions or biased categorizations). Future studies could potentially explore and analyze the opportunities provided by new LLM models, contributing to a global understanding of digital political communication and collaboration.

Author Contributions

Conceptualization, Y.E.O.; methodology, H.P.; software, H.P. and Y.A.; data collection Y.A.; analysis, H.P. and Y.E.O.; writing, Y.E.O., H.P. and Y.A.; visualization, Y.E.O., H.P. and Y.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data and analyses codes are available in our GitHub repository (https://github.com/harunpirim/multilayer_polarization/ accessed on 21 November 2023).

Acknowledgments

We thank the Digital Society Project for providing access to candidate data. We would also like to thank the reviewers and the editors of Computation, who provided extensive and constructive feedback to improve our manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1

Table A1. Supervised network: proportional scores.
Table A1. Supervised network: proportional scores.
CategoryLayerEdgesTotal Edgesp-ValueMeanLowerUpper
EconomyIncumbent3388130.4160.4160.3820.451
EnvironmentIncumbent258130.0310.0310.0200.046
HealthIncumbent2288130.2800.2800.2500.313
ImmigrationIncumbent178130.0210.0210.0130.034
IdentityIncumbent1798130.2200.2200.1920.251
LaborIncumbent208130.0250.0250.0150.038
WomenIncumbent68130.0070.0070.0030.017
EconomyCandidate1774380.4040.4040.3580.452
EnvironmentCandidate54380.0110.0110.0040.028
HealthCandidate684380.1550.1550.1230.193
ImmigrationCandidate204380.0460.0460.0290.071
IdentityCandidate1364380.3110.3110.2680.356
LaborCandidate64380.0140.0140.0060.031
WomenCandidate264380.0590.0590.0400.087
EconomyDemocrat31913830.2310.2310.2090.254
EnvironmentDemocrat4313830.0310.0310.0230.042
HealthDemocrat36113830.2610.2610.2380.285
ImmigrationDemocrat1513830.0110.0110.0060.018
IdentityDemocrat55713830.4030.4030.3770.429
LaborDemocrat5013830.0360.0360.0270.048
WomenDemocrat3813830.0270.0270.0200.038
EconomyRepublican2653680.7200.7200.6710.765
EnvironmentRepublican23680.0050.0050.0010.022
HealthRepublican263680.0710.0710.0480.103
ImmigrationRepublican723680.1960.1960.1570.241
IdentityRepublican13680.0030.0030.0000.017
WomenRepublican23680.0050.0050.0010.022
EconomyNorth16990.1620.1620.0980.252
EnvironmentNorth4990.0400.0400.0130.106
HealthNorth19990.1920.1920.1220.286
ImmigrationNorth2990.0200.0200.0040.078
IdentityNorth45990.4550.4550.3550.557
LaborNorth4990.0400.0400.0130.106
WomenNorth9990.0910.0910.0450.170
EconomySouth1472100.7000.7000.6320.760
EnvironmentSouth22100.0100.0100.0020.038
HealthSouth272100.1290.1290.0880.183
ImmigrationSouth292100.1380.1380.0960.194
IdentitySouth42100.0190.0190.0060.051
WomenSouth12100.0050.0050.0000.030
EconomyFemale852840.2990.2990.2470.357
EnvironmentFemale12840.0040.0040.0000.023
HealthFemale862840.3030.3030.2510.360
ImmigrationFemale32840.0110.0110.0030.033
IdentityFemale852840.2990.2990.2470.357
LaborFemale92840.0320.0320.0160.061
WomenFemale152840.0530.0530.0310.087
EconomyMale721460.4930.4930.4100.577
EnvironmentMale41460.0270.0270.0090.073
HealthMale361460.2470.2470.1810.326
ImmigrationMale31460.0210.0210.0050.064
IdentityMale271460.1850.1850.1270.259
LaborMale31460.0210.0210.0050.064
WomenMale11460.0070.0070.0000.043
Table A2. Unsupervised network: proportional scores.
Table A2. Unsupervised network: proportional scores.
CategoryLayerEdgesTotal Edgesp-ValueMeanLowerUpper
AwarenessCandidate57912910.4480.4480.4210.476
CampaignCandidate42212910.3270.3270.3010.353
Federal Pol.Candidate13112910.1010.1010.0860.120
MediaCandidate4212910.0330.0330.0240.044
RightsCandidate4412910.0340.0340.0250.046
State PoliticsCandidate7312910.0570.0570.0450.071
AwarenessIncumbent78412460.6290.6290.6020.656
CampaignIncumbent8812460.0710.0710.0570.087
Federal Pol.Incumbent16212460.1300.1300.1120.150
MediaIncumbent10312460.0830.0830.0680.100
RightsIncumbent10512460.0840.0840.0700.101
State PoliticsIncumbent412460.0030.0030.0010.009
AwarenessNorth1031900.5420.5420.4690.614
CampaignNorth271900.1420.1420.0970.202
Federal Pol.North231900.1210.1210.0800.178
MediaNorth61900.0320.0320.0130.071
RightsNorth261900.1370.1370.0930.196
State PoliticsNorth51900.0260.0260.0100.064
AwarenessSouth2574470.5750.5750.5280.621
CampaignSouth364470.0810.0810.0580.111
Federal Pol.South894470.1990.1990.1640.240
MediaSouth444470.0980.0980.0730.131
RightsSouth104470.0220.0220.0110.042
State PoliticsSouth114470.0250.0250.0130.045
AwarenessDemocrat89418170.4920.4920.4690.515
CampaignDemocrat28618170.1570.1570.1410.175
Federal Pol.Democrat29618170.1630.1630.1460.181
MediaDemocrat6818170.0370.0370.0290.047
RightsDemocrat25618170.1410.1410.1250.158
State PoliticsDemocrat1718170.0090.0090.0060.015
AwarenessRepublican51310390.4940.4940.4630.525
CampaignRepublican19910390.1920.1920.1680.217
Federal Pol.Republican22510390.2170.2170.1920.243
MediaRepublican6210390.0600.0600.0460.076
RightsRepublican1410390.0130.0130.0080.023
State PoliticsRepublican2610390.0250.0250.0170.037
AwarenessFemale2364460.5290.5290.4820.576
CampaignFemale1234460.2760.2760.2350.320
Federal Pol.Female254460.0560.0560.0370.083
MediaFemale184460.0400.0400.0250.064
RightsFemale394460.0870.0870.0640.119
State PoliticsFemale54460.0110.0110.0040.028
AwarenessMale1723280.5240.5240.4690.579
CampaignMale543280.1650.1650.1270.210
Federal Pol.Male523280.1590.1590.1220.204
MediaMale133280.0400.0400.0220.068
RightsMale173280.0520.0520.0310.083
State PoliticsMale203280.0610.0610.0390.094
Table A3. ChatGPT Prompts.
Table A3. ChatGPT Prompts.
Prompt CodePrompt
unsup 01This list consists of hashtags used by US politicians during the campaign period. Can you please categorize these hashtags into meaningful categories.
unsup 03This list consists of hashtags used by US Senators during the campaign period. Categorize these hashtags into politically meaningful categories, considering the polarizing aspects of US politics.
unsup 04This list consists of hashtags used by US Senators during the campaign period. Categorize these hashtags into politically meaningful categories, considering the polarizing aspects of US politics. Please ensure every hashtag is in only one category and be sure every hashtag is categorized.
unsup 05I have a list of hashtags that were used by US politicians during the 2022 midterm election campaign period. I’d like you to categorize these into distinct meaningful categories. Please present the results in a markdown table format with columns titled ’Hashtag’ and ’Category’.
unsup 05 stage 2 (unsuccesful)This is a request to recategorize hashtag groups you previously provided me. Recall, these hashtag categories are related to politicians’ use of Twitter during the campaign period. Please regroup all these categories into 10 groups maximum. You can also have an 11th group, labeled as “other”, for cases that you are not sure. The hashtag categories will be used to investigate the network among politicians using multilayer techniques. Each category will be considered as a network layer, and there will be a network between two nodes (politicians) if they both use any hashtag in the same category. Please provide the results in a table format with the following columns: previous category (the list I gave you above) and new category (the new list including 10 meaningful groups and other).
unsup 05_2This is a request to categorize US politicians’ hashtags during the campaign period into meaningful categories. The hashtags will be used to investigate the network among politicians using multilayer techniques. Each category will be considered as a network layer, and there will be a network between two nodes (politicians) if they both use any hashtag in the same category. Please provide the results in a table format with the following columns: hashtag and category.
unsup 06this list consists of hashtags and some example tweets used by us politicians during the campaign period. can you please categorize these hashtags into meaningful categories. give me the results as a table columns being: hashtag and category
sup 01-healthThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, health care. Those hashtags might include debates over the Affordable Care Act (ACA), universal healthcare, Medicare for All, and how to best ensure affordable access to healthcare.
sup 01-immiigrationThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, immigration. Those hashtags might include debates over border security, the treatment of undocumented immigrants, Deferred Action for Childhood Arrivals (DACA), and family separations at the border.
sup 01-climateThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, climate change and environmental policy. Those hashtags might include debates over climate change, renewable energy policies, and the role of government regulation in environmental protection.
sup 01-gun controlThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, gun control. Those hashtags might include debates over background checks, assault weapon bans, and concealed carry laws
sup 01-economic policy            This list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, economic policy. Those hashtags might include debates over taxation, government spending, the national debt, social welfare programs, and the role of government in regulating the economy.
sup 01-social justiceThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, Racial and Social Justice. Those hashtags might include debates over systemic racism, police reform, affirmative action, and the Black Lives Matter movement that have sparked intense debate.
sup 01-abortionThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, abortion. Those hashtags might Include debates over abortion rights, with strong feelings on both pro-life and pro-choice sides.
sup 01-LGBTQ+ RightsThis list consists of hashtags used by US Senators during the campaign period. I want you to detect hashtags related to one of the most polarized issues, LGBTQ+ Rights. Those hashtags might include debates over same-sex marriage, transgender rights, and anti-discrimination laws that continue to be contentious.
unsup detailed                           This is a request to categorize the hashtags used by US politicians during the campaign period into meaningful categories for the purpose of investigating the network among politicians. In the resulting multiplex network, the nodes will represent politician accounts, the edges will represent common hashtags, and the layers will represent the categories you provide. Each category will be considered a network layer, and there will be a network between two nodes (politicians) if they both use any hashtag in the same category. Please provide the results in a table format with the following columns: hashtag and category. When categorizing the hashtags, please ensure that the layers of the network (categories) are as uncorrelated as possible to maximize the network analysis effectiveness.

References

  1. Grimmer, J. Representational Style in Congress: What Legislators Say and Why It Matters; Cambridge University Press: New York, NY, USA, 2013. [Google Scholar] [CrossRef]
  2. Prior, M. Media and Political Polarization. Annu. Rev. Political Sci. 2013, 16, 101–127. [Google Scholar] [CrossRef]
  3. Straus, J.R.; Williams, R.T.; Shogan, C.J.; Glassman, M.E. Congressional Social Media Communications: Evaluating Senate Twitter Usage. Online Inf. Rev. 2016, 40, 643–659. [Google Scholar] [CrossRef]
  4. Russell, A. US Senators on Twitter: Asymmetric Party Rhetoric in 140 Characters. Am. Politics Res. 2018, 46, 695–723. [Google Scholar] [CrossRef]
  5. Gelman, J. Partisan Intensity in Congress: Evidence from Brett Kavanaugh’s Supreme Court Nomination. Political Res. Q. 2021, 74, 450–463. [Google Scholar] [CrossRef]
  6. Golbeck, J.; Auxier, B.; Bickford, A.; Cabrera, L.; McHugh, M.C.; Moore, S.; Hart, J.; Resti, J.; Rogers, A.; Zimmerman, J. Congressional twitter use revisited on the platform’s 10-year anniversary. J. Assoc. Inf. Sci. Technol. 2018, 69, 1067–1070. [Google Scholar] [CrossRef]
  7. Hemphill, L.; Otterbacher, J.; Shapiro, M. What’s congress doing on twitter? In Proceedings of the 2013 Conference on Computer Supported Cooperative Work, San Antonio, TX, USA, 23–27 February 2013; pp. 877–886. [Google Scholar] [CrossRef]
  8. Hemphill, L.; Russell, A.; Schöpke-Gonzalez, A.M. What drives US congressional members’ policy attention on Twitter? Policy Internet 2021, 13, 233–256. [Google Scholar] [CrossRef]
  9. Evans, H.K.; Clark, J.H. “You Tweet Like a Girl!”: How Female Candidates Campaign on Twitter. Am. Politics Res. 2015, 44, 326–352. [Google Scholar] [CrossRef]
  10. Gainous, J.; Wagner, K.M. Tweeting to Power: The Social Media Revolution in American Politics; Oxford University Press: New York, NY, USA, 2014. [Google Scholar] [CrossRef]
  11. Atkinson, M.L.; Windett, J.H. Gender stereotypes and the policy priorities of women in Congress. Political Behav. 2019, 41, 769–789. [Google Scholar] [CrossRef]
  12. Orhan, Y.E. The Relationship between Affective Polarization and Democratic Backsliding: Comparative Evidence. Democratization 2022, 29, 714–735. [Google Scholar] [CrossRef]
  13. Evans, H.K.; Cordova, V.; Sipole, S. Twitter Style: An Analysis of How House Candidates Used Twitter in Their 2012 Campaigns. PS Political Sci. Politics 2014, 47, 454–462. [Google Scholar] [CrossRef]
  14. Larsson, A.O. The EU Parliament on Twitter: Assessing the Permanent Online Practices of Parliamentarians. J. Inf. Technol. Politics 2015, 12, 149–166. [Google Scholar] [CrossRef]
  15. Golbeck, J.; Grimes, J.M.; Rogers, A. Twitter use by the U.S. Congress. J. Am. Soc. Inf. Sci. Technol. 2010, 61, 1612–1621. [Google Scholar] [CrossRef]
  16. Kruikemeier, S. How political candidates use Twitter and the impact on votes. Comput. Hum. Behav. 2014, 34, 131–139. [Google Scholar] [CrossRef]
  17. Auter, Z.J.; Fine, J.A. Social Media Campaigning: Mobilization and Fundraising on Facebook. Soc. Sci. Q. 2018, 99, 185–200. [Google Scholar] [CrossRef]
  18. Gelman, J.; Wilson, S.L.; Petrarca, C.S. Mixing messages: How candidates vary in their use of Twitter. J. Inf. Technol. Politics 2021, 18, 101–115. [Google Scholar] [CrossRef]
  19. Mechkova, V.; Wilson, S. Does gender still matter for politics? The Case of the 2018 US Elections on Twitter. Digital Society Project Working Paper. 2019.
  20. Stromer-Galley, J.; Rossini, P. Categorizing political campaign messages on social media using supervised machine learning. J. Inf. Technol. Politics 2023, 1–14. [Google Scholar] [CrossRef]
  21. Jungherr, A. Twitter use in election campaigns: A systematic literature review. J. Inf. Technol. Politics 2016, 13, 72–91. [Google Scholar] [CrossRef]
  22. Barberá, P. Birds of the Same Feather Tweet Together. Bayesian Ideal Point Estimation Using Twitter Data. Political Anal. 2015, 23, 76–91. [Google Scholar] [CrossRef]
  23. Hemsley, J. Followers Retweet! The Influence of Middle-Level Gatekeepers on the Spread of Political Information on Twitter. Policy Internet 2019, 11, 280–304. [Google Scholar] [CrossRef]
  24. Clinton, J.; Jackman, S.; Rivers, D. The Statistical Analysis of Roll Call Data. Am. Political Sci. Rev. 2004, 98, 355–370. [Google Scholar] [CrossRef]
  25. Poole, K.T.; Rosenthal, H. Patterns of Congressional Voting on JSTOR. Am. J. Political Sci. 1991, 35, 228–278. [Google Scholar] [CrossRef]
  26. Gerrish, S.; Blei, D. How They Vote: Issue-Adjusted Models of Legislative Behavior. In Advances in Neural Information Processing Systems; Pereira, F., Burges, C., Bottou, L., Weinberger, K., Eds.; Curran Associates: Red Hook, NY, USA, 2012; Volume 25. [Google Scholar]
  27. Livne, A.; Simmons, M.; Adar, E.; Adamic, L. The Party Is Over Here: Structure and Content in the 2010 Election. ICWSM 2011, 5, 201–208. [Google Scholar] [CrossRef]
  28. Conover, M.D.; Ratkiewicz, J.; Francisco, M.; Gonçalves, B.; Flammini, A.; Menczer, F. Political Polarization on Twitter. ICWSM 2011, 133, 89–96. [Google Scholar] [CrossRef]
  29. Cherepnalkoski, D.; Mozetic, I. Retweet networks of the European Parliament: Evaluation of the community structure. Appl. Netw. Sci. 2016, 1, 1–20. [Google Scholar] [CrossRef]
  30. Hemphill, L.; Culotta, A.; Heston, M. #Polar Scores: Measuring partisanship using social media content. J. Inf. Technol. & Politics 2016, 13, 365–377. [Google Scholar] [CrossRef]
  31. Hanteer, O.; Rossi, L. An Innovative Way to Model Twitter Topic-Driven Interactions Using Multiplex Networks. Front. Big Data 2019, 2, 463659. [Google Scholar] [CrossRef]
  32. Chamberlain, J.M.; Spezzano, F.; Kettler, J.J.; Dit, B. A Network Analysis of Twitter Interactions by Members of the U.S. Congress. ACM Trans. Soc. Comput. 2021, 4, 1–22. [Google Scholar] [CrossRef]
  33. Bonifazi, G.; Breve, B.; Cirillo, S.; Corradini, E.; Virgili, L. Investigating the COVID-19 vaccine discussions on Twitter through a multilayer network-based approach. Inf. Process. Manag. 2022, 59, 103095. [Google Scholar] [CrossRef]
  34. Bode, L.; Hanna, A.; Yang, J.; Shah, D.V. Candidate Networks, Citizen Clusters, and Political Expression: Strategic Hashtag Use in the 2010 Midterms. ANNALS Am. Acad. Political Soc. Sci. 2015, 659, 149–165. [Google Scholar] [CrossRef]
  35. Pilař, L.; Stanislavská, L.K.; Kvasnička, R.; Bouda, P.; Pitrová, J. Framework for Social Media Analysis Based on Hashtag Research. Appl. Sci. 2021, 11, 3697. [Google Scholar] [CrossRef]
  36. Bruns, A.; Burgess, J. The use of Twitter hashtags in the formation of ad hoc publics. In Proceedings of the 6th European Consortium for Political Research (ECPR) General Conference 2011, The European Consortium for Political Research (ECPR), Reykjavik, Iceland, 25–27 August 2011; pp. 1–9. [Google Scholar]
  37. Litt, E. Knock, Knock. Who’s There? The Imagined Audience. J. Broadcast. Electron. Media 2012, 56, 330–345. [Google Scholar] [CrossRef]
  38. Yang, L.; Sun, T.; Zhang, M.; Mei, Q. We know what@ you# tag: Does the dual role affect hashtag adoption? In Proceedings of the 21st International Conference on World Wide Web, Lyon, France, 16–20 April 2012; pp. 261–270. [Google Scholar] [CrossRef]
  39. Kuo, R. Racial justice activist hashtags: Counterpublics and discourse circulation. New Media Soc. 2016, 20, 495–514. [Google Scholar] [CrossRef]
  40. Vargas, L.; Emami, P.; Traynor, P. On the detection of disinformation campaign activity with network analysis. In Proceedings of the 2020 ACM SIGSAC Conference on Cloud Computing Security Workshop, Virtual Event, 9 November 2020; pp. 133–146. [Google Scholar]
  41. Bianconi, G. Multilayer Networks: Structure and Function; Oxford University Press: Oxford, UK, 2018. [Google Scholar] [CrossRef]
  42. Bródka, P.; Chmiel, A.; Magnani, M.; Ragozini, G. Quantifying layer similarity in multiplex networks: A systematic study. R. Soc. Open Sci. 2018, 5, 171747. [Google Scholar] [CrossRef] [PubMed]
  43. Csárdi, G.; Nepusz, T.; Traag, V.; Horvát, S.; Zanini, F.; Noom, D.; Müller, K. igraph: Network Analysis and Visualization in R, R Package Version 1.5.1; Zenodo, 2023. [CrossRef]
  44. Magnani, M.; Rossi, L.; Vega, D. Analysis of Multiplex Social Networks with R. J. Stat. Softw. 2021, 98, 1–30. [Google Scholar] [CrossRef]
  45. Fleisher, R.; Bond, J.R. The Shrinking Middle in the US Congress. Br. J. Political Sci. 2004, 34, 429–451. [Google Scholar] [CrossRef]
  46. Theriault, S.M. The Gingrich Senators: The Roots of Partisan Warfare in Congress; Oxford University Press: New York, NY, USA, 2013. [Google Scholar] [CrossRef]
  47. Roesslein, J. Tweepy: Twitter for Python! 2020. Available online: https://github.com/tweepy/tweepy (accessed on 21 November 2023).
  48. Pandas Development Team. Pandas-Dev/Pandas: Pandas. 2020. Available online: https://zenodo.org/records/10107975 (accessed on 21 November 2023).
  49. Kocoń, J.; Cichecki, I.; Kaszyca, O.; Kochanek, M.; Szydło, D.; Baran, J.; Bielaniewicz, J.; Gruza, M.; Janz, A.; Kanclerz, K.; et al. ChatGPT: Jack of all trades, master of none. Inf. Fusion 2023, 99, 101861. [Google Scholar] [CrossRef]
  50. Qin, C.; Zhang, A.; Zhang, Z.; Chen, J.; Yasunaga, M.; Yang, D. Is ChatGPT-4 a general-purpose natural language processing task solver? arXiv 2023, arXiv:2302.06476. [Google Scholar]
  51. Törnberg, P. Chatgpt-4 outperforms experts and crowd workers in annotating political twitter messages with zero-shot learning. arXiv 2023, arXiv:2304.06588. [Google Scholar]
  52. Sun, X.; Dong, L.; Li, X.; Wan, Z.; Wang, S.; Zhang, T.; Li, J.; Cheng, F.; Lyu, L.; Wu, F.; et al. Pushing the Limits of ChatGPT on NLP Tasks. arXiv 2023, arXiv:2306.09719. [Google Scholar]
  53. Chatbot Arena Leaderboard—A Hugging Face Space by Lmsys. Available online: https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard (accessed on 21 November 2023).
  54. Huynh, D. GitHub—OpenRefine/OpenRefine: OpenRefine Is a Free, Open Source Power Tool for Working with Messy Data and Improving It—github.com. Available online: https://github.com/OpenRefine/OpenRefine (accessed on 18 November 2023).
  55. Huang, J.; Katherine, M.T.; Efthimiadis, N. Conversational tagging in Twitter. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, Toronto, ON, Canada, 13–16 June 2010. [Google Scholar] [CrossRef]
Figure 1. Illustration of network constructions. In a topic layer (i.e., L1), if a hashtag (i.e., #H1) is used reciprocally, an edge is added to this layer. However, if in a topic layer (i.e., L2) a hashtag is used by only A or only B, an incident edge is not added to the layer.
Figure 1. Illustration of network constructions. In a topic layer (i.e., L1), if a hashtag (i.e., #H1) is used reciprocally, an edge is added to this layer. However, if in a topic layer (i.e., L2) a hashtag is used by only A or only B, an incident edge is not added to the layer.
Computation 11 00238 g001
Figure 2. Data pipeline.
Figure 2. Data pipeline.
Computation 11 00238 g002
Figure 3. This figure illustrates the level of involvement in BPHC among U.S. senators across various sub-groupings. The left panel displays the distribution of edges across networks derived from supervised modeling, whereas the right panel highlights the disparity among networks based on unsupervised modeling. The x-axis represents analytical network categories, and the y-axis denotes the number of edges. The colors within each bar represent the distribution of edges across various layers.
Figure 3. This figure illustrates the level of involvement in BPHC among U.S. senators across various sub-groupings. The left panel displays the distribution of edges across networks derived from supervised modeling, whereas the right panel highlights the disparity among networks based on unsupervised modeling. The x-axis represents analytical network categories, and the y-axis denotes the number of edges. The colors within each bar represent the distribution of edges across various layers.
Computation 11 00238 g003
Figure 4. These heat plots depict the Pearson degree (PD, upper panel) and Jeffrey degree (JD, lower panel) correlations across different layers among Democrats and Republicans. The darkest colorings represent higher values, and empty rows/columns imply no reciprocal hashtag involvement is detected.
Figure 4. These heat plots depict the Pearson degree (PD, upper panel) and Jeffrey degree (JD, lower panel) correlations across different layers among Democrats and Republicans. The darkest colorings represent higher values, and empty rows/columns imply no reciprocal hashtag involvement is detected.
Computation 11 00238 g004
Figure 5. Node colors represent two parties and independents: blue, Democrats; red, Republicans. Edge colors represent the topic layers.
Figure 5. Node colors represent two parties and independents: blue, Democrats; red, Republicans. Edge colors represent the topic layers.
Computation 11 00238 g005
Figure 6. BPHC strategy at each layer.
Figure 6. BPHC strategy at each layer.
Computation 11 00238 g006
Figure 7. Inter-party relationships at each layer.
Figure 7. Inter-party relationships at each layer.
Computation 11 00238 g007
Table 1. Data description.
Table 1. Data description.
GroupSub-Group# of Politicians# of Tweets# of Hashtags
Full Sample 12837,3611660
Party ID
Democrats6519,195929
Republicans6017,911886
Independent325519
Incumbency
Incumbent6012,009672
Candidate6825,3521205
Region
North9825,9421270
South3011,419585
Sex
Male9026,4191292
Female3810,942585
Notes: It is important to note that the number of tweets per user among Democrats and Republicans is almost the same, at around 295.
Table 2. Hashtag Groups.
Table 2. Hashtag Groups.
Modeling# of Layer# of Hashtags# of Tweets# of Ind. Users
Supervised
Economy3589077
Environment1231927
Health3590561
Immigration933621
Identity2846449
Women2243627
Labor1733225
Unsupervised
Appreciation1532785103
Campaign86741771
Federal Politics95829785
Media83140081
Rights54112453
State Politics168701562
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Orhan, Y.E.; Pirim, H.; Akbulut, Y. Building Political Hashtag Communities: A Multiplex Network Analysis of U.S. Senators on Twitter during the 2022 Midterm Elections. Computation 2023, 11, 238. https://doi.org/10.3390/computation11120238

AMA Style

Orhan YE, Pirim H, Akbulut Y. Building Political Hashtag Communities: A Multiplex Network Analysis of U.S. Senators on Twitter during the 2022 Midterm Elections. Computation. 2023; 11(12):238. https://doi.org/10.3390/computation11120238

Chicago/Turabian Style

Orhan, Yunus Emre, Harun Pirim, and Yusuf Akbulut. 2023. "Building Political Hashtag Communities: A Multiplex Network Analysis of U.S. Senators on Twitter during the 2022 Midterm Elections" Computation 11, no. 12: 238. https://doi.org/10.3390/computation11120238

APA Style

Orhan, Y. E., Pirim, H., & Akbulut, Y. (2023). Building Political Hashtag Communities: A Multiplex Network Analysis of U.S. Senators on Twitter during the 2022 Midterm Elections. Computation, 11(12), 238. https://doi.org/10.3390/computation11120238

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop