Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics

Bulat, Beril; Hilbert, Martin

doi:10.3390/e27060573

Open AccessArticle

Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics

by

Beril Bulat

^*

and

Martin Hilbert

Department of Communication, University of California, 1 Shields Avenue, Davis, CA 95616, USA

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(6), 573; https://doi.org/10.3390/e27060573

Submission received: 14 March 2025 / Revised: 12 May 2025 / Accepted: 23 May 2025 / Published: 28 May 2025

(This article belongs to the Special Issue Statistical Physics Approaches for Modeling Human Social Systems)

Download

Browse Figures

Versions Notes

Abstract

Bots have become increasingly prevalent in the digital sphere and have taken up a proactive role in shaping democratic processes. While previous studies have focused on their influence at the individual level, their potential macro-level impact on communication dynamics remains underexplored. This study adopts an information-theoretic approach from dynamical systems theory to examine the role of political bots shaping the dynamics of an online political discussion on Twitter. We quantify the components of this dynamic process in terms of its complexity, predictability, and its entropy rate, or the remaining uncertainty. Findings suggest that bot activity is associated with increased complexity and, simultaneously, with more uncertainty in the structural dynamics of online political communication. While our dataset features earlier-generation bots, findings foreshadow the possibility for even more complex and uncertain online politics in the age of sophisticated and autonomous generative AI agents. Our presented framework showcases how this can be studied with the use of information-theoretic measures from dynamical systems theory.

Keywords:

dynamical systems; information theory; political bots; social bots; twitter; social media

1. Introduction

The rise of bots in the digital age has transformed the online ecosystem in profound ways. Originally appearing in Internet Relay Chat networks in the early 1990s [1], bots have evolved into a diverse range of applications since then. From information-gathering bots that constantly scour the web [2] to algorithmic trading bots that engage in financial transactions [3], bots have become an integral part of the digital landscape. Among these, political bots have attracted particular attention due to their potential impact on public opinion and democratic processes [4].

Political bots are automated agents specifically tasked with public opinion manipulation. They are the epitome of the larger collection of so-called persuasive tech [5,6]. A multitude of studies so far have examined malicious bot activities and provided evidence of bot propaganda during elections and campaigns around the world [4,7,8,9,10]. Previous work suggests bots are capable of passing as humans and engaging in complex interactions with other users [11,12,13]. Among their manipulative strategies are spreading disinformation and amplifying divisive, polarizing content [14,15,16,17,18]. Although there is abundant evidence of bot engagement on Twitter; however, debates concerning the impact of their malicious activities remain to be settled [10,19,20].

While the majority of previous work focuses on their impact at the individual level, important questions remain at the macro level. For instance, how do bots influence the overall communication dynamics? Bots are based on predictive algorithms that follow (machine learning) rules that algorithmically transform input to output, but does this make online political communication less uncertain (given that bots employ predictive algorithms), or more uncertain? Do bots simplify the arising dynamic or does the communication landscape get more complex with bots? These questions are becoming increasingly important as the recent revolution in generative AI is unlikely to decrease the role of bots [21,22]. On the contrary, it is expected that the new technological possibilities of generative AI will lead to rather complex collective dynamics.

To contribute to a better understanding of these arising dynamics, in this work, we follow a long tradition that views communication as a dynamical process [23,24,25,26] and use the tools rooted in information theory, i.e., Claude Shannon’s “mathematical theory of communication” [27,28], to answer these questions. The merger of information theory with theoretical computer science, which, in its roots, goes back to Kolmogorov [29,30] and Sinai [31], allows us to analyze dynamical systems by quantifying how much information and uncertainty is created and passed along a given process [32]. We adopt this methodological framework to analyze a case of online political discussion and quantify the arising dynamic of the system as a whole in terms of predictability, complexity, and remaining uncertainty. We then assess the relationship between political bot involvement and changes in the components of the process, the complexity, and uncertainty levels of the communicative dynamics.

In line with related findings on editor bots in Wikipedia [33] and trading bots in the stock market [3], our findings reveal that political bots significantly impact the dynamics of online political communication, contributing both to its complexity and uncertainty. The presence of political bots is associated with a less predictable and stable online discourse, which can negatively affect public opinion formation around controversial topics. From a methodological perspective, applying a complex systems lens on political bots and online political communication highlights the need for further research into macro-level bot influence and the importance of considering communication processes as complex dynamic systems.

2. Literature Review

2.1. A Brief History of Bots

Bots are automated agents that perform tasks online [34,35]. They can run continuously without requiring any human intervention, though their agency is limited, and they can only act within the boundaries pre-defined in their scripts [36]. While they have surged in popularity and visibility in recent years, their existence goes back to the advent of computing technology [12,35], and they have come a long way since the introduction of Weizenbaum’s Eliza [37], the first chatbot. Today, bots are integral to a wide range of online applications, such as providing virtual customer service or automating content moderation [38,39,40]. Their evolution reflects a broader transformation in their use and overall impact, which greatly expanded with the rise of social media.

The emergence of social media platforms introduced social bots that automate content production and engagement, blurring the lines between genuine human communication and artificial interaction. Social bots can be defined as automated agents that imitate human behavior on social media [41]. They can serve benign purposes, like news bots that distribute news articles [38,42], or editor bots that oversee published content [36]. Or they can have more insidious purposes, such as spam attacks [43], identity theft [44], or public opinion manipulation [8,45].

A subgroup of social bots, political bots have garnered significant attention for their impact on online discourse [4]. These politically motivated agents have demonstrated their capacity to amplify specific narratives, contributing to the public opinion polarization and posing challenges to the integrity of public discourse [4,8,15]. While several countries have attempted to enact regulations to curb computational propaganda efforts, these attempts largely fell short due to challenges with bot identification and overreliance on platform regulation [46,47]. Meanwhile, the underlying technology continues to evolve rapidly, outpacing regulatory efforts [10,12,48], emphasizing the critical importance of ongoing research in this domain.

2.2. Social Bots with a Political Agenda

Although most social bots are automated to carry out simple, repetitive tasks [12], political bots are used for malicious purposes [49,50]. They can monitor user traffic while following a circadian rhythm to mimic real users [11]. The more they act human-like, the more likely they are to receive engagement from humans [13]. They can act alone or coordinated as botnets, amplifying or diminishing targeted viewpoints [48,51]. While their interference in political campaigns and electoral processes has been evidenced around the world [7,8,10,51,52,53,54,55,56,57], they were also found to be actively polarizing across a range of social issues, such as immigration [58], climate change [59], the vaccine debates [14,60], the recent COVID-19 outbreak [61,62,63], and the war in Ukraine [64].

A large part of the previous work on algorithmic manipulation has focused efforts on bot detection methods, which have evolved significantly since the initial attempts in 2010 [45,65,66]. Over time, researchers have explored different detection strategies, grouped under three main categories: the crowd-based approach, the network-based approach, and the feature-based approach [12,67]. Crowdsourcing approaches rely on human judgment to identify bots, but this approach comes with potential human annotation biases, in addition to issues with scalability and user privacy [12,68]. Network-based detection involves analyzing the structure of social networks and detecting automated agents through their connection patterns [69]. This approach assumes bots establish close connections with each other, but the literature suggests this approach may fail to capture automated agents integrated within genuine online communities [8]. Feature-based detection systems leverage a wide array of account characteristics to assess automation through supervised machine learning [70,71]; however, this approach necessitates continuous refinements to remain effective due to the ever-evolving nature of bots [72,73]. Recent work comparing bot detection strategies revealed that different approaches can produce markedly different results, further compounding the issue, making cross-study comparisons difficult [74,75]. Overall, bot detection remains an ongoing challenge, requiring constant adaptation from the research community.

Another major theme in the previous literature was bot influence over discussion networks. Bots were found to actively disseminate misinformation during elections [8,45], amplifying the reach of low-credibility content such as fake news [48]. Studies show that bots spread polarizing content, fostering fear, hate, and violence [51,76]. They are centrally positioned in the discussion networks to influence ongoing discourse through retweets, especially during divisive events [8,48,77,78]. Even weakly connected bots can impact discussions by engaging with influential users [51], alter network sentiments [79], sway network dynamics via their followers [80], and indirectly shape discussions by influencing search engines and recommender systems [81].

However, it should be noted that the effectiveness of bot manipulation is contested. Bastos and Mercea [7] suggested that their influence over the entire discussion may be overstated, while a more recent study showed that bots occupy less central positions within the discussion networks, compared to verified influential accounts [20]. There is evidence to suggest that, while bots do amplify certain divisive narratives, they fail at diffusion, as most of their content fails to reach new users beyond their existing audience [52]. Users that are already predisposed to extreme views are most likely to encounter bot-generated content, suggesting minimal shifts in overall public opinion and sentiment [82].

At the individual level, research demonstrates that individuals exhibit perceptual biases concerning the prevalence and impact of bots, exaggerating their presence and others’ susceptibility to their influence [83]. Partisan-motivated reasoning also seems to play an important role, as users are more likely to engage with human-like bot accounts when they share the same partisanship [13] and less likely to examine the account when they share the same stance [84]. Despite minimal direct interactions between bots and users on Twitter, a study of over 4000 users highlighted bots’ substantial influence on opinions through indirect exposure, especially on contentious topics [85]. This underscores a general lack of proficiency among users in distinguishing bots from humans [74], often leading to misidentification [86].

Although there is ample evidence regarding bot activity on Twitter, debates about the impact of these malicious activities are yet to be settled [19,20,80]. While some researchers have been warning about their increasing sophistication [41], and recent evidence suggests that there is the potential for political bots to get more advanced [65,87], others have found that the majority of the currently available commercial services and tools only provide rather simplistic and repetitive automation [88]. Most of the Twitter bots in a recent study were found to be “spammers”, with no advanced capabilities and limited intelligence [88]. However, this does not necessarily indicate that their limited capacities could cause no harm.

One possible impact of political bots could be on the overall communication process as a whole, a potential that has largely been underexplored in previous work.

2.3. Online Political Communication as a Dynamical Process

Political bots, like all other bots, are automated scripts that follow a predefined path and act within a defined rule set. In a way, they epitomize the concept of predictability by design. In addition, at first glance, the deterministic nature of political bots may suggest a potential to diminish uncertainty in online political communication, making it more predictable through repetitive actions governed by the parameters preset by their developers. Bot activity, by its preprogrammed, repetitive essence, should logically reduce the complexity of online interactions. However, despite their inherently predictable behaviors, on the macro-level, the presence of political bots in social media may also be introducing a level of complexity and unpredictability into the online political discourse, which could be understood as an emergent phenomenon.

To study this, we use Claude Shannon’s foundational framework [28] of information theory, which introduces the principle that the value of information within a message is inversely proportional to its predictability. In other words, the value of information is quantified not directly by the data transmitted, but by the surprise it delivers to the recipient. The more uncertainty gets reduced, the more information gets communicated. Information is the opposite of uncertainty, and uncertainty can be measured with probability theory. To illustrate, suppose that we are transmitting sequences that include only two characters ‘A’ and ‘B’. The message ‘ABABABA…’ is more predictable than a rather random series of ‘ABBBAABB…’. Repetitive and predictable structures convey less information. If it is more difficult to predict the next letter, there is more uncertainty.

In this sense, information theory has long been applied to quantify predictable patterns in the analysis of dynamical systems. This makes it possible to quantify how much information and uncertainty are created and passed along at each step of a dynamical process. The methods derived have found applications in a variety of fields, such as physics [89], biology [90], neuroscience [91], psychology [92], and communication [3]. In this framework, the unpredictable aspect of a process is quantified with the traditional concept of remaining uncertainty (i.e., the entropy rate), as proposed by Shannon [28]. The amount of predictable structure inherent in the process can be quantified by predictive complexity. Computational mechanics [93] conceptualizes it as the ‘minimum size, maximally predictive machinery’ that can recreate the structural pattern. Its entropy is the predictive complexity, i.e., the complexity required to produce the predictable pattern [89].

Although this approach has relevance to various aspects of communication science, only a few scholars have explicitly utilized this framework to analyze the multi-level dynamics of online communication [94]. Consequently, despite the fact that Shannon’s original paper is literally called “A Mathematical Theory of Communication” [28], the application of information theory to communication as a dynamical system has been limited. In this study, we consider online political communication as an example of a dynamic process.

2.4. Research Questions

On one hand, the pre-programmed simplicity of traditional political bot behaviors, and the premise that they are used to elicit predictable outcomes, as evidenced in the previous work, might suggest a decrease in the complexity and uncertainty of communication dynamics. Most bots engage in simple amplification tasks, such as liking and resharing existing content to simulate social contact [73,78,88]. Considering that they can efficiently perform such repetitive tasks in bulk and at a much higher frequency than human users [78,95], it could be argued that political bots are likely to make the overall communication dynamics more predictable, thereby decreasing the complexity and uncertainty inherent in the communicative process.

Conversely, the evidence regarding their capability to trigger emotional contagion around negative topics [63], or in spreading polarizing, sensational content [48,62,68], suggests a compelling counterargument. Such activities by political bots may be introducing significant levels of complexity and unpredictability into online political communication, seemingly increasing the system entropy.

Viewed through the lens of information theory, the level of complexity in online political communication can be determined by the frequency of recurring patterns that emerge during the dynamic process [89], while the remaining uncertainty is the measure of average randomness that is not captured by the structured complexity of the process [28]. It has been previously shown that social bots can have an impact on dynamic processes. In studies on the communicative turns of edits on Wikipedia [33] and trading patterns in foreign exchange markets [3], Hilbert and Darmon reported that algorithmic bots made the structural dynamics of the communicative process more predictable, more complex and, at the same time, more uncertain. They found that bots resolved much of the previous uncertainty, but they also fine-grained the dynamic, both in space (variety) and time (patterns), which created new complexity and induced new uncertainties in the process.

However, editor and trading bots operate in an entirely different context than a Twitter bot with a political agenda, and as a result, their respective impact may vary significantly. In this work, we adopt the same approach and expand on this analytical framework to examine the role of political bot activity in online political discussions. Unlike previous studies, we delve into the underlying complexity and uncertainty these digital agents introduce into the online political discussion. By quantifying the information dynamics of online political interactions, we measure the complexity and uncertainty involved in the process and seek to answer the following questions:

RQ1: How is the presence of bots associated with the level of complexity in online political communication dynamics?

RQ2: How is the presence of bots associated with the level of remaining uncertainty in online political communication dynamics?

3. Data and Methods

In this study, we perform a multiple regression analysis with two core information-theoretic measures: predictive complexity (C) (also known as statistical complexity) [96,97] and remaining uncertainty (h) (in form of the sequences’ entropy rate) [29,30]. They offer complementary insights into the temporal structure of online discourse. In simple terms, predictive complexity captures the amount of historical structure required to predict future behaviors, while remaining uncertainty quantifies how much unpredictability remains even after accounting for previous patterns. Together, they allow us to quantify how structured, and yet how random, a conversation stream gets over time.

To apply these measures, we first transform our tweet data into discrete categorical sequences based on emotional content. These sequences are then analyzed using the Causal State Splitting Reconstruction (CSSR) algorithm [98] in Darmon’s implementation [99], which infers the underlying patterns and randomness within each emotional stream. The resulting C and h values are expressed in bits, which allows us to compare how the presence of bots affects the structural complexity and uncertainty of online political discourse across 650 time-based emotional sequences.

Aside from the two information-theoretical measures as our dependent variables (quantifying how complex and how uncertain the communication dynamics are), we also include bot-level as our explanatory independent variable in our regression model and some potential confounders as controls, like word count, character count, word complexity, and time variance.

The data were collected in real time during the first democratic presidential debate in 2019 by leveraging Twitter’s search API with a keyword approach. The dataset provides roughly 395 K tweets that include the keyword ‘#demdebate’. To represent the resulting stream of tweets quantitatively, we semantically analyze them using IBM Watson Developer Cloud’s natural language understanding service, for a representation of each tweet in terms of the five primary emotions conveyed, mainly anger, fear, sadness, joy and disgust [100]. The IBM Watson API is a natural language processing tool that employs machine learning algorithms to extract semantic information from text, including concepts, emotions, and sentiments. Rather than relying on traditional methods, the application utilizes a Recurrent Neural Network to dynamically capture and classify sentiments. The API has been trained on a diverse set of sources, including tweets, and has been used in numerous research projects [101,102].

Since we worked with traditional information theory, we needed to convert the raw scores into categorical variables (otherwise we would have to work with differential/continuous entropy). We assigned each tweet into one of four categorical bins based on its emotional score from IBM Watson, which ranged from 0 to 1. This left us with five temporal sequences of roughly 395 K tweets, one for each emotion (i.e., anger, fear, sadness, joy, and disgust). We wanted to analyze statistical results, so we needed more than five sequences, but our sequences also needed to be sufficiently long, because our nonlinear information-theoretic measures converge rather slowly. We found a statistically robust sampling solution by creating sequences of 3000 consecutive tweets for each emotion, ending up with 650 temporal sequences in total. Despite sampling motivations, it is methodologically useful that each sequence is equally long, as longer/shorter sequences will increase/decrease the likelihood of more structural diversity and more/less uncertainty.

3.1. Binning of Variables

To ensure statistical robustness and gain a nuanced understanding of the data distribution, we employed three different binning strategies in categorizing the emotional scores of each tweet in each sequence. This was deemed necessary to account for variations in the distribution of emotion scores, as how we bin the data into categorical variables might affect our analysis. The first binning strategy involves categorizing each raw value in the sequence by dividing it into quartiles, which creates four categories of approximately equal sizes, where the first category includes values that are less than the 25th percentile and the last category includes values greater than the 75th percentile. The second binning strategy involves sorting and ranking the values in each sequence and creating uniform, equal-sized groups for each of the four categories. The groups are then de-indexed, meaning they return to their original position in the sequence. The third binning strategy follows an exponential approach, in which the width of each one of the four categories is determined by an exponential function. Each subsequent category is wider than the previous one, creating an increasing distribution of the data across categories.

We began our analysis by assessing the robustness of the three binning strategies used to transform continuous emotion scores into discrete categories to be used as input into the CSSR algorithm. Since our goal was to model the complexity and uncertainty of communication dynamics, it is important that the binning process preserves the essential distributional characteristics of the data without introducing artifacts. We therefore evaluated all three binning strategies (quartile-based, ranked uniform, and exponential) using standard statistical diagnostics, including assessments of normality, heteroscedasticity, and multicollinearity in the resulting regression models. Among the three, the quartile-based binning strategy consistently outperformed the others, producing more stable model assumptions and better-aligned distributions across our sequences. As such, we adopted the quartile-based strategy for all subsequent analyses. The results reported below are based on this approach.

3.2. Dependent Variables

We calculated C and h for each of the 650 temporal sequences. To derive C, we used an empirically validated algorithm to compute the two measures from our data, namely, the Causal State Splitting Reconstruction (CSSR) algorithm [98], which reliably infers the statistical structure of a given dynamical process. We used Darmon’s validated and proven implementation and adopted those default settings [97,99]. It produces three outputs: predictive information E, predictive complexity C, and remaining uncertainty h, though in this study, we are mostly concerned with complexity and uncertainty. All variables are measured in bits, which gives the optimal or average minimum number of yes-or-no questions that an observer would have to ask to completely reconstruct a system. The below Venn diagram, based on Crutchfield et al. [32], portrays the relationship between these three complementary measures along temporal sequences of past and future (Figure 1a).

Within this formalism, predictive information is the common information between past and future states; that is, the information preserved by the dynamical system over time. The predictive complexity (C) includes the information from the past that is necessary to predict the predictable component of the future (the information necessary for the ‘machinery’ to make predictions, hence, the ‘complexity’), and the remaining uncertainty (h) is the amount of information about the future that cannot be derived from the past behavior (possibly random, possibly predictable on the basis of information external to the model’s reach). To derive the statistical measures of character sequences in our data, we used sliding windows of three characters as depicted in Figure 1b and measured the occurrence of unique character subsequences. These measures are calculated with the Python 3 implementation of CSSR by Darmon [98].

3.2.1. Predictive Complexity (C)

Predictive complexity, expressed with C, is also known as statistical complexity [97] and approximates Kolmogorov complexity [96]. It quantifies the minimum amount of information that can predict the amount of structure from the past that is useful to predict the future (or “predictable information”, E in Figure 1a). Essentially, it is measured as the number of bits needed to optimally predict the future of the process from its past [32]. For instance, C would be zero for an entirely random process. The higher the required number of bits to predict the future, the more complex the process is. Complexity is therefore defined as the information required to describe a process “between order and chaos” [96]. It corresponds to the creation or preservation of information or structure in the signal. The mathematical beauty of the measures is that it is shown that the predictive complexity is a sufficient statistic, with the minimal size representation of the structure, but with maximal predictability of what can be predicted about the process [97,104]. Hence, C is a practical approximation of the Kolmogorov complexity of a dynamic, in the sense that it measures the size of the smallest model with maximal predictive power for the time series [96]. Technically, it is measured as the entropy of the derived epsilon machine, i.e., the Shannon entropy over the causal states of the hidden Markov model that represents the dynamic C = H [Causal States].

3.2.2. Remaining Uncertainty (h)

The remaining uncertainty, expressed with the traditional measure of the entropy rate h, is a fundamental and the oldest information-theoretic measure of dynamical systems [29,31]. It quantifies the amount of uncertainty per symbol about the future that remains in a process, given all the information the previous state of the process can tell us about the future. It is essentially the rate of conditional entropy H calculated per symbol, h = H[X|X₀], measuring the uncertainty involved in predicting the next symbol based on the previous one [28]. The resulting entropy rate measures the uncertainty of the next turn conditioned on the previous turns [105]. In short, the entropy rate “gives the source’s intrinsic randomness, discounting correlations that occur over any length scale” [96]. Naturally, the higher the entropy rate, or remaining uncertainty, the higher the probability of a prediction error, which makes the future of the process less predictable [101].

3.3. Independent Variables

The main independent variable in our study is botness, or “bot level”, the extent to which an account is estimated to be a bot. To assess the automation level involved with each account, we used a publicly available bot detection service frequently used in previous work, the Botometer API [67,106]. Using machine learning algorithms, Botometer extracts over 1000 predictive features that identify numerous suspicious behaviors by characterizing an account’s profile, social network, friends, temporal activity patterns, language and sentiments [70,106]. It then provides a classification score on a normalized scale, indicating the probability that a Twitter account is likely to be a bot. Scores closer to 1 indicate a higher probability of the account being a bot, while those closer to 0 suggest that the account is controlled by a human. We averaged the resulting scores for each tweet per sequence. The variable bot-level measures the percentage of botness within each of the 650 temporal sequences made up of 3000 consecutive tweets (M = 0.045, SD = 0.008, min = 0.024, max = 0.081).

The continuity of this measure captures not only uncertainty about the conclusion that an account is a bot account, but also the fact that accounts vary in their botness, as in the case of accounts that tweet a mix of human and automated content. This continuous scale allows us to account for the presence of hybrid or semi-automated accounts that exhibit both human and bot-like behaviors, a common occurrence in political discourse. By using an averaged score across tweet sequences, we incorporate both subtle and overt forms of automation into our analysis, rather than relying on a binary classification threshold.

We also control for several potential confounding variables to account for the characteristics of tweets. One such confounding factor that could affect the complexity and uncertainty of a communication dynamic is the number of words used. This measure could contribute to complexity, for longer tweets with more words could contain multiple clauses with complex sentence structures, potentially increasing the level of information contained. The variable “word count” is measured by calculating the number of words used in each tweet and averaging the total count per each temporal sequence (M = 21.219, SD = 0.456, min = 19.879, max = 24.18).

Another factor we consider is the expression complexity per word. Longer and more complicated words used in tweets can indicate a more complex sentence structure, while shorter or simpler words can drastically reduce the amount of information conveyed. The variable “word complexity” is created by dividing the number of characters in each tweet by the number of words per tweet, averaged per sequence (M = 6.497 SD = 0.127, min = 6.318, max = 6.778).

Finally, we control for time variation between tweets, for a high time variance could make it more difficult to predict how future states will be related to the past, which can affect the overall uncertainty and predictability. We therefore measure the variable “time-variance” to control for the variation of time between the first and last tweets across each temporal sequence (M = 14.4, SD = 117.5, min = 0.477, max = 1338.7).

4. Results

We first assessed the relationship between bot presence and the predictive complexity, reflecting how much information from the past is required to foresee the future state. Overall, the regression model is significant (F(5, 641) = 38.73, p < 0.001,

R^{2}

= 0.232, Figure 2a). Our results suggest a significantly positive relationship between the presence of bots and predictive complexity (

β = 0.151, p < 0.001

, Figure 2a). Notably, higher levels of bot presence correspond with higher levels of complexity. Put simply, online political communication sequences with higher bot levels require more past information to predict future behavior. The presence of bots creates more complex patterns.

Additionally, we found that both the complexity of the language used in tweets, or word complexity, (

β = 0.517, p < 0.001

) and the time variance (

β = 0.003, p < 0.001

) also positively and significantly contribute to the predictive complexity of the ongoing discussion, though the contribution of time variance is comparatively much smaller. Moreover, the results show a positive relationship between word count and predictive complexity, but this relationship lacked significance (

β = 0.055, p = 0.23

).

Turning to the remaining uncertainty involved in the online political discourse, our second model is also significant (F(5, 641) = 35.08, p < 0.001,

R^{2}

= 0.214, Figure 2b). Notably, higher bot levels corresponded with increased levels of uncertainty (

β = 0.113, p < 0.001

). Controlling for all the other predictors we included in the model, higher bot levels significantly correlate with higher remaining uncertainty: the higher the bot levels in sequences of tweets, the larger the component of future behavior that cannot be predicted from past states. As for the control variables, we found a positive relationship between uncertainty and word complexity (

β = 0.404, p < 0.001

) and a small effect from time variance (

β = 0.001, p < 0.001

). However, the positive relationship between word count and uncertainty lacked significance (

β = 0.04, p = 0.03)

.

5. Discussion

This study sheds light on the structural changes in online political discussion dynamics attributable to bot activity by analyzing the relationship between bot presence, predictive complexity, and remaining uncertainty. Our results illuminate the role of political bots in macro-level dynamics of online political discourse, demonstrating that increased bot activity is associated with higher levels of structural complexity and uncertainty.

By applying Claude Shannon’s information theory [28] within a dynamical systems framework, we offer a novel lens through which to view online political communication. Contrary to their intended function of simplifying communication through automation and repetition [34,35,88], politically motivated bots’ presence is associated with increased complexity and uncertainty within online discourse. Our results suggest that higher bot activity is associated with a larger component of future behavior that cannot be predicted from the past states, indicating more uncertainty in the process.

It may seem counterintuitive that predictive complexity (C) and remaining uncertainty (h) both increase with bot presence: since C measures the structure of a process that allows for predictability, why would it also increase uncertainty? The solution consists in the recognition that uncertainty is not a finite quantity (like ‘a given box’) that can be saturated by predictability (like ‘filling up that given box’), but there is an infinite amount of both possible uncertainty and predictable structure in reality (like a ‘box that grows with its own structure, as does the unfilled room inside it’). This co-occurrence is characteristic of many complex adaptive systems. In purely random systems, h is high but C is low, while in fully deterministic systems, both values can be low. However, semi-structured systems, such as political discourse shaped by both human behavior and automated interference, often exhibit both high structure and high unpredictability. In our case, bots may introduce organized temporal or emotional patterns (e.g., coordinated amplification) that increase C, while simultaneously destabilizing semantic flow or introducing unanticipated variation, which increases h.

Technically, C measures the entropy of the causal states of the minimal-size-maximally-predictive hidden Markov model of the dynamic (its so-called epsilon machine) [96,97]. That quantifies the number of causal states and the inequality of their distribution. C increases if there are more states and if their distribution is more equally distributed. The entropy rate, h, is captured by the transition probabilities between the different causal states. It quantifies the uncertainty from transitioning between the states. If there are more states, there are often also more possibilities to transition between them. Hence, often (but not necessarily), both C and h increase in complex systems. Our chosen framework illustrates that C and h reflect distinct and complementary aspects of information flow in complex communicative environments.

In practical terms, the usual cues and patterns that individuals rely on to interpret messages and intentions online could become obscured. The presence of bots introduces noise and unpredictability, which could affect the ability of users to engage in reasoned debate or to follow the progression of events online. This may not only affect the immediate understanding of specific conversations or the perceptions of certain critical political events but could also erode trust in the digital communication environment as a whole, as users increasingly become uncertain about which interactions are genuine and which are manipulated to control public opinion [13,84,86].

Higher bot activity also requires more past information to predict the same amount of future behavior, indicating a greater degree of complexity in the process. This complexity is not necessarily a challenge in terms of the volume or density of information, but rather it shows a substantive change in the nature of the discourse itself. This additional layer of complexity could make it more difficult for both individuals and algorithms (such as those powering recommendation systems) to filter and understand the essential elements of the ongoing conversations. Earlier work shows that bots can substantially shape discussions by influencing search engines and recommender systems [81], and their impact on communication dynamics may exacerbate these effects. Moreover, bots’ potential ability to skew the semantic direction of the discourse may distort the public’s perception of consensus or controversy on critical issues, further polarizing audiences. The outcome is a further distorted representation of public opinion, where artificially amplified themes and emotions gain undue prominence, disrupting discussions and potentially misleading observers about the true nature of public sentiment around controversial topics.

Our analysis also reveals that the complexity of the language used in tweets (word complexity) and the intervals between posts (time variance) also significantly influence both the complexity and uncertainty of online political discussions, though the influence of time variance is rather weak. Conversely, the volume of words used (simple word count) does not show significance. These results align with and support previous research emphasizing the value of quality engagement in sustaining the vitality and success of online communities and digital platforms, over frequency or quantity of engagement [107,108,109,110].

The present study makes several important contributions. Our study’s focus on the macro-level dynamics of an online political discussion provides an important expansion of the current work on social bots. Previous research has largely focused on the prevalence, reach, detection, and influence of social bots, such as their role in spreading misinformation [8,15,17], interfering in political campaigns [7,8,10], and amplifying polarizing viewpoints [58,64]. While these studies provide crucial insights, they often overlook the broader structural implications of these automated entities on the digital ecosystem. By analyzing how bots contribute to the structural complexity and uncertainty of online political discourse, our work offers a holistic view of their influence, underscoring how bots fundamentally alter the dynamics of online political communication.

Our study also contributes to the growing literature on information theory in understanding communication processes. By extending the dynamical systems framework to political bots and using information-theoretic measures to quantify the complexity and uncertainty of online political communication, we expand on the previous work and provide a novel approach to studying the impact of political bots. This approach could be further extended to other domains, such as crisis communication and health communication, and could provide insights into the macro-level dynamics of online communication that previously went unexplored.

Furthermore, this study applies the Causal State Splitting Reconstruction (CSSR) algorithm [111] to analyze online political communication. This algorithm has previously been used in other fields to analyze dynamical systems, and our study demonstrates its potential for use in the analysis of online communication on social media platforms.

While our results demonstrate strong associations between bot presence and both increased complexity and uncertainty in political discourse, the causal direction of this relationship remains open to interpretation. Bots may actively shape communicative dynamics by introducing structural disruptions, but it is also possible that already fragmented or emotionally charged conversations attract greater bot activity. Moreover, external events, such as debate timing or real-world controversies, could simultaneously drive both complexity in user discourse and increased bot deployment. Although our sampling strategy and theoretical rationale support the interpretation that bots influence discourse dynamics, future research using longitudinal, experimental, or time-resolved network methods is needed to disentangle these causal pathways with greater precision.

6. Conclusions

This study provides evidence for the macro-level impact of political bots using a dynamical systems framework. By applying information-theoretic measures to sequences of emotionally coded tweets, we show that political bots are associated with higher levels of both structural complexity and unpredictability in online political discourse. These results highlight the importance of studying communication processes as complex and dynamic systems and the need for further research into the macro-level influence of bots.

It is important to note that we cannot establish a causal relationship among the variables we considered in this work, and there is a chance that a different variable, beyond those we control for in this study, plays a causal role. Furthermore, our study only provides a snapshot of an online political discussion, and the Twitter API we used in data collection only provides 1% of the total traffic on Twitter. This limited sample may not fully capture the dynamics of online political communication, and findings may not generalize across different platforms, events, or contexts. Each platform’s unique dynamics and algorithms can influence bot and user behavior differently. Further research is needed to understand the broader effects of bot activities on various social and political discussions.

In addition, while we rely on the well-established Botometer tool to estimate bot presence, we acknowledge that the reliability of bot detection can vary over time. As social media platforms evolve and new forms of automation emerge, the features used by detection algorithms may become outdated or less effective. Consequently, our botness scores should be interpreted as probabilistic indicators of automation, not definitive classifications. The use of continuous bot scores allows us to capture both overt and subtle forms of automated behavior across sequences, making our findings robust to minor classification fluctuations.

Despite these limitations, the results point to an urgent need to better understand, and design for, the structural consequences of bots on online political conversations. As bots become increasingly prevalent and sophisticated, their influence is likely to extend far beyond individual interactions, shaping the very fabric of collective discourse. This is especially true in an age of generative AI agents, which exhibit far greater autonomy now than ever before. By adopting a complex systems perspective on bots and online political communication, our presented framework showcases how this can be done. Dynamical systems approaches can provide a nuanced and comprehensive understanding of the interplay between technology and society in the digital age.

Author Contributions

Conceptualization, B.B. and M.H.; methodology, B.B.; software, B.B.; validation, B.B. and M.H.; formal analysis, B.B.; investigation, B.B.; resources, M.H.; data curation, B.B.; writing—original draft preparation, B.B.; writing—review and editing, B.B. and M.H.; visualization, B.B.; supervision, M.H.; project administration, B.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Holz, T. A Short Visit to the Bot Zoo [malicious Bots Software]. IEEE Secur. Priv. 2005, 3, 76–79. [Google Scholar] [CrossRef]
Lebeuf, C.; Storey, M.-A.; Zagalsky, A. Software Bots. IEEE Softw. 2018, 35, 18–23. [Google Scholar] [CrossRef]
Hilbert, M.; Darmon, D. How Complexity and Uncertainty Grew with Algorithmic Trading. Entropy 2020, 22, 499. [Google Scholar] [CrossRef]
Woolley, S.C.; Howard, P.N. Political Communication, Computational Propaganda, and Autonomous Agents: Introduction. Int. J. Commun. Syst. 2016, 10. [Google Scholar]
Fogg, B.J. Persuasive Technology: Using Computers to Change What We Think and Do. Ubiquity 2002, 2002, 2. [Google Scholar] [CrossRef]
Yan, C.; Dillard, J.P.; Shen, F. Emotion, Motivation, and the Persuasive Effects of Message Framing. J. Commun. 2012, 62, 682–700. [Google Scholar] [CrossRef]
Bastos, M.T.; Mercea, D. The Brexit Botnet and User-Generated Hyperpartisan News. Soc. Sci. Comput. Rev. 2019, 37, 38–54. [Google Scholar] [CrossRef]
Bessi, A.; Ferrara, E. Social Bots Distort the 2016 US Presidential Election Online Discussion. First Monday 2016, 21, 11. [Google Scholar] [CrossRef]
Ferrara, E. Disinformation and Social Bot Operations in the Run Up to the 2017 French Presidential Election. arXiv 2017, arXiv:1707.00086. [Google Scholar]
Howard, P.N.; Woolley, S.; Calo, R. Algorithms, bots, and political communication in the US 2016 election: The challenge of automated political communication for election law and administration. J. Inf. Technol. Impact 2018, 15, 81–93. [Google Scholar] [CrossRef]
Cai, M.; Luo, H.; Meng, X.; Cui, Y. Differences in Behavioral Characteristics and Diffusion Mechanisms: A Comparative Analysis Based on Social Bots and Human Users. Front. Phys. 2022, 10, 875574. [Google Scholar] [CrossRef]
Ferrara, E.; Varol, O.; Davis, C.; Menczer, F.; Flammini, A. The rise of social bots. Commun. ACM 2016, 59, 96–104. [Google Scholar] [CrossRef]
Wischnewski, M.; Ngo, T.; Bernemann, R.; Jansen, M.; Krämer, N. ‘I agree with you, bot!’ How users (dis)engage with social bots on Twitter. New Media Soc. 2022, 26, 1505–1526. [Google Scholar] [CrossRef]
Broniatowski, D.A.; Jamison, A.M.; Qi, S.; AlKulaib, L.; Chen, T.; Benton, A.; Quinn, S.; Dredze, M. Weaponized Health Communication: Twitter Bots and Russian Trolls Amplify the Vaccine Debate. Am. J. Public Health 2018, 108, 1378–1384. [Google Scholar] [CrossRef] [PubMed]
Bradshaw, S.; Howard, P.N.; Kollanyi, B.; Neudert, L.-M. Sourcing and Automation of Political News and Information over Social Media in the United States, 2016–2018. Political Commun. 2020, 37, 173–193. [Google Scholar] [CrossRef]
Ferrara, E. What Types of COVID-19 Conspiracies are Populated by Twitter Bots? arXiv 2020, arXiv:2004.09531. [Google Scholar] [CrossRef]
Vosoughi, S.; Roy, D.; Aral, S. The spread of true and false news online. Science 2018, 359, 1146–1151. [Google Scholar] [CrossRef] [PubMed]
Woolley, S. The Political Economy of Bots: Theory and Method in the Study of Social Automation. In The Political Economy of Robots: Prospects for Prosperity and Peace in the Automated 21st Century; Kiggins, R., Ed.; Springer International Publishing: Cham, Switzerland, 2018; pp. 127–155. [Google Scholar]
Duan, Z.; Li, J.; Lukito, J.; Yang, K.-C.; Chen, F.; Shah, D.V.; Yang, S. Algorithmic Agents in the Hybrid Media System: Social Bots, Selective Amplification, and Partisan News about COVID-19. Hum. Commun. Res. 2022, 48, 516–542. [Google Scholar] [CrossRef]
González-Bailón, S.; De Domenico, M. Bots are less central than verified accounts during contentious political events. Proc. Natl. Acad. Sci. USA 2021, 118, e2013443118. [Google Scholar] [CrossRef]
Fui-Hoon Nah, F.; Zheng, R.; Cai, J.; Siau, K.; Chen, L. Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration. J. Inf. Technol. Case Appl. Res. 2023, 25, 277–304. [Google Scholar] [CrossRef]
Shin, D.; Koerber, A.; Lim, J.S. Impact of misinformation from generative AI on user information processing: How people understand misinformation from generative AI. New Media Soc. 2024. [Google Scholar] [CrossRef]
Cappella, J.N.; Planalp, S. Talk and Silence Sequences in Informal Conversations III: Interspeaker Influence. Hum. Commun. Res. 1981, 7, 117–132. [Google Scholar] [CrossRef]
Ellis, D.G.; Fisher, B.A. Phases of Conflict in Small Group Development: A Markov Analysis. Hum. Commun. Res. 1975, 1, 195–212. [Google Scholar] [CrossRef]
Fisher, B.A. Decision emergence: Phases in group decision-making. Speech Monogr. 1970, 37, 53–66. [Google Scholar] [CrossRef]
Poole, M.S.; Roth, J. Decision Development in Small Groups IV: A Typology of Group Decision Paths. Hum. Commun. Res. 1989, 15, 323–356. [Google Scholar] [CrossRef]
Kolmogorov, A.N. A new metric invariant of transient dynamical systems and automorphisms in Lebesgue spaces. Dokl. Akad. Nauk. SSSR 1958, 119, 861–864. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Kolmogorov, A.N. Entropy per unit time as a metric invariant of automorphisms. Dokl Akad Nauk SSSR 1959, 124, 754–755. [Google Scholar]
Kolmogorov, A.N. Three approaches to the quantitative definition of information. Int. J. Comput. Math. 1968, 2, 157–168. [Google Scholar] [CrossRef]
Sinai, Y.G. On the concept of entropy of a dynamical system. Dokl Akad Nauk SSSR 1959, 124, 768–771. [Google Scholar]
Crutchfield, J.P.; Ellison, C.J.; Mahoney, J.R. Time’s Barbed Arrow: Irreversibility, Crypticity, and Stored Information. Phys. Rev. Lett. 2009, 103, 094101. [Google Scholar] [CrossRef] [PubMed]
Hilbert, M.; Darmon, D. Large-scale communication is more complex and unpredictable with automated bots. J. Commun. 2020, 70, 670–692. [Google Scholar] [CrossRef]
Franklin, S.; Graesser, A. Is It an agent, or just a program? A taxonomy for autonomous agents. In Proceedings of the Intelligent Agents III Agent Theories, Architectures, and Languages, Budapest, Hungary, 12–13 August 1996; pp. 21–35. [Google Scholar]
Gorwa, R.; Guilbeault, D. Unpacking the Social Media Bot: A Typology to Guide Research and Policy. Policy Internet 2020, 12, 225–248. [Google Scholar] [CrossRef]
Tsvetkova, M.; García-Gavilanes, R.; Floridi, L.; Yasseri, T. Even good bots fight: The case of Wikipedia. PLoS ONE 2017, 12, e0171774. [Google Scholar] [CrossRef]
Weizenbaum, J. ELIZA—A computer program for the study of natural language communication between man and machine. Commun. ACM 1966, 9, 36–45. [Google Scholar] [CrossRef]
Hepp, A. Artificial companions, social bots and work bots: Communicative robots as research objects of media and communication studies. Media Cult. Soc. 2020, 42, 1410–1426. [Google Scholar] [CrossRef]
Makhortykh, M.; Urman, A.; Münch, F.V.; Heldt, A.; Dreyer, S.; Kettemann, M.C. Not all who are bots are evil: A cross-platform analysis of automated agent governance. New Media Soc. 2022, 24, 964–981. [Google Scholar] [CrossRef]
Schanke, S.; Burtch, G.; Ray, G. Estimating the Impact of ‘Humanizing’ Customer Service Chatbots. Inf. Syst. Res. 2021, 32, 736–751. [Google Scholar] [CrossRef]
Boshmaf, Y.; Muslukhov, I.; Beznosov, K.; Ripeanu, M. The socialbot network: When bots socialize for fame and money. In Proceedings of the 27th Annual Computer Security Applications Conference, Orlando, FL, USA, 5–9 December 2011; Association for Computing Machinery: New York, NY, USA, 2011; pp. 93–102. [Google Scholar]
Lokot, T.; Diakopoulos, N. News Bots. Digit. J. 2016, 4, 682–699. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, R.; Zhang, Y.; Yan, G. On the impact of social botnets for spam distribution and digital-influence manipulation. In Proceedings of the 2013 IEEE Conference on Communications and Network Security (CNS), Washington, DC, USA, 14–16 October 2013; pp. 46–54. [Google Scholar]
Goga, O.; Venkatadri, G.; Gummadi, K.P. The Doppelgänger Bot Attack: Exploring Identity Impersonation in Online Social Networks. In Proceedings of the 2015 Internet Measurement Conference, Tokyo Japan, 28–30 October 2015; Association for Computing Machinery: New York, NY, USA, 2015; pp. 141–153. [Google Scholar]
Ratkiewicz, J.; Conover, M.; Meiss, M.; Goncalves, B.; Flammini, A.; Menczer, F. Detecting and Tracking Political Abuse in Social Media. ICWSM 2011, 5, 297–304. [Google Scholar] [CrossRef]
Jones, M.L. Silencing Bad Bots: Global, Legal and Political Questions for Mean Machine Communication. Commun. Law Policy 2018, 23, 159–195. [Google Scholar] [CrossRef]
Stricklin, K.; McBride, M. Social Media Bots: Laws, Regulations, and Platform Policies; CAN—Strategy, Policy, Plans, and Programs Division (SP3); CNA: Singapore, 2020. [Google Scholar]
Shao, C.; Ciampaglia, G.L.; Varol, O.; Yang, K.-C.; Flammini, A.; Menczer, F. The spread of low-credibility content by social bots. Nat. Commun. 2018, 9, 4787. [Google Scholar] [CrossRef] [PubMed]
Stieglitz, S.; Brachten, F.; Berthelé, D.; Schlaus, M.; Venetopoulou, C.; Veutgen, D. Do Social Bots (Still) Act Different to Humans?—Comparing Metrics of Social Bots with Those of Humans. In Social Computing and Social Media. Human Behavior; Springer International Publishing: Cham, Switzerland, 2017; pp. 379–395. [Google Scholar]
Woolley, S. Bots and computational propaganda: Automation for communication and control. In Social Media and Democracy; Cambridge University Press: Cambridge, UK, 2020. [Google Scholar] [CrossRef]
Stella, M.; Ferrara, E.; De Domenico, M. Bots increase exposure to negative and inflammatory content in online social systems. Proc. Natl. Acad. Sci. USA 2018, 115, 12435–12440. [Google Scholar] [CrossRef] [PubMed]
Boichak, O.; Hemsley, J.; Jackson, S.; Tromble, R.; Tanupabrungsun, S. Not the Bots You Are Looking For: Patterns and Effects of Orchestrated Interventions in the U.S. and German Elections. Int. J. Commun. Syst. 2021, 15, 26. [Google Scholar]
Bruno, M.; Lambiotte, R.; Saracco, F. Brexit and bots: Characterizing the behaviour of automated accounts on Twitter during the UK election. EPJ Data Sci. 2022, 11, 17. [Google Scholar] [CrossRef]
Castillo, S.; Allende-Cid, H.; Palma, W.; Alfaro, R.; Ramos, H.S.; Gonzalez, C.; Elortegui, C.; Santander, P. Detection of Bots and Cyborgs in Twitter: A Study on the Chilean Presidential Election in 2017. In Social Computing and Social Media. Design, Human Behavior and Analytics; Springer International Publishing: Cham, Switzerland, 2019; pp. 311–323. [Google Scholar]
Fernquist, J.; Kaati, L.; Schroeder, R. Political Bots and the Swedish General Election. In Proceedings of the 2018 IEEE International Conference on Intelligence and Security Informatics (ISI), Miami, FL, USA, 9–11 November 2018; pp. 124–129. [Google Scholar]
Santini, R.M.; Salles, D.; Tucci, G. Comparative Approaches to Mis/Disinformation|When Machine Behavior Targets Future Voters: The Use of Social Bots to Test Narratives for Political Campaigns in Brazil. Int. J. Commun. Syst. 2021, 15, 24. [Google Scholar]
Uyheng, J.; Ng, L.H.X.; Carley, K.M. Active, aggressive, but to little avail: Characterizing bot activity during the 2020 Singaporean elections. Comput. Math. Organ. Theory 2021, 27, 324–342. [Google Scholar] [CrossRef]
Nonnecke, B.; Perez de Acha, G.; Choi, A.; Crittenden, C.; Gutiérrez Cortés, F.I.; Martin Del Campo, A.; Miranda-Villanueva, O.M. Harass, mislead, & polarize: An analysis of Twitter political bots’ tactics in targeting the immigration debate before the 2018 U.S. midterm election. J. Inf. Technol. Politics 2022, 19, 423–434. [Google Scholar]
Daume, S.; Galaz, V.; Bjersér, P. Automated Framing of Climate Change? The Role of Social Bots in the Twitter Climate Change Discourse During the 2019/2020 Australia Bushfires. Soc. Media + Soc. 2023, 9, 205630512311683. [Google Scholar] [CrossRef]
Yuan, X.; Schuchard, R.J.; Crooks, A.T. Examining Emergent Communities and Social Bots Within the Polarized Online Vaccination Debate in Twitter. Soc. Media + Soc. 2019, 5, 2056305119865465. [Google Scholar] [CrossRef]
Antenore, M.; Camacho Rodriguez, J.M.; Panizzi, E. A Comparative Study of Bot Detection Techniques with an Application in Twitter Covid-19 Discourse. Soc. Sci. Comput. Rev. 2023, 41, 1520–1545. [Google Scholar] [CrossRef]
Chang, H.-C.H.; Ferrara, E. Comparative analysis of social bots and humans during the COVID-19 pandemic. SIAM J. Sci. Comput. 2022, 5, 1409–1425. [Google Scholar] [CrossRef]
Shi, W.; Liu, D.; Yang, J.; Zhang, J.; Wen, S.; Su, J. Social Bots’ Sentiment Engagement in Health Emergencies: A Topic-Based Analysis of the COVID-19 Pandemic Discussions on Twitter. Int. J. Environ. Res. Public Health 2020, 17, 8701. [Google Scholar] [CrossRef] [PubMed]
Zhao, B.; Ren, W.; Zhu, Y.; Zhang, H. Manufacturing conflict or advocating peace? A study of social bots agenda building in the Twitter discussion of the Russia-Ukraine war. J. Inf. Technol. Politics 2023, 21, 176–194. [Google Scholar] [CrossRef]
Cresci, S. A decade of social bot detection. Commun. ACM 2020, 63, 72–83. [Google Scholar] [CrossRef]
Yardi, S.; Romero, D.; Schoenebeck, G.; Boyd, D. Detecting spam in a Twitter network. First Monday 2009, 15, 1–4. [Google Scholar] [CrossRef]
Varol, O.; Ferrara, E.; Davis, C.B.; Menczer, F.; Flammini, A. Online Human-Bot Interactions: Detection, Estimation, and Characterization. Proc. Int. AAAI Conf. Web Soc. Media 2017, 11, 280–289. [Google Scholar] [CrossRef]
Yan, H.Y.; Yang, K.-C.; Menczer, F.; Shanahan, J. Asymmetrical perceptions of partisan political bots. New Media Soc. 2021, 23, 3016–3037. [Google Scholar] [CrossRef]
Cao, Q.; Sirivianos, M.; Yang, X.; Pregueiro, T. Aiding the detection of fake accounts in large scale social online services. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, San Jose, CA, USA, 25–27 April 2012; USENIX Association: Berkeley, CA, USA, 2012; p. 15. [Google Scholar]
Davis, C.A.; Varol, O.; Ferrara, E.; Flammini, A.; Menczer, F. BotOrNot: A System to Evaluate Social Bots. In Proceedings of the 25th International Conference Companion on World Wide Web, Montreal, QC, USA, 11–15 April 2016; Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee: Geneva, Switzerland, 2016; pp. 273–274. [Google Scholar]
Kudugunta, S.; Ferrara, E. Deep neural networks for bot detection. Inf. Sci. 2018, 467, 312–322. [Google Scholar] [CrossRef]
Subrahmanian, V.S.; Azaria, A.; Durst, S.; Kagan, V.; Galstyan, A.; Lerman, K.; Zhu, L.; Ferrara, E.; Flammini, A.; Menczer, F. The DARPA Twitter Bot Challenge. Computer 2016, 49, 38–46. [Google Scholar] [CrossRef]
Yang, K.-C.; Varol, O.; Davis, C.A.; Ferrara, E.; Flammini, A.; Menczer, F. Arming the public with artificial intelligence to counter social bots. arXiv 2019, arXiv:1901.00912. [Google Scholar] [CrossRef]
Beatson, O.; Gibson, R.; Cunill, M.C.; Elliot, M. Automation on Twitter: Measuring the Effectiveness of Approaches to Bot Detection. Soc. Sci. Comput. Rev. 2023, 41, 181–200. [Google Scholar] [CrossRef]
Martini, F.; Samula, P.; Keller, T.R.; Klinger, U. Bot, or not? Comparing three methods for detecting social bots in five political discourses. Big Data Soc. 2021, 8, 20539517211033566. [Google Scholar] [CrossRef]
Luceri, L.; Deb, A.; Badawy, A.; Ferrara, E. Red Bots Do It Better:Comparative Analysis of Social Bot Partisan Behavior. In Proceedings of the Companion 2019 World Wide Web Conference, San Francisco, CA, USA, 13–19 May 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 1007–1012. [Google Scholar]
Ross, B.; Pilz, L.; Cabrera, B.; Brachten, F.; Neubaum, G.; Stieglitz, S. Are social bots a real threat? An agent-based model of the spiral of silence to analyse the impact of manipulative actors in social networks. Eur. J. Inf. Syst. 2019, 28, 394–412. [Google Scholar] [CrossRef]
Schuchard, R.; Crooks, A.T.; Stefanidis, A.; Croitoru, A. Bot stamina: Examining the influence and staying power of bots in online social networks. Appl. Netw. Sci. 2019, 4, 55. [Google Scholar] [CrossRef]
Hagen, L.; Neely, S.; Keller, T.E.; Scharf, R.; Vasquez, F.E. Rise of the Machines? Examining the Influence of Social Bots on a Political Discussion Network. Soc. Sci. Comput. Rev. 2022, 40, 264–287. [Google Scholar] [CrossRef]
Keijzer, M.A.; Mäs, M. The strength of weak bots. Online Soc. Netw. Media 2021, 21, 100106. [Google Scholar] [CrossRef]
Pescetelli, N.; Barkoczi, D.; Cebrian, M. Bots influence opinion dynamics without direct human-bot interaction: The mediating role of recommender systems. Appl. Netw. Sci. 2022, 7, 46. [Google Scholar] [CrossRef]
Bail, C.A.; Guay, B.; Maloney, E.; Combs, A.; Hillygus, D.S.; Merhout, F.; Freelon, D.; Volfovsky, A. Assessing the Russian Internet Research Agency’s impact on the political attitudes and behaviors of American Twitter users in late 2017. Proc. Natl. Acad. Sci. USA 2020, 117, 243–250. [Google Scholar] [CrossRef]
Yan, H.Y.; Yang, K.C.; Shanahan, J.; Menczer, F. Exposure to social bots amplifies perceptual biases and regulation propensity. Sci. Rep. 2023, 13, 20707. [Google Scholar] [CrossRef]
Ngo, T.; Wischnewski, M.; Bernemann, R.; Jansen, M.; Krämer, N. Spot the bot: Investigating user’s detection cues for social bots and their willingness to verify Twitter profiles. Comput. Human Behav. 2023, 146, 107819. [Google Scholar] [CrossRef]
Aldayel, A.; Magdy, W. Characterizing the role of bots’ in polarized stance on social media. Soc. Netw. Anal. Min. 2022, 12, 30. [Google Scholar] [CrossRef]
Kenny, R.; Fischhoff, B.; Davis, A.; Carley, K.M.; Canfield, C. Duped by Bots: Why Some are Better than Others at Detecting Fake Social Media Personas. Hum. Factors 2024, 66, 88–102. [Google Scholar] [CrossRef] [PubMed]
Luceri, L.; Deb, A.; Giordano, S.; Ferrara, E. Evolution of bot and human behavior during elections. First Monday 2019, 24, 9. [Google Scholar] [CrossRef]
Assenmacher, D.; Clever, L.; Frischlich, L.; Quandt, T.; Trautmann, H.; Grimme, C. Demystifying Social Bots: On the Intelligence of Automated Social Media Actors. Soc. Media + Soc. 2020, 6, 2056305120939264. [Google Scholar] [CrossRef]
Crutchfield, J.P.; Feldman, D.P. Regularities unseen, randomness observed: Levels of entropy convergence. Chaos 2003, 13, 25–54. [Google Scholar] [CrossRef]
Frank, S.A. Natural selection. V. How to read the fundamental equations of evolutionary change in terms of information theory. J. Evol. Biol. 2012, 25, 2377–2396. [Google Scholar] [CrossRef]
Gallistel, C.R.; Matzel, L.D. The neuroscience of learning: Beyond the Hebbian synapse. Annu. Rev. Psychol. 2013, 64, 169–200. [Google Scholar] [CrossRef]
Heath, R.A. Nonlinear Dynamics: Techniques and Applications in Psychology; Psychology Press: East Sussex, UK, 2014. [Google Scholar]
Crutchfield, J.P. The origins of computational mechanics: A brief intellectual history and several clarifications. arXiv 2017, arXiv:1710.06832. [Google Scholar]
Waldherr, A.; Geise, S.; Mahrt, M.; Katzenbach, C.; Nuernbergk, C. Toward a Stronger Theoretical Grounding of Computational Communication Science: How Macro Frameworks Shape Our Research Agendas. Comput. Commun. Res. 2021, 3, 152–179. [Google Scholar] [CrossRef]
Grimme, C.; Preuss, M.; Adam, L.; Trautmann, H. Social Bots: Human-Like by Means of Human Control? Big Data 2017, 5, 279–293. [Google Scholar] [CrossRef] [PubMed]
Crutchfield, J.P. Between order and chaos. Nat. Phys. 2011, 8, 17–24. [Google Scholar] [CrossRef]
Crutchfield, J.P.; Young, K. Inferring statistical complexity. Phys. Rev. Lett. 1989, 63, 105–108. [Google Scholar] [CrossRef] [PubMed]
Darmon, D. Statistical Methods for Analyzing Time Series Data Drawn from Complex Social Systems. Doctoral Thesis, University of Maryland, College Park, MD, USA, 2015. [Google Scholar]
Shalizi, C.R.; Shalizi, K.L. Blind Construction of Optimal Nonlinear Recursive Predictors for Discrete Sequences. arXiv 2004, arXiv:cs/0406011. [Google Scholar] [CrossRef]
Ekman, P.; Sorenson, E.R.; Friesen, W.V. Pan-cultural elements in facial displays of emotion. Science 1969, 164, 86–88. [Google Scholar] [CrossRef]
Hilbert, M.; Ahmed, S.; Cho, J.; Liu, B.; Luu, J. Communicating with Algorithms: A Transfer Entropy Analysis of Emotions-based Escapes from Online Echo Chambers. Commun. Methods Meas. 2018, 12, 260–275. [Google Scholar] [CrossRef]
Zhang, W.; Xi, Y.; Chen, A. Why do replies appear? A multi-level event history analysis of online policy discussions. New Media Soc. 2020, 22, 1484–1504. [Google Scholar] [CrossRef]
Jurgens, A.; Crutchfield, J.P. Taxonomy of Prediction. arXiv 2025, arXiv:2504.11371. [Google Scholar]
Shalizi, C.R.; Crutchfield, J.P. Computational Mechanics: Pattern and Prediction, Structure and Simplicity. J. Stat. Phys. 2001, 104, 817–879. [Google Scholar] [CrossRef]
Cover, T.M.; Joy, T.A. Elements of Information Theory; John Wiley & Sons, Inc.: New York, NY, USA, 1991. [Google Scholar]
Yang, K.-C.; Ferrara, E.; Menczer, F. Botometer 101: Social bot practicum for computational social scientists. SIAM J. Sci. Comput. 2022, 5, 1511–1528. [Google Scholar] [CrossRef]
Butler, B.S. Membership size, communication activity, and sustainability: A resource-based model of online social structures. Inf. Syst. Res. 2001, 12, 346–362. [Google Scholar] [CrossRef]
Cunha, T.; Jurgens, D.; Tan, C.; Romero, D. Are All Successful Communities Alike? Characterizing and Predicting the Success of Online Communities. In The World Wide Web Conference.; Association for Computing Machinery: New York, NY, USA, 2019; pp. 318–328. [Google Scholar]
Kollock, P.; Smith, M. Managing the Virtual Commons: Cooperation and Conflict in Computer Communities. In Computer-Mediated Communication: Linguistic, Social, and Cross-Cultural Perspectives; Herring, S.C., Ed.; John Benjamins: Philadelphia, PA, USA, 1996; pp. 109–128. [Google Scholar]
Panek, E.; Hollenbach, C.; Yang, J.; Rhodes, T. The Effects of Group Size and Time on the Formation of Online Communities: Evidence From Reddit. Soc. Media + Soc. 2018, 4, 205630511881590. [Google Scholar] [CrossRef]
Darmon, D. A Python Implementation of the Causal State Splitting Reconstruction Algorithm. 2020. Available online: https://github.com/3tz/transCSSR (accessed on 22 May 2025).

Figure 1. Diagram depicting the relationship between predictive information E, predictive complexity C, and remaining uncertainty. (a) The amount of information communicated from the past to the future through repeating patterns is quantified with E, the predictable information (the mutual information between the past and the future). The amount of information required to produce E is quantified by the complexity of the pattern, C (also known as the statistical complexity). The remaining uncertainty, h, is the entropy rate, which quantifies the amount of uncertainty of the future dynamic that cannot be predicted by identifying all possible patterns contained in the past. Graph adapted from [32,103]; (b) The illustration depicts the sliding-window approach we employed to measure the frequency statistics from character sequences. The observed frequency of these subsequences, which are essentially the fundamental components of the dynamic process, provide us with the statistics that outline the features of the underlying temporal structure, with a focus on how information is being preserved, created, and destroyed at each moment of the dynamic.

Figure 2. Standardized regression coefficients for (a) predicting complexity and (b) remaining uncertainty bound with 95% confidence intervals.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bulat, B.; Hilbert, M. Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics. Entropy 2025, 27, 573. https://doi.org/10.3390/e27060573

AMA Style

Bulat B, Hilbert M. Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics. Entropy. 2025; 27(6):573. https://doi.org/10.3390/e27060573

Chicago/Turabian Style

Bulat, Beril, and Martin Hilbert. 2025. "Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics" Entropy 27, no. 6: 573. https://doi.org/10.3390/e27060573

APA Style

Bulat, B., & Hilbert, M. (2025). Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics. Entropy, 27(6), 573. https://doi.org/10.3390/e27060573

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Quantifying Bot Impact: An Information-Theoretic Analysis of Complexity and Uncertainty in Online Political Communication Dynamics

Abstract

1. Introduction

2. Literature Review

2.1. A Brief History of Bots

2.2. Social Bots with a Political Agenda

2.3. Online Political Communication as a Dynamical Process

2.4. Research Questions

3. Data and Methods

3.1. Binning of Variables

3.2. Dependent Variables

3.2.1. Predictive Complexity (C)

3.2.2. Remaining Uncertainty (h)

3.3. Independent Variables

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI