1. Introduction
The advent of social media, the development of Web 3.0, and the massive amounts of data resulting from these processes have shifted how companies conduct their businesses and should also impact governments [
1]. The benefits of incorporating technology in public processes’ formulation and implementation [
2,
3] and how it increases citizen satisfaction and facilitates procedures [
4,
5] have already been proven. Social networks, microblogging, blogs, review websites, and online forums are a few digital resources governments have to share information and be closer to citizens [
6,
7].
Effective connections and accountability are some of the requirements from citizens when communicating with the public sector, and governments must provide services in transparent, collaborative, and participative manners to fulfill such requirements [
8,
9]. Ref. [
10] proved how the success of Facebook engagement in the public sector was directly related to citizens’ satisfaction in general.
The above facts illustrate the relevance of studies about how governments face, implement, and take advantage of technology through objective analysis and measurement that work in the public interest. The existing research gap indicates neglect regarding social media in governments. The existing literature is mainly related to the debate between media and politics [
11,
12] or the strategies and projections of X (formerly Twitter) in electoral processes [
13,
14]. There is an evident lack of focus on studies concerning the public communication scope itself [
10].
A persistent digital divide problem in Portugal [
15] has led the country to adopt a traditional approach to public administration. In addition, the low adhesion of Portuguese people to modernization efforts, combined with widespread political participation, aggravates the current state of social media [
16]. These facts exemplify the estimate that 8.5 million people (83.8% of the total population) were active on social networks in Portugal in January 2022 [
17]. Notably, only 2.6% (207,500) of these individuals followed the official government page on X, and 1.6% (130,430) on Facebook. As a light benchmark, contrasts tweet volumes with Spain (@desdelamoncloa) and France (@gouvernementFR); Portugal shows lower activity pre-2016 and a pandemic-era surge consistent with peers.
The study’s importance lies in understanding the adoption of new technologies in governance and fostering more meaningful connections with its citizens. This study also helps reduce the significant research gap in the Portuguese public sector’s data analysis. Studies that integrate theory and practice to generate innovative insights for the public sector are essential, as they offer solutions to challenges such as low citizen confidence, resource shortages, process simplification, and low public participation [
18]. Collecting and exploring real-world data, followed by applying the theories and concepts related to the public sector, can benefit society.
The data collected from a full-archive search of 14 Portuguese governmental X accounts aims to assess the communication quality. The research proposes validating this inspection by distancing from the sentiment analysis approach, which is widely used in X analytics studies [
19,
20]. This research employed the Data Science Process [
21] to structure the study and the TPCC Index [
22] to evaluate the data.
Therefore, this research intends to generally analyze both the theoretical and the practical aspects of public social media communication on X through the following specific objectives:
Study the importance of social media techniques for public sector communication and the determining factors for a successful strategy.
Use of analytical tools to identify patterns and deficiencies in governmental communication of the X Portuguese study case.
Calculate the TPCC Index [
22] for 14 governmental agencies’ tweets to classify the communication regarding their transparency, participation, collaboration, and comfort efforts.
This report is structured into six sections: Introduction, Literature Review, Methodology, Results and Discussion, and Conclusion. The introduction consists of an overview of the topic, its relevance, and the objectives of this research. The literature review presents all theoretical aspects, concepts, and related previous studies to support the ideas regarding public communication, social media, and X analytics. The methodology explains the processes and methods used. It encompasses all data processes, from collection to preprocessing. It utilizes five distinct techniques to enhance the understanding of X data, including time series, public metrics, and topic modeling, as well as TPCC index calculations.
Section 5 contains all the results and comparisons to previous studies. The conclusion resumes the paper and provides recommendations for future work.
3. Materials and Methods
The Data Science Process’s steps are similar across different research areas. Its singularity comes from the notion of data undergoing an exploratory analysis and the application of other statistical models [
21]. First, there is the collection of raw data from real-world situations, followed by the processing and filtering steps that complete the data preparation for the proposed analyses.
We adopt TPCC because it operationalizes Open Government principles (transparency, participation, collaboration, comfort) in a way that is platform-agnostic yet auditable; adapting it to X extends prior uses centered mainly on Facebook. Five different types of evaluations composed the exploratory analysis for the present research. While the transparency, participation, collaboration, and comfort index [
22] calculation was the statistical method applied. All results were then summarized, analyzed, and presented using data visualization techniques following this study’s methodology, illustrated in
Figure 2. The study did not employ the proprietary X analytics dashboard. We conduct social media analytics on Python 3.10.
The data were collected from the official Portuguese government X account (@govpt) and thirteen other governmental agencies’ accounts, described in
Appendix A. The collection was performed through the X Academic Research API. Python was used to process and manipulate the data from the collection step until the exploratory analysis calculations. The exploratory analysis was performed on the first sample, which consisted of 18,071 tweets from the @govpt account. At the same time, the TPCC Index calculation required a complete sample that contained all tweets from governmental agencies. Each step is presented in
Figure 2, and the data attributes are described in depth in the following subsections.
3.1. Data Collection
We focus on X, given its full-archive Academic Research API, its use for real-time governmental announcements in Portugal, and the novelty of adapting TPCC beyond Facebook. Data were collected in December 2021, encompassing a period from 28 May 2009 (1:13 pm) to December 2021. Our period ends in December 2021 to provide a stable, pre-ownership-change baseline (Twitter→X rebrand, and platform policy shifts commenced later). The results, therefore, characterize the pre-2022 regime; we explicitly caution against extrapolating beyond 2021.
The extensive time frame was due to the low activity levels on the accounts, with an average of 3.92 posts per day, spanning distinct mandates and the pandemic. A small sample required a complete analysis to extract valuable information from the data, for instance, the changes between different government mandates and the differences between accounts. The total sample description (59,036 tweets) is shown in
Table 1.
As shown in
Table 1, the @govpt (República Portuguesa) and @defesa_pt (Ministério da Defesa) were created seven years before the other accounts; therefore, they present more data. Due to this disparity, the present research’s objective was not to compare the accounts but to measure the quality of X communication.
From the total amount of tweets, it was also observed that 30,010 used images, and 14,646 were retweets from other accounts. The variables collected from each tweet are exposed in
Table 2, including a description and summary of statistics that characterize the data sample. All variables collected were used in the techniques employed in this research. The public metrics, which are retweets, replies, likes, and quote counts, represent the engagement a tweet stimulates and are essential in measuring communication success.
3.2. Preprocessing
The collection phase resulted in a .csv file, which was then organized on a Dataframe using the Pandas library [
68] to process and adequately analyze the data. This stage is crucial when handling user-generated content and unstructured data. Several preprocessing techniques were used, including X-specific methods like removing tweet tags, as well as more general approaches for preprocessing all linguistic data.
The main steps of the preprocessing were as follows:
Removal of links: such as “HTTP”, “bit.ly”, “pic.X”, and other links in general, to keep only text.
Removal of X specifics: RT tag and user identification (@).
Removal of special characters and punctuation.
Lower case normalization.
Removal of stop words: using NLTK, the Natural Language Tool Kit Portuguese dictionary. Removal of connectors and words that represent no value and could cause disturbance in the analysis.
Lemmatization: a process of normalization that reverts inflected words to their dictionary form.
Tokenization: Tokenizing words in a text separates each word into an individual object.
After the preprocessing techniques were performed, two fields were added to the original Dataframe: “clean_tokens”, consisting of the remaining words split into individual elements, and “clean_tweet”, where the clean text was kept in its string form. Since different analyses require different forms of preprocessing, the data were split into two different datasets to perform the exploratory analysis on the @govpt’s data only (18,071 tweets). A complete dataset, composed of all the accounts, was used for the TPCC Index measurements.
3.3. Descriptive and Exploratory Analysis
We conducted a descriptive and exploratory analysis (EDA) on @govpt to characterize longitudinal patterns and guide the subsequent TPCC evaluation. The TPCC computations then use the full 14-account dataset [
69]. It aggregates methods for primary analysis that can detect mistakes, check assumptions, determine the relationship between variables, and enable the correct selection of models and techniques for later use. We focused the EDA on @govpt to avoid over-weighting accounts with disparate lifespans/volumes and to keep the descriptive overview concise; all inferential classification (TPCC) uses the complete dataset. For this objective, the techniques employed were time series, public metrics, and topic modeling. Sentiment classification was intentionally excluded due to known challenges in Portuguese governmental communication, such as domain-specific vocabulary, irony and sarcasm, and politicized lexicons. This decision ensured methodological rigor and interpretability. Instead, we prioritized auditable, structure-based indicators (TPCC).
Time Series: In time series analysis, data are analyzed through observation dates and over an interval, allowing for a sequential report demonstrating how variables behave over time. According to [
70], time series is helpful for forecasting values, separating valuable information from noise, eliminating the seasonal component, and identifying external events that influence variables. It converged with the primary objective of this study, which was to understand how patterns shifted according to time.
The selected approach to visualize this time series analysis was the regression line, considering the intent was to establish the trend, that is, the relationship between two variables: time (in years) and the number of posts. The year of each publication could be isolated through the “created_at” field of the tweet’s object in DateTime. Subsequently, all tweets were grouped by year of creation and counted. The results were then prepared for visualization on a regression line using Matplotlib [
71].
Public Metrics: Metrics from X, also called engagement metrics, represent the interaction between users and tweets. Retweets, likes, replies, and quotes are essential metrics to identify public preferences and measure account interaction. They are public metrics, meaning users can see these values when browsing a public X account. The most interacted tweet for each category of interaction was identified and presented.
Topic Modeling: For the word frequency analysis, the visualization technique selected was the word cloud, where words are organized in a figure of different sizes representing the frequency with which each term is mentioned throughout the document. All terms in the sample were combined on a string and served as input for the built-in function from the Matplotlib package [
71] that generated the graph.
Since every text document is composed of words, topic modeling is a technique that identifies the highly related words to determine the topics of a document’s discourse [
72]. Therefore, each document has several topics, each composed of several words.
The method used in the present study was the Latent Dirichlet Allocation, also known as LDA. Besides choosing it because it is the most used method for topic detection in text mining, which employs probability to identify the share of each topic in the documents, we also selected it for its interpretability, stability over long time spans, and ease of integrating Portuguese stopwords and lemmatization. Transformer-based BERTopic and NMF were considered, but they would add complexity and model-selection overhead in a multilingual/bureaucratic lexicon without necessarily improving interpretability for policy audiences. In this method, several topics must be provided as input. In other words, it does not automatically infer K. To calculate the optimal number of topics for the documents that compose this research’s sample, the Coherence Score and “Elbow method” resulted in seven topics (
Figure 3).
3.4. TPCC Index
The original metrics and their corresponding factors proposed by [
22] were designed for Facebook data analysis. One of the contributions of the present research is the proposal of metrics adapted for X’s evaluation of the TPCC Index. In this process, the authors mentioned that some metrics changed and others had to be eliminated, specifically the metric that referred to calendar events, since X does not provide such a feature. The metrics used in this study are displayed in
Table 3.
The metrics calculated according to the measures presented in the table above were then used to calculate the index for each of the four factors described in
Section 2.4. Therefore, the factor’s index is the sum of all its corresponding metrics divided by the number of metrics.
3.4.1. Transparency
Following the Open Government principles presented previously, several authors related transparency to successful communication in the public sector [
73,
74]. In their framework development, Ref. [
22] emphasizes the importance of transparent government communication and information as public assets. Therefore, the transparency metrics relate to posts that are easily accessible and interpretable, such as multimedia and other social media posts.
To calculate the number of multimedia and broadcast posts, the entities on the Tweet object were accessed to uncover if the tags for images and videos existed and how frequently they existed. For the social media posts, it was necessary to count the word frequency of the names of the three leading social media platforms: “Facebook”, “Instagram”, and “YouTube”.
3.4.2. Collaboration
The Collaboration factor evaluates the government’s efforts to enable citizen engagement and co-creation. According to [
75], incorporating citizen knowledge and understanding is the main benefit a government can achieve by implementing collaboration into its processes. It can enhance public engagement and participation while taking advantage of new ideas and opinions.
Therefore, a word frequency analysis was performed to identify the posts that fit the metrics for collaboration. Through raw data observation, it was possible to list the most used terms for engagement and co-creation requests and count their frequencies. The terms were defined as follows:
Engagement: “visite”, “saiba”, “leia”, “consulte”, “acompanhe” e “assista”.
Co-Creation: “vote”, “escolha”, “colabore”, “submeta”, “ajude” and “participe”.
The collaboration lexicon was derived deductively from the Open-Government/participatory-governance literature and inductively from Portuguese imperative forms observed in the corpus. We operationalized collaboration using the following base lemmas (matched case-/diacritic-insensitively and counting common inflections/imperatives): participar/participe, responder/responda, votar/vote, inscrever/inscreva-se, enviar/envie, colaborar/colabore, co-criar/co-crie, partilhar/partilhe, comentar/comente, sugerir/sugira, propor/proponha, perguntar/pergunte, convidar/convide, contribuir/contribua. We acknowledge that keyword rules can miss context (e.g., polysemy, irony/sarcasm, negation). We therefore treat them as auditable signals rather than definitive intent. Tokens are normalized by lemma with diacritics stripped; obvious boilerplate/UI strings are excluded; counts are computed at the post level.
3.4.3. Participation
Participation enables citizens to engage in public topics and decisions. For that reason, the metrics of this factor are related to whether the e-government page allows users to post on it and the number of posts made by governmental agencies containing surveys to enquire about public opinion [
10].
The Portuguese government account allowed citizens to post and interact over their account, as with any other public X account. Users could reply, mention, and retweet the public agencies’ posts. Due to this, all accounts received a score of 1 on this Boolean value metric. The second metric was calculated by accessing the entities of each tweet object and checking the frequency of “poll” entities for each page.
3.4.4. Comfort
In comfort, the framework refers to the use of social media and the Internet for the sharing of information and resources by the government, providing an accessible and multi-faceted presence online. It allows citizens to connect more with the agencies, reinforcing the sense of trust. In this case, the metrics refer to how links are distributed throughout the governmental agencies’ X accounts. The first step was to assess whether links to governmental websites were included in the account descriptions. The results were 1 or 0, indicating the presence or absence of such redirection.
Secondly, posts containing links were analyzed to separate them into URLs redirected to government websites and those redirected to external ones. Due to the automatic URL shortener employed by X to save characters on a post, it was necessary to access the entities field to collect the “expanded_url” information. Considering that all governmental websites follow the “.gov.pt” domain. Lastly, the count of external and internal links was calculated through a separate term frequency analysis on the list of URLs.
4. Results and Discussion
This section summarizes the results from the exploratory analysis and the TPCC index evaluation, along with their interpretation.
4.1. Exploratory Analysis
The results from the primary data analysis of the @govpt account are summarized in this subsection.
4.1.1. Time Series
Figure 4 illustrates the increasing trend in publication frequency. It also shows low activity in the first six years (2009–2015), followed by an increase in the following six years (2016–2021). During 2013, there were no tweets in the count. However, the year with more tweets was 2021 (3321), which aligns with the digital communication activity during the COVID-19 pandemic.
The account was created in 2009, during the second mandate of Prime Minister José Sócrates. In the following year of his mandate, 2010 (1069), the activity almost tripled, while in 2011 (681), it decreased significantly. Increased efforts from the government in digitalization marked this mandate. The analysis concluded that this mandate was the one that most frequently used the term “internet” among the four mandates [
76].
When Pedro Passos Coelho was Prime Minister, the activity in the account was inadequately low, especially in the first two years, 2012 (5) and 2013 (0). However, 2014 (436) and 2015 (872) presented higher posts. From 2016 onwards, with António Costa as Prime Minister, the use of X increased compared to the previous periods, also influenced by a rise in social media usage. In addition to external events, such as the coronavirus pandemic, which can be considered responsible for the peaks during 2020 (2340) and 2021 (3523), governments intensely used the platform during these periods [
65].
The posting activity from the @govpt was compared to Spain’s @desdelamoncloa and France’s @gouvernementfr from 2013 to 2016 for benchmarking.
Figure 5 shows that the Portuguese account had the lowest activity throughout the analyzed period. All governmental accounts experienced a rise in activity in 2015, followed by a deceleration between 2018 and 2019, and then another increase in 2020 and 2021, which was related to the coronavirus pandemic [
65]. The final period depicted that the French government’s communication was the most positively impacted by the pandemic.
4.1.2. Public Metrics
The most retweeted tweet was also the most liked (
Table 4). It was a link to the Minister of Foreign Affairs’ pronouncement recognizing Juan Guaidó as the president of Venezuela in 2019, which received few replies or quotes. The average of retweets in the sample was 6.52, and the average of likes was 7.38. These values represented a low level of interaction with the account.
The average of replies in the sample was 0.66, a minimal mean, symbolizing the low engagement in conversations on the account since replies are the most connective of the public metrics. The most replied-to tweet (
Table 5) was the link to the video of Prime Minister António Costa’s speech about the European Council in the first days of the coronavirus pandemic, which depicted motivation from citizens to communicate with the government in moments of uncertainty.
The average of quotes and re-shares of a tweet with a comment was 0.55, again representing a low engagement rate. The most quoted tweet (
Table 6) discoursed about the National Health Service and had an overall low number on all public metrics, with only 105 quotes.
An important fact to notice is that all tweets presented above were recent. The most liked and retweeted post was from 2019, while the most quoted and replied-to tweets were from 2020. This number illustrated the growing interest of users to engage with the account once publication frequency was consistent. It also implied that Portuguese people prefer posts that do not involve heavy subjects. At the same time, the reply and quote features were mainly used on content that instigated opinions, which aligns with findings from previous studies regarding different countries’ social media communication [
10,
75].
Comparing these results with the most interacted-with posts from Spain’s and France’s governmental accounts, two detections were possible. First, the significantly lower number of interactions from the @govpt account. Secondly, the content from the most engaged posts was primarily related to the coronavirus pandemic from an emotional perspective. While in Portugal, the only mention of such a period in the tables above was a video of the first press conference from the pandemic, which was also the most-replied-to tweet.
4.1.3. Topic Modeling
This subsection presents the results in word clouds, first on a general inspection and second according to the main topics identified. Later, the topics were analyzed according to the political period Portugal was experiencing.
Through the analysis of the word cloud presenting the most used terms from the sample, displayed in
Figure 6, it was possible to identify a tendency to employ political and economic terms, such as “ministro”, “conselho”, “nacional”, “governo”, and “estado”. Moreover, the presence of the word “covid” carries a heavier meaning, as it was only mentioned for two of the twelve years in the time range. These findings align with [
65], who concluded that “covid” was the main topic during the first two years of the pandemic.
According to [
10], the terms used in communication point the behavior toward a traditional way of interacting. The focus is more on providing information than on stimulating participation and feedback from citizens.
The seven identified topics were titled according to the subject they mainly described. The main terms from each topic were arranged in Word Cloud form and presented in
Figure 7; each topic is further described.
Events: Composed mainly of expressions that refer to events, since the main terms were: “preside”, “ceremony”, “opening”, and “Lisboa.” The name “Pedro” indicates a relation to Pedro Passos Coelho’s mandate.
Press: This topic was mainly related to the press conferences of the government. It mentions António Costa and the coronavirus pandemic, probably due to the regular press conferences during the pandemic.
Election: This topic primarily focused on the election period and the definition of the State budget, as it involved terms such as “budget”, “economy”, and “debate.”
Minister’s Council: The most occurring terms were related to the “council of ministers”, its reports, and broadcasts (“directo”, “comunicado”, “portal”).
European Union: This topic contained words related to the European Union meetings, measures, and mentions of higher education.
Internal Administration: This topic presented general terms but explicitly mentioned the Internal Administration Ministry (MAI). Regarding the mentions of Rui and José, it could be referring to José Sócrates’ mandate and Rui Pereira, Minister of Internal Administration, in the same period.
Economy: The terms on this topic were primarily related to investment and public services. It mentions the economy, social security, and health.
The results illustrate a more substantial presence of terms related to the State and politicians than to public services or deeper sources of interest for citizens. In [
22], the seven most frequent topics were: “Technology Information”, “Islamic Communication”, “IT Workshop”, “Education”, “Competitions”, “Civil Status News”, and “Work”. From those, the most engaging topics were “Competitions” and “Technology Information”, which are themes involving citizens in some way. It corroborated with [
10], who identified higher interaction with posts about “Competitions and awards”, “Schools”, and “Zoo/Botanical Garden.”
The topics were then classified into a yearly time series to analyze the topic’s evolution over the years and different political mandates. The results are displayed in
Figure 8 and analyzed in the following paragraphs according to each Prime Minister since the creation date of the X accounts in 2009 during José Sócrates’ mandate (2009–2011), Pedro Passos Coelho (2011–2015), and António Costa (2016–2021).
The first period analyzed was 2009–2011, during José Sócrates’s mandate. Although activity during this period was sporadic, the analysis of available data reveals spikes in topics such as “José Sócrates”, “Economy”, and “Press”. Pedro Passos Coelho’s presidency was marked by economic measures designed to deal with the Troika effects and to fight austerity [
77], which could explain the emphasis on terms related to the economy and the prevalence of the topic “European Union” and “Economy.” While the topics “José Sócrates” and “Events”, which were the most mentioned topics in the previous period, suffered a decrease in share.
António Costa’s first government (2015–2020) was the first mandate with consistent tweet activity throughout the years. As the graph indicates, the communication was diverse in terms of the topics shared. The increase in topics “Public Services” and “European Union” in the first year reflected well the priorities of the socialist government. Even though the “Economy” topic decreased by 0.04 in share between 2016 and 2017, it was still the most relevant topic throughout the mandate. From 2019 onwards, mentions of the “European Union” topic decreased, while the “Minister’s Council” and “Press” increased; the latter, especially after 2020, when the coronavirus pandemic and press conferences about it became the focal theme of the X communication.
4.2. Transparency Index
Transparency Index’s metrics were the total number of posts in the sample, the number of posts that include multimedia, and the mentions of other social media platform accounts. The results by account are presented in
Table 7. The results indicate a significant difference between the total values of posts for the @govpt account, which ranked higher for all metrics except for the number of posts that refer to other social media platforms. For such metric, the accounts @defesa_pt, @ciencia_pt, and @saude_pt presented the highest values.
The percentage of posts that fit in the transparency factor is 38.7% on the @govpt account, the second lowest, only behind the @nestrangeiro_pt, on which it represents 38.6% of the total posts.
Concerning multimedia resources, the data implied a more frequent use of images than videos. Even though both are proven to stimulate more citizen engagement than plain text, videos are considered more efficient [
75]. The prevalence of images over videos also aligns with previous studies. For [
10], in a universe of 15,941 tweets from local German governments, 215 were images, 66 were videos, and 89 were social media integration [
5]. In comparison, Ref. [
22] study resulted in 530 images, 63 videos, and 57 social media posts. Therefore, this can be interpreted as wasted opportunities for conveying messages more effectively and increasing user engagement across different platforms.
As exposed previously, to calculate the Transparency Index according to the TPCC Framework, each account’s result was divided by the total value for each metric. For example, on the @govpt account, the total posts (18071) were separated by the total posts for all accounts (59036). Secondly, these four results were summed for each account and divided by the number of metrics on the transparency factor, which was four. The results from the described calculation can be verified in
Table 8.
The frequency of publications was directly related to the Transparency Index values, mainly due to the metric “Total number of published posts”, where the accounts @defesapt and @govpt presented the highest results. Compared to the Arabic countries’ official Facebook pages analyzed by [
22] and presented in
Appendix C, these two Portuguese accounts outperformed all Arab countries analyzed, since the highest value belonged to Tunisia with a Transparency Index of 0.620.
4.3. Collaboration Index
The Collaboration Index utilized word count values from two term categories: one aimed at stimulating citizen engagement and the other requesting co-creation. A word frequency count was performed on Python to identify the frequency of terms belonging to each category that were presented previously, and the results are displayed in
Table 9.
Through the percentage calculation of collaborative posts for each account, it was pointed out that engagement terms were used six times more than co-creation terms. In percentage terms, the account @iestruturas_pt ranked higher in collaboration (14.9% of total posts), followed by @ciencia_pt (5.6%). The accounts with more activity @govpt only employed collaborative terms in 3.4% of their posts.
The most used term to request engagement was “know”, written in 781 tweets. It was primarily used with links, encouraging readers to click on the URL and learn more about the topic described in the tweet. The most used term to request co-creation from citizens was “escolha” (98 times). This term invites readers to participate in public consultations and the government’s decision-making processes.
The values of engagement and co-creation terms were first divided by the total number of posts in the collaboration category, and then by two, the number of metrics. The results are displayed in
Table 10.
All values calculated for the Collaboration Index were lower than the transparency values, indicating that the accounts do not request citizen engagement and co-creation through X. Ref. [
10] also identified this problem, as only 14 tweets requested citizens to be involved with the government in the study. Compared to [
22], the Portuguese government accounts presented significantly lower Collaboration Index results.
4.4. Participation Index
The values of the Participation Index metrics are presented in
Table 11. The first metric refers to how the government enables citizens to post on the analyzed page. According to [
4], this metric would only be False (or 0) if the page had limitations on access, such as disabled comments or a private page, which did not happen for any accounts analyzed in the present study [
2]. Therefore, the value of 1 was added to all accounts concerning this metric.
Through the results for the second metric, posts containing surveys, it was possible to determine low employment of this resource by asking for citizens’ opinions, consistent with previous studies [
10,
75,
78]. Ref. [
4] Additionally, the limited use of polls was noted, with only 3 out of 12 countries utilizing the resource five times [
4].
Studies have proven that digital communication success is directly related to participation among actors [
2]. In the research developed by [
22], regarding Arab countries, some subjects presented higher results, with a maximum of 0.620 from Tunisia. Still, half of the sample obtained a null minimum. The final values for the Participation Index are displayed in
Table 12. The subjects that ranked higher besides @govpt were @trabalho_pt and @agricultura_pt.
4.5. Comfort Index
Regarding the last factor that comprises the TPCC index, the Comfort Index calculation was based on the presence of links redirected to governmental websites, the number of links to e-government websites, and external ones. Regarding the metric “Presence of link to e-government website”, 1 (True) was summed to all accounts except @ainterna_pt. The link on the account’s description was “portugal.gov.pt”. The results for the remaining metrics are shown in
Table 13.
The data revealed a pattern of link usage in messages, with a notable preference for URLs redirecting users to official governmental pages. These represent 90.6% of all posts that included links in the sample. The posts that were redirected to internal pages followed the same model, a type of news post with headlines and URLs that urged users to access the website for more information.
The results align with [
22], where 235 tweets contained internal links, and 103 contained external links. According to [
10], external links (2265) were more prevalent than governmental ones (1614) on Italian accounts [
5].
It was also observed that the pages with the most external links were @govpt and @saude_pt. However, the @govpt account surpassed all others regarding governmental (13,876) and external links (1682). These data highlighted the need to enhance connections between governmental agencies’ websites and social media accounts, as well as with pages from various sectors, ensuring transparency in cooperation and partnerships. The accounts analyzed by [
10] also presented low values of URL usage in general [
5].
The Comfort Index calculation is performed differently from other metrics, and the results (
Table 14) show a similar value between accounts, except for @ainterna_pt, which presented a considerably lower value because it did not sum a value of 1 for the presence of a governmental link on the account description. Ref. [
22] results presented an average of 0.080; in the present analysis, the average was 0.380.
4.6. TPCC Index
For the final estimation, all four indexes previously measured were summed and resulted in a TPCC Index for each account, shown in
Table 15. All results considered, the @govpt and @defesa_pt were the first accounts to be created and therefore had more weight in the sample, yet these were also the most consistent accounts in publications. The average of all TPCC results was 0.808, and the accounts @govpt, @defesa_pt, and @saude_pt pages were positioned above them. The remaining accounts could be considered under the average and, therefore, low levels of the TPCC index. For a compact, at-a-glance overview of per-account values (transparency, participation, collaboration, comfort, and TPCC), please see the
Appendix C tables. To avoid duplicating content and keep the main text concise, we direct readers to the full per-account breakdown.
Moreover, the official government account @govpt presented a higher TPCC index than twelve Arab countries’ Facebook accounts that underwent the same analysis by [
22], where Abu Dhabi (1647) and Bahrain (1278) presented the highest indexes.
The low frequency of posts, averaging 3.92 per day, was the greatest hindrance to communication success and a higher average of the TPCC Index. Analyzing the public metrics for the @govpt account (exploratory analysis), all posts that ranked higher on retweets, likes, quotes, and replies contained a link or multimedia resource, which was already pointed out as a primary driver of engagement and should be considered an essential factor in the planning of a social media strategy for the Portuguese government. Rankings are primarily driven by (i) account lifespan/volume (affecting transparency metrics), (ii) link usage shares (comfort), and (iii) polls/co-creation prompts (participation/collaboration). Earlier-created, consistently active accounts (@govpt, @defesa_pt) score higher. For within-sample comparisons, we classify TPCC using sample quartiles: Q1 = “low”, Q2 = “moderate−”, Q3 = “moderate+”, Q4 = “high.” These bands are descriptive (not absolute thresholds). Cross-study comparisons should adjust for platform and time period.
The results of the collaboration and participation factors displayed a preference for engagement over co-creation terms. However, both categories were more frequently employed than participative terms, which suggests that the Portuguese government did not utilize X as a trusted channel for surveys, inquiries, or information collection. This prevalence suggested that the Portuguese government fits Level 3 of the maturity model proposed by [
55]. The maturity level is directly related to the level of effort from the public sector to collaborate and involve citizens more deeply in its processes. It has not yet reached Level 4, which is Open Collaboration, and this is common in public sector accounts [
10,
50].
The exploratory analysis, developed over the @govpt account, used the main methods existing for X analytics and enabled a detailed summary of the Portuguese government practices, partly because this account represented an essential origin of the content and central point of interaction for all other accounts; therefore, comprehending how this account behaves, we can determine all other accounts’ activities. The time series analysis depicted that though the X account was created in 2009, it was only actively used from 2016 onwards.
While the results from the linguistic analysis presented on WordCloud and topic modeling showed the low adoption of terms outside of the governmental sphere, suggesting the Portuguese government used social media for its informational aspect rather than as a communication or collaboration channel. This fact aligned with the habit of the Portuguese population of using social media to consume and not to interact, which leads to low numbers of followers, engagement metrics, and interest in these accounts, which is worsened by the low frequency of posts and leads to low levels of transparency, participation, collaboration, and comfort.
The engagement-enabling features identified in this study remain modest. The higher TPCC relative to prior Facebook-based studies reflects different platforms/periods and metric compositions; we therefore interpret results as relative rather than absolute success.
At a glance, transparency and comfort are consistently the strongest dimensions across accounts, while participation and collaboration are comparatively less developed; accordingly, differences in overall TPCC are primarily explained by account longevity/volume and the frequency of link-sharing and interactive content. See
Appendix B for the exact per-account figures.
5. Conclusions
Many countries, including Portugal, have adopted the Open Government principles and are taking the proper steps toward a more transparent, participative, and collaborative governance. However, they still face several challenges in consolidating innovation in their communication on more specific and general terms of digital adaptation. This research is the first to investigate the communication facet of this problem in Portugal and resulted in a complete profile of the communication process, with the identification of patterns and missteps.
This study’s main objective was to comprehend the theoretical concepts governments had at hand to guide their interactions with citizens in the digital era. This question is addressed through the presentation of Open Government principles, followed by the identification of 31 resources to achieve these principles, as they are connected to the goals of social media use in the public sector. In that sense, the TPCC index proved to be a valuable measure of the adoption in governmental accounts.
It is unclear if there are norms regarding how the analyzed agencies should behave online. Still, due to similarities in the topics and content types, it is possible to imply a pattern in all accounts, which depicts joined efforts from agencies and the government to enhance the interaction with citizens. Although it was possible to visualize the evolution in the use of social media by the Portuguese government, some aspects of the Portuguese government and society hindered the success of this communication. Some of these factors include the old-fashioned manners of governing and processes, the nature of the population’s use of social media for content consumption rather than interaction, and the prevalence of other communication channels such as radio and television.
The exploratory analysis helped determine critical characteristics of the Portuguese government’s X communication: low and almost inadequate frequency of publications; preformatted post styles, focused on informing about the government’s day-to-day activities rather than engaging in conversations with the citizens; low use of essential resources such as videos, links, and connections to other social media platforms; little diversity on posted topics over time; no connections with external accounts while also presenting a low number of connections with internal accounts.
The second part was the application of the TPCC index calculations proposed by [
22], resulting in a “grade” for each of the four factors. This research successfully validated such calculations for X due to changes in the criteria proposed initially for Facebook. Moreover, although there is a lack of comparison for these measurements, it is possible to make more general assumptions about the quality of communication being put into practice by the agencies and identify the most significant challenges.
5.1. Theoretical Contributions
The present study contributes to the existing research on social media use by the public sector and, more specifically, analysis of governmental activity on X to build on previous studies [
22,
78,
79]. In the development of this research, the focus of the analysis was the Portuguese case study, selecting different subjects from the governmental scope to provide a comprehensive critical analysis of the topic.
As the studies suggest, effective social media strategies for governmental purposes revolve around perceptions of transparency, participation, and collaboration in society. In other words, modern governance involves citizens in every decision-making step, keeping them informed and engaged in conversations.
Above all, the main contributions of this research are situated in the Portuguese case study and the application of the TPCC Index methodology. First, it reinforces the notions about the Portuguese situation, given that the literature on this subject is scarce and previous studies focused either on the political aspect [
64] or the user characteristics [
16] creating a gap in the governmental angle of the matter. Secondly, a central theoretical contribution of this study is the adaptation of the TPCC index, initially developed for Facebook, to the platform X (formerly Twitter). This methodological innovation expands the applicability of the TPCC framework to different social media environments and provides a foundation for cross-platform and cross-country comparisons of governmental communication success.
5.2. Managerial Implications
In practical terms, the present research aimed to identify patterns and problems within public digital communication in Portugal and develop a deeper understanding of the critical factors influencing the public sector’s use of social media. Under the framework presented in the research, four principles explain the relevance and purpose of communicating online with citizens: transparency, participation, collaboration, and comfort. Close monitoring of all metrics related to these principles and their incorporation in the government’s communication planning proved essential to establishing successful and meaningful interactions.
For policymakers, this TPCC adaptation provides a replicable and auditable method to evaluate the effectiveness of governmental communication strategies on X. It allows governments to benchmark their performance and identify areas of improvement in transparency, participation, collaboration, and comfort. For scholars, the approach demonstrates how platform-specific adaptations of established indices can enrich comparative research and build cumulative knowledge in public communication studies.
The Portuguese case in this research highlighted the need to enhance various aspects of X communication to address issues like low engagement and interest in the accounts. The main identified issue was the low frequency of tweet posting, which is exacerbated by social media algorithms that determine what each user sees, thereby amplifying the reach of content from accounts that post regularly. Therefore, it is essential to maintain a high average of daily posts to stimulate interactions and create space for content diversification.
Another problem identified was the use of a template for most tweets in the sample, as the repetition of the tweet structure discourages citizens from interacting with it. This issue could be solved by using dynamic resources more frequently on posts, such as videos and broadcasts, instead of images, the leading multimedia resource. Besides implementing different types of content, the messages should be more engaging. The low level of collaborative and participative terms demonstrated this fact.
In conclusion, the last point of change for the Portuguese government towards successful communication on X is the element of network and conversation between different accounts and pages, which is nonexistent in the tweets analyzed. Raising the number of interactions with other accounts, especially outside of the governmental scope, and adopting techniques to answer the feedback presented by citizens could be other valuable forms to improve the account’s results.
5.3. Limitations and Recommendations for Future Research
Every research development is accompanied by limitations that sometimes lead to alternative paths. In the present study, the most significant limitation and, at the same time, motivation for the study was the research gap. Research has shown that there is a lack of interest in studies about the public sector, primarily due to the uncertainty of financial gains and the government’s limited investment in public development.
Moreover, the choice of the subjects for the analysis was also limited by a common problem: the small sample size due to the low activity on the Portuguese government accounts. For that matter, the decision to extend the timeframe of the data was taken. Although this achieved a good representation, as all available data were collected, it also became harder to obtain good insights, leading to the split of the sample for different analyses.
We note the well-known pitfalls of social media analytics. Activity is not influence. High posting frequency or raw engagement counts do not necessarily reflect persuasive power, reach into relevant publics, or policy impact; similarly, posting volume is only weakly coupled with meaningful engagement. Simple aggregates are easy to over-interpret, so we read TPCC as a descriptive benchmark of observable behaviors, not a causal estimate of impact.
Given non-independence of posts within accounts and unequal account lifespans, we treat TPCC as a descriptive benchmarking tool and refrain from between-account significance tests. This limitation could be addressed in future studies using account-level bootstrapping.
Another limitation is the temporal validity, which can produce confounding factors. The platform/policy changes post-2022, and the study provides a historical baseline rather than a real-time dashboard. Also, the covered period includes COVID-19, which likely altered posting and audience behavior. Future studies should analyze the behavior of these accounts, both before and after platform changes.
Additionally, for future studies, the recommendations would be to explore specific aspects of Portuguese government communication, such as economics, education, and health, while also measuring public engagement and contributing to understanding how the Portuguese people interact with these accounts. The development of further pattern recognition regarding engagement can help understand the topics, people’s responses to them, and the use of replies and quotes data to detect and analyze the conversational facet of social media communication.
Another element that requires further development is the measurement methods, such as the TPCC index. The further development of studies and analytical models that implement such calculations creates an environment for comparative and global indexation of public digital communication. The public sector’s lack of awareness about these methods contributes to the low number of productions and the limited attention to how the government can better engage with citizens and leverage social media features to enhance trust and democratic processes.