6.1. Analysis of the Social Networking and Activity Indexes of the Greek Political Blogs
Table 1 presents the descriptive statistics of the five indexes. Incoming links range from zero to 25. That is, there exits at least one blog that gathers at the most 25 out of 127 links (20% of the links). The mean equals 4.5. Outgoing links have a wider range, from zero to 30. The mean is equal to the mean of incoming links, but the standard deviation is greater. Outgoing links have a greater variance than incoming links do, so it seems that they are used by some blogs to a greater degree as a tool of networking. Normalized betweenness ranges from zero to 7.28 with a mean equal to 1.06 and a standard deviation equal to 1.75. There is a diversity of betweenness among the blogs.
Table 1.
Descriptive statistics of the five social network indexes.
Table 1.
Descriptive statistics of the five social network indexes.
| Minimum | Maximum | Mean | Std. deviation |
---|
Incoming links | 0 | 25 | 4.45 | 4.4 |
Outgoing links | 0 | 30 | 4.45 | 5.48 |
Normalized betweenness | 0 | 7.28 | 1.06 | 1.75 |
Number of 1-cliques a blog belongs to | 0 | 80 | 6.06 | 9.81 |
Size of Ego network | 0 | 38 | 8.64 | 6.72 |
The number of 1-cliques a blog belongs to ranges from zero to 80. That is, there are some blogs belonging to no 1-cliques, but some others are much more connected. Having in mind than the maximum value of 1-cliques a blog belongs to is 80, and since the mean equals 6.06, it becomes clear that most of the blogs belong to small groups of 1-cliques and only a minority of the blogs belongs to larger groups of 1-cliques. This property is associated with skewnness. Skewnness is a common property of many Web 2.0 applications and it represents the situation that most the studied entities have small values of a specific characteristic while only a few have the highest values. Regarding the size of ego network, the conclusions are similar but not the same. The range is smaller than that for the 1-cliques and the mean is greater than the one for 1-cliques. Yet, it seems that skewnness still holds even for this case.
A first attempt to explore whether the five indexes describe different aspects of the same property, that is, connectivity of the blogs, it is usual to calculate the correlation coefficients among the indexes [
23]. High correlations provide evidence that all of the indexes indeed measure aspects of the same thing. This property of indexes’ inter-correlations is of the essence for the graph-theoretic study of a network and in fact it provides evidence of the reliability of the proposed indexes. Conclusions are further supported in such a case of strong index inter-correlations, and there is also strong evidence that all the indexes measure network centrality and blog connectivity.
Table 2 presents the correlation coefficients among the indexes. All of them are positive and all, except one, are statistically significant. The non-significant one is the correlation between incoming links and number of 1-cliques a blog belongs to.
Table 2.
Correlation coefficients among the five social network indexes.
Table 2.
Correlation coefficients among the five social network indexes.
| Incoming links | Outgoing links | Normalized betweenness | Number of 1-cliques a blog belongs to |
---|
Outgoing links | 0.301 ** | | | |
Normalized betweenness | 0.531 ** | 0.750 ** | | |
Number of 1-cliques a blog belongs to | 0.163 | 0.737 ** | 0.413 ** | |
Size of Ego network | 0.192 * | 0.773 ** | 0.497 ** | 0.795 ** |
Since there is evidence that correlations exist among the proposed indexes, a usual approach is to perform Principal Components Analysis (PCA) with Varimax rotation. The method produces new variables that “summarize”, through a statistical procedure, the original variables. Although the new variables called Principal Components (PC) or Factors do not represent some actual measurement, their values or factor scores can be used to consequently describe values of the original variables, those which constructed the factors. For example, if factor loadings are positive, large values of the factors are associated with large values of the original variables.
By performing PCA two PCs are constructed, the first one explaining 53% of the total variance and the second one explaining 32% of the total variable (total variance explained = 85%). With reference to the factor loadings (
Table 3), PC1 refers to a blogs’ property that could be described as blogs’ attempt to construct ego networks, to belong to many 1-cliques, and to have many outgoing links. Loadings for size of ego network, number of 1-cliques a blog belongs to and outgoing links are over 0.8. This could be interpreted as the property of blogs to try to expand their territory by belonging to many groups and linking to many other blogs.
Table 3.
Factor loadings of the Principal Components constructed form the five social network indexes (Varimax rotation).
Table 3.
Factor loadings of the Principal Components constructed form the five social network indexes (Varimax rotation).
| PC1 (Total variance explained = 53%) | PC2 (Total variance explained = 32%) |
---|
Size of Ego network | 0.917 | 0.135 |
Number of 1-cliques a blog belongs to | 0.914 | 0.062 |
Outgoing links | 0.855 | 0.387 |
Incoming links | 0.013 | 0.924 |
Normalized betweenness | 0.494 | 0.745 |
PC2 refers to the property of blogs having many incoming links and normalized betweenness. Loadings for incoming links and normalized betweenness are 0.924 and 0.745, respectively. They are central blogs, regarding betweenness, mainly linked by others and not linking to others. Having in mind that by construction these two PCs are uncorrelated, it is reasonable to argue that they represent two distinct connectivity properties of the blogs. Some blogs serve as authority blogs having many incoming links and centrality, while others strive to expand their influence territory by having many outgoing links, and being members of many cliques or forming large ego-networks.
The findings are organized hereafter to respond to three related questions:
1. How are these properties associated with the blogs’ activity?
2. Which are the blogs that perform best regarding the two factor scores?
3. What is the connection, if any, between blogs that have the first properly, i.e., they try to expand their territory, with the blogs that have the second property, i.e., they are central and are recommended?
Activity regarding blogging is conceptualized in this paper, with reference to the bloggers’ activity and to the users’ community activity. They are measured by counting number of posts to represent blog activity and number of comments to represent community activity. Because both posts and comments may be so many, this paper follows the approach of Karpf [
53] and records them for a week interval. Measuring activity, although interesting, is not an end in itself. Rather, what is of interest in the analysis is to study the correlations between activity and networking practices.
Table 4 presents the correlation coefficients between number of posts and number of comments with the two factor scores. Only correlations between the second factor scores and the two activity indexes are both positive and statistically significant. There is, therefore, strong evidence that only blogs, which are central and recommended by others (with regard to incoming links), are also more active and attract more users’ activity and reaction.
Table 4.
Correlation coefficients between Principal Components (PC) scores and activity indexes.
Table 4.
Correlation coefficients between Principal Components (PC) scores and activity indexes.
| Number of posts | Number of comments |
---|
PC1 | −0.019 | 0.136 |
PC2 | 0.429 * | 0.408 * |
The construction of overall influence scores is practical and meaningful. By construction, factor scores have a mean equal to zero and a standard deviation equal to 1. It is a common procedure to separate cases (i.e., the blogs) using their factor scores. Blogs with factor scores greater than 1 are usually considered to be the ones that have the highest values of the factor scores and therefore of the proposed indexes, while those blogs with factor scores smaller than −1 are considered to have the lowest values of the factor scores. This finding also holds for the original variables, which served to form the factor. High factor scores provided that factor loadings are positive, and are associated with high values of the original indexes.
For the two PCs constructed by PCA, we can distinguish those blogs having factor scores greater than one for both PCs.
Table 5 presents the descriptive statistics for blogs having factor scores over 1 in PC1 and PC2. For convenience these blogs may be considered as best performing according to either PC1 or PC2. Twelve blogs, that is 9.5% of the 127 blogs, belong to the group of blogs having the property of striving to link to many other blogs. Twenty blogs, that is 15.7% of the 127 blogs, receive many incoming links, are more central and present the highest activity, regarding both blog activity and community activity. The 20 blogs having PC2 scores over unity are considered to be authority or central blogs, because they are heavily linked and they present high frequencies of posts and comments (
Table 5).
Table 5.
Descriptive statistics of the social network and activity indexes for best performing blogs according to PC1 and PC2.
Table 5.
Descriptive statistics of the social network and activity indexes for best performing blogs according to PC1 and PC2.
| Incoming links | Normalized betweenness | Outgoing links | Number of 1-cliques a blog belongs to | Size of ego network | Number of posts | Number of comments |
---|
PC1 scores > 1;
N = 12 |
Mean | 7.25 | 3.95 | 16.16 | 28.91 | 23 | 558.38 | 83.4 |
Std. Deviation | 3.72 | 2.72 | 7.77 | 18.25 | 6.48 | 619.98 | 208.79 |
PC2 scores > 1;
N = 20 |
Mean | 11.65 | 4.04 | 10.25 | 11.1 | 13.15 | 1104.94 | 523.3 |
Std. Deviation | 4.92 | 2.22 | 7.77 | 10.93 | 8.14 | 2150.1 | 1253.85 |
To answer to the third question, that is, what is the connection between best performing blogs regarding PC1 and best performing blogs regarding PC2, a useful and easy to comprehend approach is to turn to the blogs social network and corresponding adjacency matrix. First blogs in the rows of the matrix are labeled using two labels, blogs with PC1 scores over 1 and blogs with PC1 scores under 1, and similarly, blogs in the columns of the matrix are labeled as blogs with PC2 scores over 1 or blogs with PC2 scores under 1. Therefore, four groups of blogs are created and they can be presented in a kind of contingency-linkage table:
• blogs with PC1 scores > 1 that link to blogs with PC2 scores > 1;
• blogs with PC1 scores > 1 that link to blogs with PC2 scores < 1;
• blogs with PC1 scores < 1 that link to blogs with PC2 scores > 1;
• blogs with PC1 scores < 1 that link to blogs with PC2 scores < 1.
Then, we can count the number of actual links between blog groups, and calculate a percentage by dividing this number by the total number of links that could exist between blog groups, if all the blogs of group 1 linked to all the blogs of group 2. These percentages serve as measurements of intergroup linkages. Large percentages imply that many of the possible links between groups actually exist. Percentages are presented in
Table 6. It becomes clear that 31.67% of the possible links from blogs having PC1 scores over one are actually used to connect with blogs having PC2 scores over one. Similarly, 9.19% of all the possible links from blogs with PC1 scores over one are indeed used to connect with blogs having PC2 scores under one,
etc. It is quite clear that the biggest percentage of links exist between blogs best performing regarding PC1 and best performing regarding PC2. Blogs that strive to expand their network territory heavily link central and active blogs.
Table 6.
Percentages of hyperlinks between blog performance groups.
Table 6.
Percentages of hyperlinks between blog performance groups.
| PC2 scores > 1 | PC2 scores < 1 |
---|
PC1 scores > 1 | 31.67% | 9.19% |
PC1 scores < 1 | 6.83% | 1.75% |
6.2. An Exploration of the 20 Central-Authority Blogs
This section introduces an exploration of the 20 central (authority) blogs, located through the analysis of the previous section (blogs with PC2 scores over 1). Content analysis is followed to link the quantitative findings of this paper with the political framework and the context of the Greek political scene. Content analysis investigates content, political affiliation and topics discussed in the 20 blogs, using the NVIVO 8. For each blog, the following features were recorded in the content body:
1. The main title of the blog along with a summarized description.
2. The “about” part of each blog (when applicable), where the scope of the blog is given, the motivation of the blogger to create it, etc. Usually there is also information concerning the blogger, such as his/her interests, her/his aims, her/his academic background and her/his political beliefs.
3. The main part of this analysis, the posts of the political blogs, including the title of each post, its main body followed by its’ comments (those for November 2010).
The units of the media content were decided to be words or phrases. Synonyms or small phrases were grouped together.
Table 7 presents the political affiliation of the 20 central blogs. Most of them have no specific affiliation or have a Left affiliation. Also,
Figure 1 presents the percentages of negative, neutral or positive references that the central blogs make for the main Greek parties. From
Figure 1, it is obvious that for the period under study, half of the blogs make negative references for the Left or ND, but the majority of them make negative references for PASOK, which was the party in Governance for the period of the study. Neutral comments and references are more frequent for PASOK and ND than they are for parties of the Left, and positive references have equal frequencies for ND and PASOK (15%) and there are just a few for the Left. As a general conclusion we can say that the central blogs are generally critical to all main political parties. Of course they act within the general political context of the period (criticism for the government in relation to the economic crisis of Greece), but still they make either neutral or positive references to the parties.
Table 7.
Political affiliation of the 20 central blogs.
Table 7.
Political affiliation of the 20 central blogs.
Not specified | 8 |
Left | 6 |
PASOK | 3 |
ND | 2 |
LAOS | 1 |
Total | 20 |
Figure 1.
Percentages of the 20 central blogs with negative, neutral or positive references to the Greek political parties.
Figure 1.
Percentages of the 20 central blogs with negative, neutral or positive references to the Greek political parties.
Next, this analysis looked for specific areas of interest over which discussions occurred among the 20 central blogs through their posts/comments.
The analysis identified 11 main eParticipation areas that the central political blogs may discuss. These areas are:
1. Campaigning: Protest, lobbying, petitioning and other forms of collective action (except for election campaigns, see electioneering as topic),
2. Community building/collaborative environments: To promote individuals to come together to form communities, to progress shared agendas and to shape and empower such communities.
3. Deliberation/criticism: To support virtual, small and large-group discussions, to express thoughts or ideas, to comment on or criticize, allowing reflection and consideration of issues.
4. Discourse: To support analysis and representation of discourse.
5. Electioneering: To support politicians, political parties and lobbyists in the context of election campaigns.
6. Information provision: To structure, represent and manage information in participation contexts.
7. Polling: To measure public opinion and sentiment.
8. Concern creation: To make an impression about a fact/set of ideas and cause further uncertainty or suspicion.
9. Media and book reference: References to websites, magazines, newspapers etc. as well as books.
10. Environmental issues: References to environmental topics.
11. News.
Table 8 presents the percentages of blogs within the 20 central blogs and within the total 127 blogs, which make reference to the eParticipation areas. In general we can observe that in almost all eParticipation areas, the central blogs are more active than the overall Greek political blogosphere. To verify which are the main topics for which central blogs are most active compared to the blogosphere, chi-square tests are used. Statistically significant differences are found for the eParticipation areas to which central blogs also make references with the highest frequencies: information provision, media references, criticism, news, discourse and concern creation. Also there is a difference for environmental issues. These eParticipation areas may be used to describe a general profile of the central blogs. Thus, the central blogs can be characterized as being critical, informative, promoting discourse and concern. They are not only heavily linked but they also are active and very much involved in the political discourse.
Table 8.
Percentages of central blogs, which make references to the eParticipation areas.
Table 8.
Percentages of central blogs, which make references to the eParticipation areas.
| 20 Central blogs | 127 blogs (Total) | Chi-square significance |
---|
Information Provision | 95 | 54 | * |
Media references, publications | 85 | 52 | * |
Deliberation, criticism | 85 | 57 | * |
News | 80 | 50 | * |
Discourse | 60 | 32 | * |
Concern creation | 55 | 33 | * |
Campaigning | 40 | 32 | |
Environmental issues, concern | 25 | 10 | * |
Polling | 20 | 12 | |
Community building/Collaborative Environments | 10 | 13 | |
Electioneering | 10 | 6 | |