Next Article in Journal
Feedback Schemes for the Action-Dependent Wiretap Channel with Noncausal State at the Transmitter
Next Article in Special Issue
Investigating the Randomness of Passengers’ Seating Behavior in Suburban Trains
Previous Article in Journal
A Monotone Path Proof of an Extremal Result for Long Markov Chains
Previous Article in Special Issue
Matching Users’ Preference under Target Revenue Constraints in Data Recommendation Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bipartite Structures in Social Networks: Traditional versus Entropy-Driven Analyses

1
Department of Operations Research, FernUniversität in Hagen, 58097 Hagen, Germany
2
Department of Quantitative Methods, FernUniversität in Hagen, 58097 Hagen, Germany
3
Department of Business Administration, Private Hochschule für Wirtschaft und Technik, 49377 Vechta, Germany
*
Author to whom correspondence should be addressed.
Entropy 2019, 21(3), 277; https://doi.org/10.3390/e21030277
Submission received: 15 February 2019 / Revised: 6 March 2019 / Accepted: 7 March 2019 / Published: 13 March 2019
(This article belongs to the Special Issue Entropy and Information in Networks, from Societies to Cities)

Abstract

:
A special type of social networks is the so-called affiliation network, consisting of two modes of vertices: actors and events. Up to now, in the undirected case, the closeness of actors in such networks has been measured by their jointly-attended events. Indirect contacts and attenuated and directed links are of minor interest in affiliation networks. These flaws make a veritable estimation of, e.g., possible message transfers amongst actors questionable. In this contribution, first, we discuss these matters from a graph-theoretical point of view. Second, so as to avoid the identified weaknesses, we propose an up-and-coming entropy-based approach for modeling such networks in their generic structure, replacing directed (attenuated) links by conditionals: if-then. In this framework, the contribution of actors and events to a reliable message transfer from one actor to another—even via intermediaries—is then calculated applying the principle of maximum entropy. The usefulness of this new approach is demonstrated by the analysis of an affiliation network called “corporate directors”.

1. Introduction

Social Networks (SN) are actors and their mutual relationships. The aim of Social Network Analysis (SNA) is to record and interpret structures within the social fabric. How do different actors or groups of actors act within networks? What benefits do they gain from this, and what disadvantages do they suffer in interacting with other actors? Ever since Moreno [1] published his ground-breaking work, scientists have been able to support their analyses with graphic representations of these networks. Actors are displayed as nodes and relationships as either undirected or directed edges (arrows). They are a visual representation of symmetrical or asymmetrical relationships. Such simple social structures are generalized in multigraphs or hypergraphs. Whereas multigraphs show several relationships, hypergraphs focus on so-called hyperconnections; meaning connections that link up more than two actors.
Social affiliation networks are a specific type of social network. The term affiliation here describes a known membership or affinity. In other words, social affiliation networks are used to describe which actors belong to which clubs, associations, or interest groups or whether they meet at specific events. Probably the best-known, frequently-cited example of this is that of the “Southern Women” [2]. The authors describe which society ladies in a small town in the Southern States of the U.S. attend what events, and these observations are subject to a profound sociological investigation of the town’s society. Such, among others, affiliation networks typically have two modes, making them bipartite. The actors constitute one type of network node, and the events the other type. Only elements with different node types can form relationships. Once such a network has been selected, the actors can be investigated in relation to the type and number of their contacts, and the clubs can be examined in relation to their attractiveness or suitability for exchanging messages, knowledge, etc. Deviating from classical notation, from now on, we use the expression platform instead of clubs, associations, or events. Platforms are the loci of message transfer, be it notice-boards, newsgroups, or social media. Mind the fact that in reality, message transfers via platforms do not necessarily involve personal contacts. Very frequently, personal contacts are replaced by media channels, as mentioned above.
Interestingly enough, there are several applications of bipartite graphical structures in other fields of research, as well, such as collaboration networks joining, e.g., projects and project partners [3,4] or bibliographic networks joining, e.g., authors and papers or conferences, respectively [5,6].
Borgatti and Everett [7], Borgatti [8] described bipartite structures and their special features: the affiliation matrix, the sociomatrix that can be deduced from this, and different ways to determine the “proximity” of actors as an indicator of message transfer between them. They elaborated on how traditional indexes, such as network centralization or the centralities of actors and platforms, may only ever be applied after undergoing significant changes and outlined the level of modification required. Clearly, the centralization of a bipartite network cannot be the same as that of a traditional network, since connections cannot exist between nodes of the same type. The authors further investigated how the key feature of a two-mode system impacts the cohesive groups within networks, on structural and regular equivalence, and on many others.
One aspect that all traditional analyses of affiliation networks have in common is that links are always viewed as undirected. Their semantics consists of the non-directional connection between actors and platforms.
Therefore, such structures are only ever evaluated and analyzed in relation to the number of such links and the parameters resulting from this. “How often does actor i meet actor j at different clubs or associations?” “How many of the members of club k are also members of club l?” This is then used to determine the likelihood that, e.g., a message will be forwarded from i to j or to deduce the social proximity between k and l. Instead of focusing merely on certain transfers of messages or news, we also accept an attenuated version. The idea of an attenuated transfer was first mentioned by Katz [9] and referenced and developed further by Bonacich [10] and then by Bonacich and Lloyd [11]. Everett took up this idea and applied it to undirected affiliation networks [12]. The present paper makes several generalizations:
  • It focuses not on undirected, but instead on directed bipartite graphs. This takes into account the simple fact that, e.g., a message transfer from an actor to a platform does not necessarily occur with the same level of likelihood as vice versa. None of the quoted authors considered attenuated and directed affiliation networks.
  • Rödder et al. [13,14] studied an initial example of modeling general social networks using the principle of entropy. We will apply such probabilistic modeling in order to analyze attenuated directed social affiliation networks effectively. Such analysis then permits rankings of actors and platforms with respect to their influential power.
The Introduction is followed by Section 2, which outlines traditional affiliation network analysis, to provide a basis for presenting and defining the new method in subsequent sections. Section 2.1 formally presents affiliation networks, whereas Section 2.2 sets out direct and indirect contact frequencies between actors. Section 2.3 then outlines the transfer probabilities for messages that can be deduced based on the frequency of contact according to selected normalizations. A short example is then presented, and relating questions are addressed. Section 2.4 continues by generalizing to directed graphs. Section 3 is dedicated to the novel modeling method: Section 3.1 focuses on syntax and semantics and Section 3.2 on how to set up a probabilistic model for a bipartite network. Section 4 shows the potential of an entropy-based model by applying it to a network of 20 actors and 24 platforms. Finally, Section 5 offers a summary and attempts to present the prospects for further research.

2. Affiliation Networks and Traditional Analyses

2.1. Basic Concepts and Their Sociological Meanings

As outlined in the Introduction, affiliation networks have two types of entities: actors and platforms. Some actors share specific platforms: they might, for instance, go to the same clubs, use the same social media channels, etc., while others might not. The sociological literature consistently assumes that the frequency of joint platforms amongst actors supports their willingness to exchange knowledge or messages.
A (general) graph consists of a set of nodes V and a set of edges E : G = ( V , E ) . If two nodes v , w V are linked by an edge e E , they are called adjacent. If e is undirected, we write e = ( v , w ) . If it is directed, then e = v , w ; the direction in the latter case is from v to w . The sociological context of such graphs is well known; see Scott [15].
A graph G = ( V , E ) is bipartite if V = V 1 ˙ V 2 , V 1 , V 2 , and the following applies for each connection ( v , w ) , or v , w : if v V 1 , then w V 2 and vice versa. Nodes from V 1 are never mutually linked, and neither are nodes from V 2 . The cardinalities of these node sets are n 1 = | V 1 | and n 2 = | V 2 | . We will start by focusing on undirected bipartite graphs, typically used to describe affiliation networks. We will use V 1 to describe the actors and V 2 to describe the platforms. Figure 1 shows a generic affiliation network.
The first index of the nodes shows the entity type, and the second numbers them sequentially.

2.2. Contact Frequencies between Actors in Undirected Graphs

In an affiliation matrix for a network described in Section 2.1, rows will denote actors and columns will denote platforms; it has entry one in row v 1 i and column v 2 k if ( v 1 i , v 2 k ) E and zero, otherwise. This affiliation matrix A = A n 1 × n 2 hence is a rectangular matrix with n 1 rows and n 2 columns. Therefore, AA T is an n 1 × n 1 matrix, whose entries are the actors’ mutual contact frequencies. In the same way, A T A is an n 2 × n 2 matrix, whose entries constitute the number of joint memberships in platforms.
Borgatti [8], Everett [12] showed a block matrix that combines both views, calling this a “bipartite adjacency matrix B ”.
B = 0 n 1 × n 1 A n 1 × n 2 A n 2 × n 1 T 0 n 2 × n 2 Key : Zero refers to matrices filled with zero entries .
B is square ( n 1 + n 2 ) × ( n 1 + n 2 ) . B · B then gives:
B 2 = A A T 0 0 A T A
and consequently, both the actors’ contact frequencies and joint memberships in platforms. B 3 , B 4 , as well as greater powers can be formed directly, and this, e.g., gives:
B 3 = 0 A A T A A T A A T 0
B 4 = A A T A A T 0 0 A T A A T A .
While the entries in B 2 show the actors’ contact frequencies and joint memberships in platforms (see above), B 3 demonstrates the impossibility of linking actors with actors or platforms with platforms in three steps. Finally, B 4 shows the contact frequencies between actors also using an intermediary and indirect joint memberships. In other words, actor i might not be a member of the same platform as actor j, but both have the same membership as a third party. This can now be applied to platforms and extrapolated to matrices for greater powers.

2.3. From Contact Frequencies to Transfer Probabilities

If we assume that the contact frequencies between actors indicate their disposition towards passing on messages or news, this would be in line with the basic idea put forward by the respective literature [7,8,16]. We will now present a short example to make this idea more transparent.
Let us take a look at the affiliation network in Figure 2.
We chose this example because:
(i)
it includes actors not present in all platforms
(ii)
some pairs of actors share several platforms
(iii)
some pairs of actors can only contact each other via an intermediary
(iv)
some pairs of actors can only contact each other via two intermediaries.
These verbal characterizations can now be visualized in matrices, with explanations provided.
Entropy 21 00277 i001
Entropy 21 00277 i002
Entropy 21 00277 i003
Entropy 21 00277 i004
The traditional literature only uses direct contacts such as those contained in the matrix AA T to calculate transfer probabilities. Such probabilities are estimated based on relative frequencies applying various scaling rules. If ( u i j ) n 1 × n 1 are the elements of the matrix AA T , then we can scale as follows according to Borgatti [8]:
u i j n 2
u i j min ( u i i , u j j )
u i j u i i + u j j u i j
With reference to Formulas (9)–(11) and AA T as per (6), we obtain the following transfer probabilities:
0.5 0.5 0 0 0.5 0.5 0 0 0.5 0.5 0.25 0 0 0 0.25 0.25 0 0 0 0.25
1 1 0 0 1 1 0 0 1 1 0.5 0 0 0 0.5 1 0 0 0 1
1 0.67 0 0 1 0.67 0 0 0.67 0.67 0.25 0 0 0 0.25 0.5 0 0 0 0.5
The missing entries along the diagonal are self-explanatory.
All scalings can be applied easily, but nevertheless, the question arises which of these transfer probabilities should be used in further analyses.
Using the examples of (7) and (8), this section demonstrates that indirect contacts can exist via one, two, or several intermediaries. Similar transfer probability patterns could also easily be estimated from AA T AA T or AA T AA T AA T , but the problem of how to choose the most suitable scaling would remain.
Keep in mind that the entries in the matrices (6)–(8) describe walks of lengths of 2, 4, or 6 and that this might involve multiple contacts between nodes or repeated runs along the same edge. By way of example, from the 12 walks from v 11 to v 12 in (7), four come back to v 11 before reaching v 12 ; and 10 are not even trails, as they run twice along the same edge. We verify that there are only two paths from v 11 to v 12 with one intermediary, in this case, v 13 . What finally remains as a suitable set of direct or indirect contacts is an open question and is highly context-dependent.
If we accept that the messages can also be transferred indirectly via intermediaries, we must still determine how direct and indirect contacts or contact frequencies should be weighted in relation to each other. What is the significance of a contact via one or several intermediaries compared to a direct contact?

2.4. Contact Frequencies and Transfer Probabilities in Directed Graphs

The bipartite adjacency matrix—if modified slightly—is also ideal for representing directed graphs. Why directed graphs? As already mentioned in the Introduction, a message transfer from an actor to a platform does not necessarily occur with the same level of likelihood as vice versa. Imagine a club has set up a newsletter to inform its members about upcoming fundraising campaigns or future meetings of members, etc. Undoubtedly, the frequency of reaching a member when sending a message is different from the likelihood of an actor to reveal an item of information to the club. Hence, directed graphs are a must in affiliation network analysis. The upper part of (12) shows the adjacencies of actors in relation to platforms, while the lower part describes that of platforms to actors. In general, such matrices are not transposed with each other, of course. We can write the general version as:
B ¯ = 0 A A ¯ 0 .
B ¯ again is square ( n 1 + n 2 ) × ( n 1 + n 2 ) . B ¯ · B ¯ now results in:
B ¯ 2 = A A ¯ 0 0 A ¯ A .
In addition, B ¯ 3 , B ¯ 4 , and greater powers can also be formed, which, e.g., creates:
B ¯ 3 = 0 A A ¯ A A ¯ A A ¯ 0
B ¯ 4 = A A ¯ A A ¯ 0 0 A ¯ A A ¯ A .
The entries in B ¯ 2 again show the contact frequencies between actors (upper part) and joint memberships (lower part). B ¯ 3 and B ¯ 4 can be interpreted accordingly; for details, see Section 2.2. These frequencies are of course significantly limited by the direction of arrows.
Figure 3 contains all connections from platforms to actors as in Figure 2, now directed, but it only has three connections from actors to platforms. We can now specify A , A ¯ , A A ¯ , A ¯ A , A A ¯ A A ¯ , A ¯ A A ¯ A .
A = 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 A ¯ = 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 1
A A ¯ = 0 0 0 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 1 0 0 0 0 0 A ¯ A = 1 0 1 0 1 0 1 0 0 0 1 1 0 0 0 1
A A ¯ A A ¯ = 0 0 0 0 0 1 1 2 1 0 0 0 1 2 1 0 0 0 1 1 0 0 0 0 0 A ¯ A A ¯ A = 1 0 2 1 1 0 2 1 0 0 1 2 0 0 0 1
The estimation of transfer probabilities from contact frequencies follows the same logic as in Section 2.3 and results in similar difficulties as for non-directional bipartite structures. Therefore, this is omitted here.
To sum up, this section shows a severe problem in determining transfer probabilities by means of direct or indirect contact frequencies, be the respective graphs undirected or directed. Due to the great variety of calculating such probabilities, the analyst might choose the wrong aggregation method and hence obtain a biased result. To the best of our knowledge, the only probabilistic method that leads to an unbiased representation of contact frequencies uses the principle of maximum entropy. All available data concerning direct and indirect contact frequencies are “married” in a probabilistic conditional-logical framework; for an axiomatic justification, cf. Kern-Isberner [17].

3. Entropy-Driven Bipartite Network Analysis

3.1. Syntax and Network Load

The details in this section are based on the representation of knowledge processing in social networks in Rödder et al. [13], Brenner et al. [18]. They are repeated here and applied to affiliation networks in the subsequent Section 3.2.
Let us take a set of n nodes { v 1 , , v n } . Each node v i is represented by a binary variable V i with the values V i = v i and v i = 1 / 0 . Therefore, v = { v 1 , , v n } are the respective configurations. For pairs of nodes, V j = 1 V i = 1 are conditionals; ∣ is the conditional operator. For a detailed discussion on conditionals, see, e.g., Calabrese [19] or also Rödder et al. [13].
The semantics of these symbols is as follows: V i = 1 / 0 is the proposition; node v i either knows the message ( 1 ) or not ( 0 ). The conditionals describe potential transfer: if v i has the message, then it probably has v j . Therefore, conditionals replace weighted arrows in graphs.
Let us assume that sociological inquiries have only provided transfer probabilities p i j for several pairs of nodes v i , v j , and not for others. As such, this network consists of a set N { 1 , , n } × { 1 , , n } , and the related conditionals and probabilities:
V j = 1 V i = 1 with p i j for ( i , j ) N .
Now, we look for a probability distribution Q on { v } , which takes account of the transfer probabilities:
Q ( V j = 1 V i = 1 ) = p i j for ( i , j ) N .
Such a distribution is called a network load.
If p i j are entered consistently, then (21) yields a particular distribution on a network.
Q = arg min R ( Q , P 0 ) = v Q ( v ) log 2 Q ( v ) P 0 ( v ) subject to Q ( V j = 1 V i = 1 ) = p i j , ( i , j ) N .
Equation (21) respects all p i j and creates the distribution Q of Minimal Relative Entropy (MinREnt) or Kullback–Leibler divergence R from the uniform distribution P 0 on { v } . As is well known, the minimization in (21) is equivalent to the maximization of the entropy H = v Q ( v ) log 2 Q ( v ) . Therefore, we call Q a Maximum Entropy (MaxEnt) load on the net. Q is a distribution used in artificial intelligence as a knowledge base for the entire network structure [14]. Keep in mind that (21) has a strict axiomatic justification; see again Kern-Isberner [17].
For solving optimization Problem (21), an algorithmic framework is needed. Two frameworks are, e.g., LEXMED [20] and SPIRIT [21]. In the remainder of this paper, we focus on the latter, which was developed at the FernUniversität in Hagen, Germany. For its functionalities, see [21].
Once a distribution is calculated, what is the impact of a specific message sent by node v i : V i : = 1 ? If node v i sends the message, it will penetrate the network according to the probabilistic conditional structure. This process can be performed in SPIRIT [22] by solving the equation:
Q = arg min R ( Q , Q ) = v Q ( v ) log 2 Q ( v ) Q ( v ) subject to Q ( V i = 1 ) = 1 .
Q is the distribution on the network of minimum divergence from Q subject to the condition that V i = 1 . Therefore, this means conditioning a distribution as a whole. If we have Q , then Q ( V j = 1 ) can be calculated for other j i . Q ( V j = 1 ) is the probability that v j receives the message if v i sends it, i.e., actor j’s reception probability. According to the considerations in this section, v j can be a neighbor of v i , or not. Q ( V j = 1 ) even applies to v j , which can only be reached in the network via one or several intermediaries. Rödder et al. [23] also dealt with the extent to which such reception probabilities are only vague conjectures or resilient estimates.
The probabilistic conditional-logical framework developed so far will be applied to two-mode networks in the next section.

3.2. MaxEnt Distributions in Two-Mode Networks

Now, we consider the sets V 1 = { v 11 , , v 1 i , , v 1 n 1 } , V 2 = { v 21 , , v 2 k , , v 2 n 2 } and the related variables { V 11 , , V 1 i , , V 1 n 1 } , { V 21 , , V 2 k , , V 2 n 2 } .
Conditionals can then be displayed as follows:
  • V 1 i = 1 V 2 k = 1 [ p k i ] for transfers from platforms to actors and
  • V 2 k = 1 V 1 i = 1 [ p i k ] for transfers from actors to platforms,
with the probabilities p k i and p i k .
Let us refer to Figure 3 to illustrate the relationships, and assume, first, that no transfer probabilities are known. Therefore, the set of conditionals in (21) is empty, and according to Figure 4 (top), the set of all nodes yields the marginal distributions P 0 ( V 1 i = 1 ) = P 0 ( V 2 k = 1 ) = 0 . 5 . The conditioning process, i.e., solving Equation (22) for specific nodes, can be realized in SPIRIT through clicking, e.g., V 12 : = 1 results in Figure 4 (bottom). As expected, the marginal distributions for all nodes—except V 12 —have not changed.
Exemplary probabilities are now assigned to the conditionals as shown in Table 1.
This can be depicted compactly as matrices of transfer probabilities:
P = 0 0 0 0 0.8 0 0 0 0 0 0.8 0 0 0 0 0.8 0 0 0 0 P ¯ = 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 1
These probabilities mean that the transfer from platforms to actors is certain (=1), whereas actors are less likely to communicate (=0.8). If we enter the conditionals and these transfer probabilities in SPIRIT, then upon solving Equation (21), we come up with Q , whose marginal probabilities are shown in the variables in Figure 5.
Now, in contrast to Figure 4 (top), the marginal distributions have changed. They enable us to make a priori estimates of message transfers in the network, merely based on the structure. We notice lower probabilities for V = 1 for nodes more likely to be senders and higher probabilities for V = 1 for nodes more likely to be receivers. A first attempt to make these observations more transparent are the following information-theoretical considerations.
For V : = 1 as in (22), R ( Q , Q ) measures the change of the conditional structure from Q to Q , cf. Brenner et al. [18]. Rödder et al. [13] called this number the diffusion potential of a node, and in Theorem 2 on page 7975, it is shown that:
R ( Q , Q ) = log 2 Q ( V = 1 ) .
The entire conditional structure change in the network is already anticipated in (24).
Therefore, log 2 ( Q ( V 1 i = 1 ) ) is the measure of diffusion in the network for each actor i. The deeper his/her message penetrates the network, the higher his/her diffusion.
log 2 ( Q ( V 2 k = 1 ) ) is also the measure of diffusion for platform k in the network. The deeper a message known there penetrates the network—through direct or indirect contacts—the higher is its diffusion potential.
The greater log 2 Q ( V = 1 ) , the smaller is Q ( V = 1 ) for each node in the network; either for an actor or a platform. The MaxEnt distribution assigns low probabilities to related V = 1 correctly, whose network penetration—direct or indirect contacts to other nodes—is high. In the same way, it assigns high probabilities to nodes with low network penetration.
SPIRIT allows marginal probabilities to be switched to negative logarithms. Figure 6 shows these logarithms for Figure 5. As such, V 12 = 1 has the greatest diffusion potential of all actors, and V 22 = 1 has the greatest diffusion potential of all platforms; this is in line with the intuitive conditional structure of the network.
Going back to the example in Table 1 and now clicking on the value 1 for the actor i = 2 or the platform k = 2 , we get the probabilities of reception as shown in Figure 7 and Figure 8 for all actors and platforms. Obviously, they confirm the values of preset probabilities. They also demonstrate that, if actor i = 2 sends the message, this also increases the probability of reception for actors with only indirect contacts. Therefore, the knowledge processing concept implemented in SPIRIT incorporates impacts on the probabilities of reception via one or several intermediaries.
The conditional probabilities of reception as presented in Figure 7 and Figure 8 are unbiased estimates of reception opportunities for all actors and platforms after the message has been sent. For instance, the probabilities for V 21 = 1 V 12 = 1 [ 0.8 ] in Figure 7 or for V 11 = 1 V 22 = 1 [ 1.0 ] in Figure 8 are preassigned, and other conditional probabilities of reception result from the MinREnt model, as in (22). Furthermore, we notice that the probability of reception for V 24 = 1 has only increased slightly compared to the a priori probability; see Figure 5 and Figure 7.
The next section examines how the concepts developed so far can be used to analyze a real medium-sized network.

4. Analysis of the Network “Corporate Directors”

Barnes and Burkett [24] described an affiliation network called “corporate directors”. As the name suggests, this is a group of—in this case—20 directors and their memberships in 24 different institutions, such as clubs, management boards, supervisory boards, etc. The 99 affiliations are presented in the affiliation matrix in Figure A1 in Appendix A.
The preliminary column shows directors = ^ actors and the preliminary row shows institutions = ^ platforms. For reasons of consistency, we will call the directors v 1 , 1 v 1 , 20 and the institutions v 2 , 1 v 2 , 24 . Figure 9 clearly shows the bipartite structure in SPIRIT.
The aim of this section is to demonstrate the usefulness of the entropy-driven analysis with respect to the transfer of messages and knowledge between actors and/or institutions. As in Section 3.2, we assume a directional bipartite structure with the following properties:
  • Message transfer in the direction from institution to director is highly probable. Platforms are set up in order to make messages and news available to its members, where possible. As already stated in the Introduction, such message transfer might be realized via notice-boards, newsgroups, or social media. For our purposes, we choose the respective transfer probabilities to be a fictitious 0.9. Statistical analysis might help to verify such a 90% page view rate.
  • The probabilities of message transfer from actors to institutions are even more difficult to survey due to the unknown willingness of persons to share information with others. We thwart this flaw using random numbers between 0.5 and one for the transfer probabilities. A first step to predicting the posting behavior of individuals the reader might find in Kim et al. [25].
These transfer probabilities are entered into SPIRIT in a two-step process: first, for institutions to directors, then for directors to institutions. The conditionals for the former part can be deduced from adjacencies in Figure A1 in Appendix A plus 0.9 probability. The conditionals for the latter part are shown in Table A1 in Appendix A. Rödder et al. [21] set out in sufficient detail how this two-step learning process in SPIRIT can be implemented. The results are summarized in Figure 10, and the perspective according to information theory—diffusion—introduced in Section 3.2 and Equation (24) is shown in Figure 11.
As expected, diffusions in institutions tend to be greater than for directors, but some directors are also fairly communicative, e.g., directors v 1 , 11 and v 1 , 2 .
The results in Figure 11 allow us to rank actors and institutions based on their diffusion. Here, high numbers mean much influence in the network, and low numbers mean less influence. Let us look at two actors and two institutions to further demonstrate the network analysis made possible using this model. Actor v 1 , 11 has the highest diffusion and actor v 1 , 20 the lowest. The lowest and highest diffusions for institutions are those for v 2 , 11 and, e.g., v 2 , 20 ; please refer back to Figure 11. If we now click on the respective values V = 1 for these nodes, this means—as explained in detail above—that the message has been sent throughout the entire network.
Table 2 and Table 3 allow comparing the prior distribution to those after message posting.
His/her transfer probabilities make actor v 1 , 11 an opinion leader in the network. He/she reaches all other actors and raises their probability of reception significantly. This is entirely different for actor v 1 , 20 . Now, the probabilities of reception are only minimally above a priori probabilities, which indicates only marginal influence in the network. The conditionals in Table A1 might help to clarify this issue.
As was already made apparent by the diffusions for institutions, we do not expect a strong heterogeneity with regard to message transfer. The examples of the two institutions v 2 , 11 and v 2 , 20 confirm this. Nevertheless, in most cases, we can see a clear increase in the probabilities of reception compared to those of the prior distribution in the net.
To summarize, the entropy-driven approach opens up new possibilities for analysis, which were not previously available in traditional graph-based methods.

5. Summary and Prospects

This paper considered social affiliation networks. The respective graphs have a bipartite structure; the node set is bi-modal: e.g., actors and clubs. First, we present the traditional approach towards analyzing such networks; this is based on the frequencies of mutual affiliations between actors and joint memberships in clubs. Indirect connections between actors—intermediaries—and indirect connections between clubs—an intermediary club might have members from either side—are formulated mathematically, and their potential for analysis is studied. Then, directed bipartite structures are depicted mathematically and differentiated from undirected structures.
A new type of probabilistic-conditional modeling is ideally suited to analyzing directional bipartite networks. Even weighted directional edges can be formulated as probabilistic conditionals. If an actor or a club is aware of a message or has specific knowledge, then he/she or it transfers it with preset probabilities. An entropy-driven information processing grounded in artificial intelligence supports such analyses. Even actors or clubs not in direct contact with the sender of such a message receive it via intermediaries. The software SPIRIT allows for calculation of respective transfer probabilities. The power of the new model is demonstrated analyzing a well-known example called “corporate directors”. The underlying network counts 20 directors and their memberships in 24 institutions. The new method permits a ranking of actors, as well as clubs with respect to their influential power. This kind of analysis is applicable to any bipartite network structure.
There are interesting prospects for further research on this issue:
  • What are the consequences for the whole network if actors or groups of actors disappear (due to disease or career change)?
  • Might indices like centrality and centralization suitably be defined in entropy-driven analyses of bipartite social networks?
  • Can these analyses also apply to more complex structures like multigraphs or hypergraphs?
We hope that articles on these topics might stimulate our research.

Author Contributions

Conceptualization, W.R. and A.D.; methodology, W.R. and A.D. and F.K. and E.R.; software, F.K. and E.R.; formal analysis, S.L. and A.D.; visualization, S.L.; writing–original draft preparation, W.R. and A.D.; writing—review and editing, A.D. and S.L.

Funding

This research received no external funding.

Acknowledgments

This work was supported by FernUniversität in Hagen and Private Hochschule für Wirtschaft und Technik.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1 contains all conditionals from directors to institutions. Those from institutions to directors are not shown, but can be deduced from the affiliation matrix in Table A1, in conjunction with the fact outlined earlier that institutions will send messages to directors, always with a transfer probability 0.9.
Table A1. Conditionals of the Directors network from actors to institutions.
Table A1. Conditionals of the Directors network from actors to institutions.
V 25 2 = 1 V 11 2 = 1 [ 0.76 ] V 215 = 1 V 11 2 = 1 [ 0.76 ] V 218 = 1 V 11 2 = 1 [ 0.76 ] V 221 = 1 V 11 2 = 1 [ 0.76 ] V 222 = 1 V 11 2 = 1 [ 0.76 ] V 25 2 = 1 V 12 2 = 1 [ 0.91 ] V 29 2 = 1 V 12 2 = 1 [ 0.91 ] V 210 = 1 V 12 2 = 1 [ 0.91 ] V 217 = 1 V 12 2 = 1 [ 0.91 ] V 222 = 1 V 12 2 = 1 [ 0.91 ] V 21 2 = 1 V 13 2 = 1 [ 0.90 ] V 212 = 1 V 13 2 = 1 [ 0.90 ] V 21 2 = 1 V 14 2 = 1 [ 0.87 ] V 212 = 1 V 14 2 = 1 [ 0.87 ] V 221 = 1 V 14 2 = 1 [ 0.87 ] V 29 2 = 1 V 15 2 = 1 [ 0.58 ] V 213 = 1 V 15 2 = 1 [ 0.58 ] V 23 2 = 1 V 16 2 = 1 [ 0.70 ] V 29 2 = 1 V 16 2 = 1 [ 0.70 ] V 221 = 1 V 16 2 = 1 [ 0.70 ] V 22 2 = 1 V 17 2 = 1 [ 0.70 ] V 23 2 = 1 V 17 2 = 1 [ 0.70 ] V 26 2 = 1 V 17 2 = 1 [ 0.70 ] V 29 2 = 1 V 17 2 = 1 [ 0.70 ] V 221 = 1 V 17 2 = 1 [ 0.70 ] V 222 = 1 V 17 2 = 1 [ 0.70 ] V 25 2 = 1 V 18 2 = 1 [ 0.61 ] V 27 2 = 1 V 18 2 = 1 [ 0.61 ] V 210 = 1 V 18 2 = 1 [ 0.61 ] V 217 = 1 V 18 2 = 1 [ 0.61 ] V 218 = 1 V 18 2 = 1 [ 0.61 ] V 221 = 1 V 18 2 = 1 [ 0.61 ] V 223 = 1 V 18 2 = 1 [ 0.61 ] V 211 = 1 V 19 2 = 1 [ 0.54 ] V 215 = 1 V 19 2 = 1 [ 0.54 ] V 221 = 1 V 19 2 = 1 [ 0.54 ] V 222 = 1 V 19 2 = 1 [ 0.54 ] V 22 2 = 1 V 110 = 1 [ 0.76 ] V 26 2 = 1 V 110 = 1 [ 0.76 ] V 29 2 = 1 V 110 = 1 [ 0.76 ] V 221 = 1 V 110 = 1 [ 0.76 ] V 222 = 1 V 110 = 1 [ 0.76 ] V 223 = 1 V 110 = 1 [ 0.76 ] V 27 2 = 1 V 111 = 1 [ 0.99 ] V 211 = 1 V 111 = 1 [ 0.99 ] V 215 = 1 V 111 = 1 [ 0.99 ] V 217 = 1 V 111 = 1 [ 0.99 ] V 219 = 1 V 111 = 1 [ 0.99 ] V 222 = 1 V 111 = 1 [ 0.99 ] V 25 2 = 1 V 112 = 1 [ 0.57 ] V 27 2 = 1 V 112 = 1 [ 0.57 ] V 211 = 1 V 112 = 1 [ 0.57 ] V 215 = 1 V 112 = 1 [ 0.57 ] V 219 = 1 V 112 = 1 [ 0.57 ] V 221 = 1 V 112 = 1 [ 0.57 ] V 222 = 1 V 112 = 1 [ 0.57 ] V 21 2 = 1 V 113 = 1 [ 0.70 ] V 29 2 = 1 V 113 = 1 [ 0.70 ] V 210 = 1 V 113 = 1 [ 0.70 ] V 213 = 1 V 113 = 1 [ 0.70 ] V 214 = 1 V 113 = 1 [ 0.70 ] V 216 = 1 V 113 = 1 [ 0.70 ] V 219 = 1 V 113 = 1 [ 0.70 ] V 221 = 1 V 113 = 1 [ 0.70 ] V 222 = 1 V 113 = 1 [ 0.70 ] V 29 2 = 1 V 114 = 1 [ 0.77 ] V 211 = 1 V 114 = 1 [ 0.77 ] V 215 = 1 V 114 = 1 [ 0.77 ] V 216 = 1 V 114 = 1 [ 0.77 ] V 221 = 1 V 114 = 1 [ 0.77 ] V 222 = 1 V 114 = 1 [ 0.77 ] V 29 2 = 1 V 115 = 1 [ 0.65 ] V 215 = 1 V 115 = 1 [ 0.65 ] V 222 = 1 V 115 = 1 [ 0.65 ] V 24 2 = 1 V 116 = 1 [ 0.85 ] V 28 2 = 1 V 116 = 1 [ 0.85 ] V 29 2 = 1 V 116 = 1 [ 0.85 ] V 217 = 1 V 116 = 1 [ 0.85 ] V 218 = 1 V 116 = 1 [ 0.85 ] V 220 = 1 V 116 = 1 [ 0.85 ] V 221 = 1 V 116 = 1 [ 0.85 ] V 222 = 1 V 116 = 1 [ 0.85 ] V 224 = 1 V 116 = 1 [ 0.85 ] V 21 2 = 1 V 117 = 1 [ 0.77 ] V 29 2 = 1 V 117 = 1 [ 0.77 ] V 212 = 1 V 117 = 1 [ 0.77 ] V 24 2 = 1 V 118 = 1 [ 0.81 ] V 28 2 = 1 V 118 = 1 [ 0.81 ] V 219 = 1 V 118 = 1 [ 0.81 ] V 220 = 1 V 118 = 1 [ 0.81 ] V 224 = 1 V 118 = 1 [ 0.81 ] V 24 2 = 1 V 119 = 1 [ 0.87 ] V 29 2 = 1 V 119 = 1 [ 0.87 ] V 214 = 1 V 119 = 1 [ 0.87 ] V 25 2 = 1 V 120 = 1 [ 0.56 ] V 211 = 1 V 120 = 1 [ 0.56 ] V 219 = 1 V 120 = 1 [ 0.56 ] V 221 = 1 V 120 = 1 [ 0.56 ] V 222 = 1 V 120 = 1 [ 0.56 ]
Figure A1. Affiliation matrix of the network “corporate directors”.
Figure A1. Affiliation matrix of the network “corporate directors”.
Entropy 21 00277 g0a1

References

  1. Moreno, J. Who Shall Survive: A New Approach to the Problem of Human Interrelations; Nervous and Mental Disease Publishing Co.: Washington, DC, USA, 1934. [Google Scholar]
  2. Davis, A.; Gardner, B.; Gardner, M. Deep South: A Social Anthropological Study of Caste and Class; University of Chicago Press: Chicago, IL, USA, 1941. [Google Scholar]
  3. Berardo, R. Bridging and Bonding Capital in Two-Mode Collaboration Networks. Policy Stud. J. 2014, 42, 197–225. [Google Scholar] [CrossRef]
  4. Nita, A.; Rozylowicz, L.; Manolache, S.; Ciocănea, C.; Miu, I.; Popescu, V. Collaboration Networks in Applied Conservation Projects across Europe. PLoS ONE 2016, 11, 1–16. [Google Scholar] [CrossRef] [PubMed]
  5. Dai, T.; Zhu, L.; Cai, X.; Pan, S.; Yuan, S. Explore semantic topics and author communities for citation recommendation in bipartite bibliographic network. J. Ambient Intell. Humaniz. Comput. 2018, 9, 957–975. [Google Scholar] [CrossRef]
  6. Zaïane, O.; Chen, J.; Goebel, R. DBconnect: mining research community on DBLP data. In Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, San Jose, CA, USA, 12 August 2007; pp. 74–81. [Google Scholar] [CrossRef]
  7. Borgatti, S.P.; Everett, M.G. Network analysis of 2-mode data. Soc. Netw. 1997, 19, 243–269. [Google Scholar] [CrossRef]
  8. Borgatti, S.P. 2-Mode Concepts in Social Network Analysis. In Encyclopedia of Complexity and System Science; Meyers, R.A., Ed.; Springer: New York, NY, USA, 2009. [Google Scholar]
  9. Katz, L. A new status index derived from sociometric analysis. Psychometrika 1953, 18, 39–43. [Google Scholar] [CrossRef]
  10. Bonacich, P. Power and centrality: A family of measures. Am. J. Sociol. 1987, 92, 1170–1182. [Google Scholar] [CrossRef]
  11. Bonacich, P.; Lloyd, P. Eigenvector-like measures of centrality for asymmetric relations. Soc. Netw. 2001, 23, 191–201. [Google Scholar] [CrossRef] [Green Version]
  12. Everett, M.G. Centrality and the dual-projection approach for two-mode social network data. Methodol. Innov. 2016, 9, 1–8. [Google Scholar] [CrossRef]
  13. Rödder, W.; Brenner, D.; Kulmann, F. Entropy based evaluation of net structures—Deployed in Social Network Analysis. Expert Syst. Appl. 2014, 41, 7968–7979. [Google Scholar] [CrossRef]
  14. Rödder, W.; Kulmann, F.; Dellnitz, A. A new rationality in network analysis—Status of actors in a conditional-logical framework. In Computational Models of Rationality; Beierle, C., Brewka, G., Thimm, M., Eds.; College Publications: London, UK, 2016; Volume 20, pp. 348–364. [Google Scholar]
  15. Scott, J. Social Network Analysis; Sage Publications: London, UK, 2000. [Google Scholar]
  16. Borgatti, S.P. Centrality and network flow. Soc. Netw. 2005, 27, 55–71. [Google Scholar] [CrossRef] [Green Version]
  17. Kern-Isberner, G. Characterizing the principle of minimum cross-entropy within a conditional-logical framework. Artif. Intell. 1998, 98, 169–208. [Google Scholar] [CrossRef] [Green Version]
  18. Brenner, D.; Dellnitz, A.; Kulmann, F.; Rödder, W. Compressing strongly connected subgroups in social networks: An entropy-based approach. J. Math. Sociol. 2017, 41, 84–103. [Google Scholar] [CrossRef]
  19. Calabrese, P. Deduction and inference using conditional logic and probability. In Conditional Logic in Expert Systems; Goodman, I., Gupta, M., Nguyen, H., Rogers, G., Eds.; North-Holland: Amsterdam, The Netherlands, 1991; pp. 71–100. [Google Scholar]
  20. Schramm, M.; Ertel, W. Reasoning with Probabilities and Maximum Entropy: The System PIT and its Application in LEXMED. In Proceedings of the Operations Research Proceedings, Magdeburg, Germany, 1–3 September 1999; pp. 274–280. [Google Scholar]
  21. Rödder, W.; Reucher, E.; Kulmann, F. Features of the Expert-System-Shell SPIRIT. Logic J. IGPL 2006, 14, 483–500. [Google Scholar] [CrossRef]
  22. SPIRIT. 2011. Available online: http://www.xspirit.de (accessed on 15 February 2019).
  23. Rödder, W.; Dellnitz, A.; Gartner, I.; Litzinger, S. Weight prediction on missing links in social networks: A cross-entropy-based approach. J. Appl. Logics 2019, 1, 95–116. [Google Scholar]
  24. Barnes, R.; Burkett, T. Structural Redundancy and Multiplicity in Corporate Networks. Connections 2010, 30, 4–20. [Google Scholar]
  25. Kim, E.; Lee, J.A.; Sung, Y.; Choi, S.M. Predicting selfie-posting behavior on social networking sites: An extension of theory of planned behavior. Comput. Hum. Behav. 2016, 62, 116–123. [Google Scholar] [CrossRef]
Figure 1. Generic undirected affiliation network.
Figure 1. Generic undirected affiliation network.
Entropy 21 00277 g001
Figure 2. A concrete affiliation network.
Figure 2. A concrete affiliation network.
Entropy 21 00277 g002
Figure 3. A concrete directional affiliation network.
Figure 3. A concrete directional affiliation network.
Entropy 21 00277 g003
Figure 4. Marginal distributions for an empty set of conditionals before and after having evidentiated.
Figure 4. Marginal distributions for an empty set of conditionals before and after having evidentiated.
Entropy 21 00277 g004
Figure 5. A bipartite network in SPIRIT.
Figure 5. A bipartite network in SPIRIT.
Entropy 21 00277 g005
Figure 6. Information values for Figure 5.
Figure 6. Information values for Figure 5.
Entropy 21 00277 g006
Figure 7. Conditional probabilities of reception under V 12 = 1 .
Figure 7. Conditional probabilities of reception under V 12 = 1 .
Entropy 21 00277 g007
Figure 8. Conditional probabilities of reception under V 22 = 1 .
Figure 8. Conditional probabilities of reception under V 22 = 1 .
Entropy 21 00277 g008
Figure 9. Bipartite structure of corporate directors in SPIRIT.
Figure 9. Bipartite structure of corporate directors in SPIRIT.
Entropy 21 00277 g009
Figure 10. Marginal distribution across institutions and directors in SPIRIT.
Figure 10. Marginal distribution across institutions and directors in SPIRIT.
Entropy 21 00277 g010
Figure 11. Information theory perspective in relation to institutions and directors in SPIRIT.
Figure 11. Information theory perspective in relation to institutions and directors in SPIRIT.
Entropy 21 00277 g011
Table 1. A specific affiliation network.
Table 1. A specific affiliation network.
V 11 = 1 V 21 = 1 [ 1.0 ] V 11 = 1 V 22 = 1 [ 1.0 ] V 12 = 1 V 21 = 1 [ 1.0 ] V 12 = 1 V 22 = 1 [ 1.0 ] V 13 = 1 V 21 = 1 [ 1.0 ] V 13 = 1 V 22 = 1 [ 1.0 ] V 13 = 1 V 23 = 1 [ 1.0 ] V 14 = 1 V 23 = 1 [ 1.0 ] V 14 = 1 V 24 = 1 [ 1.0 ] V 15 = 1 V 24 = 1 [ 1.0 ] V 21 = 1 V 12 = 1 [ 0.8 ] V 23 = 1 V 13 = 1 [ 0.8 ] V 24 = 1 V 14 = 1 [ 0.8 ]
Table 2. A priori probabilities and probabilities of message reception for actors with evidence v 1 , 11 and v 1 , 20 .
Table 2. A priori probabilities and probabilities of message reception for actors with evidence v 1 , 11 and v 1 , 20 .
v 1 , 1 v 1 , 2 v 1 , 3 v 1 , 4 v 1 , 5 v 1 , 6 v 1 , 7 v 1 , 8 v 1 , 9 v 1 , 10 A priori 0.68 0.52 0.59 0.60 0.87 0.76 0.76 0.87 0.91 0.66 Ev . v 1 , 11 0.98 0.93 0.82 0.89 0.90 0.95 0.97 0.98 0.90 0.94 Ev . v 1 , 20 0.69 0.53 0.60 0.61 0.87 0.76 0.76 0.87 0.91 0.67 v 1 , 11 v 1 , 12 v 1 , 13 v 1 , 14 v 1 , 15 v 1 , 16 v 1 , 17 v 1 , 18 v 1 , 19 v 1 , 20 A priori 0.39 0.91 0.73 0.66 0.81 0.57 0.70 0.65 0.59 0.92 Ev . v 1 , 11 0.97 0.97 0.99 0.98 0.95 0.94 0.96 0.89 0.94 Ev . v 1 , 20 0.40 0.91 0.74 0.67 0.81 0.58 0.70 0.65 0.60
Table 3. A priori probabilities and probabilities of message reception for actors with evidence v 2 , 11 and v 2 , 20 .
Table 3. A priori probabilities and probabilities of message reception for actors with evidence v 2 , 11 and v 2 , 20 .
v 1 , 1 v 1 , 2 v 1 , 3 v 1 , 4 v 1 , 5 v 1 , 6 v 1 , 7 v 1 , 8 v 1 , 9 v 1 , 10 A priori 0.68 0.52 0.59 0.60 0.87 0.76 0.76 0.87 0.91 0.66 Ev . v 2 , 11 0.88 0.77 0.74 0.79 0.89 0.89 0.90 0.94 0.90 0.84 Ev . v 2 , 20 0.81 0.68 0.69 0.73 0.88 0.84 0.85 0.91 0.90 0.78 v 1 , 11 v 1 , 12 v 1 , 13 v 1 , 14 v 1 , 15 v 1 , 16 v 1 , 17 v 1 , 18 v 1 , 19 v 1 , 20 A priori 0.39 0.91 0.73 0.66 0.81 0.57 0.70 0.65 0.59 0.92 Ev . v 2 , 11 0.71 0.96 0.89 0.94 0.92 0.81 0.86 0.84 0.78 0.94 Ev . v 2 , 20 0.54 0.93 0.84 0.79 0.87 0.79 0.80 0.85 0.73 0.92

Share and Cite

MDPI and ACS Style

Rödder, W.; Dellnitz, A.; Kulmann, F.; Litzinger, S.; Reucher, E. Bipartite Structures in Social Networks: Traditional versus Entropy-Driven Analyses. Entropy 2019, 21, 277. https://doi.org/10.3390/e21030277

AMA Style

Rödder W, Dellnitz A, Kulmann F, Litzinger S, Reucher E. Bipartite Structures in Social Networks: Traditional versus Entropy-Driven Analyses. Entropy. 2019; 21(3):277. https://doi.org/10.3390/e21030277

Chicago/Turabian Style

Rödder, Wilhelm, Andreas Dellnitz, Friedhelm Kulmann, Sebastian Litzinger, and Elmar Reucher. 2019. "Bipartite Structures in Social Networks: Traditional versus Entropy-Driven Analyses" Entropy 21, no. 3: 277. https://doi.org/10.3390/e21030277

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop