# Structural Patterns in Complex Systems Using Multidendrograms

## Abstract

## 1. Introduction

## 2. Multidendrograms Algorithm

- Initialize n singleton clusters with one individual in each of them. Initialize also the distances between clusters with the values of the distances between individuals.
- Find the minimum distance separating two different clusters.
- Select two clusters separated by such minimum distance and merge them into a new supercluster.
- Compute the distances (Depending on the criterion used to compute the distances, different agglomerative hierarchical clusterings are obtained: single linkage, complete linkage, unweighted average, weighted average, unweighted centroid, weighted centroid, and Ward’s method are the most commonly used.) between the new supercluster and each of the other clusters.
- If all individuals are not yet in the same cluster, then go back to Step 2.

- When there are no ties, multidendrograms give the same result as the pair-group algorithm.
- It always gives a uniquely determined solution thanks to the implementation of the variable-group algorithm.
- In the multidendrogram representation of the results, the occurrence of ties during the agglomerative process can be explicitly observed, and a subsequent notion of the degree of heterogeneity inside the tied clusters is obtained.

## 3. Applications

#### 3.1. Case Study: Vertex Similarity in Networks

Ravasz-Barabasi hierarchical network of 25 nodes; Zachary's karate club network [24].

Multidendrograms obtained using Leicht and Jaccard similarities, respectively. The similarities between clusters are calculated with the standard unweighted average method (equivalent to the unweighted pair group method with arithmetic mean, UPGMA, but using variable groups instead of pairs, see [18]).

#### 3.2. Case Study: Modular Node Similarity in Networks

**Figure 2.**(

Distances network of white berry varieties in the Spanish grapevine cultivars given in Table 2 of Ibáñez et al. [28], and its unweighted average multidendrogram.

**b**) Distances network of white berry varieties in the Spanish grapevine cultivars given in Table 2 of Ibáñez et al. [28], and its unweighted average multidendrogram.

#### 3.3. Case Study: Distance Similarities in Complete Weighted Networks

**Table 1.**Number of different binary trees obtained for the grapevine cultivars data in Table 2 of Ibáñez et al. [28], using distinct hierarchical clustering methods. Although the resolution of the data is equal to 3 significant digits, we show the effect that increasing the precision has on the number of possible binary trees.

Unweighted Average | 17, 900 | 2, 208 | 2, 124 |

Weighted Average | 9, 859 | 1, 709 | 1, 762 |

Complete Linkage | >10^{8} | >10^{8} | >10^{8} |

## 4. Conclusions

## Acknowledgments

## Conflicts of Interest

