Weighted Homology of Bi-Structures over Certain Discrete Valuation Rings

Andrei Bura; Qijun He; Christian Reidys

doi:10.3390/math9070744

,

and

¹

Biocomplexity Institute and Initiative, University of Virginia, Charlottesville, VA 22904-4298, USA

²

Mathematics Department, University of Virginia, Charlottesville, VA 22904-4137, USA

^*

Author to whom correspondence should be addressed.

^†

Current address: Physical Address: Biocomplexity Institute, Town Center Four, 994 Research Park Boulevard, Charlottesville, VA 22904-4298, USA.

Mathematics2021, 9(7), 744;https://doi.org/10.3390/math9070744

This article belongs to the Section A: Algebra and Logic

Version Notes

Order Reprints

Abstract

An RNA bi-structure is a pair of RNA secondary structures that are considered as arc-diagrams. We present a novel weighted homology theory for RNA bi-structures, which was obtained through the intersections of loops. The weighted homology of the intersection complex X features a new boundary operator and is formulated over a discrete valuation ring, R. We establish basic properties of the weighted complex and show how to deform it in order to eliminate any 3-simplices. We connect the simplicial homology,

H_{i} (X)

, and weighted homology,

H_{i, R} (X)

, in two ways: first, via chain maps, and second, via the relative homology. We compute

H_{0, R} (X)

by means of a recursive contraction procedure on a weighted spanning tree and

H_{1, R} (X)

via an inflation map, by which the simplicial homology of the 1-skeleton allows us to determine the weighted homology

H_{1, R} (X)

. The homology module

H_{2, R} (X)

is naturally obtained from

H_{2} (X)

via chain maps. Furthermore, we show that all weighted homology modules

H_{i, R} (X)

are trivial for

i > 2

. The invariant factors of our structure theorems, as well as the weighted Whitehead moves facilitating the removal of filled tetrahedra, are given a combinatorial interpretation. The weighted homology of bi-structures augments the simplicial counterpart by introducing novel torsion submodules and preserving the free submodules that appear in the simplicial homology.

Keywords:

weighted simplicial complex; weighted homology; modules over PIDs (Principal Ideal Domain); torsion; embedding; spanning sub-trees

1. Introduction

This paper is concerned with the weighted homology of the nerve complex of RNA bi-structures [1]. The weighted nerve complex of such bi-structures provides a framework for studying the computational complexity of algorithms for identifying sequences that are thermodynamically stable with respect to a given bi-structure. Such sequences hold the key for evolutionary optimization [2].

The weighted homology introduced here augments the simplicial homology by making the weights of the simplices an integral part of the theory. In our situation, these weights encode the cardinality of intersections of loops in a bi-structure. Along the lines of [3], the notion of a weighted complex is introduced. Differently from [3], the weights of simplices in this paper are taken from a discrete valuation ring that contains a copy of the integers as units and in which the weight of a simplex divides the weight of its faces.

1.1. Background

RNA sequences are linear, single-stranded sequences composed of the bases A,U,G, and C. In contrast to double-stranded DNA, RNA sequences fold into structures (see Figure 1). A particularly important class of RNA structures is that of RNA secondary structures [4]. Secondary structures can be represented as planar, non-crossing arc diagrams [5,6]. By means of its folded structure, RNA facilitates a plethora of important functional roles [7]. The boundary components of an RNA secondary structure, when viewed as a fatgraph [8], are called loops, and they determine the minimum free energy of the structure.

Figure 1. Left: a planar RNA secondary structure with a closing base pair

r = (0, 20)

. Right: its diagram representation on the set of vertices

[19]

with two additional vertices, 0 and 20, forming the rainbow

r = (0, 20)

, the loop

s = [3, 4] \cup [8, 12] \cup [16, 17]

(shaded), and the corresponding intervals (underlined). The arc

x = (3, 17)

is the maximal arc of s,

α_{s} = x

.

Secondary structures have been studied from the perspectives of enumerative combinatorics [9,10,11,12], algebraic combinatorics [13], matrix models [14,15], and topology [16,17,18].

In [10], a bijection between linear trees and secondary structures was presented. In [13], the authors enumerated k-non-crossing RNA structures by employing a bijection between k-non-crossing partial matchings and walks in the interior of the Weyl-chamber

C_{0}

based on a certain bijection between oscillating tableaux and matchings [19,20,21]. The geometrical realization of a certain complex of secondary structures was studied in [22].

A key determinant of a sequence–structure pair is its thermodynamic stability, which is measured via its free energy. Given a sequence

σ

, RNA structure prediction algorithms identify structures by assuming minimal or low free energy with respect to

σ

[23]. Given any RNA sequence–structure pair

(a, S)

, the free energy

η (a, S)

is computed by adding all loop energy contributions [24,25], i.e.,

η (a, S) = \sum_{s \in S} η (a, s)

. Here,

η (a, s)

depends on the loop type of s (hairpin, interior, multi-branch, etc.) and the particular nucleotides of the sequence a that appear in the loop s. Details on loop decompositions, partition functions, and recursive constructions for the folded structure—derived via polynomial-time dynamic programming (DP)—can be found in [9,23,26,27,28]. In [12], the notion of bi-secondary structures was introduced. Bi-secondary structures are central to identifying sequences that can realize two mutually exclusive minimum free energy (mfe) conformations. Such sequences naturally appear in the context of evolutionary transitions [29] and of RNA riboswitches, i.e., sequences that exhibit two distinct, stable configurations [30].

1.2. Motivation

This paper was motivated by the problem of computing thermodynamically stable sequences for two distinguished secondary structures, S and T [2]. The problem can be formalized as the computation of a sequence a that minimizes

η (a, S, T)

, where

η (a, S, T) = \sum_{s \in S} η (a, s) + \sum_{t \in T} η (a, t) .

The sub-problems of the underlying DP routine are associated with sets of loops and recursively constructed by adding one loop at a time, where subsequently added nucleotides affect the energy calculation if they appear in multiple loops simultaneously; see Figure 2. This leads to the consideration of all such intersections as a simplicial complex, X, whose homology provides insights into the above optimization problem.

Figure 2. Top: a bi-structure decomposed into a distinguished sub-structure and its complement. The vertices contained in both (white) control the complexity of the dynamic programming (DP) routine in [2]. Bottom: the recursive step of the DP routine. Note that the recursion changes the set of white vertices by removing some and adding others.

In [1,31], we computed the simplicial homology groups of X and proved that only the second homology group carries information. This group was identified to be free, and its generators were identified as the crossing components within the diagram representation. All known RNA riboswitches exhibit one such crossing component, while random pairs of structures contain multiple crossing components.

The simplicial homology of X in [1] only allows us to express the existence of loop intersections. However, the crucial determinant for the algorithmic complexity of the DP routine in [2] is the size of such intersections; see Figure 2. The recursion is over the set of loops, successively examining one loop at a time. Each such examination affects certain vertices that the currently examined loop shares with the unprocessed loops.

It is thus natural to employ a homology theory that accounts for the intersection size. The notion of weighted homology as originally formulated [3], assigns a certain weight to each simplex, imposing specific divisibility conditions on these weights.

Our framework represents a variation on weighted homology as follows: We consider homology with coefficients in a discrete valuation ring, which, in fact, makes any divisibility constraints obsolete and allows us to deal with the cardinality of intersections in a natural way. The boundary operator introduced here is new, and so is our approach to connecting the weighted and simplicial homologies. More specifically, the construction of an inflation map relating

H_{1} (X^{1})

and

H_{1, R} (X^{1})

is a new technique. The computation of the weighted homology groups,

H_{0, R}

,

H_{1, R} (X)

, and

H_{2, R} (X)

, is also novel. Specifically, the idea of obtaining a combinatorial interpretation for the invariant factors in the structure theorems for

H_{0, R}

and

H_{1, R} (X)

via particular

X^{1}

-spanning trees is original. The chain maps between

C_{n} (X)

and

C_{n, R} (X)

, which relate the

H_{i} (X)

and

H_{i, R} (X)

introduced here, are new and nontrivial. In particular, the injection

inj : H_{2} (X) \to H_{2, R} (X)

that induces the isomorphism

H_{2, R} (X) ≅ R \otimes_{Z} H_{2} (X)

is a result of the fact that, in the expressions for the generators of

H_{2, R} (X)

, any triangle has a corresponding intersection size of one.

1.3. Organization

The paper is organized as follows: In Section 2, the simplicial loop complex of a bi-structure is introduced and the splicing of nucleotides is recalled [1]. In Section 3, the weighted complex of a bi-structure and a modified splicing procedure are discussed. In Section 4, the weighted homology and simplicial collapses [3,32] are introduced. In Section 5, a relative point of view is adopted by relating the simplicial and weighted homologies. Specifically, we introduce the inflation map, which maps classes of unweighted 1-cycles into corresponding weighted 1-cycles. In Section 6, we construct certain spanning trees that will be instrumental for proving our main results in Section 7. Here, we compute the weighted homology modules of bi-structures and provide combinatorial interpretations. The torsion module emerging in the zeroth homology stems from a recursive contraction scheme on the distinguished weighted spanning trees of the complex. The torsion of the first homology module can be understood via particular weighted edges that reference the aforementioned spanning tree. We conclude the paper with the computation of the second homology module, with all others being trivial. We integrate our results and outline future work in Section 8.

2. The Simplicial Loop Complex of a Bi-Structure

An RNA diagram S over the set

[n] : = {1, \dots, n}

is a vertex-labeled graph—whose vertices represent nucleotides—drawn on the horizontal axis and labeled with elements of the set

[n]

. An arc

α = (i, j), i < j

, is an ordered pair of vertices, which represents the base pairing between the i-th and j-th nucleotides. Each S-vertex can be paired with at most one other vertex, and the arc that connects them is drawn in the upper half-plane. An RNA diagram over the set

[n]

is augmented with two additional vertices associated with positions 0 and

n + 1

, together with the arc

(0, n + 1)

, which is called the rainbow, representing the closing base pair of the external loop of the RNA structure; see [23]. The set of vertices

{i, i + 1, \dots, j - 1, j}

is called an interval, and is denoted by

[i, j]

. The set

[0, n + 1] : = {0, 1, \dots, n, n + 1}

is referred to as the backbone of the diagram. Two arcs,

(i, j)

and

(p, q)

with

i < p

, are called crossing if and only if

i < p < j < q

. S is called a secondary structure if it does not contain any crossing arcs. A loop s in a secondary structure S is represented as the disjoint union of intervals

s = {⋃^{˙}}_{i = 1}^{k} [a_{i}, b_{i}]

, such that

(a_{1}, b_{k})

and

(b_{i}, a_{i + 1})

, for

1 \leq i \leq k - 1

, are arcs, and any other interval-vertices are unpaired. We denote by

α_{s}

the unique, maximal arc

(a_{1}, b_{k})

of the loop s, and we have a correspondence between loops and their maximal arcs; see Figure 1.

S-arcs and loops can be endowed with a partial order:

(k, l) ≺_{S} (i, j) \Leftrightarrow i < k < l < j

. Abusing the notation, for the two loops

s_{1}, s_{2} \in S

, we write

s_{1} ≺_{S} s_{2}

if

α_{s_{1}} ≺_{S} α_{s_{2}}

. Accordingly, (a) any unpaired vertex is contained in exactly one loop, (b) any non-rainbow arc appears in exactly two loops, and (c) the Hasse diagram of

(S, ≺_{S})

is a rooted tree

Tr (S)

with the rainbow arc as the root.

Given two secondary structures S and T, we refer to

R = (S, T)

as a bi-secondary structure (bi-structure). Abusing the notation, we let

R = S \dot{\cup} T

be the loop set of R. We represent a bi-structure

R = (S, T)

with the S-arcs in the upper and the T-arcs in the lower half-plane along the same horizontal backbone.

Let

A = {A_{0}, A_{1}, \dots, A_{m}}

be a collection of finite sets. We call

B = {A_{i_{0}}, \dots, A_{i_{d}}} \subseteq A

a d-simplex of A if

⋂_{k = 0}^{d} A_{i_{k}} \neq ⌀

. We set

Ω (B) = ⋂_{k = 0}^{d} A_{i_{k}}

and let

ω (B) = | Ω (B) | \neq 0

. Let

K_{d} (A)

be the set of all d-simplices of A. Then, the complex (nerve) of A is

K (A) = ⋃_{d = 0}^{\infty} K_{d} (A) \subseteq 2^{A} .

A

d^{'}

-simplex

B^{'} \in K (A)

is called a

d^{'}

-face of B if

d^{'} \leq d

and

B^{'} \subseteq B

. Let

B^{'}

be a

d^{'}

-face of a maximal simplex B, where

d^{'} < d

. Then,

B^{'}

is B-free if no other maximal simplex of

K (A)

contains

B^{'}

as a face. By construction,

K (A)

is a simplicial complex. For a secondary structure S,

K (S) ≅ Tr (S)

is a tree.

Let

R = (S, T)

be a bi-structure with the loop complex

X = {⋃^{˙}}_{d = 0}^{\infty} K_{d} (R)

. Then, a simplicial order,

≺_{R}

, is given by

r_{1} ≺_{R} r_{2} \Leftrightarrow \{\begin{matrix} r_{1}, r_{2} \in T and r_{1} ≺_{T} r_{2} \\ r_{1}, r_{2} \in S and r_{1} ≺_{S} r_{2} \\ r_{1} \in S, r_{2} \in T \end{matrix} .

We specify a d-simplex,

σ \in X

, by the ordered

d + 1

-tuple

σ = [r_{i_{0}}, r_{i_{1}}, . . ., r_{i_{d}}]

, where

r_{i_{0}} ≺_{R} r_{i_{1}} ≺_{R} \dots ≺_{R} r_{i_{d}}

.

A 1-simplex

τ = [r_{i_{0}}, r_{i_{1}}] \in X

is called pure if

r_{i_{0}}

and

r_{i_{1}}

are loops in the same secondary structure and mixed otherwise; see Figure 3.

Figure 3. Left: a bi-structure

R = (S, T)

with S-loops

a, b

, S-rainbow arc

α_{a}

, T-loops

c, d

, and T-rainbow arc

α_{c}

. Right: its weighted loop nerve

K_{0} (R) = {a, b, c, d}, K_{1} (R) = {[b, a], [b, c], [b, d], [d, c], [a, c], [a, d]}

(with

[b, a]

and

[d, c]

being pure edges, and any other edge being mixed), and

K_{2} (R) = {[b, a, c], [b, d, c], [b, a, d], [a, d, c]}

. We have

ω (a) = 7

,

ω (b) = 7

,

ω (c) = 8

,

ω (d) = 6

,

ω ([b, a, c]) = ω ([b, d, c]) = ω ([b, a, d]) = ω ([a, d, c]) = 1

, while the weights of 1-simplices are displayed directly in the figure.

By construction, any 2-simplex

Δ \in X

contains exactly one pure and two mixed edges. Furthermore, X cannot contain simplices of a dimension greater than or equal to four, as that would imply that three or more loops in the same secondary structure intersect non-trivially. Moreover, for any

σ = [s_{0}, s_{1}, t_{0}, t_{1}] \in K_{3} (X)

,

1 \leq ω (σ) \leq 2

and

Ω (σ) \subset P = {p \in [n] ∣ \deg (p) = 4}

.

In [31], we investigated the effect of splicing a nucleotide p into two adjacent nucleotides

q_{1}, q_{2}

such that the two arcs incident to p were resolved into two non-crossing arcs. In the case

ω (σ) = 2

, splitting does not change X, and if

ω (σ) = 1

, splitting induces a specific alteration: the removal of

σ

from X, as well as a distinguished free edge,

τ^{σ}

, and exactly two free triangles,

Δ_{1}^{σ}, Δ_{2}^{σ}

, glued at

τ^{σ}

. We refer to

(τ^{σ}, Δ_{1}^{σ}, Δ_{2}^{σ})

as a

σ

-butterfly; see Figure 4.

Figure 4. Splicing and removing tetrahedra together with their corresponding butterflies.

Lemma 1.

Let

R = (S, T)

be a bi-structure with complex X; then, there exists a simple bi-structure

R = (S^{'}, T^{'})

with a complex

X^{'} < X

, where

X^{'}

is derived from X by successively removing any 3-simplex, σ, together with an associated σ-butterfly.

Thus, successively splicing all nucleotides in P induces a bi-structure

(S^{'}, T^{'})

, together with an associated complex

X^{'} < X

, which is obtained by removing all

σ

-butterflies and their corresponding

σ

s from X.

Removing a

σ

-butterfly and its corresponding

σ

is equivalent to

X ↘ X^{'}

—a simplicial collapse consisting of two elementary collapses [32,33]. The first removes

σ

and

Δ_{1}^{σ}

, and the second removes

τ^{σ}

and

Δ_{2}^{σ}

.

Proposition 1.

Let X be the loop complex of a bi-structure

R = (S, T)

, and let

X^{'} < X

be an X-sub complex obtained by removing all 3-simplices, σ, from X, together with corresponding σ-butterflies. Then,

X^{'}

is the complex of a bi-structure

R^{'} = (S^{'}, T^{'})

that satisfies

{(X^{'})}^{2} = X^{'}

for any

k \geq 0

,

H_{k} (X^{'}) ≅ H_{k} (X)

.

Note that after splicing, we have simplified X to a complex of a bi-structure,

X^{'}

, thus reducing its maximum simplex dimension and maintaining its homology. In the following, we generalize this to weighted complexes.

3. $μ$ -Splicings and the Weighted Complex of a Bi-Structure

We begin by modifying the splicing introduced in Section 2 as follows: Instead of replacing a nucleotide

p \in P

by the two nucleotides

q_{1} < q_{2}

, we substitute it by three nucleotides,

q_{1} < a < q_{2}

, such that the arcs in R that share p as an endpoint now have endpoints

q_{1}

and

q_{2}

, which are non-crossing, and a is unpaired. We call this a

μ

-splicing of p.

Note that

μ

-splicing and splicing all

p \in P

have exactly the same effect on X—namely, eventually removing any 3-simplex

σ

and a corresponding

σ

-butterfly. In terms of bi-structures, from

(S, T)

,

μ

-splicing P produces the bi-structure

(S^{'}, T^{'})

augmented by a set of distinguished, unpaired nucleotides

A = {a_{1}, \dots, a_{n}}

, which only appear in 0- and 1-simplices. By construction, the complex

X^{'}

of

(S^{'}, T^{'})

does not contain any 3-simplices. Furthermore, after

μ

-splicing P, any distinguished nucleotide produced is flanked by two distinct endpoints of non-crossing arcs. Thus, for any simplex

σ \in X^{'}

, we have

h (σ) \geq 0

, where

h (σ)

denotes the difference between the number of non-distinguished and distinguished nucleotides contained in

σ

. Note that h is a natural generalization of

ω

in the context of complexes of bi-structures. Accordingly, we define a weighted complex associated with a bi-structure as follows.

First, we recall that a ring R is called a discrete valuation ring if it is a Principal Ideal Domain (PID) with exactly one non-zero maximal ideal. Any irreducible element is a generator of this ideal and is called a uniformizer for R, and furthermore, any two such elements differ only up to multiplication with a unit.

Definition 1.

Let X be a complex of a bi-structure

(S, T)

such that for any

σ \in X

, we have

h (σ) \geq 0

. Let

R \supset Z

be a discrete valuation ring with uniformizer π. We define the weight function

v : X \to R

,

v (σ) = π^{h (σ)}

, and call

(X, v)

the weighted complex of

(S, T)

.

Lemma 2.

Let

(S, T)

be a bi-structure with no distinguished vertices and complex X. Let

(S^{'}, T^{'})

be derived from

(S, T)

via μ-splicing P, and we denote its complex by

X^{'}

. Then, there exists an embedding

ϵ : (X^{'}, v) ⟶ (X, v),

where

ϵ (X^{'})

is a X-subcomplex obtained by removing any 3-simplices and associated butterflies. Moreover, ϵ is an embedding of weighted complexes:

σ = ϵ (σ^{'}) \Rightarrow v (σ^{'}) = v (σ)

.

Proof.

Successively

μ

-splicing P generates the series of complexes

X_{| P |} = X > \dots > X_{0} = X^{'}

. By construction, given

X_{i}

and a 3-simplex

σ

, the corresponding

μ

-splicing at p replaces p with the triple

q_{1}^{i} < a_{i} < q_{2}^{i}

. Since

a_{i}

is unpaired,

a_{i}

is contained in exactly two loops,

λ_{i - 1}, μ_{i - 1}

, the only two loops that change size in terms of number of nucleotides when passing to

X_{i - 1}

. Since the distinguished nucleotide contributes a

- 1

, we note that

v (λ_{i - 1}) = v (λ_{i})

and

v (μ_{i - 1}) = v (μ_{i})

; furthermore,

v ([λ_{i}, μ_{i}]) = v ([λ_{i - 1}, μ_{i - 1}])

. Any other

X_{i}

-simplex—differently from

σ

and its

σ

-butterfly, which are removed—is not affected by the splicing, and consequently retains its weight in

X_{i - 1}

, whence the lemma. □

4. Weighted Homology

Consider

\partial_{n}^{v} : C_{n, R} (X) \to C_{n - 1, R} (X)

, where

C_{n, R} (X)

denotes the free R-module generated by all n-simplices contained in X. Let

\partial_{n}^{v}

be given by

\partial_{n}^{v} (σ) = \sum_{i = 0}^{n} \frac{v ({\hat{σ}}_{i})}{v (σ)} \cdot {(- 1)}^{i} {\hat{σ}}_{i},

where the symbol

\frac{v ({\hat{σ}}_{i})}{v (σ)} = \frac{π^{h ({\hat{σ}}_{i})}}{π^{h (σ)}}

is defined to be

π^{h ({\hat{σ}}_{i}) - h (σ)}

. This is, in view of

h (σ) \leq h ({\hat{σ}}_{i})

, a non-negative power of the uniformizer

π

and, hence, still an element of the ring R. As a result, the boundary operator

\partial_{n}^{v}

is well defined. Since

\frac{v ({\hat{σ}}_{i, j})}{v ({\hat{σ}}_{i})} \cdot \frac{v ({\hat{σ}}_{i})}{v (σ)} = \frac{v ({\hat{σ}}_{j, i})}{v ({\hat{σ}}_{j})} \cdot \frac{v ({\hat{σ}}_{j})}{v (σ)}

, we obtain

\partial_{n - 1}^{v} (\partial_{n}^{v} (σ)) = 0

, i.e.,

\partial_{n}^{v}

is a boundary map.

Let

H_{n, R} (X)

denote the

\partial_{n}^{v}

-homology groups. For

(Y, v) \leq (X, v)

, we have

\begin{matrix} 0 ⟶ & C_{n, R} (Y) & \overset{I}{⟶} & C_{n, R} (X) & \overset{J}{⟶} & C_{n, R} (X, Y) & ⟶ 0, \\ ↓ \partial^{v} & ↓ \partial^{v} & ↓ {\bar{\partial}}^{v} \\ 0 ⟶ & C_{n - 1, R} (Y) & \overset{I}{⟶} & C_{n - 1, R} (X) & \overset{J}{⟶} & C_{n - 1, R} (X, Y) & ⟶ 0 \end{matrix}

and we denote the relative homology groups by

H_{n, R} (X, Y)

. These are connected via the long exact sequence

\dots ⟶ H_{n, R} (X, Y) \overset{{\hat{\partial}}_{j}^{v}}{⟶} H_{n - 1} (Y) \overset{I_{*}}{⟶} H_{n - 1} (X) \overset{J_{*}}{⟶} H_{n - 1} (X, Y) \overset{{\hat{\partial}}_{n - 1}^{v}}{⟶} \dots .

Lemma 2 guarantees that, for any weighted complex

(X, v)

of a bi-structure

(S, T)

, there exists a bi-structure

(S^{'}, T^{'})

with a weighted complex

(X^{'}, v)

such that

(X^{'}, v) < (X, v)

and where

{X^{'}}^{2} = X^{'}

. Thus, the relative homology groups

H_{k, R} (X, X^{'})

and their associated long sequence are well defined.

However, a weighted complex does not have to be induced by a bi-structure. The previous definitions still hold as long as

σ^{'} \subseteq σ \Rightarrow v (σ) | v (σ^{'})

for an arbitrary

(X, v)

-pair. Weighted complexes represent a general combinatorial-algebraic framework that enhances the simplicial homology for a wide range of applications. The next proposition extracts the algebraic core of

μ

-splicings; it represents a variation of [32], and it also appears in some form in [3].

Proposition 2.

Let

(X, v)

be a weighted complex and let

τ, σ \in X

such that: (a) τ is σ-free; (b)

v (τ) = v (σ)

. Let Y be the X-subcomplex obtained by removing any

τ \subset τ^{'} \subset σ

(

X ↘ Y

). Then,

\forall n \geq 0; H_{n, R} (X) ≅ H_{n, R} (Y) .

Proof.

Claim 1. It suffices to prove the Lemma for a σ-free τ with the property

dim (τ) + 1 = dim (σ)

.

We may assume that

σ = [v_{0}, \dots, v_{k}]

and

τ = [v_{0}]

. The lattice of sub-simplices of

σ

is isomorphic to a binary k-cube, where cover relations are induced by face inclusions.

We prove by induction on k that a k-cube can be recursively decomposed into a sequence of pairs

Σ_{k} = {(σ_{i}, τ_{i})}_{1 \leq i \leq 2^{k - 1}}

, where

σ_{i}

is a maximal simplex with a free face

τ_{i}

. The induction basis

Σ_{1} = ([1], [0])

is immediate.

For the induction step, note that a k-cube decomposes into two disjoint copies of a

(k - 1)

-cube based on if the last coordinate is zero or one. By the induction hypothesis, for a

(k - 1)

-cube, there exists a sequence,

Σ_{k - 1} = {(σ_{i}, τ_{i})}_{1 \leq i \leq 2^{k - 2}}

, with the desired properties; appending one or zero as the last coordinate yields two mappings from the

k - 1

cube into the k cube. By construction, these form a family of disjoint-type embeddings, i.e., their images are injective copies of the

k - 1

-cube that are vertex disjoint in the k-cube. Accordingly, we obtain

Σ_{k - 1}^{1} = {(σ_{i}^{1}, τ_{i}^{1})}_{i}

and

Σ_{k - 1}^{0} = {(σ_{i}^{0}, τ_{i}^{0})}_{i}

. As the two sub-cubes are disjoint, we can immediately construct the desired sequence for the k-cube,

Σ_{k} = Σ_{k - 1}^{1} Σ_{k - 1}^{0}

; the claim follows.

Claim 2. Let

X ↘ Y

, where

σ \in X

maximal, τ is σ-free, and

dim (τ) + 1 = dim (σ) = k

. Then, for any

n \geq 0

,

H_{n, R} (X, Y) = 0

.

Note that only

C_{k, R} (X, Y)

and

C_{k - 1, R} (X, Y)

are non-trivial.

C_{k + 1, R} (X, Y) = 0

implies

Im ({\hat{\partial}}_{k + 1}^{v}) = 0

; hence,

H_{k, R} (X, Y) = Ker ({\hat{\partial}}_{k}^{v})

. Note that

C_{k, R} (X, Y) = {⟨ σ ⟩}_{R}

, so

{\hat{\partial}}_{k}^{v} (r \cdot σ) = 0

implies

r = 0

; thus,

H_{k, R} (X, Y) = 0

. Similarly,

C_{k - 1, R} (X, Y) = {⟨ τ ⟩}_{R}

, so

Ker ({\hat{\partial}}_{k - 1}^{v}) \subset C_{k - 1, R} (X, X^{'})

is also generated by

τ

. Now,

{\hat{\partial}}_{k}^{v} (σ) = v (τ) / v (σ) \cdot τ = τ

, and so

H_{k - 1, R} (X, Y) = 0

. Finally, for

n \neq k, k - 1

,

Ker ({\hat{\partial}}_{n}^{v}) \subset C_{n, R} (X, Y) = 0

, so

H_{n, R} (X, Y) = 0

for any

n \neq k, k - 1

, and Claim 2 follows.

The proposition is implied in view of Claim 2 and the following long exact sequence:

⟶ H_{n + 1, R} (X, Y) \overset{\partial_{n}^{v}}{⟶} H_{n, R} (Y) \overset{I_{*}}{⟶} H_{n, R} (X) \overset{J_{*}}{⟶} H_{n, R} (X, Y) ⟶ .

□

In view of

X ↘ X^{'}

and

v (τ^{σ}) = v (Δ_{1}^{σ}) = v (Δ_{2}^{σ}) = v (σ)

, for any

σ \in X^{3}

and its corresponding

σ

-butterfly, Proposition 2 implies the following.

Corollary 1.

Let

(X, v)

be the complex of the bi-structure

(S, T)

, and let

(X^{'}, v)

be the complex of

(S^{'}, T^{'})

that is obtained by completely μ-splicing

(S, T)

. Then,

\forall k \geq 0, H_{k, R} (X^{'}) ≅ H_{k, R} (X) .

5. The Inflation Map

In view of Corollary 1, we can assume that X is the complex of a bi-structure with the property

X^{2} = X

, where

X^{k}

denotes the simplicial complex induced by all k-simplices in X [34]. Clearly, the natural embedding

X^{1} \to X

is an embedding of weighted complexes, and we have the exact sequence

\begin{matrix} 0 ⟶ & C_{n, R} (X^{1}) & \overset{I}{⟶} & C_{n, R} (X) & \overset{J}{⟶} & C_{n, R} (X, X^{1}) & ⟶ 0, \\ ↓ \partial^{v} & ↓ \partial^{v} & \overset{{\bar{\partial}}^{v}}{↓} \\ 0 ⟶ & C_{n - 1, R} (X^{1}) & \overset{I}{⟶} & C_{n - 1, R} (X) & \overset{J}{⟶} & C_{n - 1, R} (X, X^{1}) & ⟶ 0 \end{matrix}

producing the long exact sequence

0 \overset{I_{*}}{⟶} H_{2, R} (X) \overset{J_{*}}{⟶} H_{2, R} (X, X^{1}) \overset{{\hat{\partial}}_{2}^{v}}{⟶} H_{1, R} (X^{1}) \overset{I_{*}}{⟶} H_{1, R} (X) ⟶ 0 .

(1)

We now adopt a relative point of view by relating the simplicial homology

H_{k} (X)

to the weighted homology

H_{k, R} (X)

. To this end, we consider the following two segments of the long homology sequences

\begin{matrix} 0 ⟶ & H_{2, R} (X) & \overset{J_{*}}{⟶} & H_{2, R} (X, X^{1}) & \overset{{\hat{\partial}}_{2}^{v}}{⟶} & H_{1, R} (X^{1}) & \overset{I_{*}}{⟶} & H_{1, R} (X) & ⟶ 0, \\ inj ↑ & inj ↑ & ? ↑ \\ 0 ⟶ & H_{2} (X) & \overset{J_{*}}{⟶} & H_{2} (X, X^{1}) & \overset{{\hat{\partial}}_{2}}{⟶} & H_{1} (X^{1}) & \overset{I_{*}}{⟶} & ⟶ 0 \end{matrix}

where the surjectivity of

{\hat{\partial}}_{2}

is given by

H_{1} (X) = 0

[1].

While these sequences belong to different categories, the first two vertical maps are natural injections and relate simplicial and weighted homology modules.

Indeed, the first vertical injection is induced by chain maps of the form

θ_{n} : C_{n} (X) \overset{}{\to} C_{n, R} (X)

,

θ (\sum_{i} n_{i} σ_{i}) = \sum_{i} n_{i} v (σ_{i}) σ_{i}

, which make the following diagram commutative:

\begin{matrix} \dots ⟶ & C_{n} (X) & \overset{\partial_{n}}{⟶} & C_{n - 1} (X) & ⟶ \dots \\ ↓ θ_{n} & ↓ θ_{n - 1} \\ \dots ⟶ & C_{n - 1} (X) & \overset{\partial_{n}^{v}}{⟶} & C_{n - 1, R} (X) & ⟶ \dots \end{matrix}

The second vertical injection is given by the fact that, since

Z \subset R

,

C_{2} (X)

is an R-submodule of

C_{2, R} (X)

and, by construction,

H_{2} (X, X^{1}) = C_{2} (X)

and

H_{2, R} (X, X^{1}) = C_{2, R} (X)

.

This motivates us to ask if there exists a

Z

-linear map that makes the above diagram commutative. It turns out that such a map exists and will prove useful for understanding

H_{1, R} (X)

.

Lemma 3.

The mapping

infl : H_{1} (X^{1}) \to H_{1, R} (X^{1}), infl (c) = {\hat{\partial}}_{2}^{v} (\sum_{i} Δ_{i}),

where

c = \hat{\partial_{2}} (\sum_{i} Δ_{i})

, is well defined and

Z

-linear; the following diagram is commutative:

\begin{matrix} H_{2, R} (X, X^{1}) & \overset{{\hat{\partial}}_{2}^{v}}{⟶} & H_{1, R} (X^{1}) \\ inj ↑ & infl ↑ \\ H_{2} (X, X^{1}) & \overset{{\hat{\partial}}_{2}}{⟶} & H_{1} (X^{1}) . \end{matrix}

Proof.

By construction, it suffices to show that infl is well defined. Observe that, in view of sequence (1),

{\hat{\partial}}_{2}

is surjective. Hence, any

c \in H_{1} (X^{1})

can be expressed as

c = \sum_{i} {\hat{\partial}}_{2} (Δ_{i})

, where

\sum_{i} Δ_{i}

is unique, modulo elements of

Ker ({\hat{\partial}}_{2})

. Secondly, in [31], we showed that

Ker ({\hat{\partial}}_{2}) = ⨁_{β} Z \nabla_{β},

where

\nabla_{β} = \sum_{j} Δ_{j}

;

Δ_{j}

is contained in the

(S, T)

-crossing component

β

and

v (Δ_{j}) = 1

.

Any face e contained in

{\hat{\partial}}_{2} (\nabla_{β})

appears with opposite signs in two distinct the 2-simplices

Δ_{e}

and

Δ_{e}^{'}

, where

v (Δ_{e}) = v (Δ_{e}^{'}) = 1

. Therefore,

{\hat{\partial}}_{2}^{v} (\nabla_{β}) = \sum_{e \in {\hat{\partial}}_{2} (\nabla_{β})} (\frac{v (e)}{v (Δ_{e})} \cdot e - \frac{v (e)}{v (Δ_{e}^{'})} \cdot e) = 0,

i.e.,

inj (\nabla_{β}) \in Ker ({\hat{\partial}}_{2}^{v})

; infl is well defined, and a lemma follows. □

We next employ

infl : H_{1} (X^{1}) \to H_{1, R} (X^{1})

to obtain information about

H_{1, R} (X^{1})

.

X^{1}

is connected, and for a spanning tree

T_{X^{1}}

, the free generators of

H_{1} (X^{1})

are in bijection with

κ

’s mixed

X^{1}

cycle closing edges for

T_{X^{1}}

. A generator

c_{κ} \in H_{1} (X^{1})

is a sum of edges

u_{j} \in T_{X^{1}}

and the closing edge

κ

, with each such edge appearing exactly once. We fix

T_{X^{1}}

as follows: Consider the trees

T_{S}, T_{T}

of the sub-complexes for S and T, respectively, and let

T_{X^{1}}

be the tree obtained by connecting the

T_{S}, T_{T}

-roots with the mixed edge

ω

, corresponding to the two rainbow loops.

Lemma 4.

The following sequence of R modules is exact:

0 ⟶ R \otimes_{Z} infl {(H_{1} (X^{1}))}^{\overset{inj}{⟶}} H_{1, R} (X^{1}) \overset{I_{*}}{⟶} H_{1, R} (X) ⟶ 0 .

Proof.

With

T_{X^{1}}

as above, consider a generator

c_{κ} = \sum_{j} u_{j} \in H_{1} (X^{1})

where, for

1 < j < n

,

u_{j}

are pure S- or T-edges and

u_{1} = ω

while

u_{n} = κ

. Since

{\hat{\partial}}_{2}

is surjective, we can also write

c_{κ} = \sum_{i} {\hat{\partial}}_{2} (Δ_{i}) = \sum_{i} (e_{0, i} - e_{1, i} + e_{2, i}) .

This involves additional mutually canceling edges, k, all of which are mixed and appear as faces of pairs of 2-simplices,

(Δ_{k}, Δ_{k}^{'})

. Without loss of generality (w.l.o.g.), we can traverse the cycle

c_{κ}

beginning at

ω

via the pure S-edges until we arrive at

κ

, after which we return back to

ω

via the pure T-edges. Any

(Δ_{k}, Δ_{k}^{'})

pair satisfies either

v (Δ_{k}) = v (Δ_{k}^{'})

or

v (Δ_{k}) \neq v (Δ_{k}^{'})

, and we accordingly categorize the k edges into

k_{α}

- and

k_{β}

-edges, respectively. As a result, in

infl (c_{κ}) = \sum_{i} {\hat{\partial}}_{2}^{v} (Δ_{i}) = \sum_{i} (\frac{v (e_{0, i})}{v (Δ_{i})} \cdot e_{0, i} - \frac{v (e_{1, i})}{v (Δ_{i})} \cdot e_{1, i} + \frac{v (e_{2, i})}{v (Δ_{i})} \cdot e_{2, i}),

k_{β}

-edges persist with the coefficient, say

f (k_{β}) \in R

, while all

k_{α}

-edges cancel. Hence,

infl (c_{κ}) = \sum_{j} \frac{v (u_{j})}{v (Δ_{j})} \cdot u_{j} + \sum_{k_{β}} f (k_{β}) \cdot k_{β},

where

u_{j} \in {\hat{\partial}}_{2} (Δ_{j})

. Note that only

κ

and

ω

are mixed

c_{κ}

-edges, and so

{u_{j}} \cap {k_{β}} = ⌀

.

Any set M of

c_{κ}

-cycles induces a set of distinct mixed faces

ω, κ_{1}, \dots, κ_{h}

. In the two sub-trees of S and T formed by the respective pure edges of the

c_{κ}

-paths, these mixed faces connect an S-leaf with a T-minimal vertex or a T-leaf with an S-minimal vertex. By construction, these edges are distinct from any of the

k_{β}

produced by any M-cycle, and the coefficients of any of these

c_{κ_{i}}

1 \leq i \leq h

in a linear combination

\sum_{c_{κ} \in M} r_{κ} \cdot infl (c_{κ}) = 0

are zero. Iterating this argument, we arrive at

\sum_{κ} r_{κ} \cdot infl (c_{κ}) = 0 \Rightarrow r_{κ} = 0 .

Accordingly,

{infl (c_{κ}) ∣ κ}

freely generate

{⟨ {infl (c_{κ}) ∣ κ} ⟩}_{R}

, and the following diagram is commutative:

\begin{matrix} H_{2, R} (X, X^{1}) & \overset{{\hat{\partial}}_{2}^{v}}{⟶} & H_{1, R} (X^{1}) & \overset{I_{*}}{⟶} & H_{1, R} (X) & ⟶ 0 \\ inj ↑ & infl ↑ \\ H_{2} (X, X^{1}) & \overset{{\hat{\partial}}_{2}}{⟶} & H_{1} (X^{1}) & \overset{I_{*}}{⟶} & 0 \\ ↑ \\ 0 \end{matrix}

Since

Im ({\hat{\partial}}_{2}^{v}) = R \otimes_{Z} Im (infl)

, we have

H_{1, R} (X) ≅ H_{1, R} (X^{1}) / (R \otimes_{Z} Im (infl)),

whence

0 ⟶ R \otimes_{Z} infl (H_{1} (X^{1})) \overset{inj}{⟶} H_{1, R} (X^{1}) \overset{I_{*}}{⟶} H_{1, R} (X) ⟶ 0 .

□

6. The Rank of $H_{1, R} (X^{1})$

Next, we compute the rank of

H_{1, R} (X^{1})

by constructing a basis. Note that an element in

H_{1} (X^{1})

is a sum of cycles, and it is natural to ask what the analogue of a cycle in

H_{1, R} (X^{1})

is.

Let

c^{'} = \sum_{i} e_{i} \in H_{1} (X^{1})

be minimal in terms of the number of edges it contains. A straightforward computation shows that there exist

r_{i} \in R

with

\partial_{1}^{v} (\sum_{i} r_{i} \cdot e_{i}) = 0

, where the multi-set

{(r_{i})}_{i}

is unique up to a scalar factor. This shows that, except for a scalar multiple, minimal

H_{1} (X^{1})

cycles determine the distinguished

H_{1, R} (X^{1})

cycles, and we shall call a cycle

c = \sum_{i} r_{i} e_{i} \in H_{1, R} (X^{1})

elementary if

gcd ({r_{i}}) = 1

and

\sum_{i} e_{i} \in H_{1} (X^{1})

is a minimal cycle. An elementary cycle is precisely the

H_{1, R} (X^{1})

-analogue of a minimal

H_{1} (X^{1})

-cycle.

In view of

X^{1}

being connected, a basis of

H_{1} (X^{1})

can be comprised of all minimal cycles derived from a fixed

X^{1}

-spanning tree

T

: Any

X^{1}

non-tree edge added to this tree results in a unique minimal cycle in the basis that is labeled by the edge. One key observation here is that

{\partial_{1} (e) | e \in T}

generates

\partial_{1} (C_{1} (X^{1}))

for an arbitrary spanning tree

T

of

X^{1}

.

To derive a basis of

H_{1, R} (X^{1})

in an analogous fashion, an arbitrary choice of an

X^{1}

-spanning tree is no longer sufficient because of the weights involved. However, in the following, we construct a distinguished tree

T_{X^{1}}^{#}

that is particularly well suited to the understanding of the embedding of

H_{1} (X^{1})

into

H_{1, R} (X^{1})

.

Lemma 5.

There exists an

X^{1}

-spanning tree,

T_{X^{1}}^{#}

, such that the following sequence is exact:

0 ⟶ C_{1, R} (T_{X^{1}}^{#}) \overset{\partial_{1}^{v}}{⟶} C_{0, R} (X) \overset{proj}{⟶} H_{0, R} (X) ⟶ 0 .

Proof.

It suffices to construct

T_{X^{1}}^{#}

such that

\partial_{1}^{v} ({⟨ T_{X^{1}}^{#} ⟩}_{R})

= \partial_{1}^{v} ({⟨ X^{1} ⟩}_{R})

. To this end, we fix an arbitrary

X^{1}

-spanning tree

T_{0}

and let

M = X^{1} ∖ T_{0}

. We examine M-edges one by one, yielding a chain of processed edges,

⌀ = M_{0} \subset, \dots, \subset M_{k - 1} \subset M_{k}

, and a sequence of trees,

T_{0}, \dots, T_{k - 1}, T_{k}

. We claim that these have the property

\partial_{1}^{v} ({⟨ T_{k} ⟩}_{R}) = \partial_{1}^{v} ({⟨ M_{k} \cup T_{0} ⟩}_{R})

.

The above holds trivially for

k = 0

, and we assume that

M_{k - 1}

and

T_{k - 1}

are constructed with

\partial_{1}^{v} ({⟨ T_{k - 1} ⟩}_{R}) = \partial_{1}^{v} ({⟨ M_{k - 1} \cup T_{0} ⟩}_{R})

. Let

e_{k} \in M ∖ M_{k - 1}

and set

M_{k} = M_{k - 1} ⋃ {e_{k}}

. Then, there exist

r_{e} \in R

such that

\sum_{e \in {e_{k}} \cup T_{k - 1}} r_{e} \partial_{1}^{v} (e) = 0

. Letting

r = gcd {r_{e}}

and

r_{e} = r r_{e}^{'}

leads to

\sum_{e \in {e_{k}} \cup T_{k - 1}} r_{e}^{'} \partial_{1}^{v} (e) = 0,

where, by construction, at least one edge has a coefficient of one. In case the

r_{e_{k}}^{'} = 1

, set

T_{k} = T_{k - 1}

. Otherwise, remove a

T_{k - 1}

-edge with coefficient one, and add

e_{k}

to

T_{k - 1}

, obtaining

T_{k}

. Note then that

\partial_{1}^{v} ({⟨ T_{k} ⟩}_{R}) = \partial_{1}^{v} ({⟨ {e_{k}} \cup T_{k - 1} ⟩}_{R}) = \partial_{1}^{v} ({⟨ {e_{k}} ⟩}_{R}) + \partial_{1}^{v} ({⟨ M_{k - 1} \cup T_{0} ⟩}_{R}) = \partial_{1}^{v} ({⟨ M_{k} \cup T_{0} ⟩}_{R}) .

By induction, when the processes terminates, it yields the desired

T_{X^{1}}^{#}

. □

T_{X^{1}}^{#}

induces a new set of closing edges

κ^{#}

and the associated minimal

H_{1} (X^{1})

-cycles

c_{κ^{#}}

. We shall show that the corresponding elementary cycles constitute a basis of

H_{1, R} (X^{1})

.

Proposition 3.

There exists an isomorphism

H_{1, R} (X^{1}) ≅ R \otimes_{Z} H_{1} (X^{1}) .

Furthermore,

H_{1, R} (X^{1})

is freely generated by the set of elementary cycles

c_{κ^{#}}

and has rank

e - v + 1

, where e and v are the numbers of 1- and 0-simplices in

X^{1}

.

Proof.

H_{1, R} (X^{1})

is free, as it is a sub-module of the finitely generated free module over the PID R and

C_{1, R} (X^{1})

. By construction, to each

c_{κ^{#}} \in H_{1} (X^{1})

, there corresponds a unique elementary

H_{1, R} (X^{1})

-cycle

c_{κ^{#}}^{v}

, whose

κ^{#}

coefficient is one. Furthermore,

c_{κ^{#}}^{v}

is non-torsion in

H_{1, R} (X^{1})

; if

r \cdot c_{κ^{#}}^{v} = 0

, R being an integral domain implies

r = 0

.

Claim. Any

c \in H_{1, R} (X^{1})

has the representation

c = \sum_{κ} r_{κ^{#}} c_{κ^{#}}^{v}

with a unique

r_{κ^{#}} \in R

.

We proceed by induction on the number of

T_{X_{1}}^{#}

-closing edges in c.

To establish the induction basis, let

r_{κ^{#}} κ^{#}

be the unique term that contains

κ^{#}

in c. Using the fact that

κ^{#}

appears with a coefficient of one in

c_{κ^{#}}^{v}

, we derive

c = r_{κ^{#}} c_{κ^{#}}^{v}

.

For the induction step, distinguishing

r_{κ_{1}^{#}} κ_{1}^{#}

as a closing edge summand in c, we have

c - r_{κ_{1}^{#}} \cdot c_{κ_{1}^{#}}^{v} = \tilde{c} \in H_{1, R} (X^{1})

with

κ_{1}^{#} \notin \tilde{c}

. By inductive hypothesis,

\tilde{c} = \sum_{κ^{#} \in \tilde{c}} r_{κ^{#}} \cdot c_{κ^{#}}^{v}

with unique coefficients

r_{κ^{#}}

, hence the claim.

From the claim and the fact that the

c_{κ^{#}}^{v}

are non-torsion, we conclude that

H_{1, R} (X^{1}) ≅ ⨁_{κ} R c_{κ^{#}}^{v} ≅ R \otimes_{Z} H_{1} (X^{1}),

hence the proposition. □

7. The Modules of the Weighted Homology

By Lemma 5, we have

rnk (Im (\partial_{1}^{v})) = rnk (\partial_{1}^{v} ({⟨ T_{X^{1}}^{#} ⟩}_{R})) = v - 1,

from which

rnk (C_{0, R} (X) / Im (\partial_{1}^{v})) = 1

.

Since

C_{0, R} (X) / Im (\partial_{1}^{v})

is a module over the PID R, we arrive at

H_{0, R} (X) ≅ C_{0, R} (X) / Im (\partial_{1}^{v}) ≅ M_{π} \oplus R,

where

n_{1} \geq \dots n_{v - 1} \geq 0

, and

M_{π} = ⨁_{j = 1}^{v - 1} R / (π^{n_{j}})

is the

π

-torsion module of

H_{0, R} (X)

.

We shall proceed by computing

M_{π}

. Note that the

T_{X^{1}}^{#}

-tree is a spanning tree of

X^{1}

, and

{\partial_{1}^{v} (e) | e \in T_{X^{1}}^{#}}

generates

Im (\partial_{1}^{v})

. Thus, we can compute

H_{0, R} (X)

via

T_{X^{1}}^{#}

. To this end, we process

T_{X^{1}}^{#}

as follows: We select a

T_{X^{1}}^{#}

-vertex

u_{i}

and an incident edge

e_{i} = (u_{j}, u_{i})

for which

{log}_{π} \frac{v (u_{i})}{v (e_{i})}

is minimal. We contract

e_{i}

onto

u_{j}

by removing

u_{i}

and replacing any edges of the form

(u_{k}, u_{i})

with

(u_{k}, u_{j})

while maintaining their original weights

v (u_{k}, u_{j}) : = v (u_{k}, u_{i})

. Recursively contracting all

(v - 1)

edges in

T_{X^{1}}^{#}

produces a sequence of trees

T_{X^{1}}^{#} = T_{0}, \dots, T_{v - 1}

, where

T_{v - 1} = u_{v}

is a single vertex at which point the process terminates. Relabeling the

T_{X^{1}}^{#}

-vertices, we may assume that

u_{i}

is removed at the ith step of this process.

Theorem 1.

There exists an isomorphism

H_{0, R} (X) ≅ (⨁_{i = 1}^{v - 1} R / (π^{n_{i}})) \oplus R,

where

n_{i} = {log}_{π} \frac{v (u_{i})}{v (e_{i})}

, as described above.

Proof.

Let

E_{m} = {e_{i} = (u_{a_{i}}, u_{i}) ∣ 0 \leq i \leq m}

denote the set of recursively contracted edges up to and including the mth step. Set

e_{i} = (u_{i}, u_{a_{i}})

,

x_{i} = u_{i} - \frac{v (u_{a_{i}})}{v (u_{i})} u_{a_{i}}

, and

x_{v} = u_{v}

, where the formal fractions are well defined by the minimality of the denominator exponent. Note then that

\partial_{1}^{v} (e_{i}) = \frac{v (u_{i})}{v (e_{i})} u_{i} - \frac{v (u_{a_{i}})}{v (e_{i})} u_{a_{i}} = \frac{v (u_{i})}{v (e_{i})} x_{i} : = w_{i} x_{i}

.

Claim 1.

\partial_{1}^{v} ({⟨ E_{i} \cup T_{i} ⟩}_{R}) = Im (\partial_{1}^{v})

for any

0 \leq i \leq v - 1

.

We proceed by induction on i, where Lemma 5 implies the induction basis

i = 0

. Consider the contraction at step i:

E_{i} = E_{i - 1} \cup {e_{i} = (u_{a_{i}}, u_{i})}

. A

T_{i - 1}

-edge of the form

f_{k} = (u_{k}, u_{i})

induces the

T_{i}

-edge

f_{k}^{'} = (u_{k}, u_{a_{i}})

with

v (f_{k}) = v (f_{k}^{'})

. Then,

\partial_{1}^{v} (f_{k}^{'}) - \partial_{1}^{v} (f_{k}) = \frac{v (u_{k})}{v (f_{k}^{'})} u_{k} - \frac{v (u_{a_{i}})}{v (f_{k}^{'})} u_{a_{i}} - [\frac{v (u_{k})}{v (f_{k})} u_{k} - \frac{v (u_{i})}{v (f_{k})} u_{i}] = \partial_{1}^{v} (e_{i})

implies

\partial_{1}^{v} ({⟨ E_{i - 1} \cup T_{i - 1} ⟩}_{R}) = \partial_{1}^{v} ({⟨ E_{i} \cup T_{i} ⟩}_{R})

, and Claim 1 follows by induction.

Claim 2.

C_{0, R} (X) = ⨁_{i = 1}^{v} R x_{i}

.

Let

z = \sum_{j = 1}^{v} r_{j}^{'} u_{j} \in C_{0, R} (X)

, where

u_{i}

is removed from

T_{i - 1}

at the ith contraction step. We rewrite the z in terms of the

x_{i}

. This is done recursively, starting with

u_{1}

, as follows: Let

C_{p} (u_{q})

be the coefficient of

u_{q}

at step p. Then,

x_{i} = u_{i} - \frac{v (u_{a_{i}})}{v (u_{i})} u_{a_{i}}

implies

C_{i} (u_{i}) u_{i} = C_{i} (u_{i}) x_{i} + C_{i} (u_{i}) \frac{v (u_{a_{i}})}{v (u_{i})} u_{a_{i}}

, and we obtain the updated

u_{a_{i}}

-coefficient

C_{i + 1} (u_{a_{i}}) = C_{i} (u_{a_{i}}) + C_{i} (u_{i}) \frac{v (u_{a_{i}})}{v (u_{i})} .

Processing all

u_{i}

, we arrive at

z = \sum_{= 1}^{v - 1} r_{i} x_{i} + r_{v} x_{v}

, and Claim 2 follows.

Claim 3.

H_{0, R} (X) ≅ (⨁_{i = 1}^{v - 1} R {\bar{x}}_{i}) \oplus R x_{v}

with

{\bar{x}}_{i} = x_{i} + {⟨ \partial_{1}^{v} (e_{i}) ⟩}_{R}

.

By Claim 2, we have

H_{0, R} (X) = (⨁_{i = 1}^{v} R x_{i}) / Im (\partial_{1}^{v})

, and we make the following ansatz:

φ : H_{0, R} (X) ⟶ (⨁_{i = 1}^{v} R {\bar{x}}_{i}) \oplus R x_{v}, φ ((\sum_{i = 1}^{v - 1} r_{i} x_{i} + r_{v} x_{v}) + Im (\partial_{1}^{v})) = \sum_{i = 1}^{v - 1} r_{i} {\bar{x}}_{i} + r_{v} x_{v} .

First,

φ

is a well-defined homomorphism: Suppose

\sum_{i} r_{i} x_{i} - \sum_{i} r_{i}^{'} x_{i} \in Im (\partial_{1}^{v})

. Since

x_{v}

is linearly independent from

x_{i}

,

1 \leq i \leq v - 1

, this immediately implies that

r_{v} = r_{v}^{'}

. Claim 1 guarantees that

Im (\partial_{1}^{v}) = \partial_{1}^{v} ({⟨ E_{v - 1} ⟩}_{R})

, and consequently, there exist unique coefficients,

α_{i}

, such that

\sum_{i = 1}^{v - 1} r_{i} x_{i} - \sum_{i = 1}^{v - 1} r_{i}^{'} x_{i} = \partial_{1}^{v} (\sum_{i = 1}^{v - 1} α_{i} e_{i}) .

In view of

\partial_{1}^{v} (e_{i}) = n_{i} x_{i}

, we obtain

\sum_{i = 1}^{v - 1} r_{i} x_{i} - \sum_{i = 1}^{v - 1} r_{i}^{'} x_{i} = \sum_{i = 1}^{v - 1} α_{i} \partial_{1}^{v} (e_{i}) = \sum_{i = 1}^{v - 1} s_{i} n_{i} x_{i} .

The linear independence of

x_{i}

, in turn, implies that

(r_{i} - r_{i}^{'}) = s_{i} n_{i}

for any i, from which

(r_{i} - r_{i}^{'}) x_{i} = s_{i} \partial_{1}^{v} (e_{i})

, and thus,

r_{i} {\bar{x}}_{i} = r_{i}^{'} {\bar{x}}_{i}

.

φ

is, by construction, surjective and does not have a nontrivial kernel from which Claim 3 follows.

By construction,

R x_{v} ≅ R

, and as for the direct summands

R {\bar{x}}_{k}

, for

1 \leq k \leq v - 1

, we consider the mapping

h : R {\bar{x}}_{k} \to R / R n_{k}

,

h (r {\bar{x}}_{k}) = r mod n_{k}

. Suppose that

(r_{k} - r_{k}^{'}) x_{k} = s_{k} \partial_{1}^{v} (e_{k})

; then,

\partial_{1}^{v} (e_{k}) = n_{k} x_{k}

implies that

(r_{k} - r_{k}^{'}) = s_{k} n_{k}

, so h is well defined. Clearly h is surjective, and it remains to check the injectivity.

h (r {\bar{x}}_{k}) = 0

implies that

r = s n_{k}

and

s n_{k} x_{k} = s \partial_{1}^{v} (e_{k})

implies that

r {\bar{x}}_{k} = \bar{0}

, hence the theorem. □

We proceed by analyzing

H_{1, R} (X)

. We begin with the expression for

infl (c_{κ^{#}})

in Lemma 4:

infl (c_{κ^{#}}) = \sum_{j} \frac{v (u_{j})}{v (Δ_{j})} \cdot u_{j} + \sum_{k_{β}} f (k_{β}) \cdot k_{β},

where

Δ_{j}

is the 2-simplex containing the edge

u_{j}

, and

k_{β}

are mixed edges that appear when transitioning between 2-simplices with different weights that share a

k_{β}

face.

The “troublemakers” here are the off-diagonal terms of the embedding,

k_{β}

, which stem from passing from trivial to nontrivial crossing components and vice versa. We shall eliminate such transitions by appropriately deforming X without affecting its homology. We then have to check that the deformation gives rise to an inflation map that is just a restriction of the original inflation map for X. Let

Δ \in X

be a 2-simplex corresponding to a non-crossing arc, i.e.,

Δ

is maximal in X. W.l.o.g., we can assume that

Δ = [s_{1}, s_{2}, t]

with

s_{1}, s_{2} \in S, t \in T

. Then,

τ = [s_{1}, s_{2}]

is

Δ

-free, and by construction,

v (τ) = v (Δ) = 2

. Removing

τ

and

Δ

from X produces a sub-complex

X^{'}

with

X ↘ X^{'}

. By Proposition 2, we may remove all 2-simplices contained in X corresponding to non-crossing arcs together with their pure faces without changing the homology. Accordingly, the resulting

\tilde{X}

is an X-sub-complex such that

H_{i, R} (\tilde{X}) ≅ H_{i, R} (X)

. Note that

\tilde{X}

is, in general, not a complex induced by a bi-structure.

Lemma 6.

There exists an

{\tilde{X}}^{1}

-spanning tree,

T_{{\tilde{X}}^{1}}^{#}

, with

\partial_{1}^{v} ({⟨ T_{{\tilde{X}}^{1}}^{#} ⟩}_{R}) = \partial_{1}^{v} ({⟨ X^{1} ⟩}_{R})

.

Proof.

Let

T_{0}

be a fixed

{\tilde{X}}^{1}

-spanning tree. We then continue with

{\tilde{X}}^{1} ∖ T_{0}

-edge replacements, as in Lemma 5, and the process terminates with the

{\tilde{X}}^{1}

-spanning tree

T_{k} = T_{{\tilde{X}}^{1}}^{#}

, which has the property

\partial_{1}^{v} ({⟨ T_{{\tilde{X}}^{1}}^{#} ⟩}_{R}) = \partial_{1}^{v} (C_{1, R} (\tilde{X}))

.

Note that a pure edge e that is not in

\tilde{X}

can be written as

e = \partial_{2}^{v} (Δ_{e}) + e - \partial_{2}^{v} (Δ_{e})

, where

Δ_{e} \in X^{2}

is the unique 2-simplex containing the pure edge e. Since

v (e) = v (Δ_{e})

, we have

(e - \partial_{2}^{v} (Δ_{e})) \in C_{1, R} (\tilde{X})

or

(\partial_{2}^{v} (Δ_{e}) + e) \in C_{1, R} (\tilde{X})

, depending on the sign of e in

\partial_{2}^{v} (Δ_{e})

. Consequently,

\partial_{1}^{v} (C_{1, R} (\tilde{X})) = \partial_{1}^{v} (C_{1, R} (X))

, and the lemma follows. □

Note that

H_{1, R} (\tilde{X}) ≅ Ker (\partial_{1}^{v} (\tilde{X})) / Im (\partial_{2}^{v} (\tilde{X}))

and

Ker (\partial_{1}^{v} (\tilde{X})) = H_{1, R} ({\tilde{X}}^{1})

. Furthermore, by Proposition 3,

H_{1, R} ({\tilde{X}}^{1})

is generated by elementary cycles induced by

T_{{\tilde{X}}^{1}}^{#}

. Hence, we can exploit the structure of

T_{{\tilde{X}}^{1}}^{#}

in order to compute

H_{1, R} (\tilde{X}) ≅ H_{1, R} (X)

.

Theorem 2.

H_{1, R} (X)

is an R-torsion module, and there exists an isomorphism

H_{1, R} (X) ≅ ⨁_{κ^{#}} R / (π^{| κ^{#} | - 1}),

where

κ^{#}

is a closing edge of a minimal cycle

c_{κ^{#}} \in H_{1} (\tilde{X})

with respect to

T_{{\tilde{X}}^{1}}^{#}

.

Proof.

We begin by establishing that the inflation map naturally restricts to

\tilde{X}

.

Claim 1. We have the commutative diagram

\begin{matrix} 0 ⟶ & H_{1} (X^{1}) & \overset{infl}{⟶} & H_{1, R} (X^{1}) & \overset{I_{*}}{⟶} & H_{1, R} (X) & ⟶ 0 . \\ I_{*} ↑ & I_{*} ↑ & ≅ ↑ \\ 0 ⟶ & H_{1} ({\tilde{X}}^{1}) & \overset{infl | H_{1} ({\tilde{X}}^{1})}{⟶} & H_{1, R} ({\tilde{X}}^{1}) & \overset{I_{*}}{⟶} & H_{1, R} (\tilde{X}) & ⟶ 0 \end{matrix}

Clearly,

H_{2} (X^{1}, {\tilde{X}}^{1}) = 0

and

H_{2, R} (X^{1}, {\tilde{X}}^{1}) = 0

, and by the long homology sequences,

I_{*} : H_{1} ({\tilde{X}}^{1}) \to H_{1} (X^{1})

and

I_{*} : H_{1, R} ({\tilde{X}}^{1}) \to H_{1, R} (X^{1})

are embeddings. Lemma 6 shows that

T_{{\tilde{X}}^{1}}^{#}

is also a

T_{X^{1}}^{#}

-tree, from which

H_{1} (X^{1})

is obtained from

H_{1} ({\tilde{X}}^{1})

by just adding all elementary cycles induced by the pure edges that are not contained in

\tilde{X}

. Consequently, removing these edges from

X^{1}

is equivalent to restricting the inflation map

infl : H_{1} (X^{1}) \to H_{1, R} (X^{1})

to the sub-complex

{\tilde{X}}^{1}

, and Claim 1 follows.

Claim 2. For

{infl |}_{H_{1} ({\tilde{X}}^{1})} : H_{1} ({\tilde{X}}^{1}) \to H_{1, R} ({\tilde{X}}^{1})

, we have

{infl |}_{H_{1} ({\tilde{X}}^{1})} (c_{κ^{#}}) = π^{| κ^{#} | - 1} c_{κ^{#}}^{v}

. As in Lemma 4:

infl (c_{κ^{#}}) = \sum_{j} \frac{v (u_{j})}{v (Δ_{j})} \cdot u_{j} + \sum_{k_{β}} f (k_{β}) \cdot k_{β},

where

Δ_{j}

is the 2-simplex containing the edge

u_{j}

, and

k_{β}

are mixed edges that appear when transitioning between 2-simplices with different weights that share a

k_{β}

face. By construction, for any

Δ \in {\tilde{X}}^{2}

,

v (Δ) = π^{1}

, and so no

k_{β}

edges emerge in the inflation images. In particular,

κ^{#}

has the coefficient

v (κ^{#}) / v (Δ_{κ^{#}}) = π^{| κ^{#} | - 1}

in

infl (c_{κ^{#}})

. By Lemma 6

\partial_{1}^{v} ({⟨ T_{{\tilde{X}}^{1}}^{#} ⟩}_{R}) = \partial_{1}^{v} ({⟨ \tilde{X} ⟩}_{R})

, and so the corresponding elementary cycle

c_{κ^{#}}^{v}

has the coefficient 1 at

κ^{#}

. Since

H_{1, R} ({\tilde{X}}^{1})

is freely generated by all elementary cycles, Claim 2 follows.

H_{1, R} ({\tilde{X}}^{1})

is freely generated by

M = {c_{κ^{#}}^{v} ∣ κ^{#}}

and is free of rank

| M |

. By Lemma 4,

| M | = rnk (infl (H_{1} ({\tilde{X}}^{1})))

, from which

H_{1, R} (\tilde{X}) ≅ H_{1, R} ({\tilde{X}}^{1}) / (R \otimes_{Z} infl (H_{1} ({\tilde{X}}^{1})))

is, by the structure theorem of finitely generated modules over PIDs, a full-torsion module. The exact sequence

0 ⟶ R \otimes_{Z} infl {(H_{1} ({\tilde{X}}^{1}))}^{\overset{inj}{⟶}} H_{1, R} ({\tilde{X}}^{1}) \overset{I_{*}}{⟶} H_{1, R} (\tilde{X}) ⟶ 0

implies that

R \otimes_{Z} infl (H_{1} ({\tilde{X}}^{1})) = {⟨ {π^{| κ^{#} | - 1} c_{κ^{#}}^{v} ∣ κ^{#}} ⟩}_{R}

, and clearly,

H_{1, R} (X) ≅ H_{1, R} (\tilde{X}) ≅ ⨁_{κ^{#}} R / (π^{| κ^{#} | - 1}) .

□

Theorem 3.

The injection

inj : H_{2} (X) \to H_{2, R} (X)

induces an isomorphism of R-modules:

H_{2, R} (X) ≅ R \otimes_{Z} H_{2} (X) .

Proof.

The long homology sequence for the weighted homology induces the exact sequence

0 ⟶ H_{2, R} (X) \overset{J_{*}}{⟶} H_{2, R} (X, X^{1}) \overset{{\hat{\partial}}_{2}^{v}}{⟶} Im ({\hat{\partial}}_{2}^{v}) ⟶ 0,

where

H_{2, R} (X) ≅ Ker (\partial_{2}^{v})

and

H_{2, R} (X, X^{1}) = C_{2, R} (X)

. Hence,

Im (J_{*}) = Ker (\partial_{2}^{v}) = Ker ({\hat{\partial}}_{2}^{v})

, and the sequence is exact. In view of

Im (\hat{\partial_{2}^{v}}) \leq_{R} H_{1, R} (X^{1})

and Proposition 3, all modules in the sequence are free and, thus, projective; hence,

rnk (H_{2, R} (X)) + rnk (Im ({\hat{\partial}}_{2}^{v})) = rnk (H_{2, R} (X, X^{1}))

. Analogously, the exact sequence

0 ⟶ H_{2} (X) \overset{J_{*}}{⟶} H_{2} (X, X^{1}) \overset{{\hat{\partial}}_{2}}{⟶} H_{1} (X^{1}) \overset{I_{*}}{⟶} 0

of free modules implies that

rnk (H_{2} (X)) + rnk (H_{1} (X^{1})) = rnk (H_{2} (X, X^{1}))

. Since

R \otimes_{Z} H_{2} (X, X^{1})) ≅ H_{2, R} (X, X^{1})

and

rnk (Im ({\hat{\partial}}_{2}^{v})) = rnk (R \otimes_{Z} infl (H_{1} (X^{1}))) = rnk (H_{1} (X^{1}))

, we can conclude that

rnk (H_{2, R} (X)) = rnk (H_{2} (X)) .

Finally, since both

H_{2, R} (X)

and

R \otimes_{Z} H_{2} (X)

are free finitely generated modules over the PID R, the theorem follows. □

Remark 1.

Since

X^{2} = X

, by construction,

C_{i > 2, R} (X) = 0

. As such,

H_{i > 2, R} (X) = 0

, and, in view of Corollary 1, this holds for weighted complexes stemming from arbitrary bi-structures.

8. Conclusions and Future Work

In this paper, we introduced the weighted homology of the nerve complex X of an RNA bi-structure. We demonstrated that the weighted homology of a bi-structure distinctively augments the simplicial homology.

In the simplicial homology, only the zeroth and second homology groups are nontrivial [1]. Since the complex of a bi-structure is, by construction, connected, the simplicial homology carries key information exclusively via

H_{2} (X)

.

The weighted homology not only conserves the information provided by the simplicial homology, but it also supplies additional information. Specifically, connectivity is still picked up by the free submodule of

H_{0, R} (X)

; however, additional torsion now emerges. This torsion is given a concrete combinatorial interpretation via the compression algorithm on the spanning tree

T_{X^{1}}^{#}

. In fact, the compression procedure naturally generates the free submodule of rank one at its final step.

In the case of

H_{2, R} (X)

, we consider

H_{2} (X)

and observe that the nontrivial chain maps

θ_{n} : C_{n} (X) \overset{}{\to} C_{n, R} (X)

,

θ (\sum_{i} n_{i} σ_{i}) = \sum_{i} n_{i} v (σ_{i}) σ_{i}

, produce an injection

inj : H_{2} (X) \to H_{2, R} (X)

, in which all coefficients are one, since any 2-simplex contained in the expression of an

H_{2} (X)

-generator (i.e., appearing in a crossing component) has a weight of one. As a result, passing to the ring R preserves the information present in

H_{2} (X)

.

In the computation of

H_{1, R} (X)

, we employed the simplicial homology in a different way. Here, we consider the relative homology and transcend the information from the 1-skeleton,

X^{1}

. While

H_{1} (X)

is trivial [1],

H_{1} (X^{1})

carries information, which we can utilize by extracting from

\begin{matrix} 0 ⟶ & H_{2, R} (X) & \overset{J_{*}}{⟶} & H_{2, R} (X, X^{1}) & \overset{{\hat{\partial}}_{2}^{v}}{⟶} & H_{1, R} (X^{1}) & \overset{I_{*}}{⟶} & H_{1, R} (X) & ⟶ 0 \\ inj ↑ & inj ↑ & infl ↑ \\ 0 ⟶ & H_{2} (X) & \overset{J_{*}}{⟶} & H_{2} (X, X^{1}) & \overset{{\hat{\partial}}_{2}}{↑} & H_{1} (X^{1}) & \overset{I_{*}}{⟶} & 0 \\ ↑ \\ 0 \end{matrix}

the embedded exact sequence of Lemma 4. After resolving some technicalities, this allows us to compute

H_{1, R} (X)

. Note that passing to the deformation

\tilde{X}

greatly simplifies the inflation map, and we observe that it acts on

\tilde{X}

as

θ_{1}

would on the 1-skeleton.

The structure theorems for the weighted homology enable the classification of bi-structures by their R-modules. Such classifications can provide insights into the algorithmic complexity of problems involving bi-structures. For instance, the problem of computing thermodynamically stable sequences for a bi-structure can be efficiently solved in polynomial time when the second homology R-module

H_{2, R} (X)

is trivial [2].

As for future work, we shall extend the homological analysis to RNA–RNA interaction structures [35]. An interaction structure can be represented as a diagram with two backbones drawn horizontally on top of each other such that both intra-molecular and inter-molecular bonds are non-crossing; see Figure 5.

Figure 5. Left: an interaction structure with two interaction arcs, x and y. Right: the corresponding interaction complex.

The inter-molecular arcs naturally induce an equivalence relation

ϕ

on the vertices of the two backbones. The intersection of two loops is accordingly defined to be the set of

ϕ

-equivalence classes of vertices that they share. A bi-secondary structure can then be viewed as a particular type of interaction structure—namely, an interaction structure with two identical backbones, where interaction arcs connect any pair

(i, i)

for

0 \leq i \leq n + 1

. In contrast to a bi-structure, an interaction structure can exhibit nontrivial first-simplicial homology and requires revisiting the notion of a crossing component for interaction structures. The analysis involves many more topologies—in particular, surfaces and Mayer–Vietoris sequences [34].

The framework of the weighted homology is by no means restricted to bi- or interaction structures. It also gives rise to the consideration of the dissimilarity complex of a finite set of genomic sequences, which we briefly discuss in the following. A multiple sequence alignment W of genomic sequences is defined to be a finite set of words of equal length m over the finite alphabet

A = {a_{1}, \dots, a_{s}}

. For any

k \in {1, \dots, m}

, let

f_{k} : W \overset{}{\to} A

be the map that returns the symbol at position k of a word, namely,

f_{k} (w) = a_{w_{k}}

for

w = a_{w_{1}} \dots a_{w_{m}} \in W

. A

d + 1

subset

σ \subseteq W

forms a d-simplex if there exists at least one position j such that

| {f_{j} (w) | w \in σ} | = d + 1

. In other words, all

d + 1

sequences in

σ

contain mutually different nucleotide types at the site j. For a fixed

σ

, the number of different positions j for which the above holds is called the dissimilarity of

σ

. This leads to the aforementioned dissimilarity complex for genomic sequences; see Figure 6. This complex is, by definition, low dimensional; its highest dimension is

s - 1

. For

d + 1 = 2

, the 1-skeleton of the dissimilarity complex captures the well-known Hamming distance—the number of positions in which two genetic sequences differ. It may be worth pointing out that this framework integrates well-known generalizations of distances, as these appear naturally as weights in the dissimilarity complex. Triangle inequalities generalize to tetrahedron inequalities, etc., and this could enhance our understanding of genetic sequences. This is because basic constructs—e.g., trees, such as the tree of life—that reflect the ancestral relations between sequences depend on the notion of Hamming distance. Our preliminary investigations suggest that the homology of the dissimilarity complex captures evolutionary events. For example, the free rank of the first homology module gives bounds of the possible recombinations within the sequence set.

Figure 6. A multiple-sequence alignment of three sequences

X, Y, Z

, their corresponding weighted simplices, and the associated dissimilarity complex.

Author Contributions

Conceptualization, A.B., Q.H. and C.R.; writing, A.B., Q.H. and C.R.; review, A.B., Q.H. and C.R.; investigation: A.B., Q.H. and C.R. All authors have contributed equally to this work. All authors have read and agreed to the possible publication of the manuscript.

Funding

This research received no external funding; the authors received no external financial support for the research, authorship, and/or publication of this article.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We gratefully acknowledge the comments from and discussions with Fenix Huang and Thomas Li. We also appreciate the constructive feedback provided by the anonymous reviewers.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bura, A.C.; He, Q.; Reidys, C.M. Loop Homology of Bi-secondary Structures. Discret. Math. 2021, 344, 112371. [Google Scholar] [CrossRef]
Huang, F.W.; Barrett, C.L.; Reidys, C.M. The energy-spectrum of bicompatible sequences. arXiv 2019, arXiv:1910.00190. [Google Scholar]
Ren, S.; Wu, C.; Wu, J. Weighted persistent homology. Rocky Mt. J. Math. 2018, 48, 2661–2687. [Google Scholar] [CrossRef]
Holley, R.W.; Apgar, J.; Everett, G.A.; Madison, J.T.; Marquisee, M.; Merrill, S.H.; Penswick, J.R.; Zamir, A. Structure of a ribonucleic acid. Science 1965, 147, 1462–1465. [Google Scholar] [CrossRef] [PubMed]
Thirumalai, D.; Lee, N.; Woodson, S.A.; Klimov, D. Early events in RNA folding. Annu. Rev. Phys. Chem. 2001, 52, 751–762. [Google Scholar] [CrossRef]
Fresco, J.R.; Alberts, B.M.; Doty, P. Some molecular details of the secondary structure of ribonucleic acid. Nature 1960, 188, 98–101. [Google Scholar] [CrossRef] [PubMed]
Darnell, J.E. RNA: Life’s Indispensable Molecule; Cold Spring Harbor Laboratory Press: New York, NY, USA, 2011. [Google Scholar]
Chapuy, G. A new combinatorial identity for unicellular maps, via a direct bijective approach. Adv. Appl. Math. 2011, 47, 874–893. [Google Scholar] [CrossRef]
Waterman, M.S. Secondary structure of single-stranded nucleic acids. Adv. Math. Suppl. Stud. 1978, 1, 167–212. [Google Scholar]
Schmitt, W.R.; Waterman, M.S. Linear trees and RNA secondary structure. DIscrete Appl. Math. 1994, 51, 317–323. [Google Scholar] [CrossRef]
Hofacker, I.L.; Schuster, P.; Stadler, P.F. Combinatorics of RNA secondary structures. Discret. Appl. Math. 1998, 88, 207–237. [Google Scholar] [CrossRef]
Haslinger, C.; Stadler, P.F. RNA structures with pseudo-knots: Graph-theoretical, combinatorial, and statistical properties. Bull. Math. Biol. 1999, 61, 437–467. [Google Scholar] [CrossRef] [PubMed]
Jin, E.Y.; Qin, J.; Reidys, C.M. Combinatorics of RNA structures with pseudoknots. Bull. Math. Biol. 2008, 70, 45–67. [Google Scholar] [CrossRef] [PubMed][Green Version]
Orland, H.; Zee, A. RNA folding and large N matrix theory. Nucl. Phys. B 2002, 620, 456–476. [Google Scholar] [CrossRef]
Andersen, J.E.; Chekhov, L.O.; Penner, R.; Reidys, C.M.; Sułkowski, P. Topological recursion for chord diagrams, RNA complexes, and cells in moduli spaces. Nucl. Phys. B 2013, 866, 414–443. [Google Scholar] [CrossRef]
Bon, M.; Vernizzi, G.; Orland, H.; Zee, A. Topological classification of RNA structures. J. Mol. Biol. 2008, 379, 900–911. [Google Scholar] [CrossRef] [PubMed]
Andersen, J.E.; Penner, R.C.; Reidys, C.M.; Waterman, M.S. Topological classification and enumeration of RNA structures by genus. J. Math. Biol. 2013, 67, 1261–1278. [Google Scholar] [CrossRef] [PubMed]
Huang, F.W.; Reidys, C.M. Shapes of topological RNA structures. Math. Biosci. 2015, 270, 57–65. [Google Scholar] [CrossRef]
Chen, W.; Deng, E.; Du, R.; Stanley, R.; Yan, C. Crossings and nestings of matchings and partitions. Trans. Am. Math. Soc. 2007, 359, 1555–1575. [Google Scholar] [CrossRef]
Stanley, R.P. Enumerative Combinatorics; Wadsworth Publ.: Belmont, CA, USA, 1986. [Google Scholar]
Sundaram, S. The Cauchy Identity for sp (2n). J. Comb. Theory Ser. A 1990, 52, 209–238. [Google Scholar] [CrossRef]
Penner, R.; Waterman, M.S. Spaces of RNA secondary structures. Adv. Math. 1993, 101, 31–49. [Google Scholar] [CrossRef]
Zuker, M.; Sankoff, D. RNA secondary structures and their prediction. Bull. Math. Biol. 1984, 46, 591–621. [Google Scholar] [CrossRef]
Gralla, J.; Crothers, D.M. Free energy of imperfect nucleic acid helices: II. Small hairpin loops. J. Mol. Biol. 1973, 73, 497–511. [Google Scholar] [CrossRef]
Turner, D.H.; Mathews, D.H. NNDB: The nearest neighbor parameter database for predicting stability of nucleic acid secondary structure. Nucleic Acids Res. 2009, 38, D280–D282. [Google Scholar] [CrossRef] [PubMed]
Nussinov, R.; Jacobson, A.B. Fast algorithm for predicting the secondary structure of single-stranded RNA. Proc. Natl. Acad. Sci. USA 1980, 77, 6309–6313. [Google Scholar] [CrossRef]
Waterman, M.S.; Smith, T.F. Rapid dynamic programming algorithms for RNA secondary structure. Adv. Appl. Math. 1986, 7, 455–464. [Google Scholar] [CrossRef]
McCaskill, J.S. The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolym. Orig. Res. Biomol. 1990, 29, 1105–1119. [Google Scholar] [CrossRef]
Reidys, C.M. Random induced subgraphs of generalizedn-cubes. Adv. Appl. Math. 1997, 19, 360–377. [Google Scholar] [CrossRef][Green Version]
Flamm, C.; Hofacker, I.L.; Maurer-Stroh, S.; Stadler, P.F.; Zehl, M. Design of multistable RNA molecules. RNA 2001, 7, 254–265. [Google Scholar] [CrossRef]
Bura, A.C.; He, Q.; Reidys, C.M. Loop homology of bi-secondary structures II. arXiv 2019, arXiv:1909.01222. [Google Scholar]
Whitehead, J. Simplicial Spaces, Nuclei and m-Groups. Proc. Lond. Math. Soc. 1939, 2, 243–327. [Google Scholar] [CrossRef]
Cohen, M.M. A Course in Simple-Homotopy Theory, 2nd ed.; Springer: Berlin, Germany, 1973. [Google Scholar]
Hatcher, A. Algebraic Topology; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Andersen, J.E.; Huang, F.W.; Penner, R.C.; Reidys, C.M. Topology of RNA-RNA interaction structures. J. Comput. Biol. 2012, 19, 928–943. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Left: a planar RNA secondary structure with a closing base pair

r = (0, 20)

. Right: its diagram representation on the set of vertices

[19]

with two additional vertices, 0 and 20, forming the rainbow

r = (0, 20)

, the loop

s = [3, 4] \cup [8, 12] \cup [16, 17]

(shaded), and the corresponding intervals (underlined). The arc

x = (3, 17)

is the maximal arc of s,

α_{s} = x

.

Figure 2. Top: a bi-structure decomposed into a distinguished sub-structure and its complement. The vertices contained in both (white) control the complexity of the dynamic programming (DP) routine in [2]. Bottom: the recursive step of the DP routine. Note that the recursion changes the set of white vertices by removing some and adding others.

Figure 3. Left: a bi-structure

R = (S, T)

with S-loops

a, b

, S-rainbow arc

α_{a}

, T-loops

c, d

, and T-rainbow arc

α_{c}

. Right: its weighted loop nerve

K_{0} (R) = {a, b, c, d}, K_{1} (R) = {[b, a], [b, c], [b, d], [d, c], [a, c], [a, d]}

(with

[b, a]

and

[d, c]

being pure edges, and any other edge being mixed), and

K_{2} (R) = {[b, a, c], [b, d, c], [b, a, d], [a, d, c]}

. We have

ω (a) = 7

,

ω (b) = 7

,

ω (c) = 8

,

ω (d) = 6

,

ω ([b, a, c]) = ω ([b, d, c]) = ω ([b, a, d]) = ω ([a, d, c]) = 1

, while the weights of 1-simplices are displayed directly in the figure.

Figure 4. Splicing and removing tetrahedra together with their corresponding butterflies.

Figure 5. Left: an interaction structure with two interaction arcs, x and y. Right: the corresponding interaction complex.

Figure 6. A multiple-sequence alignment of three sequences

X, Y, Z

, their corresponding weighted simplices, and the associated dissimilarity complex.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Weighted Homology of Bi-Structures over Certain Discrete Valuation Rings

Abstract

1. Introduction

1.1. Background

1.2. Motivation

1.3. Organization

2. The Simplicial Loop Complex of a Bi-Structure

3. $μ$ -Splicings and the Weighted Complex of a Bi-Structure

4. Weighted Homology

5. The Inflation Map

6. The Rank of $H_{1, R} (X^{1})$

7. The Modules of the Weighted Homology

8. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

Weighted Homology of Bi-Structures over Certain Discrete Valuation Rings

Abstract

1. Introduction

1.1. Background

1.2. Motivation

1.3. Organization

2. The Simplicial Loop Complex of a Bi-Structure

3. μ -Splicings and the Weighted Complex of a Bi-Structure

4. Weighted Homology

5. The Inflation Map

6. The Rank of H 1 , R ( X 1 )

7. The Modules of the Weighted Homology

8. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

3. $μ$ -Splicings and the Weighted Complex of a Bi-Structure

6. The Rank of $H_{1, R} (X^{1})$