Simpliﬁcation of Reaction Networks, Conﬂuence and Elementary Modes

Guillaume Madelaine; Elisa Tonello; Cédric Lhoussaine; Joachim Niehren

doi:10.3390/computation5010014

,

and

¹

CRIStAL—Centre de Recherche en Informatique Signal et Automatique de Lille—CNRS UMR 9189, Université de Lille, F-59000 Lille, France

²

School of Mathematical Sciences, University of Nottingham, NG7 2RD Nottingham, UK

³

INRIA—French Institute for Research in Computer Science and Automation, 59000 Lille, France

⁴

This paper is an extended version of our paper published in the Proceedings of the International Conference on Computational Methods in Systems Biology 2016, Cambridge, UK, 21

Computation2017, 5(1), 14;https://doi.org/10.3390/computation5010014

This article belongs to the Special Issue Multiscale and Hybrid Modeling of the Living Systems

Version Notes

Order Reprints

Abstract

Reaction networks can be simplified by eliminating linear intermediate species in partial steady states. In this paper, we study the question whether this rewrite procedure is confluent, so that for any given reaction network with kinetic constraints, a unique normal form will be obtained independently of the elimination order. We first show that confluence fails for the elimination of intermediates even without kinetics, if “dependent reactions” introduced by the simplification are not removed. This leads us to revising the simplification algorithm into a variant of the double description method for computing elementary modes, so that it keeps track of kinetic information. Folklore results on elementary modes imply the confluence of the revised simplification algorithm with respect to the network structure, i.e., the structure of fully simplified networks is unique. We show, however, that the kinetic rates assigned to the reactions may not be unique, and provide a biological example where two different simplified networks can be obtained. Finally, we give a criterion on the structure of the initial network that is sufficient to guarantee the confluence of both the structure and the kinetic rates.

Keywords:

simpliﬁcation; conﬂuence; reaction network; ordinary differential equations; deterministic semantics; elementary modes; system biology; rewriting rules

1. Introduction

Chemical reaction networks are widely used in systems biology for modeling the dynamics of biochemical molecular systems [1,2,3,4]. A chemical reaction network has a graph structure that can be identified with an (unmarked) Petri net [5]. Beside of this, it assigns to each of its reactions a kinetic rate that models the reaction’s speed. Chemical reaction networks can either be given a deterministic semantics in terms of ordinary differential equations (Odes), which describes the evolution of the average concentrations of the species of the network over time, or a stochastic semantics in terms of continuous time Markov chains, which defines the evolution of molecule distributions of the different species over time. In this paper, we focus on the deterministic semantics.

Reaction networks modeling molecular biological systems—see, e.g., the examples in the BioModels database [6]—may become very large if modeling sufficient details. Therefore, biologists like to abstract whole subnetworks into single black-box reactions, usually in an adhoc manner that ignores kinetic information [7,8]. The absence or loss of kinetic information, however, limits the applicability of formal analysis techniques. Therefore, much effort has been spent on simplification methods for reaction networks that preserve the kinetic information (see [9] for an overview).

The classical example for a structural simplification method is Michaelis-Menten’s reduction of enzymatic networks with mass-action kinetics [10]. It removes the intermediate species—the complex C and enzyme E—under the assumption that their concentrations

C (t)

and

E (t)

are quasi steady, i.e., approximately constant for all time points t after a short initial phase. Segel [11] shows how to infer Michaelis-Menten’s simplification from the assumptions that

C (t)

is constant and that the conservation law

C (t) + E (t) = E (0)

holds. This is equivalent to our exact steadiness assumption for both

C (t)

and

E (t)

.

S + E \underset{k_{2} C}{\overset{k_{1} S E}{⇌}} C \overset{k_{3} C}{\to} P + E simplifies to S \overset{k_{3} (E (0) + C (0)) \frac{S}{\frac{k_{2} + k_{3}}{k_{1}} + S}}{\to} P

The Odes for C inferred from this network jointly with exact steady state assumptions for C and E entail that the concentration of substrate S must be constant too, even if the network is used in a bigger context where the intermediate C is neither produced nor consumed. In the literature, this consequence is usually mentioned but ignored when considering the production rate of product P as a function of the concentration of S for the enzymatic network in isolation (see e.g., [12]). This oversimplification can be avoided when studying the enzymatic network in the context of a larger network. For instance, the steady state assumptions for C, E, and thus S can be satisfied in the context of the reaction network with the reaction

\emptyset \overset{k_{4}}{\to} S

which produces S with constant speed

k_{4}

, and the reaction

P \overset{k_{5} P}{\to} \emptyset

which degrades P with mass-action kinetics with rate constant

k_{5}

. In this context, the concentration of P will saturate quickly under exact steady state assumptions for C, E, and thus S, as illustrated in Figure 1, while in other contexts it may grow without bound or even oscillate. The Michaelis-Menten simplification of the enzymatic network indeed preserves the dynamics of a network in any context which does not produce nor consume the intermediates E and C, under the assumption that E and C are exactly in steady state with respect to the network in the context.

Figure 1. Evolution of the concentration of

S

,

E

,

C

and

P

in enzymatic network with mass-action kinetics with the parameters

k_{1} = k_{2} = k_{3} = 1

, the initial concentrations

E (0) = 1

,

C (0) = 2

,

S (0) = 4

,

P (0) = 0

, and in the context of the network with a reaction

\emptyset \overset{k_{4}}{\to} S

which produces S with constant speed

k_{4} = 2

, and a reaction

P \overset{k_{5} P}{\to} \emptyset

which degrades P with parameter

k_{5} = 0.2

.

Whether exact steady state assumptions are realistic is an interesting question since the concentrations may be at most close to steady in practice. In the literature it has been argued that the Michaelis-Menten simplification yields a good approximation under appropriate conditions [11,13,14], which typically depend on the context. Whether such properties can be extended to more general simplification methods as developed in the present article is an interesting question but out of the scope of the paper.

Alternatively, much work was spent on simplifying the Odes inferred from a given reaction network [15,16], rather than the reaction network by itself. Indeed, any structural simplification method on the network level, that preserves the kinetic information with respect to the deterministic semantics, must induce a reduction method on the Ode level. The opposite must not be true, since some Odes may not be derivable from any reaction network or may be inferred from many different ones [17]. Furthermore, it is not clear what it could mean for an Ode simplification method to be contextual. Therefore, Ode simplification alone cannot be understood as a simplification of biological systems.

A general structural simplification algorithm for reaction networks with deterministic semantics was first presented by Radulescu et al. They proposed yet another method [18] for simplifying reaction networks with kinetic expressions in partial steady states. Their method assumes the same linearity restriction considered in this paper, preserves exactly the deterministic semantics, but uses different algorithmic techniques. Their simplification algorithm is based on a graph of intermediate species. It computes cycles for simplifying the network structure rather than on elementary modes, and spanning trees for simplifying the kinetic expressions. A set of intermediate species is eliminated in one step, leading to a unique result, that is included in the results found with the algorithm of the present paper. We have not understood yet what distinguishes this result from the others obtained with our algorithm; a clarification of this point might shed light on the relationship between the two methods. In the same paper, the authors also observe that applying the method iteratively to intermediates one by one leads to different results even with different structure. The reason is that dependency elimination is lost in this manner.

A purely structural simplification algorithm method for reaction networks without kinetic rates was proposed in [19]. The method allows to remove some intermediate species by combining the reaction producing and consuming them. For instance, one can simplify the network with the following two reactions on the left into the single reaction on the right, by removing the intermediate species B:

\begin{matrix} A_{1} + \dots + A_{n} \to B \\ B \to C_{1} + \dots + C_{m} \end{matrix}\} simplifies to A_{1} + \dots + A_{n} \to C_{1} + \dots + C_{m}

Since no partial steady state assumptions can be imposed in a kinetics free framework, the intermediate elimination rules need some further restrictions. Given these, the simplification steps were shown correct with respect to the attractor semantics. contextual equivalence relation was obtained by instantiating the general framework for observational program semantics from [20]. Rather than being based on termination as observable for concurrent programs, it relies on the asymptotic behaviours of the networks represented by the terminal connected components, which are often called attractors.

Outline

We first recall some basic notions on confluence, multisets, and commutative semigroups in Section 2. In Section 3 we recall the basics on reaction networks without kinetics and elementary flux modes. In Section 4, we present the rewrite rules for intermediate elimination, illustrate the failure of confluence, and propose a rewrite rule for eliminating dependent reactions, which however turns out to be non-confluent on its own. In Section 5 we present the refined algorithm in the case without kinetics based on the notion of flux networks for representing reaction networks, and prove its confluence by reduction to a folklore result on elementary flux modes. In Section 6, we introduce reaction networks with kinetic expressions, and extend them with kinetic constraints. In Section 7, we lift the revised algorithm to constrained flux networks with kinetics. In Section 8 we present a linearity restriction, that is preserved by reductions, and thus structurally confluent. In Section 9, we present a counter example that shows that full confluence is still not achieved, and present a further syntactic restriction based on elementary modes avoiding this problem. Section 10 provides a biological example of non-confluence with kinetics. Section 11 studies the relation between the simplification and the underlying Odes simplification. Finally, we conclude in Section 12.

2. Preliminaries

We recall basic notions on confluence of binary relations, on multisets, and more general commutative semigroups. We will denote the set of all natural numbers including 0 by

N

and the set of integers by

Z

.

2.1. Confluence Notions

We recall the main confluence notions and their relationships from the literature.

Let

(S, \sim)

be a set with an equivalence relation and

\to \subseteq S \times S

a binary relation. In most cases, ∼ will be chosen as the equality relation of the set S, which is

=^{S} = {(s, s) ∣ s \in S}

. We define

\to^{0} = \sim

and

\to^{k} = \to \circ \to^{k - 1}

for all

k \in N \ {0}

. The relation

\to^{*} = \cup_{k \in N} \to^{k}

is called the reflexive transitive closure of →.

Definition 1.

We say that a binary relation → on

(S, \sim)

is confluent if

\leftarrow^{*} \circ \to^{*} \subseteq \to^{*} \circ^{*} \leftarrow

and locally confluent if

\leftarrow \circ \to \subseteq \to^{*} \circ^{*} \leftarrow

. We say that two binary relations ⇒ and → on S commute if

\Leftarrow \circ \to \subseteq \to \circ \Leftarrow

.

The confluence notions are illustrated by the diagrams in Figure 2. Clearly, a confluence of relation → is confluent if its reflexive transitive closure

\to^{*}

commutes with itself. It is also obvious that local confluence implies confluence, and well known that the converse does not hold. In this paper, we will always use binary relations that are terminating, i.e., for any

s \in S

there exists a

k \in N

such that

{s^{'} ∣ s \to^{k} s^{'}} = \emptyset

, i.e., the length k of sequences of reduction steps starting with s is bounded. It is well known that locally confluent and terminating relations are confluent (Newman’s lemma).

Figure 2. Confluence, local confluence, and commutation.

Lemma 1.

If a binary relation → on

(S, \sim)

is confluent and commutes with ∼, then the binary relation

\sim \circ \to \circ \sim

on

(S, =_{S})

is confluent.

Definition 2.

Let

(S, \sim, \to)

and

(S^{'}, \approx, \Rightarrow)

be two sets each endowed with two binary relations. A function

T : S \to S^{'}

is called a simulation from

(S, \sim, \to)

to

(S^{'}, \approx, \Rightarrow)

if for any

s_{1}, s_{2} \in S

, if

s_{1} \sim s_{2}

then

T (s_{1}) \approx T (s_{2})

, and if

s_{1} \to s_{2}

then

T (s_{1}) \Rightarrow T (s_{2})

.

The conditions that have to be satisfied by simulations are illustrated by the diagrams in Figure 3.

Figure 3. Simulation diagrams.

2.2. Multisets

Let R be a finite set. A multiset M with elements in R is a function

M : R \to N

. For any

r \in R

we call

M (r)

the number of occurrences of r in M. We say that r is a member of multiset M and write

r \in M

if

M (r) \neq 0

. We denote by

M_{R}

the set of all multisets (over R), and will simply write

M

if the set R is clear from the context.

Given numbers

k, n_{1}, \dots, n_{k} \in N

and a subset

{r_{1}, \dots, r_{k}} \subseteq R

with k different elements, we denote by

M = n_{1} r_{1} + \dots + n_{k} r_{k} = \sum_{i = 1}^{n} n_{i} r_{i}

the multiset that for any

1 \leq i \leq k

contains

M (r_{i}) = n_{i}

occurrences of

r_{i}

and

M (r) = 0

occurrences of all other elements in R.

The sum of two multisets

M_{1} +^{M} M_{2}

is the multiset M that satisfies

M (r) = M_{1} (r) +^{N} M_{2} (r)

for all

r \in R

. The empty multiset

0^{M}

is the function that maps all elements of R to 0. The algebra of multisets

(M, +^{M}, 0^{M})

over a given set R is a commutative semigroup with a neutral element.

It should be noticed that our notation may give rise to some ambiguities, since we will also write + for the addition of natural numbers instead of

+^{N}

. This may be problematics if

R = N

. In this case, the notation introduced below we will permit us to write

{(n_{1} r_{1} + \dots + n_{k} r_{k})}^{M} = {(\sum_{i = 1}^{n} n_{i} r_{i})}^{M}

for sums of multisets and

{(n_{1} r_{1} + \dots + n_{k} r_{k})}^{N} = {(\sum_{i = 1}^{n} n_{i} r_{i})}^{N}

for sums of natural numbers.

2.3. Commutative Semigroups

Let

(G, +^{G}, 0^{G})

and

(F, +^{F}, 0^{F})

be two semigroups with neutral element. Beside of the algebras of multisets (depending on the choice of R) we are interested in the algebra of vectors of naturals

(N^{n}, +^{N^{n}}, 0^{N^{n}})

for any

n \in N

.

A homomorphism between two semigroups is a function

h : G \to F

such that

h (g_{1} +^{G} g_{2}) = h (g_{1}) +^{F} h (g_{2})

for all

g_{1}, g_{2} \in G

and

h (0^{G}) = 0^{F}

. A homomorphism

h : M_{R} \to F

on multisets is determined by the values of h on singleton multisets in

M_{R}

via the equation:

h (n_{1} r_{1} + \dots + n_{k} r_{k}) = \underset{n_{1} times}{\underset{︸}{h (1 r_{1}) +^{F} \dots +^{F} h (1 r_{1})}} +^{F} \dots +^{F} \underset{n_{k} times}{\underset{︸}{h (1 r_{k}) +^{F} \dots +^{F} h (1 r_{k})}} .

Given a homomorphism

h : M_{R} \to F

, we define the interpretation

M^{F} = h (M)

for all multisets

M \in M_{R}

. Clearly, the interpretation depends on the homomorphism h, even though only its co-domain

F

appears in our notation. This works smoothly since there will never be any ambiguity about the homomophism that is chosen. If

R = F

, then we use the homomorphism

e v a l_{F} : M_{F} \to F

with

e v a l_{F} (1 f) = f

for all elements

f \in F

. In this case, any multiset

n_{1} f_{1} + \dots + n_{k} f_{k}

with elements in

F

is evaluated to a single element

{(n_{1} f_{1} + \dots + n_{k} f_{k})}^{F} = e v a l_{F} (n_{1} f_{1} + \dots + n_{k} f_{k})

and thus by the above equation:

{(n_{1} f_{1} + \dots + n_{k} f_{k})}^{F} = \underset{n_{1} times}{\underset{︸}{f_{1} +^{F} \dots +^{F} f_{1}}} +^{F} \dots +^{F} \underset{n_{k} times}{\underset{︸}{f_{k} +^{F} \dots +^{F} f_{k}}} .

If

F = M_{R}

then we use the identity homomorphism

i d_{M_{R}} : M_{R} \to M_{R}

with

i d_{M_{R}} (M) = M

for all

M \in M_{R}

. In this case we have that

{(n_{1} r_{1} + \dots + n_{k} r_{k})}^{M_{R}} = n_{1} r_{1} + \dots + n_{k} r_{k}

is the multiset itself.

We will also use this notation in order to distinguish the operator + of multisets in

M_{N}

from the operator + of natural numbers in

N

which we overloaded (as stated earlier). For instance, if

n, m \in N

, then

{(2 n + 5 m)}^{M_{N}}

is a multiset of natural numbers while

{(2 n + 5 m)}^{N}

is a natural number. Note also that different multisets may have the same interpretation. For instance if

n = 3

and

m = 4

, then

{(2 n + 5 m)}^{N} = 26 = {(n 2 + m 5)}^{N}

where we use

e v a l_{N}

as homomorphism while

{(2 n + 5 m)}^{M_{N}} \neq {(n 2 + m 5)}^{M_{N}}

where we use

i d_{M_{N}}

as homomorphism.

For any subset

G \subseteq G

of a semigroup, we can define the (positive integer convex) cone of G, as the set of all positive integer linear combinations of elements of G:

cone (G) = {{(n_{1} g_{1} + \dots + n_{k} g_{k})}^{G} ∣ k \in N, g_{1} \dots g_{k} \in G, n_{1} \dots n_{k} \in N} .

Here we use

e v a l_{G}

as homomorphism.

3. Reaction Networks without Kinetics

Let

Spec

be a finite set of species that is totally ordered. A (chemical) solution with species in

Spec

is a multiset of species

s : Spec \to N

. A (chemical) reaction with species in

Spec

is a function

r : Spec \to Z

, which assigns to each species A the stoichiometry of A in r. A chemical reaction r consumes the chemical solution

C o n s_{r} = - r_{| {A \in Spec ∣ r (A) < 0}}

and produces the chemical solution

P r o d_{r} = r_{| {A \in Spec ∣ r (A) > 0}}

. Clearly

r (A) = P r o d_{r} (A) - C o n s_{r} (A)

for all species A, while

C o n s_{r}

and

P r o d_{r}

are disjoint multisets in chemical reactions r (since their definition is based on stoichiometries).

We will freely identify a reaction r with the pair of chemical solutions consumed and produced by r. We will denote such pairs as

C o n s_{r} \to P r o d_{r}

. For instance,

B + 2 C \to A

is the chemical reaction r with

r (A) = 1

,

r (B) = - 1

, and

r (C) = - 2

. Note also that we do not consider

2 A + B \to 3 A + 2 C

as a chemical reaction, since the species A belongs to the chemical solutions on both sides. When removing

2 A

on both sides, we obtain a chemical reaction

B \to A + 2 C

. The rewrite relation of a chemical reaction r contains all pairs of chemical solutions

(s, s^{'})

such that

s^{'} (A) = s (A) +^{N} r (A)

for all species A.

Definition 3.

A reaction network (without kinetics) over

Spec

is a finite set of chemical reactions over

Spec

, with a total order.

To any reaction network N with total order < we assign a unique vector of reactions

r = (r_{1}, \dots, r_{n})

such that

N = {r_{1}, \dots, r_{n}}

and

r_{1} < \dots < r_{n}

. Conversely, for any tuple of distinct reactions

r = (r_{1}, \dots, r_{n})

, we write

N_{r}

for the reaction network

{r_{1}, \dots, r_{n}}

with the total order

r_{1} < \dots < r_{n}

.

Any reaction network can be represented by a bipartite graph as for a a Petri net, with a node for each species and a node of a different type for each reaction. We will draw species nodes with ovals and reaction nodes with squares. An arrow labeled by k from the node of a species A to the node of a reaction r means that A is consumed k times by r, i.e.,

r (A) = - k

. Conversely, an arrow with label k from the node of a reaction r to the node of a species A means that A is produced k times by r, i.e.,

r (A) = k

. We will freely omit the labels

k = 1

.

Example 1.

Consider the reaction network presented in Figure 4. It has

m = 2

species

Spec = {X, Y}

and

n = 4

reactions

{r_{1}, \dots, r_{4}}

in that order. Reaction

r_{1}

produces two molecules of species X out of nothing, reaction

r_{2}

transforms an X into a molecule Y, while

r_{3}

transforms a molecule Y back into a molecule X. Reaction

r_{4}

degrades a molecule X.

Figure 4. A reaction network and the associated graph and stoichiometry matrix.

The set of chemical reactions defines an algebra

(R, +^{R}, 0^{R})

where

0^{R}

is the empty reaction →, and

+^{R}

is the addition of integer valued functions on

Spec

. Note that

s^{'} \to s +^{R} s \to s^{'} = 0^{R}

for any two disjoint chemical solutions s and

s^{'}

. By interpretation in this algebra (that is using the identity homomorphism), we can evaluate each multiset of chemical reactions M as a chemical reaction

M^{R}

itself, as shown in Section 2.2.

Definition 4.

An invariant of a reaction network N without kinetics is a multiset M of reactions of N such that

M^{R} = 0^{R}

. We denote the set of all invariants of N by

inv (N)

.

The reaction network in Figure 4 has the set of invariants

{{(n_{1} M_{1} + n_{2} M_{2})}^{M} ∣ n_{1}, n_{2} \in N}

where

M_{1} = r_{1} + 2 r_{4}

and

M_{2} = r_{2} + r_{3}

. We next relate the notion of invariants of a reaction network to the kernel of its stoichiometry matrix.

3.1. Stoichiometry Matrices

The stoichiometry information of a reaction network is usually collected in its stoichiometry matrix. For this we consider a set of species

Spec = {A_{1}, \dots, A_{m}}

and a reaction network

N = {r_{1}, \dots, r_{n}}

, such that both sets are totally ordered by the indices of their elements.

The stoichiometry matrix S of N is the

m \times n

matrix of integers, such that the entry of S at row i and column j is equal to

r_{j} (A_{i})

for all

1 \leq i \leq m

and

1 \leq j \leq n

. Note that reaction

r_{j}

contributes in the

j^{'} t h

column, while species

A_{j}

contributes the j’s row of S. For instance, the stoichiometry matrix of the reaction network in Figure 4 is given on the right.

It can now be noticed that, for any vector

v = (n_{1}, \dots, n_{n})

of natural numbers, the multiset

n_{1} r_{1} + \dots + n_{n} r_{n}

is an invariant of reaction network N if and only if its stoichiometry matrix satisfies

S v = 0

, i.e., if v belongs to the kernel of the stoichiometry matrix. Therefore, we define the (positive integer) kernel of a matrix S by:

\ker_{+} (S) = {v \in N^{n} ∣ S v = 0} .

3.2. Elementary Modes

The support of a vector

v = (n_{1}, \dots, n_{k})

is the subset of indices i such that

n_{i}

is non-null, i.e.,

supp (v) = {i \in {1, \dots, k} ∣ n_{i} \neq 0}

.

Definition 5.

An elementary mode of an

m \times n

matrix S over

Z

is a vector

v \in \ker_{+} (S) \ {0^{N^{n}}}

such that:

v is on an extreme ray: there exists no $v^{'} \in \ker_{+} (S) \ {0^{N^{n}}}$ such that $supp (v^{'}) ⊊ supp (v)$ , and
v is factorised: there exists no $v^{″} \in \ker_{+} (S)$ such that $v = k v^{″}$ for some natural number $k \geq 2$ .

The condition

v \in \ker_{+} (S)

means that an elementary mode must be a (positive integer) steady state of S. Geometrically, the set of all positive integer steady states forms a pointed cone, that is generated by convex combinations of its extreme rays. The first condition states that any elementary flux mode v must belong to some extreme ray of the cone. The second condition requires that an elementary mode is maximally factorised, i.e., it is the vector on the extreme ray with the smallest norm.

Theorem 1

(Folklore [21]). Let S be an

m \times n

matrix of integers. Then the set E of all elementary modes of S has finite cardinality and satisfies

\ker_{+} (S) = cone (E)

.

The intuition is

\ker_{+} (S)

is a cone with a finite number of extreme rays, so that these extreme ray generated the cone. The set of elementary modes E contains exactly one point on each of the extreme rays of

\ker_{+} (S)

. Therefore,

\ker_{+} (S) = cone (E)

, i.e., the set of elementary modes is a finite generator of

\ker_{+} (S)

.

Let us point out two differences between the definition of elementary mode considered here and in [21]. First, we added condition 2. Without this condition, any multiple of an elementary mode would be an elementary mode, so that there would be infinitely many. The double-description method as recalled there, however, computes the set of elementary modes in the above sense, so this difference is minor. Second, note that [21] considers a slightly more general problem, where some of the coordinates of v may be negative. This corresponds to the addition of reversible reactions that we do not consider in the present paper.

3.3. Elementary Flux Modes

We next lift the concept of elementary modes from matrices to reaction networks, via the stoichiometry matrix. Given a vector of reactions

r = (r_{1}, \dots, r_{n})

and a vector

v = (n_{1}, \dots, n_{n})

of natural numbers we define the multiset of reactions

v r

and the corresponding reaction

r_{v}

as follows:

v r = n_{1} r_{1} + \dots + n_{n} r_{n} and r_{v} = {(v r)}^{R} .

Definition 6.

An elementary flux mode of a reaction network

N = N_{r}

is a multiset of reactions

v r

such that the vector v is an elementary mode of the stoichiometry matrix of N.

The kernel condition

v \in \ker_{+} (S)

of elementary modes v yields that any elementary flux mode

v r

satisfies

r_{v} = 0^{R}

, i.e., the reaction defined by the elementary flux mode must be empty. For instance, reconsider the reaction network in Example 1 with

m = 2

species and

n = 4

reactions

r = (r_{1}, r_{2}, r_{3}, r_{4})

in that order. Its stoichiometry matrix has two elementary modes: the vectors

v_{1} = (1, 0, 0, 2)

and

v_{2} = (0, 1, 1, 0)

. The corresponding elementary flux modes are the multisets of reactions

v_{1} r = r_{1} + 2 r_{4}

and

v_{2} r = r_{2} + r_{3}

illustrated in Figure 5 by the arrows coloured in apricot and aquamarine respectively. First consider the multiset

r_{1} + 2 r_{4}

: the first reaction

r_{1}

produces

2 X

which are then degraded by

2 r_{4}

. So the reaction

r_{v_{1}} = {(r_{1} + 2 r_{4})}^{R} = 0^{R}

is indeed empty. Consider now the multiset of reactions

r_{2} + r_{3}

: its first reaction

r_{2}

transforms X to Y and its second reaction

r_{3}

does the inverse. Thus,

r_{v_{2}} = {(r_{2} + r_{3})}^{R} = 0^{R}

is the empty reaction too. The intuition is that applying to a chemical solution at the same time all reactions of an elementary flux mode with their multiplicities does not have any effect.

Figure 5. The elementary modes of the reaction network in Figure 4.

It should be noticed that the vector

v = (1, 1, 1, 2)

is also a solution of the steady state equation

S v = 0

, and thus the multiset of reactions

r_{1} + r_{2} + r_{3} + 2 r_{4}

is also an invariant of the example network. It is the multiset sum of two elementary flux modes

v_{1} r +^{M} v_{2} r

which is also equal to

(v_{1} +^{N^{4}} v_{2}) r

.

4. Simplifying Reaction Networks without Kinetics

We study the question whether the step-by-step intermediate elimination relation proposed in [22] is confluent in the case of reaction networks without kinetics. We present a counter example against the confluence and illustrate the reason for this problem.

4.1. Intermediate Elimination

Let

I \subseteq Spec

be a finite set of species that we will call intermediate species or intermediates for short. The simplification procedure will remove all intermediates from a given reaction network, step-by-step and in arbitrary order.

Our objective is to remove an intermediate

X \in I

from a network N by merging any pair of reactions of N, a reaction r that produces X and another reaction

r^{'}

that consumes it. This is done by the (Inter) rule in Figure 6, and is based on the merge operation

{r ⋄}_{X} r^{'}

which returns a linear combination of

r and r^{'}

and thus of the reactions in the initial network:

{r ⋄}_{X} r^{'} = {(- r^{'} (X) r + r (X) r^{'})}^{R} .

Figure 6. Simplification of reaction networks without kinetics with respect to a set

I

of intermediate species.

Since

r

produces

r (X)

molecules X while

r^{'}

consumes

r^{'} (X)

molecules X, we have (

{r ⋄}_{X} r^{'}

) (X) = 0. Therefore, X is not present in the solutions consumed and produced by reaction

{r ⋄}_{X} r^{'}

.

In Example 2 below, we will denote vectors

(n_{1}, \dots, n_{n})

of natural numbers by

1^{n_{1}} \dots n^{n_{n}}

, while freely omitting components

i^{n_{i}}

with

n_{i} = 0

and simplifying component

j^{1}

to j. For instance if

r = (r_{1}, \dots, r_{4})

, we can write

r_{14^{2}}

instead of

r_{(1, 0, 0, 2)}

.

Example 2.

We consider the network

N

in Figure 7 with species

Spec = {A, B, X, Y}

and reaction vector

r = (r_{1}, \dots, r_{4})

. We consider the elimination of the intermediates in

I = {X, Y}

in both possible orders. On the top, we first eliminate the intermediate species X from

N

, obtaining network

N_{X}

. We have to combine reaction

r_{1}

producing 2 X molecules with reaction

r_{2}

which consumes 1 X molecule. We obtain the reaction

r_{1} ⋄_{X} r_{2} = r_{12^{2}} = {r_{1} + 2 r_{2})}^{R}

that transforms one A molecule into 2 Y molecules. We proceed in the same way with the other 3 pairs of reactions that produce and consume X. Then, we can remove the intermediate species Y from network

N_{X}

and obtain the network

N_{X Y}

in the top right. Note that we keep empty reactions such as

r_{23}

. At the bottom, we show network

N_{Y}

, obtained by eliminating the intermediate species Y first. The only reaction producing Y in N is

r_{2}

and the only reaction consuming Y is

r_{3}

. Merging them produces reaction

r_{23}

. When eliminating intermediate X from

N_{Y}

, we obtain network

N_{Y X}

on the bottom right.

Figure 7. Elimination of intermediates X and Y in reaction network

N

in both possible orders, leading to two different final results

N_{X Y}

and

N_{Y X}

.

It turns out that

N_{X Y}

and

N_{Y X}

differ in that the former contains the reaction

r_{12^{2} 3^{2} 4^{2}}

in addition to the reactions

r_{14^{2}}

and

r_{23}

shared by both networks.

4.2. Eliminating Dependent Reactions

Example 2 shows that intermediate elimination with the (Inter) rule alone is not confluent, given that it may produce two different networks that cannot be simplified any further,

N_{X Y}

and

N_{Y X}

, depending on whether we first eliminate the intermediate X or the intermediate Y. The reaction network

N_{X Y}

contains an additional reaction, which is a linear combination of two other reactions:

r_{12^{2} 3^{2} 4^{2}} = {(r_{14^{2}} + 2 r_{23})}^{R} .

In order to solve this non-confluence problem, we propose the new simplification rule (Dep) in Figure 6. It eliminates a reaction that is a positive linear combination of other reactions of the network, i.e., some reaction

s_{v} = {(n_{1} r_{1} + \dots + n_{k} r_{k})}^{R}

where

s = (r_{1}, \dots, r_{k}) \in N^{k}

and

v = (n_{1}, \dots, n_{k}) \in N^{k}

for some

k \in N

.

Unfortunately, the simplification relation with rules (Inter) and (Dep) is still not confluent. The problem is that even applying rule (Dep) alone fails to be confluent as shown by the following counter example.

Example 3.

Consider the network

N^{″}

in Figure 8 in the absence of intermediates, i.e., where

I = \emptyset

. There are two ways of applying rule (Dep) to this network, since

r_{4} = {(r_{1} + 2 r_{2})}^{R}

and

r_{2} = {(r_{3} + r_{4})}^{R}

. We can thus either eliminate

r_{4}

leading to

N_{r_{4}}^{″}

or

r_{2}

leading to

N_{r_{2}}^{″}

. The two results are different even though they contain no more dependencies.

Figure 8. Dependency elimination is not confluent.

This example shows that general dependency elimination cannot be done in a confluent manner. On the other hand, what we need in order to solve the confluence problem for intermediate elimination as illustrated in Figure 7, is a little more restricted: it is sufficient to remove those dependent reactions that were introduced by intermediate elimination. Such dependencies can be identified from the vectors of natural numbers that we used to name the reactions. In the example, we have

r_{12^{2} 3^{2} 4^{2}} = {(r_{14^{2}} + 2 r_{23})}^{R}

, so the dependency of this reaction follows from the dependency of the vectors

12^{2} 3^{2} 4^{2} = {((14^{2}) + 2 (23))}^{N^{4}}

.

5. Simplifying Flux Networks

We next introduce vector representations of reaction networks without kinetics, called flux networks, and show that the simplification of such representations can indeed be done in a confluent manner.

For the reminder of this section, we fix an n-tuple

r

of distinct reactions and a subset of species

I \subseteq Spec

.

5.1. Vector Representations of Reaction Networks

The objective is to simplify the initial reaction network

N_{r}

by removing the intermediates from

I

. The iterative elimination of intermediate species generates a sequence of networks with reactions in

cone (N_{r}) = {r_{v} ∣ v \in N^{n}}

. The idea is now to use the vectors

v \in N^{n}

as representations of reactions

r_{v}

. These vectors will tell us about the provenance of the reaction obtained when simplifying the network

N_{r}

.

The mapping of vectors

v \in N^{n}

to reactions

r_{v} \in R

is a homomorphism between commutative semigroups, whose image is

cone (N_{r})

. It should be noticed, however, that it is not an isomorphism since any element of

\ker_{+} (S)

will be mapped to

0^{R}

, where S is the stoichiometry matrix of

N_{r}

. Therefore, it makes a difference whether we will work with vectors in

N^{n}

representing a reaction or with the reactions itself. Intuitively, the difference is that we know where the reaction does come from.

Definition 7.

An n-ary flux network V is a finite subset of vectors in

N^{n}

that is totally ordered.

Any n-ary flux network V defines a reaction network

r_{V} = {r_{v} ∣ v \in V}

, that we call the reaction network represented by V. The total order of the reactions in network

r_{V}

is the one induced by the total order of V.

5.2. Simplification Rules

Let

I \subseteq Spec

be a finite set of species that we call intermediates. In Figure 9, we rewrite the simplification rules (F-Inter) and (F-Dep) so that they apply to flux networks. For this we have to lift the merge operation from reactions to vectors that represent them. For any

v_{1}, v_{2} \in N^{n}

we define:

v ⋄_{X} v^{'} = {(- r_{v^{'}} (X) v + r_{v} (X) v^{'})}^{N^{n}} .

Figure 9. Simplifying flux networks for an initial n-tuple of reactions

r

and a set of intermediate species

I

.

In the rule for the dependency elimination, we now use a notation for linear combinations of vectors in

N^{n}

. Given a vector

v = (v_{1}, \dots, v_{k})

of vectors in

N^{n}

and a vector

v = (n_{1}, \dots, n_{k})

of natural numbers we define:

v_{v} = {(n_{1} v_{1} + \dots + n_{k} v_{k})}^{N^{n}} .

The counter example for the non-confluence of dependency elimination can no more be applied in this way, since rule (F-Dep) is not based on the dependency of the reactions as with (Dep) but on the dependencies of the vectors that define the reactions.

5.3. Factorization

The simplification relation with axioms (F-Inter) and (F-Dep) is still not confluent, as shown in Example 4.

Example 4.

We consider the vector of initial reactions

r = (r_{1}, r_{2}, r_{3})

of the network

N^{″}

in Figure 10. Let

I = {X, Y, Z}

be the set of intermediate species. Note that

N^{″} = r_{V_{3}}

where

V_{3} = {(1, 0, 0), (0, 1, 0), (0, 0, 1)} \subseteq N^{3}

is the flux network to which we apply the simplification algorithm. If we remove the species X first from

V_{3}

, we obtain a flux network representing the reaction network

N_{X}^{″}

, and from that we get a flux network representing

N_{X Y Z}^{″}

by eliminating Y and Z (in any order). This flux network has only one flux vector which is

1^{2} 2^{2} 3^{2} = {(2 (123))}^{N^{3}}

. If we remove Y first we obtain a flux network representing

N_{Y}^{″}

, and from that a flux network representing

N_{Y X Z}^{″}

by removing X and Z. The latter flux network is the singleton with the flux vector 123.

Figure 10. Elimination of intermediate species from flux networks in different orders is not confluent without factorization.

What is needed is a rule for the factorization of scalar multiples

{(k v)}^{N^{n}}

of vector v. This is done by the rule (F-Fact) in Figure 9, which, in the previous example, allows to simplify

N_{X Y Z}^{″}

into

N_{Y X Z}^{″}

.

We first note a consequence of the Folklore Theorem 1 and the following Lemma that is equally well known.

Lemma 2.

Let

v, v^{'} \in N^{n}

be two elementary modes of the same matrix S. If

supp (v) = supp (v^{'})

then

v = v^{'}

.

Proof.

Suppose that

supp (v) = supp (v^{'})

. Write

v = (n_{1}, \dots, n_{n})

and

v^{'} = (n_{1}^{'}, \dots, n_{n}^{'})

. Let

i \in supp (v)

be such that

n_{i} / n_{i}^{'}

is maximal. Without loss of generality we can assume that

n_{i} / n_{i}^{'} \geq 1

since otherwise, we can exchange v and

v^{'}

. Consider the vector of integers

w = n_{i} v^{'} - n_{i}^{'} v

. For any

j \in supp (v)

we have:

n_{i} n_{j}^{'} - n_{i}^{'} n_{j} = n_{i}^{'} n_{j}^{'} (n_{i} / n_{i}^{'} - n_{j} / n_{j}^{'}) \geq 0 .

Therefore,

w \in N^{n}

, and thus

w \in \ker_{+} (S)

. Furthermore,

i \notin supp (w)

so that

supp (w) ⊊ supp (v)

. Since v is an elementary mode of S this implies that

w = 0

, and thus

n_{i} v^{'} = n_{i}^{'} v

. Without loss of generality we can assume that

n_{i}

and

n_{i}^{'}

have no common prime factors. If

n_{i} = n_{i}^{'} = 1

we are done. Otherwise

n_{i} \geq 2

since

n_{i} / n_{i}^{'} \geq 1

. Thus v can be factorized by

n_{i}

, contradiction. ☐

Corollary 1.

Let S be an

m \times n

matrix of integers and

E \subseteq N^{n}

the set of all elementary modes of S. For any set

E^{'} \subseteq N^{n}

such that

cone (E^{'}) = \ker_{+} (S)

, if

E^{'}

is irreducible by

⇛_{F - FACT}

and

⇛_{F - DEP}

then

E^{'} = E

.

Proof.

By Theorem 1, we have

cone (E) = \ker_{+} (S) = cone (E^{'})

.

We first show that

E \subseteq E^{'}

. Let

v \in E

. Since

v \neq 0^{N^{n}}

and

E \subseteq cone (E^{'})

, v is of the form

v = n^{1} v^{1} + \dots + n^{k} v^{k}

for some

k \geq 1

, factorized

v^{i} \in E^{'} \ {0^{N^{n}}}

and

n^{i} \in N \ {0}

. Since all

n^{i}

and

v^{i}

are positive, it follows that

supp (v^{i}) \subseteq supp (v)

for all

1 \leq i \leq k

. Consider

i = 1

. Since v is an elementary mode and

v^{1} \in \ker_{+} (S)

, this implies that

supp (v^{1}) = supp (v)

. Since

v^{1}

is factorized, and a member of

\ker_{+} (S)

with minimal support, it is also an elementary mode of S. Lemma 2 thus implies that

v^{1} = v

, and so

v \in E^{'}

. (It also follows that

k = n^{1} = 1

).

We next show that

E^{'} \subseteq E

. Let

v \in E^{'}

. Since

E^{'} \subseteq cone (E^{'}) = cone (E)

, vector v has the form

v = n^{1} v^{1} + \dots + n^{k} v^{k}

for some

v^{i} \in E

. Since

E \subseteq E^{'}

and

E^{'}

is closed by rule (F-Dep) it follows that

k = 1

. Hence,

v = n^{1} v^{1}

. Since

E^{'}

is closed by rule (F-Fact) it follows that

n^{1} = 1

. Hence

v = v^{1} \in E

. ☐

5.4. Proving Confluence via Elementary Modes

Given a tuple of initial reactions

r

of size n and a set of intermediates

I \subseteq Spec

as parameters, we obtain a simplification relation on flux networks:

⇛_{F} =_{df} (⇛_{F - INTER} \cup ⇛_{F - DEP} \cup ⇛_{F - FACT}) .

We now show that this relation is confluent for all possible choices of the parameters. The proof is by reduction to the Corollary 1 of the folklore Theorem 1 on elementary modes. We start with an fundamental property of the diamond operator

⋄_{X}

, that we formulate in a sufficiently general manner so that is can be reused later on.

Lemma 3 (Diamond).

Let

(G, +^{G}, 0^{G}, \cdot^{G}, 1^{G})

be a commutative semi-ring and

h : N^{n} \to G

a semi-group homomorphism with respect to addition. Given a tuple

(v_{1}, \dots, v_{k})

of vectors in

N^{n}

, a tuple

(g_{1}, \dots, g_{k})

of elements of

G

, and a species

X \in Spec

, we define:

\begin{matrix} P & = & {p \in {1 \dots k} ∣ r_{v_{p}} (X) > 0}, & prod & = & {(\sum_{p \in P} r_{v_{p}} (X) g_{p})}^{G}, \\ C & = & {c \in {1 \dots k} ∣ r_{v_{c}} (X) < 0}, & cons & = & {(\sum_{c \in C} - r_{v_{c}} (X) g_{c})}^{G} \end{matrix}

It then holds that:

\begin{matrix} \sum_{p \in P}^{G} \sum_{c \in C}^{G} g_{p} \cdot^{G} g_{c} \cdot^{G} h (v_{p} ⋄_{X} v_{c}) = \sum_{p \in P}^{G} g_{p} \cdot^{G} cons \cdot^{G} h (v_{p}) +^{G} \sum_{c \in C}^{G} g_{c} \cdot^{G} prod \cdot^{G} h (v_{c}) \end{matrix}

Proof.

We use some elementary rules of commutative semi-rings to distribute and factorize the sums contained in the definition of the diamond:

\begin{matrix} \sum_{p \in P}^{G} & \sum_{G c \in C} g_{p} \cdot^{G} g_{c} \cdot^{G} h (v_{p} ⋄_{X} v_{c}) \\ = \sum_{p \in P}^{G} \sum_{c \in C}^{G} g_{p} \cdot^{G} g_{c} \cdot^{G} ({(- r_{v_{c}} (X) h (v_{p}))}^{G} +^{G} {(r_{v_{p}} (X) h (v_{c}))}^{G}) \\ = \sum_{p \in P}^{G} \sum_{c \in C}^{G} g_{p} \cdot^{G} g_{c} \cdot^{G} {(- r_{v_{c}} (X) h (v_{p}))}^{G} +^{G} \sum_{p \in P}^{G} \sum_{c \in C}^{G} g_{p} \cdot^{G} g_{c} \cdot^{G} {(r_{v_{p}} (X) h (v_{c}))}^{G} \\ = \sum_{p \in P}^{G} g_{p} \cdot^{G} {(\sum_{c \in C} - r_{v_{c}} (X) g_{c})}^{G} \cdot^{G} h (v_{p}) +^{G} \sum_{c \in C}^{G} g_{c} \cdot^{G} {(\sum_{p \in P} r_{v_{p}} (X) g_{p})}^{G} \cdot^{G} h (v_{c}) \\ = \sum_{p \in P}^{G} g_{p} \cdot^{G} c o n s \cdot^{G} h (v_{p}) +^{G} \sum_{c \in C}^{G} g_{c} \cdot^{G} p r o d \cdot^{G} h (v_{c}) \end{matrix}

☐

Our next objective is to show that the simplification preserves the invariants, when relativised to

r

. For any flux network V, we therefore define the set of relatived invariants of V as follows:

{inv}_{r} (V) = {{(n_{1} v_{1} + \dots + n_{k} v_{k})}^{N^{n}} r ∣ n_{1} r_{v_{1}} + \dots + n_{k} r_{v_{k}} \in inv (r_{V})} .

For

V_{n} = {(1, 0, \dots, 0), \dots, (0, \dots, 0, 1)} \subseteq N^{n}

with the vectors ordered in the way they are enumerated, note that we have

r_{V_{n}} = N_{r}

and

{inv}_{r} (V_{n}) = inv (N_{r})

. We next show that such relativised invariants are preserved by the simplification of flux networks.

Lemma 4.

If

V ⇛_{F} V^{'}

then

{inv}_{r} (V) = {inv}_{r} (V^{'})

.

Proof.

We assume

V ⇛_{F} V^{'}

and first show the inclusion

{inv}_{r} (V) \subseteq {inv}_{r} (V^{'})

.

Let

n_{1} v_{1} + \dots + n_{k} v_{k} \in {inv}_{r} (V)

. Then

n_{1} r_{v_{1}} + \dots + n_{k} r_{v_{k}} \in inv (r_{V})

. This means

{(n_{1} r_{v_{1}} + \dots + n_{k} r_{v_{k}})}^{R} = 0^{R}

. Since

⇛_{F}

is the union

⇛_{F - FACT} \cup ⇛_{F - DEP} \cup ⇛_{F - INTER}

, three cases are to be considered.

Case: $V ⇛_{F - FACT} V^{'}$ . Suppose that (F-Fact) replaces vector $v_{1}$ by vector $v_{1}^{'}$ so that $v_{1} = k^{'} v_{1}^{'}$ for some $k^{'} \neq 0$ . Hence $n_{1} k^{'} r_{v_{1}^{'}} + n_{2} r_{v_{2}} + \dots + n_{k} r_{v_{k}} \in inv (r_{V^{'}})$ . And thus, ${(n_{1} k^{'} v_{1}^{'} + n_{2} v_{2} + \dots + n_{k} v_{k})}^{N^{n}} r \in {inv}_{r} (V^{'})$ , which is equivalent to ${(n_{1} v_{1} + \dots + n_{k} v_{k})}^{N^{n}} r \in {inv}_{r} (V^{'})$ as required.
Case: $V ⇛_{F - DEP} V^{'}$ . By rule (F-Dep) there exist $k \in N$ , $v \in V^{k}$ and $v \in N^{k}$ such that $V = V^{'} ⊎ {v_{v}}$ . If all $v_{i}$ are distinct from $v_{v}$ then trivially $n_{1} r_{v_{1}} + \dots + n_{k} r_{v_{k}} \in inv (r_{V^{'}})$ . Otherwise, we can assume without loss of generality that $v_{1} = v_{v}$ with v and $v$ as in rule (F-Dep). Suppose that these have the forms $v = (m_{1}, \dots, m_{l})$ and $v = (w_{1}, \dots, w_{l})$ . Since $r_{v_{v}} = {(m_{1} r_{w_{1}} + \dots + m_{l} r_{w_{l}})}^{R}$ , it follows that:

$n_{1} m_{1} r_{w_{1}} + \dots + n_{1} m_{l} r_{w_{l}} + n_{2} r_{v_{2}} + \dots + n_{k} r_{v_{k}} \in inv (r_{V^{'}}) .$

This yields ${(n_{1} m_{1} w_{1} + \dots + n_{1} m_{l} w_{l} + n_{2} v_{2} + \dots + n_{k} v_{k})}^{N^{n}} r \in {inv}_{r} (V^{'})$ . Since $v_{1} = v_{v} = {(m_{1} w_{1} + \dots + m_{l} w_{l})}^{N^{n}}$ this is is equivalent to ${(n_{1} v_{1} + \dots + n_{k} v_{k})}^{N^{n}} r \in {inv}_{r} (V^{'})$ as required.
Case: $V ⇛_{F - INTER} V^{'}$ . Suppose that the intermediate species $X \in I$ was eliminated thereby. Recall that $\sum_{i = 1}^{k} n_{i} r_{v_{i}} \in inv (r_{V})$ . We can assume without loss of generality that $n_{i} \neq 0$ for all $1 \leq i \leq k$ . Let P, C, $prod$ , and $cons$ be as introduced in the Diamond Lemma 3, where $G = N^{n}$ , homomorphism h the identity on $N^{n}$ , and $g_{i} = n_{i}$ for all $1 \leq i \leq k$ . The lemma then yields:

$\begin{matrix} {(\sum_{p \in P} \sum_{c \in C} n_{p} n_{c} (v_{p} ⋄_{X} v_{c}))}^{N^{n}} = {(\sum_{p \in P} n_{p} cons v_{p} + \sum_{c \in C} n_{c} prod v_{c})}^{N^{n}} \end{matrix} .$

Since ${(\sum_{i = 1}^{k} n_{i} r_{v_{i}})}^{R} = 0^{R}$ it follows that $prod = cons$ . Furthermore, $prod \neq 0$ since otherwise $P = C = \emptyset$ so that (F-Inter) could not be applied. Since $cons = prod$ , this tuple is equal to $prod {(\sum_{p \in P} n_{p} v_{p} + \sum_{c \in C} n_{c} v_{c})}^{N^{n}}$ . With $M = {m \in {1 \dots k} ∣ r_{v_{m}} (X) = 0}$ we get:

$\begin{matrix} {(\sum_{p \in P} \sum_{q \in C} n_{p} n_{c} (v_{p} ⋄_{X} v_{c}) + \sum_{m \in M} prod n_{m} v_{m})}^{N^{n}} r = {(\sum_{i = 1}^{k} prod n_{i} v_{i})}^{N^{n}} r \end{matrix} .$

This multiset is an invariant, since ${(\sum_{i = 1}^{k} n_{i} r_{v_{i}})}^{R} = 0^{R}$ . It follows that:

$\begin{matrix} {(\sum_{p \in P} \sum_{c \in C} n_{p} n_{c} (v_{p} ⋄_{X} v_{c}) + \sum_{m \in M} n_{m} prod v_{m})}^{N^{n}} r \in inv (r_{V^{'}}) \end{matrix} .$

This implies ${(\sum_{i = 1}^{k} prod n_{i} v_{i})}^{N^{n}} r \in {inv}_{r} (V^{'})$ . Since $prod \neq 0$ and since ${inv}_{r} (V^{'})$ is closed by factorization with nonzero factors, it follows that $\sum_{i = 1}^{k} n_{i} r_{v_{i}} \in {inv}_{r} (V^{'})$ as required.

The proof of the inverse inclusion

{inv}_{r} (V) \supseteq {inv}_{r} (V^{'})

differs in that the Diamond Lemma is not needed. Let

n_{1} v_{1}^{'} + \dots + n_{k} v_{k}^{'} \in {inv}_{r} (V^{'})

. Then

n_{1} r_{v_{1}^{'}} + \dots + n_{k} r_{v_{k}^{'}} \in inv (r_{V^{'}})

. This means

{(n_{1} r_{v_{1}^{'}} + \dots + n_{k} r_{v_{k}^{'}})}^{R} = 0^{R}

. We distinguish three cases depending on which rule was applied:

Case: $V ⇛_{F - FACT} V^{'}$ . Suppose that (F-Fact) replaces vector $v_{1}$ by vector $v_{1}^{'}$ so that $v_{1} = k^{'} v_{1}^{'}$ for some $k^{'} \neq 0$ . Since ${(k^{'} n_{1} r_{v_{1}^{'}} + \dots + k^{'} n_{k} r_{v_{k}^{'}})}^{R} = 0^{R}$ we have $n_{1} r_{v_{1}} + n_{2} k^{'} r_{v_{2}^{'}} + \dots + n_{k} k^{'} r_{v_{k}^{'}} \in inv (r_{V})$ . And thus, ${(n_{1} v_{1} + n_{2} k^{'} v_{2}^{'} + \dots + n_{k} k^{'} v_{k}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ , which is equivalent to ${(n_{1} k^{'} v_{1}^{'} + \dots + n_{k} k^{'} v_{k}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ , and thus ${(n_{1} v_{1}^{'} + \dots + n_{k} v_{k}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ as required.
Case: $V ⇛_{F - DEP} V^{'}$ . By rule (F-Dep) there exist $k \in N$ , $v \in V^{k}$ and $v \in N^{k}$ such that $V = V^{'} ⊎ {v_{v}}$ . If all $v_{i}^{'}$ are distinct from $v_{v}$ then trivially $n_{1} r_{v_{1}^{'}} + \dots + n_{k} r_{v_{k}^{'}} \in inv (r_{V})$ . Otherwise, we can assume without loss of generality that $v_{1}^{'} = v_{v}$ with v and $v$ as in the rule. Suppose that these have the forms $v = (m_{1}, \dots, m_{l})$ and $v = (w_{1}, \dots, w_{l})$ . Since $r_{v_{v}} = {(m_{1} r_{w_{1}} + \dots + m_{l} r_{w_{l}})}^{R}$ , it follows that:

$n_{1} m_{1} r_{w_{1}} + \dots + n_{1} m_{l} r_{w_{l}} + n_{2} r_{v_{2}^{'}} + \dots + n_{k} r_{v_{k}^{'}} \in inv (r_{V}) .$

This yields ${(n_{1} m_{1} w_{1} + \dots + n_{1} m_{l} w_{l} + n_{2} v_{2}^{'} + \dots + n_{k} v_{k}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ . Since $v_{1}^{'} = v_{v} = {(m_{1} w_{1} + \dots + m_{l} w_{l})}^{N^{n}}$ this is is equivalent to ${(n_{1} v_{1}^{'} + \dots + n_{k} v_{k}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ as required.
Case: $V ⇛_{F - INTER} V^{'}$ . Suppose that the intermediate species $X \in I$ was eliminated thereby. We recall that $\sum_{i = 1}^{k} n_{i} r_{v_{i}^{'}} \in inv (r_{V^{'}})$ . Without loss of generality, we can assume that all elements of $V^{'}$ occur exactly once in this sum. Let $V = {v_{1}, \dots, v_{l}}$ , $P = {p ∣ r_{v_{p}} (X) > 0}$ , $C = {c ∣ r_{v_{c}} (X) < 0}$ , and $M = {m ∣ r_{v_{m}} (X) = 0}$ . If $v_{i}^{'} = v_{p} ⋄_{X} v_{c}$ for $p \in P$ and $c \in C$ , we note $o_{p c} = n_{i}$ . Otherwise, if $v_{i}^{'} = v_{m}$ with $m \in M$ , we note $o_{m} = n_{i}$ . By the rule (F-Inter) we have:

$\begin{matrix} {(\sum_{i = 1}^{k} n_{i} v_{i}^{'})}^{N^{n}} & = & {(\sum_{p \in P} \sum_{c \in C} o_{p c} v_{p} ⋄_{X} v_{c} + \sum_{m \in M} o_{m} v_{m})}^{N^{n}} \\ = & {(\sum_{p \in P} (\sum_{c \in C} o_{p c} r_{v_{c}} (X)) v_{P} + \sum_{c \in C} (\sum_{p \in P} o_{p c} r_{v_{p}} (X)) v_{C} + \sum_{m \in M} o_{m} v_{m})}^{N^{n}} \end{matrix}$

Hence ${(\sum_{i = 1}^{k} n_{i} v_{i}^{'})}^{N^{n}} r \in {inv}_{r} (V)$ .

☐

We start the reminder of the proof with the case where all species are intermediates so that

I = Spec

.

Lemma 5.

If

I = Spec

and V is irreducible by

⇛_{F - INTER}

then

{v_{v} r ∣ v \in V^{k}, v \in N^{k}, k \in N} = {inv}_{r} (V)

.

Proof.

Given that V is irreducible by

⇛_{F - INTER}

, all intermediates species must be eliminated in all reactions of

r_{V}

. Since

I = Spec

this implies that all species are eliminated in all reactions of

r_{V}

, so for all

v \in r_{V}

it follows that

r_{v} = 0^{R}

. Thus for any

v \in V

and

v \in V^{k}

we have

{(v_{v})}^{R} = 0^{R}

, so that

v_{v} \in {inv}_{r} (V)

. Hence

{v_{v} r ∣ v \in V^{k}, v \in N^{k}, k \in N} \subseteq {inv}_{r} (V)

. The inverse inclusion holds trivially. ☐

Proposition 1.

Let

I = Spec

and

V_{n} ⇛_{F}^{*} V

such that V irreducible for

⇛_{F - INTER}

. Then

cone (V) = \ker_{+} (S)

, where S is the stoichiometry matrix of

N_{r}

.

Proof.

By Lemmas 4 and 5 we have:

{v_{v} r ∣ v \in V^{k}, v \in N^{k}, k \in N} = {inv}_{r} (V) = {inv}_{r} (V_{n}) = inv (N_{r})

. This yields

{v_{v} ∣ v \in V^{k}, v \in N^{k}, k \in N} = \ker_{+} (S)

, i.e.,

cone (V) = \ker_{+} (S)

. ☐

Theorem 2.

Consider the simplification relation for flux networks

⇛_{F}

that is parametrised by

I = Spec

and a tuple of initial reactions

r

. If

V_{n} ⇛_{F}^{*} V

for some flux network V that is irreducible for

⇛_{F}

, then

V = E

, where E is the set of elementary modes of the stoichiometry matrix of

N_{r}

.

Proof.

From Proposition 1 it follows that

cone (V) = \ker_{+} (S)

where S is the stoichiometry matrix of

N_{r}

. Furthermore, V is irreducible with respect to

⇛_{F - FACT} \cup ⇛_{F - DEP}

, so that Corollary 1 implies

V = E

. ☐

Corollary 3.

The simplification relation

⇛_{F}

restricted to flux networks in the set

{V ∣ V_{n} ⇛_{F}^{*} V}

is confluent.

Proof.

We notice that

⇛_{F}

is terminating, since (F-Inter) reduces the number of intermediate species

X \in I

for which there exists a vector v such that

r_{v} (X) \neq 0

, (F-Dep) reduces the number of vectors in the set, and (F-Fact) reduces the norm of one of the vectors.

We first consider the case

Spec = I

. Let V be such that

V_{n} ⇛_{F}^{*} V

, where

⇛_{F}

is parametrised by

I

and a tuple

r

of initial reactions. Suppose that

V ⇛_{F}^{*} V_{1}

and

V ⇛_{F} V_{2}

. Since

⇛_{F}

is terminating there exist

V_{1}^{'}

and

V_{2}^{'}

that are irreducible with

⇛_{F}

such that

V_{1} ⇛_{F}^{*} V_{1}^{'}

and

V_{1} ⇛_{F}^{*} V_{2}^{'}

. Theorem 2 proves that

V_{1}^{'} = E = V_{2}^{'}

, where E is the set of elementary modes of the stoichiometry matrix of

r_{V_{n}} = N_{r}

.

We next reduce the general case where

Spec \subseteq I

to the case

Spec = I

. We define

r_{| I}

by restricting all reactions in the tuple

r

to

I

, i.e., if

r = (r_{1}, \dots, r_{n})

then

r_{| I} = ({r_{1}}_{| I}, \dots, {r_{n}}_{| I})

. We then observe that the relation

⇛_{F}

with respect to

r

coincides with the relation

⇛_{F}

with respect to

r_{| I}

. Hence the confluence result from the case

I = Spec

can be applied. ☐

As shown by Theorem 2, the exhaustive simplification of flux networks V with

⇛_{F}

can be used to compute the set of elementary modes of the stoichiometry matrix of the reaction network

r_{V}

. Interestingly, this algorithm is essentially the same as the double description method, as recalled for instance in [21]. The correspondence comes from the fact that any reaction network can be identified with its stoichiometry matrix, so that the algorithm can be formulated either for the one or the other representation. Still there is a minor difference between this algorithm and the one in [21]. The algorithm presented here is slightly more flexible, in that the rule (F-Dep) can be applied at any stage of the simplification while in the double description method as described in [21], the rule (F-Dep) is applied at the same time as the rule (F-Inter). However, as we have shown with the confluence Theorem 2, this additional freedom in the application order of the rules does not affect the final result.

6. Reaction Networks with Deterministic Semantics

We now consider reactions with kinetic expressions, and recall some basic definitions. We first define expressions and networks with kinetics. Then we recall how to associate a system of equations to a reaction network. Finally we use this system of equations to define the deterministic semantics of reaction networks.

6.1. Kinetic Expressions

We now define a class of kinetic expressions. Their syntax is the same as that of arithmetic expressions, by their semantics is by interpretation as functions of type

R_{+} \to R

,

Let

Param

be a set of parameters of type

R_{+}

. As set of variables of type

R_{+} \to R_{+}

, we will use the set

Spec

. A variable

A \in Spec

is intended to represent the temporal evolution of the concentration of A over time.

We define the set of expressions

Expr

by the terms with the abstract syntax in Figure 11. Expressions describe functions of type

R_{+}

to

R

. They are built from species A of type

R_{+}

to

R_{+}

, and constant functions defined by parameters

k \in R_{+}

, constants

c \in R

, and expressions

e (0)

, standing for the value of e at time 0. Beside of these, expressions can be constructed by addition, subtraction, multiplication, and division. For convenience, we will use parenthesis

(e)

whenever the priority of the operators might not be clear. For any species e we denote by

p Spec (e)

the subset of species that occur properly in e, that is outside of a sub-expression

e (0)

. So for instance

p Spec (B = A (0)) = {B}

.

Figure 11. Expressions where

A \in Spec, k \in Param, c \in R, and n \in N

.

The semantics of expressions is parametrised by a function

β : Param \to R_{+}

that interprets all parameters as positive real numbers. In order to simplify the notation, we assume that β is fixed, but notice that our simplification algorithms will be correct for any interpretation β.

The value of an expression

〚 e 〛_{α} \in (R_{+} \to R_{+}) \cup {⊥}

is specified in Figure 11 for any variable assignment

α : Vars \to (R_{+} \to R_{+})

. It may either be a function of type

R_{+} \to R

or undefined ⊥. The latter is necessary for the interpretation of

〚 1 / e 〛_{α}

, which is defined only if

〚 e 〛_{α} (t) \neq 0

for any time point t. We call an expression e nonnegative if

e \geq 0

is valid, i.e., if for all non-negative assignment α and all time points

t \in R_{+}

, we have

〚 e 〛_{α} (t) \geq 0

.

Definition 8.

A kinetic expression is a nonnegative expression

e \in Expr

.

6.2. Constrained Flux Networks

The next objective is to add kinetic expressions to reactions and flux networks. Furthermore, we need to be able to express constraints about these kinetic expressions in order to express partial steady state hypothesis and conservation laws. This will lead us to the notion of constrained flux networks. A reaction with kinetics expressions is a pair

r; e

where r is a reaction without kinetics and e is a kinetic expression. As before we now use flux reaction to represent a reaction but now with a kinetic expression.

Definition 9.

An n-ary flux reaction with kinetic expression is a pair

v; e

composed of a vector

v \in N^{n}

and a kinetic expression

e \in Expr

. Given a tuple of reactions

r = (r_{1}, \dots, r_{n})

, the flux reaction

v; e

represents the reaction

r_{v}; e

.

The set

C

of constraints on kinetic functions is defined in Figure 12. A constraint

C \in C

is a conjunction of atomic constraints. The first kind is an equation

e = e^{'}

stating that the expressions e and

e^{'}

must have the same value but different from ⊥. The atomic constraint

cst (e)

requires that e is a constant function,

e \neq 0

that e may never becomes equal to zero, and

e \geq 0

that e is always non-negative. More formally, we define in Figure 12 the interpretation

〚 C 〛_{α} \in B \cup {⊥}

of a constraint

C

for a given variable assignment α, where

B = {true, false}

is the set of boolean values.

Figure 12. Constraints on kinetic functions.

Definition 10.

An n-ary constrained flux network is a pair

W = V & C

where V is a set of n-ary flux reactions with kinetic expressions and

C

a constraint.

Let

W = V & C

be a constrained flux network. We denote by

Expr (W)

the set of kinetic expressions e such that

v; e \in V

or such that e occurs in the constraint

C

. We set:

p Spec (W) = {A ∣ v; e \in V, r_{v} (A) \neq 0 or A \in p Spec (e)} \cup p Spec (C) .

6.3. Systems of Constrained Equations with ODEs

We now recall how to assign systems of equations to constrained flux networks. Note that systems constrained equations may contain both constraints and ordinary differential equations (Odes) in particular.

The set of systems of constrained equations is defined in Figure 13. They are conjunctions of constraints C and Odes

\dot{A} = e

where

A \in Spec

and

e \in Expr

. Note that the constraints may subsume the non-differential arithmetic equations

e = e^{'}

. We denote by

Spec (E)

the set of (free) variables occurring in E, and by

Expr (E)

the set of expressions contained in E.

Figure 13. Systems of constrained equations with Odes.

The denotation of a system of constrained equations E is a value in

〚 E 〛_{α} \in B \cup {⊥}

as defined in Figure 13. The set of solutions of

sol (E)

is the set of assignments

α : Spec \to R_{+}

that make E true, i.e.,

sol (E) = {α ∣ 〚 E 〛_{α} = t r u e} .

We say that a constrained equation E logically implies another

E^{'}

and write

E ⊧ E^{'}

if

sol (E) \subseteq sol (E^{'})

. For instance,

true ⊧ k A + k B = k (A + B)

,

e \neq 0 ⊧ e / e = 1

,

cst (A) ⊧ \dot{A} = 0

.

Definition 11.

Two constrained equation systems E and

E^{'}

are called logically equivalent, denoted

E ⧦ E^{'}

, if they have the same solutions, i.e.,

E ⧦ E^{'} iff sol (E) = sol (E^{'}) .

Clearly,

E ⧦ E^{'}

if and only if E and

E^{'}

logically imply each other, i.e.,

E ⊧ E^{'}

and

E^{'} ⊧ E

.

6.4. Deterministic Semantics

We assign to any constrained flux network

W = V & C

a system of constrained equations

E (W)

in Figure 14. Note that

E (W)

does depend on the tuple of initial reactions

r

and the set of intermediates

I

. The system contains an Ode for any species A stating that the change of the concentration of A is equal to the sum of the rates

r_{v} (A) e

of the flux reactions

v; e \in V

. The factor

r_{v} (A)

makes the rate negative if A is consumed and positive if A is produced. It also takes care of the multiplicities of consumption and production. Finally, the constraint

C

of the constrained flux network is added to the system of constrained equations.

Figure 14. System of constrained equations of a constrained flux network

V & C

.

Example 5.

We consider the flux network for the classical Michaelis-Menten example [10]. Its system of constrained equations is then represented in Figure 15. It contains four ODEs and two constant constraints.

Figure 15. System of constrained equations for Michaelis-Menten.

6.5. Contextual Equivalence

Two constrained flux networks W and

W^{'}

are non-contextually equivalent, denoted

W ≃ W^{'}

, if their systems of constrained equations are logically equivalent:

W ≃ W^{'} iff E (W) ⧦ E {(W)}^{'} .

We now extend the definition to a contextual equivalence. The idea is that networks can be exchanged with equivalent networks in any context, without affecting the semantics. As contexts, we use flux networks themselves

W^{'} = V^{'} & C^{'}

. We define the combination of a network

W = V & C

and the context

W^{'}

as follows:

W ∣ W^{'} =_{df} V \cup V^{'} & C \land C^{'} .

We now assume a set of intermediate species

I \subseteq Spec

and call a context

W^{'}

compatible if

p Specs (W^{'}) \cap I = \emptyset

.

Definition 12.

Two constrained flux networks W and

W^{'}

are (contextually) equivalent if they have the same solutions in any compatible context, that is:

W \sim W^{'} iff \forall W^{″} compatible . W ∣ W^{″} ≃ W^{'} ∣ W^{″} .

We note that the definition of equivalence of constrained flux networks has two parameters:

r

and

I

. The equivalence ∼ depends on the tuple of initial reactions

r

, since the non-contextual equivalence relation ≃ relies on the deterministic semantics of constrained flux networks, which in turn depends on

r

. The equivalence relation ∼ also depends on the set of intermediates

I

since the notion of compatibility depends on it.

Our simplification algorithm will rewrite constrained flux networks up to logical equivalence of constraints. Therefore, we can hope for confluence only up to logical equivalence. More formally, we defined the similarity relation ≅ as the least equivalence relation on constrained flux networks that satisfies the following two inference rules for all

C, C^{'}, V, e, e^{'}

:

\frac{C ⊧ e = e^{'}}{{v; e} \cup V & C ≅ {v; e^{'}} \cup V & C}, \frac{C ⧦ C^{'}}{V & C ≅ V & C^{'}} .

The first rule states that expressions that are logically equivalent under the constraints of the constrained flux network can be replaced by each other. The second rule allows to exchange logically equivalent constraints by each other. Similar networks are trivially equivalent:

Lemma 6.

Similarity

W ≅ W^{'}

implies contextual equivalence

W \sim W^{'}

.

Proof.

Straightforward from the definitions. ☐

7. Simplification of Constrained Flux Networks

Our next objective is to simplify constrained flux networks by lifting the confluent simplification algorithm for flux networks to the case with kinetic expressions. This will require to impose partial steady state and linearity restrictions on the constrained flux networks, since otherwise, we would not know how to remove intermediates from the constrained equations assigned to the constrained flux network.

7.1. Linear Steadiness of Intermediate Species

The following restriction will allow us to eliminate an intermediate species from the constraint equations of a constrained flux network.

Definition 13.

We say that a species

X \in I

is linearly steady in a constrained flux network

V & C

if it satisfies the following four conditions:

Partial steady state: the concentration of X is steady, i.e., $C ⊧ cst (X)$ .
Linear consumption: if a reaction in V consumes X then its kinetic expression is linear in X, that is: if $v; e \in V$ such that $r_{v} (X) < 0$ then $C ⊧ e = X e^{'}$ for some expression $e^{'}$ such that $X \notin p Specs (e^{'})$ .
Independent production: if a reaction in V produces X then its kinetic expression does not contain X except for subexpressions $X (0)$ : for any $v; e \in V$ , if $r_{v} (X) > 0$ then $X \notin p Specs (e)$ .
Nonzero consumption: the consumption of X is nonzero: $C ⊧ \sum {e ∣ v; e \in V, r_{v} (X) < 0} \neq 0$ .

Suppose that X is linearly steady in

W = V & C

. Since X is in partial steady state, we have

C ⊧ cst (X)

and hence

C ⊧ \dot{X} = 0

. The constrained equations of W thus imply that the production and consumption of X are equal:

E (W) ⊧ prod = cons where \{\begin{matrix} prod = \sum {r_{v} (X) e ∣ v; e \in V, r_{v} (X) > 0} \\ cons = \sum {- r_{v} (X) e ∣ v; e \in V, r_{v} (X) < 0} \end{matrix}

The linear consumption of X imposes that

C ⊧ cons = X e

for some expression e such that

X \notin p Specs (e)

. The independent production of X imposes that

X \notin p Specs (prod)

. Because of nonzero consumption, we have:

E (W) ⊧ X = X \frac{prod}{cons} = \frac{p r o d}{e} .

where the expression

\frac{p r o d}{e}

does not contain the species X properly. Therefore, we can eliminate the variable X from the constrained equation

E (W)

by substituting X by

\frac{p r o d}{e}

. This give us hope that we can also eliminate linearly steady intermediate species from the constrained flux networks too by adapting the rule (F-Inter) to kinetic expressions.

7.2. Simplification

We now lift the simplification rules for flux networks to constrained flux networks. The lifted rules are presented in Figure 16. They define the simplification relation for constrained flux networks:

⇛_{C} =_{df} ⇛_{C - INTER} \cup ⇛_{C - MOD} \cup ⇛_{C - DEP} .

Figure 16. Simplification rules for n-ary constrained flux networks, with

I

the set of intermediate species and

r

the n-tuple of initial reactions.

The first rule (C-Inter) eliminates a linearly steady intermediate species X, by merging any pair of reactions, so that the one produces and another consumes X. The rule can be applied only under the hypothesis that the constraints of the network imply that X is linearly steady, and so that X is in partial steady state in particular. It should also be noticed that the conditions on the initial value of X are preserved by the constraint

X (0) = X \frac{prod}{cons}

. As argued above, the linear steadiness of X implies that the latter is equivalent to some other expression that does not contain species X properly. So except for constraints on the initial value

X (0)

, the species X got removed from the constrained flux network. The rule also replaces X by

X (0)

in all the kinetic expressions of reactions, in which X is used as a modifier, i.e.,

v; e \in V

such that

r_{v} = 0

and

X \in p Specs (e)

. Furthermore, the same substitution is applied to the constraints of the flux network.

The rule (C-Mod) removes an intermediate that is never a reactant or a product of a reaction, and replaces X with its initial value

X (0)

. Then the rule (C-Dep) removes a dependent reaction. In contrast to the case without kinetics, the kinetic expressions of the remaining reactions need to be modified. The last rule (C-Sim) states that simplification is applied modulo similarity of constraint flux reaction networks.

The simplification defined here is sound for the contextual equivalence relation of constrained flux networks:

Proposition 2.

Given a constrained flux network W, if

W ⇛_{C} W^{'}

then

W \sim W^{'}

.

The proof is given in Appendix C. The arguments are direct from the definitions, except that the Diamond Lemma 3 is needed in for

⇛_{C - INTER}

.

7.3. Michaelis-Menten

We illustrate the simplification on the classical Michaelis-Menten example [10].

We consider the simplification of a three-step enzymatic scheme with mass-action kinetics into a single reaction with Michaelis-Menten kinetics. In the initial network,

M M n e t

depicted in Figure 17, a substrate S can bind to an enzyme E and form a complex C. The complex can either dissociate back to S and E, or produce a product P, while releasing E. We assume here that the enzyme E and the complex C are intermediate species, i.e., they are at steady-state and cannot interact with the context. Therefore, the intermediate species E and C are linearly steady in this network.

Figure 17. Reaction networks for the Michaelis-Menten example.

M M n e t_{E}

and

M M n e t_{C}

are obtained from the initial network

M M n e t

after removing E and C respectively.

M M n e t_{C E}

is obtained after removing both C and then E in this order.

M M n e t_{E C}

is obtained by inverting the order of elimination.

We first look at the elimination of the intermediate C with (C-Inter). To this end, we merge each reaction that produces C (that is, reaction

r_{1}

) with each reaction that consumes C (reactions

r_{2}

and

r_{3}

) and obtain the network

M M n e t_{C}

. Thus, merging reactions

r_{1}

and

r_{2}

(resp.

r_{1}

and

r_{3}

) of

M M n e t

(Figure 17) results in the reaction

r_{12}

(resp.

r_{13}

) of

M M n e t_{C}

. The simplification also replaces the atomic constraint

cst (C)

with

cst (k_{1} S E / (k_{2} + k_{3}))

. Since we also have the constraint S, and the parameters are constant too, we can rewrite

cst (k_{1} S E / (k_{2} + k_{3}))

into the similar

cst (S)

. We also add the constraint

C (0) = k_{1} S E / (k_{2} + k_{3})

.

To remove E before the elimination of C, one would merge

r_{3}

with

r_{1}

,

r_{2}

with

r_{1}

, and obtain the network

M M n e t_{E}

. At this point, in both networks we have an intermediate species that is neither a product nor a reactant of any reaction, but is used as a modifier. We can then remove it with (C-Mod), replacing E with

E (0)

(resp. C with

C (0)

). We obtain respectively the networks

M M n e t_{C E}

and

M M n e t_{E C}

. Note that these networks are similar. We can rewrite

C (0) = k_{1} S E (0) / (k_{2} + k_{3})

into

E (0) = (k_{2} + k_{3}) C (0) / (k_{1} S)

, and use this equation to rewrite the kinetic rate. Additionally, we can also use it to transform the kinetic expression of

r_{12}

into the usual one for Michaelis-Menten, using the following transformation. First, we have:

\begin{matrix} E (0) & = (E (0) + C (0)) - C (0) \\ = (E (0) + C (0)) - k_{1} S E (0) / (k_{2} + k_{3}) . \end{matrix}

We can then rewrite it into:

E (0) = \frac{(E (0) + C (0)) (k_{2} + k_{3})}{k_{2} + k_{3} + k_{1} S} .

By replacing

E (0)

with this expression in the kinetic expressions of

r_{13}

in

M M n e t_{C E}

, we obtain after basic rewriting the classical rate:

k_{3} (E (0) + C (0)) \frac{S}{\frac{k_{2} + k_{3}}{k_{1}} + S} .

The following diagram illustrates the confluence of the simplifications on these networks.

8. Preservation of Linear Steadiness

As we have seen in the previous section, to remove an intermediate species X, we need to impose that X is linearly steady. When removing a set of intermediate species

I

, we then need that any

X \in I

is linearly steady, and moreover than when we remove one intermediate species, the other species in

I

remain linearly steady. We therefore introduce the following additional conditions, and denote by

LinNets

the set of networks that satisfy these conditions. We then prove that the set

LinNets

is stable under the simplification, i.e., that the simplification of a network in

LinNets

is still a network in

LinNets

.

8.1. $LinNets$

We first define the new conditions, and present some examples to motivate them. For any flux

v; e x

, we note

C o n s_{I} (r_{v}) = {X \in I ∣ r_{v} (X) < 0}

and

P r o d_{I} (r_{v}) = {X \in I ∣ r_{v} (X) > 0}

.

Definition 14.

We denote by

LinNets

the set of constrained flux networks W such that W is similar to a constrained flux network

V & C

and that for all intermediates

X \in I

:

1.: Either X is linearly steady in $V & C$ , or X is only a modifier, that is for any $v; e \in V$ we have $r_{v} (X) = 0$ .
2.: No intermediate species different from X occurs in the kinetic expression of a reaction that consumes X: for any $v; e \in V$ , if $X \in C o n s_{I} (r_{v})$ , then $Specs (e) \cap I \subseteq {X}$ .
3.: The rate of a reaction that produces X but does not consume an intermediate species does not depend on the concentration of any intermediate species: for any $v; e \in V$ , if $X \in P r o d_{I} (r_{v})$ and $C o n s_{I} (r_{v}) = \emptyset$ , then $I \cap Specs (e) = \emptyset$ .
4.: The total stoichiometry of the intermediate species in the reactant (resp. product) of a reaction is never greater than one: for any $v; e \in V$ , $| C o n s_{I} (r_{v}) | \leq 1$ and $| P r o d_{I} (r_{v}) | \leq 1$ .

Note that, as a consequence of the stoichiometry condition, the sets

C o n s_{I} (r_{v})

and

P r o d_{I} (r_{v})

are either empty or consist of a single intermediate species.

We illustrate the motivations for these new conditions on the following examples.

Let us first consider the case where a reaction consuming X has a kinetic rate that depends on another intermediate (here Y in the kinetic rate of

r_{2}

), so that Condition 2 is not satisfied. It is illustrated in Figure 18.

Figure 18. Example illustrating the need of Condition 2.

If we remove Y first by merging

r_{3}

and

r_{4}

, then we compute the expression

Y = \frac{k_{3}}{k_{4}} X

. We replace Y with this expression in the kinetic rate of

r_{2}

, obtaining a reaction with a non-linear kinetic expression

\frac{k_{2} k_{3}}{k_{4}} X^{2}

.

Similarly, consider a reaction producing X with a kinetic rate that depends on Y, i.e., a network where Condition 3 is not satisfied (Figure 19).

Figure 19. Example illustrating the need of Condition 3.

If we remove the intermediate Y, the kinetic expression of

r_{1}

becomes

\frac{k_{1} k_{3}}{k_{4}} A X

. We obtain a reaction producing X, with a kinetic expression depending on X. Therefore, the differential equation for X will not have the required form:

0 = \dot{X} = X (\frac{k_{1} k_{3}}{k_{4}} A - (k_{2} + k_{3}))

, and we cannot compute an expression for X.

This kind of situation may also appear as the result of the simplification of reactions where one intermediate has a stoichiometry greater than one, i.e., Condition 4 is not satisfied (Figure 20).

Figure 20. Example illustrating the need of Condition 4.

In this network, reaction

r_{3}

produces two molecules of Y. If we remove Y, the merging of

r_{3}

and

r_{4}

is a reaction that produces one molecule of X (and one of C), with kinetic expression

k_{3} X

.

Finally, if we have two intermediate species that are both reactants (or both products) in the same reactions (Condition 4 again), then the stoichiometry of one intermediate can become greater than one as a result of the elimination of the other intermediate (Figure 21).

Figure 21. Example illustrating the need of Condition 4.

If we remove Y, the merging of

r_{3}

and

r_{4}

is a reaction with two molecules of X as reactants.

8.2. Stability of $LinNets$

Now, we prove that

LinNets

is stable for our simplification.

We first consider the following proposition.

Proposition 3.

Let

W, W_{0}

be reaction networks such that

W_{0} \in LinNets

and

W_{0} ⇛_{C}^{*} W

. Let

v; e \in W

be a flux that depends on

v_{1}; e_{1}, \dots, v_{k}; e_{k} \in W

. Then:

there exists an index i such that $C o n s_{I} (r_{v_{i}}) = C o n s_{I} (r_{v})$ , $P r o d_{I} (r_{v_{i}}) = P r o d_{I} (r_{v})$ , and
for any $j \neq i$ , $P r o d_{I} (r_{v_{j}}) = C o n s_{I} (r_{v_{j}}) = \emptyset$ .

The proof of this proposition is quite long and requires some new notions and definitions, and is given in Appendix B.

We now prove that

LinNets

is stable for the simplification.

Proposition 4.

The set of networks

LinNets

is stable for the simplification, that is if

W \in LinNets

and

W ⇛ W^{'}

, then

W^{'} \in LinNets

.

Proof.

If the simplification is done with (C-Mod) or (C-Sim), then the conditions of

LinNets

are trivially preserved.

Let us assume that the simplification is done with the rule (C-Dep), removing a flux

v; e

that depends on

v_{i}; e_{i}

with coefficient

a_{i}

. So the simplified network contains the fluxes

v_{i}; e_{i} + a_{i} e

. By Proposition 3, there is an i such that

C o n s_{I} (r_{v_{i}}) = C o n s_{I} (r_{v})

and

P r o d_{I} (r_{v_{i}}) = P r o d_{I} (r_{v})

, and for any other

j \neq i

,

C o n s_{I} (r_{v_{j}}) = P r o d_{I} (r_{v_{j}}) = \emptyset

.

The fourth condition on the stoichiometry is trivially preserved by the simplification.

For

j \neq i

,

C o n s_{I} (r_{v_{j}}) = P r o d_{I} (r_{v_{j}}) = \emptyset

implies that the conditions on the kinetic expressions are directly satisfied. If

v_{i}; e_{i} + a_{i} e

consumes X, then since

C o n s_{I} (r_{v_{i}}) = C o n s_{I} (r_{v})

, the flux

v; e

consumes X too. Then by induction, e and

e_{i}

are linear in X, and no other intermediate species occurs in them. Therefore, this is also the case for

e_{i} + a_{i} e

.

If

v_{i}; e_{i} + a_{i} e

produces X without consuming any other intermediate, then it is also the case for

v; e

. Then by induction, e,

e_{i}

, and

e_{i} + a_{i} e

do not depend on the concentration of any intermediate species. Therefore, the kinetic conditions are satisfied, and

W^{'} \in LinNets

.

Finally, consider the case of a rule (C-Inter) applied on a species X. We denote by

prod

and

cons

the expressions defined in the rule. Since

W \in LinNets

, note that for any

Z \in I

, we have

Z \notin Vars (cons)

.

Let

v; e \in W^{'}

be a reaction such that

C o n s_{I} (r_{v}) = {Y}

. We consider the second condition, on the linearity of Y in e. If

v; e \in W

(that is the flux has not been changed at all by the simplification rule), then the linearity condition is trivially preserved by induction.

v; e

cannot be the simplification of a flux

v; e^{'}

with X as modifier, since that would contradict the linearity condition in W. So now assume the

v; e

is the merging of a flux

v_{p}; e_{p}

and

v_{c}; e_{c} \in W

. Then we have

C o n s_{I} (r_{v_{p}}) = {Y}

and

P r o d_{I} (r_{v_{p}}) = C o n s_{I} (r_{v_{c}}) = {X}

. Therefore, the linearity condition implies

e_{p} = Y e_{p}^{'}

and

e_{c} = X e_{c}^{'}

, with for any

Z \in I

,

Z \notin Spec (e_{p}^{'}), Spec (e_{c}^{'})

. Then we have

e = Y e_{p}^{'} e_{c}^{'} / cons

, and the linearity condition is satisfied in

W^{'}

.

Now let

v; e \in W^{'}

be a reaction such that

C o n s_{I} (r_{v}) = \emptyset

and

P r o d_{I} (r_{v}) = {Y}

, and consider the third linearity condition. By linearity,

v; e

cannot be the simplification of a reaction of W with X as modifier. If we had

v; e \in W

, then the condition is satisfied in W by induction, and therefore in

W^{'}

too. Assume that

v; e

is the merging of a reaction

v_{p}; e_{p}

and

v_{c}; e_{c} \in W

. Then we have

C o n s_{I} (r_{v_{p}}) = \emptyset

,

P r o d_{I} (r_{v_{p}}) = C o n s_{I} (r_{v_{c}}) = {X}

, and

P r o d_{I} (r_{v_{c}}) = {Y}

. Therefore, the linearity conditions on W imply

e_{c} = X e_{c}^{'}

, with for any

Z \in I

,

Z \notin Spec (e_{p}), Spec (e_{c}^{'})

. Then we have

e = e_{p} e_{c} / cons

, and the condition is satisfied in

W^{'}

.

Finally, consider the stoichiometric condition. We only have to verify this property for new fluxes

v; e

that are the merging of a flux

v_{p}; e_{p}

and

v_{c}; e_{c}

. We have

P r o d_{I} (r_{v_{p}}) = C o n s_{I} (r_{v_{c}}) = {X}

. Moreover, by normalization, we have

\begin{matrix} P r o d_{I} (r_{v_{p} ⋄ v_{c}}) & = & P r o d_{I} (r_{v_{c}}) \ C o n s_{I} (r_{v_{p}}) . \end{matrix}

Since

| P r o d_{I} (r_{v_{c}}) | \leq 1

, we have

| P r o d_{I} (r_{v_{p} ⋄ v_{c}}) | \leq 1

, and similarly for

C o n s_{I} (r_{v_{p} ⋄ v_{c}})

. ☐

Then the set

LinNets

is stable for the simplification. This directly implies that the simplification can remove every intermediate species in

I

.

9. Confluence of the Simplification Relation

We now study the confluence of the simplification relation. We first show that the structural confluence, that is the confluence of the fluxes without kinetics, is a direct consequence of the previous results. We next present an example that illustrates that, however, the distribution of the kinetics between the fluxes can be different. Finally, we give a criterion on the modes of a network, that guarantees the full confluence, that is confluence of the structure and the rates.

In the following, we only consider networks in

LinNets

.

9.1. Structural Confluence

We say that two constrained flux networks

W = V & C

and

W^{'} = V^{'} & C^{'}

are structurally similar, denoted

W ≅^{s t r u c} W^{'}

, if they have the same structure, that is the same fluxes when neglecting the kinetic expressions:

{v ∣ \exists e . v; e \in V} = {v^{'} ∣ \exists e^{'} . v^{'}; e^{'} \in V^{'}} .

Theorem 3 (Structural confluence).

The relation

⇛_{C}

on

(LinNets, ≅^{s t r u c})

is confluent.

Proof.

This is a direct consequence of the stability of the

LinNets

(Proposition 4) and of the confluence of the simplification without kinetics (Theorem 2). ☐

9.2. Non-Confluence of the Kinetic Rates

Let us consider the reaction network W, depicted on Figure 22, with 7 species and 6 fluxes. The intermediate species are X, Y and Z. Initially, the kinetics are all mass-action.

Figure 22. Network W and its simplifications. (top left) Network W. (top right) Network

W_{X Y Z}

after eliminating X, Y and Z (in this order). (bottom left) Network

W_{X Z Y}

after eliminating X, Z and Y. (bottom right) Network

W_{X Y Z d}

after eliminating X, Y, Z, and the dependent reaction. The new parameter is

K = k_{2} k_{3} + k_{3} k_{4} + k_{4} k_{5}

.

We can remove the intermediate species in different orders with (C-Inter). If we start by eliminating X, followed by Y and finally Z, we obtain the network

W_{X Y Z}

, while if we first eliminate X, then Z and Y, we obtain

W_{X Z Y}

. These networks are different, since

W_{X Y Z}

has one additional flux. This illustrates the necessity of the rule (C-Dep) to obtain the same network structure.

Indeed, the additional flux

v_{123456}

in

W_{X Y Z}

is dependent on

v_{123}

and

v_{456}

and can therefore be removed with (C-Dep), while updating the kinetic expressions. We then obtain exactly the network

W_{X Z Y}

. However,

v_{123456}

also depends on

v_{25}

and

v_{1346}

. Therefore, we could as well remove it while updating these reactions. In that case, we obtain the different network

W_{X Y Z d}

. This network has the same structure as

W_{X Z Y}

, but not the same distribution of rates between the fluxes.

9.3. Criterion for the Full Confluence

We now give a criterion that guaranties the full confluence of the simplification.

Definition 15.

A vector of reactions

r = (r_{1}, \dots, r_{n})

is uniquely decomposable if any mode

v \in \ker_{+} (S)

has an unique decomposition in elementary modes, where S is the stoichiometric matrix of

N_{r_{| I}}

.

Example 6.

Consider the network W represented in the Figure 22, with

I = {X, Y, Z}

. It has 4 different elementary modes:

v_{1} = (1, 1, 1, 0, 0, 0)

,

v_{2} = (1, 0, 1, 1, 0, 1)

,

v_{3} = (0, 1, 0, 0, 1, 0)

and

v_{4} = (0, 0, 0, 1, 1, 1)

. Then the mode

v = (1, 1, 1, 1, 1, 1)

can be decompose in either

v_{1} + v_{4}

, or in

v_{2} + v_{3}

. The two decompositions are illustrated in Figure 23.

r

is not uniquely decomposable, and the simplification is not confluent for the kinetic rates, as we have seen before.

Figure 23. Two decompositions of the mode

(1, 1, 1, 1, 1, 1)

in the network W.

Theorem 4 (Confluence).

If the initial vector of reactions

r

is uniquely decomposable, then the relation

⇛_{C}

on

(LinNets, ≅)

is confluent, for both the structure and the kinetic rates.

The theorem is the consequence of the following lemmas, that analyze the different critical pairs. That is, if a network W can be simplified in two different manners into

W_{1}

and

W_{2}

, then these two networks can be simplified into

W_{1}^{'}

and

W_{2}^{'}

such that

W_{1}^{'} ≅ W_{2}^{'}

.

Lemma 7.

Assume

r

is uniquely decomposable. Let W be a network such that

W ⇛_{C - DEP} W_{i}

for

i \in {1, 2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

Let

v_{i} e_{i}

be the dependent flux removed when simplifying W into

W_{i}

, for

i \in {1, 2}

, with

v_{i}; e_{i}

dependent on

v_{i}^{1}; e_{i}^{1}, \dots, v_{i}^{k_{i}}; e_{i}^{k_{i}}

with coefficients

a_{i}^{1}, \dots, a_{i}^{k_{i}}

.

If

v_{1} = v_{2}

, since

r

is uniquely decomposable, we have

{v_{1}^{1}, \dots, v_{1}^{k_{1}}} = {v_{2}^{1}, \dots, v_{2}^{k_{2}}}

. So the simplified networks are trivially the same, that is

W_{1} = W_{2}

.

Assume

v_{1} \neq v_{2}

, and that for any

i \in {1, 2}

, for any j,

v_{i} \neq v_{3 - i}^{j}

, that is

v_{1}

does not depend on

v_{2}

and reciprocally. Then we can still remove

v_{1}

in

W_{2}

, and

v_{2}

in

W_{1}

, and we find the same network modulo similarity:

W_{1}^{'} ≅ W_{2}^{'}

.

If

v_{1}

depends on

v_{2}

, and

v_{2}

depends on

v_{1}

, then since dependencies are positive linear combinations, that directly implies

v_{1} = v_{2}

.

Finally, if

v_{1} \neq v_{2}

, and

v_{1}

depends on

v_{2}

, with coefficient a, but

v_{2}

does not depend on

v_{1}

(or conversely), we have

v_{1} = \sum_{\begin{matrix} j \end{matrix}} a_{1}^{j} v_{1}^{j} + a v_{2}

and

v_{2} = \sum_{\begin{matrix} j \end{matrix}} a_{2}^{j} v_{2}^{j}

. If we remove

v_{1}

,

v_{2}; e_{2}

becomes

v_{2}; e_{2} + a e_{1}

, and can be removed. The fluxes obtained are of the form

v^{j}; e^{j} + a_{1}^{j} e_{1} + a_{2}^{j} (e_{2} + a e_{1})

. If we remove

v_{2}

first, then we can remark that

v_{1}

is still dependent, with

v_{1} = \sum_{\begin{matrix} j \end{matrix}} a_{1}^{j} v_{1}^{j} + a (\sum_{\begin{matrix} j \end{matrix}} a_{2}^{j} v_{2}^{j})

, and can be removed. We obtain the fluxes

v^{j}; e^{j} + a_{2}^{j} e_{2} + (a_{1}^{j} + a a_{2}^{j}) e_{1}

. Then the simplified fluxes are similar, and

W_{1}^{'} ≅ W_{2}^{'}

. ☐

Lemma 8.

Let W be a network such that

W ⇛_{C - DEP} W_{1}

and

W ⇛_{C - MOD} W_{2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

This case is quite trivial. Let

v; e

be the dependent flux, X the modifier, and

v^{'}; e^{'}

another reaction such that v depends on

v^{'}

with factor a. If we remove X first and then v, the flux

v^{'}; e^{'}

is simplified into

v^{'}; e^{'} [X : = X (0)] + a e [X : = X (0)]

. Otherwise, it is simplified into

v^{'}; (e^{'} + a e) [X : = X (0)]

. The two expressions are trivially similar. ☐

Lemma 9.

Let W be a network such that

W ⇛_{C - DEP} W_{1}

and

W ⇛_{C - INTER} W_{2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

The full proof is given in Appendix C. The main idea is to use Proposition 3 to prove that if X is in the dependent flux

v_{d}

, it is also in one of the fluxes

v_{i}

that

v_{d}

depends on. Therefore, if we eliminate X and combine

v_{d}

with another flux

v^{'}

, we also merge

v_{i}

with

v^{'}

. Then

v_{d} ⋄_{X} v^{'}

is still dependent on

v_{i} ⋄_{X} v^{'}

and other fluxes, and can be removed. ☐

Lemma 10.

Let W be a network such that

W ⇛_{C - MOD} W_{i}

for

i \in {1, 2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

This case is trivial, since the substitutions commute. ☐

Lemma 11.

Let W be a network such that

W ⇛_{C - MOD} W_{1}

and

W ⇛_{C - INTER} W_{2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

Once again, since the two removed species cannot be the same, this case is trivial. ☐

Lemma 12.

Let W be a network such that

W ⇛_{C - INTER} W_{i}

for

i \in {1, 2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

The full proof is given in Appendix C. The idea is that after removing one intermediate species, we can still remove the other one, either with (C-Mod) or with (C-Inter). In the second case, some dependent fluxes are generated, that we can eliminate to find the same simplified network, whatever the order of elimination of the intermediate species. ☐

10. An Example from the BioModels Database

We have shown that the simplification system that we presented can exhibit non-confluence of the rates, even in a simple scenario with a small number of intermediates. To find if such a situation occurs in practice, we investigated the SBML models in the curated BioModels database [6]. We were thus able to find a network, for the model BIOMD0000000173, that does not verify the confluence criterion, and such that two different simplified networks can be identified. Note that this was the only model not satisfying the criterion, when considering every model of the BioModels database with mass-action kinetics and with three or four linear intermediate species.

The network identified is a model of the

Smad

-based signal transduction mechanisms from the cell membrane to the nucleus, presented in [23]. We only consider here a sub-network W of this model, sufficient to illustrate the non-confluence. It is represented in Figure 24.

Figure 24. Sub-network W of the

Smad

-based signal transduction model from [23].

In this network, a molecule of

S 4_{c}

, that represents the species

Smad 4

in the cytoplasm, can bind with either a molecule of

Smad 2

in a phosphorylated form (

pS 2_{c}

) and form the complex

S 24_{c}

(reaction

r_{5}

), or with a molecule of G in a phosphorylated form (

{pG}_{c}

), and form the complex

G 4_{c}

(reaction

r_{22}

). These two reactions are reversible (

r_{5^{'}}

and

r_{22^{'}}

). The same transformations can occur in the nucleus (reactions

r_{6}

,

r_{6^{'}}

,

r_{23}

and

r_{23^{'}}

). The species

Smad 4

can also move from the cytoplasm to the nucleus, or reciprocally (

r_{1}

and

r_{1^{'}}

). Finally, the complex of

Smad 2

and

Smad 4

can move from the cytoplasm to the nucleus (

r_{7}

).

We assume that

I = {S 4_{c}, S 24_{c}, S 4_{n}, S 24_{n}}

. The network is in

LinNets

. Therefore, we can consider the elimination of the four intermediate species. According to the order of the simplification, we can then obtain two different networks, with the same structure, but with different kinetic expressions. They are represented in Figure 25. The network

W_{1}

is obtained by removing

S 4_{n}

first, then

S 24_{n}

, then

S 24_{c}

, and finally

S 4_{c}

and the dependent fluxes. The network

W_{2}

is obtained by removing

S 4_{n}

, then

S 24_{n}

,

S 4_{c}

,

S 24_{c}

and the dependent fluxes.

Figure 25. Simplified networks from W. Both networks have the same structure, and the kinetic expressions are defined in the table. The network

W_{1}

is obtained by removing in order

S 4_{n}

,

S 24_{n}

,

S 24_{c}

,

S 4_{c}

and the dependent fluxes. The network

W_{2}

is obtained by removing in order

S 4_{n}

,

S 24_{n}

,

S 4_{c}

,

S 24_{c}

and the dependent fluxes.

We now show that the criterion is not satisfied in the initial network, i.e., that W is not uniquely decomposable. We consider the following mode:

v = v_{22^{'}} + v_{1} + v_{1^{'}} + v_{5} + v_{7} + v_{6^{'}} + v_{23} .

This mode has two possible decompositions into elementary modes,

{v_{r e d}, v_{b l u e}}

and

{v_{g r e e n}, v_{m a g e n t a}}

, with:

\begin{matrix} v_{r e d} = & v_{22^{'}} + v_{5} + v_{7} + v_{6^{'}} + v_{23}, \\ v_{b l u e} = & v_{1} + v_{1^{'}}, \\ v_{g r e e n} = & v_{22^{'}} + v_{1} + v_{23}, \\ v_{m a g e n t a} = & v_{1^{'}} + v_{5} + v_{7} + v_{6^{'}} . \end{matrix}

We represent these fluxes in Figure 26, where we omit the non-intermediate species and the kinetic expressions for the sake of simplicity.

Figure 26. In red the elementary mode

v_{r e d}

, in blue

v_{b l u e}

, in green

v_{g r e e n}

, and in magenta

v_{m a g e n t a}

.

11. Simplification of Systems of Equations

In this section, we study the relation between the simplification

⇛_{C}

on reaction networks and a simplification ⇒ on systems. We show that the assignment of a system

E (W)

to a network W is a simulation for the simplifications.

11.1. Simplification of Systems of Equations

The simplification of systems is illustrated in Figure 27. The first rule replaces a constant variable x with its initial value

x (0)

, and

\dot{x}

with 0. The second rule extends the simplification to similar systems. We define the simplification:

\Rightarrow =_{df} \Rightarrow_{E - INTER} .

Figure 27. Simplification rules for systems of equations.

Lemma 13.

The simplification is correct for the equivalence, that is:

E \Rightarrow E^{'} implies E ≃ E^{'} .

Theorem 5.

The relation ⇒ on

(Syst, ≅)

is uniformly confluent.

Proof.

It is trivial, since the substitutions commute. ☐

11.2. Simulation

The assignment of a system

E (W)

to a network W is a simulation from

(LinNets, ⇛_{C})

to

(S y s t e m s, \Rightarrow^{*})

.

Lemma 14.

Given a network

W \in LinNets

, if

W ⇛_{C} W^{'}

, then

E (W) \Rightarrow^{*} E (W^{'})

.

Proof.

The rule (C-Sim) for the networks is directly imitated by the rule (E-Sim) for the systems.

For the rule (C-Mod), if

W ⇛_{C - MOD} W^{'}

, then we directly have

E (W) \Rightarrow_{E - INTER} E (W^{'})

.

For the rule (C-Inter), assume that we remove a species X from W to obtain

W^{'}

. Then

E (W) ⊧ cst (x_{X})

, therefore, we can simplify

E (W)

into a system

E^{'}

. We have to prove that

E^{'} \sim E (W^{'})

. First, observe that the systems have the same variables.

Consider a differential equation of

E (W)

, for a species

A \neq X

:

\begin{matrix} \dot{A} & = & \sum_{\begin{matrix} v; e \in W \end{matrix}} r_{v} (A) e \\ = & \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) > 0} \end{matrix}} r_{v} (A) e + \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) < 0} \end{matrix}} r_{v} (A) e + \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e . \end{matrix}

We define

prod = \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) > 0} \end{matrix}} e

and

cons = \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) < 0} \end{matrix}} e

. The differential equation in

E (W^{'})

becomes:

$\dot{A}$	=	$\sum_{\begin{matrix} v; e \in W^{'} \end{matrix}} r_{v} (A) e$
	=	$\sum_{\begin{matrix} {v; e, v^{'}; e^{'} \in W ∣ r_{v} (X) > 0, r_{v^{'}} (X) < 0} \end{matrix}} r_{v ⋄_{X} v^{'}} (A) \frac{e e^{'}}{cons} + \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)]$
	=	$\sum_{\begin{matrix} {v; e, v^{'}; e^{'} \in W ∣ r_{v} (X) > 0, r_{v^{'}} (X) < 0} \end{matrix}} (r_{v} (A) + r_{v^{'}} (A)) \frac{e e^{'}}{cons}$ $+ \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)]$
	=	$\sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) > 0} \end{matrix}} r_{v} (A) \frac{e}{cons} (\sum_{\begin{matrix} {v^{'}; e^{'} \in W ∣ r_{v^{'}} (X) < 0} \end{matrix}} e^{'}$ $) + \sum_{\begin{matrix} {v^{'}; e^{'} \in N ∣ r_{v^{'}} (X) < 0} \end{matrix}} r_{v^{'}} (A) \frac{e^{'}}{cons} (\sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) > 0} \end{matrix}} e)$
		$+ \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)]$
	=	$\sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) > 0} \end{matrix}} r_{v} (A) \frac{e cons}{cons} + \sum_{\begin{matrix} {v^{'}; e^{'} \in W ∣ r_{v^{'}} (X) < 0} \end{matrix}} r_{v^{'}} (A) \frac{e^{'} prod}{cons}$ $) + \sum_{\begin{matrix} {v; e \in W ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)]$ .

The system

E (W^{'})

also contains the constraint

X (0) = X \frac{prod}{cons}

. In addition, by the linearity conditions, we know that for any

v; e \in W

such that

r_{v} (X) > 0

, we have

X \notin Vars (e)

, so

e = e [X : = X (0)]

. For any

v^{'}; e^{'} \in W

such that

r_{v^{'}} (X) < 0

, we have

e^{'} = X e^{″}

, with

X \notin Vars (e^{″})

. Therefore,

\frac{e^{'} prod}{cons} = e^{″} \frac{X prod}{cons} = e^{″} X (0) = e^{'} [X : = X (0)]

. So we can rewrite the previous differential equation into:

\dot{A} = \sum_{\begin{matrix} v; e \in W \end{matrix}} r_{v} (A) e [X : = X (0)] .

In

E^{'}

, we directly have

\dot{A} = \sum_{\begin{matrix} v; e \in W \end{matrix}} r_{v} (A) e [X : = X (0)] .

Moreover, in

E^{'}

, the equation

\dot{X} = prod - cons

is replaced by

0 = prod [X : = X (0)] - cons [X : = X (0)]

. We then have

prod [X : = X (0)] = prod

, while

cons = X e

for some e such that

X \notin Vars (e)

. Then

cons [X : = X (0)] = X e [X : = X (0)] = X (0) e = \frac{X (0)}{X} cons

. So we can rewrite the equation

0 = prod [X : = X (0)] - cons [X : = X (0)]

into

0 = prod - \frac{X (0)}{X} cons

, and then into

X (0) = \frac{X prod}{cons}

. Therefore, the two systems

E^{'}

and

E (W^{'})

have the same differential equations and the same constraint, and they are similar.

Finally, consider the rule (C-Dep). Let

v; e

be the removed reaction, depending on

v_{1}; e_{1}, \dots, v_{k}; e_{k}

, with coefficients

a_{1}, \dots, a_{k}

. We write

V^{'}

for the set of the other fluxes in W. Let A be a species. The ordinary differential equation for A in

E (W)

is:

\dot{A} = r_{v} (A) e + \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} r_{v_{i}} (A) e_{i} + \sum_{\begin{matrix} v^{'}; e^{'} \in V^{'} \end{matrix}} r_{v^{'}} (A) e^{'} .

Since we have

r_{v} = \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} a_{i} r_{v_{i}}

, the equation is similar to:

\begin{matrix} \dot{A} & = & \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} a_{i} r_{v_{i}} (A) e + \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} r_{v_{i}} (A) e_{i} + \sum_{\begin{matrix} v^{'}; e^{'} \in V^{'} \end{matrix}} r_{v^{'}} (A) e^{'} \\ = & \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} r_{v_{i}} (A) (e_{i} + a_{i} e) + \sum_{\begin{matrix} v^{'}; e^{'} \in V^{'} \end{matrix}} r_{v^{'}} (A) e^{'} . \end{matrix}

This is the equation for A in the system

E (W^{'})

for the simplified network. Therefore,

E (W) ≅ E (W^{'})

. ☐

12. Conclusions

We have first shown that when neglecting the kinetic expressions, the elimination of linear intermediate species and dependent reactions is a reformulation of the double description method, that computes the elementary modes, and therefore that the network structure of simplified networks is unique. In a second time, when considering kinetic expressions, we provided a biological example illustrating that the simplification can produce two networks with the same structure but different kinetics. We then gave a sufficient criterion on the network structure of the initial network that guarantees the confluence of both the structure and the rates.

Note that the criterion seems to be satisfied in most cases in practice. When looking at the networks with mass-action kinetics from the BioModels database [6], and considering at most four intermediate species, only the

Smad

-model BIOMD0000000173 was identified as not satisfying the criterion. On the other hand, the linearly steadiness as well as the conditions required for a network to be in

LinNets

(such as the stoichiometry conditions, etc.) are not always satisfied in real biological networks, and these are therefore a real restriction on our simplification approach.

Acknowledgments

This work has been funded by the French National Research Agency research grant Iceberg ANR-IABI-3096.

Author Contributions

All authors contributed equally to this work and to the writing of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Soundness of the Simplification Rules for Constrained Flux Networks

We prove Proposition 2, stating that the simplification is sound for the congruence:

W ⇛_{C} W^{'}

implies

W \sim W^{'}

.

We prove, for each simplification rule, that for any context

W^{″}

with

Spec (W^{″}) \cap I = \emptyset

and for any assignment α, we have

α \in sol (E (W ∣ W^{″}))

iff

α \in sol (E (W^{'} ∣ W^{″}))

. Let us assume that

W = V & C

,

W^{'} = V^{'} & C^{'}

, and

W^{″} = V^{″} & C^{″}

. We therefore have

E (W ∣ W^{″}) = (\underset{A \in Spec}{⋀} \dot{A} = e_{A}^{V} + e_{A}^{V^{″}}) \land C \land C^{″}

where

e_{A}^{V} = \sum_{v; e \in V} r_{v} (A) e and e_{A}^{V^{″}} = \sum_{v; e \in V^{″}} r_{v} (A) e .

(C-Mod) Suppose that the removed species is

X \in I

. Since

Spec (W^{″}) \cap I = \emptyset

for any

A \in Spec

,

X \notin Vars (e_{A}^{V^{″}}) \cup Vars (C^{″})

and

e_{X}^{V^{″}} = 0

. Moreover,

\forall v; e \in V . r_{r_{v}} (X) = 0

, thus

e_{A}^{V} = 0

and the ODE for X in

E (W ∣ W^{″})

is

\dot{X} = 0

, which is also the equation for X in

E (W^{'} ∣ W^{″})

. As a consequence any solution α should verify

X = X (0)

. In addition, since

E (W ∣ W^{″})

and

E (W^{'} ∣ W^{″})

only differ by the substitution of

X (0)

for X, they have the same solutions.

(C-Dep) Let

V = V_{0} \cup \{v; e\} \cup {v_{i}; e_{i} ∣ 1 \leq i \leq n}

where

v = \sum_{1 \leq i \leq n} a_{i} v_{i}

, i.e., v depends on

v_{1}, \dots, v_{n}

with coefficients

a_{1}, \dots, a_{n}

. For any species A, we can write its ODE in

E (W ∣ W^{″})

as

\begin{matrix} \dot{A} & = & \sum_{\begin{matrix} v^{'}; e^{'} \in V_{0} \cup V^{''} \end{matrix}} r_{v^{'}} (A) e^{'} + r_{v} (A) e + \sum_{\begin{matrix} 1 \leq i \leq n \end{matrix}} r_{v_{i}} (A) e_{i} \\ = & \sum_{\begin{matrix} v^{'}; e^{'} \in V_{0} \cup V^{''} \end{matrix}} r_{v^{'}} (A) e^{'} + (\sum_{\begin{matrix} 1 \leq i \leq n \end{matrix}} a_{i} r_{v_{i}} (A)) e + \sum_{\begin{matrix} 1 \leq i \leq n \end{matrix}} r_{v_{i}} (A) e_{i} \\ = & \sum_{\begin{matrix} v^{'}; e^{'} \in V_{0} \cup V^{''} \end{matrix}} r_{v^{'}} (A) e^{'} + \sum_{\begin{matrix} 1 \leq i \leq n \end{matrix}} r_{v_{i}} (A) (a_{i} e_{i} + e) \end{matrix}

which is exactly the ODE for A in

E (W^{'} ∣ W^{″})

. In addition, the constraints in

E (W ∣ W^{″})

and

E (W^{'} ∣ W^{″})

are the same. Therefore, their solutions are identical.

(C-Sim) The soundness of this rule comes directly from Lemma 6.

(C-Inter) Suppose that the removed species is

X \in I

. We note that

e_{X}^{V^{″}} = 0

.

Let

(v_{1}, \dots, v_{k})

and

(e_{1}, \dots, e_{k})

be such that

V = {v_{1}; e_{1}, \dots, v_{k}; e_{k})

. Let P, C,

prod

, and

cons

be as in the Diamond Lemma 3, with

G

the set of kinetic expressions,

g_{i} = e_{i}

for all

1 \leq i \leq k

, and

h : N^{n} \to G

the homomorphism with

h (v) = r_{v} (A)

. P is the set of indices for fluxes that produce X, C the set of indices for fluxes that consume X, while

prod

is the total rate of production of X, and

cons

its total rate of consumption.

The Ode for X in

E (W ∣ W^{″})

is

\dot{X} = prod - cons

. Since X is linearly steady in W, it follows that

C ⊧ (prod = cons) \land cons \neq 0

. Therefore, we have that

C ⧦ C [X : = X (0)] \land X (0) = X \frac{prod}{cons}

. Let

A \neq X

be a species. The Ode for A in

E (W ∣ W^{″})

writes as

\begin{matrix} \dot{A} = & \sum_{\begin{matrix} p \in P \end{matrix}} r_{v_{p}} (A) e_{p} + \sum_{\begin{matrix} c \in C \end{matrix}} r_{v_{c}} (A) e_{c} + \sum_{\begin{matrix} {v; e \in V \cup V^{''} ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e . \end{matrix}

We next consider the Ode for A in

E (W^{'} ∣ W^{″})

and show that it can be rewritten to obtain the same result. We have

\begin{matrix} \dot{A} & = & \sum_{\begin{matrix} p \in P \end{matrix}} \sum_{\begin{matrix} c \in C \end{matrix}} r_{v_{p} ⋄_{X} v_{c}} (A) \frac{e_{p} e_{c}}{cons} + \sum_{\begin{matrix} m \in M \end{matrix}} r_{v_{m}} (A) e_{m} [X : = X (0)] + \sum_{\begin{matrix} v; e \in V^{''} \end{matrix}} r_{v} (A) e \\ = & \frac{1}{cons} \sum_{\begin{matrix} p \in P \end{matrix}} \sum_{\begin{matrix} c \in C \end{matrix}} e_{p} e_{c} h (v_{p} ⋄_{X} v_{c}) + \sum_{\begin{matrix} {v; e \in V \cup V^{''} ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)] (X \notin p Specs (V^{''}) by compatibility) \\ = & \frac{1}{cons} (\sum_{\begin{matrix} p \in P \end{matrix}} e_{p} cons h (v_{p}) + \sum_{\begin{matrix} c \in C \end{matrix}} e_{c} prod h (v_{c})) + \sum_{\begin{matrix} {v; e \in V \cup V^{''} ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)] (Diamond Lemma 3) \\ = & \sum_{\begin{matrix} p \in P \end{matrix}} e_{p} r_{v_{p}} (A) + \sum_{\begin{matrix} c \in C \end{matrix}} e_{c} r_{v_{c}} (A) + \sum_{\begin{matrix} {v; e \in V \cup V^{''} ∣ r_{v} (X) = 0} \end{matrix}} r_{v} (A) e [X : = X (0)] (C ⊧ prod = cons \land cons \neq 0) . \end{matrix}

Without the substitution

[X : = X (0)]

, this is indeed the equation of A in

E (W ∣ W^{″})

. The substitution is permitted since we argue modulo similarity of constrained flux networks, and since

C ⊧ cst (X)

. Therefore,

E (W ∣ W^{″}) ⧦ E (W^{'} ∣ W^{″})

so that

W \sim W^{'}

.

Appendix B. Proofs for the Stability of LinNets

We first need to introduce some new notions.

We write

r = (r_{1}, \dots, r_{n})

for the initial vector of reactions, and

(v_{1}, \dots, v_{n})

for the corresponding unit vectors, that is, for any i,

r_{v_{i}} = r_{i}

. Let

W_{0} \in LinNets

be a constrained flux network, and W a network obtained by simplifying

W_{0}

, that is

W_{0} ⇛_{C}^{*} W

.

We first introduce the notion of paths. We will then relate them to the fluxes in W. Note that we need to distinguish between the case of circular and the case of non-circular path.

Definition A1.

Let

r

be the initial vector of reactions.

a path $\tilde{v} = v_{1} \dots v_{k}$ is a (non empty) sequence of unit vectors $v_{i} \in N^{n}$ , such that for any $1 \leq i < k$ , we have $P r o d_{I} (r_{v_{i}}) = C o n s_{I} (r_{v_{i + 1}}) \neq \emptyset$ ;
we denote the vector of a path by $\sum \tilde{v} = \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} v_{i}$ ;
a path is circular if $P r o d_{I} (r_{v_{k}}) = C o n s_{I} (r_{v_{1}}) \neq \emptyset$ , and non-circular otherwise;
for a circular path $\tilde{v}$ , we denote the number of intermediate species occurring in the path by $\tilde{v} (X) = | {1 \leq i \leq k ∣ P r o d_{I} (r_{v_{i}}) = {X}} |$ ;
for a non-circular path $\tilde{v}$ , we denote its beginning and its end by $C o n s_{I} (r_{\tilde{v}}) = C o n s_{I} (r_{v_{1}})$ and $P r o d_{I} (r_{\tilde{v}}) = P r o d_{I} (r_{v_{k}})$ . In addition, we define the multiset $\tilde{v} (X) = | {1 \leq i < k ∣ P r o d_{I} (r_{v_{i}}) = C o n s_{I} (r_{v_{i + 1}}) = {X}} |$ . Note that we do not count $C o n s_{I} (r_{\tilde{v}})$ and $P r o d_{I} (r_{\tilde{v}})$ in this multiset.

Example A1.

For instance, consider the initial network

W_{0}

, with

I = {X, Y, Z}

, in Figure A1 (left). We denote by

v_{i}

the unit vector for reaction

r_{i}

.

The path

v_{1} v_{2} v_{3}

is non-circular. It has for vector

(1, 1, 1, 0, 0)

. We have

C o n s_{I} (r_{v_{1} v_{2} v_{3}}) = \emptyset

and

P r o d_{I} (r_{v_{1} v_{2} v_{3}}) = Z

. The multiset is defined by

v_{1} v_{2} v_{3} (X) = v_{1} v_{2} v_{3} (Y) = 1

, and

v_{1} v_{2} v_{3} (Z) = 0

(since Z is the end of the path, and not an intermediate node).

The path

v_{3} v_{4}

is circular. It has for vector

(0, 0, 1, 1, 0)

. The multiset is defined by

v_{3} v_{4} (X) = 0

and

v_{3} v_{4} (Y) = v_{3} v_{4} (Z) = 1

.

Figure A1. Networks

W_{0}

(left); and W (right).

We will see later that if a path

\tilde{v}

satisfies some conditions w.r.t. some reaction network W, then there is a corresponding flux v in W such that

\sum \tilde{v} = v

. The reciprocal property also holds. Such a particular path, called a flux-path, is formally defined as follows.

Definition A2.

Let

W, W_{0}

be reaction networks such that

W_{0} ⇛_{C}^{*} W

,

a non-circular flux-path $\tilde{v}$ is a non-circular path in W such that for any intermediate species X with $\tilde{v} (X) > 0$ , we have $X \in Spec (W_{0}) \ Spec (W)$ (meaning that one of the simplification steps $W_{0} ⇛_{C}^{*} W$ removes X from $W_{0}$ ), and such that $C o n s_{I} (r_{\tilde{v}})$ and $P r o d_{I} (r_{\tilde{v}})$ are either the empty solution ∅, or the intermediate species that are still in W,
a circular flux-path $\tilde{v}$ is a circular path in W if there is at most one intermediate species X such that $\tilde{v} (X) > 0$ and $X \in Spec (W)$ (i.e., X is not yet simplified),
we call flux-path a path that is either a circular or a non-circular flux-path.
a flux-path $\tilde{v}$ is said to correspond to a flux v in W if $\sum \tilde{v} = v$ .

Example A2.

Let us consider the simplified network W in Figure A1 (right).

The non-circular path

v_{1} v_{2} v_{3}

is not a non-circular flux-path, since

v_{1} v_{2} v_{3} (X) > 0

and X has not been removed. The path

v_{2} v_{3}

is a non-circular flux-path, and corresponds to the flux

v_{23}

.

The path

v_{3} v_{4}

is a circular flux-path, since there is a unique intermediate species, Z, that has not been removed and such that

v_{3} v_{4} (Z) > 0

. It corresponds to

v_{34}

.

We first prove the following lemma on the dependent flux.

Lemma A1.

Let

v; e

be a flux that depends on

v_{1}; e_{1}, \dots, v_{k}; e_{k}

with coefficients

a_{1}, \dots, a_{k}

. For any i, let

{\tilde{v}}_{i}

be a corresponding flux-path for

v_{i}

. Then:

v = \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} a_{i} \sum {\tilde{v}}_{i} .

Proof.

We directly have

v = \sum_{\begin{matrix} 1 \leq i \leq k \end{matrix}} a_{i} v_{i}

and for any i,

v_{i} = \sum {\tilde{v}}_{i}

. ☐

We can now prove the key lemma of this section.

Lemma A2.

Let

W, W_{0}

be reaction networks such that

W_{0} \in LinNets

and

W_{0} ⇛_{C}^{*} W

. Then the following properties hold:

1.

for any flux

v; e \in W

, there is a corresponding flux-path

\tilde{v}

for W such that

\sum \tilde{v} = v

. Moreover, if v is not dependent then, for any intermediate species X, we have

\tilde{v} (X) \leq 1

;

2.

for any flux-path

\tilde{v}

for W such that for any X,

\tilde{v} (X) \leq 1

, there is a corresponding flux

v; e \in W

, that is

\sum \tilde{v} = v

;

3.

if

v; e \in W

depends on

v_{1}; e_{1}, \dots, v_{k}; e_{k} \in W

, then

there exists an index i such that $C o n s_{I} (r_{v_{i}}) = C o n s_{I} (r_{v})$ , $P r o d_{I} (r_{v_{i}}) = P r o d_{I} (r_{v})$ , and
for any $j \neq i$ , $P r o d_{I} (r_{v_{j}}) = C o n s_{I} (r_{v_{j}}) = \emptyset$ and any flux-path $\tilde{v_{j}}$ that corresponds to v is circular.

Proof.

We proceed by induction on the simplification steps. We start by proving each conclusion of the Lemma for the base case, that is

W = W_{0}

.

(1): for flux $v; e$ in the initial network $W_{0}$ , v is necessarily a unary vector $v_{i}$ for some i. So we can directly associate the flux-path $\tilde{v} = v_{i}$ that trivially corresponds to v. Since it is a unary vector, the flux v is also necessarily not dependent. Because a flux-path of size 1 is always non-circular, we also have, for any intermediate species X, $\tilde{v} (X) = 0$ , and thus $\tilde{v} (X) \leq 1$ as required.
(2): any flux-path $\tilde{v}$ for $W_{0}$ is necessarily of size 1. Otherwise, for $\tilde{v}$ being a non-circular flux-path, there would exist some $X \in Specs (W_{0}) \ Specs (W_{0}) = \emptyset$ . And for $\tilde{v}$ being a circular flux-path, there would exist at least two species X and Y such that $\tilde{v} (X) > 0$ and $\tilde{v} (Y) > 0$ , which contradicts the definition of circular flux-path. Then there exists $v_{i}; e \in W_{0}$ such that $v_{i} = \tilde{v}$ .
(3): as said above, a flux $v; e \in W_{0}$ v can not be dependent.

Now, considering the inductive case, we assume that the Lemma is true for a network

W^{'}

such that

W_{0} ⇛_{C}^{k} W^{'}

(for some

k > 0

) and

W^{'} ⇛_{C} W

. If

W^{'} ⇛_{C - MOD} W

or

W^{'} ⇛_{C - SIM} W

, only the kinetics are modified between

W^{'}

and W. Therefore, the Lemma is still true in W. It remains to investigate the cases

W^{'} ⇛_{C - DEP} W

and

W^{'} ⇛_{C - INTER} W

.

(C-Dep) Assuming that

W^{'} ⇛_{C - DEP} W

, we prove that each point of the Lemma is satisfied by W.

(1): Let $v; e \in W$ , then it is the case that $v; e^{'} \in W^{'}$ for some expression $e^{'}$ because the rule (C-Dep) only removes a dependent flux and modifies some kinetic expressions. By induction hypothesis, there is a flux-path $\tilde{v}$ for $W^{'}$ such that $\sum \tilde{v} = v$ . Also, because $Specs (W) = Specs (W^{'})$ , any flux-path for $W^{'}$ is also a flux-path for W, which proves that $\tilde{v}$ is a flux-path for v. Finally, if v is dependent in W, it is necessarily dependent in W and satisfies $\forall X \in I . \tilde{v} (X) \leq 1$ by induction hypothesis.
(2): Let $\tilde{v}$ be a flux-path for W such that, for any intermediate species X, $\tilde{v} (X) \leq 1$ . Again, because $Specs (W) = Specs (W^{'})$ , $\tilde{v}$ is also a flux-path for $W^{'}$ . By induction hypothesis, there is a corresponding flux $v; e \in W^{'}$ . If $v; e$ is not the flux that is removed by the application of (C-Dep), then this flux still occurs in W (possibly with an updated kinetic) and we conclude directly. We now show that it can not actually be otherwise, and more precisely, that assuming v removed by (C-Dep) contradicts $\tilde{v} (X) \leq 1$ .
Suppose that v is removed by (C-Dep), then v depends on some fluxes $v_{1}; e_{1}, \dots v_{k}; e_{k}$ in $W^{'}$ . For the sake of simplicity, we only consider the case where $C o n s_{I} (r_{\tilde{v}}) = X$ for some intermediate $X \in W$ . The other case works similarly. We necessarily have $k > 1$ , because $k = 1$ contradicts the linearity assumption. By induction hypothesis and Point 3, there exists in particular at least one j such that $P r o d_{I} (r_{v_{j}}) = C o n s_{I} (r_{v_{j}}) = \emptyset$ . Again, by induction hypothesis, there exists $\tilde{v^{'}} = v_{1}^{'}, \dots, v_{l}^{'}$ a circular flux-path corresponding to $v_{j}$ in $W^{'}$ . Since it is circular, there exist some intermediate species $X_{1}, \dots, X_{l}$ such that for any $1 \leq i < l$ , $C o n s_{I} (r_{v_{i}^{'}}) = X_{i}$ and $P r o d_{I} (r_{v_{i}^{'}}) = X_{i + 1}$ , and $C o n s_{I} (r_{v_{l}^{'}}) = X_{l}$ , $P r o d_{I} (r_{v_{l}^{'}}) = X_{1}$ . Note that we also have $X_{i} \neq X$ , since, by definition of flux-path, $X_{i} \notin W^{'}$ for any i, while $X \in W$ . Since $\sum \tilde{v} = v = \sum v_{k}$ and $v_{j} = \sum \tilde{v^{'}} = \sum_{\begin{matrix} i \end{matrix}} v_{i}^{'}$ , the unit vectors $v_{i}^{'}$ also appear in the flux-path $\tilde{v}$ . So the intermediate species $X_{i}$ are present at least one time in $\tilde{v}$ . Moreover, there is at least one i such that in $\tilde{v}$ , the unit vector $v_{i}$ is preceded by a unit vector that is not one of $\tilde{v^{'}}$ . Therefore, $X_{i}$ is produced by another flux, i.e., $\tilde{v} (X) > 1$ , which contradicts the hypothesis.
(3): Let $v; e \in W$ be a flux dependent on $v_{1}; e_{1}, \dots, v_{n}; e_{n} \in W$ , that is, in particular, $v = \sum_{1 \leq i \leq k} n_{i} v_{i}$ for some $n_{i} > 0$ . Since (C-Dep) removes one flux and possibly modifies some kinetics, there is a flux $v; e^{'} \in W$ in $W^{'}$ that either depends on $v_{1}; e_{1}^{'}, \dots, v_{n}; e_{n}^{'} \in W$ or on $v_{0}; e_{0}^{'}, v_{1}; e_{1}^{'}, \dots, v_{n}; e_{n}^{'} \in W$ where $v_{0}; e_{0}^{'}$ is the flux removed by (C-Dep). The latter case is not possible, since it would imply that $v = \sum_{1 \leq i \leq k} n_{i} v_{i} = n_{0} v_{0} + \sum_{1 \leq i \leq k} n_{i} v_{i}$ for some $n_{0} > 0$ and unary vector $v_{0}$ . Thus, we conclude that $v; e^{'} \in W$ depends on $v_{1}; e_{1}^{'}, \dots, v_{n}; e_{n}^{'} \in W$ that, by induction hypothesis, satisfies the conditions of Point 3.

(C-Inter) Now, assuming that

W^{'} ⇛_{C - INTER} W

, we again prove that each point of the Lemma is satisfied by W.

(1)

Let

v; e

be in W. Either there is a corresponding flux

v; e^{'} \in W^{'}

, and we conclude directly by induction hypothesis, or

v; e

is the result of merging some

v_{p}; e_{p} \in W^{'}

that produces X and some

v_{c}; e_{c} \in W^{'}

that consume it. In this case, by induction hypothesis, there are some corresponding flux-paths

\tilde{v_{p}}

and

\tilde{v_{c}}

. The concatenation

\tilde{v} = \tilde{v_{p}} \tilde{v_{c}}

of these paths is a flux-path. Indeed,

the production of $\tilde{v_{p}}$ coincides with the consumption of $\tilde{v_{c}}$ because there is an intermediate species X such that $P r o d_{I} (r_{v_{p}}) = C o n s_{I} (r_{v_{c}}) = {X}$ , $P r o d_{I} (r_{\tilde{v_{p}}}) = P r o d_{I} (r_{v_{p}})$ and $C o n s_{I} (r_{\tilde{v_{c}}}) = C o n s_{I} (r_{v_{c}})$ .
if $\tilde{v}$ is non-circular, for any intermediate species Y such that $\tilde{v} (Y) > 0$ , either $Y = X$ and $Y \in Specs (W_{0}) \ Specs (W)$ , or, $\tilde{v_{p}} (Y) > 0$ or $\tilde{v_{c}} (Y) > 0$ and by induction hypothesis, $Y \in Specs (W_{0}) \ Specs (W^{'})$ that is $Y \in Specs (W_{0}) \ Specs (W)$ .
if $\tilde{v}$ is circular, there exists an intermediate species Y which is both consumed by $v_{p}$ and produced by $v_{c}$ . The flux-path $\tilde{v_{c}}$ cannot be circular, as this would imply $C o n s_{I} (r_{v_{c}}) = P r o d_{I} (r_{v_{c}}) = \emptyset \neq X$ , and similarly for $\tilde{v_{p}}$ . By definition of non-circular flux-path, X and Y are the only two species in $\tilde{v_{c}}$ and $\tilde{v_{p}}$ such that $X \in Specs (W^{'})$ and $Y \in Specs (W^{'})$ . Thus, Y is the only non-eliminated intermediate species in $\tilde{v}$ w.r.t. W, meaning again that $\tilde{v}$ is indeed a circular flux-path.

Moreover,

\tilde{v}

trivially corresponds to v. We prove with Point 3 that if

\tilde{v}

is non-dependent, then

\tilde{v} (X) \leq 1

for any X.

(2)

Let

\tilde{v}

be a flux-path for W such that for any Y,

\tilde{v} (Y) \leq 1

. Let X be the intermediate species removed by (C-Inter), in particular

\tilde{v} (X) \leq 1

, hence either

\tilde{v} (X) = 0

or

\tilde{v} (X) = 1

. Since

Specs (W) = Specs (W^{'}) \ {X}

, if

\tilde{v} (X) = 0

, then

\tilde{v}

is also a flux-path for

W^{'}

, so by induction there is a corresponding flux

v; e^{'} \in W^{'}

. Since

\tilde{v} (X) = 0

, we still have a flux

v; e \in W

, that corresponds to

\tilde{v}

. If

\tilde{v} (X) = 1

then we can decompose

\tilde{v}

into

\tilde{v_{p}}

producing X and

\tilde{v_{c}}

consuming X such that

\tilde{v} = \tilde{v_{p}} \tilde{v_{c}}

.

\tilde{v_{p}}

and

\tilde{v_{c}}

can not be circular, as this would imply that X is both consumed and produced by

\tilde{v_{p}}

and by

\tilde{v_{c}}

, contradicting the fact that

\tilde{v} (X) = 1

. Therefore

\tilde{v_{p}} (X) = \tilde{v_{c}} (X) = 0

. Again, because

Specs (W) = Specs (W^{'}) \ {X}

and X is the species removed by (C-Inter) and

\tilde{v}

is a flux-path,

\tilde{v_{p}}

and

\tilde{v_{c}}

are (non-circular) flux-paths for

W^{'}

. We can then apply the induction hypothesis and infer that there are some corresponding fluxes

v_{p}, v_{c} \in W^{'}

, the first one that produces X and the second one that consumes it. Consequently, there is a flux

v; e \in W

that is the merging of

v_{p}

and

v_{c}

, and that corresponds to

\tilde{v}

.

(3)

Let

v; e \in W

be a flux that depends on

v_{1}; e_{1}, \dots, v_{k}; e_{k} \in W

and X be the intermediate species removed by (C-Inter). We distinguish two cases: either (case 1)

v; e

is the simplification of some flux

v; e^{'}

(meaning that v does neither produce nor consume X) or (case 2) it results from merging fluxes that produce and consume X.

(Case 1) By induction hypothesis and Point 1, there exists a flux-path

\tilde{v}

corresponding to v for

W^{'}

. We have

\tilde{v} (X) = 0

since X has not been removed. Suppose that there is

i \in {1, \dots, k}

such that

v_{i}

is the merging, by (C-Inter), of fluxes that produce and a consume X. In this case, for any

\tilde{v_{i}}

corresponding to

v_{i}

,

\tilde{v_{i}} (X) > 0

and, by the Lemma A1, we would have that

\tilde{v} (X) > 0

, which contradicts

\tilde{v} (X) = 0

. Therefore, none of the

v_{i}

s are the merging of other fluxes by (C-Inter), therefore, there are

v_{1}; e_{1}^{'}, \dots, v_{k}; e_{k}^{'} \in W^{'}

such that

v; e^{'}

depends on those fluxes. Since, by induction hypothesis, Point 3 is satisfied for

v; e^{'}

in

W^{'}

, it is also satisfied for

v; e

in W.

(Case 2) Let

v_{p}; e_{p} \in W^{'}

be a flux that produces X, and

v_{c}; e_{c} \in W^{'}

a flux that consumes it and

v; e \in W

their merging. Let

\tilde{v_{p}}

and

\tilde{v_{c}}

be flux-paths in

W^{'}

for, respectively,

v_{p}

and

v_{c}

(such flux paths exist by induction hypothesis). Any corresponding path

\tilde{v}

is then the concatenation of corresponding paths

\tilde{v_{p}}

and

\tilde{v_{c}}

.

We first prove that, if either

v_{p}; e_{p}

or

v_{c}; e_{c}

is dependent in

W^{'}

, then

v; e

is also dependent in W. By induction, if

v_{p}; e_{p}

depends on

v_{1}^{'}; e_{1}^{'}, \dots, v_{ℓ}^{'}; e_{ℓ}^{'}

, there exists a unique i such that

C o n s_{I} (r_{v_{i}^{'}}) = C o n s_{I} (r_{v_{p}})

,

P r o d_{I} (r_{v_{i}^{'}}) = P r o d_{I} (r_{v_{p}}) = {X}

, and, for any

j \neq i

,

C o n s_{I} (r_{v_{j}^{'}}) = C o n s_{I} (r_{v_{j}^{'}}) = \emptyset

. Then, there is a flux

v_{i}^{″}; e_{i}^{″}

in W that is the merging of

v_{i}^{'}

and

v_{c}

, and there are fluxes

v_{j}^{'}; e_{j}^{'}

in W for

j \neq i

. Then

v; e

depends on

v_{i}^{″}; e_{i}^{″}

and the

v_{j}^{'}; e_{j}^{'}

.

If both

v_{p}; e_{p}

and

v_{c}; e_{c}

are not dependent, by induction hypothesis, for any

Y \neq X

, in the corresponding flux-path, we have

\tilde{v_{p}} (Y) \leq 1

and

\tilde{v_{c}} (Y) \leq 1

. If

\tilde{v_{p}} (Y) + \tilde{v_{c}} (Y) \leq 1

, then

\tilde{v} (Y) \leq 1

. We also have

\tilde{v} (X) = 1

(indeed, X is the removed species, so

\tilde{v_{p}} (X) = \tilde{v_{c}} (X) = 0

and the X produced by

\tilde{v_{p}}

is merged with the X consumed by

\tilde{v_{c}}

in

\tilde{v}

). Therefore, in this case Point 1 is satisfied. If there is a Y such that

\tilde{v_{p}} (Y) = \tilde{v_{c}} (Y) = 1

, then Y occurs twice in

\tilde{v}

, and there is an intermediate species Z (that can possibly be Y if no other intermediate species occurs more than once between both occurrences of Y) such that

\tilde{v} (Z) = 2

and such that there is a circular flux-path

\tilde{v_{c y c}}

, subpath of

\tilde{v}

that begins and end with Z:

Then using Point 2, there is a corresponding flux

v_{c y c} \in W

, with

C o n s_{I} (r_{v_{c y c}}) = P r o d_{I} (r_{v_{c y c}})

. We can repeat the same operation on the remaining path

\tilde{v_{1}} \tilde{v_{2}}

, and obtain at each step a new (circular) flux. We stop when we obtain a remaining path

\tilde{v_{r e m}}

, such that for any Y, we have

\tilde{v_{r e m}} (Y) \leq 1

. Then there is a corresponding flux

v_{r e m}

with

C o n s_{I} (r_{v_{r e m}}) = C o n s_{I} (r_{v})

, and

P r o d_{I} (r_{v_{r e m}}) = P r o d_{I} (r_{v})

. Then v is dependent on

v_{r e m}

and the set of circular fluxes we obtained in this process. Therefore, in this case Point 3 is satisfied.

Now assume that

v_{p}; e_{p}

and

v_{c}; e_{c}

are dependent, and so that

v; e

is dependent too. By induction and using Point 3,

v_{p}; e_{p}

depends on a flux

v_{p, i}; e_{p, i}

with

C o n s_{I} (r_{v_{p, i}}) = C o n s_{I} (r_{v_{p}})

and

P r o d_{I} (r_{v_{p, i}}) = P r o d_{I} (r_{v_{p}})

, and on other fluxes such that

C o n s_{I} (r_{v_{p, j}}) = C o n s_{I} (r_{v_{p}}) = \emptyset

, and similarly for

v_{c}; e_{c}

. Then any

v_{p, j}; e_{p, j}

and any

v_{c, j}; e_{c, j}

is still in W, while

v_{p, i}; e_{p, i}

is merged with

v_{c, i}; e_{c, i}

, forming a new flux

v_{i}; e_{i}

. Then

v; e

depends on

v_{i}; e_{i}

, the

v_{p, j}; e_{p, j}

and the

v_{c, j}; e_{c, j}

. Since

P r o d_{I} (r_{v_{i}}) = P r o d_{I} (r_{v_{c, i}}) = P r o d_{I} (r_{v_{c}}) = P r o d_{I} (r_{v})

, and similarly

C o n s_{I} (r_{v_{i}}) = C o n s_{I} (r_{v})

, Point 3 is satisfied.

☐

Then Proposition 3 is a direct corollary of Point 3 of Lemma A2.

Appendix C. Proofs of the Full Confluence of the Simplification

We prove Lemmas 9 and 12.

Lemma A3.

Let W be a network such that

W ⇛_{C - DEP} W_{1}

and

W ⇛_{C - INTER} W_{2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

Let X be the intermediate species, and

v_{d}; e_{d}

the dependent reaction, that depends on

v_{1}; e_{1}, \dots, v_{n}; e_{n}

, with coefficients

a_{1}, \dots, a_{n}

.

The main idea is to use the Proposition 3 to prove that if X is in the dependent flux

v_{d}

, it is also in one of the fluxes

v_{i}

whose

v_{d}

depends on. Therefore, if we eliminate X and combine

v_{d}

with another flux

v^{'}

, we also merge

v_{i}

with

v^{'}

. Then

v_{d} ⋄_{X} v^{'}

is still dependent on

v_{i} ⋄_{X} v^{'}

and other fluxes, and can be removed.

Let us first assume that X is not involved in

v_{d}

(that is,

X \notin P r o d_{I} (r_{v_{d}}) \cup C o n s_{I} (r_{v_{d}})

). By Lemma A2, Point 3, X is not involved in the

v_{i}

either. Then

v_{d}

will still be dependent in

W_{2}

, so we can still remove it after removing X. Reciprocally, X can still be removed in

W_{1}

. We now show that the simplification of the fluxes

v_{i}, e_{i}

are the same in

W_{1}^{'}

and

W_{2}^{'}

. The case for the other fluxes is trivial. In

W_{1}^{'}

, we obtain the flux

v_{i}; (e_{i} + a_{i} e_{d}) [X : = X (0)]

. In

W_{2}^{'}

, we obtain

v_{i}; e_{i} [X : = X (0)] + a_{i} e_{d} [X : = X (0)]

. Therefore, these two expressions are similar, and

W_{1}^{'} ≅ W_{2}^{'}

.

Assume that

X \in v_{d}

, for instance

P r o d_{I} (r_{v_{d}}) = {X}

. Then, again by Lemma A2 and Point 3, there is a flux

v_{i}; e_{i}

with

P r o d_{I} (r_{v_{i}}) = {X}

, and for any other

j \neq i

,

P r o d_{I} (r_{v_{j}}) = C o n s_{I} (r_{v_{j}}) = \emptyset

. We denote by

prod

and

cons

the expressions as defined in the rule (C-Inter), by

V_{p r o d}

the fluxes that produce some X, and

V_{c o n s}

the ones that consume it.

In

W_{2}

after removing X, we obtain the fluxes:

the combination of $v_{d}$ and the consuming fluxes: ${v_{d} ⋄ v_{c o n s}; e_{d} e_{c o n s} / cons ∣ v_{c o n s}; e_{c o n s} \in V_{c o n s}}$ ,
the combination of $r_{i}$ and the consuming reactions: ${v_{i} ⋄ v_{c o n s}; e_{i} e_{c o n s} / cons ∣ v_{c o n s}; e_{c o n s} \in V_{c o n s}}$ ,
the other combined fluxes: ${v_{p r o d} ⋄ v_{c o n s}; e_{p r o d} e_{c o n s} / cons ∣ v_{p r o d}; e_{p r o d} \in V_{p r o d}, v_{c o n s}; e_{c o n s} \in V_{c o n s}}$ ,
the remaining fluxes not combined: ${v_{j}; e_{j} [X : = X (0)]}_{j \neq i}$ ,
the other fluxes that are not in $V_{p r o d}$ , $V_{c o n s}$ , $v_{j}$ , where we substitute X by $X (0)$ .

Since

v_{d}

was dependent on

v_{1}, \dots, v_{n}

, we have that any flux

v_{d} ⋄ v_{c o n s}

in the first set is dependent on a flux

v_{i} ⋄ v_{c o n s}

in the second set and the fluxes

v_{j}

. Therefore, we can recursively remove those fluxes with the rule (C-Dep). We obtain the network

W_{2}^{'}

with the fluxes:

${v_{i} ⋄ v_{c o n s}; e_{i} e_{c o n s} / cons + e_{d} e_{c o n s} / cons}$ ,
${v_{p r o d} ⋄ v_{c o n s}; e_{p r o d} e_{c o n s} / cons}$ ,
${v_{j}; e_{j}) [X : = X (0)] + \sum_{\begin{matrix} e_{c o n s} \end{matrix}} a_{i} e_{d} e_{c o n s} / cons}$ ,
the other fluxes that are not in $V_{p r o d}$ , $V_{c o n s}$ , $v_{j}$ , where we substitute X by $X (0)$ .

Now, if we first remove

v_{d}

, in

W_{1}

we obtain the fluxes:

$v_{i}; e_{i} + e_{d}$
${v_{j}; e_{j} + a_{j} e_{d}}_{j \neq i}$
$V_{c o n s} \ {v_{d}, v_{i}}$ ,
$V_{p r o d}$ ,
the other fluxes that are not in $V_{p r o d}$ , $V_{c o n s}$ , $v_{j}$ .

We can still remove X, we obtain the reactions:

${v_{i} ⋄ v_{c o n s}; (e_{i} + e_{d}) e_{c o n s} / cons}$ ,
${v_{p r o d} ⋄ v_{c o n s}; e_{p r o d} e_{c o n s} / cons}$ ,
${v_{j}; (e_{j} + a_{j} e_{d}) [X : = X (0)]}$ ,
the other fluxes that are not in $V_{p r o d}$ , $V_{c o n s}$ , $v_{j}$ , where we substitute X by $X (0)$ .

Note that we have:

\begin{matrix} e_{j} [X : = X (0)] + \sum_{\begin{matrix} e_{c o n s} \end{matrix}} a_{j} e_{d} e_{c o n s} / cons & ≅ & e_{j} [X : = X (0)] + a_{j} e_{d} \\ ≅ & (e_{j} + a_{j} e_{d}) [X : = X (0)] . \end{matrix}

The fluxes of the two simplified networks are therefore similar, that is

W_{1}^{'} ≅ W_{2}^{'}

. The case with

C o n s_{I} (r_{v_{d}}) = {X}

is similar. ☐

Lemma A4.

Let W be a network such that

W ⇛_{C - INTER} W_{i}

for

i \in {1, 2}

. Then

\exists W_{i}^{'}

such that

W_{i} ⇛_{C}^{*} W_{i}^{'}

and

W_{1}^{'} ≅ W_{2}^{'}

.

Proof.

The main idea here is that after removing one intermediate species, we can still remove the other one, either with (C-Mod) or with (C-Inter). In the second case, some dependent fluxes are generated, that we can eliminate to find the same simplified network, whatever the order of elimination of the intermediate species.

Let X and Y be the intermediate species removed to obtain

W_{1}

and

W_{2}

. We can partition the fluxes of W into:

$V_{X} = {v_{X}; e_{X} ∣ X \in P r o d_{I} (r_{v_{X}}), Y \notin v_{X}}$ , the fluxes producing X without Y,
$V_{X^{'}} = {v_{X^{'}}; X e_{X^{'}} ∣ X \in C o n s_{I} (r_{v_{X^{'}}}), Y \notin v_{X^{'}}}$ , the fluxes consuming X without Y,
$V_{m o d (X)} = {v_{m o d (X)}; e_{m o d (X)} ∣ X \notin P r o d_{I} (r_{v_{m o d (X)}}) \cup C o n s_{I} (r_{v_{m o d (X)}}), X \in Vars (e_{m o d (X)}), Y \notin v_{m o d (X)}}$ , the fluxes with modifier X and without Y,
$V_{Y} = {v_{Y}; e_{Y} ∣ Y \in P r o d_{I} (r_{v_{Y}}), X \notin v_{Y}}$ , the fluxes producing Y without X,
$V_{Y^{'}} = {v_{Y}; Y e_{Y^{'}} ∣ Y \in C o n s_{I} (r_{v_{Y}}), X \notin v_{Y}}$ , the fluxes consuming Y without X,
$V_{m o d (Y)} = {v_{m o d (Y)}; e_{m o d (Y)} ∣ Y \notin P r o d_{I} (r_{v_{m o d (Y)}}) \cup C o n s_{I} (r_{v_{m o d (Y)}}), Y \in Vars (e_{m o d (Y)}), X \notin v_{m o d (Y)}}$ , the fluxes with modifier Y and without X,
$V_{X Y^{'}} = {v_{X Y^{'}}; Y e_{X Y^{'}} ∣ X \in P r o d_{I} (r_{v_{X Y^{'}}}), Y \in C o n s_{I} (r_{r_{X Y^{'}}})}$ , the fluxes producing X and consuming Y,
$V_{X^{'} Y} = {v_{X^{'} Y}; X e_{X^{'} Y} ∣ Y \in P r o d_{I} (r_{v_{X^{'} Y}}), X \in C o n s_{I} (r_{v_{X^{'} Y}})}$ , the fluxes producing Y and consuming X,
$V_{m o d (X Y)} = {v_{m o d (X Y)}; e_{m o d (X Y)} ∣ X, Y \notin v_{m o d (X Y)}}$ , the fluxes with modifier X and Y.

We define the following variables:

$T_{X} = \sum_{\begin{matrix} V_{X} \end{matrix}} e_{X}$ $T_{X^{'}} = \sum_{\begin{matrix} V_{X^{'}} \end{matrix}} e_{X^{'}}$
$T_{Y} = \sum_{\begin{matrix} V_{Y} \end{matrix}} e_{Y}$ $T_{Y^{'}} = \sum_{\begin{matrix} V_{Y^{'}} \end{matrix}} e_{Y^{'}}$
$T_{X^{'} Y} = \sum_{\begin{matrix} V_{X^{'} Y} \end{matrix}} e_{X^{'} Y}$ $T_{X Y^{'}} = \sum_{\begin{matrix} V_{X Y^{'}} \end{matrix}} e_{X Y^{'}}$

Let first remove X. We obtain the following combined fluxes:

$V_{X} ⋄ V_{X^{'}} = {v_{X} ⋄ v_{X^{'}}; e_{X} e_{X^{'}} / (T_{X^{'}} + T_{X^{'} Y})}$ ,
$V_{X} ⋄ V_{X^{'} Y} = {v_{X} ⋄ v_{X^{'} Y}; e_{X} e_{X^{'} Y} / (T_{X^{'}} + T_{X^{'} Y})}$ ,
$V_{X Y^{'}} ⋄ V_{X^{'}} = {v_{X Y^{'}} ⋄ v_{X^{'}}; e_{X Y^{'}} e_{X^{'}} Y / (T_{X^{'}} + T_{X^{'} Y})}$ ,
$V_{X Y^{'}} ⋄ V_{X^{'} Y} = {v_{X Y^{'}} ⋄ v_{X^{'} Y}; e_{X Y^{'}} e_{X^{'} Y} Y / (T_{X^{'}} + T_{X^{'} Y})}$ .

The fluxes with X as modifier become (with

X (0) = (T_{X} + Y T_{X Y^{'}}) / (T_{X^{'}} + T_{X^{'} Y})

:

$V_{m o d (X)}^{'} = {v_{m o d (X)}; e_{m o d (X)} [X : = X (0)]}$ ,
$V_{m o d (X Y)}^{'} = {v_{m o d (X Y)}; e_{m o d (X Y)} [X : = X (0)]}$ .

Finally, some fluxes are not modified:

$V_{Y} = {v_{Y}; e_{Y}}$ ,
$V_{Y^{'}} = {v_{Y}; Y e_{Y^{'}}}$ ,
$V_{m o d (Y)} = {v_{m o d (Y)}; e_{m o d (Y)}}$ .

There are now two cases to consider. First, it is possible that Y is now only a modifier in

W_{1}

. This means that

V_{X} = V_{X^{'}} = V_{Y} = V_{Y^{'}} = 0

, that is any flux with X as reactant (resp. product) also admits Y as product (resp. reactant), and reciprocally. We can then apply (C-Mod) on

W_{1}

, and obtain the fluxes (with

X (0) = Y (0) T_{X Y^{'}} / T_{X^{'} Y}

):

$V_{X Y^{'}} ⋄ V_{X^{'} Y} = {v_{X Y^{'}} ⋄ v_{X^{'} Y}; e_{X Y^{'}} e_{X^{'} Y} Y (0) / T_{X^{'} Y}}$ .
$V_{m o d (X)}^{'} = {v_{m o d (X)}; e_{m o d (X)} [X : = X (0)]}$ ,
$V_{m o d (Y)} = {v_{m o d (Y)}; e_{m o d (Y)} [Y : = Y (0)]}$ ,
$V_{m o d (X Y)}^{'} = {v_{m o d (X Y)}; e_{m o d (X Y)} [x_{X} : = X (0)] [Y : = Y (0)]}$ .

Using the constraint

X (0) = Y (0) T_{X Y^{'}} / T_{X^{'} Y}

to rewrite the first kinetic expression into

e_{X Y^{'}} e_{X^{'} Y} X (0) / T_{X Y^{'}}

, we can see by symmetry that we obtain similar fluxes by removing Y first (with (C-Inter)) and then X (with (C-Mod)).

In the other case, we can still remove Y with (C-Inter). We first compute the sum

U_{Y}

(resp.

U_{Y^{'}}

) of the kinetics of the fluxes that produced (resp. consumed) Y:

U_{Y} = \frac{T_{X^{'}} T_{Y} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T_{X^{'}} + T_{X^{'} Y}} U_{Y^{'}} = \frac{T_{X^{'}} T_{Y^{'}} + T_{Y^{'}} T_{X^{'} Y} + T_{X^{'}} T_{X Y^{'}}}{T_{X^{'}} + T_{X^{'} Y}} .

We write

T = T_{X^{'}} T_{Y^{'}} + T_{Y^{'}} T_{X^{'} Y} + T_{X^{'}} T_{X Y^{'}}

. We obtain the following combined fluxes:

$(V_{X} ⋄ V_{X^{'} Y}) ⋄ (V_{X Y^{'}} ⋄ V_{X^{'}}) = {v_{X} ⋄ v_{X^{'} Y} ⋄ v_{X^{'}} ⋄ v_{X Y^{'}}; \frac{e_{X} e_{X^{'}} e_{X Y^{'}} e_{X^{'} Y}}{(T_{X^{'}} + T_{X^{'} Y}) T}}$ ,
$(V_{X} ⋄ V_{X^{'} Y}) ⋄ V_{Y^{'}} = {v_{X} ⋄ v_{X^{'} Y} ⋄ v_{Y^{'}}; \frac{e_{X} e_{Y^{'}} e_{X^{'} Y}}{T}}$ ,
$V_{Y} ⋄ (V_{X Y^{'}} ⋄ V_{X^{'}}) = {v_{Y} ⋄ v_{X Y^{'}} ⋄ v_{X^{'}}; \frac{e_{Y} e_{X Y^{'}} e_{X^{'}}}{T}}$ ,
$V_{Y} ⋄ V_{Y^{'}} = {v_{Y} ⋄ v_{Y^{'}}; \frac{e_{Y} e_{Y^{'}} (T_{X^{'}} + T_{X^{'} Y})}{T}}$ .

The fluxes with Y as modifier become:

${(V_{X Y^{'}} ⋄ V_{X^{'} Y})}^{'} = {v_{X Y^{'}} ⋄ v_{X^{'} Y}; \frac{T_{X^{'}} T_{Y} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T (T_{X^{'}} + T_{X^{'} Y})} e_{X Y^{'}} e_{X^{'} Y}}$ ,
$V_{m o d (X)}^{″} = {v_{m o d (X)}; e_{m o d (X)} [X : = \frac{T_{X} T_{Y^{'}} + T_{X} T_{X Y^{'}} + T_{Y} T_{X Y^{'}}}{T}]}$ ,
$V_{m o d (X Y)}^{″} = {v_{m o d (X Y)}; e_{m o d (X Y)} [X : = \frac{T_{X} T_{Y^{'}} + T_{X} T_{X Y^{'}} + T_{Y} T_{X Y^{'}}}{T}] [Y : = \frac{T_{Y} T_{X^{'}} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T}]}$ ,
$V_{m o d (Y)}^{'} = {v_{m o d (Y)}; e_{m o d (Y)} [x_{Y} : = \frac{T_{Y} T_{X^{'}} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T}]}$ .

Finally, some fluxes do not involve Y:

$R_{X} ⋄ R_{X^{'}} = {v e c (r_{X}) + v e c (r_{X^{'}}); e_{X} e_{X^{'}} / (T_{X^{'}} + T_{X^{'} Y})}$

Now we can observe that we obtained some dependent fluxes. Any flux in

(V_{X} ⋄ V_{X^{'} Y}) ⋄ (V_{X Y^{'}} ⋄ V_{X^{'}})

is the combination of a flux from

{(V_{X Y^{'}} ⋄ V_{X^{'} Y})}^{'}

and

V_{X} ⋄ V_{X^{'}}

. We can then remove the first flux, while modifying the kinetics of the others. Finally, after simplifying the kinetic expressions by similarity, the simplified network has the following fluxes:

$V_{X} ⋄ V_{X^{'}} = {v_{X} ⋄ v_{X^{'}}; \frac{e_{X} e_{X^{'}} (T_{Y^{'}} + T_{X Y^{'}})}{T}}$ ,
$V_{Y} ⋄ V_{Y^{'}} = {v_{Y} ⋄ v_{Y^{'}}; \frac{e_{Y} e_{Y^{'}} (T_{X^{'}} + T_{X^{'} Y})}{T}}$ ,
$(V_{X} ⋄ V_{X^{'} Y}) ⋄ V_{Y^{'}} = {v_{X} ⋄ v_{X^{'} Y} ⋄ v_{Y^{'}}; \frac{e_{X} e_{Y^{'}} e_{X^{'} Y}}{T}}$ ,
$V_{Y} ⋄ (V_{X Y^{'}} ⋄ V_{X^{'}}) = {v_{Y} ⋄ v_{X Y^{'}} ⋄ v_{X^{'}}; \frac{e_{Y} e_{X Y^{'}} e_{X^{'}}}{T}}$ ,
$V_{m o d (X)}^{″} = {v_{m o d (X)}; e_{m o d (X)} [x_{X} : = \frac{T_{X} T_{Y^{'}} + T_{X} T_{X Y^{'}} + T_{Y} T_{X Y^{'}}}{T}]}$ ,
$V_{m o d (Y)}^{'} = {v_{m o d (Y)}; e_{m o d (Y)} [x_{Y} : = \frac{T_{Y} T_{X^{'}} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T}]}$ ,
${(V_{X Y^{'}} ⋄ V_{X^{'} Y})}^{'} = {v_{X Y^{'}} ⋄ v_{X^{'} Y}; \frac{(T_{Y} + T_{X}) e_{X Y^{'}} e_{X^{'} Y}}{T}}$ ,
$V_{m o d (X Y)}^{″} = {v_{m o d (X Y)}; e_{m o d (X Y)} [x_{X} : = \frac{T_{X} T_{Y^{'}} + T_{X} T_{X Y^{'}} + T_{Y} T_{X Y^{'}}}{T}] [x_{Y} : = \frac{T_{Y} T_{X^{'}} + T_{Y} T_{X^{'} Y} + T_{X} T_{X^{'} Y}}{T}]}$ ,

We can observe that:

the 2 first sets are symmetric to each other, in the sense that if we switch X and Y in the first set, we obtain the second one,
the 2 following sets are symmetric to each other,
the 2 following sets are symmetric to each other too,
the following set is symmetric in X and Y,
the last set is symmetric in X and Y (since the substitutions commute).

Therefore, by symmetry, we can obtain exactly the same network if we first remove Y, then remove X and the dependent fluxes. We conclude that

W_{1}^{'} ≅ W_{2}^{'}

.

☐

References

Feinberg, M. Chemical reaction network structure and the stability of complex isothermal reactors—I. The deficiency zero and deficiency one theorems. Chem. Eng. Sci. 1987, 42, 2229–2268. [Google Scholar] [CrossRef]
Hucka, M.; Finney, A.; Sauro, H.M.; Bolouri, H.; Doyle, J.C.; Kitano, H.; Arkin, A.P.; Bornstein, B.J.; Bray, D.; Cornish-Bowden, A.; et al. The systems biology markup language (SBML): A medium for representation and exchange of biochemical network models. Bioinformatics 2003, 19, 524–531. [Google Scholar] [CrossRef] [PubMed]
Calzone, L.; Fages, F.; Soliman, S. BIOCHAM: An environment for modeling biological systems and formalizing experimental knowledge. Bioinformatics 2006, 22, 1805–1807. [Google Scholar] [CrossRef] [PubMed]
Kuttler, C.; Lhoussaine, C.; Nebut, M. Rule-based modeling of transcriptional attenuation at the tryptophan operon. In Transactions on Computational Systems Biology XII; Springer: New York, NY, USA, 2010; pp. 199–228. [Google Scholar]
Chaouiya, C. Petri net modelling of biological networks. Brief. Bioinform. 2007, 8, 210–219. [Google Scholar] [CrossRef] [PubMed]
Juty, N.; Ali, R.; Glont, M.; Keating, S.; Rodriguez, N.; Swat, M.J.; Wimalaratne, S.M.; Hermjakob, H.; Le Novère, N.; Laibe, C.; et al. BioModels: Content, Features, Functionality and Use. CPT Pharmacomet. Syst. Pharmacol. 2015, 4, 55–68. [Google Scholar] [CrossRef] [PubMed]
Mäder, U.; Schmeisky, A.G.; Flórez, L.A.; Stülke, J. SubtiWiki—A comprehensive community resource for the model organism Bacillus subtilis. Nucleic Acids Res. 2012, 40, 1278–1287. [Google Scholar] [CrossRef] [PubMed]
Niehren, J.; John, M.; Versari, C.; Coutte, F.; Jacques, P. Qualitative Reasoning about Reaction Networks with Partial Kinetic Information. In Computational Methods for Systems Biology; Lecture Notes in Computer Science; Springer: New York, NY, USA, 2015; Volume 9308, pp. 157–169. [Google Scholar]
Radulescu, O.; Gorban, A.N.; Zinovyev, A.; Noel, V. Reduction of dynamical biochemical reactions networks in computational biology. Front. Genet. 2012, 3, 131. [Google Scholar] [CrossRef] [PubMed]
Michaelis, L.; Menten, M.L. Die kinetik der invertinwirkung. Biochem. Z. 1913, 49, 333–369. (In German) [Google Scholar]
Segel, L.A. On the validity of the steady state assumption of enzyme kinetics. Bull. Math. Biol. 1988, 50, 579–593. [Google Scholar] [CrossRef] [PubMed]
Cornish-Bowden, A. Fundamentals of Enzyme Kinetics; Wiley: Hoboken, NJ, USA, 2013. [Google Scholar]
Heineken, F.; Tsuchiya, H.; Aris, R. On the mathematical status of the pseudo-steady state hypothesis of biochemical kinetics. Math. Biosci. 1967, 1, 95–113. [Google Scholar] [CrossRef]
Segel, L.A.; Slemrod, M. The quasi-steady-state assumption: A case study in perturbation. SIAM Rev. 1989, 31, 446–477. [Google Scholar] [CrossRef]
King, E.L.; Altman, C. A schematic method of deriving the rate laws for enzyme-catalyzed reactions. J. Phys. Chem. 1956, 60, 1375–1378. [Google Scholar] [CrossRef]
Chou, K.-C.; Forsen, S. Graphical rules of steady-state reaction systems. Can. J. Chem. 1981, 59, 737–755. [Google Scholar]
Fages, F.; Gay, S.; Soliman, S. Inferring reaction systems from ordinary differential equations. Theor. Comput. Sci. 2015, 599, 64–78. [Google Scholar] [CrossRef]
Sáez, M.; Wiuf, C.; Feliu, E. Graphical reduction of reaction networks by linear elimination of species. J. Math. Biol. 2017, 74, 195–237. [Google Scholar] [CrossRef] [PubMed]
Madelaine, G.; Lhoussaine, C.; Niehren, J. Attractor Equivalence: An Observational Semantics for Reaction Networks. In Formal Methods in Macro-Biology; Springer: New York, NY, USA, 2014. [Google Scholar]
Schmidt-Schauss, M.; Sabel, D.; Niehren, J.; Schwinghammer, J. Observational Program Calculi and the Correctness of Translations. J. Theor. Comput. Sci. 2015, 577, 98–124. [Google Scholar] [CrossRef]
Gagneur, J.; Klamt, S. Computation of elementary modes: A unifying framework and the new binary approach. BMC Bioinform. 2004, 5, 175. [Google Scholar] [CrossRef] [PubMed]
Madelaine, G.; Lhoussaine, C.; Niehren, J.; Tonello, E. Structural simplification of chemical reaction networks in partial steady states. Biosystems 2016, 149, 34–49. [Google Scholar] [CrossRef] [PubMed]
Schmierer, B.; Tournier, A.L.; Bates, P.A.; Hill, C.S. Mathematical modeling identifies Smad nucleocytoplasmic shuttling as a dynamic signal-interpreting system. Proc. Natl. Acad. Sci. USA 2008, 105, 6608–6613. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Evolution of the concentration of

S

,

E

,

C

and

P

in enzymatic network with mass-action kinetics with the parameters

k_{1} = k_{2} = k_{3} = 1

, the initial concentrations

E (0) = 1

,

C (0) = 2

,

S (0) = 4

,

P (0) = 0

, and in the context of the network with a reaction

\emptyset \overset{k_{4}}{\to} S

which produces S with constant speed

k_{4} = 2

, and a reaction

P \overset{k_{5} P}{\to} \emptyset

which degrades P with parameter

k_{5} = 0.2

.

Figure 2. Confluence, local confluence, and commutation.

Figure 3. Simulation diagrams.

Figure 4. A reaction network and the associated graph and stoichiometry matrix.

Figure 5. The elementary modes of the reaction network in Figure 4.

Figure 6. Simplification of reaction networks without kinetics with respect to a set

I

of intermediate species.

Figure 7. Elimination of intermediates X and Y in reaction network

N

in both possible orders, leading to two different final results

N_{X Y}

and

N_{Y X}

.

Figure 8. Dependency elimination is not confluent.

Figure 9. Simplifying flux networks for an initial n-tuple of reactions

r

and a set of intermediate species

I

.

Figure 10. Elimination of intermediate species from flux networks in different orders is not confluent without factorization.

Figure 11. Expressions where

A \in Spec, k \in Param, c \in R, and n \in N

.

Figure 12. Constraints on kinetic functions.

Figure 13. Systems of constrained equations with Odes.

Figure 14. System of constrained equations of a constrained flux network

V & C

.

Figure 15. System of constrained equations for Michaelis-Menten.

Figure 16. Simplification rules for n-ary constrained flux networks, with

I

the set of intermediate species and

r

the n-tuple of initial reactions.

Figure 17. Reaction networks for the Michaelis-Menten example.

M M n e t_{E}

and

M M n e t_{C}

are obtained from the initial network

M M n e t

after removing E and C respectively.

M M n e t_{C E}

is obtained after removing both C and then E in this order.

M M n e t_{E C}

is obtained by inverting the order of elimination.

Figure 18. Example illustrating the need of Condition 2.

Figure 19. Example illustrating the need of Condition 3.

Figure 20. Example illustrating the need of Condition 4.

Figure 21. Example illustrating the need of Condition 4.

Figure 22. Network W and its simplifications. (top left) Network W. (top right) Network

W_{X Y Z}

after eliminating X, Y and Z (in this order). (bottom left) Network

W_{X Z Y}

after eliminating X, Z and Y. (bottom right) Network

W_{X Y Z d}

after eliminating X, Y, Z, and the dependent reaction. The new parameter is

K = k_{2} k_{3} + k_{3} k_{4} + k_{4} k_{5}

.

Figure 23. Two decompositions of the mode

(1, 1, 1, 1, 1, 1)

in the network W.

Figure 24. Sub-network W of the

Smad

-based signal transduction model from [23].

Figure 25. Simplified networks from W. Both networks have the same structure, and the kinetic expressions are defined in the table. The network

W_{1}

is obtained by removing in order

S 4_{n}

,

S 24_{n}

,

S 24_{c}

,

S 4_{c}

and the dependent fluxes. The network

W_{2}

is obtained by removing in order

S 4_{n}

,

S 24_{n}

,

S 4_{c}

,

S 24_{c}

and the dependent fluxes.

Figure 26. In red the elementary mode

v_{r e d}

, in blue

v_{b l u e}

, in green

v_{g r e e n}

, and in magenta

v_{m a g e n t a}

.

Figure 27. Simplification rules for systems of equations.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

Simpliﬁcation of Reaction Networks, Conﬂuence and Elementary Modes

Abstract

1. Introduction

Outline

2. Preliminaries

2.1. Confluence Notions

2.2. Multisets

2.3. Commutative Semigroups

3. Reaction Networks without Kinetics

3.1. Stoichiometry Matrices

3.2. Elementary Modes

3.3. Elementary Flux Modes

4. Simplifying Reaction Networks without Kinetics

4.1. Intermediate Elimination

4.2. Eliminating Dependent Reactions

5. Simplifying Flux Networks

5.1. Vector Representations of Reaction Networks

5.2. Simplification Rules

5.3. Factorization

5.4. Proving Confluence via Elementary Modes

6. Reaction Networks with Deterministic Semantics

6.1. Kinetic Expressions

6.2. Constrained Flux Networks

6.3. Systems of Constrained Equations with ODEs

6.4. Deterministic Semantics

6.5. Contextual Equivalence

7. Simplification of Constrained Flux Networks

7.1. Linear Steadiness of Intermediate Species

7.2. Simplification

7.3. Michaelis-Menten

8. Preservation of Linear Steadiness

8.1. LinNets

8.2. Stability of LinNets

9. Confluence of the Simplification Relation

9.1. Structural Confluence

9.2. Non-Confluence of the Kinetic Rates

9.3. Criterion for the Full Confluence

10. An Example from the BioModels Database

11. Simplification of Systems of Equations

11.1. Simplification of Systems of Equations

11.2. Simulation

12. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A. Soundness of the Simplification Rules for Constrained Flux Networks

Appendix B. Proofs for the Stability of LinNets

Appendix C. Proofs of the Full Confluence of the Simplification

References

Article Metrics

Citations

Article Access Statistics

8.1. $LinNets$

8.2. Stability of $LinNets$