Abstract
Metabolic pathway analysis is a key method to study a metabolism in its steady state, and the concept of elementary fluxes (EFs) plays a major role in the analysis of a network in terms of non-decomposable pathways. The supports of the EFs contain in particular those of the elementary flux modes (EFMs), which are the support-minimal pathways, and EFs coincide with EFMs when the only flux constraints are given by the irreversibility of certain reactions. Practical use of both EFMs and EFs has been hampered by the combinatorial explosion of their number in large, genome-scale systems. The EFs give the possible pathways in a steady state but the real pathways are limited by biological constraints, such as thermodynamic or, more generally, kinetic constraints and regulatory constraints from the genetic network. We provide results on the mathematical structure and geometrical characterization of the solution space in the presence of such biological constraints (which is no longer a convex polyhedral cone or a convex polyhedron) and revisit the concept of EFMs and EFs in this framework. We show that most of the results depend only on very general properties of compatibility of constraints with vector signs: either sign-invariance, satisfied by regulatory constraints, or sign-monotonicity (a stronger property), satisfied by thermodynamic and kinetic constraints. We show in particular that the solution space for sign-monotone constraints is a union of particular faces of the original polyhedral cone or polyhedron and that EFs still coincide with EFMs and are just those of the original EFs that satisfy the constraint, and we show how to integrate their computation efficiently in the double description method, the most widely used method in the tools dedicated to EFs computation. We show that, for sign-invariant constraints, the situation is more complex: the solution space is a disjoint union of particular semi-open faces (i.e., without some of their own faces of lesser dimension) of the original polyhedral cone or polyhedron and, if EFs are still those of the original EFs that satisfy the constraint, their computation cannot be incrementally integrated into the double description method, and the result is not true for EFMs, that are in general strictly more numerous than those of the original EFMs that satisfy the constraint.
Keywords:
metabolic pathway analysis; steady state; flux cone; flux polyhedron; elementary flux modes; elementary flux vectors and points; extreme vectors and points; double description method; thermodynamic constraints; kinetic constraints; regulatory constraints; sign-monotonicity; sign-invariance; support-minimality; conformal non-decomposability; conformal support-wise non-strict-decomposability 1. Introduction
1.1. Metabolic Networks
In order to ensure this paper is self-contained and has no prerequisite to be read, we summarize in this introduction the state-of-the-art related to the subject and fix the notations adopted throughout the paper. The results quoted being known, are thus given without proof and the reader is invited to refer to the works in [1,2,3,4,5,6,7], in addition to the references in the text, for the proofs or surveys.
A metabolic network is made up of a set of r biochemical enzymatic reactions. Each reaction consumes certain metabolites (called substrates of the reaction) and produces other metabolites (called products of the reaction). Each metabolite is assigned a coefficient in the reaction, its stoichiometric coefficient (counted negatively for substrates and positively for products). We distinguish internal from external metabolites w.r.t. the system under study (e.g., a bacterium, an eukaryotic cell), the reactions involving both internal and external metabolites being transfer reactions. If m is the number of internal metabolites (), the network is thus given by its stoichiometric matrix , where coefficient is the stoichiometric coefficient of internal metabolite j in reaction i (positive if j is a product of reaction i and negative if it is a substrate). A state of the network at a given time t is given by the net rates (or fluxes) in each of its reactions at t, i.e., by a flux vector (or rate vector, or flux distribution) . Denoting by the vector of the concentrations of internal metabolites at t, the time evolution of the network is thus given by
1.2. Steady-State Behavior and Flux Subspace
We are interested in the steady-state behavior of the network. The steady-state assumption means that the concentrations of internal metabolites remain constant along time (no accumulation or reduction of internal metabolites inside the system, an approximation which is valid for short time periods, e.g., a few minutes) and leads thus to the fundamental equation
With only this assumption, the solution space , i.e., the space of all admissible flux vectors , is thus the linear subspace of given by the kernel, or nullspace [8], of :
The dimension of the flux subspace is given by . Often, possibly after a preprocessing to eliminate its linearly dependent rows, is assumed to be of full rank and then .
The support of a vector is defined by
The support of a flux vector has thus an important biological signification as it represents the reactions involved in the subnetwork (that we shall call pathway) defined by (i.e., those through which the flux given by is not null).
1.3. Irreversible Reactions, Flux Cones and Polyhedral Cones
In addition to the homogeneous equality constraints provided by the steady-state assumption, the flux vectors, in general, must also satisfy homogeneous inequality constraints corresponding to reactions known as irreversible, whose set will be noted :
This means that fluxes in irreversible reactions are constrained to be non-negative (in reversible reactions, fluxes may be either positive, or negative or null and the direction fixed as positive is arbitrary, the role between substrates and products being able to switch). If , with , is the number of irreversible reactions, the solution space is the intersection of the linear subspace with non-negative half-spaces, it is thus a particular case of a convex polyhedral cone, called s-cone (subspace cone or special cone) or flux cone, noted :
A (general) convex polyhedral cone is defined implicitly (or by intension) by finitely many homogeneous linear inequalities:
with a suitable matrix (called a representation matrix of C) and is thus the intersection of n half-spaces whose frontiers contain the origin. The dimension of C, noted , is defined as the dimension of its affine span. A flux cone corresponds thus to a particular matrix given by
where is the extension of the identity matrix by columns of zeros corresponding to reversible reactions. This means that, for a flux cone, the homogeneous linear inequalities are of a special type: part of these (m) are actually equalities defining a lower-dimensional subspace given by the nullspace of the stoichiometric matrix , the others () being non-negativity constraints regarding some single coordinate variables (given by the irreversible reactions) corresponding thus to particular half-spaces defined by such positive coordinate axes.
Conversely, to any convex polyhedral cone C in defined by a representation matrix , we can associate a flux cone of the same dimension in defined by
Using this correspondence (which defines a bijection of C onto ), several properties, proven for flux cones, can actually be lifted to general convex polyhedral cones.
From the definition of a cone, for every nonzero element of C, the whole half-line is contained in C. This is called a ray of C. Thus a flux vector is defined up to a positive scalar multiplication. The lineality space of C is the union of all lines of C, i.e., . If C is defined by a representation matrix , its lineality space thus equals the nullspace of , i.e., . For a flux cone given by Equation (6), its lineality space is thus constituted by the flux vectors such that , i.e., flux vectors involving only reversible reactions (and thus the global flux can go in either one direction or the other). The cone C is called pointed if it does not contain a line, i.e., if its lineality space is reduced to . For example, if C is contained in a closed orthant, it is pointed (where the closed orthants are defined by for an operator vector ). In particular, a flux cone with only irreversible reactions () is necessarily pointed as it is included in the positive r-orthant (i.e., of dimension r). Actually, reversible flux vectors very rarely occur in metabolic networks, which therefore often give rise to pointed flux cones.
1.4. Extreme Vectors and Generating Sets
We are interested in finding an explicit (by extension) representation of C in the form of a (minimal) set of generators. A nonzero vector is called extreme (or extreme pathway [9,10] if is a flux vector), if
If is extreme, then is called an extreme ray of C as all its nonzero elements are extreme (and thus for simplifying notations, we will not distinguish extreme vectors and extreme rays when it does not create confusion). In fact, C has an extreme ray if and only if C is pointed and, in this case, the extreme rays are the edges (faces of dimension one) of C and, according to Minkowski’s theorem, constitute the unique minimal (finite) set of generators of C for conical (i.e., non-negative linear) combination:
where the ’s, (finite index set), are representatives of the extreme rays (unique up to positive scalar multiplication) and is the conical hull. More precisely, we get an upper bound for the number of extreme vectors that are sufficient to decompose any given nonzero vector : with . This result can be demonstrated first for a flux vector of a pointed flux cone with an upper bound given by and then for a vector of a general pointed polyhedral cone C by using the correspondence (9) between C and which maps the extreme vectors of C onto the extreme vectors of . Extreme vectors of a pointed cone C will be noted ExVs.
The Double Description (DD) method [11,12], known as Fourier–Motzkin, is an incremental algorithm (which processes one by one each linear inequality ) to build an explicit description of a pointed cone C, as a minimal generating matrix (whose columns are in 1-to-1 correspondence with the extreme rays), from an implicit description of C by a representation matrix, i.e., to enumerate its extreme rays.
If C is not pointed, it is still finitely generated:
with (not unique this time) minimal set of generators consisting of basis vectors of the lineality space and suitable vectors not in the lineality space (e.g., the extreme vectors of the pointed cone obtained by intersecting C with the orthogonal complement of its lineality space). Actually, Minkowski–Weyl theorem for cones states that it is equivalent for a set C to being a polyhedral cone (7) or to being a finitely generated cone, i.e., the conical hull of a finite set of vectors (as the ’s, ’s and ’s).
1.5. Elementary Vectors and Conformal Generating Sets
Now, if the existence and uniqueness of a minimal set of generators for conical decomposition in a pointed polyhedral cone C is satisfactory for an explicit geometric description of C, it is not in general meaningful for a flux cone representing the steady-state flux vectors of a metabolic network. In fact, for a metabolic pathway, only a conical decomposition without any cancellations is biochemically meaningful, as a reversible reaction cannot have a net rate in opposite directions in the contributing pathways. Indeed, the second law of thermodynamics states that a reaction can only carry flux in the direction of negative Gibbs free energy of the reaction, which is imposed by the values of the concentrations of the metabolites. This means that, when decomposing a flux vector, only so-called conformal sums, i.e., sums without cancellations, are biochemically admissible. A sum of vectors is called conformal if, for all :
An equivalent definition is to define a sum as conformal if
where the sign vector is defined by applying the sign function component-wise, i.e., , and the partial order ≤ on is defined by applying component-wise the partial order on induced by and . For example, there is a one-to-one mapping between closed orthants O and sign vectors , defined by (O will be called defined by and noted ), which induces a one-to-one mapping between closed r-orthants and full support (i.e., with only nonzero components) sign vectors. For sign vectors and a vector, we say that conforms to if and that conforms to if . We call two vectors conformal if for a certain sign vector or, equivalently, if for all i (so, is a conformal sum if and only if and are conformal). A conformal sum , i.e., verifying Equation (14), will be noted . It is therefore natural to look for generators as conformally non-decomposable vectors, where a nonzero vector of a convex polyhedral cone C is called conformally non-decomposable, if
A vector (resp., flux vector ) of a convex polyhedral cone C (resp., a flux cone ) is called elementary [7] if it is conformally non-decomposable. All nonzero elements of the ray defined by an elementary vector are elementary, i.e., elementary vectors are unique up to positive scalar multiplication (and thus we will often not distinguish elementary vectors and elementary rays). Elementary vectors (resp., flux vectors) will be noted EVs (resp., EFVs).
The elementary rays constitute the unique minimal (finite) set of conformal generators of C, i.e., generators for conformal conical sum:
where the ’s, (finite index set), are representatives of the elementary rays (unique up to positive scalar multiplication) and is the conical conformal hull. More precisely, we get an upper bound for the number of elementary vectors that are sufficient to decompose any given nonzero vector : with . This result can be demonstrated first for a flux vector of a flux cone with an upper bound given by and then for a vector of a general polyhedral cone C by using the correspondence (9) between C and which maps the elementary vectors of C onto the elementary flux vectors of .
It follows from (10) and (15) that an extreme vector of C is elementary, i.e., ExVs ⊆ EVs, but the converse is generally false as a conformally non-decomposable vector may be conically decomposable. Nevertheless, if C is contained in a closed orthant, there is identity between extreme vectors and elementary vectors, i.e., ExVs = EVs. More precisely, for nonzero and O a closed orthant with , then is elementary in C if and only if it is extreme in . It results that the elementary vectors of C are the extreme vectors of intersections of C with any closed orthant:
Elementary vectors of C can thus be obtained by using algorithms, such as DD, to compute extreme vectors ExVs of the pointed polyhedral cones . By doing this, it is convenient to select in Equation (17) only a minimal subset of closed orthants O in order to avoid equality or inclusion between the ’s (nonempty intersection can obviously not be avoided as orthants are closed). It is clearly enough to consider only the closed r-orthants of maximal dimension r, but this does not avoid equality or inclusion. Let be the maximal (for the partial order defined above on ) sign vectors of . It is then enough in Equation (17) to consider the closed orthants , and there is no equality or inclusion between the ’s, where . The ’s are called topes, noted Ts (flux topes [13] noted FTs for a flux cone ). C is thus decomposed into topes and Equation (17) can be rewritten as
Note that, for a flux cone Equation (6), a FT is defined by specifying a maximal subset of reactions with fixed directions (thus fixing the directions of reversible reactions), the others having necessarily a zero flux. This simplifies if is consistent, i.e., without unused reaction, which means that every reaction, in every possible direction for reversible reactions, is supported by a flux vector: and . We can always assume consistent after a preprocessing step (practically, this can be achieved by using flux variability analysis [14]) that removes all reactions that cannot carry nonzero steady-state flux and changes all reversible reactions that cannot carry flux in both directions into irreversible ones. In this case, every remaining reaction in every possible direction is supported by a flux vector with full support (i.e., with nonzero flux in any reaction) and all FTs have full support, i.e., the ’s have full support or, equivalently, the ’s are r-orthants. An obvious upper bound for the number of FTs is thus .
Now, for a flux cone , another commonly used method is, at the extreme opposite, to have it included into a single (positive) orthant in a higher dimension by splitting each reversible reaction i into a forward and a backward irreversible reaction. This means decomposing a flux in i as with and . Columns of the stoichiometric matrix corresponding to reversible reactions i are negated (which means exchanging the roles of substrates and products in i) and appended to as new columns to form the new stoichiometric matrix , where is the new number of reactions and all reactions are now irreversible, . is in one-to-one correspondence with vectors of such that for i reversible. In particular, the fluxes of the form with all other components being null are obtained as extreme vectors of but represent futile cycles (involving reactions and ) without biological reality and must be eliminated. Finally, EFVs () are in one-to-one correspondence with ExVs () ∖{futile cycles} (called at the origin extreme currents in stoichiometric network analysis [15]). is included in the positive -orthant and has thus only one FT.
We therefore have two opposite ways of dealing with reversible reactions for computing EFVs of a flux cone : either splitting each reversible reaction into two irreversible ones, such that is reduced to a single FT at the price of an increase in the space dimension by (which can cause serious efficiency problems to algorithms such as DD) or keeping the reversible reactions unchanged and decomposing into FTs, in each of which the directions of reversible reactions are fixed, at the price of a potentially exponential (in terms of ) number of FTs to consider. All intermediate cases, where only a subset of reversible reactions are split into irreversible ones and the others are processed by decomposition into FTs, are obviously possible. Independently of the solution adopted, we will work most of the time in a given FT for , defined by a (full support if is consistent) sign vector , and the EFVs of which conform to are thus given by the ExVs of this FT .
1.6. Elementary Modes
The null value 0 plays a component-wise crucial role in definitions of the support of a vector (4), of a flux cone (6) and of a conformal sum Equations (13) and (14). A close relationship results between support-minimal vectors and elementary vectors in a flux cone. A nonzero vector is called support-minimal, if
A nonzero vector (resp., flux vector ) of a convex polyhedral cone C (resp., a flux cone ) is called elementary mode (resp., elementary flux mode) if it is support-minimal (the concept of elementary flux mode was first introduced, under the name of elementary vector [16], for a subspace of , i.e., for a flux linear subspace Equation (3) and then [17] for a flux cone with a definition actually closer to that of a support-wise non-decomposable vector). All nonzero elements of the ray defined by an elementary mode are elementary modes having the same support, i.e., elementary modes are unique up to positive scalar multiplication (and we will therefore in general identify two positively proportional elementary modes). Elementary modes (resp., flux modes) will be noted EMs (resp., EFMs). The concept of EFM in a flux cone is biologically significant as it represents a minimal pathway operating in a steady state, i.e., with all reactions involved necessarily active (with a nonzero net rate), which means that no proper sub-pathway can operate in a steady state.
Note that if is an EFM, then is a minimal (for the partial order defined above on ) nonzero element of and, conversely, it is shown that a minimal nonzero sign vector determines an EFM with by . There is thus a one-to-one mapping between EFMs and minimal nonzero sign vectors of . Comparing with the one-to-one mapping between FTs and maximal sign vectors of , we see that EFMs and FTs are dual concepts.
Now, for a flux cone , support-minimality and conformal non-decomposability are equivalent properties, i.e., there is identity between elementary flux modes and elementary flux vectors: EFMs = EFVs. For metabolic pathways in a flux cone , there is therefore identity between minimal (for support inclusion) pathways and non-decomposable (for conformal sum) pathways. From Equation (16), the EFMs constitute a conformal generating set (i.e., generating set for conformal sum) for , and in fact the unique minimal such set (for that matter one way of proving Equation (16) for flux cones is to prove it with EFMs as a conformal generating set and to prove that EFMs = EFVs). From Equation (18), for any maximal sign vector of , the EFMs of which conform to are the EFMs of the FT and coincide with the ExVs of the said FT, this result being the basis of methods for computing EFMs [18,19]. EFMs are thus decomposed into subsets according to the decomposition of into flux topes [13]: the EFMs of the FT correspond to the ’s, where the ’s are the minimal nonzero sign vectors of such that .
For a general polyhedral cone C, there is no direct relationship between elementary modes and elementary vectors. Nevertheless, from Equation (16), it follows that, for any EM in C, there is an EV with the same support. Thus, all minimal support patterns of vectors appear in the set of supports of elementary vectors and are actually the minimal elements in this set for subset inclusion:
In addition, for a general polyhedral cone C, the correspondence (9) between C and the higher dimensional flux cone maps the elementary vectors of C onto the elementary flux vectors of , i.e., the elementary flux modes of :
with given by Equation (9).
Moreover, it remains true that any vector which is support-minimal in a given is actually support minimal in C, as it depends only on the convexity of C: if and are vectors in a convex set with , then a vector exists in this convex set with and (we take with minimal in such that for a certain i with ). Thus, EMs of C can be computed tope by tope:
See Figure 1 for an illustration of the concepts of extreme vectors, elementary vectors and elementary modes in a convex polyhedral cone.
Figure 1.
On the left, we consider the convex polyhedral cone shaded in gray. The extreme vectors are the blue vectors. The elementary vectors are all the blue or red vectors and coincide with the extreme vectors for all topes (here we have two topes: the subcone with and the subcone with ). The elementary mode is the red vector (common to both topes). Flux cones being defined by linear inequalities of the form , elementary modes coincide with elementary vectors in this case (at right).
1.7. Inhomogeneous Linear Constraints and Polyhedra
Additionally, in this standard setting, fluxes may be constrained by other constraints, typically lower and upper bounds regarding reaction rates:
or, more generally, any set of inhomogeneous linear constraints, noted , that can be written in the general form
where is a matrix and a vector with nonzero components, defining a general inhomogeneous convex polyhedron . The solution space is thus now a s-polyhedron or flux polyhedron noted and defined by
This is a particular case of (general) convex polyhedron that is defined implicitly by finitely many linear inequalities:
with a suitable matrix and vector and is thus the intersection of n (affine) half-spaces ( is equipped with both its structure of affine space, with origin 0, and its underlying structure of vector space and we will identify a point in the affine space with the corresponding vector in the vector space). Its dimension is defined as the dimension of its affine span. In this way, a flux polyhedron corresponds to a particular matrix and vector given by
meaning that the inequalities that are homogeneous actually divide into m equalities defining a lower-dimensional subspace given by the nullspace of the stoichiometric matrix and into non-negativity constraints regarding certain single coordinate variables (given by the irreversible reactions). A polyhedral cone (resp., flux cone) is a special case of polyhedron (resp., flux polyhedron) where (resp., ).
To any nonempty polyhedron P given by Equation (26) is associated its so-called recession cone , which is the polyhedral cone containing all unbounded directions (rays) of P (if P is a polyhedral cone, then ). A bounded polyhedron is called a polytope and thus P is a polytope if and only if its recession cone is trivial: . P is called pointed if its recession cone is pointed, i.e., if its lineality space , also called the lineality space of P (as it contains all unbounded lines of P), is trivial. For a flux polyhedron , we have: (thus can be a polytope without being so and be pointed without either or being so). Note that is not in general a flux cone.
1.7.1. Extreme Points and Vectors and Generating Sets
A vector is called an extreme point, if it cannot be written as a convex combination of two distinct vectors of P:
Note that, as P is convex, it is enough to consider the midpoint of and , i.e., to take . Extreme points coincide with vertices of P, where a vertex of P is defined as a face of dimension 0. P is pointed if and only if it has a vertex and in this case, according to Minkowski’s theorem, the vertices of P and the extreme rays of constitute the unique minimal (finite) set of “bounded” and “unbounded” generators of P for convex and conical combination, respectively:
where the ’s, (finite index set), are the extreme points (vertices) of P, noted ExPs, and the ’s, (finite index set), are the extreme vectors of (unique up to positive scalar multiplication), noted ExVs, and is the convex hull (if P is a pointed polyhedral cone, then it has only one vertex, which is the zero vector, and the Equation (29) reduces to Equation (11)). More precisely, we get an upper bound for the number of extreme points and vectors that are sufficient to decompose any given vector : with . This result can be deduced from result (11) for a pointed flux cone by using the following correspondence between P and such a flux cone.
In fact, to any convex polyhedron P in defined by a matrix and vector Equation (26), we can associate a flux cone in a higher dimension defined by
This introduces a correspondence between vectors of P and vectors of , which maps vertices of P onto extreme vectors of with component , and between vectors of and vectors of , which maps extreme vectors of onto extreme vectors of with component . Thanks to this correspondence, several properties, proven for flux cones, can be lifted to general convex polyhedra.
For the particular case where P is a flux polyhedron given by Equation (25), the correspondence (30) simplifies by associating to the flux cone in dimension defined by
The DD method builds an explicit description of a pointed polyhedron P, in the form of two generating matrices whose columns are respectively the ’s and the ’s, from an implicit description of P as in Equation (26), i.e., enumerates its vertices ExPs and extreme vectors ExVs.
If P is not pointed, it is still finitely generated:
with a (not unique this time) minimal set of generators consisting of basis vectors of the lineality space and suitable vectors and (e.g., the vertices and extreme vectors of the pointed polyhedron obtained by intersecting P with the orthogonal complement of its lineality space; if P is a non-pointed polyhedral cone, there is no nonzero and Equation (32) reduces to Equation (12)). In fact, Minkowski–Weyl theorem for polyhedra states that it is equivalent for a set P to be a polyhedron (26) or to be finitely generated, i.e., to be the Minkowski sum of the convex hull of a finite set of vectors (as the ’s) and of the conical hull of a finite set of vectors (as the ’s, ’s and ’s).
1.7.2. Elementary Points and Vectors and Conformal Generating Sets
However, such a decomposition into a finite set of generators is not in general satisfactory for a flux polyhedron as only a decomposition without any cancellations is biochemically meaningful, as was stipulated for a flux cone in Section 1.5. In the same way as we replaced, as generators for a polyhedral cone, extreme vectors by conformally non-decomposable vectors, we will replace, as generators for a polyhedron P, extreme points (vertices) by convex-conformally non-decomposable vectors (and, for its recession cone , extreme vectors by conformally non-decomposable vectors). A vector of a polyhedron P is called convex-conformally non-decomposable, if
Given a polyhedron P (resp., flux polyhedron ), a vector (resp., flux vector) is called an elementary point (resp., elementary flux point)—also called “bounded” elementary vector—of P if is convex-conformally non-decomposable and is called an elementary vector (resp., elementary flux vector)—also called “unbounded” elementary vector—of P if is conformally non-decomposable (it is unique only up to positive scalar multiplication) [7]. Elementary points (resp., flux points) will be noted EPs (resp., EFPs) and elementary vectors (resp., flux vectors) will be noted EVs (resp., EFVs), which is consistent with the same notation for polyhedral cones and flux cones. We will note Es = EPs ∪ EVs (resp., EFs = EFPs ∪ EFVs) the elementary elements (resp., elementary fluxes) of P (resp., ).
The elementary points and the elementary rays constitute the unique minimal (finite) set of conformal generators of P, i.e., generators for convex-conformal (for elementary points) and conformal (for elementary vectors) sum:
where the ’s, (finite index set), are the elementary points and the ’s, (finite index set), are the elementary vectors (unique up to positive scalar multiplication), and is the convex conformal hull. More precisely, we get an upper bound for the number of elementary points and vectors that are sufficient to decompose any given vector : with . This result can be demonstrated from result (16) for a flux cone by using the correspondence (30) between P and which maps the elementary points of P onto the elementary flux vectors of with component and the elementary vectors of P onto the elementary flux vectors of with component .
We already know that an extreme vector of is elementary, and is therefore an elementary vector of P, i.e., ExVs ⊆ EVs. It follows from Equations (28) and (33) that an extreme point (vertex) of P is an elementary point of P, i.e., ExPs ⊆ EPs, but the converse is generally false as a convex-conformally non-decomposable vector may be convex decomposable. Nevertheless, if P is contained in a closed orthant (and thus too), any sum of vectors of P (resp., ) is conformal and thus there is identity between extreme points (vertices) and elementary points (resp., between extreme vectors and elementary vectors), i.e., ExPs = EPs and ExVs = EVs. More precisely, for (resp., and nonzero) and O a closed orthant with , then is an elementary point (resp., elementary vector) of P if and only if it is a vertex in (resp., an extreme vector in ). It follows that the elementary points (resp., elementary vectors) of P are the vertices (resp., extreme vectors) of intersections of P (resp., ) with any closed orthant, which are pointed subpolyhedra (resp., pointed subcones):
Note in particular that, if , then EPs. Elementary points and vectors can therefore be obtained by using algorithms, such as DD, to compute vertices ExPs and extreme vectors ExVs of the pointed polyhedra . It is obvious that considering only the closed r-orthants is enough. Now, as for polyhedral cones, decomposing the polyhedron into topes is better:
If is the closed orthant defined by , then the corresponding tope for P is and, as seen for polyhedral cones, is a tope for the recession cone . Examine how the equality , for an arbitrary sign vector, can be expressed in terms of topes. Note first that any is dominated by at least one for the partial order ≤ on : , which means that is a sub-orthant of , and we have , expressing the relation between the topes for the recession cone of the polyhedron and the recession cones of certain of the polyhedron topes (precisely those topes for which dominates an , necessarily unique). More generally, the recession cone of any tope for P can be expressed as a subcone of a tope for the recession cone of P by , where is the greatest (it is necessarily unique) sign vector in dominated by (thus, if does not dominate any , is not a tope for ; this is the case for example if P is not a polytope but is, implying that and that is not a tope for ).
For the particular case of a flux polyhedron (25), we have and , for any sign vector . Any flux tope for can thus be expressed as , for a certain flux tope for FC, i.e., by applying the constraints to a flux tope for . The same holds for the recession cone: , i.e., by applying the homogeneous counterparts of constraints to a flux tope for . If is assumed to be consistent (same definition as for a flux cone, i.e., without unused reaction, always with a zero flux), all FTs then have full support, i.e., the ’s have full support or, equivalently, the ’s are r-orthants. An obvious upper bound for the number of FTs is thus . Note that to be consistent is a sufficient (but not necessary) condition for to be consistent, and that, if is consistent, so is (but can be inconsistent even if both and are consistent).
The method used to include a flux cone into a single (positive) orthant in higher dimension, by splitting each reversible reaction i into a forward and a backward irreversible reaction, applies as well to a flux polyhedron . Matrix is extended into matrix in the same way that is extended into , where is the new number of reactions and all reactions are now irreversible, . is in one-to-one correspondence with vectors of such that for i reversible. In particular, the fluxes with for a certain i, which are obtained as vertices or extreme vectors of , are futile (involving a net rate in opposite directions in reactions and ) without biological reality and must be eliminated (they are not generally limited to futile cycles as in flux cones). Finally, the set of elementary fluxes EFs is in one-to-one correspondence with (ExPs () ∪ ExVs ()) ∖{futile fluxes}. As is included in the positive -orthant, it has only one FT.
As for flux cones, there are two opposite ways of dealing with reversible reactions for computing EFs = EFPs ∪ EFVs of a flux polyhedron : either splitting each reversible reaction into two irreversible ones, reducing to a single FT in higher dimension, or keeping the reversible reactions unchanged and decomposing into FTs without increasing the dimension; all intermediate cases are possible. Whatever the solution adopted, EFs are obtained as the union of ExPs and ExVs for each flux tope for and thus we will generally work in a given FT for , defined by a (full support if is consistent) sign vector , and the EFs of which conform to are thus given by the ExPs and ExVs of this FT .
1.7.3. Elementary Modes
For a general polyhedron P (and even for a flux polyhedron ), there is no direct relationship between support-minimal Equation (19) vectors and elementary elements as it was the case for flux cones . Nevertheless, from Equation (34), it follows that, for any support-minimal nonzero vector in P (still called elementary mode EM), there is an elementary element with the same support. Thus, Equation (20) generalizes to the case of polyhedra and all minimal support patterns of vectors appear in the set of supports of elementary elements and are actually the minimal nonempty elements in this set for subset inclusion:
In particular, for a metabolic network, this means that any set of reactions involved in a minimal pathway (EFM) appears as the set of reactions involved in a certain elementary flux (EF).
In addition, for a general polyhedron P, the correspondence (30) between P and the higher dimensional flux cone maps the elementary points (resp., elementary vectors) of P onto the elementary flux vectors, i.e., the elementary flux modes, of with component (resp., with component ):
with given by Equation (30). The same formula holds for the particular case of a flux polyhedron (25) with given by Equation (31).
Moreover, as seen in the proof of Equation (22), any vector which is support-minimal in a certain is actually support minimal in P, thus EMs of P can be computed tope by tope:
See Figure 2 for an illustration of the concepts of extreme points and vectors, elementary points and vectors and elementary modes in a convex polyhedron.
Figure 2.
We consider the convex polyhedron in gray and its associated recession cone delimited by dashed lines. The extreme points are the vertices in blue and the extreme vectors are the blue vectors. The elementary points are all the blue or red points, and the elementary vectors are all the blue or red vectors, and they coincide, respectively, with the extreme points and extreme vectors for all topes (here we have four topes, corresponding to the subpolyhedra in the four quadrants defined by x and y). The elementary modes are the purple segment and half-line and they do not have a direct relationship with elementary elements, but their two possible supports, and , are the minimal nonempty supports of the elementary elements.
1.8. Complexity Results
Enumerating the vertices of an unbounded polyhedron has been proven to be an NP-hard enumeration problem [20]. However, the hardness of vertex generation for bounded polyhedra, and also the complexity of enumerating together vertices and extreme rays of polyhedra, are open problems. Consequently, the complexity of enumerating all EFs or EFMs of a metabolic network remains an open problem [21]. Nevertheless, enumerating all extreme rays of a flux cone that contain a given reaction in their support has been proven not to be in polynomial total time unless P = NP [21], which means that the output cannot be generated in time polynomial in the combined size of the input and the output. The problem of enumerating the extreme vectors of a polyhedral cone or the vertices of a polytope with a polynomial delay (i.e., the time between one output item and the next is bounded by a polynomial function in terms of the input size), or the even weaker question whether this enumeration can be done in polynomial total time, is an open problem too. Therefore, the possibility of enumerating EFs or EFMs of a metabolic network with a polynomial delay is an open problem [22] (except for a flux cone with all reactions being reversible, i.e., a flux linear subspace (3), as the EFMs are then the circuits of a linear matroid [16]). Given a flux cone and two reactions, deciding if there exists an extreme ray of the cone that has both reactions in its support is NP-complete [21]. Thus, given a metabolic network, deciding if an EF or EFM exists with a given support of size at least two is an NP-complete problem. The same result holds for deciding if an EF or EFM exists whose support size is bounded above by a given positive integer [22].
Regarding now the number of EFs or EFMs, counting the extreme rays of a polyhedral cone or the vertices of a polytope has been proven to be a #P-complete problem [23]. Thus, counting the number of EFs or EFMs of a metabolic network is also a #P-complete problem [22]. The McMullen’s upper bound theorem [24] states that, for any fixed positive integers d and n, the maximum number of j-faces of a d-polytope with n facets (i.e., faces of dimension ) is attained by the dual cyclic polytope for all . A consequence is that the maximum number of vertices of a d-polytope with n facets is given by . We thus obtain that the number of EFs or EFMs in a metabolic network (after having split each reversible reaction into two irreversible ones) is bounded above by a quantity approximately equal to with (so varies between r and ). If the number m of internal metabolites is small compared to the total number of reactions (after having split reversible reactions), the number of EFs or EFMs is then bounded above by a quantity of order . If m is close to , this number is bounded above by a quantity of order . The worst case occurs when m is close to with an upper bound approximately equal to . We obtain the same results with r instead of if we do not split reversible reactions but fix their signs arbitrarily, i.e., if we consider EFs or EFMs in an arbitrary closed r-orthant O, i.e., in an arbitrary flux tope for (or ). However, note that in practice the actual number of EFs or EFMs of a metabolic network is likely to be much smaller than these upper bounds.
2. Metabolic Pathways in the Presence of Biological Constraints
Although the current improved implementations of the DD method [25,26] allow the computation of millions, even billions, of EFMs or EFs, tackling genome-scale metabolic models (GSMMs) is still beyond our reach. Moreover, most of the computed EFMs or EFs are not biologically valid, because only stoichiometry and certain flux constraints, such as irreversibility of reactions or bounds on reaction rates, are taken into account. For both scaling up to large systems and limiting the number of biologically invalid solutions, it is necessary to consider additional biological constraints, such as thermodynamic, kinetic or regulatory constraints.
2.1. Biological Constraints
2.1.1. Thermodynamic Constraints
Assuming constant pressure and a closed system, according to the second law of thermodynamics, a reaction i proceeds spontaneously only in the direction of its negative Gibbs free energy [27], given by
where is the standard Gibbs free energy of reaction i, R the molar gas constant, T the absolute temperature, the (positive) concentration of metabolite j and is the stoichiometric coefficient of metabolite j in reaction i (i.e., is the extension of the stoichiometric matrix to the external metabolites). This means that , (resp., ) is a necessary condition to get (resp., ), which can be expressed by the constraint , and that a flux vector is thermodynamically feasible [28] if and only if all its components satisfy such a constraint (it is enough to consider those as the constraint is trivially satisfied when ). The thermodynamic constraint for , that depends on the vector of metabolite concentrations, can thus be defined as
As at equilibrium is null, we obtain , where is the equilibrium constant of reaction i. Thus, can be rewritten as and the thermodynamic constraint as
Often, the concentrations of external metabolites can be measured and included in the constraint as known parameters, keeping only a dependency of the constraint on the concentrations of internal metabolites. The formula can be rewritten, by dividing the numerator and denominator of the fraction by the terms dealing with external metabolites, as , where is the apparent equilibrium constant of the reaction i and (resp., ) the vector of internal (resp., external) metabolite concentrations. The thermodynamic constraint (42) can thus be rewritten as
For given metabolite concentrations vector (resp., internal metabolite concentrations vector ), let (resp., ) be the fixed thermodynamic sign vector defined by:
Then, the thermodynamic constraint (resp., ) can be rewritten as
Thus, the set (resp., ) of vectors satisfying the constraint (resp., ), given by (resp., ), is the closed orthant (resp., ) defined by (resp., ), of dimension r if (resp., ) has full support, i.e., (resp., ) for all i, of lesser dimension otherwise.
Lemma 1.
Given metabolite concentrations (resp., internal metabolite concentrations ), the set (resp., ) of vectors in satisfying the thermodynamic constraint (resp., ) is the set of vectors that conform to the thermodynamic sign vector (resp., ) Equation (44), i.e., the closed orthant (resp., ) defined by (resp., ).
2.1.2. Kinetic Constraints
Metabolic reactions are catalyzed by enzymes. The catalytic mechanisms of key enzymes have been investigated in great detail and described by mathematical formulas. However, many kinetic equations are still unknown and have to be substituted by standard rate laws such as mass-action kinetics, power laws, reversible Hill kinetics, lin-log kinetics, convenience kinetics, generic rate equations or TKM rate laws [29]. What is important for our study is that these modular rate laws share the general form
where is the (non-negative) level of the enzyme catalyzing the reaction i (given either as an amount or as a concentration, in which case the rate law is pre-multiplied by the compartment volume) and depends on the concentrations of the metabolites occurring in i (reactants of i) and on reaction i stoichiometry, rate law considered, allosteric regulation and parameters. In the general form of , is the thermodynamic numerator (which can be written in the compact form with turnover rate parameters ) that gives its sign to (and thus to ) and reflects the relationship between chemical potentials and reaction directions and ensures that the rate vanishes at chemical equilibrium, and , both positive, implement enzyme regulation (partial or complete for the first one, specific for the second) and is the (positive) kinetic denominator, a polynomial of scaled reactant concentrations whose terms correspond to different binding states of the enzyme (reducing the enzyme amount available for catalysis), which depends on the rate law considered.
The kinetic constraint for depends both on the vector of enzyme concentrations and on the vector of metabolite concentrations and can thus be defined as
where ∘ is the component-wise product of vectors: . This means that the flux vector is a component-wise linear function of the vector of enzyme concentrations (and a nonlinear function of the metabolite concentrations vector). For nonzero , the sign of gives the direction in which reaction i proceeds, so in this sense the kinetic constraint includes the thermodynamic constraint.
This can be highlighted on a widely-used rate law, the reversible Michaelis–Menten kinetics [30]. In the simple case of a reaction i where the enzyme can only exist in one of three distinct states: free, all substrates bound, or all products bound, it can be written as
where and are the maximal forward and backward rates in reaction i per unit of enzyme and the ’s are the Michaelis constants. Equating the numerator to zero at equilibrium, we obtain . This gives
that is, the product of three terms: the positive capacity term per unit of enzyme, the positive (smaller than one) fractional saturation term depending on and the thermodynamic term, which can be rewritten as and gives the sign of (and thus the sign of ): , i.e., the thermodynamic constraint. Thus, and we will assume that this equality holds for all kinetic laws we consider.
Note that, for given metabolite and enzyme concentrations and , the kinetic constraint defines completely and uniquely the only vector that satisfies it: .
2.1.3. Regulatory Constraints
Coupling metabolic networks with Boolean transcriptional regulatory networks allows us to express the additional constraints imposed by gene regulatory information on a metabolic network and to take them into account when computing EFMs [31,32,33]. In all generality, such a constraint may be given by an arbitrary Boolean formula in terms of the reactions i, viewed as propositional symbols, i.e., the positive literal i meaning that reaction i is active (nonzero flux) and the negative literal meaning it is inactive (zero flux). Thus, when applying this Boolean constraint, noted , to a flux vector , the positive literal i is interpreted as , i.e., (4), and the negative literal as , i.e., .
The regulatory constraint for induced by can thus be defined as
Note that coupled reactions as used by Flux Coupling Analysis (FCA) [34,35,36] can be easily represented by such constraints. For example, i directionally coupled to j, meaning that zero flux through i implies zero flux through j, is expressed by , and i partially coupled to j, meaning that zero flux through i is equivalent to zero flux through j, is expressed by .
By rewriting the Boolean constraint in DNF (Disjunctive Normal Form), , the set of vectors satisfying the constraint is a union of the solution spaces for each disjunct (and this union can be assumed to be disjoint by taking the disjuncts two by two inconsistent). Now a disjunct is a conjunction of literals, where a negative literal corresponds to the constraint and a positive literal i to the constraint , which can be rewritten as the disjunctive constraint . Finally, a propositional symbol i that does not appear in the disjunct corresponds to an absence of constraint on , which can be rewritten as the disjunction . Thus, the solution space for a disjunct is itself the disjoint union of subspaces each one defined by constraints of type 0 for all i, with , that is, defined by for a given sign vector , i.e., an open orthant (which, for , is topologically open in the vector subspace it spans and is the interior in this subspace of the closed orthant , which is the closure of in ). In summary, is thus the disjoint union of such open orthants. Now, instead of keeping this partition of in open orthants, it can be more practical to generalize this concept and deal with what we will call semi-open orthants , i.e., orthants O without some of their faces of lesser dimension (open orthant is thus a particular case of semi-open orthant, without any facet thus without any face of lesser dimension). To do this, we can group together with any given open orthant in , in a same cluster , all other open orthants in such that , i.e., is a face of , and, for any arbitrary orthant with (which is equivalent to ), then . In this case is a semi-open orthant . We can iteratively process like this by considering each time the open orthants in that have not yet been selected in any cluster, which guarantees that the semi-open orthants built in this way are disjoint. In addition, by choosing as open orthant, , to start with at each iteration, a maximal one among those that remain, we are sure that no two of the semi-open orthants thus built can be grouped together to constitute a bigger semi-open orthant, i.e., the collection obtained is minimal in this sense. We finally obtain that can be written as a disjoint union of semi-open orthants, with no merging possible between any two of them. However, note that such a decomposition is not unique and that an inclusion can still exist between the closures of two such semi-open orthants. If we consider the particular case of a Boolean constraint which is an arbitrary disjunct D, then for any closed r-orthant , i.e., for any full support sign vector , is a semi-open orthant, which is the face of defined by for all i such that is a literal of D, without its facets of equation for all i such that i is a literal of D.
Lemma 2.
The set of vectors in satisfying the regulatory constraint is a disjoint union of open orthants . These can be grouped together such that is a disjoint union of semi-open orthants , i.e., orthants O without some of their faces, such that no two of them can be grouped together to constitute a bigger semi-open orthant. If is a conjunction of literals and an arbitrary full support sign vector, then is itself a semi-open orthant.
2.2. Characterizing the Solution Space
We start from a flux cone (6) (or more generally a flux polyhedron (25)) representing the flux vectors of a metabolic network in a steady state, satisfying the stoichiometric equations, the inequalities regarding irreversibility of reactions (and possibly some inhomogeneous linear constraints). Then, we consider additional biological constraints, such as those described in Section 2.1. In all generality, these constraints will be noted , where represents a flux vector in and a vector of biochemical quantities involved in the constraint (typically, metabolite concentrations and enzyme concentrations). Some parameters (such as stoichiometric coefficients), that are not made explicit in the notation for sake of simplicity, are also present. In the following, the solution space in of an arbitrary constraint will be noted , defined as . For given values of quantities , the solution space is thus the constrained flux cone subset (not necessarily a cone, depending on ):
or, more generally, the constrained flux polyhedron subset (not necessarily a polyhedron, depending on ):
Now, very often, most of the values of quantities , if not all, are unknown. We then only require consistency of the set of constraints , i.e., the existence of values for variables such that the constraints are satisfied. This means that we replace the constraint by the constraint whose solution space is . The solution space becomes
or
If some but not all of the values of quantities are known, concerns only those whose value is unknown, the known values being integrated in as parameters to simplify the notations.
We will obtain examples of such constraints from the cases of thermodynamic constraint and kinetic constraint .
Depending on the constraint considered, the solution space is in general no longer a cone or polyhedron and no longer convex. Nevertheless, the definitions of extreme vectors and extreme points, of elementary flux vectors and elementary flux points, and of elementary flux modes are still valid, as, respectively, non-decomposable and convex non-decomposable vectors, conformally non-decomposable and convex-conformally non-decomposable vectors, and support-minimal vectors, where decomposition and support minimality have to be understood w.r.t. vectors of the solution space. It is then clear that if a vector of (resp., ) satisfies one of those properties in (resp., ) and satisfies the given constraint, then it satisfies the same property in the solution space, i.e.,
For EFMs, the above formulas are valid whatever the structure of . It is also the case for ExPs and EFPs but, in practice, only really meaningful if (resp., ) is a convex polyhedron. Finally, for ExVs and EFVs, this is only meaningful if (resp., ) is a convex polyhedral cone and (resp., ) is a convex polyhedron. This is the case when (resp., ) is itself a convex polyhedral cone and we will see that, for almost all common biological constraints, this solution space is actually a union of such cones or of semi-open cones and thus the formulas will apply for each conical component. In this case we have . This can be generalized when is a convex polyhedron, and thus and too, with and , by replacing by in the formulas above regarding ExVs and EFVs.
However, in all the above cases, we generally do not have the reciprocal subset inclusion. This is what we will study for particular constraints.
2.2.1. Application to Thermodynamics
Assume first that the concentrations of the metabolites (resp., of the internal metabolites) are known and given. From Equations (51) and (52) and Lemma 1, we obtain
, are thus flux cones and , flux polyhedra (possibly equal to or empty (for polyhedra) if the flux directions imposed by the concentrations of the metabolites and by the second law of thermodynamics are incompatible with the steady-state assumption). When nonempty, they consist of a single flux tope. From Equation (18), (36) and (39), it follows that
and the same for and . Now, considering the decomposition of (resp., ) into flux topes, we can proceed flux tope by flux tope for the computation of the sets above. Starting from a FT (resp., ), we have (resp., ) with inf (where inf is defined component-wise with inf) and the four sets above are equal to ExVs (), ExPs (), ExVs () and EFMs () respectively. This is in particular the case if we split each reversible reaction into two irreversible ones as, in higher dimension, (resp., ) is reduced to a single flux tope defined by , the all-positive sign vector, and thus is obtained from by changing each − into 0. Note also that (resp., ) consistent does not imply that (resp., ) is consistent ( being full support does not imply that is full support).
Proposition 1.
Given metabolite concentrations , the space of flux vectors in (resp., ) satisfying the thermodynamic constraint is a flux cone (resp., a flux polyhedron), made up of vectors of (resp., ) which conform to the thermodynamic sign vector given in Equation (44). Its elementary flux vectors, identical to elementary flux modes, (resp., elementary flux points, elementary flux vectors and elementary flux modes) are exactly those of (resp., FP) that satisfy the constraint, i.e., that conform to . The same result holds for internal metabolite concentrations and thermodynamic constraint with .
Let us now consider the more usual case where the concentrations of the metabolites (at least those of internal metabolites) are unknown. From Equation (45), we obtain one or the other form for the thermodynamic constraint (existentially quantified on metabolite concentrations):
Though the metabolite concentrations are unknown, some lower bounds and upper bounds on these concentrations are often known. They are thus added as additional constraints:
The solution space in of these constraints is thus (idem for ) and (idem for ). From the result of Lemma 1, it follows that and (idem for and with ). This is obviously enough to take the union on the maximal ’s (resp., ’s) when varies. As these are at most , the union is finite.
Lemma 3.
The set (resp., ) of vectors in satisfying the thermodynamic constraint (resp., ) is a union of closed orthants. More precisely, it is the (finite) union for all (resp., all bounded ) of the sets of vectors that conform to , i.e., ’s. The same result holds for and with M and .
It follows from Equations (53), (54), (56) and (57) that
and the same for and with M and , where all unions are finite.
From this and Equations (18), (36) and (39), we get
and the analog for , and . Now, considering the decomposition of (resp., ) into flux topes, we can proceed flux tope by flux tope for the computation of the sets above. Starting from a FT (resp., ), we have (resp., ) where are the maximal sign vectors in {inf and the four sets above are equal to ExVs (), ExPs (), ExVs () and EFMs (), respectively. This is in particular the case if we split each reversible reaction into two irreversible ones as, in higher dimension, (resp., ) is reduced to a single flux tope defined by . The analog holds for , and .
Proposition 2.
The space of flux vectors in (resp., ) satisfying the thermodynamic constraint , or , is a finite union of flux cones (resp., flux polyhedra), obtained as those vectors of (resp., ) which conform to , for a certain , or bounded . It is thus no longer convex but made up of particular faces (resp., ) of each flux tope for (resp., for ), with . Its elementary flux vectors, identical to elementary flux modes, (resp., elementary flux points, elementary flux vectors and elementary flux modes) are exactly those of (resp., FP) that satisfy the constraint, i.e., that conform to a certain , and coincide thus with the extreme vectors (resp., the extreme points, extreme vectors and elementary modes) of the ’s (resp., ’s). The same result holds for , or , with M and .
We will now refine this result in order to characterize the faces involved. Consider the case of a flux cone where the concentrations of external metabolites are given and assume first there are no bounds on the concentrations of the internal metabolites [37]. As and , let us begin by studying . We have from Equation (44). By applying the monotonic logarithm function and noting the vector whose components are given by , we obtain . We will now make use of Gale’s theorem (or Kuhn–Fourier theorem), which is a form of Farkas’ duality lemma (for two vectors , , means for all i):
Gale’s theorem (a form of Farkas’ lemma). For any and , exactly one of the following statements holds:
- (a)
- there exists such that ;
- (b)
- there exists such that , and .
Applying this theorem in the formula above to (where we note the subspace of vectors having a null component outside the support of ), given by , and , given by , provides where is defined by: .
We thus obtain . This can be rewritten as , where is an open half-space with the hyperplane as a frontier. Considering the decomposition of into FTs , this is equivalent to for all ’s maximal sign vectors in sign(). Now, note that, in this formula, is the minimal face (for set inclusion) of containing . We finally obtain finally:
That is to say, is the union of all the maximal faces of the FTs for such that or, equivalently, the union of all maximal faces of the ’s for all r-orthants O such that (see Figure 3).
Figure 3.
Structure of the solution space in the presence of the thermodynamic constraint (without bounds on the metabolite concentrations). We consider a given flux tope and represent the section of the flux cone with an affine hyperplane (polytope shaded in pink). Vertices to thus represent the EFMs. When we add the thermodynamic constraint, the solution space is the union of all the maximal faces of the flux cone that are entirely contained in the open half-space (left side of frontier H in the section), represented in red. In particular, the thermodynamically feasible EFMs are given by the vertices to that belong to this half-space. The set of these EFMs is decomposed into LTCSs, made up of the EFMs of the maximal faces above (in the 2D shape, these LTCSs are thus given by ).
A topological consequence of this characterization is that is a connected set (actually arc-connected). Another consequence, regarding EFMs (or, equivalently, EFVs), is:
i.e., the elementary flux modes in the (non-convex) cone of those flux vectors in satisfying the thermodynamic constraint are exactly those elementary flux modes in that satisfy , i.e., that belong to the open half-space . We can equivalently write
where is the polyhedral cone obtained from merely by adding the homogeneous linear inequality [38]. Each flux vector of is a conformal conical sum of these EFMs, but the converse is false as is not convex. Thus the set of EFMs, considered globally, does not characterize the solution space . More precisely, we have
where is as in Equation (66), i.e., is characterized by the decomposition of the set of EFMs into the (non-disjoint) subsets . Now, the ’s are exactly the maximal subsets of EFMs included in a given flux tope for (i.e., in a given r-orthant O) and whose conical hull is included in , i.e., all vectors in this hull must satisfy the constraint : maximal such that EFMs and with O r-orthant and . Such an is called a largest thermodynamically consistent set (LTCS) of EFMs [39] in O (or, equivalently, in the flux tope defined by O).
We could want to estimate the ratio of thermodynamically feasible EFMs on all EFMs, i.e., the ratio of EFMs on EFMs . If all reactions are reversible (), then the function maps the EFMs on one side of hyperplane onto the EFMs on the other side. Thus, if we neglect the EFMs that might belong to this hyperplane, it means that 50% of the EFMs are thermodynamically feasible (and thus only 50% eliminated). Now, intuitively, irreversible reactions given in come from an expert knowledge that can be seen as a form of compiled thermodynamic knowledge, as we saw that it is thermodynamics which imposes the direction in which a reaction may proceed. Therefore, we can assume, provided the adequacy of the model for some given environment (such as the concentrations of external metabolites, supposed here to be known), that any thermodynamically feasible EFM satisfies these irreversibility constraints. Under this assumption, the irreversibility constraints rule out only thermodynamically unfeasible EFMs so if we then apply the thermodynamic constraint we only eliminate at most than 50% of the remaining EFMs (from 50% when all reactions are reversible to 0% when all are irreversible, without splitting any reversible reaction into two irreversible ones).
If inhomogeneous linear constraints are added, we have , with Equation (25) and we obtain in the same way, for all ’s maximal sign vectors in sign()
That is to say, is the union of all the such that is a maximal face of a FT for verifying or, equivalently, the union of the for all maximal faces of the ’s for all r-orthants O such that . Take care because the ’s involved are faces of included in , but not any face of included in is included in (as such a face may be defined by inhomogeneous equality constraints coming from and not only by nullity constraints of the form ). is no longer a connected set in general.
We can sum up these results as follows.
Proposition 3.
The space of flux vectors in satisfying the thermodynamic constraint is a finite union of flux cones, obtained as all the maximal faces of all the flux topes that are entirely contained (except the null vector) in the open half-space . The thermodynamically feasible EFMs are thus those EFMs which belong to this half-space and can be simply computed as the EFMs of the flux cone obtained from by adding to it the homogeneous linear inequality (and removing those EFMs that would belong to the frontier hyperplane of ). The set of these EFMs can be decomposed into (non-disjoint) maximal subsets of EFMs belonging to a same flux tope (i.e., a same r-orthant) and whose conical hull is made up of thermodynamically feasible vectors, each of these subsets thus representing the set of EFMs of one of the maximal faces above. At most 50% of the EFMs can thus be ruled out as thermodynamically infeasible, if we assume that no thermodynamically feasible flux vector violates given irreversibility constraints. In the presence of additional inhomogeneous linear constraints on flux vectors given by , the space of flux vectors in FP satisfying TC is a finite union of flux polyhedra, obtained as intersections of the flux cones above with the polyhedron defined by . All these results hold for constraint TC by just replacing the ’s by the ’s.
This applies in particular to and (after splitting each reversible reaction into two irreversible ones) with the simplification, as (resp., ) is reduced to a single flux tope defined by , that we must just consider the maximal faces of that are entirely contained (except the null vector) in the open half-space .
Consider now the case where certain bounds on the concentrations of internal metabolites are known, i.e., the case of the thermodynamic constraint . We have . From what precedes, we obtain . Applying Gale’s theorem in this formula to , with , and , with , provides where is defined by . Let , where and . is a flux cone in of dimension and . We thus obtain . This is equivalent to: for all flux topes for . This can be rewritten as , where we note the open half-space with hyperplane as a frontier. We have and . Now, note that, in the formula above, is a face of the cone (we note the sign vector of dimension obtained by concatenation of of dimension r and + of dimension , thus ) whose intersection with is equal to , the minimal face of the tope containing . Actually, among all the faces of whose intersection with is equal to , this is the maximal one (without any constraint on the factor). Thus, finally,
.
Note that any given by Equation (71) is a face of a certain given by Equation (66) (i.e., ). is no longer a connected set in general. A consequence, for EFMs (or, equivalently, EFVs), is:
where is the maximal face of s.t. . This means that the elementary flux modes in the (non-convex) cone of those flux vectors in satisfying the thermodynamic constraint are exactly those elementary flux modes in (or in ) that satisfy , i.e., such that the maximal face of whose intersection with is equal to the ray is included in . Note that EFMs is thus given by
Each flux vector of is a conformal conical sum of these EFMs, but the converse is false as is not convex. More precisely, we have
where is as in Equation (71), i.e., is characterized by the decomposition of the set of EFMs into the (non-disjoint) subsets . Now, the ’s are exactly the maximal subsets of EFMs included in a given flux tope for (i.e., in a given r-orthant O) and whose conical hull is included in , i.e., all vectors in this hull must satisfy the constraint : maximal such that EFMs and with O r-orthant and . Such an is called a largest thermodynamically consistent (for bounded concentrations of internal metabolites) set (LTCbS) of EFMs [39] in O (or, equivalently, in the flux tope defined by O) and is included in a certain LTCS as in Equation (69).
If inhomogeneous linear constraints are added, we have, for all ’s maximal sign vectors in sign():
with as in Equation (71). That is to say, is the union of all the such that is given as in Equation (71).
We can sum up these results as follows.
Proposition 4.
For a flux cone in , let us define its lift to as the flux cone . Thus, . For any flux tope for and any face of , its lift to is defined as the maximal face of whose intersection with is equal to . The space of flux vectors in satisfying the thermodynamic constraint is a finite union of flux cones, obtained as all the maximal faces of all the flux topes whose lifts to are entirely contained (except vectors from ) in the open half-space , where . The thermodynamically feasible EFMs in are those EFMs in whose lifts (as rays) are contained (except vectors from ) in this half-space. The set of these EFMs can be decomposed into (non-disjoint) maximal subsets of belonging to a same flux tope (i.e., a same r-orthant) and whose conical hull is made up of thermodynamically feasible vectors, each of these subsets representing thus the set of EFMs of one of the maximal faces above. In the presence of additional inhomogeneous linear constraints on flux vectors given by , the space of flux vectors in FP satisfying TCb is a finite union of flux polyhedra, obtained as intersections of the flux cones above with the polyhedron defined by .
2.2.2. Application to Kinetics
In the same way, we obtain for the kinetic constraint in the absence of knowledge regarding values of concentrations of enzymes and metabolites:
Once more we can add optional lower and upper bounds and on metabolite concentrations and/or lower and upper bounds and on enzyme concentrations if they are known, and also a global enzymatic resource constraint, which is often considered, as , where is a constant positive vector of size r and W a positive constant:
We can also consider intermediate constraints, existentially quantified only on metabolite concentrations if enzyme concentrations are known:
possibly with bounds on metabolite concentrations in the quantification:
or only on enzyme concentrations if metabolite concentrations are known:
possibly with enzymatic resource constraint and bounds on enzyme concentrations in the quantification:
The solution space in of the constraint is , as the sign of is the thermodynamic sign vector . It is thus the same as the solution space of the thermodynamic constraint and is the closed orthant . It follows that the solution space of the constraint , given by , is the same as the solution space of the thermodynamic constraint and is the finite union of the closed orthants . Similarly, if the bounds are only on the metabolite concentrations, i.e., there are no bounds on enzyme concentrations. From , it follows that the flux vectors in are defined by the linear inequalities: for those i’s such that , for those i’s such that , for those i’s such that and . Thus is a convex polyhedron (a parallelepiped contained in the closed orthant , truncated by a hyperplane). Now, if , i.e., in the absence of positive lower bounds on enzyme concentrations, this polyhedron has as a vertex and is nothing else than , i.e., the closed orthant , truncated as a parallelepiped by the faces given by for those i’s such that and by the hyperplane . Consequently, is equal to a certain truncation of , defined as the finite union of truncations of the closed orthants (each becoming a parallelepiped, after being truncated by hyperplanes according to equations if (resp., if ), where (resp., ) applies to those such that , which gives a polyhedron, but also, in the presence of an enzymatic resource constraint, by an algebraic, nonlinear, surface, which gives in this case a local solution space in that is no longer a polyhedron and is not necessarily convex).
Proposition 5.
Given metabolite concentrations , the kinetic constraint is identical to the thermodynamic constraint and thus the set of vectors in satisfying is the closed orthant defined by the thermodynamic sign vector . The kinetic constraint is identical to the thermodynamic constraint and thus the set of vectors in satisfying is a union of closed orthants. In the presence of bounds only on metabolite concentrations (and not on enzyme concentrations), the kinetic constraint is identical to the thermodynamic constraint and thus the set of vectors in satisfying is a union of closed orthants. Therefore, these kinetic constraints boil down to thermodynamic constraints and the results regarding the geometrical structure of the corresponding spaces of flux vectors and the characterization of elementary flux vectors (or elementary flux modes) given by Propositions 1–4 apply: in particular, is a flux cone and and are finite unions of flux cones. For a given and in the presence of bounds on enzyme concentrations, the set of vectors in satisfying is a convex polyhedron and is thus a flux polyhedron. In the particular case where positive lower bounds on enzyme concentrations are absent, is just the parallelepiped obtained by truncating the flux cone by hyperplanes originating from the upper bounds on enzyme concentrations and by a hyperplane originating from the enzymatic resource constraint and coincides thus with the said flux cone in a certain adequate neighborhood of , i.e., for sufficiently small values of the fluxes. In this case, is thus a truncation (by an algebraic surface) of and coincides with this union of flux cones in a certain adequate neighborhood of and the characterization of elementary flux vectors remains valid in this neighborhood. These results extend in the presence of additional inhomogeneous linear constraints on flux vectors given by by intersecting the solution spaces above with the polyhedron defined by , giving rise to flux polyhedra (results in a neighborhood of holding only if h < 0).
However, the geometric structure of , of and of in the presence of positive lower bounds on enzyme concentrations, depends on the kinetic function and , and are generally neither polyhedra nor convex.
Proposition 5 has important consequences on constrained enzyme allocation problems in kinetic metabolic networks. Considering a kinetic metabolic network, with possible bounds on metabolite concentrations, but not on enzyme concentrations, i.e., with solution space , or , which is a finite union, for varying, of the flux cones , the generic enzyme allocation problem consists in maximizing the specific flux (or specific rate, i.e., rate expressed per unit of biomass amount) of a given reaction, say k, defined as , where is a flux vector in or and denotes the total protein content in the system. is expressed in all generality as a fixed weighted sum of the enzyme concentrations, (the ’s being given positive weights that denote the fraction of the resource used per unit of enzyme), able to encode different enzymatic constraints (such as limited cellular or membrane surface space) as well as other constraints regarding the abundance of certain enzymes. Likewise, the steady-state flux component may stand for diverse metabolic processes, ranging from the synthesis rate of a particular product within a specific pathway to the rate of overall cellular growth. The formation rate of a metabolic product expressed per gram of biomass and the specific growth rate of a cell are both examples of such specific rates. We look for maximization in the solution space by varying the metabolite concentrations (inside their bounds, if any) and the enzyme concentrations (without bounds), which gives rise to a complex non-convex optimization problem. Now, maximizing is equivalent to fixing the rate to a positive value, e.g., to 1, and minimizing the needed to attain this level of . This means minimizing the function by varying (with possible bounds) and, for each given , the flux vector in (without bounds on as we assume there are no bounds on enzyme concentrations). If all ’s are empty, the problem is unsolvable, i.e., is incompatible with the kinetics. Otherwise, i.e., when the problem is solvable, we consider successively each nonempty . Such an is a flux polyhedron whose elementary flux points (which are equal to the extreme points or vertices) correspond to the intersections of the hyperplane with extreme rays (edges) of , i.e., EFMs of or , and whose elementary vectors (equal to extreme vectors), if any, correspond to the extreme rays of . As, for fixed, the function to minimize is linear in , its minimum on is reached on a lower-dimensional face of (as , and is included in a closed orthant, this face is necessarily a convex hull of certain extreme points of even if it is not a polytope, i.e., no extreme vector may occur as one of the generators of this face), and thus reached in particular at at least one extreme point, i.e., at an EFM. Now, as the total number of EFMs is finite, so is the number of those for which the minimum of the function occurs for any given , considering all nonempty ’s. Therefore, when varies in its domain, we obtain the result that the maximum of the specific flux occurs (at least) at an EFM of or . In the case of and assuming the ’s are continuous, this maximum is attained at an EFM at finite metabolite concentrations as then varies in the compact set . In the case of , the maximum might not be attained at finite metabolite concentrations. This is the result already obtained in [40,41] and we followed a similar proof, but relying this time on a precise characterization of the solution space or and of the EFMs given by the Proposition 5, which was not the case in the above-quoted works. Finally, the enzyme allocation problem can theoretically be solved by computing all the thermodynamically feasible EFMs having k in their support (i.e., satisfying the Boolean constraint , see next subsection), say (all components of which are fixed by ), and, for each one, by computing the minimum (if it exists) of for varying in its domain, such that for all , which is a much simpler optimization problem than the initial one. The global minimum, if it exists, is then the smallest of these local minima, for all ’s. We then obtain the maximum value of the specific flux , an EFM where this maximum occurs and the values of the metabolite concentrations for which it occurs.
Proposition 6.
Given a kinetic metabolic network with possible bounds on metabolite concentrations, but not on enzyme concentrations, i.e., with solution space or , if the enzyme allocation problem, which consists in maximizing the specific flux (rate per unit of biomass amount) in a given reaction k, i.e., , where is a flux vector in or and is the total protein content in the system (the ’s being fixed positive weights), has an optimal solution (which is always the case for if the problem is solvable), then this solution is reached in particular at an EFM of or .
2.2.3. Application to Regulatory Constraints
From Lemma 2, we obtain that (resp., ) is the disjoint union of ’s (resp., ’s), for certain open orthants and, after merging, the disjoint union of ’s (resp., ’s), for certain semi-open orthants (orthants without some of their faces), without any further merging possible.
We will call open polyhedral cone (resp., open polyhedron) in dimension an r-polyhedral cone C (resp., r-polyhedron P) without its facets, and we will note it (resp., ) as it coincides with the topological interior of C (resp., P) in the affine span of C (resp., P). Reciprocally, C (resp., P) is the topological closure of (resp., ) in . (resp., ) is defined as in Equation (7) (resp., in Equation (26)) but with strict inequalities (and equalities for defining its affine space). We will in particular consider open faces of a cone C or polyhedron P as for F a face of C or P. C (resp., P) is the disjoint union of its open faces (by convention, a face of dimension 0, i.e., a vertex, is equal to its open face).
It follows from these definitions that, for any closed r-orthant O and any open orthant , (resp., ) is an open face of (resp., a disjoint union of open faces of , as we have to keep faces corresponding to ). In all, we obtain that
(resp., ) is a disjoint union of open faces of the flux cone (resp., the flux polyhedron ) or, equivalently, that, for any flux tope FC&η for FC(resp., FP&η for FP), (resp., ) is a disjoint union of open faces of FC&η (resp., FP&η). As we grouped together open orthants into semi-open orthants in Lemma 2, we can also group together with such an open face
all those other open faces in question where is a face of F to obtain thus a (minimal) disjoint union of semi-open polyhedral cones (resp., semi-open polyhedra). Here, we call semi-open polyhedral cone (resp., semi-open polyhedron a polyhedral cone C (resp., polyhedron P) without certain (between zero and all) of its faces of lesser dimension, that can be thus expressed as a disjoint union of certain (between all and only , resp.,) of the open faces of C (resp., P). We have: , resp., ).
Proposition 7.
Given an arbitrary Boolean constraint Bc, the solution space (resp., ) for the regulatory constraint is a finite disjoint union of open polyhedral cones (resp., open polyhedra), which are certain open faces of all the flux topes (resp., ). Grouping together certain of these open faces according to Lemma 2, we obtain a disjoint union of semi-open (i.e., without certain of their faces of lesser dimension) faces of the ’s (resp., ’s), without any possible further merging of two of them to make up a bigger semi-open face.
Note that the rules presented in the proof of Lemma 2 to group together open faces into semi-open faces are applied globally to all flux topes. If we choose to apply them separately for each flux tope, then the union of semi-open faces obtained is no longer disjoint in general as two such semi-open faces for two different flux topes may have a nonempty intersection (as (resp., ) and (resp., ), for two different closed r-orthants O and O′ may have open faces in common). Note also that, even after this merging, there may exist a strict inclusion relationship between the closures of two semi-open faces.
From this geometrical structure of the solution space, we will now deduce what its EFMs and its EFVs (or EFPs) are. Let us begin with a preliminary remark. We know that, by definition of EFVs, we have, for a flux cone , EFVs EFVs (and idem for a flux polyhedron with EFPs and EFVs), which remains true for any constrained flux cone subset (or constrained flux polyhedron subset ) whatever the biological constraint is. We saw that this equality was also satisfied by EFMs for : EFMs EFMs (and idem for ) and remained true for for any thermodynamic constraint or any kinetic constraint of the form , or (in the absence of bounds on enzyme concentrations), thus allowing in these cases to decompose the identification of EFMs flux tope by flux tope, as for EFVs. However this is no longer true for regulatory constraints. Actually, counter-examples can be found in the presence of reversible reactions, used in a given direction in an EFM local to a certain flux tope and in the other direction in an EFM local to another flux tope, choosing the Boolean constraint such that it imposes a strict set inclusion between the supports of these two local EFMs.
Example 1.
Consider the simple network comprising one reaction , where A and B are the two internal metabolites, and four transfer reactions , , , , and assume that R and are reversible (we will note par rev() the reversed reaction), the three other reactions being irreversible (see Figure 4). It is obvious that , made up of , , made up of , and , made up of , are EFMs of . Consider now the Boolean constraint . Then, is an EFM of , belonging to . Moreover, any with is an EFM of (obviously, the sub-pathways and of do not belong to ). As and , we have and thus is not an EFM of .
Figure 4.
Simple network of Example 1 with R and reversible. are EFMs. When we add the regulatory constraint , is still an EFM, which is not the case of any positive combination of and , which has a larger support. Nevertheless, the latter is an EFM in the flux tope defined by reverse fluxes in R and .
Now, from a biological point of view, it is not relevant to compare supports of two pathways with a certain reaction in a given direction in the first support and in the other direction in the second support (case of R and in the example). This means that the useful concept concerning minimality is not support-minimality, but sign-minimality (exactly in the same way as, concerning decomposition, we saw that the useful concept is not non-decomposability but conform non-decomposability), which is equivalent to comparing supports separately for each closed r-orthant, i.e., for each flux tope. We will thus identify the EFMs flux tope by flux tope (note that this is analog to distinguishing a positive flux from a negative flux in a regulatory constraint, e.g., distinguishing the constraints and in the example above, which could be done by adopting a tri-valued logic instead of a Boolean logic to represent these constraints; this is obviously done automatically when splitting each reversible reaction into two irreversible ones, where only the positive r-orthant has to be considered).
Therefore, we will consider in the following an arbitrary closed r-orthant O given by a full support sign vector (with for ) and consider the part of the solution space limited to this orthant, i.e., (resp., ), thus we can limit ourselves to sign vectors that are maximal in (resp., ). We saw that we could rewrite the Boolean constraint as a disjunction of two by two exclusive disjuncts, , decomposing thus the solution space (resp., ) into the disjoint solution spaces for each disjunct, (resp., ). The elementary flux vectors (i.e., faces of dimension one) of that satisfy the constraint are obviously elementary flux vectors of and the reciprocal is also true: if a flux vector of the semi-open polyhedral cone is not an elementary flux vector of , i.e., is not a face of dimension one, then it belongs to the interior of a face of of dimension at least two and is thus conformally decomposable in this face, i.e., is not elementary in . It follows that the elementary flux vectors of are made up of all the elementary flux vectors of the ’s. We can sum up the results regarding EFVs as:
This means that EFVs can be computed flux tope by flux tope and constraint-disjunct by constraint-disjunct. Moreover, the result holds also for EFPs (by considering vertices of ) and EFVs of .
However, for EFMs, we have to take care that a phenomenon similar to that described in the example above still arises and that, even in a given flux tope, an EFM of is not necessarily an EFM of .
Example 2.
Consider the network comprising three irreversible reactions and one internal metabolite A: , , , and the Boolean constraint , decomposed as , with and (see Figure 5). Take . Then, , made up of , which is the only EFM of , is also the only EFM of . However any with , made up of , is an EFM of and is not an EFM of . Note that the way the constraint is decomposed matters. With the decomposition with and , the result for is identical to that for , but .
Figure 5.
Simple network of Example 2. When we add the regulatory constraint , is the only EFM, and also the only one for the disjunct of the constraint, but no combination is an EFM, while it is an EFM for the disjunct .
This means that, if it is natural to study each (resp., ) separately in order to characterize the solution space and if EFVs are obtained in this way by collecting all the local EFVs, it is not the case for EFMs and, after collecting all local EFMs, we must only keep those with minimal support:
Proposition 8.
Given an arbitrary regulatory constraint with , where the disjuncts are taken two by two inconsistent, and the associated constrained flux cone subset (resp., constrained flux polyhedron subset ), its EFVs (resp., EFPs and EFVs) are obtained by collecting these for each flux tope (i.e., in each r-orthant ) and for each disjunct, i.e., for each (resp., ), and are nothing else than the EFVs (resp., EFPs and EFVs) of (resp., ) satisfying the constraint . This is not the case for EFMs. First, an EFM of is not necessarily an EFM of , but actually the biologically relevant minimality concept being sign-minimality and not support-minimality, we will consider EFMs for each flux tope . Second, an EFM of is not necessarily an EFM of . EFMs of are actually obtained by collecting EFMs of for all k and keeping only those with minimal support.
This being said, we will now focus on an arbitrary disjunct D of the form with . Thus, (resp., ) is the semi-open face of (resp., ), obtained from the face F defined by (i.e., , idem with ) by removing its facets for all . Let’s note EFMs EFMs those EFMs of that satisfy the constraint and EFMs EFMs the EFMs of the part of the solution space in . Obviously, EFMs EFMs . If , and EFMs EFMs , so we will consider the case . In this case, and contrary to what happens for thermodynamic and kinetic (as described in Proposition 5) constraints, there is generally no longer identity between EFMs and EFMs [42].
Example 3.
Consider the network of Example 1 (thus ) and let and be the EFM of made up of and . Then, for any , , positive conformal combination of the two EFMs and of , has support and belongs to (see Figure 6).
Figure 6.
Simple network of Example 3. and are EFMs. When we add the regulatory constraint , any positive combination of and is now an EFM, that is not obtained as an original EFM satisfying the constraint.
EFMs correspond to the faces of dimension one (edges or extreme rays), if any, of the semi-open face , i.e., the edges of the face F that have not been removed, which means that they are not included in any hyperplane of equation , for a certain , or equivalently that their supports contain I. Now, consider a face of F, of dimension at least two, such that but no facet of has its interior included in (i.e., any facet of is included in an hyperplane of equation , for a certain ), if any. Note that several such may exist, but none can be included in another one, i.e., they are not comparable for inclusion in the lattice of the faces of F. Then, all vectors of have the same minimal support, i.e., EFMs EFMs . If (with ) are representatives of the extreme vectors of , we have (which is actually the same as as we are in orthant ) and thus and the common minimal support of all vectors of is . Conversely, if EFMs , let be the minimal face of F containing . If has dimension one (extreme ray), then and EFMs . If has dimension at least two, then no facet of has its interior included in , because if it were the case for one facet, then any vector of its interior would have its support strictly included in the support of , which would contradict the minimality of the latter. Finally, and all vectors of have the same support as and belong to EFMs EFMs . We have thus characterized both EFMs and EFMs EFMs . We stipulate now, for the latter one, the decomposition of its vectors into EFMs EFMs , in order to compute their supports, which is generally the only useful information (the precise decomposition into fluxes not often being very relevant). Therefore, we consider a face of F, of dimension at least two, such that but no facet of has its interior included in and a minimal set of vectors in EFMs such that . Note that, for any k, (and, as we have seen, ), because if for a certain we had , then would belong to all hyperplanes of equation for , thus to all facets of , which is impossible for a non-null vector. Let us note and, for any k, , . Then, for any k, , because if for a certain we had , then the vectors of would verify the constraint and have their supports included in , thus equal to it as it is minimal for vectors in , which would contradict the minimality of . Finally, as by construction and for , we obtain that constitutes a partition of I and is a set of vectors in EFMs EFMs EFMs such that by construction and for any , otherwise, by the same reasoning as above, could be suppressed from the set , contradicting its minimality. Finally, we obtain that the support of any vector in EFMs EFMs can be written as , where , , are vectors in EFMs verifying and for all k and , where is a partition of I (note that we have the same result for EFMs by taking ). Now, a given is not necessarily minimal among the whole collection when we vary and . It is also possible that it strictly contains the support of a vector in EFMs . Therefore, to obtain exactly the supports of vectors in EFMs EFMs , we must only keep the minimal elements for inclusion w.r.t. the whole collection extended by EFMs .
Example 4.
Let us continue with the network of Examples 1 and 3, so and . We have and (see Figure 4). The only partition of with a size is given by: and . The only vector in whose support contains and not is and the only one whose support contains and not is . Thus, . Actually, we have from Example 3: (see Figure 6).
Consider now the following network comprising two internal metabolites and seven irreversible reactions, six of which are transfer reactions, , , , , , , (see Figure 7). Let and . Let made up of , made up of , made up of , made up of , made up of , made up of , made up of and made up of . We have and . The only partition of with a size is given by and . The vectors in whose support contains and not are and and those whose support contains and not are and . We have , , and . Each one of the four supports obtained is minimal in this collection, but the first three contain . Thus, . Actually, .
Figure 7.
Second network of Example 4. to are all the EFMs. When we add the regulatory constraint , only remains an EFM, with support , but new EFMs appear, made up of any positive combination of and , all with the same support .
Finally, consider another network comprising three internal metabolites and seven irreversible reactions, five of which are transfer reactions, , , , , , , . Let and (see Figure 8). Let made up of , made up of , made up of , made up of , made up of and made up of . We have and . There are four partitions of with a size . The partition gives nothing because there is no vector in whose support contains but neither nor . For the partition and , the vector in whose support contains and not is and those whose support contains and not are , and , providing , and . For the partition and , the vector in whose support contains and not is and those whose support contains and not are , and , providing , and . For the partition and , the vector in whose support contains and not is and those whose support contains and not are and , providing and . The minimal elements of this collection of supports are , and . Minimizing also w.r.t. gives thus . Actually, .
Figure 8.
Third network of Example 4. to are all the EFMs. When we add the regulatory constraint , only remains an EFM, with support , but new EFMs appear, made up of any positive combination of and , all with the same support .
We have thus proved the following result.
Proposition 9.
Given a Boolean constraint D as a conjunction of literals and an arbitrary closed r-orthant , let be the constrained flux cone subset for the regulatory constraint in . Let F be the face of the flux cone defined as , then is the semi-open flux cone obtained from F by removing its facets for all . Let be the EFMs of that satisfy and be the EFMs of . We get but, unlike the case of thermodynamic and kinetic (as in Proposition 5) constraints, there is generally no longer identity between and . correspond to the extreme rays (faces of dimension one) of , i.e., the edges of F whose support contains I. are the vectors of the ’s for all faces of F of dimension at least two, such that (i.e., is not included in any hyperplane with ) but no facet of has its interior included in (i.e., any facet of is included in a certain hyperplane with ). The supports of those vectors in are obtained as , where , , are vectors in verifying and for all k and , where is a partition of I (they are actually the minimal elements, for subset inclusion and including the supports of when checking minimality, obtained like this, for any partition of I, of size at least two, and any choice of as above).
This characterization of EFMs for flux cones in the presence of regulatory constraints does not hold as simply for flux polyhedra, because the added inhomogeneous linear constraints , given by , may not respect the structure of at all and may cut the interior of flux cones; we already mentioned that there is no direct relationship between EFMs and extreme or elementary fluxes for a flux polyhedron. Nevertheless, the ideas developed above for flux cones can be applied to flux polyhedra , where certain of their faces have to be removed due to the constraint D because they are included in certain hyperplanes {vi = 0}, in order to determine \. Moreover, in practice, ILC is generally only used to bound (below and/or above) fluxes, in which case each extreme ray of is still partly present in as for example an edge [α−,α+]e, defined by the two vertices α−e and α+e. The results above can then be transposed by using convex-conformal decomposition into vertices and conformal decomposition into elementary vectors to characterize \.
We are now interested in looking for vectors in the solution space that are (in some sense) non-decomposable while not being support-minimal, and in characterizing them, if any. We could think of EFVs (or EFPs) but, from the study above regarding EFMs, we note that, for a flux cone constrained by the regulatory constraint D, the vectors in EFMs EFMs are necessarily conformally decomposable, as they can be described as the interiors of polyhedral cones in of dimension at least two (for example, in Example 3 can be decomposed as ). More straightforwardly, we can notice that EFMs is equal to EFVs , i.e., precisely the conformally non-decomposable vectors. As already pointed out, what matters more than the precise decomposition into fluxes is the decomposition of the supports (in the decomposition above of , the supports of the components are unchanged, i.e., equal to ). It follows that a relevant concept for a solution vector is not to be conformally non-decomposable but, less strictly, to be (conformally) support-wise non-decomposable, in the sense that the vector cannot be conformally decomposed into two vectors of different (necessarily not greater) supports. Now, for all faces of F of dimension at least two such that , not covered by Proposition 9, i.e., owning at least one facet with interior included in , it so happens that all vectors of are actually support-wise decomposable, as each such vector can always be decomposed into a vector of of same support and into a vector of of smaller support. Therefore, in the decomposition, we do not authorize the same support for one of the component vectors. Thus the really relevant (less strict) concept is that the vector cannot be conformally decomposed into two vectors of smaller supports and we call a nonzero vector of a convex polyhedral cone C as (conformally) support-wise non-strictly-decomposable, if
The (conformally) support-wise non-strictly-decomposable vectors in C (resp., flux vectors in a flux cone ) will be noted swNSDVs (resp., swNSDFVs). Note that we could have defined support-wise non-strictly decomposability more generally without imposing a conformal decomposition, but, as already pointed out, this is not relevant for biological fluxes and we will only use this concept in a certain tope. It is obvious that, in such a tope (resp., flux tope), EMs, ExVs = EVs ⊆ swNSDVs (resp., ExVs = EFVs = EFMs = swNSDFVs, so that all four definitions coincide). We will introduce a similar definition with a convex combination for a polyhedron P (the previous definition and the associated relationships above being valid for its recession cone ). A vector of a polyhedron P is called support-wise convex(-conformally) non-strictly-decomposable, if
The support-wise convex(-conformally) non-strictly-decomposable vectors in P (resp., flux vectors in a flux polyhedron ) will be called support-wise non-strictly-decomposable points (resp., support-wise non-strictly-decomposable flux points) and noted swNSDPs (resp., swNSDFPs). It is obvious that, in any given tope (resp., flux tope), EMs, ExPs = EPs ⊆ swNSDPs (resp., EFMs, ExPs = EFPs ⊆ swNSDFPs).
With the notations of Proposition 9, let swNSDFVs swNSDFVs and swNSDFVs swNSDFVs . We have thus EFMs swNSDFVs EFMs swNSDFVs but, unlike the case of thermodynamic and kinetic (as in Proposition 5) constraints for flux cones, not only there is no longer identity between swNSDFVs and swNSDFVs (consequence of the non-identity between EFMs and EFMs ), but we will now see that there is generally no longer identity between EFMs and swNSDFVs .
Example 5.
Continuing with the network of Example 3, . More precisely, we have (by considering only a representative for each ray) (see Figure 9).
Figure 9.
Simple network of Examples 3 and 5. The EFMs are . When we add the regulatory constraint , only remains an EFM, with support , but new EFMs appear, made up of any positive combination of and , all with the same support . Moreover, new swNSDFVs also appear, which are not EFMs: any positive combination of and , all with the same support , and any positive combination of and , all with the same support .
Therefore, we extend the results of Proposition 9 by refining the structure in F of swNSDFVs . Let be representatives of the extreme vectors of F, thus . We get the following structure for regarding EFMs and swNSDFVs, given in the form of an algorithm.
- Let . Then EFMs . Note that R can vary from ∅ to M, thus EFMs from ∅ to EFMs . If , then EFMs EFMs swNSDFVs EFMs and the analysis is done (if has been chosen maximal in , this case corresponds to ). We consider the case here below.
- Let us consider successively all faces of F of dimension at least two, such that , i.e., is not included in any hyperplane with (the lattice of faces of F can be explored for example in a way such that a sub-face is visited before a super-face; once such an has been found, all faces of F containing it are also suitable). Let , with , be representatives of the extreme vectors of . Thus, and . For each of these , three exclusive cases can now be distinguished.
- If no facet of has its interior included in , i.e., any facet of is included in a certain hyperplane with (a necessary but insufficient condition is ), then EFMs EFMs and is a minimal support for vectors in .
- If exactly one facet of has its interior included in , i.e., it is the only facet not included in any hyperplane with , then swNSDFVs EFMs and is a non-strictly-decomposable non-minimal support for vectors in (non-strictly-decomposable support means that any vector which is a conical sum of the ’s having this support is support-wise non-strictly-decomposable, independently of the choice of the non-negative coefficients fixing the contribution of each in the distribution of the fluxes). This result follows immediately from the facts that one facet is not enough to decompose a certain vector in strictly in terms of supports and that the support of the vectors in the interior of the facet in question is strictly included in the support of the vectors in .
- If at least two facets of have their interior included in , i.e., these facets are not included in any hyperplane with , then let , with , be representatives of the extreme vectors of all these facets (note that we have then necessarily , i.e., ). Thus, the strict conical sum of the interiors of these facets, which is equal to , is not empty in (as there are at least two such facets) and is made up of the support-wise strictly-decomposable vectors of (by construction): swNSDFVs . Two subcases must therefore be distinguished.
- If , i.e., the strict conical sum of the interiors of these facets is equal to , then swNSDFVs and is a strictly-decomposable support for vectors in (which means that any vector which is a conical sum of the ’s having this support is support-wise strictly-decomposable, independently of the choice of the non-negative coefficients fixing the contribution of each in the distribution of the fluxes).
- If , then is split into two nonempty subsets: swNSDFVs and swNSDFV EFMs (note that is made up of the vectors of the conical decomposition of which on the ’s requires at least one with ). This means that part of the vectors of are support-wise non-strictly-decomposable and part are support-wise strictly-decomposable, while having the same non-minimal support (still equal to ). This proves that support-wise strict decomposability generally depends not only on the support, but also on the positive values of the fluxes and that, unlike the particular cases above, we cannot speak of a strictly-decomposable or non-strictly-decomposable support. Note that, in this subcase, the support-wise non-strictly-decomposable vectors of constitute the complementary, in the open cone , of the open sub-cone which is thus a finite disjoint union of semi-open cones, each of which is conically generated (with positive or non-negative coefficients according to faces that are present or not) by extreme vectors ’s with either or , thus in any case with , i.e., by extreme vectors EFMs EFMs .
We have finally proved the following result (keeping the notations of Proposition 9).
Proposition 10.
Let be the support-wise non-strictly-decomposable vectors of . We obtain but, unlike the case of thermodynamic and kinetic (as in Proposition 5) constraints, there is in general no longer identity between and . Consider all faces of F of dimension at least two, such that (i.e., is not included in any hyperplane with ), and let . Result of Proposition 9 can be stated as . Now, we have . Note that the ’s considered here necessarily own at least one facet that is not included in any hyperplane with . There are actually two cases: For those ’s which own exactly one such facet (thus ), we obtain and the common support of vectors in is thus a non-strictly-decomposable (independently of the choice of the distribution of the fluxes) non-minimal support for vectors in . For those ’s which own at least two such facets , we obtain and , thus part of the vectors of (consisting of a finite disjoint union of semi-open cones) are support-wise non-strictly-decomposable and part (consisting of an open cone) are support-wise strictly-decomposable (depending on the choice of the distribution of the fluxes), while having the same non-minimal support.
Example 6.
Let us continue with the network of Examples 3 and 5 (see Figure 9). is a pointed cone of dimension 3 included in the positive orthant of with axes in this order. We have . has 3 extreme rays (EFMs seen above) and 3 facets (see Figure 10). When we add the Boolean constraint , then and the solution space is the semi-open cone , i.e., F without its two edges corresponding to and . The only edge remaining in provides with support . There are three faces of F of dimension two with their interiors included in : , and . is the only one to have no facet with its interior included in , i.e., such that , thus . This means that the new EFMs that appear are all the positive combinations of and , all with the same minimal support . Regarding and , they both have as their only facet with interior included in , thus and and are both included in . and are thus non-strictly-decomposable (independently of the respective values of the fluxes in and or in and , respectively) non-minimal supports. The last face of F with its interior included in is F itself, of dimension three, with . As all the facets of F have their interior included in , we obtain . Thus, the pathways with nonzero fluxes in all five reactions, i.e., with support , are exactly the support-wise strictly-decomposable ones.
Figure 10.
Simple network of Examples 3, 5 and 6. The flux cone of dimension 3 has 3 extreme rays, which are EFMs , and 3 facets (in color in the picture that represents the projection of on the subspace spanned by , the fluxes in , respectively). When we add the regulatory constraint , the solution space becomes without its two edges corresponding to and (in gray), thus a disjoint union of the following open cones: extreme ray , the interiors of the red, blue and green facets, and the interior of . Thus, only remains an EFM, with support , but there are new EFMs, namely all positive combinations of and (the interior of the red facet), all with the same minimal support . Moreover, there are new swNSDFVs with non-minimal support: all positive combinations of and (the interior of the blue facet), all with the same non-strictly-decomposable support , and all positive combinations of and (the interior of the green facet), all with the same non-strictly-decomposable support . All vectors in the interior of , which have as a support, are support-wise strictly-decomposable.
Example 7.
Let us consider the simple network comprising one reaction , where A and B are the two internal metabolites, and four transfer reactions , , , , and assume the five reactions irreversible (see Figure 11). is a pointed cone of dimension 3 included in the positive orthant of with axes in this order. We have: . has 4 facets and 4 extreme rays. Representatives of these extreme rays () are , , and , defined by their supports: , , and , respectively.
Figure 11.
Simple network of Example 7. Flux cone of dimension 3 has 4 extreme rays, which are EFMs , and 4 facets (in color in the picture that represents the projection of on the subspace spanned by , the fluxes in , respectively). When we add the regulatory constraint , the solution space becomes without its two facets in yellow and orange, thus a disjoint union of the following open cones: the extreme ray , the interiors of the blue and purple facets, and the interior of . Thus, only remains an EFM, with support and there are no new EFMs, but the following new swNSDFVs occur. First, any positive combination of and (the interior of the blue facet), all with the same support , and any positive combination of and (the interior of the purple facet), all with the same support , these supports being thus non-strictly-decomposable (independently of the positive values of the fluxes) though not minimal. Second, the complementary in the interior of of the open sub-cone generated by the positive combinations of (i.e., the interior of the cone with blue, purple and grey facets), that is the original network with nonzero fluxes in all five reactions, thus with support , in the case where input flux y in is not smaller than output flux z in , obtained as a positive combination of (the interior of the cone with yellow, orange and grey facets) if , or of (the interior of the gray cone) if . On the other hand, when y is smaller than z, the global pathway can be decomposed into two swNSDFVs of non-strictly-decomposable supports and in an infinite number of ways depending on a parameter (decomposition of a vector in the interior of the cone with blue, purple and gray facets into two vectors in the interiors of the blue and purple facets respectively). This shows that in general support-wise strict-decomposability does not depend only on the support of the pathway but on the distribution of the fluxes.
Let us consider the Boolean constraint . Then, and the solution space is the semi-open cone , i.e., F without its two facets and .
The only EFM still present in is , thus with support .
There are two faces of F of dimension two with their interiors included in : and , both with as their only facet with interior included in . Thus, , , EFMs and . and are thus non-strictly-decomposable (independently of the respective values of the fluxes in and or in and , respectively) non-minimal supports.
The last face of F with its interior included in is F itself, of dimension three, with defined in by . F has exactly two facets with their interiors included in , namely, and and thus . The support-wise strictly-decomposable vectors constitute the open sub-cone of defined by : . They are the pathways of support such that the flux in is smaller than the flux in ; actually any vector with can be decomposed as , with arbitrary k, , i.e., into two support-wise non-strictly-decomposable vectors respectively in with support and in with support .
We thus have . The support-wise non-strictly-decomposable vectors constitute the semi-open sub-cone of defined by : . They are the pathways of support such that the flux in is not smaller than the flux in . Any vector with is equal to and belongs to if and to if .
However, in the case of a general constraint , take care that, if the support-wise strictly-decomposable vectors of each are support-wise strictly-decomposable in , it is not true for support-wise non-strictly-decomposable vectors and some of them for may be decomposable in .
This characterization of support-wise non-strictly-decomposable vectors in the presence of regulatory constraints does not extend directly from flux cones to flux polyhedra. The reason is that, due to inhomogeneous linear constraints , certain faces of are not defined by equalities of the form and thus play no role in the definition of the support of the vectors they contain. Consequently, the basis of the reasoning above, namely, that the support of vectors of the interior of any face is larger than the supports of vectors of the interior of any facet of this face, does not hold. Nevertheless, the ideas developed for flux cones for dealing with faces included in a certain hyperplane with , and thus removed from the solution space due to the Boolean constraint considered, can be applied. However, it is necessary at each step, when considering an arbitrary face, to distinguish its facets resulting from , not involved in the definition of the support, its facets included in a certain hyperplane with that are still present and contribute to the definition of the support, and its facets included in a certain hyperplane with that are removed.
2.2.4. Case of Several Types of Constraints
In general, when analyzing a metabolic pathway, all known biological constraints will have to be taken into account together, typically kinetic constraints and regulatory constraints. For two such constraints (say, a kinetic constraint , equivalent to , or in the absence of bounds on enzyme concentrations, equivalent to ) and (say a regulatory constraint ), the solution space, in the case of a flux cone (the reasoning would be similar for a flux polyhedron ) is given by . Now, from Propositions 2 and 5, is a finite union of flux cones , which are certain particular faces of each flux tope of and the constraint can be applied to each one as to an original flux cone: . From Proposition 7, each is a disjoint union of particular open faces of . In all, the solution space is a disjoint union of particular open faces of the flux topes of . From propositions above and Proposition 8, we can also conclude that the EFVs of are exactly the EFVs of that satisfy both constraints and .
Proposition 11.
The space of flux vectors in (resp., ) satisfying both the kinetic constraint (or in the absence of bounds on enzyme concentrations) and the regulatory constraint is a finite disjoint union of open polyhedral cones (resp., open polyhedra) which are certain open faces of the flux topes of (resp., ). The elementary flux vectors (resp., elementary flux points and vectors) are those of (resp., ) that satisfy both constraints.
Note that Proposition 10 applies also, by starting from each instead of each flux tope of , to determine elementary flux modes and support-wise non-strictly-decomposable vectors.
3. General Case of Sign-Compatible Constraints
We are interested in determining, for general biological constraints , what can be said about the structure of the solution spaces or . More precisely, in identifying certain general features regarding constraints (some of which are present in particular in biological constraints Equations (42), (47) and (50) we have considered so far), allowing us to clarify the mathematical and geometrical structure of the solution space, to determine the EFs (conformal non-decomposable fluxes) or EFMs (support-minimal fluxes) for this space and whether they characterize it and lastly how to compute them efficiently, in particular by checking the integration of this computation into the DD method. We will focus on deducing pertinent geometrical characteristics of the solution space only from general properties of the compatibility of the constraints with vector signs (i.e., with vector supports in each closed r-orthant or flux tope) and we will see that part of the results obtained above for thermodynamic, kinetic and regulatory constraints actually depends only on these general global properties.
3.1. Sign-Invariant Constraints
A constraint (resp., ), is said to be sign-invariant if it depends only on , i.e., on the signs of the ’s but not on their values (in the formulas below, free variables are assumed universally quantified):
It follows from these definitions that, if the ’s are sign-invariant for all , then is sign-invariant. Note that the property for a constraint to be support-invariant (i.e., to depend only on ) is stronger than the property to be sign-invariant. Actually, in any flux tope (or in any given closed orthant), having the same sign for two vectors is equivalent to having the same support and thus being sign-invariant for a constraint means being support-invariant in each flux tope independently.
Example 8.
It follows then from definitions Equations (41), (42) or (43) that the thermodynamic constraint or is sign-invariant. Thus this is also the case for and Equation (60) and for and Equation (61).
On the other hand, the kinetic constraint Equation (47) is not sign-invariant, and this is also the case of Equation (78), because is bounded below and above (e.g., for Michaelis–Menten kinetics, ) and thus, for a given nonzero enzyme concentration , there is no metabolite concentrations vector allowing an arbitrary flux value in reaction i (as soon as the value of is outside the bounds).
The kinetic constraint Equation (80) is however sign-invariant. This follows from the linear dependency of on . Actually, if is satisfied (for a certain ) and , then, for all , and have the same sign; thus, it is also the case for and . Therefore, by taking when , and when , is satisfied (for ). This sign-invariant property thus holds also for Equation (76) and for Equation (77) if the only bounds are on metabolite concentrations. However, it does not hold for (81) and for (77) in the presence of bounds on enzyme concentrations.
Finally, by definition, the regulatory constraint Equation (50) depends only on , so is support-invariant and thus sign-invariant.
Lemma 4.
All thermodynamic constraints are sign-invariant. Only kinetic constraints and are sign-invariant, as well as with bounds only on metabolite concentrations (but not on enzyme concentrations, thus in particular without enzymatic resource constraint). The regulatory constraints are support-invariant, thus sign-invariant.
The structure of the solution space of a sign-invariant constraint follows directly from its definition. Actually, if satisfies a given sign-invariant constraint (resp., ), then is included in and thus is a disjoint union of open orthants. This result can be compared to Lemma 2. More precisely, if we consider a support-invariant constraint , the same reasoning applies and gives , where denotes the set of vectors of support , where we code a support as a binary vector (1 coding support membership and 0 non-membership). A support-invariant constraint is thus equivalent (in extension) to a family of such support sets and, as there are possible binary vectors, there are thus different support-invariant constraints. Now, a given is equivalent to the Boolean constraint and thus an arbitrary support-invariant constraint is equivalent to the Boolean constraint , thus to the regulatory constraint . Thus support-invariant constraints identify with regulatory constraints and all properties we have demonstrated for the latter apply to the first. For example, by associating with any sign vector its support (i.e., the support of any vector having the given sign), given by if or − and if , we deduce that, for any binary vector , there are sign vectors with support , given by or − if and 0 else, and we obtain and , i.e., that is a disjoint union of open orthants, which is precisely the statement of Lemma 2. Note that a support-invariant constraint is entirely defined by its restriction to an arbitrary closed r-orthant , i.e., for an arbitrary full support sign vector, as can be reconstituted from . Moreover, as to be sign-invariant and to be support-invariant coincide in any closed r-orthant, a sign-invariant constraint is nothing but a constraint whose restriction to any closed r-orthant is support-invariant, i.e., which coincides in any closed r-orthant with a well-defined regulatory constraint (but such regulatory constraints differ in general from one r-orthant to another, while obviously coinciding on their intersection). A sign-invariant constraint is equivalent to a family of open orthants and, as there are possible sign vectors, there are thus different sign-invariant constraints. Applying the merging method described in the proof of Lemma 2, these open orthants can be grouped together in order to obtain a family of semi-open orthants, without any possible further merging between any two of them.
Proposition 12.
Support-invariant constraints coincide with Boolean constraints, i.e., regulatory constraints. They are completely characterized by their restriction to any closed r-orthant and number . Sign-invariant constraints coincide with constraints whose restriction to any closed r-orthant is given by a regulatory constraint (with identity of such regulatory constraints on the intersections of any two of these orthants). They number . The set of vectors in satisfying the sign-invariant constraint (resp., ) is a disjoint union of open orthants, which can be grouped together according to Lemma 2 to provide a disjoint union of semi-open orthants without any possible further merging between any two of them.
An important consequence is that, if we reason for each closed r-orthant separately, i.e., flux tope by flux tope in the solution space, then all results demonstrated in Section 2.2.3 for regulatory constraints apply to sign-invariant constraints. With the previous definitions of open or semi-open polyhedral cones and open or semi-open polyhedra and, more generally, of open or semi-open faces of polyhedral cones or polyhedra, we get the following result.
Theorem 1.
Let (resp., ) be a sign-invariant constraint and (resp., ) be the associated constrained flux cone subset (resp., the associated constrained flux polyhedron subset). Then, (resp., ) is a finite disjoint union of open polyhedral cones (resp., open polyhedra), which are certain open faces of the flux topes (resp., ) for all maximal sign vectors in (resp., ). They can be grouped together according to Lemma 2 to provide a disjoint union of semi-open faces of the ’s (resp., ’s) without any possible further merging between any two of them. Elementary fluxes are obtained by collecting those for each flux tope (which is not the case for elementary flux modes where, after collecting them, only those with minimal support have to be kept) and are nothing but the elementary fluxes of (resp., ) that satisfy the constraint (which again is not the case for elementary flux modes):
Proof of Theorem 1.
The disjoint decomposition of the solution space into open faces of the flux topes (resp., ) results from Proposition 12 or directly from the fact that all vectors of the strict conical hull and of the strict convex hull of given vectors in a flux tope, thus in a closed orthant (so the sum is conformal), have the same support , thus the same sign, and from the fact that an open face of a polyhedron is the Minkowski sum of the strict convex hull of its vertices and the strict conical hull of its extreme vectors. Thus, an open face of a flux tope (resp., ) either does not intersect the solution space or is completely included in it. We have seen (55) that any elementary flux or any elementary flux mode of (resp., ) that satisfies the constraint is an elementary flux or an elementary flux mode of the solution space (the conditions of validity of (55) for elementary fluxes are satisfied from the structure of the solution space). These elementary fluxes or elementary flux modes actually identify with the extreme vectors of the ’s (resp., extreme points of the ’s and extreme vectors of the ’s) that satisfy the constraint, i.e., whose sign satisfies the constraint. Reciprocally, any vector of the solution space that is not in this case necessarily belongs to an open face of a certain (or ) of dimension at least two (resp., of a certain of dimension at least one) and is thus conformally (resp., convex-conformally) decomposable in this open face and consequently cannot be an elementary vector (resp., elementary point) of the solution space (obviously, the decomposition involves two vectors of same support, thus nothing can be deduced regarding elementary flux modes). This gives the result for elementary fluxes. □
Theorem 1 applies in particular to all thermodynamic constraints, regulatory constraints and those kinetic constraints described in Lemma 4. Especially, we directly obtain the results of Propositions 7 and 8 for regulatory constraints.
Note that our only knowledge of the EFVs for (resp., the EFPs and EFVs for ) is really a long way from characterizing the solution space (resp., ): actually they are just the ExVs of each tope (resp., the ExPs and ExVs of each ), i.e., the only one-dimension open polyhedral cones, i.e., edges (resp., zero-dimension polyhedra, i.e., vertices, and edges for the recession cones) among all the open cones (resp., open polyhedra) of any dimension that constitute (resp., ).
As in each closed r-orthant, i.e., in each flux tope, a sign-invariant constraint identifies with a regulatory constraint, Propositions 9 and 10 demonstrated for regulatory constraints apply thus to sign-invariant constraints. Nevertheless, they are stated for constraints that are conjunctions of literals and, even if a decomposition into such disjuncts is always possible from the identity above, it is not necessarily natural for an arbitrary sign-invariant constraint and a global characterization for each flux tope is preferable, in particular for what concerns support-wise non-strictly-decomposable vectors. The difference is that one has to deal for with an arbitrary family of open faces of instead of a single semi-open face of , obtained specifically as a face of without certain of its facets, for .
Let us adopt notations similar to those used in Propositions 9 and 10, i.e., let EFMs EFMs be the elementary flux modes of that satisfy (i.e., the elementary flux vectors of ), EFMs EFMs be the elementary flux modes of and swNSDFVs swNSDFVs be the support-wise non-strictly-decomposable vectors of . We have thus EFMs EFMs swNSDFVs . The proof of Proposition 9 adapts straightforwardly: EFMs correspond to the open faces of dimension one (extreme rays) of in and EFMs EFMs are made up of all the open faces in where F is a face of dimension at least two of but not any proper (i.e., with positive dimension, less than the dimension of F) face of F is such that is in . The result is based on the properties that all vectors of an open face in have the same support, and the common support of vectors of , where is a face of F, is strictly included in that of vectors of . In the same way, the proof of Proposition 10 is easily adapted for what concerns swNSDFVs EFMs . For this we consider successively all the open faces in where F is a face of dimension at least two of which owns at least one proper face such that is in . For each such given , let be the family of all such . Let be representatives of the extreme vectors of F and , with , be representatives of the extreme vectors of all these . Thus, and and we obtain swNSDFVs , as by construction those vectors of which belong to are precisely those in which are support-wise strictly-decomposable in , the decomposition being achieved along vectors in the ’s, whose support is strictly included in the common support of vectors in . Therefore, three different exclusive cases can be distinguished for :
- , i.e., the ’s are all included in a same facet of F, that is in a same hyperplane for a certain coordinate i of the affine span of F, which is still equivalent to (this is for example the case if ). Then, swNSDFVs EFMs is entirely made up of support-wise non-strictly-decomposable vectors and is a non-strictly-decomposable non-minimal support for vectors in .
- , i.e., . Then, swNSDFVs is entirely made up of support-wise strictly-decomposable vectors and is a strictly-decomposable support for vectors in .
- , i.e., and not all ’s are included in a same facet of F, which means that, for all i coordinates of the affine span of F, there exists such that is not included in the hyperplane , or equivalently, there exists such that , which is still equivalent to . Then, is split into two nonempty subsets: swNSDFVs , made up of support-wise strictly-decomposable vectors, and swNSDFVs EFMs , made up of support-wise non-strictly-decomposable vectors (note that is made up of those vectors of the conical decomposition of which on the ’s requires at least one with ). As all vectors of have the same non-minimal support , this proves that support-wise strict decomposability does not generally only depend on the support but also on the positive values of the fluxes and that, unlike the two particular cases above, we cannot speak of a strictly-decomposable or non-strictly-decomposable support. Note that the support-wise non-strictly-decomposable vectors of are obtained as the complementary, in this open cone, of the open sub-cone , which is thus a disjoint union of semi-open cones (its connected components), each of which being conically generated both by certain extreme vectors with (that necessarily do not belong to EFMs ), with non-negative coefficients, and by certain extreme vectors with (that may belong or not to EFMs ), with positive coefficients (in order to keep the concerned facets common with ).
We have finally proved the following result (while keeping the notations of Theorem 1).
Theorem 2.
For a sign-invariant constraint, a flux tope and the associated subset of the solution space (disjoint union of open faces of from Theorem 1), let be the elementary flux modes of that satisfy (equal to ), be the elementary flux modes of , and be the support-wise non-strictly-decomposable vectors of , with . correspond to the open faces of dimension one (extreme rays) of belonging to . Consider now successively all the open faces in where F is a face of dimension at least two of , and for each such given F, let be representatives of the extreme vectors of F and , with , be representatives of the extreme vectors of all proper (i.e., with positive dimension, less than the dimension of F) faces of F such that belongs to . Thus, and let . Consequently, if , then ; if and , then and is a non-strictly-decomposable non-minimal support for vectors in ; if , then and is a strictly-decomposable support for vectors in ; if and , then , a non-empty open cone, constitutes the support-wise strictly-decomposable vectors in and , a non-empty finite disjoint union of semi-open cones, constitutes the support-wise non-strictly-decomposable vectors in , all having the same non-minimal support (decomposability depending in this case not only on the support but also on the distribution of the fluxes). We finally obtain and by collecting these vectors for each .
Note that even the knowledge of swNSDFVs is not enough to completely reconstruct the tope of the solution space.
In order to deal with the usual enzymatic resource constraint (more generally capacity constraints) in the kinetic constraints, we generalize the sign-invariance criterion. A constraint (resp., ), is said to be contracting-sign-invariant if, when satisfied by one vector, it is satisfied by any vector having the same sign which is not greater (on each component), i.e., that belongs to the open rectangle parallelepiped defined by the null vector and the given vector:
Obviously a sign-invariant constraint is contracting-sign-invariant and, if the ’s are contracting-sign-invariant for all , then is contracting-sign-invariant.
Example 9.
The kinetic constraints Equation (81) and Equation (77) are contracting-sign-invariant in the absence of positive lower bounds on enzyme concentrations (i.e., when ). Actually, if is satisfied (for a certain verifying and ) and with for all i, we saw that, by taking when , and when , then is satisfied (for ). Now, as , we have and thus and , so is satisfied.
Lemma 5.
The kinetic constraints and are contracting-sign-invariant in the absence of positive lower bounds on enzyme concentrations (i.e., when ).
We will call -star domain a subset of which has the property that the whole open segment joining to any element of is included in : . Any cone is a -star domain.
Theorem 3.
If (or ) is contracting-sign-invariant, then is a -star domain and there exists a neighborhood of for a certain such that is a finite disjoint union of N-truncated open polyhedral cones, which are the intersection with N of open faces of the flux topes . In addition, results (88) regarding and hold in N, i.e., for sufficiently small fluxes. The same holds for with the flux topes if is an interior point of , and in this case results (89) regarding and hold in N.
Proof of Theorem 3.
The property of being a -star domain is a direct consequence of the definition: if a contracting-sign-invariant constraint is satisfied by a vector , it is satisfied by for all . From the proof of Theorem 1, it follows that if a nonzero vector verifies a given contracting-sign-invariant constraint, then any vector of the minimal open face of containing (thus having the same sign as ) and belonging to with (contracting condition) also verifies the constraint. The result is obtained by taking for the minimum of the ’s on all open faces of the ’s (whose number is smaller than the number of possible signs, i.e., ). The proof is still valid for but the result is meaningful only if N is included in , i.e., if is an interior point of , which means that the inhomogeneous linear constraints defining , given by , verify . □
Theorem 3 tells us that the result of Theorem 1 for sign-invariant constraints regarding the geometrical structure of the solution space applies identically for contracting-sign-invariant constraints locally in a neighborhood of , i.e., when considering only pathways with sufficiently small amounts of fluxes. This applies in particular to those kinetic constraints described in Lemma 5.
3.2. Sign-Monotone Constraints
We now consider a property of compatibility of a constraint with signs that is stronger than sign-invariance. A constraint (resp., ), is said to be sign-monotone if, when satisfied by a vector, it is satisfied by any other vector with a smaller or equal sign (for the partial order on signs):
It follows from these definitions that, if the ’s are sign-monotone for all , then is sign-monotone and that any sign-monotone constraint is sign-invariant. Note that, in any given closed orthant, thus in any flux tope, is equivalent to and thus being sign-monotone for a constraint means being support-monotone in each flux tope. Note also that if a sign-monotone constraint has a solution, then the null vector is a solution.
Example 10.
It then follows from definitions (41), (42) or (43) that the thermodynamic constraint or is sign-monotone. This is thus also the case for and Equation (60) and for and Equation (61).
The argument given previously to establish that the kinetic constraint Equation (80) is sign-invariant proves in fact that it is sign-monotone. This sign-monotone property holds thus also for Equation (76) and for Equation (77) in the absence of bounds on enzyme concentrations.
The regulatory constraint Equation (50) is not generally sign-monotone: consider for example reduced to a positive literal. Actually, is sign-monotone if and only if in DNF contains no positive literal, which is a very special case, requiring only certain fluxes to be zero but unable to express that a given flux is nonzero (in this particular case, it is support-monotone, which is stronger than sign-monotone).
Lemma 6.
All thermodynamic constraints are sign-monotone. Only those kinetic constraints and are sign-monotone, as well as in the absence of bounds on enzyme concentrations. The regulatory constraint is not sign-monotone, except if in DNF contains only negative literals (i.e., assigns only zero fluxes, but no nonzero fluxes), in which case it is support-monotone.
The structure of the solution space of a sign-monotone constraint follows directly from its definition.
Lemma 7.
The set of vectors in satisfying the sign-monotone constraint (resp., ) is a union of closed orthants.
Actually, if satisfies , then is included in and thus . Obviously, we can keep only those ’s for which is maximal, in order to avoid any inclusion between the closed orthants. This result can be compared to Lemmas 1, 3 and first part of Proposition 5, which are obviously more precise but it shows that the mathematical structure of the solution space of these thermodynamic and kinetic constraints is mainly the only consequence of their sign-monotonicity.
Theorem 4.
Consider indifferently a sign-monotone constraint or (e.g., if is sign-monotone for any ), and let us name it and the associated constrained flux cone subset (resp., the associated constrained flux polyhedron subset ). Then, (resp., ) is a finite union of polyhedral cones (resp., polyhedra), which are faces of the flux topes (resp., ) for all maximal sign vectors in (resp., ) and we obtain
Proof of Theorem 4.
The (non-disjoint) decomposition of the solution space into faces of the flux topes (resp., ) follows from Lemma 7 or directly from the fact that all vectors of the conical hull and of the convex hull of given vectors in a flux tope, thus in a closed orthant (so the sum is conformal), have their supports included in , which is the support of any vector in or , thus their signs being less than or equal to the sign of , and from the fact that a face of a polyhedron is the Minkowski sum of the convex hull of its vertices and the conical hull of its extreme vectors. Thus, if a vector of a flux tope (resp., ) belongs to the solution space, the minimal face of (resp., ) containing is completely included in it. As a sign-monotone constraint is sign-invariant, the results of Theorem 1 apply and provide formulas for EFVs and EFPs. Thus remains only the case of EFMs. Now, if an elementary flux mode of the solution space were not support-minimal in , a nonzero vector would exist in with . Moreover, we could choose such that is nonzero, belongs to and verifies and . From the sign-monotonicity property of the constraint, would also satisfy the constraint and would thus belong to the solution space, which would contradict the fact that is an elementary flux mode in this space. Therefore, is an elementary flux mode in and we get the result for EFMs. □
Of course, in the decomposition of or (resp., or ) as a union of certain faces of the ’s (resp., ’s), we can only keep those faces which are maximal. Theorem 4 applies in particular to all thermodynamic constraints, to those kinetic constraints and to those very few regulatory constraints described in Lemma 6. In particular, we directly obtain (except of course the reference to ) Proposition 2 for thermodynamic constraints and , as well as the structure of the solution space as a union of flux cones (resp., flux polyhedra) for kinetic constraints and (in the absence of bounds on enzyme concentrations) and the characterization of elementary fluxes and elementary flux modes given by Proposition 5, which proves that these results depend only on the fact that these constraints are sign-monotone.
Note that the knowledge of EFVs, equal to EFMs, of (resp., and of EFPs of ) is not good enough to reconstruct the structure of the solution space (resp., ) as a union of polyhedral cones (resp., polyhedra). For this, it is necessary to know the (non-disjoint) decomposition of these EFVs (resp., and EFPs) as extreme vectors (resp., and extreme points) of the faces of the flux topes (resp., ) that constitute the solution space. The ’s are exactly the maximal subsets of EFVs (resp., and EFPs) whose conformal conical hull (resp., conformal convex hull) is entirely contained in the solution space: (resp., and ). By analogy with LTCS, we will call each such a largest -consistent set of EFVs or EFMs (resp., and of EFPs), noted LCS .
In order to deal with enzymatic capacity constraints, we generalize the sign-monotonicity criterion exactly in the same way we generalized the sign-invariance criterion. A constraint, (resp., ), is said to be contracting-sign-monotone if, when satisfied by one vector, it is satisfied by any vector with a smaller or equal sign which is not greater (on each component) than the vector itself, i.e., by any vector that belongs to the rectangle parallelepiped defined by the null vector and the given vector:
Obviously a sign-monotone constraint is contracting-sign-monotone and, if the ’s are contracting-sign-monotone for all , then is contracting-sign-monotone. Furthermore, any contracting-sign-monotone constraint is contracting-sign-invariant.
Example 11.
Lemma 8.
The kinetic constraints and are contracting-sign-monotone in the absence of positive lower bounds on enzyme concentrations (i.e., when ).
Theorem 5.
If (or ) is contracting-sign-monotone, then is a -star domain and there exists a neighborhood of for a certain such that is a finite union of N-truncated polyhedral cones, which are the intersection with N of faces of the flux topes . In addition, results (94) regarding and hold in N, i.e., for sufficiently small fluxes. The same holds for with flux topes if is an interior point of , and in this case results (95) regarding and hold in N.
Proof of Theorem 5.
The proof becomes identical to that of Theorem 3 by using the proof of Theorem 4. □
Theorem 5 tells us that the result of Theorem 4 for sign-monotone constraints regarding the geometrical structure of the solution space and the determination of elementary fluxes and elementary flux modes applies identically for contracting-sign-monotone constraints locally in a neighborhood of , i.e., when considering only pathways with sufficiently small amounts of fluxes. It applies in particular to those kinetic constraints described in Lemma 8, providing the result for the structure of and quoted in the last part of Proposition 5.
Finally, Proposition 11 extends straightforwardly to the case where we have to deal with both one sign-monotone constraint and one sign-invariant constraint. As a sign-monotone constraint is sign-invariant the conjunction of the two constraints is sign-invariant and Theorem 1 applies.
Theorem 6.
The space of flux vectors in (resp., ) satisfying both a sign-monotone constraint and a sign-invariant constraint is a finite disjoint union of open polyhedral cones (resp., open polyhedra) which are certain open faces of the flux topes of (resp., ). The elementary flux vectors (resp., elementary flux points and vectors) are those of (resp., ) that satisfy both constraints.
Note that, in the case of , Theorem 2 applies also to determine elementary flux modes and support-wise non-strictly-decomposable vectors.
3.3. Consequences on the Computation of Elementary Fluxes
From the results above, we will see that the computation of elementary fluxes in the presence of a sign-monotone constraint can be efficiently performed with the Double Description (DD) method.
First let us briefly remember the principle of the DD method [11]. This is an algorithm that takes as input the implicit description of a pointed convex polyhedral cone C as its representation matrix, i.e., a finite set of homogeneous linear inequalities defining C as the intersection of the corresponding vector half-spaces, and produces as output the explicit description of C as a (minimal) generating matrix, i.e., the set of extreme rays of C. More generally, it can deal in the same way with a pointed convex polyhedron P producing, from a finite set of linear inequalities defining P as the intersection of the corresponding affine half-spaces, the explicit description of P as two generating matrices providing respectively the vertices of P and the extreme rays of . As the first are obtained as the extreme rays of the pointed cone obtained from P by adding one dimension to the space and considering the conical hull of P from an origin of this extended vector space, it is enough to explain how the DD method works on pointed convex polyhedral cones. The DD method is an incremental algorithm that processes one by one each of the n homogeneous linear inequalities defining C. At each step , it builds the intermediate extreme rays of the intermediate current cone defined by the i first linear inequalities from the knowledge of the extreme rays of built at previous steps and of the ith linear inequality. At the end, for , and the extreme rays are thus obtained. The ith linear inequality defines a half-space with a vector hyperplane as a frontier. All the extreme rays of that are on the right side of , i.e., belong to , are extreme rays of and thus kept; all those that are on the wrong side do not belong to and will be discarded; the new extreme rays at step i appear from the intersection of with and are obtained as the intersection with of the 2-faces of defined by two adjacent extreme rays, one on each side of . We therefore just need to keep and update at each step this list of pairs of adjacent extreme rays and, when the ith linear inequality is processed, each such pair with elements on both sides of determines with an extreme ray for . The key fact is that each new extreme ray built at step i is the conical sum of two extreme rays existing at step and, as we will proceed by flux tope, i.e., in a given closed r-orthant, this sum is conformal and the support of this new extreme ray is the union of the supports of two previously existing extreme rays. Moreover, similarly, this new extreme ray, if involved in the next steps as a member of a relevant pair of adjacent rays, in order to build a new extreme ray, will have its support included in the support of the latter one. Finally, all we need to do is initialize . For a flux cone defined by Equation (6), what has been shown as the most efficient initialization is to start from a basis of the nullspace of (the authors of [43] proposed a method to compute EFMs directly as linear combinations of the vectors of a basis of this nullspace but it turned out to be less efficient than DD), and this basis (of size by assuming of full rank) can be chosen so that of the r linear inequalities are satisfied (we consider here the worst case corresponding to the highest number of such inequalities, i.e., , which happens in particular if each reversible reaction is split into two irreversible ones). There thus remain m linear inequalities to satisfy, i.e., there are steps in the DD algorithm.
Let us now consider a sign-monotone constraint . From Theorem 4, the EFVs (identical to EFMs) of the constrained flux cone subset are simply those of , i.e., of all its flux topes successively, that satisfy constraint (and the same is true for EFPs of ). Therefore, we just have to build the extreme rays of each by using the DD algorithm as presented above and filter those that satisfy . Obviously a filtering at the end, once all extreme rays built, will not at all improve the efficiency. However, by exploiting the key fact that any newly built extreme ray at a certain step, if used in other next steps to build other new extreme rays will have necessarily its support included in each support of the latter ones, and exploiting the sign-monotonicity of , we can conclude that, each time a newly built extreme ray does not satisfy the constraint , it can be immediately discarded because not only it has no use at the step of its discovery but also no use at the future steps as all extreme rays, in the building of which it could be involved, would have a larger support and thus would thus not satisfy either. Thus, the computation of elementary fluxes or elementary flux modes in the presence of a sign-monotone constraint can be fully integrated into the incremental DD algorithm, the filtering of the extreme rays satisfying the constraint being achieved at each step when they are newly created. The extra-cost is having to check all intermediate extreme rays built for satisfiability by (for most constraints, this can be practically instantaneous) and the gain is that all intermediate extreme rays for which this checking gives a negative answer are discarded.
Proposition 13.
Given a sign-monotone constraint , the (or ) of the constrained flux cone subset are obtained by the DD algorithm as the extreme rays of each flux tope by filtering out at each step all those newly built extreme rays that do not satisfy .
This result applies in particular to thermodynamic constraints and [44,45] and to kinetic constraints and (in the absence of bounds on enzyme concentrations). Actually, for (or which is identical from Proposition 5), Proposition 3 demonstrated that the criterion for an extreme ray to be thermodynamically satisfiable is to belong to a given fixed open half-space delimited by a vector hyperplane. Checking this satisfiability thus boils down to computing the scalar product of the extreme ray with a fixed vector (normal to the hyperplane and oriented towards the half-space) and verifying it is positive. In this case, as it has been shown, it is much simpler to integrate this thermodynamic constraint as one supplementary th homogeneous linear inequality and we can choose for example that it is processed first by the DD algorithm. However, as it has been already pointed out, providing the list of the extreme flux vectors of does not provide its structure in terms of a union of polyhedral cones. For this, we have to identify all the LCS ’s, i.e., the maximal subsets of those extreme flux vectors whose conformal conical hull is entirely contained in . For each flux tope , this amounts to computing all maximal upper bounds for sets of signs of those extreme rays built from this flux tope such that any vector of this tope with a sign equal to such an upper bound satisfies constraint , as the vectors of the strict conical hull of a subset of extreme rays have as a common sign the upper bound for the signs of these extreme rays. To each such upper bound identified is associated the LCS made up of all the extreme rays whose signs are smaller than, or equal to, this upper bound.
For sign-invariant constraints, and thus for regulatory constraints, no such incremental filtering is possible as a newly built intermediate extreme ray may have a sign that does not satisfy the constraint but may participate at some later stage in the construction of other new extreme rays (with necessarily larger supports) whose sign will satisfy the constraint. Thus, such an extreme ray cannot be discarded. In addition, we saw in any case there is no longer identity between those EFMs of that satisfy the constraint, the EFMs of and the support-wise non-strictly-decomposable vectors in and, as only the last two are biologically relevant, computing only the first ones would be of limited interest.
4. Conclusions
The analysis of metabolic networks in a steady state takes classically into account only stoichiometric and reactions irreversibility constraints. In this context, well-known pathways have been introduced, such as elementary flux modes EFMs, expressing the property of minimality, or elementary flux vectors EFVs, expressing the property of non-decomposability (which coincide in this standard framework), that are biologically relevant in their own and from which all pathways, whose structure is a convex polyhedral flux cone , can be reconstructed. However, first, the computation of these EFMs or EFVs is hampered by the combinatorial explosion of their number in genome-scale metabolic models, and, second, most of them are biologically irrelevant, because other important biological constraints are not taken into account. With the objective of both enumerating only biologically feasible EFMs or EFVs (without filtering them afterwards) and, as there are considerably fewer of them, improving the scalability of this computation, we took into consideration in this paper on one side thermodynamic and, more generally, kinetic constraints and on the other side regulatory constraints, and we tackled the problem of revisiting in this new extended framework the concept of EFMs and EFVs and, more largely, of characterizing the geometry of the solution space. Actually, we considered a more general conceptual framework for constraints, namely their property to be compatible with vector signs (i.e., with vector supports separately in each closed r-orthant), because most properties of the geometrical structure of the solution space happen to depend only on this very general sign-compatibility of constraints. This is how we demonstrated, for constraints which are sign-monotone (i.e., once satisfied by a certain vector are satisfied by any vector with smaller or equal sign), which is the case of thermodynamic constraints and of kinetic constraints in the absence of bounds on enzyme concentrations, that the solution space is a union of convex polyhedral cones, which are certain faces of the flux topes of , and that the EFMs, which still coincide with the EFVs, are simply those of that satisfy the constraint. In addition, we showed that their computation can be efficiently integrated into the classical double description algorithm, as each newly built intermediate extreme ray that does not satisfy the constraint can be filtered out at each incremental step. For the specific case of thermodynamic constraints or of kinetic constraints in the absence of bounds on enzyme concentrations, we demonstrated that their solution spaces are identical and, when there are also no bounds on metabolite concentrations, made up of those maximal faces of the flux topes of which are entirely contained in a fixed open vector half-space and thus that the EFMs are simply those of belonging to this half-space and thus computable by adding one single homogeneous linear inequality to the initial (stoichiometric and reactions irreversibility) ones. The situation is more complex, and challenging from a computational perspective, for constraints which are only sign-invariant (i.e., once satisfied by a certain vector are satisfied by any vector with the same sign), which is the case of regulatory constraints. We showed that in fact sign-invariant constraints are constraints that are support-invariant in each closed r-orthant separately and that support-invariant constraints coincide with regulatory (Boolean) constraints, therefore regulatory constraints are prototypical of sign-invariant constraints. For such sign-invariant constraints, we demonstrated that the solution space is a finite disjoint union of semi-open convex polyhedral cones, obtained from certain faces of the flux topes of by removing certain of their own faces of lesser dimension, and that the EFVs are simply those of (equal to the EFMs of ) that satisfy the constraint. However, these cannot be efficiently computed by the double description algorithm because they cannot be filtered out during the incremental process. In addition it is no longer true that EFVs identify with EFMs, as it exists in general minimal solutions that are decomposable, and that EFMs identify with support-wise non-strictly-decomposable vectors (a new property that we introduced and which is the proper one to express non-decomposability in this general framework, in terms of support non-decomposability), as there are in general such support-wise non-strictly-decomposable solutions that are not minimal: there are usually strict inclusions between these three sets and we provided again a complete characterization of the two latter ones. Finally, we extended all these results to the case where inhomogeneous linear constraints (expressing for example capacity constraints or bounds on fluxes) exist, dealing thus with a convex flux polyhedron instead of a flux cone . Basically, most of the results regarding the geometrical structure of the solution space in the presence of the above biological constraints remain the same with cones replaced by polyhedra.
Future work will be carried out along two paths. First, the present theoretical work will be extended to minimal cut sets (MCSs). Such MCSs are defined as minimal (for set inclusion) sets of reactions whose deletion will block the operation of given objectives or target reactions (as, e.g., those producing some toxic or undesirable product), i.e., removal of an MCS (that is, the knockout of its reactions) implies a zero flux for the target reactions in a steady state. MCSs are important for computing intervention strategies, e.g., for metabolic engineering. It is obvious from this definition that, for a metabolic network modeled in a steady state by a flux cone , the MCSs are the minimal hitting sets of the set of target EFMs (identical to EFVs of the given metabolic network (i.e., the set of EFMs that comprise at least one of the target reactions), where a hitting set of the target EFMs is a set (of reactions) that has a nonempty intersection with each one of these EFMs. Moreover, this generalizes to a metabolic network modeled by a flux polyhedron by using EFPs. This gives an indirect method for computing MCSs [46,47] from the preliminary computation of EFVs (or EFPs) that can be applied in the presence of biological constraints by using the results obtained in this paper. However, it is known that there is also a method for computing MCSs directly as the EFVs of a dual network [48,49], obtained basically by transposing the stoichiometric matrix of the original matrix (thus in some sense, exchanging reactions and metabolites). We will therefore study how to define this dual operation properly for metabolic networks in the presence of biological constraints in order to preserve this result (or most of it). Second, the algorithms described in this paper will be implemented and testing conducted on metabolic networks described in the literature for which biological (thermodynamic, kinetic or regulatory) data are known. As it has been shown, the computation of elementary flux vectors (or elementary flux points) boils down to filtering those of extreme rays (or vertices) in each flux tope which satisfy the biological constraints. Moreover, for sign-monotone constraints as thermodynamic constraints or kinetic constraints in the absence of bounds on enzyme concentrations, this filtering can be easily integrated at each step of the incremental double description method, so it is natural to rely first on this method which benefits from very efficient implementations. In the absence of bounds on metabolite concentrations, the advantage is obvious as it is just necessary to add one step (i.e., one homogeneous linear constraint to deal with) in the DD algorithm. It has nevertheless been shown that at most 50% of the EFVs can be ruled out that way, which is clearly insufficient to scale up to GSMMs. Adding bounds on metabolite concentrations has already proved capable of ruling out a higher percentage of EFVs. This is achieved by checking the extreme rays at each step of the DD algorithm thanks to a call to a linear programming solver and it will be interesting to compare its efficiency with the method given by Proposition 4 which does not need using an LP solver but has the disadvantage of requiring reasoning in a higher dimension. As the structure of the solution space cannot be deduced from merely the knowledge of the EFVs but requires the identification of the largest consistent (w.r.t. the constraints) sets of EFVs, efficient computation of these LCS ’s will be studied. We can reasonably think that scaling up will be obtained only by dealing with all biological constraints together. However, as it has been shown, handling regulatory constraints poses serious problems as these constraints are not sign-monotone and thus filtering of the EFVs that satisfy them cannot be integrated incrementally into the DD algorithm. In addition the structure of the solution space as union of semi-open polyhedral cones (or semi-open polyhedra) is more complex and pathways of interest for biologists, such as elementary flux modes or support-wise non-strictly-decomposable vectors, no longer coincide with EFVs and require more complex computations to be identified. Algorithms will be carefully studied for maximal efficiency and novel ways of using the DD method or the use of other methods recently proposed such as local reverse search or satisfiability based methods [42,50] will be investigated.
Funding
This research received no external funding.
Data Availability Statement
Data sharing not applicable.
Acknowledgments
I would like to express my sincere thanks to my colleagues Sabine Peres, who introduced me a few years ago to bioinformatics and more particularly to metabolic pathway analysis, that was a field completely unknown to me, and Antoine Deza, who explained to me all I had to know about convex geometry and the double description algorithm. Without their help and the huge number of hours we spent working and discussing together, this work could never have been accomplished. I also warmly thank Rosemary Patricot for having proofread the English.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Urbanczik, R.; Wagner, C. An improved algorithm for stoichiometric network analysis: Theory and applications. Bioinformatics 2005, 21, 1203–1210. [Google Scholar] [CrossRef] [PubMed]
- Wagner, C.; Urbanczik, R. The Geometry of the Flux Cone of a Metabolic Network. Biophys. J. 2005, 89, 3837–3845. [Google Scholar] [CrossRef] [PubMed]
- Trinh, C.T.; Wlaschin, A.; Srienc, F. Elementary mode analysis: A useful metabolic pathway analysis tool for characterizing cellular metabolism. Appl. Microbiol. Biotechnol. 2009, 81, 813–826. [Google Scholar] [CrossRef] [PubMed]
- Llaneras, F.; Picó, J. Which Metabolic Pathways Generate and Characterize the Flux Space? A Comparison among Elementary Modes, Extreme Pathways and Minimal Generators. J. Biomed. Biotechnol. 2010, 2010, 753904. [Google Scholar] [CrossRef]
- Zanghellini, J.; Ruckerbauer, D.E.; Hanscho, M.; Jungreuthmayer, C. Elementary flux modes in a nutshell: Properties, calculation and applications. Biotechnol. J. 2013, 8, 1009–1016. [Google Scholar] [CrossRef]
- Müller, S.; Regensburger, G. Elementary Vectors and Conformal Sums in Polyhedral Geometry and their Relevance for Metabolic Pathway Analysis. Front. Genet. 2016, 7, 90. [Google Scholar] [CrossRef]
- Klamt, S.; Regensburger, G.; Gerstl, M.P.; Jungreuthmayer, C.; Schuster, S.; Mahadevan, R.; Zanghellini, J.; Müller, S. From elementary flux modes to elementary flux vectors: Metabolic pathway analysis with arbitrary linear flux constraints. PLoS Comput. Biol. 2017, 13, e1005409. [Google Scholar] [CrossRef]
- Schilling, C.H.; Palsson, B.O. The underlying pathway structure of biochemical reaction networks. Proc. Natl. Acad. Sci. USA 1998, 95, 4193–4198. [Google Scholar] [CrossRef]
- Schilling, C.H.; Letscher, D.; Palsson, B.O. Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. J. Theor. Biol. 2000, 203, 229–248. [Google Scholar] [CrossRef]
- Jevremovic, D.; Trinh, C.T.; Srienc, F.; Boley, D. On Algebraic Properties of Extreme Pathways in Metabolic Networks. J. Comput. Biol. 2010, 17, 107–119. [Google Scholar] [CrossRef]
- Motzkin, T.S.; Raiffa, H.; Thompson, G.L.; Thrall, R.M. The double description method. In Contributions to the Theory of Games II, Annals of Mathematics Studies; Kuhn, H.W., Tucker, A.W., Eds.; Princeton University Press: Princeton, NJ, USA, 1953; Volume 28. [Google Scholar]
- Fukuda, K.; Prodon, A. Double Description Method Revisited. In Combinatorics and Computer Science; Deza, M., Euler, R., Manoussakis, I., Eds.; Springer: Berlin/Heidelberg, Germany, 1996; Volume 1120, pp. 91–111. [Google Scholar]
- Gerstl, M.P.; Müller, S.; Regensburger, G.; Zanghellini, J. Flux tope analysis: Studying the coordination of reaction directions in metabolic networks. Bioinformatics 2019, 35, 266–273. [Google Scholar] [CrossRef]
- Gudmundsson, S.; Thiele, I. Computationally efficient flux variability analysis. BMC Bioinform. 2010, 11, 489. [Google Scholar] [CrossRef]
- Clarke, B.L. Stoichiometry network analysis. Cell Biophys. 1988, 12, 237–253. [Google Scholar] [CrossRef]
- Rockafellar, R.T. The elementary vectors of a subspace of RN. In Combinatorial Mathematics and Its Applications; University of North Carolina Press: Chapel Hill, NC, USA, 1969; pp. 104–127. [Google Scholar]
- Schuster, S.; Hilgetag, C. On elementary flux modes in biochemical reaction systems at steady state. J. Biol. Syst. 1994, 2, 165–182. [Google Scholar] [CrossRef]
- Gagneur, J.; Klamt, S. Computation of elementary modes: A unifying framework and the new binary approach. BMC Bioinform. 2004, 5, 175. [Google Scholar] [CrossRef]
- de Figueiredo, L.F.; Podhorski, A.; Rubio, A.; Kaleta, C.; Beasley, J.E.; Schuster, S.; Planes, F.J. Computing the shortest elementary flux modes in genome-scale metabolic networks. Bioinformatics 2009, 25, 3158–3165. [Google Scholar] [CrossRef]
- Khachiyan, L.; Boros, E.; Borys, K.; Elbassioni, K.; Gurvich, V. Generating all vertices of a polyhedron is hard. Discret. Comput. Geom. 2008, 39, 174–190. [Google Scholar] [CrossRef]
- Acuña, V.; Marchetti-Spaccamela, A.; Sagot, M.F.; Stougie, L. A note on the complexity of finding and enumerating elementary modes. BioSystems 2010, 99, 210–214. [Google Scholar] [CrossRef] [PubMed]
- Acuña, V.; Chierichetti, F.; Lacroix, V.; Marchetti-Spaccamela, A.; Sagot, M.F.; Stougie, L. Modes and cuts in metabolic networks: Complexity and algorithms. BioSystems 2009, 95, 51–60. [Google Scholar] [CrossRef] [PubMed]
- Dyer, M.E. The complexity of vertex enumeration methods. Math. Oper. Res. 1983, 8, 381–402. [Google Scholar] [CrossRef]
- McMullen, P. The maximum numbers of faces of a convex polytope. Mathematika 1970, 17, 179–184. [Google Scholar] [CrossRef]
- Terzer, M.; Stelling, J. Large-scale computation of elementary flux modes with bit pattern trees. Bioinformatics 2008, 24, 2229–2235. [Google Scholar] [CrossRef]
- van Klinken, J.B.; Willems van Dijk, K. FluxModeCalculator: An efficient tool for large-scale flux mode computation. Bioinformatics 2016, 32, 1265–1266. [Google Scholar] [CrossRef] [PubMed][Green Version]
- Atkins, P.; de Paula, J. Physical Chemistry, 10th ed.; Freeman: Alexandria, VA, USA, 2014. [Google Scholar]
- Henry, C.S.; Broadbelt, L.J.; Hatzimanikatis, V. Thermodynamics-based metabolic flux analysis. Biophys. J. 2007, 92, 1792–1805. [Google Scholar] [CrossRef]
- Liebermeister, W.; Uhlendorf, J.; Klipp, E. Modular rate laws for enzymatic reactions: Thermodynamics, elasticities and implementation. Bioinformatics 2010, 26, 1528–1534. [Google Scholar] [CrossRef]
- Noor, E.; Flamholz, A.; Liebermeister, W.; Bar-Even, A.; Milo, R. A note on the kinetics of enzyme action: A decomposition that highlights thermodynamic effects. FEBS Lett. 2013, 587, 2772–2777. [Google Scholar] [CrossRef] [PubMed]
- Jungreuthmayer, C.; Ruckerbauer, D.E.; Zanghellini, J. Utilizing gene regulatory information to speed up the calculation of elementary flux modes. arXiv 2012, arXiv:1208.1853. [Google Scholar]
- Jungreuthmayer, C.; Ruckerbauer, D.E.; Zanghellini, J. regEfmtool: Speeding up elementary flux mode calculation using transcriptional regulatory rules in the form of three-state logic. BioSystems 2013, 113, 37–39. [Google Scholar] [CrossRef] [PubMed]
- Jungreuthmayer, C.; Ruckerbauer, D.E.; Gerstl, M.P.; Hanscho, M.; Zanghellini, J. Avoiding the enumeration of infeasible elementary flux modes by including transcriptional regulatory rules in the enumeration process saves computational costs. PLoS ONE 2015, 10, e0129840. [Google Scholar] [CrossRef] [PubMed][Green Version]
- Burgard, A.P.; Nikolaev, E.V.; Schilling, C.H.; Maranas, C.D. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004, 14, 301–312. [Google Scholar] [CrossRef]
- Larhlimi, A.; David, L.; Selbig, J.; Bockmayr, A. F2C2: A fast tool for the computation of flux coupling in genome-scale metabolic networks. BMC Bioinform. 2012, 13, 57. [Google Scholar] [CrossRef] [PubMed]
- Röhl, A.; Goldstein, Y.; Bockmayr, A. EFM–Recorder—Faster Elementary Mode Enumeration via Reaction Coupling Order. Adv. Syst. and Synth. Biol. 2015, 1, 91–99. [Google Scholar]
- Peres, S.; Jolicœur, M.; Moulin, C.; Dague, P.; Schuster, S. How important is thermodynamics for identifying elementary flux modes? PLoS ONE 2017, 12, e0171440. [Google Scholar] [CrossRef] [PubMed]
- Peres, S.; Schuster, S.; Dague, P. Thermodynamic constraints for identifying the elementary flux modes. Biochem. Soc. Trans. 2018, 46, 641–647. [Google Scholar] [CrossRef] [PubMed]
- Gerstl, M.P.; Jungreuthmayer, C.; Muller, S.; Zanghellini, J. Which sets of elementary flux modes form thermodynamically feasible flux distributions? FEBS J. 2016, 283, 1782–1794. [Google Scholar] [CrossRef]
- Müller, S.; Regensburger, G.; Steuer, R. Enzyme allocation problems in kinetic metabolic networks: Optimal solutions are elementary flux modes. J. Theor. Biol. 2014, 347, 182–190. [Google Scholar] [CrossRef]
- Wortel, M.T.; Peters, H.; Hulshof, J.; Teusink, B.; Bruggeman, F.J. Metabolic states with maximal specific rate carry flux through an elementary flux mode. FEBS J. 2014, 281, 1547–1555. [Google Scholar] [CrossRef]
- Morterol, M.; Dague, P.; Peres, S.; Simon, L. Minimality of Metabolic Flux Modes under Boolean Regulation Constraints. In Proceedings of the 12th International Workshop on Constraint-Based Methods for Bioinformatics (WCB’16), Toulouse, France, 5 September 2016. [Google Scholar]
- Wagner, C. Nullspace Approach to Determine the Elementary Modes of Chemical Reaction Systems. J. Phys. Chem. B 2004, 108, 2425–2431. [Google Scholar] [CrossRef]
- Gerstl, M.P.; Jungreuthmayer, C.; Zanghellini, J. tEFMA: Computing thermodynamically feasible elementary flux modes in metabolic networks. Bioinformatics 2015, 31, 2232–2234. [Google Scholar] [CrossRef]
- Gerstl, M.P.; Ruckerbauer, D.E.; Mattanovich, D.; Jungreuthmayer, C.; Zanghellini, J. Metabolomics integrated elementary flux mode analysis in large metabolic networks. Sci. Rep. 2015, 5, 8930. [Google Scholar] [CrossRef]
- Haus, U.U.; Klamt, S.; Stephen, T. Computing knock-out strategies in metabolic networks. J. Comput. Biol. 2008, 15, 259–268. [Google Scholar] [CrossRef] [PubMed]
- Jungreuthmayer, C.; Nair, G.; Klamt, S.; Zanghellini, J. Comparison and improvement of algorithms for computing minimal cut sets. BMC Bioinform. 2013, 14, 318. [Google Scholar] [CrossRef] [PubMed]
- Ballerstein, K.; von Kamp, A.; Klamt, S.; Haus, U. Minimal cut sets in a metabolic network are elementary modes in a dual network. Bioinformatics 2012, 28, 381–387. [Google Scholar] [CrossRef]
- von Kamp, A.; Klamt, S. Enumeration of Smallest Intervention Strategies in Genome-Scale Metabolic Networks. PLoS Comput. Biol. 2014, 10, e1003378. [Google Scholar] [CrossRef] [PubMed]
- Mahout, M.; Carlson, R.P.; Peres, S. Answer Set Programming for Computing Constraints-Based Elementary Flux Modes: Application to Escherichia coli Core Metabolism. Processes 2020, 8, 1649. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).