Next Article in Journal
Black Hole Horizons and Thermodynamics: A Quantum Approach
Next Article in Special Issue
Using Quantum Computers for Quantum Simulation
Previous Article in Journal
Entropy and Information Approaches to Genetic Diversity and its Expression: Genomic Geography
Previous Article in Special Issue
Measurement Back-Action in Quantum Point-Contact Charge Sensing
Open AccessArticle

# Roofs and Convexity

by Armin Uhlmann
Institute for Theoretical Physics, University of Leipzig, PB 100 920, D-04009 Leipzig, Germany
Entropy 2010, 12(7), 1799-1832; https://doi.org/10.3390/e12071799
Received: 5 June 2010 / Revised: 5 July 2010 / Accepted: 9 July 2010 / Published: 20 July 2010

## Abstract

Convex roof extensions are widely used to create entanglement measures in quantum information theory. The aim of the article is to present some tools which could be helpful for their treatment. Sections 2 and 3 introduce into the subject. It follows descriptions of the Wootters' method, of the "subtraction procedure", and examples on how to use symmetries.
Keywords:

## 1. Introduction

One often is in the position to know a quantity for pure states of a quantum system, say g, without a definite meaning in classical physics. Then one looks for a method to extend g to all mixed states, see [1,2,3,4]. Let G denote such a possible extension.
A reasonable approach is certainly to extend g “as linearly as possible” or, more correctly, “as affine as possible”. The mathematical model for such a demand is a “roof” as presented in Section 2.
As it is difficult, to say the least, to imagine higher dimensional geometry, let us look at an elementary example, the real qubits. They fill a disk bounded by a circle. The circle represents the pure states. Considering $g ( π )$, with π pure, as the height of a wall at the point π. An extension G of g to the disk provides a covering of the ground floor. Let $G ( ω )$ be a point of that covering. To be a roof in the sense of Section 2, we like to have: There is either a straight line or a plane coincident with the roof point $ω , G ( ω )$ and resting on the wall at two or at three points $π j , g ( p i j )$. If the line or plane is parallel to the ground floor, it is called flat. An uncomfortable feature is the rather large arbitrariness: There are plenty of roof extensions allowing, however, as a bonus, much room for playing. Under rather weak assumptions, there is a minimal as well as a maximal roof extension (Proposition 2.1).
Physically stronger motivated seem extensions G of g which are either convex or concave. Heuristically, the convex ones try to suppress the “classical noise” or else the “classical information” by the tendency to attach lower values to the states “far” from the pure ones. The concave extension stress the “classical part”, possessing lower values in the vicinity of the pure states.
It is a well known fact that there are maximal convex and minimal concave extensions. If it exists, the minimal roof extension is equal to the maximal convex extension and this facilitate calculations sometimes.
The maximal convex extension $g ∪$ of g can be gained by
$g ∪ ( ω ) = inf ∑ p j g ( π j )$
where the inf is running through all convex decompositions
$ω = ∑ p j π j , p j ≥ 0 . ∑ p j = 1$
with pure states $π j$, . In quantum information theory this is nowadays a common procedure to define entanglement measures. Its first appearance is in the important paper  by C. H. Bennett, D. DiVincenzo, J. Smolin, and W. Wootters, and has been called entanglement of formation and is denoted by $E ( ω )$. These authors considered states ω of a bipartite quantum system. The role of g plays $S ( Tr b π )$, the entropy S of the partial trace Trb, as a function of a pure bipartite states π.
In the present paper a trace preserving positive map T from the states of a quantum system into another one is called a stochastic map. A channel is a completely positive stochastic map. If in the definition of entanglement of formation, E, a stochastic channel T is used in place of the partial trace Trb, we call the resulting quantity the entanglement of T and denote it by $E T$. For a channel, $E T$ is equivalent (in many ways) to the restriction of E onto a face of a bipartite state space of sufficiently high dimension, . In this way, T is seen as a sub-channel of the partial trace: The tensor product is partitioned into subspaces on which the sub-channels are defined. A 1-qubit channel, for example, can be represented by a $2 × m$ bipartite quantum systems as the restriction of the partial trace over the larger dimensional part onto the density operators supported by a suitable 2-dimensional subspace.
Clearly, $E T$ is of interest in its own as part of the $χ *$-function $χ * = S T − E T$ with $( S T ) ( ω ) = S ( T ( ω ) )$ the maximum of which is the Holevo capacity  of T.
Since the importance of E has been realized in , several other measures of similar structure have been introduced and discussed, replacing the von Neumann entropy S by another function on the state space of the output system. The perhaps mostly discussed examples are the “tangle” $τ T$ and the “concurrence” $C T$. The connection between them is $C T ( π ) 2 = τ T ( π )$ for all pure states π. Sometimes one also needs a minimal concave extension (“entanglement of assistance”). All these quantities have been defined by global variational problems of type (1), (2).
A further, even earlier source for the said procedure roots in the problem of defining a “quantum dynamic entropy”, generalizing the Kolmogorov–Sinai one. In the approach  of A. Connes, H. Narnhofer, and W. Thirring several similar global variational problems wait to be solved. One of them is the search for the convex roof defined by $g ( ω ) = S ( D n ( ω ) )$, S again von Neumann’s entropy and $D n$ the diagonal map, setting all off-diagonal entries of a matrix to zero. The problem initiated the paper  of F. Benatti, H. Narnhofer, and A. Uhlmann and further ones.
It is true that complete solutions for these extensions are only known in the lowest non-trivial dimensions. In the present paper general tools are presented to facilitate the treatment of convex or concave roofs. The general aspects are mainly in the Section 2 and Section 3. It includes the foliation of the input state space into “convex leaves” onto which the roof becomes affine and, in particular nice situations, even constant, [10,11].
Section 4 is devoted to the Wootters’ way [12,13,14] of presenting the entanglement of formation explicitly, see also .
Section 5 shows a more recent way to compute the concurrence of a 1-qubit stochastic map by a substraction procedure. For the concurrence of $2 × m$ quantum systems it allows to compute the concurrence of any rank two quantum state. An elegant way to do so was opened by Hildebrand, [16,17], using the so-called “S-lemma”of Yakobovich, see . Another one has been chosen by Hellmund and Uhlmann, , who use the description of general 1-qubit maps given by Gorini and Sudershan, .
For the tangles the pioneering work goes back to Coffman, Kundu, and Wootters, , who already remarked that optimal decompositions of length two should be sufficient in the 1-qubit case. That the “substraction procedure” works can be read off from a paper of Osborne and Verstraete, . Here we describe analytical results for axial symmetric channels. There are also results for the 3-tangle roof problem, .
Most results, if not numerically, are found by the help of symmetries. Besides the already quoted ones, an essential step has been done by K. G. Vollbrecht and R. F. Werner, , and B. M .Terhal and K. G. Vollbrecht . Meanwhile it became a very large domain of research, exceeding the frame of the present paper. Hence, in Section 6, only some aspects, connected with maximal symmetric states, are touched.
Some notations: We use $H$ for Hilbert spaces, $B ( H )$ for its algebra of operators, $Ω ( H )$ for the set of density operators supported by $H$. We also say “state” for “density operator” and “state space” for $Ω ( H )$. As a convex set, $Ω ( H )$ is embedded in Herm$( H )$, the real linear space of Hermitian operators. The symbol Ω is also used for a general compact convex sets in a real linear space. Following , the extremal points of a convex set will be called “pure” ones mostly. They are usually symbolized by the letter π. We follow  in using $η ( x ) = − x log x$ and $S ( ω ) = Tr η ( ω )$, the von Neumann entropy.

## 2. Roofs, Roof Extensions

We are now going to give an exact meaning to the word “roof”. For this purpose we assume G to be a real valued function on a compact convex set Ω, contained in a finite dimensional real linear space.
Definition 2.1a: Roof points
$ω ∈ Ω$ is called a roof point of G, if there is at least one extremal convex decomposition
$ω = ∑ p j π j , π j ∈ Ω pure$
such that
$G ( ω ) = ∑ p j G ( π j )$
If this takes place, we call the decomposition (3) optimal with respect to G or, equivalently, G-optimal. The number of terms in (3) with $p j ≠ 0$ is the length of the decomposition.
Definition 2.1b: Flat roof points
A roof point ω of G is called flat, if there is a G-optimal decomposition (3) fulfilling
$G ( ω ) = G ( π 1 ) = G ( π 2 ) = ⋯$
i.e. all the values $G ( π j )$ are equal one to another .
Let $f ( x )$ be a real function defined on the range of G. The main merit of a flat roof point ω of G is the simple fact, that it remains a flat roof point for $f ( G )$: $f ( G ( π j ) ) = f ( G ( ω ) )$ for all j and (4) remains true for $f ( G )$. In other words, the flat points of $ρ → G ( ρ )$ are also flat roof points of $ρ → f ( G ( ρ ) )$.
Definition 2.2: Roofs, flat roofs
A real function G on Ω is a roof if every $ω ∈ Ω$ is a roof point of G.
G is called a flat roof if all roof points are flat ones.
In important applications the point of view is a bit different: A function $π → g ( π )$ is given on the set $Ω pure$ of pure states, or, more generally, on the set of extremal states of an arbitrary compact convex set. Then one asks for meaningful extensions G which coincides with g on $Ω pure$. In such a generality the problem is too arbitrarily posed and one asks for restrictions to such an extension. Remarkable ones are the roof extensions. They interpolate the values of g “as linearly (or as affine) as possible”.
Definition 2.3: Roof extensions
Let $π → g ( π )$ be a real function on $Ω pure$. A roof G is called a roof extension of g, if $G ( π ) = g ( π )$ for pure states.
Now observe the following simple fact: The maximum (the minimum)
$max { G 1 , ⋯ , G n } respectively min { G 1 , ⋯ , G n }$
of finitely many roof extensions of g is a roof again.
Indeed, assume for roof extensions $G 1 , ⋯ , G n$ of g and $ω ∈ Ω$ the value $G 1 ( ω )$ is not less than the other values $G j ( ω )$. Then one selects a $G 1$-optimal decomposition for $max { G 1 , ⋯ , G n }$.
The reasoning above fails for infinitely many roof extensions. The proof of the following is postponed to that of proposition 3.5.
Proposition 2.1
Let g be a real continuous function on $Ω pure ( H )$ and G a roof extension of g. For all $ω ∈ Ω ( H )$ there exist optimal decompositions (3), (4) the length of which does not exceed $( dim H ) 2 + 1$.
There is a minimal and a maximal roof extension of g.

#### 2.1. Examples

Roof extensions exist in abundance. To see this and also the difficulties, which we may get into, let us consider a few examples showing some typical constructions. We remain within the state space $Ω ( H )$, $dim H = d$ and we start with $d = 2$.

#### Example 2.1: A Bloch ball construction

Seen from convex analysis, the space Ω of all 1-qubit density operators is a 3-dimensional ball, the Bloch ball. $Ω pure$ is the Bloch sphere, the surface of the Bloch ball.
Now assume there is a function g on the Bloch sphere and we like to find roof extensions G of g. Particular nice ones are gained as following: We take a bundle of straight lines such that every point ω of the Bloch ball is coincident with exactly one line, say $L ω$, of the bundle. If ω is not pure, then $L ω$ hits the Bloch sphere at exactly two points, say $π 1$ and $π 2$. Hence, ω is a convex combination of them. Now we define
$G ( ω ) = p g ( π 1 ) + ( 1 − p ) g ( π 2 ) if ω = p π 1 + ( 1 − p ) π 2$
Because there is just one line from our bundle going through ω, we get a well-defined roof.
Let us now specify our example by choosing a bundle of parallel lines. It belongs exactly one main axis of the Bloch ball to the bundle. We may assume that it is the $x 3$-axis with respect to the Bloch coordinates $x 1 , x 2 , x 3$ of a general Pauli representation
$ω = 1 2 ( 1 + x 1 σ 1 + x 2 σ 2 + x 3 σ 3 )$
The line $L ω$ is now fixed by the values $x 1$ and $x 2$, letting the third Bloch coordinate arbitrary. $L ω$ crosses the Bloch Sphere at the pure states $π ±$ with the Bloch coordinates $x 1$, $x 2$, $y 3 = ± ( 1 − x 1 2 − x 2 2 ) 1 / 2$. Hence
$π ± = 1 2 ( 1 + x 1 σ 1 + x 2 σ 2 ± [ 1 − x 1 2 − x 2 2 ] 1 / 2 σ 3 ) , ω = p π + + ( 1 − p ) π −$
with $0 ≤ p ≤ 1$. Expressing now g in terms of Bloch coordinates, we obtain the roof
$G ( ω ) = G ( x 1 , x 2 ) = p g ( x 1 , x 2 , + 1 − x 1 2 − x 2 2 ) + ( 1 − p ) g ( x 1 , x 2 , − 1 − x 1 2 − x 2 2 )$
The example is further specified by choosing for g the Shannon entropy of the diagonal elements of ω. Then G becomes the roof
$E T ( ω ) = H ( 1 + 1 − x 1 2 − x 2 2 2 , 1 − 1 − x 1 2 − x 2 2 2 )$
$T ( ω )$ denotes the diagonal part of ω and H the Shannon entropy of the diagonal elements. That this is the solution for the entanglement of the diagonal channel, has been suggested by Levitin,  and Thirring . (10) provides a roof extension as described above. Indeed, it is a flat roof depending on $1 − x 1 2 − x 2 2$, hence a function of the Euclidean distance $x 1 2 + x 2 2$ of ω from the $x 3$-axis. It is
$〈 0 | ω | 1 〉 = x 1 − i x 2 2 , x 1 2 + x 2 2 = 4 | 〈 0 , ω | 1 〉 | 2$
Therefore, the concurrence of the diagonal map for qubits can be written
$C T ( ω ) = | 〈 0 | ω | 1 〉 |$
It is easily to be seen that C is a flat convex roof. It possesses quite large domains where it is just affine: Cut the Bloch ball with a plane containing the $x 3$-axis. We get a disk, cut by the $x 3$-axis into two half-disks. The concurrence is affine on every such half-disk.
Indeed, if T is a 1-qubit channel with two different pure fix points, one can observe a similar phenomenon. (The disks will be cut by the axis through the fix points.)

#### Example 2.2

There is a constructions, similar to the previous one: Given any function $f ( x 3 )$ on the $x 3$-axis. We construct a roof by
$A ( ω ) = f ( x 3 ) , x 3 = 〈 0 | ω | 0 〉 − 〈 1 | ω | 1 〉$
More generally, given a bundle of planes so that every point of the Bloch ball is coincident with one and only one of these planes. A function, constant on planes, is a roof. This can be achieved by choosing a function f on the $x 3$-axis and attaching to every plane the value of f at its crossing with this axis.
Similar constructions are possible with higher dimensional balls (ellipsoids). However, on higher dimensional state spaces things are essential more complicated due to the subtle structure of the set of their pure states.

#### Example 2.3: Affine functions on the state space

An affine function on Herm$( H )$ is of the form
with an Hermitian operator A and a real number a. It is sometimes useful to write (14) in the form
The function (14) is trivially a roof: If we have any extremal decomposition (3) one immediately gets $l ( ρ ) = ∑ p j l ( π j )$. Thus, every extremal decomposition is optimal.
We easily conclude that with G also $G + l$ is a roof.
A bit more tricky is the assertion that every function (14) is a flat roof on the state space $Ω ( H )$.
We prove this by induction to the dimension of the Hilbert space. We start with the qubit case. We can choose a basis in $H$ such that $A = a + a ′ σ 3$ and
$ω = 1 2 ( 1 + x 1 σ 1 + x 3 σ 3 )$
It suffices to prove the assertion for $A = σ 3$, resulting in $l ( X ) = Tr σ 3 X = x 3$. With an unimodular number ϵ we consider the pure states
$ω ± = 1 2 ( 1 + x 1 σ 1 ± ϵ 1 − x 1 2 − x 3 2 σ 2 + x 3 σ 3 )$
so that our affine function is constant at the segment $p ω + + ( 1 − p ) ω −$. The segment contains ω for $p = 1 / 2$.
Varying ϵ we see the following: To every affine function l as in (14) there is an axis through the Bloch ball such that l is constant on every plane perpendicular to that axis. We can say something more: Given $ω ∈ Ω$ and two affine functions, $l j$, $j = 1 , 2$, we use the constants $l j ( ω ) = c j$ to define two planes by $l j ( X ) = c j$. Because the two planes contain ω, they intersect along a line, say $L ω$, containing ω. The intersection of $L ω$ with the Bloch sphere provides two pure states which define a flat optimal decomposition of ω.
Proposition 2.2
Affine functions on $Ω ( H )$, $dim H < ∞$, are flat roofs. Given two affine functions and a density operator ω, there is a common flat optimal decomposition of ω.
Proof. The 2-dimensional case has already been proved. Let $l j ( X ) = a j + A j X$, $j = 1 , 2$, denote two affine functionals and choose $ω ∈ Ω ( H d )$. Assume the assertion is true for $dim H < d$. If ω is from the boundary of $Ω ( H d )$ we are done by our induction hypothesis. Otherwise we consider the linear subspace $L$ of Herm$( H d )$ orthogonal to $A 1$ and $A 2$, i.e., of all Y satisfying $A j Y = 0$, $j = 1 , 2$. Consider the affine space $L ω = ω + L$. It contains ω and it is $l j ( Y ) = l j ( ω )$ for all $Y ∈ L ω$. The intersection $K = Ω ∩ L ω$ is compact and its extremal points belong to the boundary of $Ω ( H d )$. Therefore, we get a decomposition
$ω = ∑ p j ω j , rank ( ω j ) < d$
and, furthermore, $l j ( ω ) = l j ( ω j )$ because of $ω j ∈ K$. However, the support of any $ω j$ is of dimension less than d. Therefore, there are extremal decompositions
$ω j = ∑ k p j k π j k , l j ( ω j ) = l ( π j k )$
for all j and all k with pure states $π j k$. Thus
$ω = ∑ j p j ∑ k p j k π j k$
is a flat optimal decomposition of ω for $l 1$ as well for $l 2$. Thus the proposition is proved.
The following is a corollary.
Proposition 2.3
Let $f ( x 1 , x 2 )$ be a function defined on the range of two affine functions, $l 1$ and $l 2$. Then
$F ( ω ) : = f ( l 1 ( ω ) , l 2 ( ω ) )$
is a flat roof.

#### Example 2.4: An application to the diagonal map

We like to apply the proposition above to the diagonal map
$X → D n ( X ) = diag ( X )$
which which is a channel on $Ω ( H )$. diag$( X )$ denotes the diagonal part of X obtained by replacing all off-diagonal elements of X by zeros. Proposition 2.3 proves that the von Neumann entropy
$ω → S ( D n ( ω ) ) = ∑ η ( 〈 j | ω | j 〉 )$
is a flat roof for $n = 2 , 3$. For $n = 3$ one replaces the third diagonal element, $x 33$, by $1 − x 11 − x 22$ to see that proposition 2.3 suffices to verify the assertion.
We like to prove the flatness of (17) for all dimensions. This will be done in example 3.2 later on.

#### Example 3

A general way to obtain roof extensions in state spaces is presented next. It allows for further modifications, but it is difficult to be controlled explicitly.
Let g be a continuous real function on $Ω pure$ and $ω ∈ Ω$. Every basis $| ψ 1 〉 , ⋯ , | ψ d 〉$ gives rise to a decomposition
$ω = ∑ ω | ψ j 〉 〈 ψ j | ω$
of positive rank one operators. After normalization of the rank one operators we get extremal convex decomposition of ω,
$ω = ∑ p j ≠ 0 p j ω | ψ j 〉 〈 ψ j | ω 〈 ψ j | ω | ψ j 〉 , p j = 〈 ψ j | ω | ψ j 〉$
In the following definitions we vary over all bases.
$G 1 ( ω ) = min bases ∑ p j ≠ 0 p j g ( ω | ψ j 〉 〈 ψ j | ω 〈 ψ j | ω | ψ j 〉 )$
where $p j$ is as in (19). Because continuity of g is assumed, and the set of bases is compact, the minimum in (20) will be attained. Thus, $G 1$ is a roof extension of g.
One gets another roof extension by
$G 2 ( ω ) = max bases ∑ p j ≠ 0 p j g ( ω | ψ j 〉 〈 ψ j | ω 〈 ψ j | ω | ψ j 〉 )$
One cannot guaranty the roof property without continuity of g.
(21) and (20) are global optimization problems for which there are no explicit expressions known in most cases. There are, however, algorithms to approximate them numerically.

## 3. Roofs and Convexity

We start with some definitions and elementary, mostly well-known statements around maximal convex and minimal concave extensions. The second subsection is concerned with sufficient conditions for the existence of convex and concave roofs and their main properties.

#### 3.1. Convex and Concave Extensions

Let Ω be a compact convex set and g a real function defined on the set $Ω pure$ of its extremal points.
Definition 3.1: Convex (concave) extensions
A convex function G is called a convex (respectively concave) extension of g if G is convex (respectively concave) and coincides on $Ω pure$ with g.
Between any convex, concave or roof extension of a function g the inequalities
$G convex ≤ G roof ≤ G concave$
are valid. Indeed, with an optimal decomposition (3) for $G roof$ we get
$G roof ( ω ) = ∑ p j G roof ( π j ) = ∑ p j G convex ( π j )$
because the extensions coincide on $Ω pure$. The right sum cannot be smaller than $G convex ( ω )$ by convexity. Similarly one argues in the concave case.
The proof of (22) is valid pointwise, leading to
Proposition 3.1
Let $G convex$ be a convex, $G concave$ a concave, and G any extension of g. If ω is a roof point of G, then
$G convex ( ω ) ≤ G ( ω ) ≤ G concave ( ω )$
Let $G convex$ and $G concave$ be as in the above proposition and assume in addition equality in (23), $G convex ( ω ) = G concave ( ω )$. For all convex combination
$ω = ∑ p j ω j , ω j ∈ Ω , p j > 0 ,$
we conclude
$∑ p j G convex ( ω j ) ≥ G convex ( ω ) = G concave ( ω ) ≥ ∑ p j G concave ( ω j )$
because of (22) the conclusion is $G convex ( ω j ) = G concave ( ω j )$ for all j.
To express this finding we need some standard terminology. A subset $K ⊂ Ω$ is called a face of Ω if $ω ∈ K$ and (24) necessarily implies $ω j ∈ K$ for all j. Faces are convex subsets of Ω. If not only Ω but also $Ω pure$ is compact then faces are compact and any face K is convexly generated by $Ω pure ∩ K$.
The intersection of faces is either empty or a face again. The smallest face containing ω will be called ω-face of Ω and denoted by face$ω [ Ω ]$. If K is a face and ω not a point of the boundary of K, then K is the ω-face of Ω.
Now we express the finding above by
Proposition 3.2
Coincide a convex and a concave extension of g at $ω ∈ Ω$, then they coincide on the ω-face of Ω.
The least upper bound of a set of convex functions is convex again. Hence, given g on $Ω pure$, there is a unique largest convex extension of g from $Ω pure$ to Ω. Similarly there exists a smallest concave extension of g. It is convenient to introduce an extra notation for these extensions:
Definition 3.2: $g ∩$ and $g ∪$
Let g be a real function on $Ω pure$. We denote by $g ∪$ the largest convex and by $g ∩$ the smallest concave extension of g to Ω,
$g ∪ = largest convex extension of g from Ω pure to Ω ,$
$g ∩ = smallest concave extension of g from Ω pure to Ω .$
We also write $G = G ∪$ (or $G = G ∩$) if G is the largest convex (or the smallest concave) extension of the restriction of G onto $Ω pure$.
Proposition 3.3
Let G be an extension of g and ω one of its roof points. If G is convex, then $G ( ω ) = g ∪ ( ω )$. If G is concave, then $G ( ω ) = g ∪ ( ω )$.
If a convex (resp. concave) roof extension of g exists, then it is unique. Because G is convex, $G ≤ g ∪$. Because ω is a roof point, (23) asserts $G ( ω ) ≥ g ∪ ( ω )$.
Is there a convex extension at all for a given g ? If there is one then there is also a largest one, i.e. $g ∪$ exists. The answer to the question is affirmative and has been given in  by a variational characterization which is well known in quantum information theory as a recipe to construct entanglement measures:
$g ∪ ( ω ) = inf ∑ p j g ( π j )$
$g ∩ ( ω ) = sup ∑ p j g ( π j )$
where the “inf”, respectively “sup”, is running over all extremal convex decompositions
$ω = ∑ p j π j , π j ∈ Ω pure$
of ω.
Indeed, if G is an extension of g which is convex, the right side of (25) must be always larger than $G ( ω )$. On the other hand, given $ω 1$ and $ω 2$, one can find decompositions (27) for them differing an arbitrary small amount $ϵ > o$ from $g ∪ ( ω 1 )$ respectively $g ∪ ( ω 2 )$. They may be composed from the pure states $π i , j$ and probabilities $p i , j$, $i = 1 , 2$. Then
$2 ϵ + p g ∪ ( ω 1 ) + ( 1 − p ) g ∪ ( ω 2 ) ≥ ∑ p p 1 , j g ( π 1 , j ) + ∑ ( 1 − p ) g ( π 2 , k )$
and this is not smaller than $g ∪ ( p ω 1 + ( 1 − p ) ω 2 )$. Because ϵ can be arbitrary near to zero, $g ∪$ is convex. The concave case can be settled by a similar reasoning or by
$− g ∪ = ( − g ) ∩ , − g ∩ = ( − g ) ∪$
Remark: Hulls of functions
The convex hull of G is the largest convex function which is smaller than G. The concave hull of G is the smallest concave function which is larger than G. In (25) and (26) one uses the values of G at the pure states only. Because the hull construction must respect values on the whole of Ω, there are more constraints to be fulfilled.
The expressions (25) and (26) are similarly structured as those of the convex and the concave hulls of a function G on Ω. One mimics the proofs and gets
$conv [ G ] ( ω ) = inf ∑ p j G ( ω j )$
$conc [ G ] ( ω ) = sup ∑ p j G ( ω j )$
where, as in (24), one has to run through all convex combinations
$ω = ∑ p j ω j , ω j ∈ Ω , p j > 0$
Obviously, conv$[ G ]$ is smaller than $G ∪$ and conc$[ G ]$ is larger than $G ∩$. This quite simple reasoning provides also
Proposition 3.4
If G is a concave or a roof extension of g then $g ∪ = conv [ G ]$.
If G is a convex or a roof extension of g then $g ∩ = conc [ G ]$.
As a matter of fact one can do similar hull constructions with any subset of Ω which convexly generates Ω. This has been emphasized in .

#### 3.2. Convex and Concave Roofs

If there is a convex roof extension of g, then it equal to $g ∪$. There is a sufficient condition to guaranty the roof property of $g ∪$ and of $g ∩$.
Proposition 3.5
Let Ω be a convex set. Assume both, Ω and $Ω pure$, are compact and g continuous on $Ω pure$. Then $g ∪$ and $g ∩$ are roofs. According to proposition 3.3 they are the minimal respectively maximal roof extensions of g.
Remember that $Ω pure$ is compact if $Ω = Ω ( H )$ and $H$ is finite dimensional. The requirement of continuity of g is often satisfied in physically motivated applications, though not always. A counter example is the Schmidt number in bipartite quantum systems. In this case it is not known whether $g ∪$ and $g ∩$ are roofs. Nevertheless. the assumptions needed are rather weak ones. We met them already in proposition 2.1.
The proof will be “constructive” in a certain sense. It is arranged to sharpen “theorem 1” in : If we know an optimal decomposition with pure states $π 1 , π 2 , ⋯$, then every convex combination of them is optimal. In particular, the (convex or concave) roof is affine on the convex set generated by the pure states $π 1 , π 2 , ⋯$. Restricted to this this set, the graph of G is a piece of an affine space. The whole graph of G appears as composed of flat pieces . If one would know a covering of Ω by those “convex leaves”, one could compute $g ∪$ from the values of g at $Ω pure$. Things are similar for $g ∩$.
To start proving propositions 3.5 and 2.1 let us repeat the assumptions. Ω is a compact convex set in a real linear space $L$ of finite dimension, the set $Ω pure$ of all pure (i.e., extremal) points of Ω is compact. g is a real continuous function on $Ω pure$. The dimension of Ω as a set in $L$ is denoted by n. It is the dimension of the affine space generated by Ω.
Remark: The space $Ω ( H )$ of density operators is embedded in Herm$( H )$. The latter is of dimension $d 2$ if $dim H = d$. The affine space generated by $Ω ( H )$ is the hyperplane of Hermitian operators of trace one. The dimension of $Ω ( H )$ is $n = d 2 − 1$.
We enlarge $L$ to the linear space $L ′ = L ⊕ I R$. Its elements, $X ⊕ λ$, will be written in vector form ${ X , λ }$ with two components, $X ∈ L$ and $λ ∈ I R$. We need the set
$E = { π , g ( π ) } , π ∈ Ω pure$
E is a compact set by our assumption. Hence its convex hull, denoted by $Ω [ g ]$, is a compact convex set. The set of extremal points of $Ω [ g ]$ is E. (If one of the elements of E would be a convex combination of the others, the same would be true for the corresponding pure states, contradicting our assumptions.)
Choose $ω ∈ Ω$ and consider in $L ⊕ I R$ the straight line consisting of the points ${ ω , λ }$, $λ ∈ I R$. The line intersects with $Ω [ g ]$ along a compact segment
${ ω , λ } ∈ Ω [ g ] ⇔ λ 0 ( ω ) ≤ λ ≤ λ 1 ( ω )$
λ satisfies (32) if and only if there is an extremal decomposition
${ ω , λ } = ∑ p j { π j , g ( π j ) } = { ω , ∑ p j g ( π j ) } , π j ∈ Ω pure$
Therefore,
$g ∪ ( ω ) = λ 0 ( ω ) ≤ λ ≤ λ 1 ( ω ) = g ∩ ( ω )$
and there exist extremal decompositions of ω with equality in (25) respectively (26). Therefore, $g ∪$ and $g ∩$ are roofs and proposition 3.5 has been proved.
If G is a roof extension of g, the point ${ ω , G ( ω ) }$ is contained in $Ω [ g ]$. Hence it can be represented by a convex combination of elements from E. As the dimension of $Ω [ g ]$ is $n + 1$, there are, by a theorem of Carathèodory, pure convex decomposition of length $n + 2$. This proves proposition 2.1: For $Ω = Ω ( H )$ it follows $n + 2 = ( d 2 − 1 ) + 2$.
On the other hand, a point ${ ω , g ∪ ( ω ) }$ belongs to a face of the boundary of $Ω [ g ]$. Its dimension cannot exceed n. Thus there are, again by Carathèodory, pure decompositions of length $n + 1$. For $Ω ( H )$ this gives an achievable length $d 2$, an often used fact .
Definition 3.2: convex leaves
Let G be a real function on Ω. A subset $K ⊂ Ω$ is called a convex leaf of G if
(a) K is compact and convex,
(b) K is the convex hull of $K ∩ Ω pure$,
(c) G is convexly linear on K, i.e.
$G ( ∑ p j ρ j ) = ∑ p j G ( ρ j ) if all ρ j ∈ K$
K is called complete, if all pure states which can appear in an optimal decomposition of any $ρ ∈ K$ are contained in K.
Proposition 3.6
With the assumptions of proposition 3.5 it holds: For $g ∪$ (respectively $g ∩ )$ and any $ω ∈ Ω$ there are complete convex leaves containing ω.
It makes sense to call the set of all complete convex leaves of G the convex foliation of G or, shortly, the G-foliation.
Proof. The proposition will be proved for $g ∪$. The case of $g ∩$ is similar. There is an affine functions l such that
$l ≤ g ∪ , l ( ω ) = g ∪ ( ω )$
We define a subset K of Ω by
$K = { ρ | l ( ρ ) = g ∪ ( ρ ) }$
Proposition 3.6.a: (36) is a complete $g ∪$-leaf.
Clearly, $ω ∈ K$. Let us choose $ρ ∈ K$. For an optimal pure decomposition, $ρ = ∑ p j π j$, one obtains
$∑ p j g ∪ ( π j ) = g ∪ ( ρ ) = l ( ρ ) = ∑ p j l ( π j )$
However, $l ( π j ) ≤ g ∪ ( π j )$ by assumption. By the equation above, all these inequalities must be equalities. Hence, the pure composers $π j$ of every optimal decomposition of any $ρ ∈ K$ are contained in K,i.e., K is complete. Now choose another $ρ ′ ∈ K$ and let $ρ ′ = ∑ p k ′ π k ′$ be an optimal decomposition. For $0 < p < 1$ we get, applying first our assumption and then convexity of $g ∪$,
$l ( p ρ + ( 1 − p ) ρ ′ ) ≤ g ∪ ( p ρ + ( 1 − p ) ρ ′ ) ≤ p g ∪ ( ρ ) + ( 1 − p ) g ∪ ( ρ ′ )$
By the assumption the right hand side can be written
$p l ( ρ ) + ( 1 − p ) l ( ρ ′ ) = l ( p ρ + ( 1 − p ) ρ ′ )$
and is equal to the left expression. Hence, equality must hold and K must be a convex set. We have seen already that every $ρ ∈ K$ can be represented by a convex combination of elements from $K ∩ Ω pure$. Because g is continuous, the set of all π satisfying $g ( π ) = l ( π )$ is compact. Hence, K is compact. Indeed, it is convexly generated by a compact set.
We repeat a further standard notation. An element ρ of a convex set K is called K-inner or “convexly inner” if for any $ν ∈ K$ follows $( 1 + s ) ρ − s ν ∈ K$ and small enough positive s. Geometrically, the line segment from ν to ρ can be prolonged a bit without leaving K. There is also a topological characterization: A K-inner point is an inner point of K with respect to the affine space generated by K. (Example: The invertible density operators are the convexly inner points of $Ω ( H )$.)
The intersection of complete convex leaves is either empty or it is a complete convex leaf. Hence there is a minimal complete convex leaf containing a given $ω ∈ Ω$, the ω-leaf of $g ∪$. It is convexly generated by all those $π ∈ Ω pure$ which can appear in an optimal decomposition of ω.
The ω-leaf is the largest convex leaf containing ω as convexly inner point.
Now let $K 1$ be the $ω 1$-leaf and $K 2$ that of $ω 2$. If $ω 1$ is a convexly inner point of $K 2$, then $K 1 ⊂ K 2$. In particular, $K 1 = K 2$ if and only if they contain a point which is commonly inner. Let us draw a corollary:
If $K 2$ is properly larger than $K 1$, the convex dimension of $K 2$ must be strictly larger than that of $K 1$.
A chain $K 1 ⊂ K 2 ⊂ ⋯$, consisting of different complete $g ∪$-leaves, cannot contain more than $n + 1$ members. As above, n denotes the convex dimension of Ω. The maximal number of different leaves in any chain is called the depth of the $g ∪$-foliation. As said above, the $g ∪$-foliation consists of all complete $g ∪$-leaves.
In the case $Ω = Ω ( H )$, the depth if bounded by $d 2$. If the roof is an affine function, see example 2.3, exactly the faces of Ω are its leaves and the bound is reached.
Let us look again to the setting above in more geometric terms. We shall see that the ω-leaves of $g ∪$ and $g ∩$ correspond uniquely to the faces of $Ω [ g ]$.
The triple ${ Ω [ g ] , Ω , Π }$ is a fiber bundle with bundle space $Ω [ g ]$, base space Ω, and projection
$Π : { ω , λ } → ω$
In this scheme a roof G becomes a cross section, say $s G$, by setting
$ω → s G ( ω ) = { ω , G ( ω ) } ∈ Ω [ g ]$
we get $Π ( s G ( ω ) ) = ω$, which is necessary for a bundle structure.
The boundary, $∂ Ω [ g ]$, of $Ω [ g ]$ is the union of three disjunct sets:
$∂ 0 Ω [ g ] = { { ω , g ∪ ( ω ) } | g ∪ ( ω ) = g ∩ ( ω ) }$
$∂ − Ω [ g ] = { { ω , g ∪ ( ω ) } | g ∪ ( ω ) ≠ g ∩ ( ω ) }$
$∂ + Ω [ g ] = { { ω , g ∩ ( ω ) } | g ∪ ( ω ) ≠ g ∩ ( ω ) }$
The cross section (38) with $G = g ∪$ maps Ω onto $∂ 0 Ω [ g ] ∪ ∂ − Ω [ g ]$ while the cross section with $G = g ∩$ maps the base space onto $∂ 0 Ω [ g ] ∪ ∂ + Ω [ g ]$.
The fibres degenerate to a point at the boundary part (39) and can be identified with a subset of Ω. By proposition 3.2 that subset consists of the pure point and possibly of some faces at which $g ∪ = g ∩$, and all roof extensions of g coincide and are affine.
Let us consider a face $K ˜$ contained in the “lower” part $∂ − Ω [ g ]$ of the boundary (40). $K ˜$ is convex by definition and compact because of the compactness of $Ω [ g ] pure$. Therefore, the projection K of $K ˜$ to Ω, $Π K ˜ = K$, is convex and compact, see (37). We use the supposed face property: $ω ˜ ∈ K ˜$ implies
$ω ˜ = ∑ p j ω ˜ j ⇒ ω ˜ j ∈ K ˜$
if all $p j > 0$. Writing this out in the manner $ω ˜ = { ω , g ∪ ( ω ) }$, and so on, we arrive at
${ ω , g ∪ ( ω ) } = ∑ p j { ω j , g ∪ ( ω j ) } = { ω , ∑ p j g ∪ ( ω j ) }$
Now we can state
Proposition 3.7
There is a one-to-one correspondence between the faces of $Ω [ g ]$ contained in $∂ 0 Ω [ g ] ∪ ∂ − Ω [ g ]$ and the complete convex leaves of $g ∪$. The cross section $s G$, with $G = g ∪$, maps complete convex leaves of $g ∪$ onto faces of $Ω [ g ]$. The bundle projection Π returns them back to Ω.
For the concave roof things are similar.

#### Example 3.1: Minimum and maximum of g

On $Ω pure$ let $g min$ be the minimum of g and $g max$ its maximum. The convex hull of the set
${ π ∈ Ω pure | g ( π ) = g min }$
is a complete convex leaf of $g ∪$. The convex hull of the set
${ π ∈ Ω pure | g ( π ) = g max }$
is a complete convex leaf of $g ∩$.
Let us again consider the diagonal map $D ( X ) = diag ( X )$ as in (16) and its von Neumann entropy $S ( D ( . ) )$, see (17). $g ( π ) = S ( D ( π ) )$ is the output entropy of the pure state π. The Hilbert space dimension is denoted by d. As well known, the minimum output entropy is zero and the maximal one $log d$.
Things become more refined by restricting the channel onto a face of Ω. As an example we take a short look at the $( d − 1 )$-dimensional subspace $H 0$ which is orthogonal to the vector $| φ 〉 = d − 1 / 2 ∑ | j 〉$. $H 0$ consists of vectors $∑ a j | j 〉$ such that $∑ a j = 0$. $H 0$ supports some pure states satisfying $D ( π ) = d − 1 1$ and the maximal output entropy is $log d$ again.
There is a reasonable conjecture, saying that the minimal output entropy is independent of d and equal to $log 2$. There are $d ( d − 1 ) / 2$ pure states $π j k$, $j < k$. The matrix elements $a n m$ of $π j k$ are $1 / 2$ for $n = m = j$ and $n = m = k$. They are $− 1 / 2$ for $n = j , m = k$ and $m = j , n = k$, and all other entries are zeros. Hence it is evident that $S ( diag ( π j k ) ) = log 2$ and, therefore,
$E D ( ρ ) ≤ log 2 , ρ = 1 d − 1 ( 1 − | φ 〉 〈 φ | )$
because we can represent ρ by a convex combination of the pure states $π j k$. The conjecture asserts that the decomposition is an optimal one. The conjecture rests on the fact that for no other state than $π j k$, supported by $H 0$, the output of the diagonal map is of rank two. Then one applies a theorem of Michelson and Jozsa, see appendix of , reducing in the case at hand the minimization of the entropy to that of minimizing the second elementary symmetric function or, equivalently, to the minimization of the concurrence. For $d = 2$ this is trivial, for $d = 3$ it can be done, for $d > 3$ it is yet a conjecture.
Under the assumption, the conjecture is true, the convex set generated by the $π j k$ is a complete leaf. The minimal length of an optimal decomposition of ρ is equal to $d ( d − 1 ) / 2$.

#### Example 3.2: Again the diagonal map

We return to the example 2.4 to prove the flatness of the Entropy (17) of the diagonal map (16). Hence, since we know from von Neumann’s work the concavity of $ω → S ( D ( ω ) )$, flatness of (17) proves the diagonal map a flat concave roof for all dimensions d of $H$. In other words, for
$g ( π ) = S ( diag ( π ) ) , π ∈ Ω ( H )$
it follows for any density operator ω
$g ∩ ( ω ) = S ( diag ( ω ) )$
Proposition 3.8
Choose $ω ∈ Ω ( H )$. The set $K ω$ of all states ρ with $diag ( ρ ) = diag ( ω )$ is a complete convex leaf of (47). $g ∩$ is a flat concave roof.
The set $K ω$ is compact, convex, and $S ( D ( . ) )$ is constant on it. Transversal to the sets $K ω$ the function $S ( D ( . ) )$ is strictly concave. This excludes that two density operators with different diagonals can belong to a convex leaf. Hence, the sets $K ω$ are convex leaves.

## 4. Wootters’ Method

We are going to describe the fundamental idea in , see also , and its generalizations . After a short introduction to anti-linearity, which is on the heart of the method, we present slightly simplified proofs for a class of convex and concave roofs. For reasons of uniqueness we sometimes write $〈 . , . 〉$ for the scalar product instead of Dirac’s $〈 . | . 〉$.
The use of anti-linearity  goes back, at least in physics, to Wigner. He applied it to the time reversal symmetry  and he discovered the structure of anti-unitary operators, . A highlight in the further development of this line of thinking is the proof of the CPT-theorem within Wightman’s axiomatic quantum field theory.

#### 4.1. Anti-Linearity in Short

We start with some elementary remarks. An anti-linear operator, say ϑ, obeys the rule
$ϑ ( a 1 | ϕ 1 〉 + a 2 | ϕ 2 〉 ) = a 1 * ϑ | ϕ 1 〉 + a 2 * ϑ | ϕ 2 〉$
An important fact follows immediately: Because of $c ϑ = ϑ c *$ the eigenvalues of ϑ form a set of circles. Indeed, if $| x 〉$ is an eigenvector of ϑ with eigenvalue a, then $ϵ | x 〉$, $| ϵ | = 1$, is an eigenvector with eigenvalue $ϵ * a$. Consequently, most of the unitary invariants of linear operators are undefined for anti-linear ones. The trace, for example, does not exist for anti-linear operators.
The Hermitian adjoint $ϑ †$ of an anti-linear operator ϑ is defined by
$〈 ϕ 1 , ϑ † ϕ 2 〉 = 〈 ϕ 2 , ϑ ϕ 1 〉$
There is to set a caution mark: Do not apply an anti-linear operator to a bra in the usual Dirac manner! By (49) one may get absurd results.
A useful class of anti-linear operators are the Hermitian ones. By (49) any matrix representation must result in a complex symmetric matrix. About symmetric matrices see . It follows that the Hermitian anti-linear operators constitute a complex linear space of dimension $d ( d + 1 ) / 2$ if $dim H = d$.
An anti-unitary, V, is an anti-linear operator which is unitary, i.e., satisfies $V † = V − 1$. A conjugation, Θ, is an anti-unitary which is Hermitian. It implies $Θ 2 = 1$. In accordance with what has been said about eigenvalues, one can find an orthogonal basis $| ϕ 1 〉 , ⋯$ such that $Θ | ϕ j 〉 = ϵ j | ϕ j 〉$ with arbitrarily chosen unimodular numbers $ϵ j$.
A conjugation Θ distinguishes a real Hilbert subspace $H Θ$ of $H$ consisting of all Θ-real vectors, $Θ | ψ 〉 = | ψ 〉$.
There is a polar decomposition, $ϑ = V | ϑ |$, for any anti-linear operator ϑ. $| ϑ |$ denotes the positive root $( ϑ † ϑ ) 1 / 2$ and V is an anti-unitary operator. The proof is similar to the linear case .
Now we turn to the case of an anti-linear Hermitian operator $ϑ = ϑ †$. It commutes with the positive (linear!) operator $ϑ 2$ and, therefore, with $| ϑ | = ( ϑ 2 ) 1 / 2$. With a non-singular ϑ we perform $Θ = ϑ − 1 | ϑ |$ the square of which is $1$. As it is Hermitian too, it is a conjugation. We conclude
$ϑ = Θ | ϑ | = | ϑ | Θ , Θ = Θ † = Θ − 1$
By continuity, or by a more detailed analysis, (50) can be verified for all Hermitian and anti-linear ϑ.

#### 4.2. Building Roofs with an Anti-Linear Hermitian ϑ

Let ϑ be Hermitian and anti-linear on a Hilbert space $H$ of dimension d. This setting provides a function
$g ( π ) = | 〈 ψ , ϑ ψ 〉 | , π = | ψ 〉 〈 ψ |$
on the pure states of $Ω ( H )$. Now $g ∩$ and $g ∪$ are well defined and we shall prove:
Proposition 4.1
Let g be as in (51). If ${ λ 1 ≥ λ 2 ≥ ⋯ }$ denote the eigenvalues of $| ω ϑ ω |$, then
$g ∪ ( ω ) = max { 0 , λ 1 − ∑ j > 1 λ j } ,$
$g ∩ ( ω ) = ∑ λ j$
$g ∪$ and $g ∩$ are flat roofs.
At first we simplify the assertion by starting with
$ϑ ω : = ω ϑ ω , | ϑ ω | = ω ϑ ω ϑ ω 1 / 2$
Up to normalization every pure decomposition of ω can be gained from a decomposition of the unit operator $1$,
$ω = ∑ ω π j ω , 1 = ∑ π j$
(51) is 1-homogeneous on the positive rank one operators. Comparing (55) with (25) and (26), it can be seen that
$g ∪ ( ω ) = inf ∑ | 〈 ψ j , ϑ ω ψ j 〉 | , g ∩ ( ω ) = sup ∑ | 〈 ψ j , ϑ ω ψ j 〉 |$
where we have to run through all rank one decompositions of $1$,
$∑ | ψ j 〉 〈 ψ j | = 1$
Now $ϑ ω$ is Hermitian and anti-linear. Therefore there is to any chosen set of phase factors $ϵ 1 , ϵ 2 , ⋯$ a basis $φ 1 , φ 2 , ⋯$ satisfying
$| ϑ ω | | φ j 〉 = λ j | φ j 〉 , ϑ ω | φ j 〉 = λ j ϵ j | φ j 〉$
and the conjugation Θ in the polar decomposition multiplies the j-th basis vector by $ϵ j$.
For the next step we assume the existence of a real $d × d$ Hadamard matrix. Then we can choose a basis ${ | χ i 〉 }$ fulfilling
$| χ i 〉 = 1 d ∑ j a i j | φ j 〉 , a i j = ± 1$
because of the orthogonality and $a k i 2 = 1$ we get for all k
$d 〈 χ k , ϑ ω χ k 〉 = ∑ i j a k i a k j 〈 φ i , ϑ ω φ j 〉 = ∑ ϵ j λ j$
Therefore, by (51), we get
$g ∪ ( ω ) ≤ | ∑ ϵ j λ j | ≤ g ∩ ( ω )$
By varying the unimodular numbers $ϵ j$, which could be chosen arbitrarily, one arrives at
$g ∪ ( ω ) ≤ max { 0 , λ 1 − ∑ j > 1 λ j } , g ∩ ( ω ) ≥ ∑ λ j$
Assuming equality in (62), we see from (60) that ω is a flat point of $g ∩$ and of $g ∪$. If $g ∪ ( ω ) > 0$, then we choose $ϵ 1 = 1$ and $ϵ j = − 1$ for $j > 1$ to obtain from (58) an optimal basis ${ | χ k }$. In the concave case we set $ϵ j = 1$ for all j.
Now we are going to prove equality in (62), starting with $g ∪$. We clearly get an estimation from below in (56) by
$inf | ∑ ξ k | , ξ k = 〈 ψ k , ϑ ω ψ k 〉 |$
Sandwiching with the eigenbasis (58) it yields
$∑ ξ j = ∑ j k 〈 ψ k , ϑ ω φ j 〉 〈 φ j , ψ k 〉 = ∑ j k ϵ j λ j 〈 ψ k , φ j 〉 〈 φ j , ψ k 〉$
and this shows, summing first over k,
$| ∑ k ξ k | = | ∑ k ϵ j λ j |$
Its minimum is attained by the largest of the two numbers 0 and $λ 1 − λ 2 − ⋯$ and the first equation in (54) is true.
Now we prove equality in (62) for $g ∩$. the second equation of (56). It is
$| 〈 ψ k , ϑ ω ψ k 〉 | = | 〈 ψ k , | ϑ ω | ψ k ′ 〉 | , | ψ k ′ 〉 = Θ | ψ k 〉$
We apply Cauchy’s inequality. The result is
$| 〈 ψ k , | ϑ ω | ψ k ′ 〉 | 2 ≤ 〈 ψ k , | ϑ ω | ψ k 〉 〈 ψ k ′ , | ϑ ω | ψ k ′ 〉$
and, because Θ is an involution, hence anti-unitary, we arrive at
$| 〈 ψ k , ϑ ω ψ k 〉 | ≤ 〈 ψ k , | ϑ ω | ψ k 〉$
Summing up we get the trace of $| ϑ ω |$ which upper bounds $g ∩ ( ω )$.
Up to now the proof of proposition 4.1 rests on particular bases ${ | χ k 〉 }$. They exist if there is a real $d × d$ Hadamard matrix, $d = dim H$. To overcome the restriction we go to a larger Hilbert space, $H ⊕ H 0$, for which the proposition has been proved. Then we restrict to the face of density operators supported by the original Hilbert space $H$. Because a (flat) roof remains a (flat) roof if restricted to a face, the proof will become complete.
To do so, we choose $d ′ = dim H ⊕ H 0$ sufficiently large and extend ϑ to $ϑ ′$ by requiring $ϑ ′ | ψ 0 〉 = 0$ for all $| ψ 0 〉 ∈ H 0$. We then choose any conjugation $Θ 0$ on $H 0$ and use $Θ ′ = Θ ⊕ Θ 0$. Now, if there is a real $d ′ × d ′$ Hadamard matrix, we are done.
Hadamard matrices exist in dimensions $d ′ = 2 m$. This suffices for the proof.
Proposition 4.2
Let $g ( π ) = | 〈 ψ , ϑ ψ 〉 |$. Then $g ∪$ and $g ∩$ allow for flat optimal decompositions of length $d ′$ where $d ≤ d ′ = 2 m$. More generally, if $d ≤ d ′$ and there is a real $d ′ × d ′$ Hadamard matrix, then there are flat optimal decompositions of length $d ′$.

#### 4.3. Cases of Application

Indeed, the question is now: How to find a suitable anti-linear Hermitian operator ϑ to calculate concurrence, tangle, and entanglement entropy (as particular cases of entanglement of formation) in $2 × n$ systems. Clearly, this can be fully successful for flat roofs only.
Let T be a trace preserving positive map from the states
$ω ∈ Ω d : = Ω ( H ) , dim H = d$
into the 1-qubit state space $Ω ( H 2 )$. Because the trace of the output, $T ( ω )$ is one, $T ( ω )$ is characterized, up to a unitary transformation, by one variable. It is common to use $4 det T ( ω )$ or its square root to be this variable. Let us abbreviate the convex roofs on $Ω d$, playing a role below. By
$C T = 2 ( det T ) ∪ , ( det T ) ( ω ) = det T ( ω )$
$τ T = 4 ( det T ) ∪$
$E T = ( S T ) ∪ , S T ( ω ) = S ( T ( ω ) )$
(65) is the concurrence, (66) the 1-tangle, and (67) the entanglement of T. (67) is the entanglement of formation if T is a partial trace of a bipartite quantum system. By its very definition we need only the values of $det T$ and of $S T$ for pure input states.
There are some general relation between these three roofs. The first is typical also for more general settings: $C T$ is a positive convex function and so does its square. For pure input states the tangle and the squared concurrence coincide. Hence, because $τ T$ is maximal within all convex extensions, it is not less than $C T 2$. On the other hand, if ω turns out to be a flat point of $C T$, than this remains true for its square. Thus,
Proposition 4.3
For stochastic maps with 1-qubit outputs it holds
$τ T ( ω ) ≥ C T ( ω ) 2$
and equality takes place for flat roof points of $C T$.
Let us now switch to $S T$. With use the abbreviations $η ( x ) = − x log x$ and
$ξ ( x ) = η 1 − y 2 + η 1 + y 2 , 1 = x 2 + y 2$
ξ is defined and continuous for $− 1 ≤ x ≤ 1$ and it is strictly convex. Therefore, ξ is the sup of a family of functions $a x + b$. Inserting a convex function C defined on any convex set with values $− 1 ≤ C ≤ 1$, we get $ξ ( C )$ by a sup of convex functions $a C + b$. Therefore, $ξ ( C )$ is a convex function on the domain of definition of C.
We apply this fact to the concurrence $C T$ yielding:
$ω → ξ T ( ω ) : = ξ ( C T ( ω ) )$
is a convex function on $Ω d$. $C T$ for pure states π we get $2 det T ( π )$. We insert in (70),
$ξ T ( π ) = η ( 1 − 1 − 4 det T ( π ) 2 ) + η ( 1 + 1 − 4 det T ( π ) 2 )$
and one identifies the arguments in η as the two eigenvalues of $T ( π )$.
$ξ T ( π ) = S ( T ( π ) ) , π ∈ Ω d , pure$
proves $ξ T$ to be a convex extension of the pure states output entropies. Reasoning as for proposition 4.3 results in
Proposition 4.4
For stochastic maps with 1-qubit outputs it holds
$E T ( ω ) ≥ ξ ( C T ( ω ) )$
Equality takes place if ω is a flat roof point.
It should be underlined hat there are more and different estimations for concurrence and entanglement of formation in higher dimensions, see [40,41,42,43,44]. In  there is an application to states with only two different, but arbitrarily degenerated eigenvalues.

#### 4.4. How to Find ϑ

There is a general recipe to get the wanted anti-linear operator for channels T mapping the states of a quantum system $H d$ to 1-qubit states. Assume a Kraus representation
$T ( X ) = ∑ A j X A j † , A j : H d ↦ H 2$
There is an additional condition to be fulfilled: T must be Kraus representable with not more than two Kraus operators. But at first we remain within the more general (73).
The key to the following is the existence of the “time reversal” or “spin flip” anti-unitary operator $θ f$ on $H 2$,
$θ f ( c 0 | 0 〉 + c 1 | 1 〉 ) = c 1 * | 0 〉 − c 0 * | 1 〉$
Apart from the obvious
$θ f † = θ f − 1 = − θ f$
the relation
$θ f Y † θ f Y = − ( det Y ) 1 2$
is valid for all operators $Y ∈ B ( H 2 )$. Up to a multiplicative constant, only the spin flip commutes with all $U ∈$ SU(2), It is really a very special anti-unitary operator.
The task is in inserting $Y = T ( X )$ into (75) and to get something similar for $det T$. This goes through particulary nice if X is of rank one, $X = | ψ 2 〉 〈 ψ 1 |$. Calculations show
$det T ( | ψ 2 〉 〈 ψ 1 | ) = ∑ j < k 〈 ψ 1 , ϑ j k ψ 1 〉 〈 ψ 2 , ϑ j k ψ 2 〉 * , | ψ i 〉 ∈ H d$
The anti-linear Hermitian operators $ϑ j k$ are defined by
$ϑ j k = 1 2 A j * θ f A k − A k * θ f A j$
using the Kraus operators $A j$ from any Kraus representation (73) of T. The operators $ϑ j k$ are Hermitian and anti-linear.
In the lucky case of channels (73) with just two Kraus operators, $A 1$ and $A 2$, we get only one operator ϑ by (75) and, hence,
$det T ( | ψ 〉 〈 ψ | ) = | 〈 ψ , ϑ ψ 〉 |$
and, by proposition 4.1, we are done.
Wether and how one can replace the operation $X → ϑ X ϑ$ by an anti-linear stochastic map to obtain a more general roof construction, is unknown. An (implicit) attempt is in  by taking $X → ( X ) 1 − X$ as an higher dimensional substitute for the flip operation.

#### 4.5. Applications

The partial trace of a 2-quibt system can be represented by two Kraus operators: Looking at the operators over $H 2 ⊗ H 2$ as block matrices, the partial trace over the second part is the map
$X = X 00 X 01 X 10 X 11 ⟶ X 00 + X 11 = Y$
One immediately sees a possible choice for the Kraus operators,
$A 1 = 1 2 { 1 2 , 0 2 } , A 2 = 1 2 { 0 2 , 1 2 }$
Now we can compute ϑ according to (77). We get, eventually up to a sign, Wootters’ conjugation, $ϑ = θ w$,
$4 ϑ = θ w = 0 θ f − θ f 0 = θ f ⊗ θ f$
Remark: The concurrence of a 2-qubit system is a flat convex roof. Generally, complete convex leaves consists of a set of flat ones.
Let us choose two operators, $A 1 = A$, $A 2 = B$, from $B ( H 2 )$ so that (73) becomes a 1-qubit channel. Then there are only two eigenvalues, $λ 1$, $λ 2$, of $| ω ϑ ω |$ to respect and (52) simplifies to
$g ∪ ( ω ) = | λ 2 − λ 1 |$
One has to solve a quadratic equation to get the general expression
$1 4 C T ( ω ) 2 = ( ω ϑ ω ϑ ) − 2 ( det X ) ( det ϑ 2 ) 1 / 2$
There are standard forms for 1-qubit channels, [20,47,48,49]. For channels with two Kraus operators one can assume
$A = a 00 0 0 a 11 , B = 0 b 01 b 10 0$
up to unitary equivalence. With this choice ϑ acts as
$ϑ ( c 0 | 0 〉 + c 1 | 1 〉 ) = ( b 10 a 00 c 0 ) * | 0 〉 − ( b 01 a 11 c 1 ) * | 1 〉$
After inserting in the relevant expressions one observes that the concurrence is the restriction of a semi-norm to the state space. Indeed, one obtains, [16,17,
$C T ( X ) = | | b 10 a 00 | x 00 + | b 01 a 11 | x 11 + z x 10 − z * x 01 |$
z is one of the roots of
$z 2 = a 00 * a 11 b 01 b 10 *$

## 5. A Subtraction Procedure

We start with a simple example not covered by Wootters’ method and showing the principle. The idea is to subtract from $det T ( X )$ a suitable multiple of $det ( X )$ to get the squared concurrence or the tangle of a stochastic 1-qubit map T. We choose for T the map
$x 00 x 01 x 10 x 11 ⟶ x 00 + ( 1 − γ ) x 11 0 0 γ x 11$
and consider
$γ x 00 x 11 + γ ( 1 − γ ) x 11 2 − w ( x 00 x 11 − x 01 x 10 )$
With $w = γ$, we arrive at the squared concurrence of T,
$1 4 C T ( X ) 2 = γ ( 1 − γ ) x 11 2 + γ x 01 x 10$
At first (90) is a positive semi-definite quadratic form implying that its square root is convex. Secondly, by its very construction it coincides for pure states π with $4 det T ( π )$. Finally, on the state space, it is a roof. To indicate a general way to proof the roof property we polarize (90) and get
$( X , Y ) T : = γ ( 1 − γ ) x 11 y 11 + 1 2 γ ( x 01 y 10 + x 10 y 01 )$
which is a positive semi-definite bilinear form in the space of Hermitian matrices. For the pure state $π 0 = | 0 〉 〈 0 |$ (90) becomes zero. Applying the Schwarz inequality we get $( X , π 0 ) = 0$ for all X. Hence, with any pure state π,
$ω s = ( 1 − s ) π 0 + s π ⟹ ( ω s , ω s ) = s 2 ( π , π )$
Taking the root we see that $s → ω s$, $0 ≤ s ≤ 1$, is a convex leaf of $C T$ for every π. Now the assertion is proved and, the more, $C T$ is not a flat roof.
One may ask, whether one can diminish w a bit without destroying convexity on Ω. Let us use in (89) the new value $w ′ = γ 2$. Then (89) becomes
$γ x 00 x 11 + γ ( 1 − γ ) x 11 2 − γ 2 ( x 00 x 11 − x 01 x 10 )$
Different to the former case we restrict ourselves to $Ω ( H 2 )$ and respect the condition $x 00 + x 11 = 1$. After some manipulations we arrive at a convex roof which coincides with $det T ( π )$ for pure states. Up to a factor it must be the tangle of T on Ω.
$1 4 τ T ( X ) = γ ( 1 − γ ) x 11 + γ 2 x 01 x 10$
The tangle is affine on the set of density operators with $x 01 =$ constant. As the square of the concurrence is equal to the tangle for pure states, we have the inequality $τ T ( ω ) > C T ( ω ) 2$ for mixed states.
That the ansatz (89) is working generally for the concurrence has been shown first in [16,17,52], by means of the “S-lemma of Yakobovich” and, using the explicit expression for general stochastic 1-qubit maps of , in . The case of the tangle can be read off from .

#### 5.1. Concurrence of Stochastic 1-Qubit Maps

Let T be a stochastic, i.e., a positive trace preserving map. We prove
Proposition 5.1
There is a real number $0 ≤ w ≤ 1$ such that for all $ρ ∈ Ω ( H 2 )$
$1 4 C T ( ρ ) 2 = det T ( ρ ) − w det ρ$
At first we show the w-bounds. With $w < 0$, (94) becomes the sum of two concave functions on Ω and $C T$ could not be convex. To prove $1 ≥ w$ we insert $ρ 0 = ( 1 / 2 ) 1$ and get $det T ( ρ 0 ) ≥ w / 4$. However, $det T ( ρ 0 ) ≤ 1 / 4$ is required by stochasticity. $w > 1$ would be a contradiction.
Next, we consider the expression (94) on the Bloch space of all Hermitian operators of trace one,
$X = 1 2 ( 1 + x 1 σ 1 + x 2 σ 2 + x 3 σ 3 )$
The determinant of X is a quadratic function
$det X = 1 4 ( 1 − x 1 2 − x 2 2 − x 3 2 )$
in the Bloch coordinates. The same is with $det T ( X )$. In the terminology of the S-lemma, the quadratic function $det T ( X )$ is called co-positive with$det X$ because from $det X ≥ 0$ it follows $det T ( X ) ≥ 0$ under the constraint $X = 1$.
The S-lemma, , states the following: Let $q 1$ and $q 2$ be two quadratic functions on $I R n$, not necessarily homogeneous. If $q 1 ( x ) ≥ 0$ implies $q 2 ( x ) ≥ 0$ then $q 2$ is co-positive with $q 1$.
S-lemma
Let $q 1$ be strictly positive at least at one point. Then there exists a real number w such that
$q 2 ( x ) − w q 1 ( x ) ≥ 0 for all x ∈ I R n$
if and only if $q 2$ is co-positive with $q 1$.
Now it is proved: The right expression of (94) can be made non-negative with suitable numbers w for all Hermitian trace one operators. Then, substituting $x j → x j / x 0$ and multiplying by $x 0 2$, we see that
$det Tr ( X ) − w det X ≥ 0$
becomes a positive semi-definite and homogeneous polynomial in the variables $x 0 , x 1 , x 2 , x 3$ for some w. Polarizing(97) as in (91) yields a positive semi-definite symmetric form $( X , Y ) w$ satisfying
$( X , X ) w = det T ( X ) − w det X$
We now may assume (97) for all values w which are bounded by $1 ≥ w 1 ≥ w ≥ w 2 ≥ 0$. We can further assume that $( X , Y ) w$ is degenerated for $w = w 1$ and for $w = w 2$. But wether degenerate or not, (97) implies Cauchy’s inequality
$| ( X , Y ) w | 2 ≤ ( X , X ) w ( Y , Y ) w$
In particular, if $Y = ν$ is in the null-space, $( ν , ν ) w = 0$, then $det T ( ν ) = w det ν$ and $det T ( ν ) ≥ w ′ det ν$ for all allowed $w ′$, i.e.,
$( w − w ′ ) det ν ≥ 0 , w 1 ≥ w ′ ≥ w 2$
Furthermore, $( ν , X ) w = 0$ for all Hermitian X.
We like to show: If $w = w 2$ every density operator ρ is a roof point.
We have to distinguish several cases. The first one is $ν = 0$. Then, with $ρ ∈ Ω$, we define
$ρ s = ρ + s ν$
On this line $( ρ s , ρ s ) W$ is independent of s. Therefore, because of (94), its intersection with Ω is a flat convex leaf of $C T$.
Being Hermitian and not identical zero, $ν = 0$ results in $det ν < 0$. Because of (100) one concludes $w = w 2$.
For the next cases we suppose $ν = 1$ and start with
$ρ s = ( 1 − s ) ν + s ρ$
Then $( ρ s , ρ s ) W = s 2$. If the sign of s does not change while $ρ s$ is inside the Bloch ball, we can take the root and get a convex leaf. This takes place if ν is not an inner point of Ω, i.e., not a properly mixed state.
The condition is certainly satisfied if $det ν = 0$ and ν is a pure state. The condition implies the w-independence of $( ν , ν ) w$ which becomes equal to $det T ( ν )$. As there is only one convex roof $C T$ it is $w 1 = w 2$ necessarily .
ν is outside Ω if $det ν ≠ 0$. We conclude $w = w 2$ from (100) and we get convex leaves by intersecting the lines (102) with the Bloch ball.
In addition one observes: $det ν ≠ 0$ is necessary for $w 1 > w 2$. Indeed, by (100) we see $( ν , ν ) w = 0$ can be satisfied if either $w = w 1$ and $det ν > 0$ or by $w = w 2$ and $det ν < 0$.
Now, all relevant cases are discussed and proposition 5.1 is proved.
In the course of the proof one obtains two more general insights:
Proposition 5.2
The concurrence of stochastic 1-qubit maps is the restriction of a Hilbert semi-norm to the state space.
Every state allows for an optimal decomposition of length two.

#### 5.2. Axial Symmetric Maps, Concurrence

We shall list the subtraction parameter w for the class of axial symmetric stochastic maps. A standard form for them reads
$T ( X ) = α x 00 + ( 1 − γ ) x 11 β x 01 β x 10 γ x 11 + ( 1 − α ) x 00$
with real non-negative parameters $α , β , γ$. The trace preserving is obvious. Positivity requires $0 ≤ α ≤ 1$, $0 ≤ γ ≤ 1$, and
$β 2 ≤ β max 2 : = 1 + 2 α γ − α − γ + 2 α ( 1 − α ) γ ( 1 − γ )$
T is a channel, hence completely positive, if $β 2 ≤ α γ$. To express w one needs the “critical” $β c$
$β c : = 1 + 2 α γ − α − γ − 2 α ( 1 − α ) γ ( 1 − γ )$
Then, see ,
$w = max { β 2 , β c 2 }$
At the bifurcation point $β = β c$ the T-concurrence is affine on Ω. For $β ≥ β c$ the roof is flat. Otherwise it looks similar to the particular one (88). See  for more details.

#### 5.3. Axial Symmetric Maps, Tangle

The tangle for stochastic 1-qubit maps can be found in [16,17]. For 1-qubit channels it is already in . These tangles always allow for optimal decompositions of length two.
The axial symmetric maps (103) can be treated explicitly. The following is due to .
Case A: If $| β | > | α + γ − 1 |$ one has to use $w = β 2$. It results in
$1 4 τ T ( X ) = 1 − β 2 − ( α − γ ) 2 − 2 ( α − γ ) ( α + γ − 1 ) x 3 + [ β 2 − ( α − γ − 1 ) 2 ] x 3 2$
Case B: Here $| β | = | α + γ − 1 |$, a bifurcation point in the parameter space. $w = β 2$ results in
$1 4 τ T ( X ) = 1 − β 2 − ( α − γ ) 2 − 2 ( α − γ ) ( α + γ − 1 ) x 3$
and the tangle becomes affine on the Bloch ball.
Case C: If $| β | < | α + γ − 1 |$ then $w = ( α + γ − 1 ) 2$ and we obtain
$1 4 τ T ( X ) = 1 − ( α + γ − 1 ) 2 − ( α − β ) 2 − 2 ( α − γ ) ( α + γ − 1 ) x 3 + [ ( α + γ − 1 ) 2 − β 2 ] ( x 1 2 + x 2 2 )$

## 6. Symmetries

The use of symmetries is almost obligatory in the treatment of roofs. We present only a small, hopefully helpful, part of it, mainly abstracted from [10,26,27,54,55]. See also .
Let $Ω ( H )$ be the space of states supported by the Hilbert space $H$ of dimension d, and g a real continuous function on $Ω pure$.
A symmetry of Ω is a transformation
$ω → ω V : = V ω V − 1$
V is a unitary or an anti-unitary operator inducing the symmetry (110).
We need the group Γ of all V such that
$π ∈ Ω pure ⟹ g ( π V ) = g ( π )$
Γ is the invariance group of g. A quite obvious statement reads
Proposition 6.1:
If Γ is the invariance group of g, then
$g ∪ ( ω V ) = g ∪ ( ω ) , g ∩ ( ω V ) = g ∩ ( ω )$
for all $V ∈ Γ$ and all $ω ∈ Ω$.
Let K be a convex leaf of $g ∪$. Obviously,
$K V = { ω V | ω ∈ K }$
is a convex leaf of $g ∪$ again. Hence, $K → K V$ permutes the convex leaves.
An interesting subgroup of Γ is the stabilizer group
$Γ K = { V ∈ Γ | K V = K }$
This is a compact group with an invariant Haar measure. say $d K V$. We can perform the invariant integration (“twirling”) over $Γ K$,
$ω ⟶ ω K = ∫ ω V d K V$
There is only one $Γ K$-invariant element in the convex hull of all $ω V$. It is $ω K$.
The map $ω → ω K$ contracts K onto the set
$K stable = { ω ∈ K | ω = ∫ ω V d K V }$
This set, being convex and compact, is the convex hull of its extremal invariant states. Extremal invariant states can be represented by $π K$ with pure $π ∈ K$. Invariant states which are not extremal, cannot be represented in such a way.
Proposition 6.2:
Let K be a convex leaf of $g ∪$ and $Γ K$ its stabilizer group. Then $K ∩ Ω pure$ consists of $Γ K$-orbits.
Every extremal $Γ K$-invariant state of K is of the form $π K$, π pure.
Every $Γ K$-invariant states of K is a convex combination of extremal $Γ K$-invariant states.

#### 6.1. Entanglement of the Diagonal Channel

In example 3.2 we considered the concave roof of (46),
$g ( π ) = S ( diag ( π ) ) , π ∈ Ω ( H )$
Now we look at the convex roof $g ∪$, i.e., at the entanglement $E D$ of $D ( ω ) = diag ( ω )$.
With the exception of $d = 2$ one does not know the structure of $E D$. But there are some insights on highly symmetric, “isotropic” quantum states.
The channel D will be described by the help of a basis $| j 〉$, $j = 1 , ⋯ , d$. To it we associate the vector
$| ψ 〉 = 1 d ∑ | j 〉$
The invariance group, Γ, of (117) consists of the permutations of the chosen basis, eventually followed by the conjugation Θ defined by $Θ | j 〉 = | j 〉$ for all j. The density operator ω commutes with Γ if its matrix elements satisfy
$〈 j | ω | j 〉 = 1 d , 〈 j | ω | k 〉 = x d$
for all j and all $k ≠ j$. x is a real number in the range
$− 1 d − 1 ≤ x ≤ 1$
The restriction is due to the positivity of ω. One often uses the fidelity parameter, F,
$0 ≤ F : = 〈 ψ | ω | ψ 〉 = ( d − 1 ) x + 1 d − 1 ≤ 1$
and we denote the corresponding Γ-invariant density operator by $ω F$.
We choose an allowed value F and write $K ω$ or $K ( F )$ for the complete convex leaf of $ω F$ with respect to $E D$. Now we state the following:
$ρ ∈ K ω ⇒ Θ ρ = ρ Θ$
Θ is the conjugation about the basis ${ | j 〉 }$.
Proof: It suffices to prove the assertion for pure states. We assume that the pure states π and $π ′ = Θ π Θ$ are both in $K ω$. The diagonal parts of them and of $ρ = ( 1 / 2 ) ( π + π ′ )$ are the same. It follows $S D ( ρ ) = S D ( π ) = S D ( π ′ )$ and, because we are inside a convex leaf, we get also $g ∪ ( ρ ) = S D ( ρ )$. Hence, by proposition 3.2, $S D = E D$ on the whole face containing π and $π ′$. This is a contradiction if $π ≠ π ′$.
Indeed, we proved something more:
Proposition 6.3:
Any two different pure states contained in a convex leaf of $E D$ must have different diagonal parts.
The Γ-invariant density operators are ordered by the fidelity parameters as indicated by (120). The more, $F → ω F$ is convexly linear in F. This fact allows to apply proposition 6.2.
Proposition 6.4:
Let $K ( F )$ denote the compact convex leaf of $E D$ belonging to the Γ-invariant density operator $ω F$. There is a subset R of the unit interval as follows:
a) Either we have $F ∈ R$. Then $K ( F )$ consists of flat roof points only. There is no other maximal symmetrical state in $K ( F )$ than $ω F$.
b) Or there are $F − , F + ∈ R$ such that $K ( F )$ is the convex hull of $K ( F − ) ∩ K ( F + )$.
In the case $d = 3$ and if
$1 2 ≤ F ≤ F * * , F * * = 8 9$
or $F = 0$ case a) is true. There is an optimal vector of the form $a | 1 〉 + b | 2 〉 + b | 3 〉$ for any of these F-values. $K ω = K ( F )$ contains not more than three pure states. They become permuted by the action of the invariance group Γ.
For more details, also in higher dimensions, see .

#### 6.2. An Embedding

There are numerous relations between different channels. Some of them provide insight in roof structures. Our main interest is again in the diagonal channels.
Let $H$ and $H ′$ be Hilbert spaces of dimensions d and $d ′ > d$. There are embeddings of $H$ into $H ′$ which relate the entanglement $E D$ and $E D ′$ of the corresponding diagonal maps, D and $D ′$.
For their description we first choose d integers, $m 1 , ⋯ , m d$, such that $d ′ = m 1 + ⋯ + m d$ and enumerate the basis vectors of $H ′$ as
$| j k 〉 , j = 1 , ⋯ , d , k = 1 , ⋯ , m j$
We further choose numbers
$y j , k , j = 1 , ⋯ , d , k = 1 , ⋯ , m j$
satisfying
$∑ k = 1 m j | y j k | 2 = 1 , j = 1 , ⋯ , d$
These data provide a unitary embedding
$| j 〉 → V | j 〉 = ∑ j = 1 d y j k | j , k 〉$
From
$diag ( X ) = { x 11 , ⋯ , x d d }$
we get
$diag ( V X V † ) = { | y 11 | 2 x 11 , ⋯ , | y 1 m 1 | 2 x 11 , | y 21 | 2 x 22 , ⋯ | y 2 m 1 | 2 x 22 , ⋯ }$
One obtains for the entropy of the diagonal channel
$S D ′ ( V X V † ) = ∑ k = 1 m 1 η ( | y 1 k | 2 x 11 ) + ∑ k = 1 m 2 η ( | y 2 k | 2 x 22 ) + ⋯$
The functional equation $η ( x y ) = y η ( x ) + x η ( y )$ allows to rewrite the first sum in (128) into the form
$∑ | y 1 k | 2 η ( x 11 ) + x 11 ∑ η ( | y 1 k | 2 )$
and we get finally
$S D ′ ( V ω V † ) = S D ( ω ) + ∑ j = 1 m 〈 j | ω | j 〉 ∑ k = 1 m j η ( | y j k | 2 )$
A convex roof remains a convex roof if we add a function linear in ω. Plugging (129) into (25) directly provides:
Proposition 6.5:
If $H$ is embedded unitarily in $H ′$ according to (125), then the entanglements of the diagonal channels are related by
$E D ′ ( V ω V † ) = E D ( ω ) + l ( ω )$
and the linear function is given by
$l ( ω ) = ∑ j = 1 m 〈 j | ω | j 〉 ∑ k = 1 m j η ( | y j k | 2 )$
Optimal decompositions are mapped onto optimal decomposition and convex leaves onto convex leaves.
A particular simple example is the embedding
$V | 0 〉 = | 1 〉 , V | 1 〉 = 1 2 ( | 2 〉 + | 3 〉 )$
of $H 2$ into $H 3$. Pairs of pure state vectors, yielding flat optimal decompositions for the entanglement of $D 2$, are
$a 0 | 0 〉 + a 1 | 1 〉 , a 1 * | 0 〉 + a 0 * | 1 〉$
A particular case is $a 0 = 1 / 3$, $a 1 = 2 / 3$. Applying the map (132) results in the optimal pair
$1 3 ( | 1 〉 + | 2 〉 + | 3 〉 ) , 2 3 | 1 〉 + 1 6 ( | 2 〉 + | 3 〉 )$
Call $π 0$ and $π 1$ the pure states determined by the vectors (134). Then
$diag ( π 0 ) = { 1 3 , 1 3 , 1 3 } , diag ( π 1 ) = { 2 3 , 1 6 , 1 6 }$
and the fidelity parameters (120) are 1 and $F * * = 8 / 9$. Returning to the remarks after proposition 6.4, we see a reason why this value should be a bifurcation point for the behavior of maximal symmetric states and their convex leaves.

#### 6.3. A Further Embedding

There are strong relations between the entanglement of the diagonal channels and the entanglement of formation, governed by embedding procedures. A nice and quite simple one is
$V | j 〉 = | j j 〉 = | j 〉 ⊗ | j 〉$
V is a unitary map of $H$ onto the subspace $H ′$ of $H a ⊗ H b$ with basis ${ | j j 〉 }$. The embedding is of interest because of
$D ( X ) : = diag ( X ) = Tr b ( V X V † ) , X ∈ B ( H )$
The relation implies
$E ( ( V ω V † ) ) = E D ( ω ) , ω ∈ Ω ( H )$
and relates the entanglement $E D$ of a diagonal map to the entanglement of formation E for bipartite states supported by $H ′$. This is true for all dimensions $d = dim H$. The vector (117) becomes a completely entangled one, say $| e 〉$, if transformed by (136).
The crucial point is now that invariance group $Γ e$ of $| e 〉 = V | ψ 〉$ is much larger than $V Γ V †$, the invariance group of $| ψ 〉$. $Γ e$ contains the local unitary operators $U ⊗ U ¯$ and the swap operation. The involution $V Θ 〉 † = Θ ⊗ Θ$ is defined originally only on $H ′$. The canonical extension to an involution $Θ e$ of all $H a ⊗ H b$ can be gained by
$Θ e | j k 〉 = | k j 〉$
and the requirement of anti-linearity. $Θ e$ satisfies
$Θ e ( X ⊗ 1 ) | e 〉 = ( X † ⊗ 1 ) | e 〉$
for all $X ∈ B ( H a )$ .
Now one tries to enlarge convex leaves in $H ′$ by transforming them with operators from a suitable larger group $Γ ′ ⊂ Γ e$. The involution $Θ e$ is a symmetry of the entanglement of formation. If a bipartite state ρ commutes with $Θ e$, all elements of the convex leaf of ρ must commute with $Θ e$.
One can understand quite well why $E D$ for maximal symmetric states are so similar to the entanglement of formation for isotropic states. A detailed discussion is not in the frame of the present paper. However, an essential point in the considerations above is in the relation between the entanglement of diagonal channels and the entanglement of formation. This may be of use in future research: It seems easier to imagine the structure of the diagonal channel as that of the partial trace. Nevertheless, the degree of difficulty is about the same.

## 7. Summary and Outlook

Given values $g ( π )$ for pure states π, the direct way of solving the convex roof problem is the search for optimal decompositions. The most prominent and successful examples are the concurrence and the entanglement of formation for 2-qubit bipartite quantum systems. The method goes back to Wootters and is described in section 4. With it one gets analytical expressions and flat optimal decompositions. The flatness of the convex (and concave) roofs inherited from Wootters’ method is rendering its use in higher dimensions.
A quite different way is to look for a maximal convex extension G of g : If for any other extension $G ′$ of g we find $G ′ ( ω ) > G ( ω )$ for a state $ω ∈ Ω$, then $G ′$ cannot be convex.
A further, and more efficient reformulation of the convex roof problem asks for roof points, (see definition 2.1a), of a convex extension G of g. At a roof point ω of a convex extension G one gets $G ( ω ) = g ∪ ( ω )$ and the problem is solved for the particular state ω. Similarly, if G would be a concave extension of g, then $G ( ω ) = g ∩ ( ω )$ for a roof point of G.
This way of proving is used in the chapter on the “subtraction procedure”. One of its merits is the control on the concurrence and on the 1-tangle of any rank two density operator of a $2 × m$ bipartite quantum system. The same is with the slightly more general class of stochastic, (just positive and trace preserving), 1-qubit channels.
Therefore, there is some hope to get the concurrence (and the 1-tangle) for all states of any $2 × m$ bipartite quantum system explicitly.
But even if this becomes true, it does not provide us with the entanglement of formation of a $2 × 3$ system: The concurrence ceases to be flat. However, by proposition 4.1 one can obtain reasonable lower bounds.
Wootters’ and the subtraction method seem to be quite different in spirit. Uniting the strength of both, would be very useful. Also one should look at the subtraction method in more general terms. One can get at least lower bounds on the concurrence for higher dimensional system as shown by two examples in . A more systematic study of the issue seems prospective.
The use of symmetries is well know and efficient in general. In convex roof construction the symmetries of g as of a function on the pure states is what counts. If a state ω is invariant with respect to a symmetry group Γ, its convex leaf, (see definitions 3.2), is the convex hull of a set of Γ-orbits consisting of pure states. The shapes of the leaves can be quite different. However, one would suppose a smooth change of the leaves with the exception of some bifurcation points (or lines ...) at which the dimension of the leaves is changing. Some help comes from embedding a lower dimensional problem into a higher dimensional one. This is shown for the entanglement of the diagonal channel: $E T$ can be computed in any dimension on 2-dimensional subspaces which contain at least one pure diagonal state. On these subspaces $E T$ is the sum of a flat convex roof and a linear function. The study of more examples is certainly desirable.
At this point we can return to the concurrence of the stochastic 1-qubit maps. For them $C T$ is the restriction of a Hilbertian semi-norm to the state space. The proof of proposition 5.1 shows a one to one correspondence between the structure of the foliation and the null-space of the semi-norm. Two stochastic 1-qubit maps come with the same pattern of their convex leaves if the null-spaces of their semi-norms are identical. Indeed, it is the first class of channels and roofs with a complete classification of their convex leaves and their foliation.

## Acknowledgements

I like to thank M. Hellmund for many valuable discussions. I wish to acknowledge the helpful and valuable advices of the referees and of P. Harremoës, Editor-in Chief of Entropy.

## References and Notes

1. Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
2. Vidal, G. Entanglement monotones. J. Mod. Opt. 2000, 47, 355. [Google Scholar] [CrossRef]
3. Bengtsson, I.; Zyczkowski, K. Geometry of Quantum States; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
4. Horodecki, R.; Horodecki, P.; Horodecki, M.; Horodecki, K. Quantum entanglement. Rev. Mod. Phys. 2009, 81, 865–942. [Google Scholar] [CrossRef]
5. Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
6. Bennett, C.; DiVincenzo, D.; Smolin, J.; Wootters, W. Mixed state entanglement and quantum error correction. Phys. Rev. A 1996, 54, 3824–3851. [Google Scholar] [CrossRef] [PubMed]
7. Hayden, P.; Leung, D.; Winter, A. Aspects of general entanglement. Comm. Math. Phys. 2007, 265, 95117. [Google Scholar]
8. Holevo, A.S. Quantum probability and quantum statistics (in Russian). Probl. Perdeachi Inf. 1973, 9, 3. [Google Scholar]
9. Connes, A.; Narnhofer, H.; Thirring, W. Dynamical entropy of C*-algebras and von Neumann algebras. Comm. Math. Phys. 1987, 112, 681–719. [Google Scholar] [CrossRef]
10. Benatti, F.; Narnhofer, H.; Uhlmann, A. Decompositions of quantum states with respect to entropy. Rep. Math. Phys. 1996, 38, 123–141. [Google Scholar] [CrossRef]
11. Uhlmann, A. Entropy and optimal decompositions of states relative to a maximal commutative subalgebra. Open Sys. Inf. Dyn., 1998, 5, 209–227. [Google Scholar] [CrossRef]
12. Hill, S.; Wootters, W.K. Entanglemant of a pair of quantum bits. Phys. Rev. Lett. 1997, 78, 5022–5025. [Google Scholar] [CrossRef]
13. Wootters, W.K. Entanglement of formation of an arbitrary state of two qubits. Phys. Rev. Lett. 1998, 80, 2243–2248. [Google Scholar] [CrossRef]
14. Uhlmann, A. Fidelity and concurrence between conjugated states. Phys. Rev. A 2000, 62, 032307. [Google Scholar] [CrossRef]
15. Wootters, W.K. Entanglement of formation and concurrence. Quantum Inf. Comput. 2002, 1, 27–47. [Google Scholar]
16. Hildebrand, R. Concurrence of Lorentz-positive maps. arXiv: quant-ph/0612064. 2006. [Google Scholar]
17. Hildebrand, R. Concurrence revisted. J. Math. Phys. 2008, 48, 102108. [Google Scholar] [CrossRef]
18. Pòlik, I.; Terlaky, T. A Survey of the S-Lemma. SIAM Rev. 2007, 3, 371–418. [Google Scholar] [CrossRef]
19. Hellmund, M.; Uhlmann, A. Concurrence of stochastic 1-qubit maps. arXiv:0802.209 (quant-ph).
20. Gorini, V.; Sudarshan, E.C.G. Extreme affine transformations. Commun. Math. Phys. 1976, 46, 43–52. [Google Scholar] [CrossRef]
21. Coffman, V.; Kundu, J.; Wootters, W.K. Distributet entanglement. Phys. Rev. A 2000, 61, 052306. [Google Scholar] [CrossRef]
22. Osborne, T.; Verstraete, F. General monogamy inequaltiy for bipartite qubit entanglement. Phys. Rev. Lett. 2006, 96, 22503. [Google Scholar] [CrossRef]
23. This does not affect the 1-tangle. Though (93) results from a subtraction procedure, it is not an application of the S-lemma: τT becomes negatve for x00 → ∞.
24. If the null space is 2-dimensional the convex leaves of CT become 2-dimensional too. A 3-dimensional null-space forces CT to be constant.
25. Osterloh, A.; Siewert, J.; Uhlmann, A. Tangles of superpositions and the convex-roof extension. Phys. Rev. A 2008, 77, 032310. [Google Scholar] [CrossRef]
26. Vollbrecht, K.G.; Werner, R.F. Entanglement measures under symmetry. Phys. Rev. A 2001, 64, 0623307. [Google Scholar] [CrossRef]
27. Terhal, B.M.; Vollbrecht, K.G. The entanglement of formation for isotropic states. Phys. Rev. Lett. 2000, 85, 2625–2628. [Google Scholar] [CrossRef] [PubMed]
28. Ohya, M.; Petz, D. Quantum Entropy and Its Use; Springer: New York, NY, USA, 1993. [Google Scholar]
29. A flat roof point can allow for several optimal decompositions, some of them flat and some of them not flat.
30. Levitan, L.B. Entropy defect and information for two quantum states. Open Sys. Inf. Dyn. 1994, 2, 319–329. [Google Scholar] [CrossRef]
31. Thirring, W. 1994; unpublished.
32. This is the very reason the present author introduced the notation “convex roof”.
33. If g is not continuous, an optimal decomposition may not exist, even if we allow for infinite length.
34. Mitchison, G.; Jozsa, R. Towards a geometrical interpretation of quantum information compression. Phys. Rev. A 2004, 69, 032304. [Google Scholar] [CrossRef]
35. Anti-linear Operators are also called a “conjugate linear” ones.
36. Wigner, E.P. Über die Operation der Zeitumkehr in der Quantenmechanik. Nachr. Ges. Wiss. Göttingen, Math.-Physikal. Klasse 1932, 31, 546–559. [Google Scholar]
37. Wigner, E.P. Normal form of anitunitary operators. J. Math. Phys. 1960, 1, 409–413. [Google Scholar] [CrossRef]
38. Horn, R.A.; Johnson, C.R. Matrix Analysis; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
39. The polar decomposition for anti-linear operators is equivalent to the Takaki decomposing of symmetric matrices.
40. Chen, K.; Albeverio, S.; Fei, Sh.-M. Entanglement of formation of bipartite quantum states. Phys. Rev. Lett. 2005, 95, 21051. [Google Scholar] [CrossRef]
41. Chen, K.; Albeverio, S.; Fei, Sh.-M. Concurrence of arbitrary dimensional bipartite quantum states. Phys. Rev. Lett. 2005, 95, 040504. [Google Scholar] [CrossRef] [PubMed]
42. Łoziński, A.; Buchleitner, A.; Życzkowski, K.; Wellens, T. Entanglement of 2×K quantum systems. Europhys. Lett. 2003, 62, 168. [Google Scholar] [CrossRef]
43. Mintert, F.; Kuś, M.; Buchleitner, A. Concurrence of mixed bipartite quantum states of arbitrary dimensions. Phys. Rev. Lett. 2005, 95, 260502. [Google Scholar] [CrossRef] [PubMed]
44. Osborne, T. Entanglement for rank-2 mixed states. Phys. Rev. A 2005, 72, 022399. [Google Scholar] [CrossRef]
45. Fei, S.; Li-Jost, X. R-function related to entanglement of formation. Phys. Rev. A 2006, 73, 024302. [Google Scholar] [CrossRef]
46. Rungta, R.; Buzek, V.; Caves, C.M.; Hillery, M.; Milburn, G.J.; Wootters, W.K. Universal state inversion and concurrence in arbitrary dimensions. Phys. Rev. A 2001, 64, 042315. [Google Scholar] [CrossRef]
47. Fujiwara, A.; Algoet, P. One-to-one parametrization of quantum channels. Phys. Rev. A 1999, 59, 3290–3294. [Google Scholar] [CrossRef]
48. Ruskai, M.B.; Szarek, S.; Werner, E. An analysis of completely-positive trace-preserving maps on 2×2 matrices. Lin. Alg. Appl. 2002, 347, 159–187. [Google Scholar] [CrossRef]
49. King, C.; Ruskai, M.B. Minimal entropy of states rmerging from noisy quantum channels. IEEE Trans. Info. Theor. 2002, 47, 1–19. [Google Scholar]
50. Uhlmann, A. On concurrence and entanglement of rank two channels. Open Sys. & Inf. Dyn. 2005, 12, 1–14. [Google Scholar]
51. Uhlmann, A. Concurrence and foliations induced by some 1-qubit channels. Int. J. Theor. Phys. 2003, 42, 983–999. [Google Scholar] [CrossRef]
52. Hellmund, M.; Uhlmann, A. Concurrence and entanglement entropy for stochastic 1-qubit maps. Phys. Rev. A 2009, 70, 052319. [Google Scholar] [CrossRef]
53. Hellmund, M. personal communication, 2009.
54. Benatti, F.; Narnhofer, H.; Uhlmann, A. Optimal decompositions with respect to entropy and symmetries. Lett. Math. Phys. 1999, 47, 237–253. [Google Scholar] [CrossRef]
55. Benatti, F.; Narnhofer, H.; Uhlmann, A. Broken symmetries in the entanglement of formation. Int. J. Theor. Phys. 2003, 42, 983–999. [Google Scholar] [CrossRef]
56. Fei, S.; Jost, J.; Li-Jost, X.; Wang, G. Entanglement of formation for a class of quantum states. Phys. Lett. A 2003, 310, 333. [Google Scholar] [CrossRef]
57. Θe is the so-called modular conjugation with defining vector |e〉.