Next Article in Journal
The Optimality Region for a Single-Machine Scheduling Problem with Bounded Durations of the Jobs and the Total Completion Time Objective
Previous Article in Journal
Optimal Design of Isothermal Sloshing Vessels by Entropy Generation Minimization Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cutting Convex Polytopes by Hyperplanes

1
Department of Pure and Applied Mathematics, Graduate School of Information Science and Technology, Osaka University, Toyonaka, Osaka 560-0043, Japan
2
Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
*
Author to whom correspondence should be addressed.
Mathematics 2019, 7(5), 381; https://doi.org/10.3390/math7050381
Submission received: 5 February 2019 / Revised: 20 April 2019 / Accepted: 21 April 2019 / Published: 26 April 2019

Abstract

:
Cutting a polytope is a very natural way to produce new classes of interesting polytopes. Moreover, it has been very enlightening to explore which algebraic and combinatorial properties of the original polytope are hereditary to its subpolytopes obtained by a cut. In this work, we devote our attention to all the separating hyperplanes for some given polytope (integral and convex) and study the existence and classification of such hyperplanes. We prove the existence of separating hyperplanes for the order and chain polytopes for any finite posets that are not a single chain, and prove there are no such hyperplanes for any Birkhoff polytopes. Moreover, we give a complete separating hyperplane classification for the unit cube and its subpolytopes obtained by one cut, together with some partial classification results for order and chain polytopes.
MSC:
Primary 52B05; Secondary 06A07

1. Introduction

Let P R n be a convex polytope of dimension d and P its boundary. If H R n is a hyperplane, then we write H ( + ) and H ( ) for the closed half-spaces of R n with H ( + ) H ( ) = H . We say that H cuts P if H ( P \ P ) and if each vertex of the convex polytopes P H ( + ) and P H ( ) is a vertex of P . When H ( P \ P ) , it follows that H cuts P if and only if, for each edge e = conv ( { v , v } ) of P , where v and v are vertices of P , and not e H , one has H e { v , v } . Cutting a polytope is a very natural way to produce new classes of interesting polytopes. For example, the hypersimplices are obtained from cutting the unit n cube by hyperplanes of the form x 1 + + x n = k , k + 1 , for some integer 0 k < n , which is a class of very interesting and well-studied polytopes (see for example [1,2,3]). A similar class of interesting polytopes obtained from cutting permutahedrons and in general any graphical zonotopes are studied in [4]. In general, it is a very interesting problem to explore which algebraic and combinatorial properties of P are hereditary to P H ( + ) and P H ( ) . For example, in [5] the study on separating hyperplanes of the edge polytope P G of a finite connected simple graph G is achieved and it is shown that P G is normal if and only if each of P G H ( + ) and P G H ( ) is normal.
In this paper, we look at the problem from another perspective, focusing more on the hyperplane that cuts the polytope. We are interested in the existence and classification of such hyperplanes.
If H cuts P , then we call H a separating hyperplane of P . If H is a separating hyperplane of P , then the decomposition of P via H is
P = ( P H ( + ) ) ( P H ( ) ) .
For example, if [ 0 , 1 ] 3 R 3 is the unit cube, then the hyperplane H R 3 defined by the equation x i + x j = 1 with 1 i < j 3 is a separating hyperplane of [ 0 , 1 ] 3 .
Unless n = d , where d = dim P , two different separating hyperplanes H and H of P might yield the same decomposition of P . For example, if [ 0 , 1 ] 2 R 3 is the square, then its separating hyperplane defined by x 1 + x 2 + x 3 = 1 and that defined by x 1 + x 2 x 3 = 1 clearly yield the same decomposition of [ 0 , 1 ] 2 .
An integral convex polytope is a convex polytope whose vertices have integer coordinates. Let P R n be an integral convex polytope of dimension d and suppose that P Z n is the set of vertices of P . It then follows that a hyperplane H R n is a separating hyperplane of P if and only if each of the subpolytopes P H ( + ) and P H ( ) is integral of dimension d.
The study of existence and classification for any general internal convex polytopes can be very hard. In the present paper, we focus our study on the following classes of polytopes: The unit cube and its subpolytopes cut by one hyperplane, order and chain polytopes, and Birkhoff polytopes. We prove the existence of separating hyperplanes for the order and chain polytopes for any finite posets that are not a single chain (Theorem 2), and prove there are no separating hyperplanes for any Birkhoff polytopes (Theorem 3). Moreover, we give a complete separating hyperplane classification for the unit cube and its subpolytopes cut by one hyperplane (Section 1), together with partial classification results for order and chain polytopes (Section 2).

2. The Unit Cube

Let [ 0 , 1 ] d R d be the unit cube with d 2 . In the study of its separating hyperplane H it is assumed that H passes through the origin of R d . First, we discuss the question when a hyperplane H of R d passing through the origin
H : a 1 x 1 + + a d x d = 0 ,
where each a i Q , is a separating hyperplane of [ 0 , 1 ] d .
Lemma 1.
A hyperplane (1) is a separating hyperplane of [ 0 , 1 ] d if and only if there exist p and q with a p > 0 and a q < 0 and all nonzero coefficients of H have the same absolute value.
Proof. 
(“If”) Let e be an edge of [ 0 , 1 ] d . Then there is 1 i d with
e = { ( x 1 , , x d ) : 0 x i 1 and x j = ε j for all j i } ,
where each ε j belongs to { 0 , 1 } . Suppose that there exist p and q with a p > 0 and a q < 0 and that all nonzero coefficients of H have the same absolute value. By relabeling the subscripts (if necessary), we may assume that
H : x 1 + + x s x s + 1 x s + t = 0 ,
where s > 0 , t > 0 and s + t d . If i > s + t , then either H e = e or H e = . If i s + t , then either H e = or H e is a vertex of [ 0 , 1 ] d . Thus, each of [ 0 , 1 ] d H ( + ) and [ 0 , 1 ] d H ( ) is integral. Moreover, since s > 0 and t > 0 , it follows that H ( 0 , 1 ) d . Hence H is a separating hyperplane of [ 0 , 1 ] d .
(“Only if”) If every coefficient a i of (1) is nonnegative, then H [ 0 , 1 ] d consists only of the origin. Hence H cannot be a separating hyperplane of [ 0 , 1 ] d . Thus, there exists p and q with a p > 0 and a q < 0 .
Now, suppose that there exist i j with a i 0 , a j 0 and | a i | | a j | . Let, say, | a i | < | a j | . Let e be the edge defined by x i = 1 and x k = 0 for all k with k { i , j } . If a i a j < 0 , then 0 < a i / a j < 1 and v = ( v 1 , , v n ) e with v j = a i / a j belongs to H . Thus, H cannot be a separating hyperplane of [ 0 , 1 ] d . Hence a i a j > 0 . In particular | a p | = | a q | . Let 1 k d with a k > 0 . Then, since a k a q < 0 , it follows that | a k | = | a q | . Similarly, if a k < 0 , then | a k | = | a p | . Consequently, | a k | = | a p | ( = | a p | ) for all k with a k 0 , as desired. □
By virtue of Lemma 1, it follows that up to permuting the coordinates, a separating hyperplane of [ 0 , 1 ] d passing through the origin is of the form
x 1 + + x s x s + 1 x s + t = 0
with s > 0 , t > 0 and s + t d . Moreover, in (2), by replacing x s + i with 1 x s + i for 1 i t , we can work with a separating hyperplane of [ 0 , 1 ] d of the form
x 1 + + x s + x s + 1 + + x s + t = t .
Finally, Equation (3) can be rewritten as
x 1 + + x k = , 2 k d , 1 < k .
If H k , is a separating hyperplane defined by (4) of [ 0 , 1 ] d , then
[ 0 , 1 ] d H k , ( + ) = { ( x 1 , , x d ) [ 0 , 1 ] d : x 1 + + x k } ,
[ 0 , 1 ] d H k , ( ) = { ( x 1 , , x d ) [ 0 , 1 ] d : x 1 + + x k } .
In [ 0 , 1 ] d H k , ( ) , again, by replacing x i with 1 x i for 1 i k , it follows that since 0 < k < k , each of the subpolytopes [ 0 , 1 ] d H k , ( ± ) is, up to unimodular equivalence, of the form
Q k , = { ( x 1 , , x d ) [ 0 , 1 ] d : x 1 + + x k } , 2 k d , 1 < k .
It follows easily that when ( k , ) ( k , ) , two subpolytopes Q k , and Q k , cannot be unimodularly equivalent. Hence
Corollary 1.
The number of convex polytopes of the form [ 0 , 1 ] d H ( ± ) , where H is a separating hyperplane of [ 0 , 1 ] d is, up to unimodular equivalence, d ( d 1 ) / 2 .
We now turn to the problem of finding separating hyperplanes of the subpolytope Q k , of [ 0 , 1 ] d . We say that a separating hyperplane of Q k , (of the form (5)) is a second separating hyperplane of [ 0 , 1 ] d following the hyperplane H k , .
Lemma 2.
Each of the separating hyperplanes of [ 0 , 1 ] d is of the form
i I x i j J x j = h ,
where I [ d ] , J [ d ] , I J = and where h 0 is an integer with 0 h < ( I ) .
Proof. 
Let v = ( v 1 , , v d ) be a vertex of [ 0 , 1 ] d . Let I [ d ] and J [ d ] with I J = [ d ] and I J = such that v i = 0 if i I and v j = 1 if j J . Let
H : a 1 x 1 + + a d x d = j J a j ,
with each a i Q , be a separating hyperplane of [ 0 , 1 ] d passing through v. In (6) replace x j with 1 x j for j J , and the hyperplane
H : i I a i x i + j J ( a j ) x j = 0
is a separating hyperplane of [ 0 , 1 ] d passing through the origin. It then follows from Lemma 1 that all nonzero coefficients of (7) have the same absolute value. Thus, each of a i ’s and a j ’ belongs to { 0 , ± 1 } .
It turns out that Equation (6) is
H : p I x p q J x q = h ,
where I [ d ] , J [ d ] , I J = and where h 0 is an integer. If h ( I ) , then [ 0 , 1 ] d H ( + ) or [ 0 , 1 ] d H ( ) . Hence 0 h < ( I ) . If 0 h < ( I ) , then
( H ( + ) \ H ) [ 0 , 1 ] d , ( H ( ) \ H ) [ 0 , 1 ] d .
Thus, H is, in fact, a separating hyperplane of [ 0 , 1 ] d . □
Let H R d be a second separating hyperplane of [ 0 , 1 ] d following (4). Clearly H is a separating hyperplane of [ 0 , 1 ] d . It then follows from Theorem 2 that
H : i I x i j J x j = h ,
where I [ d ] , J [ d ] , I J = and where h 0 is an integer with 0 h < ( I ) .
Theorem 1.
A hyperplane H of (8) is a second separating hyperplane of [ 0 , 1 ] d following (4) if and only if one of the following conditions is satisfied:
  • ( J ) + h + k ( X ) ;
  • ( I ) h + k ( Y ) ,
where X = I [ k ] and Y = J [ k ] .
Proof. 
Let P R d denote the subpolytope (5) of [ 0 , 1 ] d . Then a hyperplane H of (8) is a second separating hyperplane of [ 0 , 1 ] d following (4) if and only if one has H [ 0 , 1 ] d P .
(“If”) Let v = ( v 1 , , v d ) [ 0 , 1 ] d belong to H , i.e., i I v i j J v j = h . If ( J ) + h + k ( X ) , then
v 1 + + v k = i X v i + i [ k ] \ X v i i I v i + i [ k ] \ X v i = h + j J v j + i [ k ] \ X v i h + ( J ) + k ( X ) .
Hence v P . If ( I ) h + k ( Y ) , then
v 1 + + v k = i Y v i + i [ k ] \ Y v i i J v i + i [ k ] \ Y v i = h + j I v j + i [ k ] \ Y v i h + ( I ) + k ( Y ) .
Hence v P .
(“Only if”) Let ( J ) + h + k ( X ) > and ( I ) h + k ( Y ) > . We claim the existence of v = ( v 1 , , v d ) [ 0 , 1 ] d with v H such that v 1 + + v k > .
Let ( I ) ( J ) + h . Then 0 h < ( I ) ( J ) + h . Thus, there is v [ 0 , 1 ] d belonging to H with v i = 1 for all i I such that if j Y and j J \ Y then v j v j . Such v H [ 0 , 1 ] d can be chosen with v i = 1 for all [ k ] \ ( X Y ) . Then
v 1 + + v k ( X ) + min { ( I ) h , ( Y ) } + ( [ k ] \ ( X Y ) ) = ( X ) + min { ( I ) h , ( Y ) } + k ( X ) ( Y ) = min { ( I ) h , ( Y ) } + k ( Y ) = min { ( I ) h + k ( Y ) , k } .
Since ( I ) h + k ( Y ) > and k > , it follows that v 1 + + v k > .
Let ( I ) > ( J ) + h . Then there is v [ 0 , 1 ] d belonging to H with v j = 1 for all j J such that if i X and i I \ X then v i v i . Such v H [ 0 , 1 ] d can be chosen with v i = 1 for all [ k ] \ ( X Y ) . Then
v 1 + + v k ( Y ) + min { ( J ) + h , ( X ) } + ( [ k ] \ ( X Y ) ) = ( Y ) + min { ( J ) + h , ( X ) } + k ( X ) ( Y ) = min { ( J ) + h , ( X ) } + k ( X ) = min { ( J ) + h + k ( X ) , k } .
Since ( J ) + h + k ( X ) > and k > , it follows that v 1 + + v k > . □
Corollary 2.
Let
x 1 + + x d = , 1 < d
be a separating hyperplane of [ 0 , 1 ] d . Then a hyperplane
x 1 + + x s = x s + 1 + + x s + t + h , 0 h < s
is a second separating hyperplane of [ 0 , 1 ] d following (9) if and only if one of the following conditions is satisfied:
  • d s ( t + h ) ;
  • d ( t + h ) s .

3. Order and Chain Polytopes

Let P = { ξ 1 , , ξ d } be a finite partially ordered set ( poset for short). To each subset W P , we associate ρ ( W ) = ξ i W e i R d , where e 1 , , e d are the unit coordinate vectors of R d . In particular ρ ( ) is the origin of R d . A poset ideal of P is a subset I of P such that for all ξ i and ξ j with ξ i I and ξ j ξ i , one has ξ j I . An antichain of P is a subset A of P such that ξ i and ξ j belonging to A with i j are incomparable. We say that ξ j covers ξ i if ξ i < ξ j and ξ i < ξ k < ξ j for no ξ k P . A chain ξ j 1 < ξ j 2 < < ξ j of P is called saturated if ξ j q covers ξ j q 1 for 1 < q . A maximal chain is a saturated chain such that ξ j 1 is a minimal element and ξ j is a maximal element of the poset.
The order polytope of P is the convex polytope O ( P ) R d which consists of those ( x 1 , , x d ) R d such that 0 x i 1 for every 1 i d together with
x i x j
if ξ i ξ j in P.
The chain polytope of P is the convex polytope C ( P ) R d which consists of those ( x 1 , , x d ) R d such that x i 0 for every 1 i d together with
x i 1 + x i 2 + + x i k 1
for every maximal chain ξ i 1 < ξ i 2 < < ξ i k of P.
One has dim O ( P ) = dim C ( P ) = d . The number of vertices of O ( P ) is equal to that of C ( P ) . Moreover, the volume of O ( P ) and that of C ( P ) are equal to e ( P ) / d ! , where e ( P ) is the number of linear extensions of P ([6] (Corollary 4.2)). It also follows from [6] that the facets of O ( P ) are the following:
  • x i = 0 , where ξ i P is maximal;
  • x j = 1 , where ξ j P is minimal;
  • x i = x j , where ξ j covers ξ i .
And that the facets of C ( P ) are the following:
  • x i = 0 for all ξ i P ;
  • x i 1 + + x i k = 1 , where ξ i 1 < < ξ i k is a maximal chain of P.
Moreover, we have the following descriptions for vertices, which will be used frequently in this section.
Lemma 3 ([6]).
(1) 
Each vertex of the order polytope O ( P ) is of the form ρ ( I ) , where I is a poset ideal of P.
(2) 
Each vertex of the chain polytope C ( P ) is of the form ρ ( A ) , where A is an antichain of P.
Please note that in [6], “dual poset ideals” are employed instead of poset ideals. However, no essential difference arises.

3.1. Existence of Separating Hyperplanes for Order and Chain Polytopes

In this subsection, we study the existence of separating hyperplanes of order polytopes and chain polytopes (Theorem 2). First we need an explicit description of edges in terms of vertices.
Recall that the comparability graph Com ( P ) of P is the finite simple graph on the vertex set { ξ 1 , , ξ d } whose edges are those { ξ i , ξ j } with i j for which ξ i and ξ j are comparable in P. In general, we say that a nonempty subset Q = { ξ k 1 , , ξ k q } of P is connected inP if the induced subgraph of Com ( P ) on { ξ k 1 , , ξ k q } is connected.
Lemma 4.
Let I and J be poset ideals of P with I J . Then conv ( { ρ ( I ) , ρ ( J ) } ) forms an edge of O ( P ) if and only if I J and J \ I is connected in P.
Proof. 
If there exists a maximal element ξ i of P not belonging to I J , then conv ( { ρ ( I ) , ρ ( J ) } ) lies in the facet x i = 0 . If there exists a minimal element ξ j of P belonging to I J , then conv ( { ρ ( I ) , ρ ( J ) } ) lies in the facet x j = 1 . Hence, working with induction on d, we may assume that I J = P and I J = .
Let neither I = nor J = . Then P is the disjoint union of I and J. Now, suppose that conv ( { ρ ( I ) , ρ ( J ) } ) is an edge of O ( P ) . Then there exists a supporting hyperplane H of O ( P ) defined by the equation h ( x ) = i = 1 d a i x i = 1 with each a i Q such that H O ( P ) = conv ( { ρ ( I ) , ρ ( J ) } ) . Since x i I a i = x j J a j = 1 , one has i = 1 d a i = 2 . In particular h ( ρ ( P ) ) > 1 and h ( ) < 1 . Thus, H cannot be a supporting hyperplane of P. In other words, conv ( { ρ ( I ) , ρ ( J ) } ) cannot be an edge of P.Hence, if conv ( { ρ ( I ) , ρ ( J ) } ) is an edge of P, then either I = or J = . Let I = and J = P . Suppose that P is disconnected and that conv ( { ρ ( ) , ρ ( P ) } ) is an edge of P. Again, there exists a supporting hyperplane H of O ( P ) defined by the equation h ( x ) = i = 1 d a i x i = 0 with each a i Q such that H O ( P ) = conv ( { ρ ( ) , ρ ( P ) } ) . Let, say, h ( ρ ( I ) ) > 0 for those poset ideals I with I and I P . Since P is disconnected, there exist poset ideals I and J with I J = and I J = P . Since h ( ρ ( I ) ) > 0 and h ( ρ ( J ) ) > 0 , it follows that h ( ρ ( P ) ) = h ( ρ ( I ) ) + h ( ρ ( J ) ) > 0 , a contradiction. Thus, P must be connected.
Conversely, suppose that I = and J = P and that P is connected. Let ξ i 1 , , ξ i q be the maximal elements of P and A i j the set of those elements y P with y < ξ i j . Let k { i 1 , , i q } . Then we write b k for the number of i j ’s with ξ k A i j . Let b i j = ( A i j ) . We then claim that the hyperplane H of R d defined by the equation h ( x ) = i = 1 d b i x i = 0 is a supporting hyperplane of O ( P ) with H O ( P ) = conv ( { ρ ( ) , ρ ( P ) } ) . Clearly h ( ρ ( P ) ) = h ( ρ ( ) ) = 0 . Let I be a poset ideal of P with I and I P . What we must prove is h ( ρ ( I ) ) > 0 . To simplify the notation, suppose that I { ξ i 1 , , ξ i q } = { ξ i 1 , , ξ i r } , where 0 r < q . If r = 0 , then h ( ρ ( I ) ) > 0 . Let 1 r < q and J = j = 1 r ( A i j ξ i j ) . Then J is a poset ideal of P and h ( ρ ( J ) ) h ( ρ ( I ) ) . We claim h ( ρ ( J ) ) > 0 . One has h ( ρ ( J ) ) 0 . Moreover, h ( ρ ( J ) ) = 0 if and only if no z J belongs to A i r + 1 A i q . Now, since P is connected, if follows that there exists z J with z A i r + 1 A i q . Hence h ( ρ ( J ) ) > 0 . Thus, h ( ρ ( I ) ) > 0 , as desired. □
Lemma 5.
Let A and B be antichains of Pwith A B . Then conv ( { ρ ( A ) , ρ ( B ) } ) forms an edge of C ( P ) if and only if ( A \ B ) ( B \ A ) is connected in P.
Proof. 
If A B P and if ξ i A B , then conv ( { ρ ( A ) , ρ ( B ) } ) lies in the facet x i = 0 . Furthermore, if A B = P and A B , then ξ j A B is isolated in P and ξ j itself is a maximal chain of P. Thus, conv ( { ρ ( A ) , ρ ( B ) } ) lies in the facet x j = 1 . Now, suppose that A B = P and A B = . Then ( A \ B ) ( B \ A ) = A B = P .
Let conv ( { ρ ( A ) , ρ ( B ) } ) be an edge of C ( P ) and H a supporting hyperplane of C ( P ) defined by h ( x ) = i = 1 d a i x i = 1 , where each a i Q , with H C ( P ) = conv ( { ρ ( A ) , ρ ( B ) } ) and C ( P ) H ( + ) . If P is disconnected and if A 1 B 1 and A 2 B 2 are antichains of P, where A is the disjoint union of A 1 A 2 and B is the disjoint union of B 1 B 2 , then h ( ρ ( A 1 B 1 ) ) < 1 and h ( ρ ( A 2 B 2 ) ) < 1 . Hence h ( ρ ( A B ) < 2 . However, since h ( ρ ( A ) ) = 1 and h ( ρ ( B ) ) = 1 , one has h ( ρ ( A B ) ) = 2 , a contradiction. Thus, conv ( { ρ ( A ) , ρ ( B ) } ) cannot be an edge of C ( P ) . Hence P must be connected if conv ( { ρ ( A ) , ρ ( B ) } ) is an edge of C ( P ) .
Now, suppose that P is connected. If there exist x , x A and y , y B with x < y and y < x , then P cannot be connected. We assume y < x if x A and y B are comparable. For each ξ i A we write a i for the number of elements y B with y < ξ i . For each ξ j B we write b j for the number of elements z A with ξ j < z . Clearly ξ i A a i = ξ j B b j = q , where q is the number of pairs ( x , y ) with x A , y B and x < y . Let h ( x ) = ξ i A a i x i + ξ j B b j x j and H the hyperplane of R d defined by h ( x ) = d . Then h ( ρ ( A ) ) = h ( ρ ( B ) ) = q . We claim that for any antichain C of P with C A and C B , one has h ( ρ ( C ) ) < q . Let C = A B with A A and B B . Since P = A B is connected and since C is an antichain of P, it follows that ξ i A a i + ξ j B b j < q . Thus, h ( ρ ( C ) ) < q , as desired. □
Now we ask the question whether there exists a separating hyperplane of an order polytope as well as that of a chain polytope.
Lemma 6.
Let ξ i , ξ j P with ξ i ξ j and H i , j the hyperplane of R d defined by the equation x i = x j . Then the following conditions are equivalent:
(i)
H i , j is a separating hyperplane of O ( P ) ;
(ii)
H i , j intersects the interior of O ( P ) ;
(iii)
ξ i and ξ j are incomparable in P.
Proof. 
The implication (i) ⇒ (ii) is obvious. Suppose (ii). Then there exist poset ideals I and J of P with ρ ( I ) H i , j ( + ) \ H i , j and ρ ( J ) H i , j ( ) \ H i , j . In other words, there exist poset ideals I and J of P with ξ i I \ J and ξ j J \ I . Thus, in particular ξ i and ξ j are incomparable in P. Hence (ii) ⇒ (iii) follows.
Suppose (iii). Let I be the poset ideal of P consisting of those y P with y ξ i and J the poset ideal of P consisting of those y P with y ξ j . Since ξ i and ξ j are incomparable in P, it follows that ξ i J and ξ j I . Thus ρ ( I ) H i , j ( + ) \ H i , j and ρ ( J ) H i , j ( ) \ H i , j . Hence H i , j intersects the interior of O ( P ) . Let, in general, I and J be poset ideals of P with ρ ( I ) H i , j ( + ) \ H and ρ ( J ) H i , j ( ) \ H . In other words, ξ i I \ J and ξ j J \ I . Hence I ¬ J and J ¬ I . Lemma 4 then guarantees that conv ( { ρ ( I ) , ρ ( J ) } ) cannot be an edge of O ( P ) . Hence H i , j is a separating hyperplane of O ( P ) , as desired. □
Lemma 7.
Let H be the hyperplane of R d defined by the equation i = 1 d x i 1 = 0 . Then the following conditions are equivalent:
(i)
H is a separating hyperplane of C ( P ) ;
(ii)
H intersects the interior of C ( P ) ;
(iii)
P is not a chain.
Proof. 
The implication (i) ⇒ (ii) is obvious. Suppose (ii). Since the origin ρ ( ) of R d belongs to H ( ) \ H , there is an antichain A of P with ρ ( A ) H ( + ) \ H . Then ( A ) 2 . Thus, P cannot be a chain. Hence (ii) ⇒ (iii) follows.
Suppose (iii). One has an antichain A of P with ( A ) 2 . Then ρ ( A ) H ( + ) \ H and ρ ( ) H ( ) \ H . Hence H intersects the interior of C ( P ) . Clearly ρ ( ) is a unique vertex of C ( P ) belonging to H ( ) \ H . Let B be an antichain of P with ρ ( B ) H ( + ) \ H . Thus, ( B ) 2 . Since B = ( \ B ) ( B \ ) is disconnected in P, Lemma 5 says that conv ( { ρ ( ) , ρ ( B ) } ) cannot be an edge. Hence H is a separating hyperplane of C ( P ) , as desired. □
By virtue of Lemmas 6 and 7, it follows immediately that
Theorem 2.
Let P be a finite poset, but not a chain. Then each of the order polytope O ( P ) and the chain polytope C ( P ) possesses a separating hyperplane.

3.2. Description of Separating Hyperplanes for Order and Chain Polytopes

In this subsection, we study the necessary and sufficient conditions for the hyperplane H defined by
h ( x ) = c 1 x 1 + c 2 x 2 + + c d x d = 0
to become a separating hyperplane for certain order polytopes and for certain chain polytopes. This study can be very difficult for general posets. Therefore, we focus on the following three basic posets: Disjoint chains; binary trees (assume connected); and zigzag posets (assume connected, a zigzag poset is a poset where its graph looks like a zigzag path, see examples of zigzag posets in Proposition 3). Notice that there are no “X” shape in all the three classes of posets, therefore their chain polytopes and order polytopes are unimodular equivalent ([7]). In this subsection, we will focus on order polytopes, and all results are also true for chain polytopes.
First, by the definition of separating hyperplanes, together with Lemmas 3 and 4 about the descriptions of the vertices and edges for order polytopes, we have the following description.
Lemma 8.
H is a separating hyperplane for O ( P ) if and only if the following two properties are satisfied:
1.
there exist two poset ideals I and J such that h ( ρ ( I ) ) > 0 and h ( ρ ( J ) ) < 0 (getting two nontrivial subpolytopes);
2.
h ( ρ ( I ) ) h ( ρ ( J ) ) 0 , for each pair of poset ideals I and J such that ( I \ J ) ( J \ I ) is connected in P.
We call a pair of poset ideals I and J that does not satisfy the second property in Lemma 8 a bad pair for h, i.e., h ( ρ ( I ) ) h ( ρ ( J ) ) < 0 and ( I \ J ) ( J \ I ) is connected in P. In the rest of this subsection, we will prove most necessary conditions for being a separating hyperplane by constructing bad pairs. We are looking for posets which have the following property.
Consider the following three properties of the hyperplane H defined by h ( x ) = 0 .
Property 1.
1.
There exist two minimal elements ξ i and ξ j such that c i > 0 and c j < 0 ;
2.
All nonzero coefficients have the same absolute value, i.e., c i { 0 , 1 , 1 } after rescaling, for all i = 1 , 2 , , d ;
3.
All the coefficients c i , where ξ i is minimal element of P, uniquely determine the other coefficients.
In what follows, we discuss whether Property 1 can be a necessary and sufficient condition for H to be a separating hyperplane of the order polytope O ( P ) .
Note that once Property 1 is a necessary and sufficient condition for H to be a separating hyperplane of O ( P ) , we can easily check whether a given hyperplane is a separating hyperplane of O ( P ) . Moreover, assume all coefficients for minimal elements are not zero, then the total number of separating hyperplanes of O ( P ) will be 2 # { of min elements in P } , since each minimal element can only choose to be positive or negative, and then all other coefficients are uniquely determined by the minimal elements. Among the three classes of posets we mentioned: disjoint chains, connected binary trees and connected zigzag posets, only disjoint chains satisfy Property 1. We will provide counter examples for the other two posets and give the best possible results under certain conditions.
Proposition 1.
For the order polytope O ( P ) , where P consists of disjoint chains, Property 1 is a necessary and sufficient condition for H to be a separating hyperplane.
Proof. 
We first prove that all three conditions listed in Property 1 are necessary for H to be a separating hyperplane.
  • By Lemma 8 (1), there exists one order ideal I of P , such that h ( ρ ( I ) ) > 0 . We assume I is connected, otherwise we look at the chain decomposition of P = C 1 C r and consider I C i , for i = 1 , , r . At least one of the intersections is nonempty and satisfies h ( ρ ( I C i ) ) > 0 . Now back to the case when I is connected. Since I is a chain, there exists a unique minimal element i in I. We claim that c i 0 , where c i is the coefficient of x i in H . In fact, if c i < 0 , I and J = { i } is a bad pair. Actually, here we can assume c i > 0 , since in the case c i = 0 , we can simply throw this element away from the poset and look at the new minimal element in the subposet P \ { i } . Since the whole I cannot have all coefficients zero, we will just assume c i 0 . Similarly, we also have another minimal element j with c j < 0 .
  • We first prove that nonzero coefficients of the minimal elements need to have the same absolute value. For example, consider the following poset.
    Mathematics 07 00381 i001
    Without loss of generality, pick c a > 0 , c b < 0 . Suppose | c b | > | c a | . Let I = { a } , J = { b , a } . Then ( I , J ) is a bad pair. So, we need | c b | = | c a | . Considering all pairs of minimal elements with opposite signs, it follows that their coefficients have the same absolute value.
    Now consider the pair I = { a } , J = { a , d } , to make ( I , J ) not bad, we need c d c a = c b . Consider the pair I = { b } , J = { b , a , d } , we have c d 0 . Then consider the pair I = { a , d } , J = { b , a , d } , since we want to avoid zero coefficient, assume c d 0 , therefore we have c d c a . Therefore, we need c d = c a . For the same reason, we have c g = c a . Now consider c e . Similar to above, the pairs ( { a } , { a , d , e } ) , ( { b } , { b , a , d , e } ) and ( { a , d , e } , { b , a , d , e } ) provide c e = c a . Continuing this way, we can show that the signs along each chain need to alternate and their coefficients have the same absolute value.
  • We have just shown in the previous part that given the coefficients of the minimal elements, there exists a unique way to extend the coefficients to other elements (assume avoiding zero coefficients), which is exactly Property 1 (3).
Now we want to show that if a hyperplane H satisfies the three conditions listed in Property 1, then H is a separating hyperplane. Condition (1) guarantees part (1) in Lemma 8. Now we want to show that there is no bad pair. For any pair of poset ideals ( I , J ) , if J \ I is connected, then J \ I is a segment in a chain. By the necessary conditions on the coefficients of H , i J \ I c i { 1 , 0 , 1 } . As a result, no matter what the value of h ( v I ) is, we always have h ( v I ) h ( v J ) 0 . □
Proposition 2.
For binary trees, the following are true:
1.
Property 1 (1) is necessary.
2.
Property 1 (2) is not necessary.
3.
Assume a separating hyperplane H satisfying Property 1 (1) and (2), then (3) is also necessary.
4.
However, all three conditions in Property 1 together are not sufficient for a hyperplane to be a separating hyperplane.
Proof. 
  • We want to show that there exist two minimal elements i and j such that c i > 0 and c j < 0 . The argument in the proof for the disjoint union of chains also works here. The key point is that for any connected poset ideal I in the binary tree and one of its minimal element i, I \ { i } is still connected in P .
  • The argument that all the minimal elements have the same absolute value still holds as in the disjoint union of chains. However, it is possible that not all elements have the same absolute value. For example, consider the hyperplane as the following labelled represented poset, where the label for an element i in P is the coefficient c i in H . We can check that there are no bad pairs for H , thus H is a separating hyperplane. However, not all coefficients in H have the same absolute value.
    Mathematics 07 00381 i002
  • Now assume all coefficients have the same absolute value, and thus can only take value from { 1 , 0 , 1 } after rescaling. So here we only need to talk about the sign for an element i in P (+ refers to c i = 1 and − refers to c i = 1 ). Now we want to show that the sign of an element is determined by the sign of its two children. Here “the sign of the child" refers to the sign of the poset ideal generated by that child. In particular, there are exactly six local sign patterns:
    Mathematics 07 00381 i003
    Notice that 0 appears if and only if its children have a + and a −. For two elements a , b with a common parent d,
    (a)
    suppose c b = c a = 1 . Let e be a minimal element with c e = 1 . Then by the pair ( I = { e } , J = d , e ) (J is the poset ideal generated by d and e), we have h ( ρ ( J ) ) 0 , and thus c d = 1 . This corresponds to the second tree above, and the same for the first tree.
    (b)
    suppose c b = c a > 0 . Then by the pair ( { b } , { a , b , d } ) and ( { a } , { a , b , d } ) , we have c d = 0 , which corresponds to the third tree above.
    (c)
    suppose c b > 0 and c a = 0 . This indicates that a is larger than some minimal element e with c e < 0 . Then by the pair ( { e } , d ) and ( { b } , d ) , we have h ( ρ ( d ) ) = 0 , thus c d = c b , which corresponds to the forth tree above. The fifth and the sixth tree can be obtained in a similar way.
  • Following the above rule will not always result in a separating hyperplane. For example, consider the hyperplane represented by the following labelled poset.
    Mathematics 07 00381 i004
    One can easily check that the above hyperplane follows the six local rules listed above as well the other two conditions in Property 1. However, for example, I = g , e , f and J = a , b , c , d , e , f is a bad pair.
 □
Proposition 3.
For the zigzag posets, Property 1 (1) is not necessary for H to be a separating hyperplane. However, for any hyperplane H with Property 1 (1), the other two conditions listed in Property 1 are necessary and sufficient conditions for H to be a separating hyperplane.
Proof. 
The following example is a separating hyperplane but does not satisfy Property 1 (1).
Mathematics 07 00381 i005
Now assume H is a hyperplane satisfying Property 1 (1). We first prove that if H is a separating hyperplane, then both Property 1 (2) and (3) are true.
  • We want to prove that all the nonzero coefficients in any separating hyperplane for a zigzag poset have the same absolute value. First notice that all the minimal elements have the same absolute value, as proved in Proposition 1. Following the same proposition, all the non-maximal elements (if nonzero) have the same absolute value. As for the maximal elements, let us have a closer look at the zigzag poset. One maximal element m covers at most two minimal elements p , q . For the case m only covers one minimal element, the coefficient c m needs to have the same absolute value for the same reason as disjoint chains proved in Proposition 1. Now there are two cases when m covers two minimal elements p , q :
    (a)
    c p · c q < 0 . Let I = m be the poset ideal generated by m. Consider the pair I and J, where J = { p } or { q } . We have h ( ρ ( I ) ) = 0 , which implies | c m | 1 .
    (b)
    c p · c q > 0 . Say c p = c q = 1 . Let n be a maximal element adjacent to m that covers two minimal elements with different signs. For example,
    Mathematics 07 00381 i006
    Consider the poset ideal I = m , n . Similar to the previous case, we have h ( ρ ( I ) ) = 0 , which still implies | c m | 1 .
  • Since H satisfies conditions (1) and (2) in Property 1, once we fix the signs of all the minimal elements, all elements except those maximal are uniquely determined the same way as the disjoint chains (Proposition 1). As for the maximal elements, they are uniquely determined by the signs of their two children the same as the binary trees (Proposition 2).
Now we want to prove that any hyperplane h ( x ) = 0 satisfying the three conditions listed in Property 1 is a separating hyperplane. The condition (1) in Property 1 implies condition (1) in Lemma 8. Now we want to show that there are no bad pairs. Notice that by the rules described above, any connected component has value sum to { 1 , 0 , 1 } . In the case I J , if h ( ρ ( I ) ) < 0 , then h ( ρ ( J ) ) = h ( ρ ( I ) ) + h ( ρ J \ I ) < 1 , since h ( ρ ( J \ I ) ) < 1 . Now we claim that for the zigzag poset, the condition that ( I \ J ) ( J \ I ) is connected, implies that I \ J or J \ I is empty. Consider a generic connected subposet S = ( I \ J ) ( J \ I ) . We want to show that S I or S J . If S only has one maximal element, then it is clear that all the elements belong to the same order ideal as the maximal element (either I or J). If there are more than one maximal element, see the following example.
Mathematics 07 00381 i007
Consider two adjacent maximal elements (they are a and b in the example). These two maximal elements cover a common minimal element d, because this subposet is connected. Then d belongs to the same poset ideal as both a and b. Therefore, both a and b belong to the same poset ideal. This shows that S belongs to either I or J. □

4. Birkhoff Polytopes

The Birkhoff polytope B n is defined to be the convex hull of all n × n nonnegative real matrices with row sum and column sum equal to one. These matrices are known as the doubly stochastic matrices. Here we consider an n × n matrix as a n 2 -vector. The Birkhoff polytope is a well-studied polytope and have many applications, in combinatorial optimization and Bayesian statistics, for example [8,9]. In this section, we look for separating hyperplanes for B n (Theorem 3).
In the rest of the section, we assume the hyperplanes have the form
h ( x ) = c 1 x 1 + + c n 2 x n 2 = 0 ,
but actually all the results hold for general hyperplanes h ( x ) = r for any constant r. We start with the following known properties of the Birkhoff polytope B n . Here we use both the one-line notation and the cycle notation for a permutation. For example, w = 34256187 is the one-line notation for the permutation sending 1 3 , 2 4 , 3 2 , 4 5 , 5 6 , 6 1 , 7 8 and 8 7 . The cycle notation for w is ( 132456 ) ( 78 ) , thus w has two cycles.
  • dim B n = ( n 1 ) 2 ;
  • B n has n ! vertices, which are the n × n permutation matrices;
  • permutations w and u form an edge in B n if and only if w 1 u has one cycle (excluding the fixed points), [10].
In particularly, for n = 3 , w 1 u has one cycle for any w , u S 3 with w u . In other words, the skeleton graph for B 3 is the complete graph K 6 . Therefore, there are no separating hyperplanes for B 3 . Moreover, we have
Lemma 9.
B 4 has no separating hyperplanes.
Proof. 
Suppose there exists a separating hyperplane with coefficients indicated in the following matrix:
a b c d e f g h i j k m n o p .
We use x w to represent the vector corresponding to the permutation matrix for a permutation w. By symmetry, assume h ( x id ) > 0 . The identity permutation is connected with all other permutations except for three with two cycles ( 12 ) ( 34 ) , ( 13 ) ( 24 ) and ( 14 ) ( 23 ) . Then for any permutation w that is not the above three, we have h ( x w ) 0 , and the only possible u’s with h ( x u ) < 0 are among the above three. Without loss of generality, assume h ( x ( 12 ) ( 34 ) ) < 0 . Then note that the permutation ( 12 ) ( 34 ) is connected to all other permutations except for id, ( 13 ) ( 24 ) and ( 14 ) ( 23 ) . Therefore, h ( x v ) = 0 for all permutations v with one cycle.
Now notice that h ( x ( 12 ) ( 34 ) ) + h ( x ( 13 ) ( 24 ) ) = h ( x 2143 ) + h ( x 3412 ) = ( e + b + o + ) + ( i + n + c + h ) = ( e + n + c + ) + ( i + b + o + h ) = h ( x 2413 ) + h ( x 3142 ) = h ( x ( 1243 ) ) + h ( x ( 1342 ) ) = 0 , therefore, h ( x ( 13 ) ( 24 ) ) > 0 . Similarly, we can get h ( x ( 14 ) ( 23 ) ) > 0 . However, then 0 < h ( x ( 13 ) ( 24 ) ) + h ( x ( 14 ) ( 23 ) ) = ( i + n + c + h ) + ( m + j + g + d ) = ( i + n + g + d ) + ( m + j + c + h ) = h ( x ( 1324 ) ) + h ( x ( 1423 ) ) = 0 , a contradiction. Therefore, there does not exist any separating hyperplane. □
Remark 1.
Even though Lemma 9 is a special case of Theorem 3, we still state it separately as a lemma, since its proof provides a good example for Theorem 3.
Theorem 3.
B n has no separating hyperplanes.
Proof. 
Assume there is a hyperplane h ( x ) = 0 . By symmetry, assume h ( x id ) > 0 . Since all permutations with one cycle are connected with id, we have h ( x u ) 0 for all u with one cycle. Suppose h ( x v ) < 0 for some permutation v with k cycles. Assume k is the smallest such number. In other words, h ( x w ) 0 , for all w with fewer than k cycles. Notice that k > 1 . First notice that h ( x σ ) = 0 , for all σ connected with v, and have fewer cycles than v. In fact, since σ has fewer than k cycles, we have h ( x σ ) 0 . On the other hand, since σ is connected with v, h ( x σ ) > 0 cannot happen. Therefore, h ( x σ ) = 0 .
Now we apply the method in Lemma 9 to show that h ( x v ) < 0 cannot happen. Write in cycle notation v = ( C 1 ) ( C 2 ) ( C 3 ) ( C k ) , where each C i is some sequence of numbers. Without loss of generality, assume C 1 = 125 A and C 2 = 34 B , where A and B are sequences of numbers. First consider the permutation τ 1 = ( 325 A ) ( 14 B ) C 3 C k . Notice that
h ( x v ) + h ( x τ 1 ) = h ( x τ 2 ) + h ( x τ 3 ) ,
where τ 2 = ( 125 A 34 B ) C 3 C k and τ 3 = ( 325 A 14 B ) C 3 C k . One can check that τ 2 and τ 3 are both connected with v, in fact τ 2 differs with v by ( 13 ) and τ 3 differs with v by ( 24 ) . Since τ 2 and τ 3 also have fewer than k cycles, we proved earlier that h ( x τ 2 ) = 0 and h ( x τ 3 ) = 0 . Therefore, h ( x τ 1 ) > 0 .
Now consider the permutation σ 1 = ( C 3 ) ( C k ) . Since it has fewer than k cycles, we have h ( x σ 1 ) 0 . Notice that
h ( x τ 1 ) + h ( x σ 1 ) = h ( x σ 2 ) + h ( x σ 3 ) ,
where σ 2 = ( 325 A ) C 3 C k and σ 3 = ( 14 B ) C 3 C k . One can check that σ 2 and σ 3 are both connected with v. Since σ 2 and σ 3 both have fewer cycles than v, we have h ( x ( σ 2 ) ) = 0 and h ( x σ 3 ) = 0 . This is a contradiction, since h ( x τ 1 ) > 0 and h ( x σ 1 ) 0 . □

Author Contributions

Both authors contribution equally on the result.

Funding

The first author is partially supported by JSPS KAKENHI 19H00637.

Acknowledgments

The authors are grateful for reviewers’ comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, N. Ehrhart h*-vectors of hypersimplices. Discret. Comput. Geom. 2012, 48, 847–878. [Google Scholar] [CrossRef]
  2. Lam, T.; Postnikov, A. Alcoved polytopes I. Discret. Comput. Geom. 2007, 38, 453–478. [Google Scholar] [CrossRef]
  3. Stanley, R. Eulerian partitions of a unit hypercube. In Higher Combinatorics; Aigner, M., Ed.; Reidel: Dordrecht, The Netherlands; Boston, MA, USA, 1977; p. 49. [Google Scholar]
  4. Li, N.; Postnikov, A. Slices of graphical zonotopes. in preparation.
  5. Hibi, T.; Li, N.; Zhang, Y. Separating hyperplanes of edge polytopes. J. Comb. Theory Ser. A 2013, 120, 218–231. [Google Scholar] [CrossRef] [Green Version]
  6. Stanley, R. Two poset polytopes. Discret. Comput. Geom. 1986, 1, 9–23. [Google Scholar] [CrossRef]
  7. Hibi, T.; Li, N. Unimodular equivalence of order and chain polytopes. Math. Scand. 2016, 118, 5–12. [Google Scholar] [CrossRef]
  8. Linderman, S.; Mena, G.; Cooper, H.; Paninski, L.; Cunningham, J. Reparameterizing the Birkhoff Polytope for Variational Permutation Inference. In Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), Lanzarote, Canary Islands, Spain, 9–11 April 2018. [Google Scholar]
  9. Lim, C.; Wright, S. Beyond the Birkhoff Polytope: Convex Relaxations for Vector Permutation Problems. In Proceedings of the Neural Information Processing Systems Conference 2014, Montreal, QC, Canada, 8–13 December 2014. [Google Scholar]
  10. Brualdi, R.; Gibson, P. Convex polyhedra of doubly stochastic matrices. I. Applications of the permanent function. J. Comb. Theory Ser. A 1977, 22, 194–230. [Google Scholar] [CrossRef] [Green Version]

Share and Cite

MDPI and ACS Style

Hibi, T.; Li, N. Cutting Convex Polytopes by Hyperplanes. Mathematics 2019, 7, 381. https://doi.org/10.3390/math7050381

AMA Style

Hibi T, Li N. Cutting Convex Polytopes by Hyperplanes. Mathematics. 2019; 7(5):381. https://doi.org/10.3390/math7050381

Chicago/Turabian Style

Hibi, Takayuki, and Nan Li. 2019. "Cutting Convex Polytopes by Hyperplanes" Mathematics 7, no. 5: 381. https://doi.org/10.3390/math7050381

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop