1. Introduction
The polyhedral cage, a p-cage for short, is a relatively new concept. It was first introduced to describe the geometry of an artificial protein cage [
1]. Made out of 24 seemingly regular hendecagons, the geometry of the experimentally obtained cage looked impossible, but it turned out that the faces are actually nearly regular with deformations so small that they cannot be observed with the naked eye. Nevertheless, despite this irregularity, such a structure can be made experimentally.
This observation led us to formalise the concept of polyhedral cages to identify which other geometries could be generated experimentally [
2]. A p-cage consists of an assembly of regular or nearly regular, but planar, polygons with holes between them. The polygons share some of their edges with their neighbour faces, like the face of polyhedron, but some edges are adjacent to a hole. To describe objects that are cage-like, we depend upon two further requirements. First, we require that each face has at least three neighbours. This is to avoid chicken wire-like structures. We also require that of two adjacent polygon edges, at most one of them, can be shared with a neighbour face. A p-cage is said to be regular if all its faces are regular polygons. On the other hand, if some of its faces are nearly regular, the p-cage is said to be a near-miss. We also restrict ourselves to convex p-cages.
Our main aim was to identify potential geometries for artificial protein cages. All the virus capsids found in nature essentially have the structure of Platonic solids [
3]. Artificial protein cages have, on the other hand, more general symmetries or near-symmetries [
1,
4,
5,
6,
7]. J. Heddle has created in his laboratory several artificial protein cages made from TRAP proteins [
8]. These TRAP proteins are modified so as to form so called rings made out of typically 11, 12 or 13 TRAP proteins which exhibit the geometry of polygons with 11, 12 or 13 edges [
9]. These rings assemble to form several structures. For example, the 11 ring forms protein cages made out of 24 rings but also smaller ones made out of 12 rings [
1,
5]. While chemical and physical properties of the assembling proteins must obviously be taken into account [
10,
11,
12], identifying the possible geometries for protein nano-cages will help the selection of protein candidates to form such cages.
The motivation for making such artificial protein cages is to develop new methods of targeted drug delivery by enclosing the desired drug inside a protein capsid which will contain, on its outer side, receptors which will in turn allow it to penetrate inside the target type of cells [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23] such as cancer cells, for example [
24]. Once inside the cell, the protein cage will open to release its cargo [
25,
26]. The advantage of using artificial capsids is that they provide a larger number of options but also reduce the risk of the protein cage being recognised by the immune system of the patient [
24,
27,
28,
29,
30].
Joining proteins to form artificial cages is not easy and requires suitable geometries and binding properties between them [
31,
32,
33,
34]. Some protein cages are built using symmetry properties [
35,
36,
37,
38,
39], while others break known symmetries [
40].
A p-cage is homogeneous symmetric [
41] if for any pair of faces there is a congruent automorphism of the p-cage that maps one face onto the other. This means that, as for Platonic solids, all the faces are equivalent. We later defined bi-symmetric p-cages as p-cages made out of two families of faces, so that for any two faces belonging to the same family, there is a congruent automorphism of the p-cage that maps one face onto the other [
2]. Homogeneous bi-symmetric p-cages are made out of the same type of faces, but most bi-symmetric p-cages are made of two different types of polygons.
Polyhedral cages can be seen as a generalisation of regular polyhedra or polytopes [
42]. With the restrictions which we have imposed, most p-cages are sphere-like with small holes and so are good candidates for encapsulating some cargo. Mathematically, it is interesting to compute their Euler characteristic
C, which is defined as [
43]
where
V is the number of vertices,
E the number of edges, and
F the number of faces. For a polyhedron
C is always 2. This implies that for a p-cage,
, where
H is the number of holes of the p-cages. Indeed, if a p-cage is such that each of its faces share only one edge per hole, the hole can be considered as the face of a containing polyhedron. If
denotes the number of faces of the p-cage, we have
and so
To generate more general p-cages, we must add one or more vertices to the edges of the above p-cage. Doing so increases
V and
E by 1 for each extra vertex while keeping
and
H unchanged, and so, by (
1),
C remains unchanged, proving that
for all p-cages.
All the homogeneous symmetric p-cages with deformations not exceeding 10% were identified in [
2,
41]. In [
44], we identified all the bi-symmetric p-cages such that the faces of a given family are only adjacent to the faces of the other family. In [
45,
46], we have identified all the bi-symmetric p-cages with faces adjacent to four other faces that contribute to, at most, three edges to each hole. In this paper, we complete this later study by identifying all the bi-symmetric p-cages for which the faces of one family have three neighbours each, while the other faces have three, four, five or six neighbour faces. To avoid p-cages with large holes, we also restrict ourselves to p-cages where its faces contribute to at most only three of the edges to any given hole.
It is worthwhile at this point to describe how nano-bio-chemical engineers should use the results of this study, as well as those described in our previous papers. The results presented in this study are purely geometric and describe the structures that can be realised with nano-protein cages. If a research group has two types of polygon-shaped proteins with
and
edges, one can identify from the lists we provide which p-cage shapes can be achieved, if any. For example, there are none for
and
but there are quite a few for
and
. The Heddle lab [
9] can make
,
, and
polygons with TRAP proteins, but there is no reason why other polygons cannot be achieved with other proteins.
The 10% deformation criteria was chosen in [
2] to be large enough to include all potentially realisable geometries. In [
5,
9] the deformations of some of the obtained protein cages were shown to be above 3%.
The large number of geometries for p-cages implies that there is potentially a very large number of different protein nano-cages that can be realised. They will have to be realised with a range of different proteins and that diversity should lead to a larger range of cargo that can be encapsulated inside these nano-cages. This will obviously require many further experimental works.
Many aspect of the protein nano-cages are bio-chemical problems well beyond the scope of our study. For example, the holes can either be plucked with receptors if left as they are, but this is for the designer to decide.
Drug encapsulation is also a bio-chemical problem related directly to the type of protein used to make nano-cages and so it is not covered at all in our study which is purely geometric. To determine the actual geometry of protein nano-cages, see the paper from the Heddle lab [
4,
8,
9].
2. Methodology
The main method to categorise and classify p-cages is to join the faces at the centre of a p-cage with lines; these lines define a polyhedron which corresponds to the dual of the p-cage. This is illustrated for a specific p-cage made out of nanogons and decagons in
Figure 1a. We call these polyhedra the hole-polyhedra and they describe the characteristics of the p-cages as follows. The faces describe the structure of the p-cage hole. So a triangle corresponds to a hole formed by 3 faces and a square to a hole made out from 4 faces…. This is represented by red lines in
Figure 1a. The vertices of the hole-polyhedra correspond to the p-cage faces while the edges describe how these faces are connected together. So if a vertex joins 4 edges, it corresponds to a p-cage face, sharing 4 of its edges with neighbour faces.
The hole-polyhedra can be projected onto a plane to obtain the corresponding planar graph, as illustrated in
Figure 1b. A p-cage made out of 2 families of equivalent families will then correspond to a planar graph made out of 2 families of nodes such that each node of a given family can be mapped to any other node of the same family via an automorphism of the graph.
As we do not allow the adjacent edges of the p-cage faces to be both shared with neighbours, each face must have a number of edges adjacent to holes. A given face can contribute an arbitrary number of its edges to any hole, aside from the fact that in order for the p-cage faces to be equivalent the symmetry of the planar graph must impose some restrictions. In
Figure 1b we have indicated in the corners of the planar graph the number of edges contributing to each hole for the p-cage shown in
Figure 1a.
The equivalence between the p-cage faces of each family suggests that the automorphisms mapping the nodes of the planar graphs must also leave the number of hole-edges, denoted in the graph corners, as invariant.
To classify the p-cage faces with two families of equivalent faces, we must consider all the planar graphs made out of two families of equivalent nodes. Then we must assign a number of hole-edges to each corner of these planar graphs so that the equivalence between the nodes is preserved. As we will illustrate with a few examples later, this can lead to a very large number of configurations to consider.
Planar graphs where all the nodes are equivalent are called Cayley graphs [
47] and Maschke proved [
48] that the only planar Cayley graphs are the graphs of the regular solids, i.e., the prism, antiprism, Platonic solids, and Archimedean solids. Planar graphs made out of 2 families of equivalent nodes were classified in [
49] where it was shown that there are over 400 such graphs when considering only those with
faces no larger than decagons.
In [
44] we have classified all the bi-symmetric p-cages where all the faces of a given family are only adjacent to faces of the other family. There were only 9 graphs to consider and most of them lead to p-cages with very large holes. In [
46] we have classified all the bi-homogenous symmetric p-cages where each face has at least 4 neighbours and their holes are made out of at most 4 faces.
In this paper we complete that study by characterising all the bi-homogenous symmetric p-cages where the faces of one family has 3 neighbours faces while the others have 3, 4, 5 or 6 neighbours faces. We consider all the graphs corresponding to such p-cages [
49] one by one. Each planar graph corresponds to a hole-polyhedron and we start by constructing the dual of that polyhedron which we call the tilling polyhedron. Notice that these tilling polyhedron are made out of 2 types of planar non-regular polygons. Moreover, in most cases, they are not unique as the faces can be deformed while preserving the equivalence between the faces of each family. We call this polyhedra the tilling polyhedra because the p-cage can be obtained by inserting the polygonal face onto its faces.
Because the p-cage faces belong to 2 families of equivalent faces, we only need to consider one representative for each family, which we call the reference faces, with the other faces being obtained by applying the rotation that maps these faces together.
We denote
and
as the number of edges for the face belonging to family I and II, respectively. Following [
44,
45], the family of type I is chosen to be the one with 3 neighbours.
The vertices of the reference faces are denoted
for
, ordered anticlockwise when seen from outside the p-cage. The normal to the reference face of type
j,
, is chosen to point towards the outside of the p-cage. The edges are then given by the vectors
so that the edge lengths are
. We can then determine the inside angles between adjacent edges,
, by evaluating
and
As we are only interested in convex faces we must have
. For a regular P-gon, the angles between the adjacent edges are
and the edges all have the same length. To minimise the deformations of the p-cage face we must evaluate all the
and
and adjust the coordinates of the reference face so as to minimise the difference between the
and
and the corresponding values for a regular polygon. We achieve this by minimising the energy
where
where
,
, and
and weight factors which, as will be shown later, can be adjusted to obtain p-cages with different levels of deformations. In (
6), the first term evaluates the deviation of the edge lengths
from the target length
L while the second term does the same for
deviations from that of a regular P-gon.
is given explicitly by
where
is the Heaviside function. Notice that for a concave polygon (
7) is 0.
is obtained by computing
, the centre of face
i and by choosing 2 adjacent faces,
and
, with normal vectors
and
, respectively. The p-cage is convex if for all pairs of adjacent faces, the distance between the centres of the 2 faces is smaller than the distance between
and
. This allows us to define
as
where
is the set of faces, of any type, adjacent to the reference face
j. For convex p-cages
is 0.
To obtain p-cages that are minimally irregular but convex, we used a simulated annealing algorithm [
50], varying the parameters of the p-cages to minimise
E and hence determine optimal configurations for each p-cage.
To ensure that the p-cage and its faces are all convex, we have taken very large values for
and
so that the last two terms in (
5),
, and
dominate the energy contribution when they are non null. In (
5), we divide the energy by
to make the parameterisation of the numerical optimising algorithm easier to tune.
To describe the regularity of the p-cages, we use the following two quantities:
To balance the edge length and angle deformations, the weight
and
were varied but with the constraints
. We started with the values
and perform a full simulated annealing relaxation. We then adjusted
and
using a gradient algorithm to minimise
, performing a full simulated annealing relaxation for each value of
and
. In units of energy (
5), the temperature decreased from
to
by the multiplicative factor
, using 2000 randomisations per degree of freedom for each temperature.
The reference faces can be described by a vector
normal to the face, where
is used to label the two faces’ families, as well as 2 orthonormal basis vectors
and
. Any point on the plane span by the face can then be described as
. These vectors can be parameterised by the angles
and
as well as the length
as follows:
As all the faces
k of a given family are related to the corresponding reference via a rotation
R, the normal and basis vectors for this plane are given by
Notice that
is the unit matrix for the reference faces.
The edges of the p-cage faces are obtained by parameterising the lines intersecting the plane span by any two pairs of adjacent faces. Given
,
,
and
,
,
, the vectors spanning two adjacent planes, we first define
where
and
and the normal vectors are normalised to 1, and
is a vector parallel to the line intersecting the 2 planes.
Multiplying (
12) by
, we identify the point
on the intersecting line
which is perpendicular to
. This leads to a relation between the parameters
and
as well as
and
. The values of
are obtained by multiplying (
12) by
. Inserting the result back into (
12), we obtain
Given the intersecting lines between 2 pairs of adjacent faces, we will also need to determine the intersecting point between these 2 lines:
We achieve this by imposing the condition
Then, multiplying (
15) by
and also
leads to the following system of two linear algebraic equations
This can easily be solved to find
The most efficient way to perform the p-cage configuration optimisation is to determine the coordinates of the tilling solid so that the initial face configuration can be obtained by tilling the solid faces. The reason is that it can often be tricky to find the tilling solid coordinates from the hole-polyhedron planar graph and we only need to perform this once for every planar graph considered rather than for every possible p-cage corresponding to that planar graph (there can be a thousand of such p-cages to consider).
Some of the hole-polyhedra are regular solids or the duals of a regular solid. In this case the tilling solid is known. The other hole-polyhedra are either axially symmetric solids or their planar graphs are obtained by adding nodes to the planar graph of a regular solid [
49]. The construction of the tilling solids was detailed in [
46] and we shall not repeat it here.
Given a tilling solid, the normal of its reference face is used as the initial normal vector for the p-cage reference faces.
Choosing different polygons for the p-cage faces leads to a number of p-cages for each hole-polyhedron graph. When a
P-gon face shares
n edges with as many neighbours there remain
edges to distribute between the holes. This repartition of the hole-edges must preserve the equivalence between the faces of each family. We can determine how this can be performed for each graph by labelling the number of hole-edges using the symbols
a,b,c,d,e and
A,D,C,D,E,F for, respectively, the type I and II families. The is carried out via construction, as illustrated in a few example below. The algorithm used to list all the hole-edge configurations for a given graph, avoiding duplications, was the same as in [
46]. The python program used for this is available on zenodo as part of all the computer programs used to generate the p-cages (see the zenodo reference at the end).
We illustrate this in 3 cases shown in
Figure 2. In the first case, the hole-polyhedron is a triangular pyramid where the top node is of type II while the 3 sides are of type I. The dual of the hole-polyhedron, the tilling solid, is also a triangular pyramid where the base is of type II and the side of type I.
Figure 2a corresponds to the planar graph of the tilling solid, so we must place the hole-edge labels in the corner of each face. The symmetry is a rotation around the main axis of the pyramid. As a result, the hole-edge labels for the type II face must be identical so that each of the type I faces are adjacent with the same configuration. We then place an
A at each corner of the base face. The type I faces are mapped onto each other by a rotation around the main pyramid axis and so they must all have the same label at the top vertex which we name
a. We then place the remaining labels
b and
c anticlockwise around each type I face.
The second hole-polyhedron is a triangular prism,
Figure 2b. The type I and II faces cover, respectively, both bases of the prism. In this case, it is easy to see that the tilling solid consists of two triangular pyramids joined at their bases which we label in the hole-polyhedron planar graph. The symmetry is again a rotation around the main axis of the prism. The three nodes of type I must then have the same edge label on the top base of the prism, similarly for type II nodes and the bottom base. We then label them, respectively, as
a and
A. The other labels are then placed anticlockwise around each node.
The third hole-polyhedron has the symmetry of a square prism,
Figure 2c. The type II nodes are the 8 nodes of the square prism graph while the type I faces are on the side of the prism. The symmetry is again another rotation symmetry around the main axis of the prism as well as a
rotation around the centre of the prism faces. The type II faces must then have the same label on the prism base, which we call
A, and we then label the other corners around these type II nodes anticlockwise. We label the three corners around one of the type I nodes anticlockwise
a,
b, and
c. Applying the rotation around the main axis of the prism to these labels, we obtain the labelling of 3 more nodes. We can then apply the
rotation to these first four nodes to obtain the labelling of the remaining four. The tilling solid corresponding to the hole-polyhedron is shown in
Figure 2d.
The fourth hole-polyhedron is a cuboctahedron where a truncated pyramid has been added to the square faces; see
Figure 2e. The type II and type I faces correspond, respectively, to the nodes of the cuboctahedron and the nodes of the truncated part of the pyramids. The symmetries are a
rotation around the axis of the square faces/truncated pyramids and a
rotation around the axis going through the triangular faces of the cuboctahedron. The labels at the top of the truncated pyramids must be the same,
a, and we add the other labels anticlockwise around the type I nodes. The labels on the triangular faces must also be identical,
A, and we label the other corners anticlockwise around each type II nodes. The tilling solid for that hole-polyhedron is shown in
Figure 2f. It corresponds to a rhombic dodecahedron where 4 of the rhombic faces are split in 2 triangles.
Before providing some examples of p-cage derivations, it is useful to describe the naming convention which we use to refer to these p-cages. The names are of the format GRAPH_P_P_a_b_c_d[_e]-A_B_C_D[_E_F] where the italic symbols are replaced by actual values. GRAPH stands for the nickname of the hole-polyhedron graph while , and the hole-edge labels are self explanatory. We also use the following simplification to avoid excessively long names: when n successive hole-edge labels have the same values, , we replace the sequence by nx. As an example, p5-2pyr_P10_P10_1_1_2_2-1_1_1_1_1 becomes p5-2pyr_P10_P10_2x1_2x2-5x1.
The graphs that we have considered are all listed in
Appendix A together with the planar graph names taken from [
49], the p-cage names, the number of configurations that we have tested, as well as the number of p-cages found with deformations not exceeding 10%, 5% and regular p-cages (
).
3. Results
The first example is derived from the planar graph
3pyr (
33_F4_1-3-0_3-2-1_V3_1 in [
49]). It corresponds to the planar graph of a triangular pyramid where the top vertex is of type II and the other three are of type I. The dual of that solid is also a triangular pyramid where we fit the type II face while the side faces are tilled with the type I p-cage faces. The hole-edge repartition is shown in
Figure 1a. This results in 12, 6 and 3 p-cages with deformations below, respectively, 10%, 5% or equal to 0%. The regular p-cages consist in the simple embedding of hexagons, nanogons or decagons on the four faces of a tetrahedron, as shown in
Figure 3a. The non-regular p-cages have a regular base, while the type II faces are non-regular polygons on the sides (
Figure 3b,c).
The second example is derived from the planar graph
p3-tb (
33_F5_1-0-3_1-3-0_3- 2-2_V3_3 in [
49]). It corresponds to the planar graph of a triangular prism where the vertices of a given base belong to the same family, as shown in
Figure 1b. The dual of that solid consists of two pyramids joined at their bases where the face of a pyramid are all of the same type.
The hole-edge repartition is shown in
Figure 1b. This results in 65, 20, and 5 p-cages with deformations below, respectively, 10%, 5% or equal to 0%. The regular p-cages consist in two identical pyramids (
Figure 4a,b).
Figure 4c shows a p-cage made of two different pyramids: three decagons and three hendecagons.
The third example is derived from the planar graph
p4-12sqr (
34_F14_2-0-4_12-2-2_ V8_8 in [
49]). It corresponds to the planar graph of a square prism where the vertical edges have been split, as shown in
Figure 1c which describes the hole-edge repartition for that graph. The dual of that solid corresponds to a rhombic dodecahedron where four of the squares have been split into two triangles, as shown in
Figure 1d.
This results in 25, 5 and 1 p-cages with deformations below, respectively, 10%, 5% or equal to 0%. The regular p-cage consists of a square prism where the base are tiled with four octagons and the edges with two octagons (
Figure 5a). This is a degenerate p-cage as the side octagons are touching those on the adjacent side. The difference is clear when we compare it with the p-cage shown in
Figure 5b. This second p-cage is made out of eight 13-gons (on the prism base) and eight decagons (on the prism sides). This p-cage has a 4.3% deformation).
The fourth example is derived from the planar graph
p4-bsplit (
34_F8_4-1-2_4-2-2_ V4_4 in [
49]). The graph has an axial symmetry but is not directly related to the planar graph of a regular solid. The hole-edge repartition is shown in
Figure 6a. This results in 101 and 14 p-cages with deformations below, respectively, 10%, 5% and no regular p-cages. We present two examples in
Figure 6b,c with deformations of 2.4% and 4.3%.
The fifth example is derived from the planar graph
DAco (
34_F12_12-2-2_V8_6 in [
49]). It corresponds to the planar graph of the rhombic dodecahedron, the dual solid of cuboctahedron. As a result, the tilling solid is the cuboctahedron and its planar graph with the hole-edge repartition is shown in
Figure 7a. The symmetry of the graph imposes strong constraints on the hole-edge values. As a results this leads to nine regular p-cages and no near-miss p-cages. The simplest one,
DAco_P6_P8_3x1-4x1, is shown in
Figure 7b; it corresponds to the truncated cuboctahedron where the squares are the p-cage holes. The p-cage shown in
Figure 7c,
DAco_P9_P8_3x2-4x1, is another example.
The sixth example is derived from the planar graph
Da5 (
35_F10_10-3-1_V10_2 in [
49]). It corresponds to the planar graph of the dual of the pentagonal antiprism. As a result, the tilling solid is the pentagonal antiprism and the hole-edge repartition is shown in
Figure 8a.
This results in 4, 5 and 1 p-cages with deformations below respectively 10%, 5% or equal to 0%. The regular p-cage
Da5_P10_P10_3_1_3-5x1, shown in
Figure 8b is degenerate as described above. Most of the near-miss p-cages of this type look like ice cream sandwiches, as shown in
Figure 8c.
The seventh example is derived from the planar graph
DAid (
35_F30_30-2-2 _V20_12 in [
49]). It corresponds to the planar graph of the dual of the icosidodecahedron. The tilling solid is hence the icosidodecahedron and the hole-edge repartition for that planar graph shown in
Figure 9a. This is another example where the hole-edge numbers are very constrained.
This results in five regular p-cages and no near-miss one. The simplest one
DAid_P6 _P10_3x1-5x1,
Figure 9b, corresponds to the icosidodecahedron where the square are the p-cage holes. The second example, shown in
Figure 9c, is one of the few examples of a three-neighbours family p-cage which is sphere-like in shape and has small holes.
The eighth example is derived from the planar graph
Aco-6tr-4pyr (
36_F38_6-4-0_8-0-3_24-2-2_V24_12 in [
49]). It corresponds to the planar graph of the cuboctahedron on which truncated pyramids have been placed on its square faces.
This results in no regular p-cages but seven and two p-cages with deformations below, respectively, 10% and 5%. As an example, the p-cage
Aco-6tr-4pyr_P6_P18_3x1-6x2 is shown in
Figure 10a.
The ninth example is derived from the planar graph
Asc-8-3pyr (
36_F54_6-0-4_24-0-3_24-1-2_V8_24 in [
49]). It corresponds to the planar graph of the snubcube on which triangular pyramids have been placed on the eight triangles located at the corner of the underlying cube. The hole-edge repartition is presented in
Figure 10b.
This results in three p-cages with deformation below 10% but none with deformations below 5%, let alone regular p-cages. An example,
Asc-8-3pyr_P6_P20_3x1-3x3_2x1_3, is presented in
Figure 10c. It has a deformation exceeding 8% as is clearly seen from the shape of the icosagons.