1. Introduction
The lack of gene-delivery vectors [
1] is the main limiting factor in the field of gene personalized therapy [
2]. Viral vectors have a low efficiency. Synthetic gene-delivery agents do not possess the required efficacy [
3]. In recent years, a variety of capable polymers have been designed, specifically for gene delivery. Understanding the polymer gene-delivery mechanisms will help in the design of polymer-based gene-delivery systems, thus becoming essential tools for human gene therapy. The design criteria for the construction of useful delivery vectors are continuously evolving. Reverse drug design, fragment-based drug design, and virtual screening all failed in identifying a class candidate. Most of these methods explore the same drug space of a template compound. Thus, a mathematical model, expressed as a function, will enlarge the drug ability space. Such a model will go beyond the respective class of compounds. A way of generating such a model is to explore Cartesian coordinates, close contacts, dihedral angle values, and the internal coordinates of a certain molecule. Cartesian coordinates can provide a high starting point for projection in a vast range of mathematical spaces, compared with internal coordinates and close contacts. One can distinguish linear spaces and topological spaces. Linear spaces lack 3D dimensionality due to their algebraic nature. Linear operations performed in a linear space lead to straight lines. Their dimensionality is defined as the maximal number of independent vectors.
Topological spaces that are analytic in nature lead to continuous functions. A topological space is hard to define. An algebraic approach is used in most cases.
Polynomial equations and their geometric properties are used worldwide in algebraic geometry. A major characteristic that makes polynomial equations a valuable tool in topology is their definition from a basic arithmetic operation—addition and multiplication. This operation, when used, retrieves smooth and Riemannian manifolds.
Manifolds are a type of topological space that resembles Euclidian space. The concept of manifolds is a keystone in geometry, while it allows complex structures to be characterized from the perspective of the concept of local topological properties. The cartesian coordinates and dihedral angles of a structure can be viewed as carrying these topological properties.
A smooth manifold is not an actual space. Furthermore, topological manifolds are defined as smooth manifolds that are finite linear spaces. In other words, the surface of an ellipsoid can be represented as a smooth manifold.
Every real structure has its own Euclidean space representation. Riemann manifolds can also be considered to be Euclidean spaces.
Mathematical spaces, usually those in a complex analysis, coexist with drug spaces both in real roots and complex ones [
4]. The significant backtracking of these techniques based on molecule topology is the actual force field, i.e., the level of theory used to optimize the respective compound.
Statistical and mathematical models are essential tools in predicting molecular properties. QSAR methodology uses a set of molecules in order to predict one distinct molecular property by computing a set of descriptors based on which a quantitative structure–activity relationship (QSAR) equation is generated. This equation is used to further predict specific properties. The molecular descriptors used in QSAR models are driven from experimental data like logP, molar refractivity, polarizability, and theoretical descriptors, which are generally symbolical representations of a molecule. The theoretical descriptors characterize the constitution of the molecule, structural fragments, the graph invariants, the 3D properties (molecule size, volume), and the properties derived from the grid base molecular force field (GRID), comparative molecular field analysis (CoMFA) and similar methods, respectively. Both types of descriptors, experimental and theoretical, are used in establishing a prediction model. Such models involve a specific chemical space.
Furthermore, the more diversified the molecular set is (i.e., the more the molecules differ in respect to their molecular formula and structure) the more the model will cover a vast chemical space. Thus, by using a diverse set of molecules one will obtain a better prediction model. In other words, the set of molecules used in this study are diverse (i.e., distinct molecular formula, distinct structure) so they can characterize accurately a chemical space, making the model applicable to novel molecules
In this study, an inverse methodology was proposed in order to identify polymers with gene transfer capabilities. Instead of building a QSAR model that will explore a fraction of the chemical space, the chemical space was investigated with the help of the Cartesian coordinates. The equations derived from the Cartesian coordinates were used to compute chemical spaces—Riemannian spaces.
Theoretically, a model built in such a way provides an extensive overview of the drug space, both in real numbers and in imaginary complex number spaces. Such equations (polynomial equations)/models (Riemann spaces) can work as “reverse engineering” and lead to suitable gene-carrying polymers, with good cytotoxicity and biocompatibility profiles.
2. Materials and Methods
A set of 29 monomers (
Table 1), retrieved from the gene transfer literature [
5], were used to compute a mathematical model. The in silico models in the mol2 format of polymer structures were generated using the Chemoffice 2005 software package [
6]. The models were optimized at the 6-31G level of theory [
7]. The cartesian coordinates (
Figure 1) of polymer structures were used to develop derivate equations of coordinates (
Table 2 and
Table 3), using Mathematica 5.0 software [
8]. The Riemann surfaces for each derived coordinate equation were solved and computed using the same software (Mathematica). The branch point for each equation was computed and represented. A 2D (complex map) and a 3D (Riemann surface) were used to represent the results. In addition, a single hypothesis QSAR model was computed using the Schrodinger 2009 software package [
9]. The pharmacophores were computed; having compounds 8 and 21 as templates, they were chosen because of their proven gene transfer properties. Moreover, a combined pharmacophore hypothesis was developed. The branching points were compared against the known gene transfer properties. The methodology is exemplified for compounds 8 and 21.
3. Results
Table 2 lists the initial derivate equations generated using the Cartesian coordinates of each molecule.
The equations in
Table 2 were transformed using the following formula:
where a = free term; b = coefficient; z = Lambert W function.
The equations for each polymer are shown in
Table 3.
The Riemann surfaces obtained for compounds 8 and 21 that showed promising experimental results are shown in
Figure 2.
The branching points computed for each structure are shown in
Figure 3.
The QSAR models, developed as a single hypothesis for compounds 8 and 21, and the merge hypothesis are shown in
Figure 4.
The Cartesian coordinates of each hypothesis are listed here. Hypothesis 1: D1D2H3H4; D1 (x −4.18, y 2.07, z −0.84); D2 (x −4.48, y −2.07, z 0.84); H3 (x −5.97, y 2.62, z 0.00); H4 (x −2.69 y −2.69, z 0.00).
Hypothesis 2: A1A2A3A4H5H6; A1 (x −3.11, y 3.66, z 0.00); A2 (x 2.06, y −3.25, z 0.00); A3 (x −2.32, y 5.77, z 0.00); A4 (x −2.32, y 5.77, z 0.00); H5 (−1.38, y 1.91, z 0.00); H6 (x 0.35, y −1.50, z 0.00).
Merge hypothesis: A1A2D2D3H5H6; A1 (x −3.11, y 3.66, z 0.00); A2 (x 2.06, y −3.25, z 0.00); D2 (x −4.48, y −2.07, z 0.34); D3 (x −4.18. y 2.07, z −0.84); D4 (x −4.48, y −2.07, z 0.84); H5 (x −5.97, y 2.62, z 0.00); H6 (x −2.69, y −2.62, z 0.00). As observed above, the merge hypothesis has common elements from both hypotheses 1 and 2, but also leaves out D1 and H4 from hypotheses 1 and A3 and A4 from hypothesis 2, respectively.
4. Discussion
Symmetric spaces are pseudo-Riemann manifolds [
10]. A connected Riemann manifold is symmetric (space) if its curvature tensor is invariant. Broadly, a Riemann manifold is symmetric if each point exists as an isometry. Furthermore, every symmetric space is exhaustive.
Riemann symmetric spaces are considered in physics, mathematics, and chemistry. They have a central role in the homology theory. Examples of Riemann spaces include Euclidean spaces, hyperbolic spaces, and projective spaces. Riemann spaces are classified into the Euclidean type, the Compact type, and the Non-Compact type [
11].
If the complex argument of a function can be mapped from a single point in the domain of multiple points, then the branching point of an analytical function is a point in the complex plane [
12]. When z (branching point) = 0, under the power function
f(z) = za, where “a” is a complex non-integer (“a” ∈ C, with a ∄ Z). Writing z = eiθ and taking θ in the interval 0 to 2π results in:
so that the values of the function
f(z); z = (0;2π) are different [
12,
13,
14,
15].
In Riemann surfaces, the aspect of a branch point is defined for a holomorphic function when ƒ: X → Y, from a compact connected Riemann surface X to the compact Riemann surface Y (usually the Riemann sphere). If ƒ is not constant, ƒ will be a covering map onto its image at all but a finite number of points. The points of X, where the function ƒ fails, are the bifurcations points of ƒ, and the branch point is an image of a ramification point under ƒ.
For any point P ∈ X and Q = ƒ(P) ∈ Y, there are the holo-morphic local coordinates z for X near P and w for Y near Q, in terms of which the function ƒ(z) is given by ω = zk, for some integer k. This integer is called the ramification index of P. Usually, the ramification index equals one; if the branching index is not equal to one, then P is, by definition, a ramification point, and Q is a branch point.
If Y is just the Riemann sphere, and Q is in the finite part of Y, then there is no demand to select particular coordinates. The ramification index (Equation (3)) can be determined explicitly from Cauchy’s integral formula. Let γ be a simple rectifiable loop in X around P. The ramification index of ƒ at P is (P-any point in the space with local holomorphic coordinates)
This integral is the number of times that ƒ(y) winds over the point Q. As above, P is a ramification point, and Q is a branch point if eP > 1.
A Riemann surface is a surface-like composition that encloses the complex plane with infinitely many “sheets.” These sheets can have very intricate structures and interconnections. Riemann surfaces are one way of representing multiple-valued functions; another way is represented by the branch cuts. The plot in
Figure 1 shows the Riemann surfaces as the solutions of the equation:
with d = 2, 3, 4, and 5, where w(z) is the Lambert W-function.
The Riemann surface S of the function field K is the set of non-trivial discrete evaluations on K. Here, the set S corresponds to the ideals of the ring A of integers of K over z. Riemann surfaces provide a geometric visualization of the function elements and their analytical continuations.
Schwarz proved, at the end of the nineteenth century, that the automorphism group of a unified Riemann surface of genus g ≥ 2 is finite; then, Hurwitz showed that the group order is at most 84 (g − 1), where “g” is the genus.
In light of the computational results, polymers #4, 8, 11, 16 and 21, are the best candidates for feasible gene transfer. Experimentally, only polymers 8 and 21 showed good and acceptable results. A threshold regarding the branching point and its correlation with bioactivity was observed (
Figure 2). A branching point of ~1 has an excellent correlation with gene transfer capability, cytotoxicity, and biocompatibility.
The QSAR single hypothesis models for compounds 8 and 21 revealed the importance of topology in performing bioactivity. The merge hypothesis presents a shared future for both compounds 8 and 21. Hydrogen atom accepting A-groups and donor D-groups are critical in the chemical space of compounds [
16]. Furthermore, the pharmacophores demonstrated a relatively diverse set of functional groups for each hypothesis, findings that suggest a lack of specificity in describing a common pharmacophore.
Lastly, using the characteristics of symmetric mathematical space [
17], one can characterize distinct molecular properties, as shown in
Table 4, where the Riemman spaces for compounds 8 and 21 are represented with their complex aspects (see
Supplementary Material S2) [
18,
19,
20].
5. Conclusions
A suitable mathematical model, based on Riemann surfaces can be built in order to screen and characterize polymers with gene transfer properties. The branching point of Riemann surfaces can aid in assessing such surfaces, their connection with the drug space and their connective properties.