Möbius Transformation-Induced Distributions Provide Better Modelling for Protein Architecture

Mohammad Arashi; Najmeh Nakhaei Rad; Andriette Bekker; Wolf-Dieter Schubert

doi:10.3390/math9212749

Abstract

Proteins are found in all living organisms and constitute a large group of macromolecules with many functions. Proteins achieve their operations by adopting distinct three-dimensional structures encoded within the sequence of the constituent amino acids in one or more polypeptides. New, more flexible distributions are proposed for the MCMC sampling method for predicting protein 3D structures by applying a Möbius transformation to the bivariate von Mises distribution. In addition to this, sine-skewed versions of the proposed models are introduced to meet the increasing demand for modelling asymmetric toroidal data. Interestingly, the marginals of the new models lead to new multimodal circular distributions. We analysed three big datasets consisting of bivariate information about protein domains to illustrate the efficiency and behaviour of the proposed models. These newly proposed models outperformed mixtures of well-known models for modelling toroidal data. A simulation study was carried out to find the best method for generating samples from the proposed models. Our results shed new light on proposal distributions in the MCMC sampling method for predicting the protein structure environment.

Keywords:

bioinformatics; cosine model; mixture distributions; Möbius transformation; sine model; toroidal data

1. Introduction

Proteins constitute a diverse set of biological macromolecules that are often referred to as the workhorses of cells because of their central role in most biological processes. Chemically, proteins are biopolymers consisting of linear sequences of amino acid covalently linked by peptide bonds, such that each polypeptide is a single large molecule. Nineteen of the natural amino acids (all but proline) have an amino group (–

N H_{2}

), a carboxylic acid group (–

C O O H

), an amino acid-specific side-chain, and a hydrogen atom attached to a central carbon atom (

C_{α}

). Each peptide bond links the carboxylate group of one amino acid to the amino group of the next. Protein structure is often described in terms of four levels of organisation. The primary structure is the sequence of amino acids. The secondary structure refers to the local folding of the polypeptide backbone into helices, strands, or loops. The tertiary structure describes the complex three-dimensional folding of a polypeptide. Finally, the quaternary structure describes the involvement of one or more polypeptides in creating a functional protein. The amino nitrogen,

C_{α}

, and the carbonyl carbon of all residues constitute the protein backbone.

The 3D coordinates of proteins, as provided by electron microscopy, NMR, or X-ray crystallography, directly reveal the conformation of the backbone atoms, with knowledge of standard chemical bond angles and lengths incorporated during the refinement process. Generally, the backbone conformation is analysed using the backbone torsion or the dihedral angles, denoted by

ϕ

,

ψ

, and

ω

, as introduced by Ramachandran [1] (Figure 1A), where

ω

is usually close to

180^{\circ}

or occasionally

0^{\circ}

. Alternatively, virtual bond and torsion angles

θ

and

τ

may be used to describe a protein backbone representation based on only

C_{α}

positions (Figure 1B).

Figure 1. Two representations of protein backbone structures based on torsion or pseudo-torsion angles.

A major challenge in molecular biology and computational biochemistry involves predicting protein 3D structure. The encoding gene provides the primary structure of a protein, and the secondary structure may be predicted computationally with high reliability using artificial neural networks [2], based on the propensity of amino acids to form different secondary structures.

However, predicting the 3D structure of a protein, especially if it is larger than 100 amino acids or if a homologue with a known structure and significant sequence identity is not available, remains challenging. This challenge is addressed by de novo structure prediction, which requires parametrized physical force fields. The probability of observing a particular conformation

x

of the molecule,

p (x | β)

is considered and expressed as the Boltzmann distribution:

p (x | β) = \frac{exp (β U (x))}{Z_{β}},

where

Z_{β}

is the normalization constant,

U (x)

is the potential energy of the molecule,

β = {(k_{b} T)}^{- 1}

is the thermodynamic beta,

k_{b}

is the Boltzmann constant, and constant T is the temperature. The 3D structure of a molecule can be derived from

p (x | β)

by determining the mode of the distribution. Molecular dynamics (MD) is a simulation-based method used to probe for the mode of distribution. However, many millions to trillions of steps are required to simulate a single folding event. By contrast with MD, Monte Carlo (MC)-based methods are more time-efficient. In the Markov Chain Monte Carlo (MCMC) method, a Markov chain is constructed using the Metropolis–Hastings (MH) algorithm ([3,4]), with

p (x | β)

as the stationary distribution. A symmetric proposal distribution is utilized in the MH algorithm.

Choosing a good proposal distribution is one of the challenges in MCMC-based simulation. Gaussian perturbations are the most straightforward proposal distributions that can be used [5]. The results are more accurate when the proposal distribution is closer to the stationary distribution; therefore, protein structural information is incorporated into most proposal distributions. Using the information on angles and bond lengths observed in real proteins is a simple way to define a suitable proposal distribution. Fragment libraries for backbone angles and rotamer libraries for side-chain angles can be selected as default choices for proposal distributions [6,7,8].

Various tractable statistical distributions for modelling protein dihedral angles are briefly reviewed. These models can be used as proposal distributions for MCMC protein sampling. They can also be utilized as a prior for determining a protein structure from data. However, these models do not generate folded proteins because they work under some simplifying assumptions, both in terms of their functional form and dependency structure (see [9]). The ultimate goal of our contribution is to propose more flexible models for the proposal distribution.

1.1. Brief Overview

An overview of the models available for toroidal data that forms the departure point for the investigation in this paper follows (see [10]).

The first probability distribution on the torus was proposed by Mardia in [11]. It is the bivariate von Mises distribution:

\begin{matrix} f (θ_{1}, θ_{2}) & = & C exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) \\ + & (cos (θ_{1} - ι_{1}), sin (θ_{1} - ι_{1})) A {(cos (θ_{2} - ι_{2}), sin (θ_{2} - ι_{2}))}^{T}), \end{matrix}

where C is the normalizing constant,

ι_{1}, ι_{2} \in [- π, π)

are location parameters,

κ_{1}, κ_{2} \geq 0

are concentration parameters, and matrix

A_{2 \times 2}

is the circular–circular dependence parameter. To move beyond the complexity created by the large number of parameters in this founding distribution, a few special cases in the literature have been considered. Rivest in [12] introduced the subclass:

\begin{matrix} f (θ_{1}, θ_{2}) & \propto & exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) + α cos (θ_{1} - ι_{1}) cos (θ_{2} - ι_{2}) \\ + & β sin (θ_{1} - ι_{1}) sin (θ_{2} - ι_{2})), \end{matrix}

(1)

where

α, β \in R

. Singh et al. in [13] proposed the sine model as a special case of (1) with one less parameter, letting

α = 0

and

β = κ_{3}

:

\begin{matrix} f (θ_{1}, θ_{2}) = C exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) + κ_{3} sin (θ_{1} - ι_{1}) sin (θ_{2} - ι_{2})), \end{matrix}

(2)

where

C^{- 1} = 4 π^{2} \sum_{i = 0}^{\infty} (\binom{2 i}{i}) {(\frac{κ_{3}^{2}}{4 κ_{1} κ_{2}})}^{i} I_{i} (κ_{1}) I_{i} (κ_{2}),

(3)

where

I_{α} (z)

is the modified Bessel function of the first kind of order

α

. Another submodel of (1), the cosine model, was introduced by Mardia et al. in [14] by setting

α = β = - κ_{3}

:

\begin{matrix} f (θ_{1}, θ_{2}) = C exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) - κ_{3} cos (θ_{1} - ι_{1} - θ_{2} + ι_{2})), \end{matrix}

(4)

where

C^{- 1} = 4 π^{2} \{I_{0} (κ_{1}) I_{0} (κ_{2}) I_{0} (κ_{3}) + 2 \sum_{i = 1}^{\infty} I_{i} (κ_{1}) I_{i} (κ_{2}) I_{i} (κ_{3})\} .

(5)

It is worth noting that Kent et al. in [15] introduced another version of the cosine model, with a negative interaction given by:

\begin{matrix} f (θ_{1}, θ_{2}) = C exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) - κ_{3} cos (θ_{1} - ι_{1} + θ_{2} - ι_{2})), \end{matrix}

with the same normalizing constant as for the model with a positive interaction in (4). Kent et al. in [15] also introduced a submodel of (1), which is a hybrid between the sine and cosine models, given by:

\begin{matrix} f (θ_{1}, θ_{2}) & \propto exp (κ_{1} cos (θ_{1} - ι_{1}) + κ_{2} cos (θ_{2} - ι_{2}) \\ + β \{(cosh λ - 1) cos (θ_{1} - ι_{1}) cos (θ_{2} - ι_{2})) + sinh λ sin (θ_{1} - ι_{1}) sin (θ_{2} - ι_{2})\}), \end{matrix}

(6)

where

κ_{1}, κ_{2} \geq 0

,

λ \in R

, and for simplicity,

β = 1

. Mardia and Frellsen in [16] compared the properties of these three submodels in (2), (4), and (6). The multivariate extensions of the sine model can be found in [17]. In another attempt to expand the platform of toroidal distributions, Wehrly and Johnson in [18] used a marginal specification approach to construct bivariate models with more flexible specified circular marginals. Later, Jones et al. in [19] obtained various toroidal models using the general form in [18]. In this way, Fernández-Durán in [20] proposed another general toroidal model by using a copula pdf that García-Portugués imposed periodic restrictions on in [21], and Jones et al. [19] defined it as a circula pdf, arguing that it is characterised by a circular uniform distribution. For more details, see [22].

The main incentive for defining toroidal models in recent years has been the demand from other sciences, especially bioinformatics, to model dihedral angles in order to analyse protein structures ([13,14,23,24]). However, toroidal data can also be observed in other fields, for example, in meteorology (wind directions at two different times of day) and medicine (peak systolic blood pressure during two separate time periods). For the interested reader, some applications of toroidal models can be found in [25,26,27,28,29].

Most of the proposed toroidal models are pointwise symmetric, whereas the data that they model usually represent asymmetric patterns. This inspired Ameijeiras-Alonso and Ley [24] to introduce bivariate sine-skewed distributions (

B S S

):

f_{B S S} (θ_{1}, θ_{2}) = f (θ_{1} - μ_{1}, θ_{2} - μ_{2}) (1 + λ_{1} sin (θ_{1} - μ_{1}) + λ_{2} sin (θ_{2} - μ_{2})),

(7)

where

f (., .)

is a toroidal density symmetric (pointwise) about

- π \leq μ_{1}, μ_{2} < π

, and the skewness parameters

- 1 ⩽ λ_{i} ⩽ 1

,

i = 1, 2,

satisfy

| λ_{1} | + | λ_{2} | ⩽ 1

.

In this paper, Möbius transformation will form the foundation for the construction of competitive models. A map

T : C \to C

is a Möbius transformation if it has the following form:

T (z) = \frac{a z + b}{c z + d},

where

C

is the set of complex numbers,

a, b, c, d \in C

are complex numbers, and

a d - b c \neq 0

. Let

S \subset C

be a unit circle, then Möbius transformation maps a point on the unit circle

θ

onto another

\tilde{θ}

. Jones in [30] subsequently applied the Möbius transformation to introduce a new family of distributions on the disc. Kato and Jones in [31] used the Möbius transformation to introduce a new distribution on the circle by transforming the von Mises distribution. Wang and Shimizu in [32] applied the Möbius transformation to cardioid random variables. Kato and Pewsey in [33] employed this transformation to define the unimodal bivariate wrapped Cauchy distribution by transforming the bivariate circular distribution in [34]:

\begin{matrix} f (θ_{1}, θ_{2}) & = c (c_{0} - c_{1} cos (θ_{1} - μ_{1}) - c_{2} cos (θ_{2} - μ_{2}) - c_{3} cos (θ_{1} - μ_{1}) cos (θ_{2} - μ_{2}) \\ {- c_{4} sin (θ_{1} - μ_{1}) sin (θ_{2} - μ_{2}))}^{- 1}, \end{matrix}

(8)

where

c = (1 - ρ^{2}) (1 - r_{1}^{2}) (1 - r_{2}^{2}) / 4 π^{2}

,

c_{0} = (1 + ρ^{2}) (1 + r_{1}^{2}) (1 + r_{2}^{2}) - 8 | ρ | r_{1} r_{2}

,

c_{1} = 2 (1 + ρ^{2}) r_{1} (1 + r_{2}^{2}) - 4 | ρ | (1 + r_{1}^{2}) r_{2}

,

c_{2} = 2 (1 + ρ^{2}) r_{2} (1 + r_{2}^{2}) - 4 | ρ | (1 + r_{2}^{2}) r_{1}

,

c_{3} = - 4 (1 + ρ^{2}) r_{1} r_{2} + 2 | ρ | (1 + r_{1}^{2}) (1 + r_{2}^{2})

,

c_{4} = 2 ρ (1 - r_{1}^{2}) (1 - r_{2}^{2})

,

μ_{1}, μ_{2} \in [- π, π)

,

r_{1}, r_{2} \in [0, 1)

, and

- 1 < ρ < 1

. Kato and McCullagh in [35] introduced the Cauchy distribution on the sphere by using a Möbius transformation.

1.2. Our Contribution

In this paper, two new distributions are introduced on the torus by applying a restricted version of the Möbius transformation developed by Kato and Pewsey in [33], namely the circular Möbius transformation that transforms

θ

into

\tilde{θ}

through the following mapping:

θ = ℧ (\tilde{θ}, μ, ν, r) = μ + ν + 2 arctan \{\frac{1 - r}{1 + r} tan (\frac{\tilde{θ} - ν}{2})\},

(9)

where

- π \leq μ

,

ν \leq π

,

r \in [0, 1)

, and

μ

is the rotation parameter. When

μ = 0

,

ν

and r attract the point

θ

towards

ν

. By increasing r, the concentration of the points around

ν

increases. If

r = 0

, the transformation is identity mapping, and when

r \to 1

,

℧ (\tilde{θ}, μ, ν, r)

tends to

ν

. More details about the circular Möbius transformation can be found in [29,36]. The inverse of (9) can be obtained as follows:

\tilde{θ} = ν + 2 arctan \{\frac{1 + r}{1 - r} tan (\frac{θ - μ - ν}{2})\} .

More specifically, our novel contribution includes the following highlights:

New Möbius transformation-induced toroidal distributions are developed, acting as alternatives for existing models and efficiently outperforming them in the data application in this paper;
The proposed distributions reflect the protein structure more accurately than the existing models and can serve as proposal distributions for MCMC sampling of proteins since we should incorporate protein structure information into proposal distributions to obtain more accurate results;
Sine-skewed versions of these proposed models are introduced to meet the increasing demand for the modelling of asymmetric toroidal data;
The marginals of the new models lead to new multimodal circular distributions.

The remainder of this paper is organised as follows. Section 2 introduces two new distributions emanating from the sine and cosine models in (2) and (4), respectively. Section 3 introduces the sine-skewed versions of the newly proposed transformed sine and cosine models. Section 4 outlines the maximum likelihood method for obtaining the parameter estimates for the proposed models. Three real datasets, including information on angles in protein structures, are analysed in Section 5 to determine the performance of the proposed models relative to known competitors, and demonstrate their well-deserved designation as possible models for toroidal data. In Section 6, a simulation study is conducted for two reasons: (1) to explore the best method of generating samples from the newly transformed sine and cosine models, and (2) to evaluate the numerical method, followed by the acquisition of the maximum likelihood estimates (MLEs) of the parameters.

2. Two New Models on the Torus

This section highlights two new flexible models for toroidal data, obtained by transforming the sine and cosine models in (2) and (4) via a Möbius transformation.

2.1. Transformed Cosine Model

Let

({\tilde{Θ}}_{1}, {\tilde{Θ}}_{2})

have pdf (4) with

ι_{1} = ι_{2} = 0

. Suppose that

(Θ_{1}, Θ_{2}) = (℧ ({\tilde{Θ}}_{1}, μ_{1}, ν_{1}, r_{1}), ℧ ({\tilde{Θ}}_{2}, μ_{2}, ν_{2}, r_{2})),

where

℧ (.)

is defined in (9),

μ_{1}, μ_{2}, ν_{1}, ν_{2} \in (- π, π]

,

r_{1}, r_{2} \in [0, 1)

and without loss of generality

ν_{1} = ν_{2} = 0

. Then,

(Θ_{1}, Θ_{2})

has a pdf of

\begin{matrix} f (θ_{1}, θ_{2}) = \frac{C (1 - r_{1}^{2}) (1 - r_{2}^{2})}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} \\ \times exp \{\frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} (C_{0} + C_{1} cos (θ_{1} - μ_{1}) \\ + C_{2} cos (θ_{2} - μ_{2}) + C_{3} cos (θ_{1} - μ_{1}) cos (θ_{2} - μ_{2}) + C_{4} sin (θ_{1} - μ_{1}) sin (θ_{2} - μ_{2}))\}, \end{matrix}

(10)

where

κ_{1}, κ_{2} \geq 0

,

κ_{3} \in R

, C is defined in (5), and

\begin{matrix} C_{0} & = - 2 κ_{1} r_{1} (1 + r_{2}^{2}) - 2 κ_{2} r_{2} (1 + r_{1}^{2}) - 4 κ_{3} r_{1} r_{2}, \\ C_{1} & = κ_{1} (1 + r_{1}^{2}) (1 + r_{2}^{2}) + 2 κ_{3} r_{2} (1 + r_{1}^{2}) + 4 κ_{2} r_{1} r_{2}, \\ C_{2} & = κ_{2} (1 + r_{1}^{2}) (1 + r_{2}^{2}) + 2 κ_{3} r_{1} (1 + r_{2}^{2}) + 4 κ_{1} r_{1} r_{2}, \\ C_{3} & = - 2 κ_{1} r_{2} (1 + r_{1}^{2}) - 2 κ_{2} r_{1} (1 + r_{2}^{2}) - κ_{3} (1 + r_{1}^{2}) (1 + r_{2}^{2}), \\ C_{4} & = - κ_{3} (1 - r_{1}^{2}) (1 - r_{2}^{2}), \end{matrix}

(11)

where

μ_{1}, μ_{2} \in [- π, π)

are location parameters,

κ_{1}, κ_{2} \geq 0

are concentration parameters,

κ_{3}

is the circular–circular dependence parameter, and

r_{1}

and

r_{2}

regulate the concentrations of the marginal distributions. In (10), when

r_{1} = r_{2} = 0

, the cosine model (4) is obtained. If

κ_{1}, κ_{2}, κ_{3} = 0

yields the bivariate wrapped Cauchy distribution, then

θ_{1} ⊥ θ_{2}

follows. The pdf and contour plots of (10) are shown in Figure 2 for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

and

r_{2}

, and reveal unimodal and bimodal behaviour.

Figure 2. Pdf and contour plots of the transformed cosine model (10) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Proposition 1.

Assuming the transformed cosine model (10), when

r_{1}, r_{2} \to 0

, then

(Θ_{1}, Θ_{2})

has approximately a bivariate normal distribution if and only if

κ_{3} \leq \frac{κ_{1} κ_{2}}{κ_{1} + κ_{2}}

.

Proof.

See Appendix A. □

In the following, the marginal pdf and conditional pdf of the transformed cosine model (10) and their properties are discussed. The marginal pdf of

θ_{1}

for the transformed cosine model in (10) is as follows:

\begin{matrix} f_{Θ_{1}} (θ_{1}) = \frac{2 π C (1 - r_{1}^{2})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})} I_{0} (h (θ_{1})) exp \{\frac{κ_{1} (1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 κ_{1} r_{1}}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}\}, \end{matrix}

(12)

where

h (θ_{1}) = {\{κ_{2}^{2} + κ_{3}^{2} - \frac{2 κ_{2} κ_{3} ((1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 r_{1})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}\}}^{1 / 2},

(13)

and C is as defined in (5). The marginal pdf of

Θ_{1}

in (12) is symmetric to

μ_{1}

, small values of

κ_{3}

approximate the transformed von Mises distribution [31], and

r_{1} = 0

, which simplifies to the marginal pdf of the cosine model [14]. It is clear that for

r_{1} = 0

and small values of

κ_{3}

, the von Mises distribution is approximated. If

κ_{1} = κ_{2} = κ_{3} = 0

in (12), then the Möbius-transformed uniform distribution is obtained. For

κ_{1} = κ_{2} = κ_{3} = r_{1} = 0

, the distribution is uniform. When

κ_{1} = κ_{2} = 0

in (12), the distribution is the transformed von Mises distribution [31], and when

κ_{1} = κ_{2} = r_{1} = 0

, the von Mises distribution is obtained. The plots of this generalized marginal pdf of

Θ_{1}

are shown in Figure 3 (left) for

μ_{1} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}

and

r_{1}

, reflecting unimodal and bimodal graphs. In the following theorem, the modality of the marginal density function

Θ_{1}

is addressed.

Figure 3. Plots of the marginal pdf of

Θ_{1}

in (12) (left) and in (18) (right) for

μ_{1} = 0

and different parameter values.

Corollary 1.

The marginal distribution of

Θ_{1}

in (12) is symmetric around

θ_{1} = μ_{1}

and unimodal (with mode at

μ_{1}

) if and only if

\frac{A (∣ κ_{2} - κ_{3} ∣)}{∣ κ_{2} - κ_{3} ∣} \leq (\frac{2 r_{1} {(1 - r_{1})}^{2}}{{(1 - r_{1}^{2})}^{2}} + κ_{1}) / κ_{2} κ_{3}

, where

A (κ) = I_{1} (κ) / I_{0} (κ)

. Moreover, the marginal distribution of

Θ_{1}

in (12) is bimodal (with the modes at

μ_{1} - θ_{1}^{*}

and

μ_{1} + θ_{1}^{*}

) if and only if

\frac{A (∣ κ_{2} - κ_{3} ∣)}{∣ κ_{2} - κ_{3} ∣} > (\frac{2 r_{1} {(1 - r_{1})}^{2}}{{(1 - r_{1}^{2})}^{2}} + κ_{1}) / κ_{2} κ_{3}

, and

θ_{1}^{*}

is the root of

κ_{2} κ_{3} {(1 - r_{1}^{2})}^{2} A (h (θ_{1}^{*})) /

h (θ_{1}^{*}) - 2 r_{1} (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1}^{*} - μ_{1})) - κ_{1} {(1 - r_{1}^{2})}^{2} = 0

, where

h (θ)

is as defined in (13).

Proof.

See Appendix A. □

The conditional pdf

f (θ_{2} ∣ Θ_{1} = θ_{1})

results in the transformed von Mises distribution [31] given by the following:

\begin{matrix} f (θ_{2} ∣ Θ_{1} = θ_{1}) = \frac{1 - r_{2}^{2}}{2 π I_{0} (h (θ_{1}))} \frac{1}{1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2})} \\ \times exp \{\frac{h (θ_{1}) cos τ ((1 + r_{2}^{2}) cos (θ_{2} - μ_{2}) - 2 r_{2}) + h (θ_{1}) sin τ (1 - r_{2}^{2}) sin (θ_{2} - μ_{2})}{1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2})}\}, \end{matrix}

(14)

where

h (θ_{1})

is as defined in (13), and

tan τ = \frac{- κ_{3} (1 - r_{1}^{2}) sin (θ_{1} - μ_{1})}{κ_{2} (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) - κ_{3} ((1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 r_{1})}

(15)

Note that for

r_{1} = r_{2} = 0

, (14) simplifies to the von Mises distribution with the parameters

τ

and

h (θ_{1})

in (13).

2.2. Transformed Sine Model

Let

({\tilde{Θ}}_{1}, {\tilde{Θ}}_{2})

have a bivariate pdf (2), with

ι_{1} = ι_{2} = 0

. Suppose that

(Θ_{1}, Θ_{2}) = (℧ ({\tilde{Θ}}_{1}, μ_{1}, ν_{1}, r_{1}), ℧ ({\tilde{Θ}}_{2}, μ_{2}, ν_{2}, r_{2})),

where

℧ (.)

is as defined in (9),

μ_{1}, μ_{2}, ν_{1}, ν_{2} \in (- π, π]

,

r_{1}, r_{2} \in [0, 1)

, and without loss of generality

ν_{1} = ν_{2} = 0

. Then,

(Θ_{1}, Θ_{2})

has a pdf as follows:

\begin{matrix} f (θ_{1}, θ_{2}) = \frac{C (1 - r_{1}^{2}) (1 - r_{2}^{2})}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} \\ \times exp \{\frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} (C_{0} + C_{1} cos (θ_{1} - μ_{1}) \\ + C_{2} cos (θ_{2} - μ_{2}) + C_{3} cos (θ_{1} - μ_{1}) cos (θ_{2} - μ_{2}) + C_{4} sin (θ_{1} - μ_{1}) sin (θ_{2} - μ_{2}))\}, \end{matrix}

(16)

where

κ_{1}, κ_{2} \geq 0

,

κ_{3} \in R

, C is as defined in (3), and

\begin{matrix} C_{0} & = - 2 κ_{1} r_{1} (1 + r_{2}^{2}) - 2 κ_{2} r_{2} (1 + r_{1}^{2}), \\ C_{1} & = κ_{1} (1 + r_{1}^{2}) (1 + r_{2}^{2}) + 4 κ_{2} r_{1} r_{2}, \\ C_{2} & = κ_{2} (1 + r_{1}^{2}) (1 + r_{2}^{2}) + 4 κ_{1} r_{1} r_{2}, \\ C_{3} & = - 2 κ_{1} r_{2} (1 + r_{1}^{2}) - 2 κ_{2} r_{1} (1 + r_{2}^{2}), \\ C_{4} & = κ_{3} (1 - r_{1}^{2}) (1 - r_{2}^{2}), \end{matrix}

(17)

where

μ_{1}, μ_{2} \in [- π, π)

are location parameters,

κ_{1}, κ_{2} \geq 0

are concentration parameters,

κ_{3}

is the circular–circular dependence parameter, and

r_{1}

and

r_{2}

regulate the concentrations of the marginal distributions. If

r_{1} = r_{2} = 0

in (16), then the sine model in (2) follows. The pdf and contour plots of (16) are shown in Figure 4 for

μ_{1} = μ_{2} = 0

and for different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

and

r_{2}

. As can be seen, this transformed sine pdf (16) can have both unimodal and bimodal forms.

Figure 4. Pdf and contour plots of the transformed sine model (16) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Proposition 2.

Assuming the transformed sine model in (16), when

r_{1}, r_{2} \to 0

, then

(Θ_{1}, Θ_{2})

has an approximately bivariate normal distribution if and only if

κ_{3}^{2} < κ_{1} κ_{2}

.

Proof.

Similarly, Theorem 1 is proved using the results in [13]. □

In this case, the marginal pdf of

Θ_{1}

for the transformed sine model in (16) is as follows:

\begin{matrix} f_{Θ_{1}} (θ_{1}) = \frac{2 π C (1 - r_{1}^{2})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})} I_{0} (h (θ_{1})) exp \{\frac{κ_{1} (1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 κ_{1} r_{1}}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}\}, \end{matrix}

(18)

where

h (θ_{1}) = {\{κ_{2}^{2} + {(\frac{κ_{3} (1 - r_{1}^{2})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})})}^{2} {sin}^{2} (θ_{1} - μ_{1})\}}^{1 / 2},

(19)

and C, as shown in (3). The marginal pdf of

Θ_{1}

is symmetric around

μ_{1}

. If

κ_{3} = 0

, the distribution is the transformed von Mises distribution [31]. If

r_{1} = 0

in (18), the marginal distribution of the sine model [13] is obtained. The plots of the marginal pdf of

Θ_{1}

in (18) are shown in Figure 3 (right) for

μ_{1} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}

, and

r_{1}

. As can be seen, the distribution can be both unimodal and bimodal. In the following theorem, the modality of the marginal pdf of

Θ_{1}

in (18) is explored.

Corollary 2.

The marginal distribution of

Θ_{1}

in (18) is symmetric around

θ_{1} = μ_{1}

and unimodal (with mode at

μ_{1}

) if and only if

\frac{A (κ_{2})}{κ_{2}} \leq (\frac{2 r_{1} {(1 - r_{1})}^{2}}{{(1 - r_{1}^{2})}^{2}} + κ_{1}) / κ_{3}^{2}

, where

A (κ) = I_{1} (κ) / I_{0} (κ)

. Moreover, the marginal distribution of

Θ_{1}

in (18) is bimodal (with the modes at

μ_{1} - θ_{1}^{*}

and

μ_{1} + θ_{1}^{*}

) if and only if

\frac{A (κ_{2})}{κ_{2}} > (\frac{2 r_{1} {(1 - r_{1})}^{2}}{{(1 - r_{1}^{2})}^{2}} + κ_{1}) / κ_{3}^{2}

, and

θ_{1}^{*}

is the root of

κ_{3}^{2} {(1 - r_{1}^{2})}^{2} cos θ_{1}^{*} A (h (θ_{1}^{*})) / h (θ_{1}^{*}) - 2 r_{1} (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1}^{*} - μ_{1})) - κ_{1} {(1 - r_{1}^{2})}^{2} = 0

, where

h (θ)

is as defined in (19).

Proof.

See Appendix A. □

The conditional pdf

f (θ_{2} ∣ Θ_{1} = θ_{1})

is given by:

\begin{matrix} f (θ_{2} ∣ Θ_{1} = θ_{1}) = \frac{1 - r_{2}^{2}}{2 π I_{0} (h (θ_{1}))} \frac{1}{1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2})} \\ \times exp \{\frac{h (θ_{1}) cos τ ((1 + r_{2}^{2}) cos (θ_{2} - μ_{2}) - 2 r_{2}) + h (θ_{1}) sin τ (1 - r_{2}^{2}) sin (θ_{2} - μ_{2})}{1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2})}\}, \end{matrix}

(20)

where

h (θ_{1})

is as defined in (19), and

tan τ = \frac{κ_{3}}{κ_{2}} \frac{(1 - r_{1}^{2}) sin (θ_{1} - μ_{1})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}

(21)

Interestingly, the conditional distribution is the transformed von Mises distribution [31]. When

r_{1} = r_{2} = 0

in (20), the von Mises distribution with parameters

τ

and

h (θ_{1})

is obtained.

3. Sine-Skewed Transformed Sine and Cosine Distributions

In practice, it is possible to have skewed toroidal datasets, despite the well-known toroidal distributions being pointwise symmetric. Therefore, it would be interesting to extend this methodology to the recent model of Ameijeiras-Alonso and Ley in [24]. In this section, the skewed versions of the proposed transformed sine and cosine models in (16) and (10) are introduced. In addition, Abe and Pewsey’s skew model in [37] is applied to extend models on the circle manifold using marginal density functions.

By substituting (10) in (7), the sine-skewed transformed cosine (

B S S T C

) distribution can be defined as follows:

\begin{matrix} f_{B S S T C} (θ_{1}, θ_{2}) = \frac{C (1 - r_{1}^{2}) (1 - r_{2}^{2})}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} \\ \times exp \{\frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} (C_{0} + C_{1} cos (θ_{1} - μ_{1}) \\ + C_{2} cos (θ_{2} - μ_{2}) + C_{3} cos (θ_{1} - μ_{1}) cos (θ_{2} - μ_{2}) + C_{4} sin (θ_{1} - μ_{1}) sin (θ_{2} - μ_{2}))\} \\ \times (1 + λ_{1} sin (θ_{1} - μ_{1}) + λ_{2} sin (θ_{2} - μ_{2})), \end{matrix}

(22)

where

κ_{1}, κ_{2} \geq 0

,

κ_{3} \in R

, C is as defined in (5), and

C_{0}

–

C_{4}

are as defined in (11). The pdf and contour plots of the sine-skewed transformed cosine model for

κ_{1} = 0.2

,

κ_{2} = 0.3

,

κ_{3} = 0.2

,

r_{1} = 0.2

,

r_{2} = 0.1

,

μ_{1} = μ_{2} = 0

, and different values of

λ_{1}

and

λ_{2}

are shown in Figure 5 (top).

Figure 5. Pdf and contour plots of the sine-skewed transformed cosine model in (22) (top) and the sine-skewed transformed sine model in (24) (bottom) for different values of

λ_{1}

and

λ_{2}

.

The marginal pdf of

θ_{1}

for

B S S T C

in (22) is as follows:

\begin{matrix} f_{Θ_{1}; B S S} (θ_{1}) = \frac{C (1 - r_{1}^{2})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})} exp \{\frac{κ_{1} (1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 κ_{1} r_{1}}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}\} \\ \times \{2 π I_{0} (h (θ_{1})) (1 + λ_{1} (1 - r_{1}^{2}) sin (θ_{1} - μ_{1}) / (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1}))) \\ + λ_{2} A (h (θ_{1})) cos (τ + μ_{2})\} \end{matrix}

(23)

where

h (θ_{1})

and

τ

are obtained from (13) and (15), respectively. When

λ_{2} = 0

,

f_{Θ_{1}; B S S} (θ_{1})

is the Möbius-transformed sine-skewed version [37] of the marginal pdf of the cosine model. The plots of the skewed pdf in (23) are shown in Figure 6 (left) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}, λ_{1}

, and

λ_{2}

. As can be observed, the distribution can be both unimodal and bimodal.

Figure 6. Plots of the marginal pdf of

Θ_{1}

for

B S S T C

(left) and

B S S T S

(right) for

μ_{1} = μ_{2} = 0

and different parameter values.

Similarly, from (16) and (7), the sine-skewed transformed sine (

B S S T S

) distribution can be obtained as follows:

\begin{matrix} f_{B S S T S} (θ_{1}, θ_{2}) = \frac{C (1 - r_{1}^{2}) (1 - r_{2}^{2})}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} \\ \times exp \{\frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2} - μ_{2}))} (C_{0} + C_{1} cos (θ_{1} - μ_{1}) \\ + C_{2} cos (θ_{2} - μ_{2}) + C_{3} cos (θ_{1} - μ_{1}) cos (θ_{2} - μ_{2}) + C_{4} sin (θ_{1} - μ_{1}) sin (θ_{2} - μ_{2}))\} \\ \times (1 + λ_{1} sin (θ_{1} - μ_{1}) + λ_{2} sin (θ_{2} - μ_{2})), \end{matrix}

(24)

where

κ_{1}, κ_{2} \geq 0

,

κ_{3} \in R

, C is as defined in (3), and

C_{0}

–

C_{4}

are defined in (17). The pdf and contour plots of the sine-skewed transformed sine model for

κ_{1} = 2

,

κ_{2} = 0.6

,

κ_{3} = 2

,

r_{1} = 0.1

,

r_{2} = 0.1

,

μ_{1} = μ_{2} = 0

, and different values of

λ_{1}

and

λ_{2}

are shown in Figure 5 (bottom).

The marginal pdf of

θ_{1}

for

B S S T S

is of the same density as in (23), where

h (θ_{1})

and

τ

are obtained from (19) and (21). When

λ_{2} = 0

,

f_{θ_{1}; B S S} (θ_{1})

is the Möbius-transformed sine-skewed version [37] of the marginal pdf of the sine model. The plots of the skewed pdf in (23) are shown in Figure 6 (right) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}, λ_{1}

, and

λ_{2}

. Figure 6 illustrates that the distribution can have both unimodal and bimodal forms.

To expand the skewed circular models, the following models are introduced based on the k sine-skewed model of [37]. The skewed version of the marginal distribution of

Θ_{1}

in (12) is the following:

\begin{matrix} f_{S S} (θ_{1}) & = \frac{2 π C (1 - r_{1}^{2})}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})} I_{0} (h (θ_{1})) exp \{\frac{κ_{1} (1 + r_{1}^{2}) cos (θ_{1} - μ_{1}) - 2 κ_{1} r_{1}}{1 + r_{1}^{2} - 2 r_{1} cos (θ_{1} - μ_{1})}\} \\ \times (1 + λ sin (k (θ_{1} - μ_{1}))), \end{matrix}

(25)

where C is as defined in (5),

h (θ_{1})

is as defined in (13), and

- 1 \leq λ \leq 1

.

λ > 0

leads to left-skewed distributions, and

λ < 0

provides right-skewed distributions. The plots of the skewed pdf in (25) are shown in Figure 7 (left) for

k = 1

,

μ_{1} = 0

, and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

λ

.

Figure 7. Plots of the sine-skewed versions of marginal pdfs of

Θ_{1}

in (12) (left) and (18) (right) for

k = 1

,

μ_{1} = 0

and different parameter values.

Similarly, the sine-skewed version [37] of the marginal pdf of

Θ_{1}

in (18) is of the same density as in (25), where C is as defined in (3),

h (θ_{1})

is as defined in (19), and

- 1 \leq λ \leq 1

. The plots of the sine-skewed version of the marginal pdf in (18) are shown in Figure 7 (right) for

k = 1

,

μ_{1} = 0

, and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

λ

. As can be seen, the distribution is both unimodal and bimodal. Multimodal results for

k > 1

.

4. Maximum Likelihood Estimation

In this section, the maximum likelihood method is outlined to obtain the estimates of parameters for both the transformed cosine and sine models. Suppose that

ζ = {(μ_{1}, μ_{2}, κ_{1}, κ_{2}, κ_{3}, r_{1}, r_{2})}^{T}

are the parameters associated with the transformed cosine model (10). The log-likelihood function of the transformed cosine model is represented as follows:

\begin{matrix} l (ζ) = n log C + n log (1 - r_{1}^{2}) + n log (1 - r_{2}^{2}) - \sum_{i = 1}^{n} log (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1 i} - μ_{1})) \\ - \sum_{i = 1}^{n} log (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2 i} - μ_{2})) + \sum_{i = 1}^{n} \frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1 i} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2 i} - μ_{2}))} \\ \times (C_{0} + C_{1} cos (θ_{1 i} - μ_{1}) + C_{2} cos (θ_{2 i} - μ_{2}) + C_{3} cos (θ_{1 i} - μ_{1}) cos (θ_{2 i} - μ_{2}) \\ + C_{4} sin (θ_{1 i} - μ_{1}) sin (θ_{2 i} - μ_{2})), \end{matrix}

(26)

where C is as defined in (5), and

C_{0}

–

C_{4}

are as defined in (11). The MLE of the parameters,

\hat{ζ} = {({\hat{μ}}_{1}, {\hat{μ}}_{2}, {\hat{κ}}_{1}, {\hat{κ}}_{2}, {\hat{κ}}_{3}, {\hat{r}}_{1}, {\hat{r}}_{2})}^{T}

, can be determined by maximizing (26) with respect to

ζ = {(μ_{1}, μ_{2}, κ_{1}, κ_{2}, κ_{3}, r_{1}, r_{2})}^{T}

.

Supposing that

ζ = {(μ_{1}, μ_{2}, κ_{1}, κ_{2}, κ_{3}, r_{1}, r_{2})}^{T}

are the parameters associated with the transformed sine model (16), the log-likelihood function of the transformed sine model can be represented as follows:

\begin{matrix} l (ζ) = n log C + n log (1 - r_{1}^{2}) + n log (1 - r_{2}^{2}) - \sum_{i = 1}^{n} log (1 + r_{1}^{2} - 2 r_{1} cos (θ_{1 i} - μ_{1})) \\ - \sum_{i = 1}^{n} log (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2 i} - μ_{2})) + \sum_{i = 1}^{n} \frac{1}{(1 + r_{1}^{2} - 2 r_{1} cos (θ_{1 i} - μ_{1})) (1 + r_{2}^{2} - 2 r_{2} cos (θ_{2 i} - μ_{2}))} \\ \times (C_{0} + C_{1} cos (θ_{1 i} - μ_{1}) + C_{2} cos (θ_{2 i} - μ_{2}) + C_{3} cos (θ_{1 i} - μ_{1}) cos (θ_{2 i} - μ_{2}) \\ + C_{4} sin (θ_{1 i} - μ_{1}) sin (θ_{2 i} - μ_{2})), \end{matrix}

(27)

where C is as defined in (3), and

C_{0}

–

C_{4}

are as defined in (17). The maximization of (27) with respect to

ζ = {(μ_{1}, μ_{2}, κ_{1}, κ_{2}, κ_{3}, r_{1}, r_{2})}^{T}

results in the MLE of the parameters,

\hat{ζ} = {({\hat{μ}}_{1}, {\hat{μ}}_{2}, {\hat{κ}}_{1}, {\hat{κ}}_{2}, \hat{κ_{3}}, {\hat{r}}_{1}, {\hat{r}}_{2})}^{T}

.

By setting the partial derivatives of the log-likelihood functions in (26) and (27) with respect to

ζ

to zero, the MLEs of

ζ = {(μ_{1}, μ_{2}, κ_{1}, κ_{2}, κ_{3}, r_{1}, r_{2})}^{T}

can be derived for the transformed cosine and sine models. Given the fact that no closed-form expressions exist, it is necessary to use numerical methods to obtain the MLEs. Operationally, the maximization of (26) and (27) with respect to

ζ

is obtained by the DEoptim package in the R software [38] based on the differential evolution (DE) algorithm [39]. Extensive studies have validated its significant performance as a global optimization algorithm for continuous numerical minimization problems [40]. It is worth noting that this package was also used to obtain the MLEs of the parameters for sine-skewed versions and mixtures of transformed cosine and sine models.

5. Protein Structure Application

To demonstrate the performance of the proposed models in modelling the dihedral angles and the planar and torsion angles in a protein structure, three datasets are considered, which are available at http://scop.mrc-lmb.cam.ac.uk/scop/. SCOP.1 contains 10,188 planar and torsion angles

(θ, τ)

(see Figure 1A) for about 63 protein domains that were randomly selected from three remote protein classes in the structural classification of proteins (SCOP). SCOP.3 includes 4607 planar and torsion angles

(θ, τ)

from approximately 40 protein chains, and the TCBIG.VAL.right set consists of 2673 dihedral angles

(ϕ, ψ)

(see Figure 7B) [41]. The Ramachandran plots [1] for each dataset are presented in Figure 8. As can be seen, the datasets are at least bimodal, so bimodal or mixture distributions will be good choices for fitting.

Figure 8. Ramachandran plots for each dataset.

The transformed sine and cosine models in (16) and (10), along with their competitors—the sine model, and a mixture of sine models (see (2); [13]), the cosine model, and a mixture of cosine models (see (4); [14]), and a mixture of bivariate wrapped Cauchy models (see (8); [33])—were fitted to the SCOP.1 and SCOP.3 datasets. A mixture distribution with two components was investigated as follows:

g_{M} (θ_{1}, θ_{2}) = p f_{1} (θ_{1}, θ_{2}) + (1 - p) f_{2} (θ_{1}, θ_{2})

where

p \in [0, 1]

and

f_{1} (., .)

and

f_{2} (., .)

are two toroidal distributions. The estimation of parameters, identifiability, and choosing the number of mixing components and parameters are among the well-known challenges in the application of mixture distributions. Furthermore, when the empirical density of the data is highly asymmetric, it can result in a misleading statistical inference of the parameters [42]. Multimodal distributions, which represent the random behaviour of data with multi-mode presence, can provide better model fitting. This is observed here using the bimodal transformed sine model.

The sine-skewed versions of the aforementioned distributions [24] form part of these evaluations. The results, including the MLEs of parameters, log-likelihood, Akaike information criterion (AIC), and the Bayesian information criterion (BIC), are shown in Table 1 and Table 2. Based on these results, the bimodal transformed sine model in (16) provides the best fit for the data, and its performance is better than that of the mixture models for these datasets. Based on the symmetry test of Ameijeiras-Alonso and Ley in [24] and the values of log-likelihood in Table 1 and Table 2, there is no evidence that rejects the fact that underlying distributions for SCOP.1 and SCOP.3 are pointwise symmetric. The results of the mixture of transformed sine and the mixture of transformed cosine models are not reported in Table 1 and Table 2 because

\hat{p} \approx 1

. Scatter plots of the data, together with contour plots of the fitted distributions are provided in Figure 9 and Figure 10.

Table 1. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.1 (

n =

10,188).

Table 2. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.3 (

n = 4607

).

Figure 9. Contour plots of fitted pdfs together with scatter plot for SCOP.1 (

n =

10,188). The last row includes the proposed models.

Figure 10. Contour plots of fitted pdfs together with scatter plot for SCOP.3 (

n = 4607

). The last row includes the proposed models.

With the last dataset TCBIG.VAL.right, good results are not observed upon application of the single component distributions. Therefore, a mixture model might offer a solution. Subsequently, only mixtures of the aforementioned distributions were considered. For comparison, goodness-of-fit was evaluated for mixtures of distributions from transformed sine and cosine models, and for mixtures of distributions from existing models. The results are listed in Table 3. As can be seen, the mixture of transformed sine models provides the best fitting of the data. Scatter plots of the data and contour plots of the fitted distributions are shown in Figure 11.

Table 3. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for TCBIG.VAL.right (

n = 2673

).

Figure 11. Contour plots of fitted pdfs together with scatter plots for TCBIG.VAL.right (

n = 2673

). The last row includes the proposed models.

The kernel density plots of the three datasets and the best-fit models obtained for each dataset are shown in Figure 12. According to the levels of contours in the kernel densities of the data and fitted curves, our proposed models provide an accurate fit.

Figure 12. Kernel density plots of the data, and the best-fit models.

6. Simulation Study

The authors of Ref. [16] explored suitable methods for generating samples from cosine (with positive interaction) and sine models. They found that both Gibbs and rejection sampling approaches performed well, but the latter was more efficient. To simulate a sample from the newly proposed transformed sine and transformed cosine distributions in (16) and (10), four packages in R, which are generally based on rejection sampling, including MCMCpack [43], gibbs.met [44], LearnBayes [45], and MHadaptive [46], were used and the results were compared. These packages are based on Metropolis sampling, random walk Metropolis sampling, Metropolis-Hastings MCMC sampling, and Gibbs sampling with Metropolis steps. First, a sample of size

n = 1000

was generated with each package from the transformed sine model in (16), with the parameters

κ_{1} = 2.1585

,

κ_{2} = 0.3489

,

κ_{3} = 3.1712

,

r_{1} = 0.6036

,

r_{2} = 0.0131

,

μ_{1} = 1.8573

, and

μ_{2} = 2.4321

(the best-fit model for the SCOP.1 dataset in the previous section). The results, including scatter plots of simulated samples with contour plots of the distribution, trace plots, and compare-partial plots [47], which use the last 10 percent of the chain, are shown in Figure 13. The runtime of each method is shown in Figure 14 (left) for a sample size of

n = 100

[48] (system: Intel(R) Core(TM) i7-8550U CPU @ 1.80 GHz RAM 8.00 GB). Second, the MLE of the parameters and bias and the mean squared error (MSE) of the estimates were calculated for each method using the Monte Carlo method, with 500 replications and n = 1,001,000. The results are listed in Table 4.

Figure 13. Scatter, trace, and compare-partial plots of the simulated data from the transformed sine model using “gibbs_met” in the “gibbs.met” package (first row), “MCMCmetrop1R” in the “MCMCpack” package (second row), “met_gaussian” in the “gibbs.met” package (third row), “Metro_Hastings” in the “MHadaptive” package (fourth row), and “rwmetrop” in the “LearnBayes” package (fifth row).

Figure 14. Execution times for generating a sample size of

n = 100

from a transformed sine model (left) and a transformed cosine model (right) for each method.

Table 4. Maximum likelihood estimates of parameters and bias, and the MSE of the estimates for the simulated data obtained from each method.

Similarly, for the transformed cosine model in (10) with parameters

κ_{1} = 3.9891

,

κ_{2} = 0.6532

,

κ_{3} = 1.7911

,

r_{1} = 0.2305

,

r_{2} = 0.5046

,

μ_{1} = - 1.5651

, and

μ_{2} = 0.9878

, the aforementioned R packages were applied, first to generate a sample size of

n = 1000

. The results, including scatter plots of simulated samples with contour plots of the distribution, trace plots, and compare-partial plots [47], are shown in Figure 15. The runtime of each method is presented in Figure 14 (right) for a sample size of n = 100 [48]. Then, the MLE of the parameters and bias and the MSE of the estimates were calculated for each method using the Monte Carlo method, with 500 replications and n = 100,1000. The results are listed in Table 4, which support the performance of the selected approach for obtaining the MLEs of parameters. As shown in Figure 14, the MCMCmetrop1R is the highest-speed method, and gibbs_met is the lowest-speed method. According to the results in Table 4, rejection sampling provides accurate results. Gibbs sampling with Metropolis steps (gibbs_met) is also precise despite the low speed. With increasing n, bias and MSE decrease.

Figure 15. Scatter, trace, and compare-partial plots of the simulated data from the transformed cosine model using “gibbs_met” in the “gibbs.met” package (first row), “MCMCmetrop1R” in the “MCMCpack” package (second row), “met_gaussian” in the “gibbs.met” package (third row), “Metro_Hastings” in the “MHadaptive” package (fourth row), and “rwmetrop” in the “LearnBayes” package (fifth row).

7. Conclusions

In MCMC protein sampling for predicting the 3D structure, when the proposal distribution is closer to the stationary distribution, the results are more accurate. Therefore, a suitable proposal distribution can be defined using the angles and bond lengths observed in natural proteins. Statistical distributions for modelling protein dihedral angles can be used as proposal distributions for MCMC protein sampling. We gave a brief overview of existing symmetric models that formed the basis of the proposed models in this paper ((2) and (4)). In addition, new Möbius transformation-induced toroidal distributions, together with skewed versions, were developed in this study as alternatives to proposal distributions for the MCMC sampling of proteins. We demonstrated their performance with three protein datasets of toroidal nature and graphically illustrated their flexible behaviour. The AIC and BIC confirmed the better performance of our proposed models in comparison with the existing models. These newly proposed models even outperformed mixtures of well-known models for modelling toroidal data. In comparison with the existing toroidal models, these proposed models reflect the protein structural information better and should be incorporated into proposal distributions. Lastly, to meet the need for sampling of proposal distribution in the MCMC algorithm, suitable methods for generating samples from these new models were explored using different types of the Metropolis sampling. In the future, one can investigate the performance of the Möbius transformation to obtain new cylindrical distributions.

Author Contributions

Conceptualization, M.A. and A.B.; methodology, M.A., N.N.R., A.B. and W.-D.S.; validation, M.A., N.N.R. and A.B.; formal analysis, M.A., N.N.R. and A.B.; investigation, M.A., N.N.R., A.B. and W.-D.S.; writing—original draft preparation, N.N.R.; writing—review and editing, M.A., N.N.R. and A.B.; visualization, M.A., N.N.R., A.B. and W.-D.S.; supervision, M.A. and A.B.; project administration, N.N.R.; funding acquisition, N.N.R. and A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Research Foundation grant number 71199, 109214, 120839.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available at http://scop.mrc-lmb.cam.ac.uk/scop/ (accessed on 20 August 2020).

Acknowledgments

We would like to sincerely thank the three anonymous reviewers for their constructive comments that improved the paper. This work was based on research supported in part by the National Research Foundation (NRF) of South Africa, SARChI Research Chair UID: 71199; Ref.: IFR170227223754 grant No. 109214; Ref.: SRUG190308422768 grant No. 120839, STATOMET at the Department of Statistics at the University of Pretoria, and DSI-NRF Centre of Excellence in Mathematical and Statistical Sciences (CoE-MaSS), South Africa. The opinions expressed and conclusions arrived at are those of the authors and are not necessarily attributed to the CoE-MaSS or the NRF.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of Proposition 1

When

r_{1}, r_{2} \to 0

, pdf (10) tends to the cosine distribution, which for large values of

κ_{1}

and

κ_{2}

is concentrated near 0. Suppose

μ_{1} = μ_{2} = 0

(without loss of generality), according to Theorem 1 in [14] and using Taylor expansions,

(Θ_{1}, Θ_{2}) \sim N_{2} (0, Σ)

, where

Σ^{- 1} = (\begin{matrix} κ_{1} - κ_{3} & κ_{3} \\ κ_{3} & κ_{2} - κ_{3} \end{matrix})

, with

κ_{3} \leq \frac{κ_{1} κ_{2}}{κ_{1} + κ_{2}}

.

Appendix A.2. Proof of Corollary 1

Without loss of generality, we consider

μ_{1} = 0

. According to (12), we conclude that:

\begin{matrix} \frac{f_{Θ_{1}}^{'} (θ_{1})}{f_{Θ_{1}} (θ_{1})} & = {\frac{- 2 r_{1}}{(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})} + \frac{κ_{2} κ_{3} {(1 - r_{1}^{2})}^{2} A (h (θ_{1}))}{h (θ_{1}) {(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})}^{2}} \\ - \frac{κ_{1} {(1 - r_{1}^{2})}^{2}}{{(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})}^{2}}} sin θ_{1} \\ = g (θ_{1}) sin θ_{1} . \end{matrix}

(A1)

In (A1), if

κ_{3} < 0

, then

g (θ_{1}) < 0

. Therefore, for

θ_{1} \in [0, π)

,

f_{Θ_{1}}^{'} (θ_{1}) < 0

, and for

θ_{1} \in [- π, 0)

,

f_{Θ_{1}}^{'} (θ_{1}) \geq 0

. Thus,

f_{Θ_{1}} (θ_{1})

is increasing in

[- π, 0)

and decreases from 0 to

π

. In addition,

f_{Θ_{1}} (θ_{1}) = f_{Θ_{1}} (- θ_{1})

, which means that

f_{Θ_{1}} (θ_{1})

is symmetric around 0; thus, for

κ_{3} < 0

,

f_{Θ_{1}} (θ_{1})

is unimodal. If

κ_{3} > 0

,

h (θ_{1})

decreases from

- π

to 0 and increases from 0 to

π

, and

h (0) = ∣ κ_{2} - κ_{3} ∣

and

h (- π) = κ_{2} + κ_{3}

. From Lemma 1 in Singh et al. (2002),

A (t) / t

is a decreasing function of t; therefore,

A (h (θ_{1})) / h (θ_{1})

is increasing in

[- π, 0)

and decreases from 0 to

π

. It can be concluded that

g (θ_{1})

is decreasing in

[0, π)

and increasing in

[- π, 0)

; hence, if

- 2 r_{1} + κ_{2} κ_{3} \frac{A (∣ κ_{2} - κ_{3} ∣)}{∣ κ_{2} - κ_{3} ∣} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 - r_{1})}^{2}} - κ_{1} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 - r_{1})}^{2}} < 0

, then

f_{Θ_{1}}^{'} (θ_{1}) \geq 0

for

θ_{1} \in [- π, 0)

and

f_{Θ_{1}}^{'} (θ_{1}) < 0

for

θ_{1} \in [0, π)

; which means that

f_{Θ_{1}} (θ_{1})

is unimodal. If

- 2 r_{1} + κ_{2} κ_{3} \frac{A (∣ κ_{2} - κ_{3} ∣)}{∣ κ_{2} - κ_{3} ∣} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 - r_{1})}^{2}} - κ_{1} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 - r_{1})}^{2}} > 0

and

- 2 r_{1} + κ_{2} κ_{3} \frac{A (κ_{2} + κ_{3})}{κ_{2} + κ_{3}} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 + r_{1})}^{2}} - κ_{1} \frac{{(1 - r_{1}^{2})}^{2}}{{(1 + r_{1})}^{2}} \leq 0

, then

f_{Θ_{1}} (θ_{1})

is first increasing and then decreasing in

[- π, 0)

, which means that

f_{Θ_{1}} (θ_{1})

is bimodal. A more detailed proof is provided by the authors upon request.

Appendix A.3. Proof of Corollary 2

Suppose

μ_{1} = 0

(without loss of generality). According to (18), the following result can be obtained:

\begin{matrix} \frac{f_{Θ_{1}}^{'} (θ_{1})}{f_{Θ_{1}} (θ_{1})} & = {\frac{- 2 r_{1}}{(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})} - \frac{κ_{1} {(1 - r_{1}^{2})}^{2}}{{(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})}^{2}} + \frac{κ_{3}^{2} {(1 - r_{1}^{2})}^{2} A (h (θ_{1}))}{h (θ_{1}) {(1 + r_{1}^{2} - 2 r_{1} cos θ_{1})}^{4}} \\ \times (({(1 + r_{1}^{2})}^{2} + 4 r_{1}^{2}) cos θ_{1} - 4 r_{1} (1 + r_{1}^{2}) {cos}^{2} θ_{1} - 2 r_{1} (1 + r_{1}^{2}) {sin}^{2} θ_{1})} sin θ_{1} \\ = g (θ_{1}) sin θ_{1} . \end{matrix}

(A2)

In (A2), if

cos θ_{1} \leq 0

, then

g (θ_{1}) < 0

and the sign of (A2) depends on the sign of

sin θ_{1}

. Hence, for

θ_{1} \in (- π, - π / 2]

,

f_{Θ_{1}}^{'} (θ_{1}) < 0

and for

θ_{1} \in [π / 2, π]

,

f_{Θ_{1}}^{'} (θ_{1}) \geq 0

. Thus,

f_{Θ_{1}} (θ_{1})

is increasing in

(- π, - π / 2]

and decreasing from

π / 2

to

π

. In addition,

f_{Θ_{1}} (θ_{1}) = f_{Θ_{1}} (- θ_{1})

, which means that

f_{Θ_{1}} (θ_{1})

is symmetric around 0; therefore,

f_{Θ_{1}} (θ_{1})

is unimodal. For

θ \in [0, π / 2]

,

h (θ_{1})

is an increasing function of

θ_{1}

, and according to Lemma 1 in [13],

A (h (θ_{1})) / h (θ_{1})

is a decreasing function of

θ_{1}

. We can conclude that if

- 2 r {(1 - r_{1})}^{2} - κ_{1} {(1 - r_{1}^{2})}^{2} + κ_{3}^{2} {(1 - r_{1}^{2})}^{2} \frac{A (κ_{2})}{κ_{2}} < 0

, then

f_{Θ_{1}} (θ_{1})

is a decreasing function from 0 to

π / 2

, and because

f_{Θ_{1}} (θ_{1})

is symmetric around 0, it increases from

- π / 2

to 0. If

- 2 r {(1 - r_{1})}^{2} - κ_{1} {(1 - r_{1}^{2})}^{2} + κ_{3}^{2} {(1 - r_{1}^{2})}^{2} \frac{A (κ_{2})}{κ_{2}} > 0

, then

f_{Θ_{1}} (θ_{1})

first increases and then decreases in

[0, π / 2]

and

[- π / 2, 0]

(because it is symmetric around 0), which states that

f_{Θ_{1}} (θ_{1})

is bimodal.

References

Ramachandran, G.T.; Sasisekharan, V. Conformation of polypeptides and proteins. Adv. Protein Chem. 1968, 23, 283–437. [Google Scholar]
Holley, L.H.; Karplus, M. Protein secondary structure prediction with a neural network. Proc. Natl. Acad. Sci. USA 1989, 86, 152–156. [Google Scholar] [CrossRef] [Green Version]
Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equation of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef] [Green Version]
Hastings, W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
Irbäck, A.; Mohanty, S. PROFASI: A Monte Carlo simulation package for protein folding and aggregation. J. Comput. Chem. 2006, 27, 1548–1555. [Google Scholar] [CrossRef]
Jones, D.T. Successful ab initio prediction of the tertiary structure of NK-lysin using multiple sequences and recognized supersecondary structural motifs. Proteins Struct. Funct. Bioinform. 1997, 29, 185–191. [Google Scholar] [CrossRef]
Jones, T.A.; Thirup, S. Using known substructures in protein model building and crystallography. Embo J. 1986, 5, 819–822. [Google Scholar] [CrossRef]
Simons, K.T.; Kooperberg, C.; Huang, E.; Baker, D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J. Mol. Biol. 1997, 268, 209–225. [Google Scholar] [CrossRef] [Green Version]
Ley, C.; Verdebout, T. Applied Directional Statistics: Modern Methods and Case Studies; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Ley, C.; Verdebout, T. Modern Directional Statistics; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
Mardia, K.V. Statistics of directional data. J. R. Stat. Soc. Ser. B (Methodol.) 1975, 37, 349–371. [Google Scholar] [CrossRef]
Rivest, L.P. A distribution for dependent unit vectors. Commun. Stat.-Theory Methods 1988, 17, 461–483. [Google Scholar] [CrossRef]
Singh, H.; Hnizdo, V.; Demchuk, E. Probabilistic model for two dependent circular variables. Biometrika 2002, 89, 719–723. [Google Scholar] [CrossRef]
Mardia, K.V.; Taylor, C.C.; Subramaniam, G.K. Protein bioinformatics and mixtures of bivariate von-Mises distributions for angular data. Biometrics 2007, 63, 505–512. [Google Scholar] [CrossRef]
Kent, J.T.; Mardia, K.V.; Taylor, C.C. Modelling strategies for bivariate circular data. In Proceedings of the Leeds Annual Statistical Research Conference; The Art and Science of Statistical Bioinformatics, Leeds University Press: Leeds, UK, 2008; pp. 70–73. [Google Scholar]
Mardia, K.V.; Frellsen, J. Statistics of bivariate von Mises distributions. In Bayesian Methods in Structural Bioinformatics; Springer: Berlin/Heidelberg, Germany, 2012; pp. 159–178. [Google Scholar]
Mardia, K.V.; Hughes, G.; Taylor, C.C.; Singh, H. A multivariate von Mises distribution with applications to bioinformatics. Can. J. Stat. 2008, 36, 99–109. [Google Scholar] [CrossRef]
Wehrly, T.E.; Johnson, R.A. Bivariate models for dependence of angular observations and a related Markov process. Biometrika 1980, 67, 255–256. [Google Scholar] [CrossRef]
Jones, M.C.; Pewsey, A.; Kato, S. On a class of circulas: Copulas for circular distributions. Ann. Inst. Stat. Math. 2015, 67, 843–862. [Google Scholar] [CrossRef]
Fernández-Durán, J.J. Models for circular–linear and circular–circular data constructed from circular distributions based on nonnegative trigonometric sums. Biometrics 2007, 63, 579–585. [Google Scholar] [CrossRef]
García-Portugués, E.; Crujeiras, R.M.; González-Manteiga, W. Exploring wind direction and SO2 concentration by circular–linear density estimation. Stoch. Environ. Res. Risk Assess. 2013, 27, 1055–1067. [Google Scholar] [CrossRef] [Green Version]
Pewsey, A.; García-Portugués, E. Recent advances in directional statistics. TEST 2021, 30, 1–58. [Google Scholar] [CrossRef]
Di Marzio, M.; Panzera, A.; Taylor, C.C. Kernel density estimation on the torus. J. Stat. Plan. Inference 2011, 141, 2156–2173. [Google Scholar] [CrossRef] [Green Version]
Ameijeiras-Alonso, J.; Ley, C. Sine-skewed toroidal distributions and their application in protein bioinformatics. Biostatistics 2020. Available online: https://doi.org/10.1093/biostatistics/kxaa039 (accessed on 20 January 2021). [CrossRef]
Kato, S.; Shimizu, K.; Shieh, G.S. A circular–circular regression model. Stat. Sin. 2008, 18, 633–645. [Google Scholar]
Shieh, G.S.; Johnson, R.A. Inferences based on a bivariate distribution with von-Mises marginals. Ann. Inst. Stat. Math. 2005, 57, 789–802. [Google Scholar] [CrossRef]
Shieh, G.S.; Zheng, S.; Johnson, R.A.; Chang, Y.F.; Shimizu, K.; Wang, C.C.; Tang, S.L. Modeling and comparing the organization of circular genomes. Bioinformatics 2011, 27, 912–918. [Google Scholar] [CrossRef]
Liu, D.; Peddada, S.D.; Li, L.; Weinberg, C.R. Phase analysis of circadian-related genes in two tissues. BMC Bioinform. 2006, 7, 87. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Downs, T.D.; Mardia, K.V. Circular regression. Biometrika 2002, 89, 683–697. [Google Scholar] [CrossRef]
Jones, M.C. The Möbius distribution on the disc. Ann. Inst. Stat. Math. 2004, 56, 733–742. [Google Scholar] [CrossRef]
Kato, S.; Jones, M.C. A family of distributions on the circle with links to, and applications arising from, Möbius transformation. J. Am. Stat. Assoc. 2010, 105, 249–262. [Google Scholar] [CrossRef] [Green Version]
Wang, M.Z.; Shimizu, K. On applying Möbius transformation to cardioid random variables. Stat. Methodol. 2012, 9, 604–614. [Google Scholar] [CrossRef]
Kato, S.; Pewsey, A. A Möbius transformation-induced distribution on the torus. Biometrika 2015, 102, 359–370. [Google Scholar] [CrossRef]
Kato, S. A distribution for a pair of unit vectors generated by Brownian motion. Bernoulli 2009, 15, 898–921. [Google Scholar] [CrossRef]
Kato, S.; McCullagh, P. Some properties of a Cauchy family on the sphere derived from the Möbius transformation. Bernoulli 2020, 26, 3224–3248. [Google Scholar] [CrossRef]
McCullagh, P. Möbius transformation and Cauchy parameter estimation. Ann. Stat. 1996, 24, 787–808. [Google Scholar] [CrossRef]
Abe, T.; Pewsey, A. Sine-skewed circular distributions. Stat. Pap. 2011, 52, 683–707. [Google Scholar] [CrossRef]
Mullen, K.; Ardia, D.; Gil, D.L.; Windover, D.; Cline, J. DEoptim: An R package for global optimization by differential evolution. J. Stat. Softw. 2011, 40, 1–26. [Google Scholar] [CrossRef] [Green Version]
Storn, R.; Price, K. Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Price, K.; Storn, R.M.; Lampinen, J.A. Differential Evolution: A Practical Approach to Global Optimization; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Najibi, S.M.; Maadooliat, M.; Zhou, L.; Huang, J.Z.; Gao, X. Protein structure classification and loop modeling using multiple Ramachandran distributions. Comput. Struct. Biotechnol. J. 2017, 15, 243–254. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Moghimbeygi, M.; Golalizadeh, M. Spherical logistic distribution. Commun. Math. Stat. 2020, 8, 151–166. [Google Scholar] [CrossRef]
Martin, A.D.; Quinn, K.M.; Park, J.H.; Park, M.J.H. MCMCpack: Markov Chain Monte Carlo (MCMC) Package; Version 1.5-0; R Package: Vienna, Austria, 2020; Available online: https://cran.r-project.org/web/packages/MCMCpack/index.html (accessed on 25 August 2020).
Li, L. gibbs.met: Naive Gibbs Sampling with Metropolis Steps; Version 1.1-3; R Package: Vienna, Austria, 2015; Available online: https://cran.r-project.org/web/packages/gibbs.met/index.html (accessed on 25 August 2020).
Albert, J. LearnBayes: Functions for Learning Bayesian Inference; Version 2.15.1; R Package: Vienna, Austria, 2018; Available online: https://cran.r-project.org/web/packages/LearnBayes/index.html (accessed on 25 August 2020).
Chivers, C.; Chivers, M.C. MHadaptive: General Markov Chain Monte Carlo for Bayesian Inference Using Adaptive Metropolis-Hastings Sampling; Version 1.1-8; R Package: Vienna, Austria, 2015; Available online: https://cran.r-project.org/web/packages/MHadaptive/index.html (accessed on 25 August 2020).
Fernández-i-Marın, X. ggmcmc: Analysis of MCMC samples and Bayesian inference. J. Stat. Softw. 2016, 70, 1–20. [Google Scholar] [CrossRef] [Green Version]
Mersmann, O. Microbenchmark: Accurate Timing Functions; Version 1.4-7; R Package: Vienna, Austria, 2019; Available online: https://www.rdocumentation.org/packages/microbenchmark/versions/1.4-7/topics/microbenchmark (accessed on 25 August 2020).

Figure 1. Two representations of protein backbone structures based on torsion or pseudo-torsion angles.

Figure 2. Pdf and contour plots of the transformed cosine model (10) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Figure 2. Pdf and contour plots of the transformed cosine model (10) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Figure 3. Plots of the marginal pdf of

Θ_{1}

in (12) (left) and in (18) (right) for

μ_{1} = 0

and different parameter values.

Figure 3. Plots of the marginal pdf of

Θ_{1}

in (12) (left) and in (18) (right) for

μ_{1} = 0

and different parameter values.

Figure 4. Pdf and contour plots of the transformed sine model (16) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Figure 4. Pdf and contour plots of the transformed sine model (16) for

μ_{1} = μ_{2} = 0

and different values of

κ_{1}, κ_{2}, κ_{3}, r_{1}

, and

r_{2}

.

Figure 5. Pdf and contour plots of the sine-skewed transformed cosine model in (22) (top) and the sine-skewed transformed sine model in (24) (bottom) for different values of

λ_{1}

and

λ_{2}

.

Figure 5. Pdf and contour plots of the sine-skewed transformed cosine model in (22) (top) and the sine-skewed transformed sine model in (24) (bottom) for different values of

λ_{1}

and

λ_{2}

.

Figure 6. Plots of the marginal pdf of

Θ_{1}

for

B S S T C

(left) and

B S S T S

(right) for

μ_{1} = μ_{2} = 0

and different parameter values.

Figure 6. Plots of the marginal pdf of

Θ_{1}

for

B S S T C

(left) and

B S S T S

(right) for

μ_{1} = μ_{2} = 0

and different parameter values.

Figure 7. Plots of the sine-skewed versions of marginal pdfs of

Θ_{1}

in (12) (left) and (18) (right) for

k = 1

,

μ_{1} = 0

and different parameter values.

Figure 7. Plots of the sine-skewed versions of marginal pdfs of

Θ_{1}

in (12) (left) and (18) (right) for

k = 1

,

μ_{1} = 0

and different parameter values.

Figure 8. Ramachandran plots for each dataset.

Figure 9. Contour plots of fitted pdfs together with scatter plot for SCOP.1 (

n =

10,188). The last row includes the proposed models.

Figure 9. Contour plots of fitted pdfs together with scatter plot for SCOP.1 (

n =

10,188). The last row includes the proposed models.

Figure 10. Contour plots of fitted pdfs together with scatter plot for SCOP.3 (

n = 4607

). The last row includes the proposed models.

Figure 10. Contour plots of fitted pdfs together with scatter plot for SCOP.3 (

n = 4607

). The last row includes the proposed models.

Figure 11. Contour plots of fitted pdfs together with scatter plots for TCBIG.VAL.right (

n = 2673

). The last row includes the proposed models.

Figure 11. Contour plots of fitted pdfs together with scatter plots for TCBIG.VAL.right (

n = 2673

). The last row includes the proposed models.

Figure 12. Kernel density plots of the data, and the best-fit models.

Figure 13. Scatter, trace, and compare-partial plots of the simulated data from the transformed sine model using “gibbs_met” in the “gibbs.met” package (first row), “MCMCmetrop1R” in the “MCMCpack” package (second row), “met_gaussian” in the “gibbs.met” package (third row), “Metro_Hastings” in the “MHadaptive” package (fourth row), and “rwmetrop” in the “LearnBayes” package (fifth row).

Figure 14. Execution times for generating a sample size of

n = 100

from a transformed sine model (left) and a transformed cosine model (right) for each method.

Figure 14. Execution times for generating a sample size of

n = 100

from a transformed sine model (left) and a transformed cosine model (right) for each method.

Figure 15. Scatter, trace, and compare-partial plots of the simulated data from the transformed cosine model using “gibbs_met” in the “gibbs.met” package (first row), “MCMCmetrop1R” in the “MCMCpack” package (second row), “met_gaussian” in the “gibbs.met” package (third row), “Metro_Hastings” in the “MHadaptive” package (fourth row), and “rwmetrop” in the “LearnBayes” package (fifth row).

Table 1. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.1 (

n =

10,188).

Table 1. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.1 (

n =

10,188).

Model	$\hat{ρ}$	${\hat{κ}}_{1}$	${\hat{κ}}_{2}$	${\hat{κ}}_{3}$	${\hat{r}}_{1}$	${\hat{r}}_{2}$	${\hat{μ}}_{1}$	${\hat{μ}}_{2}$	${\hat{λ}}_{1}$	${\hat{λ}}_{2}$	$\hat{p}$	Log-Likelihood	AIC	BIC
Sine	–	25.2085	0.3679	7.3700	–	–	1.8976	2.4624	–	–	–	$- 15, 890.80$	31,790.16	31,827.74
[13]
Sine-skewed sine	–	18.8058	0.0852	4.8449	–	–	1.8701	$- 3.1415$	0.4051	$- 0.3718$	–	$- 18, 089.71$	36,193.42	36,244.02
[24]
Mixture of sine	–	4.9938	0.4603	2.3512	–	–	2.0560	2.5011	–	–	0.3476
												$- 15, 719.26$	31,460.22	31,540.04
	–	0.0217	0.0413	$- 4.4594$	–	–	1.0912	$- 1.8997$	–	–	0.6524
Cosine	–	11.6274	$6.7 \times 10^{- 17}$	0.6507	–	–	1.8807	$- 0.8652$	–	–	–	$- 19, 919.04$	39,848.07	39,884.22
[14]
Sine-skewed cosine	–	11.6274	$1.7 \times 10^{- 8}$	0.6507	–	–	1.8807	$- 0.8651$	$- 0.7557$	0.0789	–	$- 19, 919.04$	39,852.07	39,902.68
[24]
Mixture of cosine	–	9.6015	2.6459	0.0087	–	–	1.7967	0.8676	–	–	0.5266
[14]												$- 18, 120.09$	36,262.18	36,341.70
	–	8.4761	0.0820	2.3228	–	–	2.1309	0.9647	–	–	0.4734
Mixture of bivariate	$- 0.2892$	–	–	–	0.9551	0.5649	1.6129	1.5337	–	–	0.4463
wrapped Cauchy												$- 17, 099.36$	34,220.72	34,300.24
[33]	$- 0.1289$	–	–	–	0.8513	0.5433	2.1128	$- 2.6980$	–	–	0.5537
Transformed sine	–	2.1585	0.3489	3.1712	0.6036	0.0131	1.8573	2.4321	–	–	–	−15,558.98	31,131.97	31,182.56
Sine-skewed transformed sine	–	2.1582	0.3487	3.1712	0.6037	0.0131	1.8573	2.4321	$- 0.1894$	$0.0556$	–	$- 15, 558.98$	31,135.97	31,201.02
Transformed cosine	–	4.5122	$1.9 \times 10^{- 16}$	2.7905	0.2632	0.4164	1.8806	$- 0.6888$	–	–	–	$- 16, 920.43$	33,854.86	33,905.46
Sine-skewed transformed cosine	–	4.4704	$4.2 \times 10^{- 5}$	2.8185	0.2656	0.4228	1.8805	-0.6871	0.6225	$- 0.1849$	–	$- 16, 920.43$	33,858.86	33,923.92

Table 2. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.3 (

n = 4607

).

Table 2. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for SCOP.3 (

n = 4607

).

Model	$\hat{ρ}$	${\hat{κ}}_{1}$	${\hat{κ}}_{2}$	${\hat{κ}}_{3}$	${\hat{r}}_{1}$	${\hat{r}}_{2}$	${\hat{μ}}_{1}$	${\hat{μ}}_{2}$	${\hat{λ}}_{1}$	${\hat{λ}}_{2}$	$\hat{p}$	Log-Likelihood	AIC	BIC
Sine	–	27.0312	0.3243	8.0789	–	–	1.8810	2.4618	–	–	–	$- 6970.09$	13,950.18	13,982.36
[13]
Sine-skewed sine	–	26.8304	0.3224	8.0732	–	–	1.8960	2.4724	$- 0.4124$	$- 0.0159$	–	$- 6941.31$	13,896.62	13,941.67
[24]
Mixture of sine	–	7.3842	2.0013	$- 6.3567$	–	–	2.0918	$- 1.4321$	–	–	0.6632
												$- 6893.41$	13,901.15	13,879.61
	–	2.8774	0.0347	$- 1.7125$	–	–	1.9306	$- 1.1124$	–	–	0.3368
Cosine	–	11.5883	$3.8 \times 10^{- 16}$	0.6404	–	–	1.8537	$- 0.9851$	–	–	–	$- 9028.72$	18,067.45	18,099.62
[14]
Sine-skewed cosine	–	11.5883	$5.0 \times 10^{- 9}$	0.6404	–	–	1.8537	$- 0.9850$	$- 0.0168$	$- 0.6387$	–	$- 9028.72$	18,071.45	18,116.49
[24]
Mixture of cosine	–	29.9375	1.9210	0.0213	–	–	1.6840	0.8043	–	–	0.5648
[14]												$- 6959.76$	13,941.52	14,012.31
	–	17.3302	0.0211	1.9456	–	–	2.0575	0.8866	–	–	0.4352
Mixture of bivariate	$- 0.2347$	–	–	–	0.9169	0.5546	1.5969	1.1037	–	–	0.4712
wrapped Cauchy												$- 7137.52$	14,297.04	14,367.83
[33]	$- 0.1279$	–	–	–	0.8388	0.5100	1.9792	$- 2.0869$	–	–	0.5288
Transformed sine	–	3.8755	0.3414	3.6786	0.4950	$1.3 \times 10^{- 9}$	1.8589	2.4490	–	–	–	−6905.08	13,824.17	13,869.22
Sine-skewed transformed sine	–	3.8764	0.3415	3.7066	0.4883	$2.6 \times 10^{- 8}$	1.8591	2.4491	$- 0.1544$	0.0796	–	$- 6905.08$	13,828.17	13,886.08
Transformed cosine	–	4.1351	$2.4 \times 10^{- 16}$	2.8283	0.2884	0.4183	1.8604	$- 0.6560$	–	–	–	$- 7567.28$	15,148.56	15,193.61
Sine-skewed transformed cosine	–	4.1350	$6.4 \times 10^{- 10}$	2.8283	0.2884	0.4183	1.8604	$- 0.6560$	0.6868	$- 0.1567$	–	$- 7567.27$	15,152.56	15,210.46

Table 3. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for TCBIG.VAL.right (

n = 2673

).

Table 3. Maximum likelihood estimates and corresponding log-likelihood, AIC, and BIC for TCBIG.VAL.right (

n = 2673

).

Model	$\hat{ρ}$	${\hat{κ}}_{1}$	${\hat{κ}}_{2}$	${\hat{κ}}_{3}$	${\hat{r}}_{1}$	${\hat{r}}_{2}$	${\hat{μ}}_{1}$	${\hat{μ}}_{2}$	$\hat{p}$	Log-Likelihood	AIC	BIC
Mixture of sine	–	4.4364	7.6222	$- 1.2187$	–	–	$- 1.7736$	2.3336	0.6239
										$- 4901.12$	9824.25	9889.04
	–	5.4606	7.7290	$- 3.1154$	–	–	$- 1.4197$	$- 0.4111$	0.3761
Mixture of cosine	–	4.2849	6.1824	$4.6 \times 10^{- 6}$	–	–	$- 1.4003$	$- 0.4191$	0.3787
[14]										$- 5005.01$	10,032.03	10,096.82
	–	4.6268	7.8256	$9.1 \times 10^{- 6}$	–	–	$- 1.7785$	2.3358	0.6213
Mixture of bivariate	$- 0.3805$	–	–	–	0.8545	0.8118	$- 1.1194$	$- 0.4710$	0.3108
wrapped Cauchy										$- 5283.42$	10,588.85	10,653.64
[33]	$- 0.0294$	–	–	–	0.7038	0.7778	$- 1.8772$	2.3037	0.6892
Mixture of transformed sine	–	2.8274	7.3718	$- 3.0328$	0.2930	0.0872	$- 1.3860$	$- 0.4040$	0.3515
										$- 4826.80$	9683.61	9771.96
	–	4.0949	1.6545	$- 0.7133$	0.02450	0.4387	$- 1.7802$	2.3495	0.6485
Mixture of transformed cosine	–	2.3349	9.3385	0.0063	0.4909	0.1287	$- 1.1835$	$- 0.5358$	0.2645
										$- 4882.14$	9794.28	9882.64
–	3.9496	0.0117	0.8886	0.0553	0.8668	$- 1.8501$	2.3841	0.7355

Table 4. Maximum likelihood estimates of parameters and bias, and the MSE of the estimates for the simulated data obtained from each method.

Method	Distribution	n		$κ_{1}$	$κ_{2}$	$κ_{3}$	$r_{1}$	$r_{2}$	$μ_{1}$	$μ_{2}$
			MLE	1.8450	0.2876	2.8360	0.6001	0.0063	1.7744	2.6177
		$n = 100$	Bias	$- 0.0542$	$- 0.1932$	$- 0.1531$	$- 0.0089$	$- 0.0521$	$- 0.0815$	0.1185
			MSE	0.0879	0.0921	0.1864	0.0083	0.0044	0.0087	0.0344
	Transformed sine
			MLE	2.1311	0.2649	3.1546	0.6243	0.0395	1.8491	2.4263
		$n = 1000$	Bias	$- 0.0298$	$- 0.0900$	$- 0.0135$	0.0142	0.0130	$- 0.0018$	$- 0.0046$
			MSE	0.0527	0.0588	0.0002	0.0018	0.0008	0.0005	$2.5 \times 10^{- 5}$
MCMCmetrop1R
			MLE	3.7753	0.5985	1.5693	0.2564	0.4660	$- 1.4684$	1.0937
		$n = 100$	Bias	$- 0.1137$	$- 0.0572$	$- 0.1022$	0.0143	$- 0.0376$	0.0957	0.0421
			MSE	0.0998	0.0932	0.1007	0.0028	0.0116	0.0135	0.0275
	Transformed cosine
			MLE	4.0693	0.6305	1.8210	0.2546	0.5557	$- 1.5613$	1.0116
		$n = 1000$	Bias	0.0758	$- 0.0319$	0.0305	0.0241	0.0329	0.0037	0.0238
			MSE	0.0065	0.0426	0.0085	0.0007	0.0014	0.0007	0.0011
			MLE	1.8186	0.2260	3.2619	0.6444	0.0359	1.8815	2.6027
		$n = 100$	Bias	$- 0.2337$	$- 0.0867$	0.3837	0.0398	0.0205	0.0370	0.1006
			MSE	0.1094	0.0350	0.0726	0.0016	0.0406	0.0022	0.0291
	Transformed sine
			MLE	1.9735	0.3887	3.1546	0.6204	0.0159	1.8701	2.4270
		$n = 1000$	Bias	0.0979	0.0360	$- 0.0165$	0.0131	0.0046	0.0129	$- 0.0046$
			MSE	0.0340	0.0482	0.0003	0.0004	0.0001	0.0005	$2.1 \times 10^{- 5}$
rwmetrop
			MLE	3.8086	0.5397	1.4956	0.1726	0.5929	$- 1.5746$	1.0417
		$n = 100$	Bias	$- 0.0984$	$- 0.0940$	$- 0.2954$	$- 0.0512$	0.0883	$- 0.0115$	0.0524
			MSE	0.0902	0.0893	0.1003	0.0033	0.0088	0.0009	0.0037
	Transformed cosine
			MLE	3.9139	0.7320	1.8188	0.2546	0.4667	$- 1.5766$	0.9962
		$n = 1000$	Bias	$- 0.0758$	0.0703	$0.0166$	0.0241	$- 0.0386$	$- 0.0095$	0.0084
			MSE	0.0056	0.0674	0.0007	0.0013	0.0014	0.0007	0.0018
			MLE	2.5400	0.4480	3.1322	0.6963	0.1087	1.8321	2.3133
		$n = 100$	Bias	0.3469	0.0904	$- 0.0328$	0.0837	0.0916	$- 0.0294$	$- 0.0908$
			MSE	0.1773	0.0583	0.0117	0.0092	0.0091	0.0009	0.0141
	Transformed sine
			MLE	1.9022	0.2894	3.3403	0.6649	0.0016	1.8619	2.3822
		$n = 1000$	Bias	$- 0.2223$	$- 0.0658$	0.1137	0.0607	$- 0.0123$	0.0042	$- 0.0481$
			MSE	0.1268	0.0042	0.0286	0.0043	0.0025	$2.1 \times 10^{- 5}$	0.0024
met_gaussian
			MLE	3.4389	0.5409	1.4373	0.2530	0.5214	$- 1.5411$	1.0526
		$n = 100$	Bias	$- 0.3642$	$- 0.1187$	$- 0.2970$	0.0143	0.0113	0.0210	0.0623
			MSE	0.1927	0.0993	0.1251	0.0853	0.0082	0.0005	0.0041
	Transformed cosine
			MLE	3.6491	0.5341	1.6260	0.2420	0.5134	$- 1.5720$	1.0158
		$n = 1000$	Bias	$- 0.2978$	$- 0.1049$	$- 0.0999$	0.0195	0.0027	$- 0.0105$	0.0126
			MSE	0.1214	0.0915	0.0875	0.0019	0.0032	0.0005	0.0024
			MLE	2.1465	0.2728	2.6713	0.6912	0.0010	1.8107	2.2024
		$n = 100$	Bias	$- 0.0197$	$- 0.0784$	$- 0.4598$	0.0856	$0.0770$	$- 0.0603$	$- 0.1029$
			MSE	0.0434	0.0883	0.2498	0.0073	0.0059	0.0046	0.0527
	Transformed sine
			MLE	2.1657	0.2743	3.2124	0.5813	0.0762	1.8487	2.4826
		$n = 1000$	Bias	0.0362	$- 0.0456$	0.0433	$- 0.0262$	0.0531	$- 0.0081$	0.0404
			MSE	0.0246	0.0556	0.0027	0.0011	0.0039	$7.3 \times 10^{- 5}$	0.0025
Metro_Hastings
			MLE	3.8290	0.5903	2.0419	0.2857	0.5753	$- 1.5434$	0.8373
		$n = 100$	Bias	$- 0.1600$	$- 0.0582$	0.2407	0.0577	0.0709	0.0172	$- 0.1202$
			MSE	0.1998	0.0944	0.1208	0.0061	0.0059	0.0006	0.0266
	Transformed cosine
			MLE	3.8936	0.6747	1.5822	0.2775	0.4859	$- 1.5676$	0.9935
		$n = 1000$	Bias	$- 0.0961$	0.0298	$- 0.2317$	0.0470	$- 0.0196$	$- 0.0025$	0.0057
			MSE	0.0091	0.0847	0.0829	0.0030	0.0003	0.0002	0.0051
			MLE	2.2340	0.2728	2.8277	0.6198	0.1398	1.8437	2.1688
		$n = 100$	Bias	0.0712	$- 0.0784$	$- 0.3884$	0.0143	0.1220	$- 0.0129$	$- 0.2598$
			MSE	0.1171	0.0583	0.1179	0.0092	0.0160	0.0003	0.0692
	Transformed sine
			MLE	2.1901	0.3555	3.1760	0.6016	0.0156	1.8573	2.4342
		$n = 1000$	Bias	0.0315	0.0066	0.0048	$- 0.0020$	0.0024	$6.1 \times 10^{- 5}$	0.0021
			MSE	0.0963	0.0221	0.0464	0.0008	0.0003	$1.7 \times 10^{- 5}$	0.0018
gibbs_met
			MLE	3.6123	0.5732	1.5193	0.2006	0.5852	$- 1.5886$	0.8808
		$n = 100$	Bias	$- 0.2842$	$- 0.0853$	$- 0.2717$	$- 0.0276$	0.0869	$- 0.0238$	$- 0.0758$
			MSE	0.1419	0.0893	0.1067	0.0082	0.0087	0.0005	0.0187
	Transformed cosine
			MLE	3.6322	0.5978	1.7613	0.2626	0.5793	$- 1.5685$	0.9668
		$n = 1000$	Bias	$- 0.2568$	$- 0.0653$	$- 0.0232$	0.0221	0.0648	$- 0.0034$	$- 0.0209$
			MSE	0.0915	0.0726	0.0008	0.0031	0.0074	0.0002	0.0046

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.