A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model

Sun, Bo; Ye, Zeyi; Li, Mohan; Hong, Weiyi; Ruan, Weidong; Meng, Lingxin

doi:10.3390/buildings16102015

Open AccessArticle

A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model

by

Bo Sun

^*,

Zeyi Ye

,

Mohan Li

,

Weiyi Hong

,

Weidong Ruan

and

Lingxin Meng

Department of Civil Engineering, Zhejiang University of Technology, Hangzhou 310023, China

^*

Author to whom correspondence should be addressed.

Buildings 2026, 16(10), 2015; https://doi.org/10.3390/buildings16102015

Submission received: 17 April 2026 / Revised: 8 May 2026 / Accepted: 12 May 2026 / Published: 20 May 2026

(This article belongs to the Section Building Structures)

Download

Browse Figures

Versions Notes

Abstract

The structural safety and performance of long-span bridges in coastal areas are significantly influenced by the wind field. Traditional univariate or simplified multivariate probabilistic models often fail to capture the multimodal and nonlinear dependencies among wind parameters, leading to inaccuracies in wind-induced risk assessment. This study proposes a novel joint probability density function framework for wind speed, wind direction, and wind attack angle, integrating a Dirichlet process mixture model (DPMM) for marginal distributions and a Regular Vine (R-Vine) copula for dependency modeling. The DPMM adaptively identifies the number of mixture components without presetting, effectively capturing multimodal and periodic characteristics of the wind field. The R-Vine copula flexibly models complex nonlinear dependencies among the three variables. A case study using field data from the Beikou Bridge demonstrates the capability to reveal seasonal wind patterns of the proposed model. By explicitly parametrizing the underlying wind regimes, the DPMM-based framework enables physically interpretable wind field decomposition. This provides direct support for probabilistic wind field modeling, supporting enhanced wind-resistant design and operational management of coastal long-span bridges.

Keywords:

long-span bridge; dirichlet process mixture model; copula; joint probability density function; wind field

1. Introduction

The rapid expansion of coastal transportation infrastructure has led to a growing number of long-span bridges in recent years [1]. The structural integrity and flexible performance inherent in the long-span bridges are sensitive to wind-induced hazards and the statistical characteristics of the wind field [2,3]. As span lengths increase, nonlinear structural behavior becomes more outstanding, and the interaction between the wind field and the structure grows significantly more influential [4]. Therefore, a comprehensive probability density function (PDF) model of wind field forms the basis for a robust probabilistic assessment, enables accurate evaluation of structural safety, and directly supports the long-term management strategies.

Owing to the highly fluctuating and nonlinear nature of measured wind field data, the accurate characterization of the wind field in various terrains and climate regions necessitates the application of multiple statistical probability models for fitting analysis [5]. Ozay and Celiktas applied the maximum likelihood (ML) to estimate the two parameters of the Weibull distribution for modeling the wind speed distribution in the Alaçatı region [6]. Wang et al. utilized several classic parametric probability distributions, including the Lognormal, Gamma, and Loglogistic, to fit the wind field [7]. Ding et al. adopted three parametric distribution functions for modeling wind speed on the long-span bridge: the Gumbel distribution, Weibull distribution, and Rayleigh distribution [8]. Despite advantages such as computational efficiency and parameter transparency, the single parametric models are limited by their predefined forms, which often result in systematic errors and an inability to resolve the multimodal structures of the wind field. Hence, for complex wind fields or high-precision applications, these simple models should be superseded by more flexible mixture models or nonparametric models [9]. Kollu et al. conducted a comparative study of three mixture distributions (Weibull-GEV, Weibull-Lognormal, GEV-Lognormal) against single parametric models and other mixture models for characterizing the wind field [10]. Mazzeo et al. proposed the Mixture of Two Truncated Normal Distributions (MTTNDs) based on a linear combination of two normal distributions for modeling bimodality or asymmetry of the wind speed distribution [11]. Wang et al. assessed the applicability of the mixture Weibull distribution (2W2W) in modeling the probability distribution of wind fields in engineering and compared it with the traditional single-component Weibull distribution [12]. Khamees et al. pointed out that the two-component mixed distribution is not the best choice in wind field modeling, and then introduced the three-component mixture Weibull distribution and the meta-heuristic optimization method to construct a more optimal wind speed probability model [13]. Wang et al. extended the limited-components of mixture model for estimating wind field PDF to the Bayesian infinite Gaussian mixture model, but still needed to set an upper limit for the number of components [14]. Wu et al. developed a truncated Gaussian mixture model for wind speed probability distribution by using the continuous ranked probability score (CRPS) loss function and the False Discovery Rate (FDR) algorithm [15]. Furthermore, the nonparametric density estimation methods (such as the maximum entropy theory and kernel density estimation) have the ability to adaptively fit complex wind speed distributions without any prior assumptions about the parameter model; thus, the advantages have been extensively studied by researchers [16,17,18].

A comprehensive analysis of statistical properties of the wind field requires moving beyond a single-parameter approach. The joint probability density function (JPDF) is a key statistical characteristic model for characterizing the dependence between random variables, with broad applications in wind engineering [19,20]. Carta et al. employed an Angular-linear (AL) model to construct a joint probability distribution for wind speed and direction by combining a Normal–Weibull mixture for wind speed with a finite mixture of von Mises distributions for wind direction [21]. Han et al. adopted the classic Johnson-Wehrly (JW) model structure to estimate the JPDF of wind speed and wind direction for characterizing their dependence in wind field analyses [22]. While the JW model can overcome the symmetry restriction that limits the AL method in complex scenarios, its reliance on subjective judgment for model selection and its limited capacity for explaining complex dependencies present significant drawbacks [23].

In the wind-resistant design of large-span bridges, the wind attack angle is a critical parameter influencing structural behavior [24]. The Copula method offers both flexible marginal fitting and accurate simulation of the complex interdependencies among all wind field parameters. Chen et al. proposed two Copula-based strategies to estimate the JPDF of wind speed, wind direction and wind attack angle at typical mountain bridge sites: Strategy I employed the Vine Copula framework in combination with the modified binary Bernstein Copula for modeling; Strategy II directly utilized the modified ternary Bernstein Copula for estimation [25]. Ding et al. characterized the marginal distribution (univariate PDF) of each variable by the finite mixture (FM) model, and used the copula model to integrate wind speed, direction, and attack angle into a unified trivariate JPDF at the bridge site [26]. Zhang et al. (2022, 2024) developed trivariate joint probability models for wind speed, direction, and angle of attack using on-site measurement data from a deep-cut gorge bridge site, employing distinct Copula structures in each study [27,28]. The 2022 model utilized a D-Vine structure with a Frank Copula, while the 2024 model was based on a C-Vine structure with Gumbel, mixed von Mises, and Logistic marginal distributions [27,28].

In general, the above studies have revealed different advantages of frameworks for JPDF of wind field, such as “more suitable”, “better fit”, “better performance”, or “is proposed”. However, for large-span bridges located in coastal areas with distinct seasonal climates, there are relatively few joint probability modeling systems for wind speed, wind direction, and wind attack angle. Most of the research pays more attention to the JPDF of wind speed with other factors in the coastal areas of bridges [29,30,31], but the wind horizontal and vertical components should be further considered. Consequently, the joint probability modeling of wind fields for large-span bridges in coastal areas demands a framework capable of both elucidating the intrinsic physical structure of the wind and fulfilling the requirements for computational efficiency in engineering. The development of methods specifically towards real engineering applications has been a shared emphasis in other structural engineering fields [32,33].

In this study, a JPDF modeling framework based on Copula theory is proposed for statistical characteristics analysis of wind fields. For marginal distribution modeling, the Dirichlet process mixture model (DPMM) is adopted: a Gaussian mixture model is used for wind speed and wind attack angle to capture their continuous multimodal characteristics, while a von Mises mixture distribution is employed for wind direction to strictly preserve the periodic nature of circular variables. In terms of dependency structure modeling, a parametric Vine Copula method is applied to effectively capture complex nonlinear dependencies among multiple variables. The framework achieves a balance between data-driven physical mechanism representation and computational efficiency optimization, avoiding both the oversimplified assumptions of purely physical models and the high computational costs of fully nonparametric methods, while strictly respecting the physical nature of each variable, which includes the periodicity of wind direction and the non-negativity of wind speed. Compared with existing Copula-based models that predominantly adopt single parametric distributions or finite mixtures with prespecified component counts, the DPMM adopted here adaptively infers the number of wind regimes directly from the data. Each resulting component corresponds to a physically interpretable meteorological state, enabling data-driven regime discovery without prior assumptions on the underlying wind climate structure. The research outcome provides a comprehensive solution for wind field characteristic modeling in coastal bridge engineering, combining physical rigor with computational feasibility.

2. Dirichlet Process Mixture Model

2.1. The DPMM in the Stick-Breaking Construction

The Dirichlet process (DP) was introduced by Ferguson in 1973 [34], which is widely used in Bayesian nonparametric inference. As a prior on the space of probability measures, the DP addresses both uncertainty in the number of clusters and data-adaptive density estimation [35].

Let

X = (x_{1}, x_{2}, \dots, x_{n})

be a sample space,

G_{0}

a base distribution on

X

,

α > 0

a positive scaling parameter. A random measure

G

follows a DP, denoted:

G | \{α, G_{0}\} \sim DP (α, G_{0})

(1)

Thus, for any finite partition

(x_{1}, x_{2}, \dots, x_{n})

, the random vector

(G (x_{1}), G (x_{2}), \dots, G (x_{n}))

follows a Dirichlet distribution:

(G (x_{1}), G (x_{2}), \dots, G (x_{N})) \sim D i r i c h l e t (α G_{0} (x_{1}), α G_{0} (x_{2}), \dots, α G_{0} (x_{N}))

(2)

where

D i r i c h l e t (•)

is the Dirichlet distribution. Moreover, the DPMM under stick-breaking construction admits an infinite number of components with random weights [36], the random measure

G

is expressed as follows:

G = \sum_{k = 1}^{\infty} π_{k} δ_{θ_{k}}

(3)

where

δ_{θ_{k}}

is the indicator function located at the atom

θ_{k}

and

θ_{k}

is drawn independently from the base distribution

G_{0}

;

π_{k} = β_{k} \prod_{j = 1}^{k - 1} (1 - β_{j})

is the weight constructed by the stick-breaking process:

β_{k} | α \sim Beta (1, α)

and

\sum_{k = 1}^{\infty} π_{k} = 1

. Each observation

x_{i}

is associated with a latent variable

z_{i}

that indicates its component assignment. The DPMM is defined by the following generative process: (1) the collection of components

{β_{k}}_{k = 1}^{\infty}

independently drawn from a Beta distribution with parameters 1 and

α

. (2) the collection of components

{θ_{k}}_{k = 1}^{\infty}

corresponding to each mixture component are independently drawn from

G_{0}

. (3) For each

x_{i}

, the latent variables

z_{i}

is drawn from the prior distribution of weights. Conditioned on

z_{i}

and the component parameters, the

x_{i}

is generated from an observation model

p (x_{i}| θ_{z_{i}})

.

2.2. Accelerated Variational Bayesian Inference Algorithm

The accelerated variational Bayesian algorithm extends standard variational inference by introducing a soft truncation strategy for the DP model. While conventional methods select a simple parametric family for the variational distribution, the soft truncation allows the variational posterior to theoretically possess infinitely many components. However, for components beyond a prespecified level

T

(i.e.,

i > T

), the variational distributions are fixed to their priors and excluded from optimization. This ensures the variational family remains nested as

T

increases, enabling adaptive model complexity selection. Accordingly, the intractable posterior of the DP mixture is approximated by a factorized variational distribution of the form:

q (θ, β, z; ϕ) = [\prod_{i = 1}^{L} q_{β_{i}} (β_{i}; ϕ_{i}^{β}) q_{θ_{i}} (θ_{i}; ϕ_{i}^{θ})] [\prod_{i = 1}^{n} q_{z_{i}} (z_{i})]

(4)

where

q_{β_{i}} (β_{i}; ϕ_{i}^{β})

and

q_{θ_{i}} (θ_{i}; ϕ_{i}^{θ})

are parametric models with parameters

ϕ_{i}^{β}

and

ϕ_{i}^{θ}

, and

q_{z_{i}} (z_{i})

are discrete distributions over the component labels. To handle large-scale data efficiently, the dataset is divided among the leaf nodes of a KD-tree. The same responsibility distribution is assigned to all samples that belong to the same leaf, thereby avoiding point-wise updates. Complexity is further reduced by caching sufficient statistics at the node level.

The objective of the variational inference process would be to minimize the Kullback–Leibler divergence

K L [q (θ, β, z; ϕ)‖ q (θ, β, z| x, λ, α)]

, or equivalently, that minimize the free energy [37] in this paper, the form of free energy can be written as follows:

F = \sum_{i = 1}^{K} \{E_{q_{β_{i}}} [\log \frac{q_{β_{i}} (β_{i}; ϕ_{i}^{β})}{p_{β} (β_{i}| α)}] + E_{q_{β_{i}}} [\log \frac{q_{θ_{i}} (θ_{i}; ϕ_{i}^{θ})}{p (θ_{i}| λ)}]\} - \sum_{A} |n_{A}| \log \sum_{i = 1}^{\infty} \exp (S_{A, i})

(5)

where

n_{A}

is the number of data in the node outer node

A

of KD-tree. Based on the stick-breaking construction,

p_{β_{i}} (β_{i}| α)

and

q_{β_{i}} (β_{i}; ϕ_{i}^{β})

can be assumed as follows:

p_{β} (β_{i}| α) \sim Beta (α_{1}, α_{2})

(6)

q_{β_{i}} (β_{i}; ϕ_{i}^{β}) \sim Beta (ϕ_{i, 1}^{β}, ϕ_{i, 2}^{β})

(7)

To obtain an analytical solution for the variational update equations, this algorithm assumes conjugate exponential family distributions for both the prior and the variational posterior. Thus, the analytical solution for prior

p (θ_{i}| λ)

and variational posterior

q_{θ_{i}} (θ_{i}; ϕ_{i}^{θ})

in the exponential family are given by:

p (θ_{i}| λ) = h (θ_{i}) \exp \{λ_{1} θ_{i} + λ_{2} (- a (θ_{i}) - a (λ))\}

(8)

q_{θ_{i}} (θ_{i}; ϕ_{i}^{θ}) = h (θ_{i}) \exp \{ϕ_{i, 1}^{θ} θ_{i} + ϕ_{i, 2}^{θ} (- a (θ_{i}) - a (ϕ_{i}^{θ}))\}

(9)

where

h (•)

is the base measure;

a (•)

is the log-partition function. The form of

S_{A, i}

can be written as follows:

S_{A, i} = E_{q_{θ_{i}}} {[θ_{i}]}^{T} {〈x〉}_{A} - E_{q_{θ_{i}}} [a (θ_{i})]

(10)

where

{〈x〉}_{A}

denotes the average over all data contained in the node

A

. The optimal rules for different parameters with the KD-tree are as follows:

ϕ_{i, 1}^{β} = α_{1} + \sum_{A} |n_{A}| q_{z_{A}} (z_{A} = i)

(11)

ϕ_{i, 2}^{β} = α_{2} + \sum_{A} |n_{A}| \sum_{j = i + 1}^{\infty} q_{z_{A}} (z_{A} = j)

(12)

ϕ_{i, 1}^{θ} = λ_{1} + \sum_{A} |n_{A}| q_{z_{A}} (z_{A} = i) {〈x〉}_{A}

(13)

ϕ_{i, 2}^{θ} = λ_{2} + \sum_{A} |n_{A}| q_{z_{A}} (z_{A} = i)

(14)

where

q_{z_{A}} (z_{A})

is given by:

q_{z_{A}} (z_{A} = i) = \frac{\exp (S_{A, i})}{\sum_{j = 1}^{\infty} \exp (S_{A, j})}

(15)

3. Copula Framework

3.1. Pair-Copula Constructions

A Copula

C (•)

:

{[0, 1]}^{N} \to [0, 1]

is a multivariate cumulative distribution function (CDF) with uniform marginal distributions on

[0, 1]

. According to Sklar’s theorem [38], for any joint CDF

F

with marginal distribution functions

F_{1}, F_{2}, \dots, F_{N}

can be expressed as follows:

F (X_{1}, X_{2}, \dots, X_{N}) = C \{F_{1} (X_{1}), F_{2} (X_{2}), \dots, F_{N} (X_{N})\}

(16)

If the marginal distributions are continuous, the copula

C (•)

is unique. The joint probability density function

f

can be expressed as the product of the copula density

c (•)

and the marginal densities

f (X_{1}, X_{2}, \dots, X_{N}) = c \{F (X_{1}), F (X_{2}), \dots, F (X_{N})\} \prod_{i = 1}^{N} f_{i} (X_{i})

(17)

where

c (•)

is the partial derivative of

C (•)

. Sklar’s theorem decouples marginal modeling from dependence modeling, which allows the

c (•)

and

C (•)

to be modeled independently. To address the issue of dimension caused by high-dimensional joint distributions, Joe (1997) [39] introduced the Pair-Copula Construction (PCC) framework, which decomposes a high-dimensional joint distribution into a product of conditional bivariate copulas. The general formula of conditional marginal density can be written as follows:

f (X ∣ v) = c_{X v_{j} ∣ v_{- j}} \{F (X ∣ v_{- j}), F (v_{j} ∣ v_{- j})\} \cdot f (X ∣ v_{- j})

(18)

where

v

is the n-dimensional vector;

v_{j}

is the

j

component of

v

;

v_{- j}

is vector

v

without its

j

component. for every

j

, the formula of

F (X ∣ v)

can be written as follows:

F (X ∣ v) = \frac{\partial C_{X v_{j} ∣ v_{- j}} \{F (x ∣ v_{- j}), F (v_{j} ∣ v_{- j})\}}{\partial F (v_{j} ∣ v_{- j})},

(19)

For the special case where

v

is univariate and

X

,

v

are uniform, the conditional distribution of

F (X ∣ v)

is obtained by the h-function. That is

h (X, v) = F (X ∣ v) = \frac{\partial C_{X, v} (X, v)}{\partial v}

(20)

where the second parameter of

h (•)

always denotes the conditioning variable. While the

h (•)

provides the analytical mechanism to evaluate conditional distributions in a pair-copula construction, the sequential order in which these pair copulas are cascaded is not predetermined.

3.2. Vine Copula

Bedford & Cooke (2001, 2002) [40,41] introduced the Regular Vine (R-Vine) as a systematic framework for organizing all valid pair-copula decompositions. A

N

dimensional R-Vine is defined by an ordered sequence of (

N - 1

) trees. Each tree

T_{i}

consists of a set of nodes

N_{i}

and a set of edges

E_{i}

. The sequence of trees forms a valid vine when [42]:

(1): Tree $T_{1}$ has a node set $N_{1}$ representing the $N$ random variables.
(2): Tree $T_{i}$ (for $i \geq 2$ ) has a node set $N_{i}$ formed by the edges $E_{i - 1}$ of the previous tree $T_{i - 1}$ .
(3): Proximity Condition: For any edge $e = \{a, b\} \in E_{i}$ (where $i \geq 2$ ), it must hold that $|a \cap b| = 1$ .

As the most general vine copula structure, R-vine subsumes C-vine and D-vine as special cases. Its distribution-free properties and configurable architecture enable flexible adaptation to complex dependence patterns without parametric constraints [43]. For a model specified by the R-Vine structure, a set of copula families, and a set of parameters, the joint density function is given by the product of marginal densities and all pair-copula densities [44]:

f (X_{1}, X_{2}, \dots, X_{N}) = \prod_{n = 1}^{N} f_{n} (X_{n}) \prod_{i = 1}^{N - 1} \prod_{e \in E_{i}} c_{C_{e, a}, C_{e, b} | D_{e}} \{F_{C_{e, a} | D_{e}} (X_{C_{e, a}} | X_{D_{e}}), F_{C_{e, b} | D_{e}} (X_{C_{e, b}} | X_{D_{e}})\}

(21)

where

e = \{a, b\} \in E_{i}

is an edge in tree

T_{i}

, connecting the random variables

X_{C_{e, a}}

and

X_{C_{e, b}}

conditioned on the set

X_{D_{e}}

.

C_{e, a}

,

C_{e, b}

is the conditioned set of

e

, defined as follows:

C_{e, a} = a \ D_{e}

(22)

C_{e, b} = b \ D_{e}

(23)

where

D_{e}

is the conditioning set of

e

, defined as follows:

D_{e} = a \cap b

(24)

Building an R-Vine copula model requires solving three intertwined tasks: (1) selecting the vine structure; (2) selecting a suitable bivariate copula family for each pair-copula in the structure; (3) estimating the parameters for all selected copulas. As the number of possible R-Vine structures grows super-exponentially with dimension, a manual approach is infeasible. Therefore, the Dißmann algorithm is adapted in this paper. Figure 1 shows the structure of an exemplary three-dimensional R-vine copula.

3.3. Regular Vine Model Selection and Estimation

The construction of an R-Vine copula model involves three steps: structure selection, copula family selection, and parameter estimation. As the number of possible R-Vine structures grows exponentially with dimension, the efficient sequential algorithm of Dißmann et al. (2013) [45] is adopted in this paper. This algorithm proceeds by capturing the strongest conditional dependencies at each level of the vine hierarchy through the following steps:

(1): Starting from uniform pseudo-observations, the empirical Kendall’s τ matrix is computed for all variable pairs to serve as a nonparametric weight for dependence strength.
(2): For each tree level ( $T_{1}$ to $T_{N - 1}$ ), select a Maximum Spanning Tree using absolute τ values as weights. For each edge, choose the best-fitting bivariate copula family by Bayesian Information Criterion ( $B I C$ ) and estimate its parameters by maximum likelihood. The preliminary independence test based on τ is performed for each candidate pair. The independence copula is selected if independence is not rejected. The h-function is then applied to obtain the transformed variables for the next tree.
(3): The algorithm outputs the full model specification in three lower triangular arrays: vine structure, copula families, and parameters.

This methodology yields a flexible characterization of high-dimensional dependence by automating the identification of prominent associations and the individual tailoring of pair-copula tail properties.

4. Goodness-of-Fit Test

This study employs a two-stage modeling framework comprising marginal PDF fitting for wind field variables, followed by the construction of the JPDF to capture their interdependencies [46]. To ensure rigorous evaluation across distinct modeling objectives and error sources, a hierarchical assessment scheme aligned with the DPMM-Copula structure is adopted, spanning three layers: univariate margins, bivariate copula dependence, and the JPDF.

At the univariate marginal distribution layer, Root Mean Square Error (

R M S E

), Coefficient of Determination (

R^{2}

), and Mean Absolute Error (

M A E

) are adopted to evaluate the fitting accuracy of the PDF and CDF. Their mathematical formulations are given below:

R M S E = \sqrt{\frac{1}{L} \sum_{i = 1}^{L} {(x_{i} - {\hat{x}}_{i})}^{2}}

(25)

R^{2} = 1 - \frac{\sum_{i = 1}^{L} {(x_{i} - {\hat{x}}_{i})}^{2}}{\sum_{i = 1}^{L} {(x_{i} - \bar{x})}^{2}}

(26)

M A E = \frac{1}{L} \sum_{i = 1}^{L} |x_{i} - {\hat{x}}_{i}|

(27)

where

L

is the number of data samples;

x_{i}

is the

i

-th measured value of the data sample;

{\hat{x}}_{i}

is the

i

-th output value of the model;

\bar{x}

is the sample mean of the measured value.

At the bivariate copula dependence structure layer, the

B I C

is employed to evaluate the statistical validity and parsimony of the selected copula structure. The

B I C

is defined as follows:

B I C = - 2 \cdot \ln (L) + k \cdot \ln (L)

(28)

where

L

is the model likelihood, and

k

is the number of model parameters.

At the JPDF layer,

R M S E

,

M A E

, and the Index of Agreement (

I A

) are utilized to quantify Copula model accuracy. The

I A

formula is as follows:

I A = 1 - \frac{\sum_{i = 1}^{L} {(x_{i} - {\hat{x}}_{i})}^{2}}{\sum_{i = 1}^{L} {(|x_{i} - \bar{x}| + |x_{i} - \bar{x}|)}^{2}}

(29)

5. Case Study

5.1. Instrument and Data

This study is bases on the Beikou Bridge at the Oujiang River in Wenzhou, Zhejiang, China—a three-tower four-span double-layer continuous steel truss girder suspension bridge with a 2178 m main cable span (230 + 800 + 800 + 348). Its main cable is 1/10, with north and south spans measuring 213.6 m and 273.6 m, respectively. Wind speed, direction, and attack angle were recorded by sensors installed at four positions on the main girder. Refer to Figure 2 for further details.

It should be noted that this bridge is located in a coastal area with predominantly flat estuarine terrain and gentle hills to the north and south, resulting in relatively unobstructed wind flow from seaward directions. The region experiences strong seasonal influences, with wind field variations significantly affecting the structural behavior of the bridge.

In this study, the JPDF is constructed using 2023 wind data collected at 10 Hz from four 3-axis ultrasonic anemometers (F-06, F-08, F-14, and F-16). The main technical parameters of anemometers are presented in Table 1.

The wind speed, direction, and angle of attack are analyzed based on 10 min mean values, a standard averaging interval in engineering applications. Prior to JPDF modeling, the raw data (sample size 10 Hz) is preprocessed to address missing values and outliers. Outliers are identified and removed using the Pauta criterion (3σ rule), which assumes that residual errors follow a normal distribution and defines a safety interval of ±3σ, beyond which the probability of occurrence is less than 0.3% [47]. Following outlier elimination, missing and excised data points are imputed by cubic spline interpolation [48], a method selected for its high smoothness and continuity, which are particularly well suited to the fluctuating components of wind field data. While this preprocessing procedure enhances the reliability and temporal consistency of the dataset, it does not distort the tails or the dependence structure.

5.2. Marginal PDFs for Wind Field

In this study, the DPMM is adopted for marginal density estimation of each variable (wind speed, wind direction, and wind attack angle). This step constitutes the first stage in the construction of the JPDF.

Specifically, a Gaussian mixture model is employed for wind speed and wind attack angle to capture their continuous multimodal characteristics, while a von Mises mixture model is applied to wind direction to strictly preserve the periodicity inherent in circular variables. This mixture model within the DPMM framework can adaptively determine the number of mixture components without a predefined structure. This advantage can avoid the functional form constraints and selection biases typical of conventional parametric models. As a benchmark for comparison, kernel density estimation (KDE) is also implemented and evaluated.

For the quantitative assessment of goodness-of-fit metrics, the continuous probability density estimates are compared against the empirical probabilities derived from the discretized bins. Accordingly, the domain of each variable is partitioned into intervals (bins), a procedure that balances statistical resolution with computational efficiency. The wind speed at the observation points is discretized in 0.5 m/s bins between site-specific minimum and maximum values. The wind direction is discretized into 36 bins (0° to 360°, 10° interval, measured clockwise from true north) [49]. For wind attack angle, the theoretical range spans from −90° to 90°; however, practical observations are typically more concentrated. To preserve statistical fidelity while maintaining valid bin coverage, a 2° bin spacing is adopted throughout this interval.

As wind direction belongs to circular data, the application of linear mixture models ignores periodicity. Therefore, the wind direction observations should be transformed onto the two-dimensional Euclidean plane, and a DP mixture of von Mises models (DPMM-vM) is adopted as the marginal model. For wind speed and wind attack angle, DP Gaussian mixture models (DPGMMs) are employed. The fitted PDFs and CDFs are compared with the actual bin values, enabling a quantitative assessment of the goodness-of-fit between the DPMM-based marginals (DPGMM and DPMvMM) and the KDE benchmark.

5.2.1. Marginal Model for Wind Speed

The marginal model of wind speed at the four positions (F-06, F-08, F-14, and F-16) is fitted respectively using DPGMM and KDE. The corresponding PDF and CDF estimates are displayed in Figure 3 and Figure 4. Table 2 and Table 3 show the goodness-of-fit results for marginal models of wind speed.

As indicated by the goodness-of-fit metrics presented in Table 2 and Table 3, both methods yield low

R M S E

and

M A E

values, with

R^{2}

values close to 1, confirming satisfactory overall fit. It is worth mentioning that while KDE achieves slightly lower

R M S E

and

M A E

in some PDF fitting cases, the DPMM is still retained as the marginal model. Unlike KDE, which yields a nonparametric estimate without structural interpretability, the DPMM provides explicit parametric components that correspond to physically meaningful wind regimes and form the basis for the analysis of seasonal variation.

Table 4 shows the model parameters for the marginal PDFs of wind speed. The variation in the number of DPGMM components along the bridge span, seven at F-14 versus four or five at the remaining positions, reflects pronounced spatial heterogeneity in the wind speed distribution. F-14 is located at the mid-span of the main girder (Figure 2). While the wind field at F-14 is subject to aerodynamic interference from the bridge towers and cables, it is largely unaffected by the surrounding terrain due to its distance from the shoreline. This observation confirms the effectiveness of the DPGMM in adaptively determining the required model complexity and providing high-quality marginal foundations for the subsequent Vine copula modeling.

5.2.2. Marginal Model for Wind Direction

Since the wind direction is circular data, the application of linear mixture models would ignore its inherent periodicity. Therefore, the DPMM-vM is adopted as the marginal model, with the number of components adaptively determined from the data. Each von Mises component is parameterized by a mean direction and a concentration parameter, naturally satisfying the periodic boundary condition

F (0) = F (2 π)

. For the integration purpose in the copula framework, the fitted DPMM-vM cumulative distribution function is used to transform wind direction to uniform pseudo-observations. Since the CDF satisfies

F (θ + 2 π) = F (θ) + 1

, observations near 0° and 360° map consistently to values near 0 and 1. The corresponding PDF and CDF estimates are displayed in Figure 5 and Figure 6. Table 5 and Table 6 show the goodness-of-fit results for marginal models of wind direction.

As summarized in Table 5 and Table 6, the goodness-of-fit metrics indicate that the DPMM-vM preserves the inherent periodicity of wind direction while maintaining a robust characterization of the global distributional form.

Table 7 provides the estimated mean direction, concentration parameter

κ

, and mixture weight for each component of the DPMM-vM. The prevailing wind direction intervals and their corresponding directional concentration are distinctly observable at each measurement site within the bridge area. For example, the dominant directional components at sites F-06 and F-08 are centered at approximately 0.84 rad and 0.79 rad, respectively, and are associated with notably high

κ

. These estimates indicate the presence of stable, well-defined prevailing wind directions within the study region.

5.2.3. Marginal Model for Wind Attack Angle

The marginal models of wind attack angle are modeled using the DPGMM. The corresponding PDF and CDF estimates are displayed in Figure 7 and Figure 8. Table 8 and Table 9 show the goodness-of-fit results for marginal models of wind attack angle.

Figure 8 illustrates the pronounced central tendency evident in the measured wind attack angle data.

As shown in Table 10, the relatively small estimated means and standard deviations of the individual mixture components indicate that the observed wind attack angle exhibits a limited range of fluctuation, concentrated within a considerably narrower interval than the theoretical bounds of −90° to 90°.

5.3. Seasonal Variation in the Wind Field

Bayesian inference in the DPMM framework provides the posterior component-assignment probabilities for each observation. These quantities serve as the basis for a seasonal analysis of the wind field. The monthly occurrence probabilities of the mixture components for wind speed, wind direction, and wind attack angle are shown in Figure 9, Figure 10 and Figure 11, respectively. This probabilistic decomposition reveals the seasonal variability of the wind climate at the bridge site and illustrates the annual progression of the dominant regional weather regimes.

As shown in Figure 3a and Figure 10a, at position F-06, the fourth Gaussian component of wind speed (characterized by the highest mean value of approximately 5.22 m/s) reaches its peak monthly probability of 39.75% in July, indicating that winds of this intensity dominate during this month.

Located within the subtropical monsoon climate of southeastern China, the bridge site exhibits a pronounced seasonal shift in wind direction components, as shown in Figure 10a. Specifically, the third wind direction component (mean direction: 194.01°, concentration parameter: 7.95) emerges as the dominant regime in July, accounting for a notably high occurrence probability of 60.35%. Concurrently, the third Gaussian component of the wind attack angle (mean: 2.06°, standard deviation: 1.74°) attains its maximum probability of 65.11% during the same month.

It is noteworthy that the highest mean wind speeds also have markedly elevated probabilities at the other locations in July: the fifth component at F-08 (mean: 5.79 m/s, 42.95%), the sixth at F-14 (mean: 7.72 m/s, 49.81%), and the fourth at F-16 (mean: 6.13 m/s, 30.48%). During the same month, the prevailing wind direction and wind attack angle components exhibit strong concentration, each dominated by a single regime.

The probabilistic decomposition identifies seasonal wind regimes and their annual progression, which are meaningful for the vulnerability assessment in the construction phase during the dominant wind period, maintenance scheduling during low-wind months, reliability assessment with regime-conditional limit states, and wind-traffic control decision-making.

5.4. Construction of JPDF Based on Vine Copula

Based on Sklar’s theorem, the JPDF can be decomposed into the marginal distributions of each variable and a Copula function that describes the dependency structure among the variables. Section 5.2 has already obtained the precise marginal models of wind speed, wind direction, and wind attack angle using a mixture model. Thus, the JPDF of the three variables is constructed by selection and fitting of the optimal R-vine structure and bivariate copulas, following the sequential algorithm of Dißman. The bivariate copula families employed in this study are listed in Table 11.

Table 12 presents the optimal R-vine structure, the selected bivariate copula families, their corresponding parameter estimates, and the associated

B I C

values for the JPDF of wind parameters at the four positions (F-06, F-08, F-14, and F-16).

The bivariate copula selection results show that there are differences in the optimal Copula types at each position, which reflects pronounced spatial heterogeneity in the wind field dependence structure at the bridge site. All selected optimal bivariate copulas return low

B I C

values, which indicates a favorable balance between goodness-of-fit and model complexity. These models provide a reliable dependence skeleton for JPDF construction.

Table 13 and Table 14 report the goodness-of-fit test for the JPDF and the joint CDF derived from the fitted R-vine copula models.

6. Conclusions

This study develops a comprehensive JPDF modeling framework for wind speed, wind direction, and wind attack angle. The proposed approach integrates the flexibility of the DPMM for adaptive marginal model estimation with the structural adaptability of R-vine copulas for dependence modeling. The framework is established to simultaneously accommodate the multimodal and non-Gaussian features of the wind parameters and the complex and nonlinear interdependencies among them, and is subsequently validated against measured data to confirm its availability and robustness. The following findings from discussions are noteworthy:

(1): The DPMM offers great advantages in marginal distribution fitting. It autonomously identifies the intrinsic clustering structure of the wind field data and exhibits strong performance and stability in capturing multimodal, non-Gaussian characteristics as well as the tail behavior of the distribution.
(2): The posterior component-assignment probabilities from the Bayesian framework enable a monthly decomposition of the wind field. This analysis reveals a distinct seasonal pattern, with a dominant summer regime of elevated wind speeds, stable direction, and narrow attack-angle range. These results provide direct support for season-specific wind-resistant design and reliability assessment.
(3): The diversity of adaptively selected bivariate copula families reflects pronounced spatial heterogeneity in the wind field characteristics along the bridge span, thereby affirming the framework’s flexibility in capturing complex nonlinear dependence.
(4): The proposed DPMM-Copula framework yields excellent goodness-of-fit performance, demonstrating high-fidelity characterization of the marginal distributions, the JPDF, and the JCDF.
(5): It is worth mentioning that the framework is validated using one year of measurements from a single coastal bridge site. While it is sufficient for establishing seasonal wind regime characteristics, the record length may not fully represent interannual variability. Moreover, typhoon events are not separated from the overall dataset, and their distinct wind structures may need specific further study in future work.

Author Contributions

Conceptualization, B.S.; Methodology, B.S.; Validation, Z.Y. and W.R.; Formal analysis, Z.Y. and W.R.; Investigation, M.L. and W.H.; Resources, L.M.; Data curation, L.M.; Writing—original draft, Z.Y.; Writing—review and editing, M.L. and W.H.; Visualization, M.L. and W.H.; Funding acquisition, B.S. and W.R. All authors have read and agreed to the published version of the manuscript.

Funding

The research work was supported in part by the National Nature Science Foundation of China (Grant Nos. 52278226 and 52571314) and the Natural Science Foundation of Zhejiang Province (Grant No. Y24E080011). Opinions and findings presented are those of the authors and do not necessarily reflect the views of the sponsors.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chen, X.L.; Xiang, H.Y.; Li, Y.L. Mechanism of wake-induced vibration and mitigation in parallel box girders of coastal long-span cable-stayed bridges. J. Wind Eng. Ind. Aerodyn. 2025, 266, 106206. [Google Scholar] [CrossRef]
Zhao, L.; Cui, W.; Fang, G.; Cao, S.; Zhu, L.; Song, L.; Ge, Y. State-of-the-art review on typhoon wind environments and their effects on long-span bridges. Adv. Wind Eng. 2024, 1, 100007. [Google Scholar] [CrossRef]
Tao, T.; Wang, H.; Wen, X.; Fenerci, A. Flutter analysis of a long-span triple-tower suspension bridge under typhoon winds with non-uniform spanwise profile. Structures 2024, 68, 107156. [Google Scholar] [CrossRef]
Tao, T.; Xu, Y.L.; Huang, Z. Buffeting Analysis of Long-Span Bridges under Typhoon Winds with Time-Varying Spectra and Coherences. J. Struct. Eng. 2020, 146, 04020255. [Google Scholar] [CrossRef]
Carta, J.A.; Ramirez, P.; Velazquez, S. A review of wind speed probability distributions used in wind energy analysis: Case studies in the Canary Islands. Renew. Sustain. Energy Rev. 2009, 13, 933–955. [Google Scholar] [CrossRef]
Ozay, C.; Celiktas, M.S. Statistical analysis of wind speed using two-parameter Weibull distribution in Alaçatı region. Energy Convers. Manag. 2016, 121, 49–54. [Google Scholar] [CrossRef]
Wang, J.; Huang, X.; Li, Q.; Ma, X. Comparison of seven methods for determining the optimal statistical distribution parameters: A case study of wind energy assessment in the large-scale wind farms of China. Energy 2018, 164, 432–448. [Google Scholar] [CrossRef]
Ding, Y.; Ye, X.W.; Guo, Y.; Zhang, R.; Ma, Z. Probabilistic method for wind speed prediction and statistics distribution inference based on SHM data-driven. Probabilistic Eng. Mech. 2023, 73, 103475. [Google Scholar] [CrossRef]
Jung, C.; Schindler, D. Wind speed distribution selection–A review of recent development and progress. Renew. Sustain. Energy Rev. 2019, 11, 109290. [Google Scholar] [CrossRef]
Kollu, R.; Rayapudi, S.R.; Narasimham, S.V.L.; Pakkurthi, K.M. Mixture probability distribution functions to model wind speed distributions. Int. J. Energy Environ. Eng. 2012, 3, 27. [Google Scholar] [CrossRef]
Mazzeo, D.; Oliveti, G.; Labonia, E. Estimation of wind speed probability density function using a mixture of two truncated normal distributions. Renew. Energy 2018, 115, 1260–1280. [Google Scholar] [CrossRef]
Wang, W.; Gao, Y.; Ikegaya, N. Approximating wind speed probability distributions around a building by mixture weibull distribution with the methods of moments and L-moments. J. Wind Eng. Ind. Aerodyn. 2025, 257, 106001. [Google Scholar] [CrossRef]
Khamees, A.K.; Abdelaziz, A.Y.; Ali, Z.M.; Alharthi, M.M.; Ghoneim, S.S.; Eskaros, M.R.; Attia, M.A. Mixture probability distribution functions using novel metaheuristic method in wind speed modeling. Ain Shams Eng. J. 2022, 13, 101613. [Google Scholar] [CrossRef]
Wang, Y.; Li, Y.; Zou, R.; Song, D. Bayesian infinite mixture models for wind speed distribution estimation. Energy Convers. Manag. 2021, 236, 113946. [Google Scholar] [CrossRef]
Wu, J.; Li, N. Impact of components number selection in truncated Gaussian mixture model and interval partition on wind speed probability distribution estimation. Sci. Total Environ. 2023, 883, 163709. [Google Scholar] [CrossRef]
Miao, S.; Xie, K.; Yang, H.; Karki, R.; Tai, H.M.; Chen, T. A mixture kernel density model for wind speed probability distribution estimation. Energy Convers. Manag. 2016, 126, 1066–1083. [Google Scholar] [CrossRef]
Xie, W.; Huang, P. Extreme estimation of wind pressure with unimodal and bimodal probability density function characteristics: A maximum entropy model based on fractional moments. J. Wind Eng. Ind. Aerodyn. 2021, 214, 104663. [Google Scholar] [CrossRef]
Houndekindo, F.; Ouarda, T.B. A non-parametric approach for wind speed distribution mapping. Energy Convers. Manag. 2023, 296, 117672. [Google Scholar] [CrossRef]
Li, J.; Hong, X. Typhoon hazard analysis based on the probability density evolution theory. J. Wind Eng. Ind. Aerodyn. 2021, 219, 104796. [Google Scholar] [CrossRef]
Ye, T.; Li, L. Statistical Analysis and Study on Joint Distribution of the Extreme Value of Wind Speed and Wind Direction. IOP Conf. Ser. Earth Environ. Sci. 2021, 634, 012019. [Google Scholar] [CrossRef]
Carta, J.A.; Ramírez, P.; Bueno, C. A joint probability density function of wind speed and direction for wind energy analysis. Energy Convers. Manag. 2008, 49, 1309–1320. [Google Scholar] [CrossRef]
Han, Q.; Hao, Z.; Hu, T.; Chu, F. Non-parametric models for joint probabilistic distributions of wind speed and direction data. Renew. Energy 2018, 126, 1032–1042. [Google Scholar] [CrossRef]
Soukissian, T.H.; Karathanasi, F.E. On the selection of bivariate parametric models for wind data. Appl. Energy 2017, 188, 280–304. [Google Scholar] [CrossRef]
Wen, C.; Zhang, Z.; Nie, Y. Field Measurements and Time-Domain Buffeting Analysis of a Long-Span Suspension Bridge in a Mountainous Area. Int. J. Struct. Stab. Dyn. 2025, 2750077. [Google Scholar] [CrossRef]
Chen, Q.; Yu, C.; Li, Y. General strategies for modeling joint probability density function of wind speed, wind direction and wind attack angle. J. Wind Eng. Ind. Aerodyn. 2022, 225, 104985. [Google Scholar] [CrossRef]
Ding, Y.; Ye, X.-W.; Guo, Y. Copula-based JPDF of wind speed, wind direction, wind angle, and temperature with SHM data. Probabilistic Eng. Mech. 2023, 73, 103483. [Google Scholar] [CrossRef]
Zhang, J.; Zhang, M.; Jiang, X.; Wu, L.; Qin, J.; Li, Y. Pair-Copula-based trivariate joint probability model of wind speed, wind direction and angle of attack. J. Wind Eng. Ind. Aerodyn. 2022, 225, 105010. [Google Scholar] [CrossRef]
Zhang, J.; Jiang, F.; Zhang, M.; Zheng, H.; Li, Y.; Liang, J. Study on joint design method of multiple wind parameters for long-span bridges in deep-cutting gorge areas based on field measurement. J. Wind Eng. Ind. Aerodyn. 2024, 254, 105930. [Google Scholar] [CrossRef]
Meng, S.; Ding, Y.; Zhu, H. Stochastic response of a coastal cable-stayed bridge subjected to correlated wind and waves. J. Bridge Eng. 2018, 23, 04018091. [Google Scholar] [CrossRef]
Fang, C.; Xu, C.; Li, Y.; Li, X. Directional effects on the nonlinear response of vehicle-bridge system under correlated wind and waves. Ocean Eng. 2024, 310, 118718. [Google Scholar] [CrossRef]
Yang, R.; Li, Y.; Xu, C.; Yang, Y.; Fang, C. Directional effects of correlated wind and waves on the dynamic response of long-span sea-crossing bridges. Appl. Ocean Res. 2023, 132, 103483. [Google Scholar] [CrossRef]
Jiang, Y.; Li, L.; Zhu, L.; Xu, W.; Wang, Y.; Chen, B.; Wu, G. Design provision assessment for localized-loading resistance of welded stainless steel I-beams with web openings. Case Stud. Constr. Mater. 2025, 22, e04854. [Google Scholar] [CrossRef]
Ye, Q.; Nie, H.; Chen, B.; Wang, Y.; Lu, P.; Dai, P.; Hai, L. Local buckling behaviour of welded stainless steel I-beams with perforated webs. Thin-Walled Struct. 2025, 212, 113201. [Google Scholar] [CrossRef]
Ferguson, T. A Bayesian analysis of some nonparametric problems. Ann. Stat. 1973, 1, 209–230. [Google Scholar] [CrossRef]
Sethuraman, J. A Constructive Definition of the Dirichlet Prior. Stat. Sin. 1994, 4, 639–650. [Google Scholar] [CrossRef]
Bishop, C.; Nasrabadi, N. Pattern Recognition and Machine Learning; Springer: New York, NY, USA, 2006. [Google Scholar]
Kurihara, K.; Welling, M.; Vlassis, N. Accelerated variational dirichlet process mixtures. In Advances in Neural Information Processing Systems; MIT Press: Cambridge, MA, USA, 2006; Available online: http://hdl.handle.net/10993/11033 (accessed on 10 January 2026).
Sklar, M. Fonctions de Répartition à n Dimensions et Leurs Marges. Ann. De L’isup 1959, 8, 229–231. Available online: https://hal.science/hal-04094463v1/document (accessed on 10 May 2026).
Joe, H. Multivariate Models and Dependence Concepts; Chapman & Hall: London, UK, 1997. [Google Scholar]
Bedford, T.; Cooke, R.M. Probability Density Decomposition for Conditionally Dependent Random Variables Modeled by Vines. Ann. Math. Artif. Intell. 2001, 32, 245–268. [Google Scholar] [CrossRef]
Bedford, T.; Cooke, R.M. Vines: A new graphical model for dependent random variables. Ann. Stat. 2002, 30, 1031–1068. [Google Scholar] [CrossRef]
Czado, C.; Nagler, T. Vine copula based modeling. Annu. Rev. Stat. Its Appl. 2022, 9, 453–477. [Google Scholar] [CrossRef]
Kurowicka, D.; Cooke, R. Uncertainty Analysis with High Dimensional Dependence Modelling; Wiley: New York, NY, USA, 2006. [Google Scholar] [CrossRef]
Aas, K.; Czado, C.; Frigessi, A.; Bakken, H. Pair-copula constructions of multiple dependence. Insur. Math. Econ. 2009, 44, 182–198. [Google Scholar] [CrossRef]
Dißmann, J.; Brechmann, E.; Czado, C.; Kurowicka, D. Selecting and estimating regular vine copulae and application to financial returns. Comput. Stat. Data Anal. 2013, 59, 52–69. [Google Scholar] [CrossRef]
Huang, S.; Li, Q.; Shu, Z.; Chan, P.W. Copula-based estimation of directional extreme wind speeds: Application for wind-resistant structural design. Structures 2024, 60, 105845. [Google Scholar] [CrossRef]
Tang, Z.; Shi, X.; Zou, H.; Zhu, Y.; Yang, Y.; Zhang, Y.; He, J. Fault diagnosis of wind turbine generators based on stacking integration algorithm and adaptive threshold. Sensors 2023, 23, 6198. [Google Scholar] [CrossRef]
He, Y.; Li, H.; Wang, S.; Yao, X. Uncertainty analysis of wind power probability density forecasting based on cubic spline interpolation and support vector quantile regression. Neurocomputing 2021, 430, 121–137. [Google Scholar] [CrossRef]
Erdem, E.; Shi, J. Comparison of bivariate distribution construction approaches for analysing wind speed and direction data. Wind Energy 2011, 14, 27–41. [Google Scholar] [CrossRef]

Figure 1. The R-vine structure in three dimensions.

Figure 2. Sensor layout of the bridge.

Figure 3. The PDF of wind speed at different positions.

Figure 4. The CDF of wind speed at different positions.

Figure 5. The PDF of wind direction at different positions.

Figure 6. The CDF of wind direction at different positions.

Figure 7. The PDF of wind attack angle at different positions.

Figure 8. The CDF of wind attack angle at different positions.

Figure 9. The monthly occurrence probabilities of the components for wind speed.

Figure 10. The monthly occurrence probabilities of the components for wind direction.

Figure 11. The monthly occurrence probabilities of the components for wind attack angle.

Table 1. The main technical parameters of the anemometer.

Equipment	Technical Parameters	Value
HD2003	Wind speed	0~65 m/s ± 1
	Wind direction	0~360° ± 1
	Wind attack angle	−90°~90° ± 1

Table 2. The goodness-of-fit results for wind speed PDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPGMM	0.0015	0.0009	0.9998
F-06	KDE	0.0011	0.0005	0.9993
F-08	DPGMM	0.0023	0.0012	0.9975
F-08	KDE	0.0005	0.0003	0.9999
F-14	DPGMM	0.0016	0.0008	0.9978
F-14	KDE	0.0011	0.0004	0.9990
F-16	DPGMM	0.0023	0.0013	0.9970
F-16	KDE	0.0011	0.0005	0.9994

Table 3. The goodness-of-fit results for wind speed CDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPGMM	0.0024	0.0022	0.9999
F-06	KDE	0.0012	0.0005	0.9999
F-08	DPGMM	0.0026	0.0022	0.9999
F-08	KDE	0.0015	0.0015	0.9999
F-14	DPGMM	0.0082	0.0080	0.9991
F-14	KDE	0.0011	0.0005	0.9999
F-16	DPGMM	0.0017	0.0015	0.9999
F-16	KDE	0.0012	0.0010	0.9999

Table 4. The model parameters for marginal PDFs of wind speed.

Item	Position	Parameter	Value
Wind speed	F-06	μ_F-06	$[\begin{matrix} 3.77 & 2.36 & 1.31 & 5.22 & 0.75 \end{matrix}]$
		σ_F-06	$[\begin{matrix} 1.10 & 0.71 & 0.42 & 1.86 & 0.22 \end{matrix}]$
		π_F-06	$[\begin{matrix} 0.4068 & 0.2899 & 0.1275 & 0.1191 & 0.0567 \end{matrix}]$
	F-08	μ_F-08	$[\begin{matrix} 3.88 & 2.78 & 1.74 & 1.04 & 5.79 \end{matrix}]$
		σ_F-08	$[\begin{matrix} 1.10 & 0.73 & 0.46 & 0.24 & 2.07 \end{matrix}]$
		π_F-08	$[\begin{matrix} 0.3396 & 0.3143 & 0.1828 & 0.0946 & 0.0687 \end{matrix}]$
	F-14	μ_F-14	$[\begin{matrix} 5.56 & 2.29 & 3.43 & 1.47 & 3.69 & 7.72 & 0.99 \end{matrix}]$
		σ_F-14	$[\begin{matrix} 1.56 & 0.64 & 0.95 & 0.36 & 1.01 & 2.92 & 0.19 \end{matrix}]$
		π_F-14	$\begin{array}{l} [\begin{matrix} 0.2689 & 0.2195 & 0.1526 & \dots \end{matrix} \\ \begin{matrix} \dots & 0.1337 & 0.0813 & 0.0805 & 0.0635 \end{matrix}] \end{array}$
	F-16	μ_F-16	$[\begin{matrix} 2.68 & 4.04 & 1.50 & 6.13 & 0.78 \end{matrix}]$
		σ_F-16	$[\begin{matrix} 0.81 & 1.24 & 0.49 & 2.21 & 0.24 \end{matrix}]$
		π_F-16	$[\begin{matrix} 0.3403 & 0.3400 & 0.1697 & 0.0790 & 0.0710 \end{matrix}]$

Table 5. The goodness-of-fit results for wind direction PDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPMM-vM	0.0079	0.0048	0.8912
F-06	von Mises KDE	0.0047	0.0023	0.9610
F-08	DPMM-vM	0.0040	0.0031	0.9656
F-08	von Mises KDE	0.0024	0.0014	0.9874
F-14	DPMM-vM	0.0035	0.0029	0.9640
F-14	von Mises KDE	0.0016	0.0011	0.9929
F-16	DPMM-vM	0.0067	0.0051	0.9113
F-16	von Mises KDE	0.0026	0.0017	0.9870

Table 6. The goodness-of-fit results for wind direction CDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPMM-vM	0.0220	0.0197	0.9942
F-06	von Mises KDE	0.0221	0.0203	0.9942
F-08	DPMM-vM	0.0261	0.0232	0.9924
F-08	von Mises KDE	0.0262	0.0236	0.9923
F-14	DPMM-vM	0.0329	0.0249	0.9899
F-14	von Mises KDE	0.0326	0.0244	0.9901
F-16	DPMM-vM	0.0213	0.0180	0.9941
F-16	von Mises KDE	0.0229	0.0211	0.9931

Table 7. The model parameters for marginal PDFs of wind direction.

Item	Position	Parameters	Value
Wind direction	F-06	μ_F-06	$[\begin{matrix} 0.84 & 2.15 & 3.37 & 5.22 & 1.22 \end{matrix}]$
		κ_F-06	$[\begin{matrix} 20.65 & 6.22 & 7.59 & 5.55 & 6.34 \end{matrix}]$
		π_F-06	$[\begin{matrix} 0.2908 & 0.1818 & 0.1695 & 0.2247 & 0.1332 \end{matrix}]$
	F-08	μ_F-08	$[\begin{matrix} 0.79 & 1.81 & 2.90 & 4.94 & 1.25 \end{matrix}]$
		κ_F-08	$[\begin{matrix} 29.42 & 6.42 & 5.99 & 5.39 & 6.54 \end{matrix}]$
		π_F-08	$[\begin{matrix} 0.2504 & 0.1832 & 0.1991 & 0.2523 & 0.1150 \end{matrix}]$
	F-14	μ_F-14	$[\begin{matrix} 1.06 & 2.00 & 3.44 & 4.96 & 1.50 \end{matrix}]$
		κ_F-14	$[\begin{matrix} 21.99 & 5.67 & 4.40 & 11.50 & 4.78 \end{matrix}]$
		π_F-14	$[\begin{matrix} 0.2018 & 0.2263 & 0.2638 & 0.1879 & 0.1202 \end{matrix}]$
	F-16	μ_F-16	$[\begin{matrix} 0.70 & 3.06 & 5.15 & 1.37 \end{matrix}]$
		κ_F-16	$[\begin{matrix} 20.82 & 4.02 & 6.55 & 3.14 \end{matrix}]$
		π_F-16	$[\begin{matrix} 0.2553 & 0.2016 & 0.2454 & 0.2977 \end{matrix}]$

Table 8. The goodness-of-fit results for wind attack angle PDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPGMM	0.0018	0.0006	0.9982
F-06	KDE	0.0029	0.0007	0.9950
F-08	DPGMM	0.0020	0.0007	0.9956
F-08	KDE	0.0004	0.0001	0.9998
F-14	DPGMM	0.0026	0.0009	0.9939
F-14	KDE	0.0009	0.0002	0.9992
F-16	DPGMM	0.0044	0.0012	0.9875
F-16	KDE	0.0038	0.0005	0.9906

Table 9. The goodness-of-fit results for wind attack angle CDFs.

Positions	Method	RMSE	MAE	R²
F-06	DPGMM	0.0014	0.0006	0.9999
F-06	KDE	0.0123	0.0090	0.9993
F-08	DPGMM	0.0041	0.0029	0.9999
F-08	KDE	0.0014	0.0009	0.9999
F-14	DPGMM	0.0030	0.0020	0.9999
F-14	KDE	0.0007	0.0002	0.9999
F-16	DPGMM	0.0109	0.0079	0.9995
F-16	KDE	0.0237	0.0170	0.9975

Table 10. The model parameters for marginal PDFs of wind attack angle.

Item	Position	Parameters	Value
Wind attack angle	F-06	μ_F-06	$[\begin{matrix} - 2.89 & 11.91 & 2.06 & 12.15 \end{matrix}]$
		σ_F-06	$[\begin{matrix} 1.28 & 6.70 & 1.74 & 3.01 \end{matrix}]$
		π_F-06	$[\begin{matrix} 0.5536 & 0.1895 & 0.1522 & 0.1047 \end{matrix}]$
	F-08	μ_F-08	$[\begin{matrix} 12.27 & - 0.96 & 5.76 & 7.91 & 3.96 \end{matrix}]$
		σ_F-08	$[\begin{matrix} 5.14 & 3.43 & 1.95 & 1.11 & 0.87 \end{matrix}]$
		π_F-08	$[\begin{matrix} 0.4346 & 0.2167 & 0.1261 & 0.1163 & 0.1063 \end{matrix}]$
	F-14	μ_F-14	$[\begin{matrix} 8.56 & 1.57 & 3.69 & 0.59 \end{matrix}]$
		σ_F-14	$[\begin{matrix} 5.37 & 0.98 & 1.68 & 0.60 \end{matrix}]$
		π_F-14	$[\begin{matrix} 0.7594 & 0.1367 & 0.0669 & 0.0370 \end{matrix}]$
	F-16	μ_F-16	$[\begin{matrix} - 1.05 & 0.26 & 15.57 & - 1.27 \end{matrix}]$
		σ_F-16	$[\begin{matrix} 1.80 & 4.84 & 3.78 & 0.73 \end{matrix}]$
		π_F-16	$[\begin{matrix} 0.3185 & 0.2853 & 0.2446 & 0.1516 \end{matrix}]$

Table 11. The bivariate copula families.

Copula	Formulation	Parameter Range
Gauss	$C_{ρ} (u, v) = Φ_{2} (Φ^{- 1} (u), Φ^{- 1} (v); ρ)$	$ρ \in (- 1, 1)$
t	$C_{ρ, ν} (u, v) = t_{2, ν} (t_{ν}^{- 1} (u), t_{ν}^{- 1} (v); ρ, ν)$	$\begin{array}{l} ρ \in (- 1, 1), \\ v \in [1, 1000000] \end{array}$
Clayton	$C_{θ} (u, v) = {(u^{- θ} + v^{- θ} - 1)}^{- \frac{1}{θ}},$	$ρ \in [0.00001, 150]$
Gumbel	$C_{θ} (u, v) = \exp \{- {[{(- \log (u))}^{θ} + {(- \log (v))}^{θ}]}^{\frac{1}{θ}}\},$	$θ \in [1, 120]$
Frank	$C_{θ} (u, v) = - \frac{1}{θ} \log (\frac{1 - e^{- θ} - (1 - e^{- θ u}) (1 - e^{- θ v})}{1 - e^{- θ}})$	$θ \in [- 700, 700]$
Ali–Mikhail–Haq	$C_{θ} (u, v) = \frac{u v}{1 - θ (1 - u) (1 - v)}$	$θ \in [- 1, 1]$
Farlie–Gumbel–Morgenstern	$C_{θ} (u, v) = u v (1 + θ (1 - u) (1 - v))$	$θ \in [- 1, 1]$
Plackett	$\begin{array}{l} C_{θ} (u, v) = \frac{1}{2 (θ - 1)} \times \dots \\ \dots (1 + (θ - 1) (u + v) - {({(1 + (θ - 1) (u + v))}^{2} - 4 θ (θ - 1) u v)}^{\frac{1}{2}}) \end{array}$	$θ \in [0, 1000000]$
Joe	$C_{θ} (u, v) = 1 - {({(1 - u)}^{θ} + {(1 - v)}^{θ} - {(1 - u)}^{θ} {(1 - v)}^{θ})}^{\frac{1}{θ}}$	$θ \in [1, 150]$
Survival Clayton	$C_{θ} (u, v) = {(u^{- θ} + v^{- θ} - 1)}^{- \frac{1}{θ}},$	$θ \in [0.00001, 150]$
Survival Gumbel	$C_{θ} (u, v) = \exp \{- {[{(- \log (u))}^{θ} + {(- \log (v))}^{θ}]}^{\frac{1}{θ}}\},$	$θ \in [1, 120]$
Survival Joe	$C_{θ} (u, v) = 1 - {({(1 - u)}^{θ} + {(1 - v)}^{θ} - {(1 - u)}^{θ} {(1 - v)}^{θ})}^{\frac{1}{θ}}$	$θ \in [1, 150]$

Table 12. The optimal Copula model and R-Vine structure.

Position	Tree	Edge	Copula	Parameter	BIC
F-06	T1	23	Plackett	27.136	−5.8561 × 10⁴
	T1	13	Plackett	0.447	−3.6962 × 10³
	T2	12\|3	Ali–Mikhail–Haq	0.533	−1.9048 × 10³
F-08	T1	23	Joe	1.632	−1.3913 × 10⁴
	T1	12	Farlie–Gumbel–Morgenstern	−0.363	−7.6686 × 10²
	T2	13\|2	t	[−0.131, 28.538]	−1.0302 × 10³
F-14	T1	23	Ali–Mikhail–Haq	−1.000	−8.8872 × 10³
	T1	13	Frank	1.9616	−5.3369 × 10³
	T2	12\|3	Farlie–Gumbel–Morgenstern	−0.8206	−3.2166 × 10³
F-16	T1	23	Survival Clayton	1.0189	−2.1613 × 10⁴
	T1	13	Survival Gumbel	1.1427	−2.9634 × 10³
	T2	12\|3	Farlie–Gumbel–Morgenstern	−0.4763	−1.2521 × 10³

Note: Variables are coded as 1 = wind speed, 2 = wind direction, and 3 = wind attack angle; the edge label “ab|c” indicates that variables a and b are conditioned on variable c.

Table 13. The goodness-of-fit test for JPDF of wind field.

Position	BIC	RMSE	MAE	IA
F-06	−6.4136 × 10⁴	1.2299 × 10⁻⁶	8.2202 × 10⁻⁸	0.9999
F-08	−1.5675 × 10⁴	3.2344 × 10⁻⁶	4.0045 × 10⁻⁷	0.9994
F-14	−1.7514 × 10⁴	9.8721 × 10⁻⁸	1.1222 × 10⁻⁸	0.9999
F-16	−2.5802 × 10⁴	1.8430 × 10⁻⁶	2.2527 × 10⁻⁷	0.9998

Table 14. The goodness-of-fit test for JCDF of wind field.

Position	RMSE	MAE	IA
F-06	1.8112 × 10⁻⁴	8.3158 × 10⁻⁵	0.9999
F-08	5.6988 × 10⁻⁴	1.6102 × 10⁻⁴	0.9999
F-14	4.2763 × 10⁻⁵	1.4163 × 10⁻⁵	0.9999
F-16	5.4428 × 10⁻⁴	2.1227 × 10⁻⁴	0.9999

The lower

B I C

value indicates that the model achieves the balance between goodness-of-fit and model parsimony. For the JPDF,

R M S E

and

M A E

values are on the order of 10⁻⁸ to 10⁻⁶; for the JCDF, both metrics are on the order of 10⁻⁵ to 10⁻⁴. The

I A

consistently exceeds 0.999 across all positions. These results collectively confirm the high accuracy of the proposed framework.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, B.; Ye, Z.; Li, M.; Hong, W.; Ruan, W.; Meng, L. A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model. Buildings 2026, 16, 2015. https://doi.org/10.3390/buildings16102015

AMA Style

Sun B, Ye Z, Li M, Hong W, Ruan W, Meng L. A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model. Buildings. 2026; 16(10):2015. https://doi.org/10.3390/buildings16102015

Chicago/Turabian Style

Sun, Bo, Zeyi Ye, Mohan Li, Weiyi Hong, Weidong Ruan, and Lingxin Meng. 2026. "A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model" Buildings 16, no. 10: 2015. https://doi.org/10.3390/buildings16102015

APA Style

Sun, B., Ye, Z., Li, M., Hong, W., Ruan, W., & Meng, L. (2026). A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model. Buildings, 16(10), 2015. https://doi.org/10.3390/buildings16102015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Copula Framework for Joint Probability Density of Wind Speed, Wind Direction, and Wind Attack Angle Based on Dirichlet Process Mixture Model

Abstract

1. Introduction

2. Dirichlet Process Mixture Model

2.1. The DPMM in the Stick-Breaking Construction

2.2. Accelerated Variational Bayesian Inference Algorithm

3. Copula Framework

3.1. Pair-Copula Constructions

3.2. Vine Copula

3.3. Regular Vine Model Selection and Estimation

4. Goodness-of-Fit Test

5. Case Study

5.1. Instrument and Data

5.2. Marginal PDFs for Wind Field

5.2.1. Marginal Model for Wind Speed

5.2.2. Marginal Model for Wind Direction

5.2.3. Marginal Model for Wind Attack Angle

5.3. Seasonal Variation in the Wind Field

5.4. Construction of JPDF Based on Vine Copula

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI