Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness

Zhang, Lu; Tuerde, Mulati

doi:10.3390/math13193094

Open AccessArticle

Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness

by

Lu Zhang

and

Mulati Tuerde

^*

College of Mathematics and System Sciences, Xinjiang University, Urumqi 830017, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(19), 3094; https://doi.org/10.3390/math13193094

Submission received: 24 August 2025 / Revised: 14 September 2025 / Accepted: 17 September 2025 / Published: 26 September 2025

(This article belongs to the Special Issue Research on Dynamical Systems and Differential Equations, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

This paper develops a nonlinear quantile structural equation model via the Bayesian approach, aiming to more accurately analyze the relationships between latent variables, with special attention paid to the issue of non-ignorable missing data in the model. The model not only incorporates quantile regression to examine the relationships between latent variables at different quantile levels but also features a specially designed mechanism for handling missing data. The non-ignorable missing mechanism is specified through a logistic regression model, and a combined method of Gibbs sampling and Metropolis–Hastings sampling is adopted for missing value imputation, while simultaneously estimating unknown parameters, latent variables, and parameters in the missing data model. To verify the effectiveness of the proposed method, simulation studies are conducted under conditions of different sample sizes and missing rates. The results of these simulation studies indicate that the developed method performs excellently in handling complex data structures and missing data. Furthermore, this paper demonstrates the practical application value of the nonlinear quantile structural equation model through a case study on the growth of listed companies, providing researchers in related fields with a new analytical tool.

Keywords:

latent variable model; missing data; MCMC method; structural equation model

MSC:

37M10; 65P40; 93C10

1. Introduction

Structural equation modeling (SEM) is a statistical method used to analyze relationships among latent variables, consisting of two components: measurement equations composed of multiple related observed indicators, and structural equations that assess the mutual influences between latent variables. SEM has found extensive utilization across diverse domains, including behavioral science, education, medicine, psychology, and social sciences. Despite its many advantages in data processing, SEM, like traditional regression models, is sensitive to outliers and typically focuses on analyzing conditional means. To address these limitations, researchers have begun integrating quantile regression techniques into SEM to provide a more comprehensive analysis of relationships between variables.

Since its introduction by Koenker and Bassett [1], quantile regression has emerged as an effective tool to overcome the limitations of traditional regression models. By estimating at different quantile levels, it more comprehensively reflects data characteristics, exhibiting strong robustness, especially when dealing with non-normal distributions and outliers, without requiring normality assumptions for error distributions. The wide application of quantile regression across various fields has significantly expanded the scope of traditional regression models. Against this backdrop, Wang et al. [2] were the first to integrate quantile regression into SEM, proposing the quantile structural equation model (QSEM). This model effectively addresses the sensitivity of traditional models to non-normal distributions and outliers by introducing the Asymmetric Laplace Distribution (ALD) and Markov Chain Monte Carlo (MCMC) algorithms, and has played an important role in research areas such as chronic kidney disease.

In the extended research on QSEM, Feng et al. [3] proposed a Bayesian regularized quantile structural equation model, introducing Bayesian Lasso and adaptive Lasso methods for variable selection and parameter estimation to enhance model flexibility and interpretability. Wang Zhiqiang [4] put forward a composite quantile structural equation model, which combines Gibbs sampling and Metropolis–Hastings algorithms for parameter and latent variable estimation. Xue Jiao et al. [5] proposed a variational approximation inference for Bayesian quantile structural equation models based on Mean Field Variational Bayes (MFVB) and adopted a non-parametric bootstrap method to improve the performance of MFVB. Cheng [6] developed a quantile varying-coefficient structural equation model and a new estimation method based on local polynomials (LPs) for flexible model estimation. However, relevant studies have mostly focused on linear frameworks and not involved nonlinear relationships between latent variables. Therefore, we propose a new nonlinear quantile structural equation model that can capture nonlinear interactions between variables and enhance the interpretability and prediction accuracy for complex phenomena.

In practice, missing data are prevalent in various research scenarios. Regarding missing data issues in SEM, previous studies have covered both linear and nonlinear frameworks, involving two mechanisms: Missing At Random (MAR) and Missing Not At Random (MNAR). In early research, Lee and Tang [7] developed a Bayesian method to analyze nonlinear structural equation models with non-ignorable missing data, which specifies the non-ignorable missing mechanism through a logistic regression model and combines Gibbs sampling and Metropolis–Hastings algorithms for parameter estimation. Cai et al. [8] proposed a nonlinear structural equation model capable of handling mixed data types (continuous, ordered, and unordered categorical) and non-ignorable missing data, exploring its Bayesian estimation and model comparison methods. The integrated SEM proposed by Lee and Song [9] realizes estimation and model comparison of mixed data under MAR through Bayesian methods combined with MCMC and path sampling. Cai and Song [10] focused on the situation where response variables have non-ignorable missing data in mixed structural equation models, also adopting Bayesian methods for analysis. Cai et al. [11] addressed the problem of both response variables and covariates having non-ignorable missing data in mixed structural equation models, achieving parameter estimation and model selection through Bayesian methods. Existing studies have mostly focused on traditional mean-structured SEM, with insufficient attention to quantile structural equation models, especially lacking systematic research on non-ignorable missing data in nonlinear quantile structural equation models. Therefore, this paper considers a nonlinear quantile structural equation model with non-ignorable missing data.

This paper focuses on the influencing factors of the growth of listed companies in Xinjiang. Corporate growth refers to a company’s ability to achieve sustained development and expansion in the future, and this study mainly examines it from two dimensions: profitability and solvency. Since these influencing factors cannot be directly measured by a single observed variable and there are complex nonlinear relationships between them, this study aims to construct a nonlinear quantile structural equation model to conduct an in-depth analysis of the influencing factors and their interaction mechanisms of corporate growth among 61 listed companies in Xinjiang, providing a scientific decision-making basis for corporate management and contributing to the sustainable development of the regional economy.

The structure of this paper is arranged as follows: Section 2 introduces the nonlinear quantile structural equation model with non-ignorable missing data. Section 3 presents a Bayesian analysis of the above model. Section 4 conducts simulation studies with different sample sizes and missing rates to evaluate the performance of the proposed method. Section 5 illustrates the proposed method through a practical case study. Section 6 includes a discussion. Appendix A presents some technical details.

2. Model and Notation

2.1. Nonlinear Quantile Structural Equation Model

The quantile structural equation model (QSEM) typically consists of a measurement equation in the form of median regression and a structural equation in the form of quantile regression. The measurement equation characterizes the features of latent variables through multiple observed variables, while the structural equation evaluates the associations between latent variables and reveals the mechanism by which explanatory latent variables influence outcome latent variables. In previous QSEM studies, both the measurement equation and the structural equation were mostly assumed to have linear relationships. Based on this, we extend the “double-linear” specification of traditional QSEM and propose a new definition of the nonlinear quantile structural equation model.

This model takes “linear measurement equation + nonlinear structural equation” as its core feature. It not only inherits the classical framework of nonlinear structural equation models defined by Lee [12] and Song and Lee [13], where the linear measurement equation ensures the effective reflection of latent variables by observed variables and the nonlinear form characterizes the associations between latent variables, but also introduces innovations within the quantile regression framework. Specifically, the structural equation allows nonlinear relationships in the form of differentiable functions between latent variables. Meanwhile, by means of the quantile regression-type equation, it overcomes the limitation that traditional models can only characterize the average associations between variables, realizes the accurate characterization of variable relationships at different quantiles, and thus better conforms to the actual characteristics of variable associations in complex data scenarios.

First, to investigate the association between observed and latent variables, we begin by assuming that the observed variables

y_{1}, \dots, y_{n}

are independently distributed, where each

y_{i} = {(y_{i 1}, \dots, y_{i p})}^{T}

represents a p-dimensional vector of measurements. Correspondingly, the latent variables

ω_{1}, \dots, ω_{n}

are also independent, with each

ω_{i} = {(ω_{i 1}, \dots, ω_{i q})}^{T}

being a q-dimensional random vector (

q < p

).

The linear relationship between

y_{i}

and

ω_{i}

can then be defined by the following measurement equation:

y_{i} = A c_{i} + Λ ω_{i} + ε_{i}, i = 1, \dots, n,

(1)

where

A (p \times r_{1})

is an unknown coefficient matrix;

Λ (p \times q)

is an unknown factor loading matrix;

c_{i} (r_{1} \times 1)

is a vector of fixed covariates; and

ε_{i} (p \times 1)

is a random error vector. Within this model, a linear regression model connects the conditional median of

y_{i}

with

c_{i}

and

ω_{i}

.

Next, to study the relationships between latent variables, we decompose

ω_{i}

into

{(η_{i}^{T}, ξ_{i}^{T})}^{T}

, where

η_{i} (q_{1} \times 1)

denotes the outcome latent variable,

ξ_{i} (q_{2} \times 1)

denotes the explanatory latent variable, and

q_{1} + q_{2} = q

. For simplicity, we assume

q_{1} = 1

. The nonlinear relationship between

η_{i}

and

ξ_{i}

can then be defined by the following structural equation:

η_{i} = B_{τ} d_{i} + Γ_{τ} H (ξ_{i}) + δ_{i}, i = 1, \dots, n,

(2)

where

B_{τ} (q_{1} \times r_{2})

and

Γ_{τ} (q_{1} \times q_{3})

are unknown coefficient matrices to be estimated; the subscript

τ

indicates that these matrices may differ across quantiles;

d_{i} (r_{2} \times 1)

is a vector of fixed covariates;

H (ξ_{i}) = {(h_{1} (ξ_{i}), \dots, h_{q_{3}} (ξ_{i}))}^{T}

is a

q_{3} \times 1

-dimensional nonlinear function of

ξ_{i}

, whose components

h_{1}, \dots, h_{q_{3}}

are differentiable functions; and

δ_{i} (q_{1} \times 1)

is a random error vector. In the present model, a nonlinear regression model connects the conditional quantile of

η_{i}

with

d_{i}

and

H (ξ_{i})

.

Equations (1) and (2) together constitute the nonlinear quantile structural equation model (NQSEM).

2.2. Asymmetric Laplace Distribution

In the Bayesian quantile regression method based on the Asymmetric Laplace Distribution (ALD) proposed by Yu and Moyeed [14], the “true underlying distribution” of the error term is unknown. This is not only an objective constraint in real-world data scenarios but also the starting point of the core design logic of this method. In practical research, data often exhibit complex characteristics such as non-normality, heavy tails, and heteroscedasticity, which makes it difficult for researchers to pre-determine or describe the true underlying distribution of the error term using a single fixed distribution.

To address this issue, the key innovation of Yu and Moyeed [14] is to avoid direct modeling of the “unknown true underlying distribution”. Instead, they revealed the intrinsic theoretical connection between quantile regression and ALD: the quantiles of ALD correspond exactly to the conditional quantiles of the regression model. On this basis, they designated ALD as a “working distribution”. By assuming that the error term follows ALD, they constructed the likelihood function. This assumption is not intended to reconstruct the true distribution of the error term; instead, it uses ALD as a bridge to enable Bayesian estimation of quantile regression parameters without relying on the true underlying distribution of the error term.

The core that supports the validity of this approach is the in-depth compatibility between the check loss function of quantile regression and the probability density function of ALD. The check loss function of quantile regression is defined as

ρ_{τ} (x) = x (τ - I (x < 0))

, where x denotes the residual (i.e., the difference between the model’s predicted values and the actual observed values),

τ

is the preset quantile level, and

I (\cdot)

is an indicator function that takes the value of 1 when

x < 0

and 0 otherwise. This function assigns asymmetric weights to positive and negative residuals: when

x > 0

, the loss is

τ \cdot x

; when

x \leq 0

, the loss is

(1 - τ) \cdot | x |

, which precisely matches the need of quantile regression to “characterize the relationship between variables at different quantiles”. The probability density function of ALD is expressed as

f (y | μ, σ, τ) = \frac{τ (1 - τ)}{σ} exp (- ρ_{τ} (\frac{y - μ}{σ}))

, where y represents a random variable following ALD,

μ

is the location parameter,

σ

is the scale parameter, and

τ

is the skewness parameter. This probability density function directly embeds the aforementioned check loss function, and the equivalence between the two means that maximizing the ALD likelihood function is essentially equivalent to minimizing the quantile regression loss function—providing a theoretically consistent quantitative basis for Bayesian inference. At the same time, this modeling approach based on the check loss function and ALD can effectively handle complex data characteristics such as non-normality and heteroscedasticity, and ultimately yield robust parameter estimation results.

Building on this core finding of Yu and Moyeed [14], Wang et al. [2] further extended the application of ALD to quantile structural equation models (SEMs). They continued the approach of “constructing the likelihood function using ALD as the working distribution” and derived a form of likelihood function suitable for Bayesian analysis of quantile SEM, which provides effective support for Bayesian inference in such models.

Assume that the probability density function of the k-th component

ε_{i k}

(

k = 1, \dots, p

) of the error term

ε_{i}

follows an Asymmetric Laplace Distribution, with the form

f (ε_{i k} | μ, σ, τ) = \frac{τ (1 - τ)}{σ} exp (- ρ_{τ} (\frac{ε_{i k} - μ}{σ})), ε_{i k} \in (- \infty, + \infty),

(3)

where

μ \in R

is the location parameter,

σ > 0

is the scale parameter,

τ (0 < τ < 1)

is the skewness parameter, and

ρ_{τ} (x) = x (τ - I (x < 0))

is the check function. Thus, we denote

ε_{i k} \sim ALD (μ, σ, τ)

. Similarly, this paper assumes

q_{1} = 1

,

δ_{i}

is a one-dimensional real number, and its probability density function is given by

f (δ_{i} | μ, σ, τ) = \frac{τ (1 - τ)}{σ} exp (- ρ_{τ} (\frac{δ_{i} - μ}{σ})), δ_{i} \in (- \infty, + \infty),

(4)

Thus, we denote

δ_{i} \sim ALD (μ, σ, τ)

.

In statistics, “standard distributions” generally refer to distribution types (such as the normal distribution and gamma distribution) that are widely used in theoretical research and practical applications, with clear probability density forms and mature supporting analysis tools (e.g., having tractable conjugate priors). From this definition, ALD does not belong to such standard distributions. It is not only not a commonly used distribution type in statistics but, more importantly, lacks a tractable conjugate prior. Coupled with the inherent complexity of its likelihood function, this directly makes the posterior density function of the model difficult to handle through analytical methods, thereby inevitably increasing the computational burden in the process of model parameter estimation. Nevertheless, according to studies by Reed and Yu [15] and Kozumi and Kobayashi [16], ALD can be expressed as a mixture of exponential and normal distributions. Specifically, if the random variables

ε_{i k}

and

δ_{i}

follow

ALD (0, σ, τ)

, they can be expressed as

ε_{i k} = k_{1} (τ) e_{y i k} + \sqrt{k_{2} (τ) σ_{y k} e_{y i k}} ς_{i},

(5)

δ_{i} = k_{1} (τ) e_{η i} + \sqrt{k_{2} (τ) σ_{η} e_{η i}} ς_{i},

(6)

where

k_{1} (τ) = \frac{1 - 2 τ}{τ (1 - τ)}

,

k_{2} (τ) = \frac{2}{τ (1 - τ)}

,

e_{y i k} \sim exp (\frac{1}{σ_{y k}})

,

e_{η i} \sim exp (\frac{1}{σ_{η}})

, and

ς_{i} \sim N (0, 1)

;

e_{y i k}

is the k-th component of

e_{y i}

,

σ_{y k}

is the k-th component of

σ_{y}

, and

ς_{i}

is independent of

e_{y i k}

and

e_{η i}

, respectively.

Given the expressions for

ε_{i k}

and

δ_{i}

, and noting that

ε_{i} = {(ε_{i 1}, ε_{i 2}, \dots, ε_{i p})}^{T}

, we can derive the forms of both

y_{i}

and

η_{i}

in the nonlinear quantile structural equation model as follows:

y_{i} = A c_{i} + Λ ω_{i} + k_{1} (τ) e_{y i} + \sqrt{k_{2} (τ) σ_{y} e_{y i}} ς_{i},

(7)

η_{i} = B_{τ} d_{i} + Γ_{τ} H (ξ_{i}) + k_{1} (τ) e_{η i} + \sqrt{k_{2} (τ) σ_{η} e_{η i}} ς_{i},

(8)

where

y_{i}

follows

ALD (A c_{i} + Λ ω_{i}, σ, τ)

, and

η_{i}

follows

ALD (B_{τ} d_{i} + Γ_{τ} H (ξ_{i}), σ, τ)

. By introducing latent variables

e_{y i}

and

e_{η i}

to augment

y_{i}

and

η_{i}

, the conditional distributions of

y_{i}

and

η_{i}

become normal distributions. Their conditional means are

A c_{i} + Λ ω_{i} + k_{1} (τ) e_{y i}

and

B_{τ} d_{i} + Γ_{τ} H (ξ_{i}) + k_{1} (τ) e_{η i}

, respectively, and their variances are

k_{2} (τ) σ_{y} e_{y i}

and

k_{2} (τ) σ_{η} e_{η i}

, respectively. This data augmentation facilitates subsequent Bayesian analysis, allowing normal distributions to be used as prior distributions for unknown coefficients.

2.3. Non-Ignorable Missing Data

Wang et al. [2] performed Bayesian statistical analyses on linear QSEM using completely observed data. In the present study, we extend this framework to account for incomplete observations of the vector

y_{i}

, where missingness arises from a non-ignorable mechanism. Specifically, we decompose

y_{i}

into two components:

y_{i} = {(y_{o i}^{T}, y_{m i}^{T})}^{T}

, where

y_{o i} (p_{1 i} \times 1)

represents the observed portion of the manifest variables, and

y_{m i} (p_{2 i} \times 1)

denotes the missing portion, with the dimension constraint

p_{1 i} + p_{2 i} = p

.

We assume that the missingness in

y_{i}

occurs in an arbitrary manner. Thus,

y_{i} = {(y_{o i}^{T}, y_{m i}^{T})}^{T}

can be seen as a reordering of the elements from the original

y_{i}

. No matter how the elements are reordered, the observed vector

y_{o i}

and the unobserved vector

y_{m i}

together form the complete manifest variable vector

y_{i}

with a non-ignorable missing mechanism.

In the study of missing data problems, if the missing pattern is correlated with the missing data itself, the mechanism is referred to as a non-ignorable missing mechanism. In this case,

p (r_{i} ∣ y_{i}, ω_{i}, φ)

is a key conditional probability distribution, which describes the probability distribution of the missing indicator variable

r_{i}

given the observed data

y_{i}

and latent variable

ω_{i}

. It is crucial to select an appropriate model to describe this distribution, as the model needs to balance complexity and identifiability to avoid ineffective parameter estimation or computational difficulties caused by excessive model complexity.

Before introducing the model, it is necessary to clarify the definition of the missing indicator variable

r_{i}

:

r_{i} = \{\begin{matrix} 1, & if y_{i} is missing data, \\ 0, & if y_{i} is observed data, \end{matrix}

(9)

According to the properties of the indicator function,

r_{i}

is a binary vector and follows a 0–1 distribution with probability

p (r_{i} ∣ y_{i}, ω_{i}, φ)

.

In the nonlinear quantile structural equation model, given that the observation vectors

y_{1}, y_{2}, \dots, y_{n}

are mutually independent, it is reasonable to assume that the missing indicator variables

r_{1}, r_{2}, \dots, r_{n}

are also mutually independent, and their joint probability is the product of the respective independent probabilities. Since the components of

y_{i}

are independent given

ω_{i}

, when

j \neq l

,

r_{i j}

and

r_{i l}

are conditionally independent given

y_{i}

and

ω_{i}

, and thus the joint probability can be further decomposed into the product of the probabilities of each component.

Drawing on the non-ignorable missingness mechanism model proposed by Ibrahim et al. [17], we model the binary missing missingness indicator

r_{i j}

by specifying the probability

p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)

. We employ a logistic regression model for this purpose, as it reduces the number of parameters in the missing data mechanism and facilitates efficient sampling from the conditional distribution given the observed data. The link function for this model is denoted by

G (\cdot)

, specifically,

G (x) = {logit}^{- 1} (x) = {(1 + e^{- x})}^{- 1}

. The linear predictor is given by

\begin{matrix} logit \{p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)\} & = φ_{0} + φ_{1} y_{i 1} + \dots + φ_{p} y_{i p} + φ_{p + 1} ω_{i 1} + \dots + φ_{p + q} ω_{i q} \\ = φ^{T} F_{i}, \end{matrix}

(10)

Consequently, the distribution of the missingness indicator is

r_{i j} ∣ y_{i}, ω_{i}, φ \sim Bernoulli (G (φ^{T} F_{i})),

(11)

where

logit (p) = log (\frac{p}{1 - p})

,

F_{i} = {(1, y_{i 1}, \dots, y_{i p}, ω_{i 1}, \dots, ω_{i q})}^{T}

is the design vector, and

φ = {(φ_{0}, φ_{1}, \dots, φ_{p + q})}^{T}

is the coefficient vector; both are

(p + q + 1)

-dimensional.

In addressing nonlinear structural equation models (NSEMs) with non-ignorable missing data, Lee and Tang [7] made pioneering contributions. After in-depth analysis of the complex model structure and data characteristics, they adopted a specific form, as shown in Equation (12):

logit \{p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)\} = φ_{0} + φ_{1} y_{i 1} + \dots + φ_{p} y_{i p} = φ^{T} F_{i}^{*},

(12)

where

F_{i}^{*} = {(1, y_{i 1}, \dots, y_{i p})}^{T}

(i.e., excluding the latent variable

ω_{i}

from the design vector), and this definition of

F_{i}^{*}

is consistent with the notation used for

F_{i}^{*}

in subsequent parts of the manuscript.

A prominent feature of this form is that it no longer depends on the latent variables

ω_{i}

. They argued that the characteristics of latent variables

ω

can be reflected by observed variables y. This insight offers dual advantages: from a computational perspective, it significantly reduces the dimension of parameter estimation and complex matrix operations during model computation, effectively alleviating the computational burden; from the perspective of model application, it simplifies the originally obscure and complex model structure, enhancing the model’s operability and interpretability in practical scenarios.

In subsequent studies, they further conducted large-scale simulation studies to comprehensively verify the effectiveness of this simplified model. During the simulation process, multiple sets of different data generation mechanisms and model parameter combinations were set up, covering common complex data scenarios. The results of the simulation studies verified the superiority of the simplified model from multiple dimensions: in terms of parameter estimation accuracy, its estimation bias (Bias) and root mean square error (RMS) were both at a level similar to those of the full model, indicating that this simplification did not lead to a significant loss of estimation precision; more importantly, the computational efficiency of the simplified model was significantly improved—it not only shortened the time required for parameter iteration convergence but also reduced the computational complexity in the high-dimensional sampling process.

Given that the research findings of Lee and Tang [7] possess both reliability and practicality, considering that the research scenario and data characteristics of this paper are highly consistent with those of their study, and for the further purposes of ensuring model identifiability and improving computational tractability, this paper decides to adopt Equation (12) for modeling. It is expected that this will also enable efficient and accurate model construction and analysis when dealing with nonlinear quantile structural equation models with non-ignorable missing data.

It can be seen from the above missing data mechanism model that the missingness of data

y_{i}

is non-ignorable. Thus,

p (r_{i j} ∣ y_{i}, ω_{i}, φ)

describes such a data mechanism where the response variable has non-ignorable missingness.

3. Bayesian Inference for the Proposed Model

Let

Y = (y_{1}, \dots, y_{n})

,

Y_{m} = {y_{m 1}, \dots, y_{m n}}

denote the set of missing values related to observed variables,

Y_{o} = {y_{o 1}, \dots, y_{o n}}

denote the set of observed response variables,

Ω = (ω_{1}, \dots, ω_{n})

,

r = (r_{1}, \dots, r_{n})

denote the vector of missing indicators, and

θ = (A, Λ, B_{τ}, Γ_{τ}, Φ, σ_{y k}, e_{y i k}, σ_{η}, e_{η i})

represent all unknown parameters in Equations (1) and (2). Our primary focus is to perform posterior inference on the unknown parameters of interest

θ

and the missing mechanism parameter

φ

by utilizing the missing data indicators

r

and the observed dataset

Y_{o}

.

The joint posterior distribution of the unknown parameters

θ

and

φ

conditioned on

Y_{o}

and

r

is derived as follows:

\begin{matrix} p (θ, φ ∣ Y_{o}, r) & \propto p (Y_{o}, r ∣ θ, φ) \cdot p (θ, φ) \\ \propto [\int \int p (Y_{o}, Y_{m}, r, Ω ∣ θ, φ) d Ω d Y_{m}] \cdot p (θ, φ) \\ \propto [\prod_{i = 1}^{n} \int \int p (y_{i}, r_{i}, ω_{i} ∣ θ, φ) d ω_{i} d y_{m i}] \cdot p (θ, φ) \\ \propto [\prod_{i = 1}^{n} \int \int p (y_{i} ∣ ω_{i}, θ) p (r_{i} ∣ y_{i}, ω_{i}, φ) p (ω_{i} ∣ θ) d ω_{i} d y_{m i}] \cdot p (θ, φ) . \end{matrix}

(13)

where

p (\cdot ∣ \cdot)

denotes a conditional probability density, and

p (θ, φ)

denotes the combined prior distribution assigned to

θ

and

φ

. It should be noted that the integral contained in Equation (13) is a high-dimensional integral, whose dimension equals the sum of the dimensions of

ω_{i}

and

y_{m i}

. Since such high-dimensional integrals are difficult to solve directly and have no closed-form expression, directly conducting posterior inference poses numerous challenges and is quite difficult. Therefore, numerical methods (such as Markov Chain Monte Carlo methods) need to be used for approximate calculation. In this study, we adopt a hybrid algorithm combining Gibbs sampling and the Metropolis–Hastings algorithm to implement posterior Bayesian analysis of the model.

By drawing on the data augmentation method proposed by Tanner and Wong [18], we extend the original joint posterior distribution

p (θ, φ ∣ Y_{o}, r)

to

p (Ω, Y_{m}, θ, φ ∣ Y_{o}, r)

. Subsequently, based on this extended joint posterior distribution, we use the Gibbs sampling and Metropolis–Hastings algorithms to iteratively sample from the following conditional distributions in sequence, thereby obtaining a random observation sequence of

{Ω, Y_{m}, θ, φ}

. The specific implementation steps can be divided into two main parts.

1. Generating Observed Data ${Y_{o}, r}$

This part constructs the observed data and missing indicator matrix required for subsequent sampling by simulating complete data and the missing mechanism. The specific steps are as follows:

Step 1: Generate the Complete Dataset $Y$

Generate the complete dataset

{y_{i j} : i = 1, \dots, n; j = 1, \dots, p}

according to Equations (1) and (2). This dataset serves as the “ground truth dataset” for simulating missing status, which is used to determine whether data are missing and define the scope of observed data in subsequent steps, acting as a reference benchmark.

Step 2: Determine the Missing Status of Observations and Construct ${Y_{o}, r}$

Based on the missing mechanism (12) and a pre-specified value of

φ

, determine whether each observation

y_{i j}

is missing. Specifically, generate a random number u from a uniform distribution. If

u \leq p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)

, mark

y_{i j}

as missing (denoted as

r_{i j} = 1

); otherwise, mark it as non-missing (denoted as

r_{i j} = 0

). Finally, extract non-missing values from the complete dataset

Y

to form the observed data

Y_{o}

, and construct the missing indicator matrix

r

from all

r_{i j}

. Thus,

{Y_{o}, r}

required for subsequent sampling is obtained.

2. Posterior Sampling Based on ${Y_{o}, r}$ to Obtain ${Ω, Y_{m}, θ, φ}$

With the generated

{Y_{o}, r}

as input, iterative sampling is performed to estimate the target parameters and missing data. The specific steps are as follows:

Step 1: Specify Initial Values

Specify the initial values as

(Ω^{(0)}, Y_{m}^{(0)}, θ^{(0)}, φ^{(0)})

.

Step 2: Iterative Sampling by Traversing Each Component

Traverse each component of

Ω

,

Y_{m}

,

θ

, and

φ

in sequence. Each component is sampled based on the current values of all other components. Denote the values at the t-th iteration as

(Ω^{(t)}, Y_{m}^{(t)}, θ^{(t)}, φ^{(t)})

; the hybrid sampling for the

(t + 1)

-th iteration is defined as follows:

(a): Sample $Ω^{(t + 1)}$ from the conditional distribution $p (Ω ∣ Y_{o}, Y_{m}^{(t)}, r^{(t)}, θ^{(t)}, φ^{(t)})$ ;
(b): Sample $Y_{m}^{(t + 1)}$ from the conditional distribution $p (Y_{m} ∣ Y_{o}, Ω^{(t)}, r^{(t)}, θ^{(t)}, φ^{(t)})$ ;
(c): Sample $θ^{(t + 1)}$ from the conditional distribution $p (θ ∣ Y_{o}, Y_{m}^{(t)}, Ω^{(t)})$ ;
(d): Sample $φ^{(t + 1)}$ from the conditional distribution $p (φ ∣ Y_{o}, Y_{m}^{(t)}, Ω^{(t)}, r^{(t)}, θ^{(t)})$ .

Repeat the iterative process in Step 2 (Posterior Sampling Iteration Step), which is the process of traversing components to sample and update

Ω, Y_{m}, θ, φ

, until the algorithm converges. Among them, the distributions corresponding to (a), (b), and (d) are non-standard, uncommon, and relatively complex; thus, the Metropolis–Hastings algorithm is required to address the sampling challenge. In contrast, the distribution corresponding to (c) is a standard and common distribution, and its sampling process is relatively straightforward and simple, so the Gibbs sampler can be directly used for sampling.

3.1. Posterior Distributions

To perform hybrid sampling, we need to specify the four posterior distributions involved in the sampling process. Drawing on relevant derivations in Lee and Tang [7], we can obtain the following conclusions:

First, consider the posterior distribution

p (Ω ∣ Y_{o}, Y_{m}, r, θ, φ)

of the latent variable

Ω

. When both the missing data

Y_{m}

and observed data

Y_{o}

are known, the posterior distribution can be simplified to

p (Ω ∣ Y, r, θ, φ)

, which depends on the complete dataset

Y

.

Since

ω_{i}

are mutually independent, and

y_{i}

are also mutually independent given

ω_{i}

, it follows that

\begin{matrix} p (Ω ∣ Y, r, θ, φ) & = \prod_{i = 1}^{n} p (ω_{i} ∣ y_{i}, r_{i}, θ, φ) \\ \propto \prod_{i = 1}^{n} p (y_{i} ∣ ω_{i}, θ) p (η_{i} ∣ ξ_{i}, θ) p (ξ_{i} ∣ θ) p (r_{i} ∣ y_{i}, ω_{i}, φ) . \end{matrix}

(14)

For the reasons stated earlier, the ALD is used to model the error terms regardless of their true latent distribution. Specifically, let the k-th component

ε_{i k}

of the error term

ε_{i}

follow

ALD (0, σ_{y k}, 0.5)

to model the median-based regression specified in Equation (1); let the error term

δ_{i}

follow

ALD (0, σ_{η}, τ)

to model the

τ

-th quantile regression in Equation (2). This is denoted as

y_{i k} \sim ALD (μ, σ_{y k}, 0.5)

and

η_{i} \sim ALD (μ, σ_{η}, τ)

.

Building on this ALD specification, we first define the covariance matrices for the conditional distributions: the covariance matrix of

y_{i}

is

Ψ_{ε i} = diag (k_{2} (τ) σ_{y 1} e_{y i 1}, \dots, k_{2} (τ) σ_{y p} e_{y i p})

, and the variance of

η_{i}

is

Ψ_{δ i} = k_{2} (τ) σ_{η} e_{η i}

. As the parameter

k_{2} (τ) = \frac{2}{τ (1 - τ)}

was defined previously, substituting

τ = 0.5

(the median case, consistent with

ε_{i}

’s ALD setting) yields

k_{2} (0.5) = \frac{2}{0.5 \times (1 - 0.5)} = 8

; therefore,

Ψ_{ε i}

can also be simply expressed as

diag (8 σ_{y 1} e_{y i 1}, \dots, 8 σ_{y p} e_{y i p})

.

Based on these settings, we can further derive the following conditional distributions:

(y_{i} ∣ ω_{i}, θ) \sim N_{p} (A c_{i} + Λ ω_{i}, Ψ_{ε i}),

(η_{i} ∣ ξ_{i}, θ) \sim N (B_{τ} d_{i} + Γ_{τ} H (ξ_{i}) + k_{1} (τ) e_{η i}, Ψ_{δ i}),

(ξ_{i} ∣ θ) \sim N_{q_{2}} (0, Φ),

Thus, it follows that

\begin{matrix} p (ω_{i} ∣ y_{i}, θ) & \propto exp \{- \frac{1}{2} ξ_{i}^{T} Φ^{- 1} ξ_{i} - \frac{1}{2} {(y_{i} - A c_{i} - Λ ω_{i})}^{T} Ψ_{ε i}^{- 1} (y_{i} - A c_{i} - Λ ω_{i}) \\ - \frac{1}{2} {(η_{i} - B_{τ} d_{i} - Γ_{τ} H (ξ_{i}) - k_{1} (τ) e_{η i})}^{T} Ψ_{δ i}^{- 1} (η_{i} - B_{τ} d_{i} - Γ_{τ} H (ξ_{i}) - k_{1} (τ) e_{η i}) \\ + (\sum_{j = 1}^{p} r_{i j}) φ^{T} F_{i}^{*} - p log (1 + exp (φ^{T} F_{i}^{*}))\}, \end{matrix}

Second, consider the posterior distribution

p (Y_{m} ∣ Y_{o}, Ω, r, θ, φ)

of the missing observed variables

Y_{m}

. It follows that

\begin{matrix} \begin{matrix} \begin{matrix} p (Y_{m} ∣ Y_{o}, Ω, r, θ, φ) & = \prod_{i = 1}^{n} p (y_{m i} ∣ y_{o i}, ω_{i}, r_{i}, θ, φ) \\ \propto \prod_{i = 1}^{n} p (y_{m i} ∣ ω_{i}, θ) p (r_{i} ∣ y_{i}, ω_{i}, φ), \end{matrix} \end{matrix} \end{matrix}

(15)

Moreover,

p (y_{m i} ∣ y_{o i}, ω_{i}, r_{i}, θ, φ)

\begin{matrix} \propto exp \{- \frac{1}{2} {(y_{m i} - A_{m i} c_{i} - Λ_{m i} ω_{i})}^{T} Ψ_{m ε i}^{- 1} (y_{m i} - A_{m i} c_{i} - Λ_{m i} ω_{i}) \\ + (\sum_{j = 1}^{p} r_{i j}) φ^{T} F_{i}^{*} - p log (1 + exp (φ^{T} F_{i}^{*}))\}, \end{matrix}

where

A_{m i}

denotes a

p_{2 i} \times 1

subvector extracted from

A

, whose entries correspond to the missing values in

y_{i}

;

Λ_{m i}

refers to a

p_{2 i} \times q

submatrix derived from

Λ

, with rows that map to the missing entries in

y_{i}

; and

Ψ_{m ε i}

represents a

p_{2 i} \times p_{2 i}

submatrix of

Ψ

, featuring both rows and columns that align with the missing components of

y_{i}

.

Third, consider the posterior distribution

p (θ ∣ Y_{o}, Y_{m}, Ω)

of the unknown parameter

θ

. Similarly, the posterior distribution can be simplified to

p (θ ∣ Y, Ω)

.

Let

Λ_{y} = (A, Λ)

, where

Λ_{y k}^{T}

is the k-th row of

Λ_{y}

for

k = 1, \dots, p

; and let

Λ_{ω τ} = (B_{τ}, Γ_{τ})

. Given

Y

and

Ω

, the posterior distribution of

θ

can be obtained using the following conjugate prior distributions:

Λ_{y k} \sim N_{r_{1} + q} (Λ_{0 y k}, H_{0 y k}),

σ_{y k}^{- 1} \sim Gamma (α_{0 y k}, β_{0 y k}),

Φ^{- 1} \sim Wishart (R_{0}, ρ_{0}),

Λ_{ω τ} \sim N_{r_{2} + q_{3}} (Λ_{0 ω}, H_{0 ω}),

σ_{η}^{- 1} \sim Gamma (α_{0 σ}, β_{0 σ}),

where

α_{0 y k}

,

β_{0 y k}

,

α_{0 σ}

,

β_{0 σ}

,

ρ_{0}

,

Λ_{0 y k}

,

Λ_{0 ω}

, and the positive definite matrices

H_{0 y k}

,

H_{0 ω}

,

R_{0}

are hyperparameters, whose specific values need to be set based on prior information or expert knowledge.

Finally, consider the posterior distribution

p (φ ∣ Y_{o}, Y_{m}, Ω, r, θ)

of the missing mechanism parameter

φ

. This can be simplified to

p (φ ∣ Y, Ω, r, θ)

. Assume that

p (φ)

denotes the prior probability density function of parameter

φ

, which follows a

p + 1

-dimensional multivariate normal distribution

N_{p + 1} (φ^{0}, V)

, where

φ^{0}

and

V

are hyperparameters whose values are predetermined based on prior information. Given that the distribution of

r

depends exclusively on

Y, Ω

, and

φ

, coupled with our assumption that the prior for

φ

is statistically independent of that for

θ

, the following result emerges:

p (φ ∣ Y, Ω, r, θ) \propto p (r ∣ Y, Ω, φ) p (φ) .

(16)

From Equations (11) and (12), we have

p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ) = \frac{exp (φ^{T} F_{i}^{*})}{1 + exp (φ^{T} F_{i}^{*})},

and substituting the value of

p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)

into

p (r ∣ Y, Ω, φ)

, we get

\begin{matrix} p (r ∣ Y, Ω, φ) & = \prod_{i = 1}^{n} \prod_{j = 1}^{p} {\{p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)\}}^{r_{i j}} {\{1 - p (r_{i j} = 1 ∣ y_{i}, ω_{i}, φ)\}}^{1 - r_{i j}} \\ = \prod_{i = 1}^{n} \prod_{j = 1}^{p} {\{\frac{exp (φ^{T} F_{i}^{*})}{1 + exp (φ^{T} F_{i}^{*})}\}}^{r_{i j}} {\{1 - \frac{exp (φ^{T} F_{i}^{*})}{1 + exp (φ^{T} F_{i}^{*})}\}}^{1 - r_{i j}} \\ = exp \{\sum_{i = 1}^{n} (\sum_{j = 1}^{p} r_{i j}) φ^{T} F_{i}^{*} - \sum_{i = 1}^{n} p log (1 + exp (φ^{T} F_{i}^{*}))\} . \end{matrix}

where the posterior distribution of

φ

can be further derived as follows:

\begin{matrix} p (φ ∣ Y, Ω, r, θ) & \propto exp \{\sum_{i = 1}^{n} (\sum_{j = 1}^{p} r_{i j}) φ^{T} F_{i}^{*} - \frac{1}{2} {(φ - φ^{0})}^{T} V^{- 1} (φ - φ^{0}) \\ - \sum_{i = 1}^{n} p log (1 + exp (φ^{T} F_{i}^{*}))\} . \end{matrix}

The specific implementation steps of the aforementioned hybrid sampling algorithm, as well as the complete derivation of the relevant posterior distributions, are all presented in Appendix A.

3.2. Bayesian Estimation

By applying the hybrid algorithm described in Section 3.1, we draw a random sample from the posterior distribution

p (Ω, Y_{m}, θ, φ ∣ Y_{o}, r)

, denoted as

\{(Ω^{(t)}, θ^{(t)}, φ^{(t)}, Y_{m}^{(t)}) :

t = 1, \dots, T\}

. Based on these samples, we can calculate the joint Bayesian estimates of

Ω

,

Y_{m}

,

θ

, and

φ

as follows:

\hat{Ω} = T^{- 1} \sum_{t = 1}^{T} Ω^{(t)}, {\hat{Y}}_{m} = T^{- 1} \sum_{t = 1}^{T} Y_{m}^{(t)}, \hat{θ} = T^{- 1} \sum_{t = 1}^{T} θ^{(t)}, \hat{φ} = T^{- 1} \sum_{t = 1}^{T} φ^{(t)},

As documented in Geyer’s [19] study, these combined Bayesian joint estimates serve as consistent approximations for the posterior means they each correspond to.

In Bayesian structural equation modeling, the evaluation of the overall model fit needs to comprehensively examine both its measurement and structural components. To ensure the reliability of statistical inference, it is insufficient to only focus on the accuracy of parameter estimation; a systematic assessment of the overall fit between the model and the data is also required. Regarding this issue, the existing literature has developed various diagnostic methods based on posterior distributions. For example, Gelman et al. [20] proposed the method of posterior predictive p-values. To address the issue of anomalous posterior p-value calculations that may arise from repeated use of observed data, Bayarri and Berger [21] further proposed the method of partial posterior predictive p-values. The most intuitive approach is to perform residual analysis. We can obtain the residual estimates of the measurement equations by calculating

{\hat{ε}}_{i} = y_{i} - \hat{A} c_{i} - \hat{Λ} {\hat{ω}}_{i}

, where

\hat{A}

,

\hat{Λ}

, and

{\hat{ω}}_{i}

are the Bayesian estimates of

A

,

Λ

, and

ω_{i}

, respectively. Residual plots are generated based on the values of

{\hat{ε}}_{i}

, and scatter plots of

{\hat{ε}}_{i}

against

{\hat{ω}}_{i}

are constructed. If the plots of

{\hat{ε}}_{i}

and the scatter plots of

{\hat{ε}}_{i}

versus

{\hat{ω}}_{i}

all lie between two parallel horizontal lines that are narrowly spaced and centered at zero, this indicates a good model fit. Similarly, the residual estimate of the structural equation is given by

{\hat{δ}}_{i} = {\hat{η}}_{i} - {\hat{B}}_{τ} d_{i} - {\hat{Γ}}_{τ} H ({\hat{ξ}}_{i})

, where

{\hat{η}}_{i}

,

{\hat{B}}_{τ}

,

{\hat{Γ}}_{τ}

, and

{\hat{ξ}}_{i}

are all posterior estimates. The plotting and interpretation of

{\hat{δ}}_{i}

are the same as those for

{\hat{ε}}_{i}

.

4. Simulation

In this section, we verify the validity of the estimation methods for the Bayesian quantile structural equation model in handling complete and missing data through two simulation experiments. Simulated data are generated using the following nonlinear quantile structural equation model:

y_{i} = A c_{i} + Λ ω_{i} + ε_{i},

(17)

η_{i} = b_{1 τ} d_{i} + γ_{1 τ} ξ_{i 1} + γ_{2 τ} ξ_{i 2} + γ_{3 τ} ξ_{i 1} ξ_{i 2} + γ_{4 τ} ξ_{i 1}^{2} + γ_{5 τ} ξ_{i 2}^{2} + δ_{i} .

(18)

Let

p = 9

,

q = 3

,

q_{1} = 1

,

q_{2} = 2

,

q_{3} = 5

, and

r_{1} = r_{2} = 1

. We consider two cases of sample sizes,

n = 100

and

n = 300

, with quantiles

τ

set to 0.1, 0.5, and 0.9, respectively. The covariate coefficient

A = {(1, \dots, 1)}^{T}

. Fixed covariates

c_{i k}

and

d_{i}

are both sampled from the standard normal distribution

N (0, 1)

. The non-overlapping factor loading matrix

Λ

is shown as follows:

Λ^{T} = (\begin{matrix} 1^{*} & λ_{21} & λ_{31} & 0^{*} & 0^{*} & 0^{*} & 0^{*} & 0^{*} & 0^{*} \\ 0^{*} & 0^{*} & 0^{*} & 1^{*} & λ_{52} & λ_{62} & 0^{*} & 0^{*} & 0^{*} \\ 0^{*} & 0^{*} & 0^{*} & 0^{*} & 0^{*} & 0^{*} & 1^{*} & λ_{83} & λ_{93} \end{matrix}),

where the zeros and ones marked with ∗ are pre-specified to clearly explain the identification of latent variables and the model, while other

λ_{j k}

are unknown parameters to be estimated. Let

λ_{21} = λ_{31} = λ_{52} = λ_{62} = λ_{83} = λ_{93} = 0.8

,

b_{1 τ} = 1

, and

γ_{1 τ} = γ_{2 τ} = γ_{3 τ} = γ_{4 τ} = γ_{5 τ} = 0.6

. The explanatory latent variables

ξ_{i} \sim N (0, Φ)

, where

ϕ_{11} = ϕ_{22} = 1

and

ϕ_{12} = 0.2

.

In practical research, the true underlying distribution of error terms is often difficult to know in advance. To fully verify the adaptability of the proposed model under different data distribution characteristics, we refer to the setting of error scenarios in quantile structural equation models by Wang et al. [2] and the simulation idea of complex distributions in composite quantile regression by Wang Zhiqiang [4]. We select the following four representative distributions as the generation distributions for the error terms

ε_{i k}

and

δ_{i}

, thereby constructing simulated data

y_{i}

and

η_{i}

with different characteristics. The specific distribution settings are as follows:

Case 1: Normal distribution

N (0, 0.3)

.

Case 2: Skewed log-normal distribution

ln N (0, 0.25)

.

Case 3: U-shaped distribution

Beta (0.5, 0.5)

.

Case 4: Heavy-tailed skewed distribution

0.3 χ^{2} (3)

.

To simplify the simulation design and avoid cumbersome permutations and combinations, this study only considers the scenario where

ε_{i k}

and

δ_{i}

follow the same distribution. In the subsequent Bayesian analysis, given that the true underlying distribution of error terms is unknown, we select the Asymmetric Laplace Distribution (ALD) as a “working distribution” to replace the true distribution for modeling. By comparing the parameter estimation effects (such as bias and root mean squared error) of the ALD substitution assumption under the above four distribution scenarios, we corely verify the applicability of ALD when the true distribution is unknown, providing a reliable basis for the practical application of the model.

Based on the given true values and values sampled from specific distributions,

η_{i}

and

y_{i}

are calculated sequentially to obtain the complete dataset

y_{i j} : i = 1, \dots, n, j = 1, \dots, p

. Missing data are then generated according to the pre-specified values of

φ

and the missing mechanism Equation (12). In this process, we investigate three scenarios with missing rates of 20%, 30%, and 40%. Specifically, different values of

φ

are adopted for each scenario, and these varying

φ

values ultimately lead to the corresponding differences in missing rates. The true values of

φ

applied in these scenarios are presented in Table A1 and Table A2 of Appendix A.

Hyperparameters are set as follows: free elements in

Λ_{0 y k}

are set to their true values, and the covariance matrix

H_{0 y k}

is taken as a diagonal matrix with diagonal elements

10^{- 3}

;

Λ_{0 ω} = (1, 0.6, 0.6, 0.6, 0.6, 0.6)

, and the covariance matrix

H_{0 ω}

is set the same as

H_{0 y k}

;

α_{0 y k} = α_{0 σ} = 9

,

β_{0 y k} = β_{0 σ} = 4

;

ρ_{0} = 1

,

R_{0} = 0.2 I_{2}

;

φ^{0}

is set to its corresponding true values; and

V = I_{10}

.

Accurate computation of Bayesian estimates requires determining how many iterations are required to achieve convergence. Therefore, we set three different sets of initial values, ran three chains separately, and calculated the EPSR values. To intuitively show the variation process of each parameter’s EPSR value with the number of iterations, we plotted the EPSR convergence trend graph for all parameters (see Figure 1). The results indicate that after around 3000 iterations, the EPSR values for all parameters fall below 1.2. Thus, we performed 5000 iterations, discarded the first 2000 iterations as the burn-in period, and conducted Bayesian parameter estimation based on the subsequent 3000 iterations. Residual plots and scatter plots were generated using the obtained parameter estimates to evaluate the model fit.

Figure 2 presents partial estimated residual plots of the model under the scenario with non-ignorable missing data, showing the changes in

{\hat{ε}}_{i 1}

,

{\hat{ε}}_{i 2}

,

{\hat{ε}}_{i 3}

, and

{\hat{δ}}_{i}

with observation indices. Figure 3 displays scatter plots of the measurement equation residuals

{\hat{ε}}_{i 1}

against the latent variables

{\hat{ξ}}_{i 1}

,

{\hat{ξ}}_{i 2}

, and

{\hat{η}}_{i}

. Figure 4 shows scatter plots of the structural equation residuals

{\hat{δ}}_{i}

against the latent variables

{\hat{ξ}}_{i 1}

and

{\hat{ξ}}_{i 2}

. From the overall performance of the residual plots and scatter plots, all residuals are uniformly distributed around zero, and most fall between two parallel horizontal lines with a narrow spacing, without showing obvious systematic bias or abnormal fluctuations. This indicates that the constructed model has a good fit to the data and can effectively capture the underlying relationships between variables. The residual plots and scatter plots for the model without missing data are similar, and thus are not repeated here.

We repeated the sampling 100 times. To evaluate the accuracy of the model estimates, we used bias and root mean square error (RMSE), with the specific formulas as follows:

Bias (\hat{θ}) = E (\hat{θ}) - θ, RMSE (\hat{θ}) = {(\frac{1}{n} \sum_{i = 1}^{n} E ({(\hat{θ_{i}} - θ)}^{2}))}^{1 / 2} .

Table 1 presents the Bayesian estimation results of the core regression coefficients (

b_{1 τ}

,

γ_{1 τ} \sim γ_{5 τ}

) in the structural equation under the scenario of missing data with a sample size of

n = 100

. It covers three error distributions, namely

N (0, 0.3)

,

Beta (0.5, 0.5)

, and

0.3 χ^{2} (3)

, as well as three missing rates (M1, M2, M3). The table focuses on reporting the bias and root mean square error (RMS) of each coefficient at the quantiles of

τ = 0.1

,

0.5

, and

0.9

.

As can be seen from the data in the table, regardless of whether the error distribution is normal or not and how the missing rate changes, the estimation performance of the core regression coefficients remains stable: the absolute values of bias are generally low, and those of all parameters are less than 0.1; the overall RMS maintains a low fluctuation range without significant abnormal values, and there is no obvious fluctuation across different quantiles. This fully demonstrates the reliable estimation ability of the proposed model for core parameters under the conditions of small sample size and missing data.

It is worth noting that the estimation results of other parameters in the measurement equation (such as

a_{1} \sim a_{9}

,

λ_{21}

,

λ_{31}

,

λ_{52}

etc., and

ϕ_{11} \sim ϕ_{22}

) are similar to or even better than those of the core regression coefficients. To save space, the detailed estimation results of the above-mentioned other parameters, as well as those under different sample sizes, complete data, and more error distribution combinations (Table A3, Table A4, Table A5, Table A6, Table A7, Table A8 and Table A9), have been uniformly organized in Appendix A, which can be further referred to by readers to verify the estimation performance of the model in a wider range of scenarios.

All simulation experiments in this study were computed using a device with an Intel (R) Core (TM) i5-10210U CPU (1.60GHz, 2112 MHz, 4 cores, and 8 logical processors) and implemented in R software (Version 4.4.2). The total runtime for completing both Simulation 1 and Simulation 2 was approximately 364 h, and the relevant codes are provided in Appendix A.

5. A Real Example

In this section, we illustrate the proposed model by analyzing the factors influencing the growth of Chinese listed companies. We selected annual financial report data of Xinjiang listed companies from the Shanghai, Shenzhen, and Beijing Stock Exchanges during the period 2020–2024 from CNINFO (China Securities Information Co., Ltd., Beijing, China). These data include nine observed variables (

y_{1}

to

y_{9}

). After data preprocessing, we obtained continuous 5-year observation records of 61 companies, with an overall data missing rate of 8.6%.

Different from the simulation study, in which it was necessary to preset four types of distributions for the error term (such as the normal distribution and skewed log-normal distribution) and simulate the generation of missing data, in practical application, the real data itself already contains inherent distribution characteristics and has natural missing situations. Therefore, there is no need to additionally preset the distribution type of the error term or simulate the generation of missing data. It is only required to generate the missing indicator matrix

r

based on the preprocessed real data, then substitute the real data into the model, and conduct Bayesian inference using the Asymmetric Laplace Distribution (ALD) as the working distribution (for the specific process, refer to the Section 2 “Posterior Sampling Based on

{Y_{o}, r}

to Obtain

{Ω, Y_{m}, θ, ϕ}

” in Section 3).

We consider an NQSEM with parameters

n = 305

,

p = 9

,

q = 3

,

q_{1} = 1

,

q_{2} = 2

,

q_{3} = 5

,

r_{1} = r_{2} = 0

. Its measurement equation is as follows:

y_{i} = Λ ω_{i} + ε_{i},

(19)

where

ω_{i} = {(η_{i}, ξ_{i 1}, ξ_{i 2}, ξ_{i 1} ξ_{i 2}, ξ_{i 1}^{2}, ξ_{i 2}^{2})}^{T}

. The non-overlapping factor loading matrix

Λ

has the following form:

Λ^{T} = (\begin{matrix} 1 & λ_{21} & λ_{31} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & λ_{52} & λ_{62} & λ_{72} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & λ_{93} \end{matrix}),

The measurement equation expresses the explicit relationship between each latent variable and its corresponding indicators. Based on this, we propose the following structural equation to evaluate the impact of potential determinants on growth:

η_{i} = γ_{1 τ} ξ_{i 1} + γ_{2 τ} ξ_{i 2} + γ_{3 τ} ξ_{i 1} ξ_{i 2} + γ_{4 τ} ξ_{i 1}^{2} + γ_{5 τ} ξ_{i 2}^{2} + δ_{i},

(20)

Inspired by the existing literature such as Zhi Zheng [22], we set “profitability (

ξ_{1}

)” and “solvency (

ξ_{2}

)” as latent variables, and selected five quantiles (0.1, 0.3, 0.5, 0.7, 0.9) to more comprehensively characterize the impact of various factors on development capability under different quantiles. Neither of the above equations considers covariates

c_{i}

and

d_{i}

. The observation indicators of the two types of latent variables are calculated according to the formulas listed in Table 2, and the specific path relationships are shown in Figure 5.

The hyperparameter values of the model are set as follows: The elements to be estimated in

Λ_{0 y k}

are set to 0.4, and the covariance matrix

H_{0 y k}

is a diagonal matrix with diagonal elements of

10^{- 2}

;

Λ_{0 ω} = (0.5, 0.5, 0.5, 0.5, 0.5)

, and the covariance matrix

H_{0 ω}

is a diagonal matrix with diagonal elements of

10^{- 4}

;

α_{0 y k} = α_{0 σ} = 9

,

β_{0 y k} = β_{0 σ} = 4

;

ρ_{0} = 1

and

R_{0} = I_{2}

;

φ^{0} = (- 1, 0.3, . . ., 0.3)

; and

V = I_{10}

.

To examine convergence, we calculated the EPSR values for all parameters based on two parallel Markov chains. The results indicate that the algorithm converges within 5000 iterations. Therefore, we generated a total of 7000 iterations, discarded the results of the first 5000 iterations, and performed Bayesian estimation of the parameters based on the last 2000 iterations. The specific estimation results are shown in Table 3.

First, we set the quantile

τ = 0.5

to obtain the estimation results of the factor loading matrix

Λ

. Since factor loadings may differ across quantiles, which would lead to inconsistent measurement scales of latent variables across quantiles, we fixed the factor loadings across all quantiles to the above-estimated values

\hat{Λ}

. Subsequent analyses under other quantile scenarios were conducted based on this setting to enhance the robustness of the model.

As shown in Table 3, these factors all affect corporate growth, and the magnitudes of the coefficients vary across quantiles, indicating that their impacts on the growth of Xinjiang-listed companies change with quantiles.

Profitability has a significant positive impact on corporate growth, which is consistent with the findings of Wang and He [23] and Kou [24]. Quantile regression analysis reveals that the profitability coefficient $γ_{1 τ}$ exhibits a clear monotonically increasing characteristic: for low-growth companies, the marginal contribution of profitability is relatively limited because their growth is mainly constrained by factors such as market environment or management efficiency; for high-growth companies, the driving effect of profitability is significantly enhanced, which is consistent with the research conclusion of Du [25]. Specifically, as a company’s weighted return on equity, operating profit margin, net profit margin, and gross profit margin improve, its comprehensive growth gradually strengthens, with high-growth companies significantly outperforming low-growth ones.
Solvency also has a positive effect on corporate growth—the stronger a company’s solvency, the higher its growth. This is consistent with the findings of Shen and Wu [26], but its impact is slightly weaker than that of profitability, which aligns with the conclusion drawn by Xu and Guo [27]. The coefficient $γ_{2 τ}$ shows an overall downward trend and stabilizes in the high quantile range: solvency has a relatively stronger impact on low-growth companies; its impact on high-growth companies is relatively weaker. This may be because low-growth companies often face issues such as tight capital chains and financing constraints, and strong solvency (e.g., low debt ratio, sufficient cash flow) can effectively reduce financial risks, provide a foundation for their basic growth, and serve as an important support to break through growth bottlenecks. In contrast, the growth of high-growth companies relies more on endogenous funds generated by profits or efficient external financing to support expansion. Although solvency still provides necessary financial stability for growth, its marginal driving effect on growth decreases.
The impact coefficient of the interaction term between profitability and solvency on growth ( $γ_{3 τ}$ ) is significantly positive at all quantiles and shows an overall upward trend, indicating that their synergistic effect positively drives growth, with such effect strengthening as corporate growth improves. Specifically, the coefficient of the interaction term for low-growth companies, though positive, is relatively small, suggesting that the synergistic pull on growth is weak in this case. This may be because the core constraints on the growth of such companies lie in unresolved fundamental issues such as insufficient business expansion capabilities and market competitiveness, rather than inadequate coordination between profitability and solvency, making it difficult for synergistic effects to take hold. For high-growth companies, the coefficient of the interaction term increases significantly, and the synergistic effect is notably enhanced. This is because high-growth companies not only need profitability to support endogenous expansion but also require strong solvency to ensure smooth external financing, forming a positive cycle of “profit laying the foundation for expansion + solvency alleviating financing constraints” and thus providing stronger impetus for high growth.
The impact coefficient of the squared term of profitability on growth ( $γ_{4 τ}$ ) gradually shifts from negative to positive with increasing values: for low-growth companies, the negative coefficient indicates that increased profitability may inhibit growth, i.e., an “excessive pursuit of profits” can have adverse effects. This is likely because such companies have limited profit scales, with profits mostly used to cover operational gaps; an excessive focus on profitability would squeeze investments in growth-oriented activities such as research and development and market development, leading to a diminishing marginal positive effect of profitability on growth. For high-growth companies, the positive and increasing coefficient reflects an increasing marginal positive effect of profitability on growth. At this stage, companies have a solid profit foundation, and profit growth can be more efficiently converted into growth momentum by supporting large-scale expansion, enhancing risk resistance, and improving financing advantages.
The impact coefficient of the squared term of solvency on growth ( $γ_{5 τ}$ ) changes from a slightly negative value to a significantly positive one with a monotonic increasing trend: for low-growth companies, the coefficient of the squared term is close to zero, indicating that marginal changes in solvency have little differential impact on growth. This may be because the growth bottleneck for these companies lies in business fundamentals (e.g., insufficient market demand and low efficiency), and improved solvency can only maintain financial stability without generating additional impetus for growth. For high-growth companies, the significantly positive and increasing coefficient of the squared term indicates an increasing marginal positive effect of improved solvency. This is probably because their expansion requires substantial capital, and enhanced solvency can reduce financing costs, broaden financing channels, and boost investor confidence, supporting aggressive growth strategies. This forms a cycle of “strengthened solvency → expanded financing advantages → accelerated growth,” where the leverage effect of marginal improvements on growth increases with the enhancement of solvency.

In summary, this study identifies profitability, solvency, their squared terms, and their interaction term as key factors influencing corporate growth, and quantifies their specific impact intensities at different quantile levels. This provides more detailed empirical evidence for unraveling the driving logic behind enterprise growth.

6. Discussion

In this paper, we constructed a nonlinear quantile structural equation model with non-ignorable missing data. This model can not only comprehensively analyze the latent variable relationships between corporate financial variables and growth but also account for the non-ignorable missing mechanism through a linear logistic regression model. Meanwhile, we propose a Bayesian method based on ALD theory to carry out posterior inference. Simulation studies show that the model performs well in both computational efficiency and parameter estimation accuracy. After applying it to the data analysis of growth and its determinants of listed companies in Xinjiang, the results not only confirm previous research conclusions on the impact of factors such as profitability and solvency on growth but also provide new insights into the driving mechanisms of corporate growth.

Future research can proceed in the following four directions: First, in nonlinear quantile structural equation models, variable selection is also of great significance, especially when latent variables in the model exhibit significant impacts only at certain quantiles but not at others. Regularization methods show obvious advantages in this model framework because they can simultaneously realize parameter estimation and variable selection. A feasible research idea is to introduce the method proposed by Feng et al. [3] into nonlinear quantile structural equation models and conduct further research on its adaptability and extensibility under nonlinear settings. Second, the current model only incorporates continuous indicators in the measurement equations, while real-world data often include various types such as ordinal data (e.g., satisfaction scores, rating assessments), count data (e.g., number of event occurrences, quantity statistics), and unordered categorical data (e.g., attribute classification, group division). Therefore, we can further construct a generalized nonlinear quantile structural equation model compatible with multiple data types. By extending the distributional assumptions of measurement equations (e.g., introducing discrete response models within the quantile regression framework), the model can handle continuous, ordinal, count, and unordered categorical indicators simultaneously, thus better fitting the complexity of actual data and enhancing its applicability and explanatory power. Furthermore, regarding missing data, we can explore the Bayesian analysis of nonlinear quantile structural equation models under scenarios where covariates have non-ignorable missingness, or where both response variables and covariates have non-ignorable missingness. Finally, similar to Dang and Maestrini’s [28] exploration of variational approximate inference in conventional structural equation models (CSEMs), introducing variational methods into nonlinear quantile structural equation models with non-ignorable missing data is expected to significantly improve computational efficiency while providing reliable inference results, offering a more efficient solution for the practical application of such complex models.

Author Contributions

Conceptualization, L.Z.; methodology, M.T.; software, L.Z.; validation, M.T.; formal analysis, L.Z.; investigation, M.T.; resources, M.T.; data curation, L.Z.; writing—original draft preparation, L.Z.; writing—review and editing, M.T.; visualization, L.Z.; supervision, M.T.; project administration, M.T.; funding acquisition, M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by 2025 Xinjiang University Faculty Excellence Program under the Autonomous Region’s Double First-Class Initiative (grant no. 51172500101).

Data Availability Statement

These data were derived from the following resources available in the public domain: [annual financial report data of Xinjiang 399 listed companies from the Shanghai, Shenzhen, and Beijing Stock Exchanges during the 400 period 2020–2024 from CNINFO] [https://www.cninfo.com.cn (accessed on 18 June 2025)].

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

When performing Metropolis–Hastings sampling, the required proposal distributions and their variances are as follows, with reference to the methods proposed by Lee and Song [13] and Lee and Tang [7].

First, samples

ω_{i}

are drawn from

p (ω_{i} ∣ y_{i}, r_{i}, θ, φ)

. Specifically,

N (0, σ_{ω}^{2} Ω_{ω i})

is chosen as its proposal distribution, where

Ω_{ω i} = {(Σ_{ω i}^{- 1} + Λ^{T} Ψ_{ε i}^{- 1} Λ)}^{- 1}

, and

Σ_{ω i}^{- 1}

is given by

Σ_{ω i}^{- 1} = (\begin{matrix} Ψ_{δ i}^{- 1} & Ψ_{δ i}^{- 1} Γ_{τ} Δ_{H} \\ - Δ_{H}^{T} Γ_{τ}^{T} Ψ_{δ i}^{- 1} & Φ^{- 1} + Δ_{H}^{T} Γ_{τ}^{T} Ψ_{δ i}^{- 1} Γ_{τ} Δ_{H} \end{matrix}),

with

Δ_{H} = \partial H (ξ_{i}) / \partial ξ_{i}^{T} |_{ξ_{i} = 0}

. The implementation of the MH algorithm is as follows:

In the

(t + 1)

-th iteration, based on the current

ω_{i}^{(t)}

, a new candidate value

ω_{i}

is generated from the proposal distribution

N (ω_{i}^{(t)}, σ_{ω}^{2} Ω_{ω i})

. The probability of accepting this new candidate is

min \{1, \frac{p (ω_{i} ∣ y_{i}, r_{i}, θ, φ)}{p (ω_{i}^{(j)} ∣ y_{i}, r_{i}, θ, φ)}\},

where the variance

σ_{ω}^{2}

is adjusted to ensure the mean acceptance rate reaches roughly 0.25 or greater (Gelman et al. [29]).

Second, samples

y_{m i}

are drawn from

p (y_{m i} ∣ y_{o i}, ω_{i}, r_{i}, θ, φ)

.

N (0, σ_{y}^{2} Ω_{y m i})

is selected as its proposal distribution, with variance

Ω_{y m i} = {(Ψ_{m ε i}^{- 1} + φ_{m i} \frac{p exp (φ_{0} + \sum_{l \in \bar{D}} φ_{l} y_{i l})}{{(1 + exp (φ_{0} + \sum_{l \in \bar{D}} φ_{l} y_{i l}))}^{2}} φ_{m i}^{T})}^{- 1}

, where

Ψ_{m ε i}

is a submatrix of

Ψ_{ε i}

, and

φ_{m i}

is a subvector of

φ_{i}

, both containing rows and columns corresponding to

y_{m i}

.

\bar{D} \in {t_{1}, \dots, t_{k}}

is the set of indices corresponding to the observed data

y_{o i}

. The selection criteria for

σ_{y}^{2}

and the sampling process similarly apply.

Finally, samples

φ

are drawn from

p (φ ∣ Y, Ω, r, θ)

.

N (0, σ_{φ}^{2} Ω_{φ})

is chosen as its proposal distribution, with variance

Ω_{φ} = {(\frac{p}{4} \sum_{i = 1}^{n} F_{i}^{*} {F_{i}^{*}}^{T} + V^{- 1})}^{- 1}

. The selection criteria for

σ_{φ}^{2}

and the sampling process similarly apply.

The selection of proposal distributions for the above three parameters and the calculation of their covariances both refer to the article by Lee and Tang [7], and the average acceptance rates of the three parameters are all maintained between 0.25 and 0.4 by adjusting

σ_{ω}^{2}

,

σ_{y}^{2}

, and

σ_{φ}^{2}

.

The Gibbs sampling process and the posterior distribution

p (θ ∣ Y, Ω)

of the unknown parameter

θ

are presented below, where the notation used follows that in Song and Lee [13].

Let

Ω_{2} = (ξ_{1}, \dots, ξ_{n})

,

Y = (y_{1}, \dots, y_{n})

,

C = (c_{1}, \dots, c_{n})

,

D = (d_{1}, \dots, d_{n})

,

σ_{y} = {(σ_{y 1}, \dots, σ_{y p})}^{T}

,

e_{η} = {(e_{η 1}, \dots, e_{η n})}^{T}

,

e_{y i} = {(e_{y i 1}, \dots, e_{y i p})}^{T}

,

e_{y} = (e_{y 1}, \dots, e_{y n})

, and

v_{i} = {(d_{i}^{T}, H {(ξ_{i})}^{T})}^{T}

, and

Ψ_{ε i k}

is the k-th diagonal element of the matrix

Ψ_{ε i}

.

Given

Λ_{y} = \{A, Λ\}

, let

λ_{y k j}

denote the corresponding element of the matrix, i.e.,

Λ_{y} = \{λ_{y k j}\}

, where

j = 1, \dots, r_{1} + q

and

k = 1, \dots, p

. The positions of fixed elements in

Λ_{y}

are identified by the index matrix

L_{y} = \{l_{y k j}\}

, whose elements are defined as follows:

l_{y k j} = \{\begin{matrix} 0, & if λ_{y k j} is fixed, \\ 1, & if λ_{y k j} is free . \end{matrix}

Let

u_{i} = {(c_{i}^{T}, ω_{i}^{T})}^{T}

,

U = (u_{1}, \dots, u_{n})

, and

U_{k}

be a submatrix of

U

where

U_{k} = (u_{k 1}^{*}, \dots, u_{k n}^{*})

. Rows corresponding to

l_{y k j} = 0

are set to zero vectors, while rows with

l_{y k j} = 1

are retained. Meanwhile, let

Y_{k}^{*} = {(y_{1 k}^{*}, \dots, y_{n k}^{*})}^{T}

, where

y_{i k}^{*} = y_{i k} - \sum_{j = 1}^{r_{1} + q} λ_{y k j} u_{i j} (1 - l_{y k j})

, and

u_{i j}

is the j-th element of

u_{i}

. The Gibbs sampling process is as follows:

1. Updating

σ_{y k}

:

\begin{matrix} p (σ_{y k}^{- 1} ∣ Y, U, Λ_{y k}) & \propto p (σ_{y k}^{- 1}) p (Y ∣ σ_{y k}^{- 1}, Λ_{y k}, U) \\ \propto {(σ_{y k}^{- 1})}^{α_{0 y k} - 1} exp \{- β_{0 y k} σ_{y k}^{- 1}\} \cdot \prod_{i = 1}^{n} \frac{0 . 5^{2}}{σ_{y k}} exp \{- σ_{y k}^{- 1} ρ_{τ} (y_{i k} - Λ_{y k} u_{i})\} \\ = {(σ_{y k}^{- 1})}^{α_{0 y k} - 1} exp \{- β_{0 y k} σ_{y k}^{- 1}\} \cdot \prod_{i = 1}^{n} \frac{σ_{y k}^{- 1}}{4} exp \{- \frac{1}{2} σ_{y k}^{- 1} |y_{i k} - Λ_{y k} u_{i}|\} \\ \propto {(σ_{y k}^{- 1})}^{α_{0 y k} + n - 1} exp \{- (β_{0 y k} + \frac{1}{2} \sum_{i = 1}^{n} |y_{i k} - Λ_{y k} u_{i}|) σ_{y k}^{- 1}\}, \end{matrix}

For

k = 1, \dots, p

, it holds that

p (σ_{y k}^{- 1} ∣ Y, U, Λ_{y k}) \sim Gamma (α_{0 y k} + n, β_{0 y k} + \frac{1}{2} \sum_{i = 1}^{n} |y_{i k} - Λ_{y k} u_{i}|) .

(A1)

2. Updating

e_{y i k}

:

\begin{matrix} p (e_{y i k} ∣ y_{i k}, u_{i}, Λ_{y k}, σ_{y k}) & \propto p (e_{y i k}) p (y_{i k} ∣ Λ_{y k}, u_{i}, σ_{y k}, e_{y i k}) \\ \propto σ_{y k}^{- 1} exp \{- σ_{y k}^{- 1} e_{y i k}\} \cdot \frac{1}{\sqrt{2 π}} {|Ψ_{ε i k}|}^{- \frac{1}{2}} exp \{- \frac{1}{2 Ψ_{ε i k}} {(y_{i k} - Λ_{y k} u_{i})}^{2}\}, \end{matrix}

Let

e_{y i k}^{*} = e_{y i k}^{- 1}

. Then,

\begin{matrix} p (e_{y i k}^{*} ∣ y_{i k}, u_{i}, Λ_{y k}, σ_{y k}) & \propto σ_{y k}^{- 1} exp \{- σ_{y k}^{- 1} {(e_{y i k}^{*})}^{- 1}\} \cdot \frac{1}{\sqrt{2 π}} \cdot 8^{- \frac{1}{2}} σ_{y k}^{- \frac{1}{2}} {(e_{y i k}^{*})}^{\frac{1}{2}} exp \{- \frac{1}{2 Ψ_{ε i k}} {(y_{i k} - Λ_{y k} u_{i})}^{2}\} {(e_{y i k}^{*})}^{- 2} \\ \propto σ_{y k}^{- \frac{1}{2}} {(e_{y i k}^{*})}^{- \frac{3}{2}} \cdot \frac{1}{4 \sqrt{π}} exp \{[\frac{{(y_{i k} - Λ_{y k} u_{i})}^{2} e_{y i k}^{*}}{16 σ_{y k}} - σ_{y k}^{- 1} {(e_{y i k}^{*})}^{- 1}]\} \\ \propto \sqrt{\frac{2 σ_{y k}^{- 1}}{2 π {(e_{y i k}^{*})}^{3}}} exp \{[\frac{{(y_{i k} - Λ_{y k} u_{i})}^{2} e_{y i k}^{*}}{16 σ_{y k}} - σ_{y k}^{- 1} {(e_{y i k}^{*})}^{- 1}]\}, \end{matrix}

For

i = 1, \dots, n

and

k = 1, \dots, p

, it holds that

p (e_{y i k}^{- 1} ∣ y_{i k}, u_{i}, Λ_{y k}, σ_{y k}) \sim Inverse Gaussian (e_{y i k}^{- 1}; μ, λ),

(A2)

where

μ = 4 {|y_{i k} - Λ_{y k} u_{i}|}^{- 1}

and

λ = 2 σ_{y k}^{- 1}

.

3. Updating

Λ_{y}

:

p (Λ_{y k} ∣ Y, e_{y i k}, σ_{y k}) \propto p (Λ_{y k}) p (Y ∣ Λ_{y k}, e_{y i k}, σ_{y k}),

To derive

p (Λ_{y k} ∣ Y, e_{y i k}, σ_{y k})

, we can refer to the method proposed by Lindley and Smith [30] for deriving posterior distributions. Specifically, let

y \sim N (μ, D)

denote that the column vector

y

follows a multivariate normal distribution with mean vector

μ

and positive semi-definite variance matrix

D

. Given

θ_{1}

,

y \sim N (A_{1} θ_{1}, C_{1}),

and given

θ_{2}

,

θ_{1} \sim N (A_{2} θ_{2}, C_{2}),

then the distribution of

θ_{1}

given

y

is

N (B b, B)

, where

B^{- 1} = A_{1}^{T} C_{1}^{- 1} A_{1} + C_{2}^{- 1}

and

b = A_{1}^{T} C_{1}^{- 1} y + C_{2}^{- 1} A_{2} θ_{2}

. Similarly, we can derive the posterior distribution of

Λ_{y k}

. Specifically, given

Λ_{y k}

,

e_{y i k}

, and

σ_{y k}

, we have

y_{i}| Λ_{y k}, e_{y i k}, σ_{y k} \sim N_{p} (Λ_{y} u_{i}, Ψ_{ε i}),

Λ_{y k} \sim N_{r_{1} + q} (Λ_{0 y k}, H_{0 y k}),

Thus, for

k = 1, \dots, p

, it holds that

p (Λ_{y k}| Y, e_{y i k}, σ_{y k}) \sim N_{r_{1} + q} (B_{Λ k} b_{Λ k}, B_{Λ k}),

(A3)

where

A_{1} θ_{1} = Λ_{y} u_{i}

,

C_{1} = Ψ_{ε i}

,

A_{2} θ_{2} = Λ_{0 y k}

, and

C_{2} = H_{0 y k}

; the variance of the multivariate normal distribution is

B_{Λ k} = {(\sum_{i = 1}^{n} \frac{u_{k i}^{*} u_{k i}^{* T}}{8 σ_{y k} e_{y i k}} + H_{0 y k}^{- 1})}^{- 1}

, and the mean

B_{Λ k} b_{Λ k} = B_{Λ k} (\sum_{i = 1}^{n} \frac{y_{i k}^{*} u_{k i}^{*}}{8 σ_{y k} e_{y i k}} + H_{0 y k}^{- 1} Λ_{0 y k})

.

4. Updating

Φ

:

\begin{matrix} p (Φ ∣ Ω_{2}) & \propto P (Φ) \prod_{i = 1}^{n} P (ξ_{i} ∣ Φ) \\ \propto {|Φ^{- 1}|}^{\frac{ρ_{0} - q_{2} - 1}{2}} exp \{- \frac{1}{2} tr (R_{0}^{- 1} Φ^{- 1})\} \cdot \prod_{i = 1}^{n} {|Φ|}^{- \frac{1}{2}} exp \{- \frac{1}{2} (ξ_{i}^{T} Φ^{- 1} ξ_{i})\} \\ = {|Φ|}^{- \frac{(ρ_{0} + n) + q_{2} + 1}{2}} \cdot exp \{- \frac{1}{2} [tr (R_{0}^{- 1} Φ^{- 1}) + \sum_{i = 1}^{n} (ξ_{i}^{T} Φ^{- 1} ξ_{i})]\} \\ = {|Φ|}^{- \frac{(ρ_{0} + n) + q_{2} + 1}{2}} \cdot exp \{- \frac{1}{2} [tr (R_{0}^{- 1} Φ^{- 1}) + tr (Ω_{2}^{T} Φ^{- 1} Ω_{2})]\} \\ = {|Φ|}^{- \frac{(ρ_{0} + n) + q_{2} + 1}{2}} \cdot exp \{- \frac{1}{2} \{tr [Φ^{- 1} (R_{0}^{- 1} + Ω_{2} Ω_{2}^{T})]\}\}, \end{matrix}

Thus,

p (Φ ∣ Ω_{2}) \sim Inverse - Wishart (R_{0}^{- 1} + Ω_{2} Ω_{2}^{T}, ρ_{0} + n) .

(A4)

5. Updating

σ_{η}

:

\begin{matrix} p (σ_{η}^{- 1} ∣ η, Λ_{ω τ}, V) & \propto p (σ_{η}^{- 1}) \prod_{i = 1}^{n} p (η_{i} ∣ σ_{η}^{- 1}, Λ_{ω τ}, V) \\ \propto {(σ_{η}^{- 1})}^{α_{0 σ} - 1} exp \{- β_{0 σ} σ_{η}^{- 1}\} \cdot \prod_{i = 1}^{n} \frac{τ (1 - τ)}{σ_{η}} exp \{- σ_{η}^{- 1} ρ_{τ} (η_{i} - Λ_{ω τ} v_{i})\} \\ \propto {(σ_{η}^{- 1})}^{α_{0 σ} + n - 1} exp \{- [β_{0 σ} + \sum_{i = 1}^{n} ρ_{τ} (η_{i} - Λ_{ω τ} v_{i})] σ_{η}^{- 1}\}, \end{matrix}

Thus,

p (σ_{η}^{- 1} ∣ η, Λ_{ω τ}, V) \sim Gamma (α_{0 σ} + n, β_{0 σ} + \sum_{i = 1}^{n} ρ_{τ} (η_{i} - Λ_{ω τ} v_{i})) .

(A5)

6. Updating

e_{η i}

:

\begin{matrix} p (e_{η i} ∣ η_{i}, v_{i}, Λ_{ω τ}, σ_{η}) & \propto p (e_{η i}) p (η_{i} ∣ v_{i}, Λ_{ω τ}, σ_{η}, e_{η i}) \\ \propto (σ_{η}^{- 1}) e^{- (σ_{η}^{- 1}) e_{η i}} \cdot \frac{1}{\sqrt{2 π}} {|k_{2} (τ) σ_{η} e_{η i}|}^{- \frac{1}{2}} exp \{- \frac{{(η_{i} - Λ_{ω τ} v_{i} - k_{1} (τ) e_{η i})}^{2}}{2 k_{2} (τ) σ_{η} e_{η i}}\}, \end{matrix}

Let

e_{η i}^{*} = e_{η i}^{- 1}

. Then,

\begin{matrix} p (e_{η i}^{*} ∣ η_{i}, v_{i}, Λ_{ω τ}, σ_{η}) \\ \propto σ_{η}^{- 1} exp \{- σ_{η}^{- 1} {(e_{η i}^{*})}^{- 1}\} \cdot \frac{1}{\sqrt{2 π}} {|k_{2} (τ) σ_{η} {(e_{η i}^{*})}^{- 1}|}^{- \frac{1}{2}} exp \{- \frac{{(η_{i} - Λ_{ω τ} v_{i} - k_{1} (τ) {(e_{η i}^{*})}^{- 1})}^{2} e_{η i}^{*}}{2 k_{2} (τ) σ_{η}}\} {(e_{η i}^{*})}^{- 2} \\ \propto \sqrt{\frac{{k_{1} (τ)}^{2} + 2 k_{2} (τ)}{2 π k_{2} (τ) σ_{η} {(e_{η i}^{*})}^{3}}} exp \{- \frac{1}{2} [\frac{{(η_{i} - Λ_{ω τ} v_{i})}^{2} e_{η i}^{*} - 2 (η_{i} - Λ_{ω τ} v_{i}) k_{1} (τ) + {k_{1} (τ)}^{2} {(e_{η i}^{*})}^{- 1} + 2 k_{2} (τ) {(e_{η i}^{*})}^{- 1}}{k_{2} (τ) σ_{η}}]\}, \end{matrix}

For

i = 1, \dots, n

, it holds that

p (e_{η i}^{- 1} ∣ η_{i}, v_{i}, Λ_{ω τ}, σ_{η}) \sim Inverse - Gaussian (\frac{\sqrt{{k_{1} (τ)}^{2} + 2 k_{2} (τ)}}{|η_{i} - Λ_{ω τ} v_{i}|}, \frac{{k_{1} (τ)}^{2} + 2 k_{2} (τ)}{k_{2} (τ) σ_{η}}) .

(A6)

7. Updating

Λ_{ω τ}

:

p (Λ_{ω τ} ∣ Ω, e_{η}, σ_{η}) \propto p (Λ_{ω τ}) \prod_{i = 1}^{n} p (η_{i} ∣ ξ_{i}, e_{η i}, σ_{η}, Λ_{ω τ}),

References

Koenker, R.; Bassett, G., Jr. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar] [CrossRef]
Wang, Y.; Feng, X.N.; Song, X.Y. Bayesian quantile structural equation models. Struct. Equ. Model. 2016, 23, 246–258. [Google Scholar] [CrossRef]
Feng, X.N.; Wang, Y.; Lu, B.; Song, X.Y. Bayesian regularized quantile structural equation models. J. Multivar. Anal. 2017, 154, 234–248. [Google Scholar] [CrossRef]
Wang, Z.Q. Bayesian Statistical Inference for Quantile Regression Models. Ph.D. Thesis, Yunnan University, Kunming, China, 2019. [Google Scholar]
Xue, J. Bayesian Quantile Factor Models and Their Extensions. Ph.D. Thesis, Lanzhou University of Finance and Economics, Lanzhou, China, 2023. [Google Scholar]
Cheng, H. Quantile varying-coefficient structural equation model. Stat. Methods Appl. 2023, 32, 1439–1475. [Google Scholar] [CrossRef]
Lee, S.Y.; Tang, N.S. Bayesian analysis of nonlinear structural equation models with nonignorable missing data. Psychometrika 2006, 71, 541–564. [Google Scholar] [CrossRef]
Cai, J.H.; Lee, S.Y.; Song, X.Y. Bayesian analysis of nonlinear structural equation models with mixed continuous, ordered and unordered categorical, and nonignorable missing data. Stat. Interface 2008, 1, 99–114. [Google Scholar] [CrossRef]
Lee, S.Y.; Song, X.Y. On Bayesian estimation and model comparison of an integrated structural equation model. Comput. Stat. Data Anal. 2008, 52, 4814–4827. [Google Scholar] [CrossRef]
Cai, J.H.; Song, X.Y. Bayesian analysis of mixtures in structural equation models with non-ignorable missing data. Br. J. Math. Stat. Psychol. 2010, 63, 491–508. [Google Scholar] [CrossRef]
Cai, J.H.; Song, X.Y.; Hser, Y.I. A Bayesian analysis of mixture structural equation models with non-ignorable missing responses and covariates. Stat. Med. 2010, 29, 1861–1874. [Google Scholar] [CrossRef] [PubMed]
Lee, S.Y. Structural Equation Modeling: A Bayesian Approach; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
Lee, S.Y.; Song, X.Y. Basic and Advanced Bayesian Structural Equation Modeling: With Applications in the Medical and Behavioral Sciences; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
Yu, K.; Moyeed, R.A. Bayesian quantile regression. Stat. Probab. Lett. 2001, 54, 437–447. [Google Scholar] [CrossRef]
Reed, C.; Yu, K. A Partially Collapsed Gibbs Sampler for Bayesian Quantile Regression; Technical report; Department of Mathematical Sciences, Brunel University: Uxbridge, UK, 2009. [Google Scholar]
Kozumi, H.; Kobayashi, G. Gibbs sampling methods for Bayesian quantile regression. J. Stat. Comput. Simul. 2011, 81, 1565–1578. [Google Scholar] [CrossRef]
Ibrahim, J.G.; Chen, M.H.; Lipsitz, S.R. Missing responses in generalised linear mixed models when the missing data mechanism is nonignorable. Biometrika 2001, 88, 551–564. [Google Scholar] [CrossRef]
Tanner, M.A.; Wong, W.H. The calculation of posterior distributions by data augmentation. J. Am. Stat. Assoc. 1987, 82, 528–540. [Google Scholar] [CrossRef]
Geyer, C.J. Practical markov chain monte carlo. Stat. Sci. 1992, 7, 473–483. [Google Scholar] [CrossRef]
Gelman, A.; Meng, X.L.; Stern, H. Posterior predictive assessment of model fitness via realized discrepancies. Stat. Sin. 1996, 6, 733–760. [Google Scholar]
Bayarri, M.; Berger, J.O. P values for composite null models. J. Am. Stat. Assoc. 2000, 95, 1127–1142. [Google Scholar]
Zhi, Z. Research on the Growth of Chinese Agricultural Listed Companies. Master’s Thesis, Nanjing Agricultural University, Nanjing, China, 2023. [Google Scholar]
Wang, Q.; He, Y. Analysis of main factors affecting the growth of Chinese listed companies. Stat. Decis. 2005, 61–63. [Google Scholar]
Kou, P. Analysis of Corporate Growth Based on Nonlinear Structural Equation Modeling. Master’s Thesis, Kunming University of Science and Technology, Kunming, China, 2013. [Google Scholar]
Du, L. Research on Enterprise Growth Evaluation Model and Its Application: An Empirical Test Based on 66 Listed Companies in Henan Province. Financ. Manag. Res. 2025, 97–102. [Google Scholar]
Shen, H.; Wu, Q. Analysis of Factors Affecting the Growth of Small and Medium-Sized Enterprises: An Empirical Study Based on Panel Data of Small and Medium-Sized Board Listed Companies. J. Financ. Dev. Res. 2010, 66–70. [Google Scholar]
Xu, Y.; Guo, H. Analysis on the Growth of Listed Companies on China’s GEM. Coop. Econ. Sci. 2016, 60–62. [Google Scholar]
Dang, K.D.; Maestrini, L. Fitting structural equation models via variational approximations. Struct. Equ. Model. 2022, 29, 839–853. [Google Scholar] [CrossRef]
Gelman, A.; Roberts, G.O.; Gilks, W.R. Efficient Metropolis jumping rules. Bayesian Stat. 5 1996, 5, 599–608. [Google Scholar]
Lindley, D.V.; Smith, A.F. Bayes estimates for the linear model. J. R. Stat. Soc. B Stat. Methodol. 1972, 34, 1–18. [Google Scholar] [CrossRef]

Figure 1. EPSR convergence trend graph for all parameters.

Figure 2. Estimated residual plots: (a)

{\hat{ε}}_{i 1}

, (b)

{\hat{ε}}_{i 2}

, (c)

{\hat{ε}}_{i 3}

, and (d)

{\hat{δ}}_{i}

.

Figure 2. Estimated residual plots: (a)

{\hat{ε}}_{i 1}

, (b)

{\hat{ε}}_{i 2}

, (c)

{\hat{ε}}_{i 3}

, and (d)

{\hat{δ}}_{i}

.

Figure 3. Plots of estimated residuals

{\hat{ε}}_{i 1}

versus (a)

{\hat{ξ}}_{i 1}

, (b)

{\hat{ξ}}_{i 2}

, and (c)

{\hat{η}}_{i}

.

Figure 3. Plots of estimated residuals

{\hat{ε}}_{i 1}

versus (a)

{\hat{ξ}}_{i 1}

, (b)

{\hat{ξ}}_{i 2}

, and (c)

{\hat{η}}_{i}

.

Figure 4. Plots of estimated residuals

{\hat{δ}}_{i}

versus (a)

{\hat{ξ}}_{i 1}

and (b)

{\hat{ξ}}_{i 2}

.

Figure 4. Plots of estimated residuals

{\hat{δ}}_{i}

versus (a)

{\hat{ξ}}_{i 1}

and (b)

{\hat{ξ}}_{i 2}

.

Figure 5. Path diagram of the NQSEM model in financial data.

1 *

indicates that this variable is fixed.

Figure 5. Path diagram of the NQSEM model in financial data.

1 *

indicates that this variable is fixed.

Table 1. Bayesian estimates of regression coefficients in the structural equation with missing data.

		M1		M2		M3
$N (0, 0.3)$
Par	$τ$	Bias	RMS	Bias	RMS	Bias	RMS
$b_{1 τ}$	0.1	0.0091	0.0138	0.0085	0.0124	0.0074	0.0114
	0.5	0.0007	0.0081	0.0004	0.0075	−0.0003	0.0071
	0.9	−0.0041	0.0102	−0.0041	0.0101	−0.0045	0.0084
$γ_{1 τ}$	0.1	0.0052	0.0082	0.0035	0.0068	0.0043	0.0071
	0.5	−0.0002	0.0059	−0.0014	0.0062	−0.0012	0.0059
	0.9	−0.0011	0.0069	−0.0021	0.0069	−0.0020	0.0070
$γ_{2 τ}$	0.1	0.0137	0.0153	0.0130	0.0146	0.0126	0.0137
	0.5	−0.0003	0.0064	−0.0004	0.0056	0.0009	0.0058
	0.9	−0.0084	0.0109	−0.0080	0.0100	−0.0065	0.0086
$γ_{3 τ}$	0.1	0.0012	0.0053	0.0026	0.0049	0.0019	0.0049
	0.5	−0.0004	0.0050	−0.0003	0.0042	0.0002	0.0045
	0.9	−0.0016	0.0065	−0.0017	0.0055	−0.0001	0.0049
$γ_{4 τ}$	0.1	−0.0367	0.0376	−0.0347	0.0355	−0.0316	0.0326
	0.5	0.0021	0.0083	0.0025	0.0079	0.0046	0.0077
	0.9	0.0441	0.0451	0.0449	0.0456	0.0444	0.0450
$γ_{5 τ}$	0.1	−0.0432	0.0437	−0.0414	0.0423	−0.0382	0.0390
	0.5	−0.0019	0.0083	−0.0018	0.0077	−0.0006	0.0069
	0.9	0.0428	0.0435	0.0421	0.0429	0.0417	0.0422
$Beta (0.5, 0.5)$
Par	$τ$	Bias	RMS	Bias	RMS	Bias	RMS
$b_{1 τ}$	0.1	0.0019	0.0060	0.0021	0.0051	0.0018	0.0052
	0.5	−0.0008	0.0041	−0.0008	0.0038	−0.0008	0.0038
	0.9	−0.0048	0.0073	−0.0041	0.0064	−0.0039	0.0063
$γ_{1 τ}$	0.1	−0.0287	0.0291	−0.0277	0.0281	−0.0270	0.0275
	0.5	−0.0189	0.0193	−0.0178	0.0182	−0.0170	0.0174
	0.9	−0.0076	0.0099	−0.0058	0.0089	−0.0051	0.0085
$γ_{2 τ}$	0.1	−0.0298	0.0301	−0.0293	0.0298	−0.0282	0.0287
	0.5	−0.0189	0.0193	−0.0187	0.0191	−0.0185	0.0188
	0.9	−0.0108	0.0126	−0.0102	0.0124	−0.0078	0.0099
$γ_{3 τ}$	0.1	−0.0179	0.0182	−0.0184	0.0189	−0.0172	0.0177
	0.5	−0.0208	0.0211	−0.0207	0.0209	−0.0199	0.0202
	0.9	−0.0240	0.0250	−0.0220	0.0232	−0.0212	0.0226
$γ_{4 τ}$	0.1	−0.0208	0.0215	−0.0209	0.0219	−0.0209	0.0219
	0.5	0.0066	0.0082	0.0065	0.0080	0.0068	0.0087
	0.9	0.0662	0.0670	0.0581	0.0592	0.0536	0.0548
$γ_{5 τ}$	0.1	−0.0254	0.0261	−0.0253	0.0262	−0.0252	0.0261
	0.5	0.0002	0.0052	0.0004	0.0054	0.0009	0.0050
	0.9	0.0535	0.0545	0.0485	0.0495	0.0453	0.0463
$0.3 χ^{2} (3)$
Par	$τ$	Bias	RMS	Bias	RMS	Bias	RMS
$b_{1 τ}$	0.1	−0.0011	0.0033	−0.0005	0.0035	−0.0008	0.0034
	0.5	0.0028	0.0039	0.0032	0.0041	0.0029	0.0040
	0.9	0.0070	0.0084	0.0065	0.0076	0.0062	0.0077
$γ_{1 τ}$	0.1	−0.0172	0.0175	−0.0160	0.0163	−0.0138	0.0142
	0.5	−0.0062	0.0068	−0.0061	0.0066	−0.0051	0.0059
	0.9	0.0103	0.0121	0.0095	0.0109	0.0104	0.0119
$γ_{2 τ}$	0.1	−0.0188	0.0190	−0.0178	0.0180	−0.0161	0.0164
	0.5	−0.0078	0.0082	−0.0076	0.0081	−0.0060	0.0066
	0.9	0.0070	0.0094	0.0075	0.0094	0.0072	0.0090
$γ_{3 τ}$	0.1	−0.0156	0.0157	−0.0145	0.0148	−0.0133	0.0136
	0.5	−0.0130	0.0132	−0.0127	0.0129	−0.0105	0.0107
	0.9	−0.0039	0.0059	−0.0026	0.0055	−0.0029	0.0052
$γ_{4 τ}$	0.1	−0.0176	0.0182	−0.0155	0.0163	−0.0127	0.0136
	0.5	0.0016	0.0046	0.0013	0.0045	0.0035	0.0054
	0.9	0.0436	0.0447	0.0381	0.0390	0.0371	0.0380
$γ_{5 τ}$	0.1	−0.0179	0.0185	−0.0166	0.0170	−0.0143	0.0150
	0.5	0.0004	0.0043	0.0013	0.0040	0.0038	0.0054
	0.9	0.0405	0.0415	0.0360	0.0370	0.0339	0.0346

Note: Sample size

n = 100

.

Table 2. Variables in the NQSEM for analyzing determinants of company growth.

Variables	Indicators	Details
Growth ( $η$ )	Operating income growth rate ( $y_{1}$ )	(Current turnover—Previous turnover)/Previous turnover
	Operating profit growth rate ( $y_{2}$ )	(Current operating profit—Previous operating profit)/Previous operating profit
	Net profit growth rate ( $y_{3}$ )	(Current net profit—Previous net profit)/Previous net profit
Profitability ( $ξ_{i 1}$ )	Weighted ROE ( $y_{4}$ )	Net profit/Weighted average net assets
	Main business profit margin ( $y_{5}$ )	Main business profit/Main business income
	Net profit margin ( $y_{6}$ )	Net profit/Operating income
	Gross profit margin ( $y_{7}$ )	(Operating income—Operating cost)/Operating income
Solvency ( $ξ_{i 2}$ )	Quick ratio ( $y_{8}$ )	(Current assets—Inventories)/Current liabilities
Solvency ( $ξ_{i 2}$ )	Asset–liability ratio ( $y_{9}$ )	Total liabilities/Total assets

Table 3. Estimation results of parameters under different quantiles in the case study.

Parameter	$τ$
Parameter	0.1	0.3	0.5	0.7	0.9
Factor Loadings
$λ_{21}$	–	–	0.5131	–	–
$λ_{31}$	–	–	0.4750	–	–
$λ_{52}$	–	–	0.4960	–	–
$λ_{62}$	–	–	0.6187	–	–
$λ_{72}$	–	–	0.3723	–	–
$λ_{93}$	–	–	0.3738	–	–
Structural Equation Coefficients
$γ_{1 τ}$	0.3939	0.4816	0.6134	0.7985	0.9144
$γ_{2 τ}$	0.4271	0.3587	0.3298	0.3139	0.3154
$γ_{3 τ}$	0.2435	0.2395	0.2801	0.3062	0.4348
$γ_{4 τ}$	−0.5631	−0.2124	0.0835	0.3630	0.8079
$γ_{5 τ}$	−0.0396	0.0473	0.3065	0.5137	0.7218

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, L.; Tuerde, M. Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness. Mathematics 2025, 13, 3094. https://doi.org/10.3390/math13193094

AMA Style

Zhang L, Tuerde M. Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness. Mathematics. 2025; 13(19):3094. https://doi.org/10.3390/math13193094

Chicago/Turabian Style

Zhang, Lu, and Mulati Tuerde. 2025. "Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness" Mathematics 13, no. 19: 3094. https://doi.org/10.3390/math13193094

APA Style

Zhang, L., & Tuerde, M. (2025). Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness. Mathematics, 13(19), 3094. https://doi.org/10.3390/math13193094

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Missing Rate	Case	$φ_{0}$	$φ_{1}$	$φ_{2}$	$φ_{3}$	$φ_{4}$	$φ_{5}$	$φ_{6}$	$φ_{7}$	$φ_{8}$	$φ_{9}$
20%	1	−4.00	0.28	0.28	0.28	0.28	0.28	0.28	0.28	0.28	0.28
	3	−3.80	0.19	0.19	0.19	0.19	0.19	0.19	0.19	0.19	0.19
	4	−4.00	0.15	0.15	0.15	0.15	0.15	0.15	0.15	0.15	0.15
30%	1	−4.00	0.40	0.40	0.40	0.40	0.40	0.40	0.40	0.40	0.40
	3	−4.00	0.27	0.27	0.27	0.27	0.27	0.27	0.27	0.27	0.27
	4	−4.00	0.20	0.20	0.20	0.20	0.20	0.20	0.20	0.20	0.20
40%	1	−4.00	0.60	0.60	0.60	0.60	0.60	0.60	0.60	0.60	0.60
	3	−4.00	0.36	0.36	0.36	0.36	0.36	0.36	0.36	0.36	0.36
	4	−4.00	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25

Missing Rate	Case	$φ_{0}$	$φ_{1}$	$φ_{2}$	$φ_{3}$	$φ_{4}$	$φ_{5}$	$φ_{6}$	$φ_{7}$	$φ_{8}$	$φ_{9}$
20%	1	−4.00	0.28	0.28	0.28	0.28	0.28	0.28	0.28	0.28	0.28
	3	−3.80	0.18	0.18	0.18	0.18	0.18	0.18	0.18	0.18	0.18
	4	−4.00	0.15	0.15	0.15	0.15	0.15	0.15	0.15	0.15	0.15
30%	1	−4.00	0.40	0.40	0.40	0.40	0.40	0.40	0.40	0.40	0.40
	3	−3.80	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25
	4	−4.00	0.20	0.20	0.20	0.20	0.20	0.20	0.20	0.20	0.20
40%	1	−4.00	0.65	0.65	0.65	0.65	0.65	0.65	0.65	0.65	0.65
	3	−3.80	0.34	0.34	0.34	0.34	0.34	0.34	0.34	0.34	0.34
	4	−4.00	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25	0.25

		$N (0, 0.3)$		$ln N (0, 0.25)$		$Beta (0.5, 0.5)$		$0.3 χ^{2} (3)$
Par	$τ$	Bias	RMS	Bias	RMS	Bias	RMS	Bias	RMS
$b_{1 τ}$	0.1	0.0131	0.0196	0.0090	0.0099	0.0245	0.0261	0.0050	0.0076
	0.5	0.0021	0.0128	0.0076	0.0082	0.0191	0.0199	0.0080	0.0090
	0.9	0.0056	0.0173	0.0126	0.0145	0.0162	0.0186	0.0101	0.0147
$γ_{1 τ}$	0.1	0.0490	0.0510	−0.0471	0.0473	−0.0306	0.0314	0.0229	0.0237
	0.5	0.0060	0.0144	−0.0114	0.0119	−0.0643	0.0645	−0.0297	0.0300
	0.9	−0.0211	0.0263	0.0470	0.0489	0.0572	0.0620	0.0074	0.0176
$γ_{2 τ}$	0.1	0.0368	0.0396	−0.0486	0.0489	−0.0360	0.0368	0.0218	0.0225
	0.5	0.0004	0.0131	−0.0143	0.0147	−0.0717	0.0719	−0.0221	0.0224
	0.9	−0.0192	0.0246	0.0397	0.0418	0.0318	0.0450	0.0228	0.0278
$γ_{3 τ}$	0.1	0.0058	0.0141	0.0350	0.0354	0.0166	0.0182	0.0300	0.0304
	0.5	0.0013	0.0111	−0.0509	0.0511	−0.0591	0.0593	−0.0539	0.0541
	0.9	−0.0016	0.0146	0.0024	0.0122	−0.0280	0.0311	−0.0354	0.0389
$γ_{4 τ}$	0.1	−0.0297	0.0332	0.0301	0.0312	−0.0165	0.0196	0.0266	0.0280
	0.5	0.0026	0.0105	−0.0258	0.0265	−0.0171	0.0190	−0.0100	0.0116
	0.9	0.0481	0.0501	−0.0027	0.0208	−0.0434	0.0528	0.0582	0.0630
$γ_{5 τ}$	0.1	−0.0363	0.0390	0.0347	0.0357	0.0011	0.0109	0.0166	0.0188
	0.5	0.0016	0.0114	−0.0217	0.0224	−0.0007	0.0079	−0.0081	0.0099
	0.9	0.0447	0.0468	0.0083	0.0191	−0.0082	0.0459	0.0549	0.0605

	$τ = 0.1$		$τ = 0.5$		$τ = 0.9$
Par	Bias	RMS	Bias	RMS	Bias	RMS
$a_{1}$	0.0028	0.0037	0.0028	0.0037	0.0029	0.0037
$a_{2}$	0.0013	0.0119	−0.0007	0.0112	−0.0024	0.0103
$a_{3}$	0.0000	0.0120	−0.0020	0.0111	−0.0038	0.0111
$a_{4}$	0.0285	0.0295	0.0285	0.0294	0.0281	0.0290
$a_{5}$	0.0158	0.0207	0.0156	0.0210	0.0142	0.0205
$a_{6}$	0.0169	0.0219	0.0169	0.0219	0.0151	0.0211
$a_{7}$	−0.0193	0.0211	−0.0191	0.0209	−0.0191	0.0209
$a_{8}$	−0.0063	0.0159	−0.0055	0.0156	−0.0010	0.0161
$a_{9}$	−0.0028	0.0140	−0.0022	0.0142	0.0021	0.0145
$λ_{21}$	−0.0596	0.0609	−0.0445	0.0458	0.0393	0.0448
$λ_{31}$	−0.0603	0.0616	−0.0452	0.0461	0.0384	0.0438
$λ_{52}$	0.0571	0.0584	0.0179	0.0224	0.0192	0.0254
$λ_{62}$	0.0594	0.0608	0.0192	0.0237	0.0210	0.0268
$λ_{83}$	0.0369	0.0395	0.0022	0.0140	−0.0034	0.0173
$λ_{93}$	0.0386	0.0410	0.0030	0.0145	−0.0030	0.0173
$ϕ_{11}$	−0.0384	0.0496	−0.0113	0.0203	0.0396	0.0603
$ϕ_{12}$	0.0731	0.0751	0.0512	0.0527	0.0281	0.0451
$ϕ_{22}$	0.0140	0.0299	−0.0350	0.0397	0.0147	0.0388

	$τ = 0.1$		$τ = 0.5$		$τ = 0.9$
Par	Bias	RMS	Bias	RMS	Bias	RMS
$a_{1}$	0.0014	0.0028	0.0011	0.0023	0.0010	0.0025
$a_{2}$	−0.0047	0.0121	−0.0017	0.0107	0.0015	0.0106
$a_{3}$	−0.0010	0.0108	0.0025	0.0109	0.0038	0.0112
$a_{4}$	−0.0106	0.0119	−0.0110	0.0124	−0.0102	0.0113
$a_{5}$	−0.0042	0.0114	−0.0034	0.0116	−0.0016	0.0115
$a_{6}$	−0.0061	0.0115	−0.0052	0.0113	−0.0054	0.0111
$a_{7}$	−0.0055	0.0077	−0.0044	0.0072	−0.0048	0.0074
$a_{8}$	−0.0049	0.0119	−0.0031	0.0111	−0.0009	0.0106
$a_{9}$	−0.0039	0.0119	−0.0032	0.0125	−0.0008	0.0116
$λ_{21}$	−0.0248	0.0268	−0.0031	0.0109	0.0045	0.0117
$λ_{31}$	−0.0262	0.0287	−0.0047	0.0121	0.0034	0.0119
$λ_{52}$	0.0155	0.0180	0.0000	0.0104	−0.0144	0.0179
$λ_{62}$	0.0172	0.0200	−0.0020	0.0098	−0.0128	0.0167
$λ_{83}$	0.0146	0.0174	−0.0006	0.0103	−0.0141	0.0182
$λ_{93}$	0.0115	0.0142	−0.0020	0.0101	−0.0186	0.0215
$ϕ_{11}$	0.0190	0.0587	0.0767	0.0947	−0.0039	0.0591
$ϕ_{12}$	−0.0314	0.0445	−0.0418	0.0563	−0.0385	0.0546
$ϕ_{22}$	−0.0422	0.0610	−0.0221	0.0561	−0.0456	0.0655

Article Menu

Bayesian Analysis of Nonlinear Quantile Structural Equation Model with Possible Non-Ignorable Missingness

Abstract

1. Introduction

2. Model and Notation

2.1. Nonlinear Quantile Structural Equation Model

2.2. Asymmetric Laplace Distribution

2.3. Non-Ignorable Missing Data

3. Bayesian Inference for the Proposed Model

3.1. Posterior Distributions

3.2. Bayesian Estimation

4. Simulation

5. A Real Example

6. Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI