Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex

Alwadeai, Abdulghani; Bouzebda, Salim; Khardani, Salah

doi:10.3390/math14081242

Open AccessArticle

Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex

by

Abdulghani Alwadeai

¹,

Salim Bouzebda

^2,*

and

Salah Khardani

¹

Laboratoire de Modélisation Mathématiques, Statistiques et Processus Stochastique, University of Tunis El Manar, Tunis 1068, Tunisia

²

Laboratoire de Mathématiques Appliquées de Compiègne, Université de Technologie de Compiègne, Alliance Sorbonne Universités, 60203 Compiègne, France

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(8), 1242; https://doi.org/10.3390/math14081242

Submission received: 5 March 2026 / Revised: 1 April 2026 / Accepted: 4 April 2026 / Published: 8 April 2026

(This article belongs to the Section D1: Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

This article develops a boundary-adaptive nonparametric methodology for estimating the geometric conditional quantiles of a multivariate response when the conditioning covariate is supported on the simplex—an important case, as it is the natural domain of compositional data. The statistical difficulty addressed here is twofold. First, geometric conditional quantiles for multivariate responses must be defined and estimated through a genuinely directional and convex framework rather than through any scalar ordering. Second, when the covariate is compositional or otherwise simplex-constrained, conventional symmetric kernel procedures suffer from intrinsic support mismatch and severe boundary distortion, thereby compromising both estimation accuracy and inferential validity near faces and edges of the simplex. The method proposed in this paper is designed precisely to overcome this combined obstacle. Our main innovation consists in embedding the spatial quantile formalism of Chaudhuri within a Dirichlet–Kernel smoothing scheme whose shape parameters depend deterministically on the evaluation point. This produces a convex M-estimator that respects the simplex geometry exactly, automatically adapts its local shape to the position of the target point, and removes the need for artificial boundary corrections. To the best of our knowledge, this is the first contribution to provide a complete asymptotic treatment of geometric conditional quantile estimation under simplex-supported covariates with location-adaptive asymmetric kernels. We establish a Bahadur-type linear representation with an explicit negligible remainder, from which we derive refined asymptotic bias and variance expansions. The variance analysis reveals a distinctive geometric phenomenon: each coordinate direction approaching the simplex boundary induces an additional

b^{- 1 / 2}

inflation factor, so that the variance at a face of codimension

| J |

scales as

n^{- 1} b^{- (s + | J |) / 2}

. We further obtain the asymptotic mean squared error, an explicit optimal bandwidth rate, asymptotic normality under the nonstandard normalization

n^{1 / 2} b^{- s / 4}

, and consistent plug-in covariance estimators yielding valid confidence ellipsoids. Numerical experiments and a real-data illustration based on the GEMAS data confirm the practical merit of the approach, especially in boundary regions where classical methods are known to deteriorate.

Keywords:

asymptotic normality; bahadur representation; boundary-adaptive smoothing; confidence ellipsoids; dirichlet kernel; geometric conditional quantile; multivariate quantile regression; nonparametric estimation; simplex support; spatial quantiles

MSC:

62G08; 60F05; 62G20; 62H12

1. Introduction

The problem of understanding how an s-dimensional covariate

X

modulates the conditional distribution of a d-dimensional response

Y

, based on an i.i.d. sample

{(X_{i}, Y_{i})}_{i = 1}^{n}

, lies at the core of contemporary statistics, econometrics, and statistical learning. While conditional means and conditional covariance structures provide useful summaries of location and dispersion, they remain intrinsically insufficient whenever the scientific objective concerns heterogeneity across the conditional distribution, directional extremality, tail behavior, or robust local features. In such settings, conditional quantiles furnish a far more informative statistical description. They encode distributional asymmetry, remain meaningful under heavy-tailed regimes, and are naturally aligned with decision-theoretic questions involving risk, stress, and distribution-sensitive prediction. In the scalar-response case, the theory of conditional quantiles is by now highly developed, with kernel, local polynomial, nearest-neighbor, and spline-based estimators enjoying a mature asymptotic theory. By contrast, the extension to multivariate responses remains mathematically delicate, primarily because there is no canonical total order on

R^{d}

.

A fruitful resolution of this difficulty is provided by the geometric viewpoint on multivariate quantiles. In the univariate case, the p-th quantile is characterized as the unique minimizer of a convex asymmetric absolute loss; see [1]. This variational formulation extends naturally to regression-type problems [2,3]. In higher dimension, however, the absence of an order structure requires a geometric replacement for scalar ranking. Two major lines of development have emerged in this direction: multivariate norm-based quantiles [4,5], and the theory of spatial, or geometric, quantiles introduced by Chaudhuri [6,7] and subsequently developed in several directions, including recent conditional and Bahadur-type formulations [8,9,10,11,12]. The spatial quantile framework is especially compelling because it indexes multivariate quantiles by a direction vector

u

in the open unit ball

B^{(d)} = {u \in R^{d} : ∥ u ∥ < 1},

thereby encoding, within a single object, both the magnitude of outlyingness and its geometric orientation. This directional parametrization confers a direct geometric interpretation upon conditional multivariate quantiles and makes the associated estimation problem amenable to convex analysis.

More precisely, for

u \in B^{(d)}

, define

Φ (u, l) = ∥ l ∥ + 〈 u, l 〉, l \in R^{d} .

(1)

The

u

-th geometric conditional quantile of

Y

given

X = x

is then defined as any minimizer of the conditional convex functional

Q (u ∣ x) \in arg min_{θ \in R^{d}} E \{Φ (u, Y - θ) - Φ (u, Y) | X = x\} .

This formulation, which goes back to the geometric quantile paradigm of [6,7], offers a coherent multivariate analogue of ordinary quantiles, while retaining convexity, directional interpretability, and robustness. Under absolute continuity assumptions, existence and uniqueness follow from strict convexity arguments; see, e.g., [4] (Remark 2.3). The conditional version is particularly attractive for multivariate response analysis, portfolio allocation, environmental monitoring, compositional systems, and spatio-temporal modelling, where one seeks a directional description of the entire conditional distribution rather than a single central tendency summary.

Yet the nonparametric estimation of

Q (u ∣ x)

becomes substantially more delicate when the support of the covariate is bounded. This is not a marginal technical complication but a structural issue. Classical symmetric kernel methods inevitably allocate mass outside the support, thereby generating boundary bias and distorting local smoothing near edges, faces, and corners. This challenge has been widely documented in the nonparametric literature on density estimation, regression, and conditional functionals over bounded supports [13,14,15,16,17,18,19,20,21,22,23,24]. A broad range of correction strategies has been proposed, including boundary kernels and local polynomial corrections [25,26]. Nevertheless, a conceptually cleaner alternative has emerged in the form of asymmetric kernels whose support coincides exactly with the underlying domain [27]. Such kernels are support-respecting by construction and exhibit location-dependent shape adaptation, thereby yielding an intrinsic form of boundary correction.

This philosophy is well established on the unit interval through beta kernels [28,29,30,31,32,33,34,35,36,37,38] and on bounded Euclidean domains using Bernstein-type techniques [39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56]. However, the simplex setting is fundamentally richer. When the covariate represents compositional proportions or relative allocations, it is naturally constrained to the simplex

S_{s, 1} = \{x \in {[0, 1]}^{s} : {∥ x ∥}_{1} \leq 1\},

whose geometry is qualitatively different from that of a Cartesian product domain. In that setting, not only must the smoothing device respect each coordinate boundary, but it must also incorporate the global

ℓ_{1}

-constraint. Dirichlet kernels constitute the natural support-adapted analogue of beta kernels on

S_{s, 1}

; see [57,58,59]. Their shape may be made to depend deterministically on the evaluation point, and this produces a genuinely geometry-aware smoothing scheme, with local dispersion and asymmetry automatically modulated by the position of the target point relative to the simplex boundary.

The present work is motivated precisely by this conjunction of two difficulties: the lack of a scalar order in multivariate conditional quantile analysis, and the geometric complexity induced by simplex-supported covariates. Our objective is to construct and analyze a nonparametric estimator of geometric conditional quantiles that is simultaneously: (i) faithful to the spatial-quantile paradigm; (ii) intrinsically compatible with the simplex support; (iii) asymptotically tractable under a full inferential theory; and (iv) computationally implementable in realistic multivariate settings. To this end, we combine the geometric quantile loss of [6,7] with Dirichlet kernel weighting on

S_{s, 1}

, using evaluation-point-dependent shape parameters

α = x / b + 1, β = (1 - ∥ x ∥_{1}) / b + 1 .

This yields a location-adaptive smoothing device that exactly respects the simplex geometry and produces a weighted convex M-estimator for the conditional geometric quantile.

The contribution of the paper is threefold. First, we formulate a Dirichlet–Kernel estimator of geometric conditional quantiles on the simplex and show that it provides a principled support-respecting analogue of the Nadaraya–Watson idea for multivariate directional quantile functionals. Second, we establish a full asymptotic theory, including a Bahadur-type linear expansion with explicit stochastic remainder, asymptotic bias and covariance expansions, mean squared error analysis, optimal bandwidth scaling, and asymptotic normality with covariance estimation. Third, our analysis uncovers a distinctive boundary phenomenon: the asymptotic variance is no longer governed solely by the ambient simplex dimension s, but also by the codimension of the face approached by the evaluation point. More precisely, each coordinate direction becoming asymptotically boundary-active contributes a multiplicative

b^{- 1 / 2}

inflation to the variance, leading to the regime

n^{- 1} b^{- (s + | J |) / 2}

near a face of codimension

| J |

. This reveals, in explicit form, how simplex geometry governs stochastic uncertainty in conditional quantile estimation.

These results position the present work at the intersection of several active literatures. Relative to the foundational theory of spatial quantiles [4,6,7], we address the genuinely conditional problem with nonparametric, covariate-dependent weighting. Relative to the extensive literature on asymmetric kernel smoothing and Bernstein-type approximation [29,30,54,59], we move beyond density and distribution estimation to a substantially more intricate geometric functional. Relative to recent work on conditional multivariate quantiles and Bahadur expansions [10,11,60], we introduce a support-adapted simplex-based smoothing mechanism and derive the resulting nonstandard inferential scaling. When

u = 0

, the estimator reduces to the conditional geometric median, thereby recovering, and substantially extending, the framework of [61]. At the same time, the full directional formulation permits inference not merely at the center of the conditional distribution, but across a continuum of oriented and increasingly extreme directions.

From a methodological standpoint, the framework is especially relevant for compositional and constrained-covariate applications. In environmental geochemistry, soil composition profiles are naturally represented by proportions or normalized concentrations. In econometrics and finance, portfolio weights and budget shares live on simplices. In spatial and biological applications, relative abundance vectors obey the same structural constraint. In all such settings, conventional kernel smoothing can be severely distorted near the boundary, whereas Dirichlet kernels yield a support-faithful alternative whose asymptotic behavior can be analyzed sharply. This is the principal conceptual message of the paper: boundary adaptivity is not merely a technical refinement, but an essential structural ingredient in multivariate conditional quantile estimation on constrained domains.

Organization

The remainder of the paper is organized as follows. Section 2 introduces the simplex setting, Dirichlet kernels, and the geometric conditional quantile functional, and it clarifies the weighted M-estimation principle underlying the estimator. Section 3 states the regularity conditions and develops the main asymptotic results, including the Bahadur representation, bias and variance expansions, mean squared error, optimal bandwidth, mean integrated absolute error, and asymptotic normality. Section 4 reports simulation experiments illustrating the finite-sample behavior of the estimator, the geometry of the confidence ellipsoids, and the directional quantile contours. Section 5 presents a real-data application to the GEMAS dataset [62], thereby illustrating the practical relevance of the methodology for compositional environmental covariates. Section 6 concludes with limitations and possible extensions, while Section 7 collects the technical proofs.

2. Setup and Definitions

We now formalize the geometric and probabilistic framework underlying the proposed estimator. We first recall the simplex geometry and the Dirichlet kernel construction, and then explain how these ingredients combine with the spatial-quantile loss to produce a weighted empirical M-estimator of the geometric conditional quantile. Let

S_{s, 1}

denote the s-dimensional simplex defined by

S_{s, 1} : = \{x \in {[0, 1]}^{s} : {∥ x ∥}_{1} \leq 1\},

with interior

Int (S_{s, 1}) : = \{x \in {(0, 1)}^{s} : {∥ x ∥}_{1} < 1\},

where

{∥ x ∥}_{1} : = \sum_{i = 1}^{s} | x_{i} |

and

s \in N^{*}

. For parameters

α_{1}, \dots, α_{s}, β > 0

, the density function of the Dirichlet

(α, β)

distribution is given by

K_{α, β} (x) : = \frac{Γ ({∥ α ∥}_{1} + β)}{Γ (β) \prod_{i = 1}^{s} Γ (α_{i})} {(1 - {∥ x ∥}_{1})}^{β - 1} \prod_{i = 1}^{s} x_{i}^{α_{i} - 1}, x \in S_{s, 1} .

For a detailed account, we refer the reader to [57] (Chapter 49) and [58]. We now clarify the theoretical basis of the estimator introduced below. For a fixed evaluation point

x \in S_{s, 1}

and a fixed directional index

u \in B^{(d)}

, the target of inference is the population geometric conditional quantile

Q (u ∣ x) \in arg min_{θ \in R^{d}} E [Φ (u, Y - θ) - Φ (u, Y) | X = x] .

Accordingly, the only unknown parameter to be estimated in what follows is the vector

Q (u ∣ x) \in R^{d},

whereas

x

and

u

are regarded as fixed indexing arguments, and

b > 0

is a smoothing parameter. The vector

θ \in R^{d}

appearing in the minimization problem is therefore not an additional model parameter; it is the optimization variable whose minimizing value defines the estimator of

Q (u ∣ x)

.

The estimator is obtained by a plug-in principle. Since the conditional distribution of

Y

given

X = x

is unknown, we replace it by a Dirichlet–Kernel weighted empirical conditional measure concentrated on the observations

Y_{1}, \dots, Y_{n}

. Specifically, for the fixed evaluation point

x

, we assign to each observation

(X_{i}, Y_{i})

the weight

w_{n i} (x) = \frac{K_{α, β} (X_{i})}{\sum_{j = 1}^{n} K_{α, β} (X_{j})}, i = 1, \dots, n,

where

K_{α, β}

is the Dirichlet kernel on

S_{s, 1}

with parameters

(α, β) = (\frac{x}{b} + 1, \frac{1 - {∥ x ∥}_{1}}{b} + 1) .

These weights are nonnegative and sum to one, so they define the empirical conditional probability measure

{\hat{P}}_{n} (\cdot ∣ x) = \sum_{i = 1}^{n} w_{n i} (x) δ_{Y_{i}} (\cdot),

where

δ_{Y_{i}}

denotes the Dirac mass at

Y_{i}

. Equivalently, its associated conditional distribution function is

F_{n} (y ∣ x) = \sum_{i = 1}^{n} w_{n i} (x) 1_{{Y_{i} \leq y}},

with the inequality understood componentwise.

Replacing the unknown conditional distribution in the population variational characterization by

{\hat{P}}_{n} (\cdot ∣ x)

yields the sample criterion

L_{n} (θ; u, x) = \int_{R^{d}} \{Φ (u, y - θ) - Φ (u, y)\} {\hat{P}}_{n} (d y ∣ x),

and the estimator is defined as its minimizer. Hence

{\hat{Q}}_{n} (u ∣ x)

is a weighted convex M-estimator, namely the empirical analogue of the population geometric conditional quantile functional. This motivates the definition

\begin{matrix} {\hat{Q}}_{n} (u ∣ x) & = arg min_{θ \in R^{d}} \int_{R^{d}} \{Φ (u, y - θ) - Φ (u, y)\} F_{n} (d y ∣ x) \\ = arg min_{θ \in R^{d}} \sum_{i = 1}^{n} w_{n i} (x) [Φ (u, Y_{i} - θ) - Φ (u, Y_{i})] . \end{matrix}

(2)

In particular, the operator

arg min

denotes the set of minimizers of the sample objective, and under the regularity conditions imposed later, convexity and absolute continuity ensure existence and, with probability tending to one, uniqueness of the minimizer. Thus, Equation (2) is not an ad hoc definition, but the natural Dirichlet–Kernel plug-in estimator of the population conditional geometric quantile.

Notation

Throughout the paper, let

f (\cdot, \cdot)

and

f (\cdot)

denote the joint and marginal density functions of the random variables

(X, Y)

and

X

, respectively. The conditional density of

Y

given

X = x

is denoted by

f (\cdot ∣ x)

. The notation

D \overset{D}{\to}

indicates convergence in distribution. For any matrix

A

,

A^{⊤}

denotes its transpose. For

y \in R^{d}

, define

B (y) = \{\begin{matrix} \frac{1}{∥ y ∥} (I_{d} - U (y) U {(y)}^{⊤}), & if y \neq 0, \end{matrix}

(3)

where

I_{d}

denotes the

d \times d

identity matrix, and

U (y) = \{\begin{matrix} \frac{y}{∥ y ∥}, & if y \neq 0, \\ 0, & if y = 0, \end{matrix}

(4)

with

∥ y ∥

denoting the Euclidean norm of

y

. Unless otherwise stated, all limits in this paper are taken as

n \to \infty

.

Remark 1.

Equation (2) follows from a standard plug-in principle, but with a localization device tailored to the geometry of the covariate space. The functional

Q (u ∣ x)

is defined as the minimizer of a conditional convex criterion, and the statistical task is therefore to approximate the conditional law of

Y

at the design point

x

. The weights

w_{n i} (x)

arise from this approximation step. The specificity of the present framework lies in the fact that the covariate takes values in

S_{s, 1}

. In such a setting, a symmetric kernel is not geometrically appropriate, since it ignores the support constraint and produces boundary distortion. The Dirichlet kernel, by contrast, is intrinsically supported on the simplex and possesses an evaluation-point-dependent shape, thereby inducing an automatic boundary adaptation. Hence the estimator

{\hat{Q}}_{n} (u ∣ x)

is best interpreted as a geometric conditional quantile estimator obtained by localized empirical risk minimization under a support-respecting Dirichlet smoothing scheme.

3. Main Results

We now specify the regularity conditions required to establish our main theoretical results. We fix throughout a point

x \in S_{s, 1}

, where

S_{s, 1} = \{t \in {[0, 1]}^{s} : {∥ t ∥}_{1} \leq 1\} .

Let

U_{x} \subset R^{s}

be an open neighborhood of

x

, and set

N_{x} : = U_{x} \cap S_{s, 1} .

We now state the regularity conditions used throughout the sequel.

A.1–: For every $t \in N_{x}$ , the conditional distribution of $Y$ given $X = t$ admits a density $f (y ∣ t)$ with respect to Lebesgue measure on $R^{d}$ . Moreover, for every bounded Borel set $B \subset R^{d}$ ,

$sup_{t \in N_{x}} sup_{y \in B} f (y ∣ t) < \infty .$
A.2–: The marginal density f of $X$ is strictly positive on $N_{x}$ . More precisely,

$inf_{t \in N_{x}} f (t) > 0 .$

In addition, there exists a function $\tilde{f} : U_{x} \to R$ such that

$\tilde{f} \in C^{2} (U_{x}) and \tilde{f} (t) = f (t) for all t \in N_{x} .$

Furthermore, for every multi-index $α \in N^{s}$ with $| α | \leq 2$ ,

$sup_{t \in N_{x}} |\partial^{α} \tilde{f} (t)| < \infty .$
A.3–: For each $t \in N_{x}$ and $θ \in R^{d}$ , define

$r (θ, t) : = E [U (Y - Q (u ∣ x) - θ) + u | X = t] .$

(5)

Assume that, for every $M > 0$ ,

$sup_{∥ θ ∥ \leq M, t \in N_{x}} ∥\frac{\partial^{2} r (θ, t)}{\partial t \partial t^{⊤}}∥ < \infty,$

where the derivatives with respect to $t$ are understood componentwise on $U_{x}$ .
A.4–: The bandwidth sequence $b = b_{n}$ satisfies

$b_{n} \to 0 and n b_{n}^{s / 2} \sim C n^{γ} for some C > 0 and \frac{2}{s + 2} < γ < 1 .$

In particular,

$n b_{n}^{s / 2} \to \infty and \frac{log n}{n b_{n}^{s / 2}} = o (b_{n}) .$
A.5–: There exist constants $M > 0$ and $w > 0$ such that

$sup_{t \in N_{x}} sup_{∥ θ ∥ \leq M} \int_{R^{d}} \frac{f (y ∣ t)}{{∥ y - Q (u ∣ x) - θ ∥}^{1 + w}} d y < \infty .$

(6)

If $d \geq 3$ , this condition is assumed with $w = 1$ ; if $d = 2$ , it is assumed for some $w \in (0, 1)$ .
A.6–: Define, for $t \in N_{x}$ ,

$\begin{matrix} D_{1} (t) : = E [B (Y - Q (u ∣ x)) | X = t] . \end{matrix}$

(7)

Assume that the mapping $t ⟼ D_{1} (t)$ is continuous at $x$ , and that the matrix $D_{1} (x)$ is nonsingular.
A.7–: Define, for $t \in N_{x}$ ,

$D_{t} : = E [(U (Y - Q (u ∣ x)) + u) {(U (Y - Q (u ∣ x)) + u)}^{⊤} | X = t] .$

(8)

Assume that the mapping $t ⟼ D_{t}$ is continuous at $x$ .

3.1. Discussion of the Assumptions

The collection of assumptions A.1–A.7 is stated at the beginning of this section because it provides the common regularity framework underlying all the asymptotic results established below. These conditions are not attached to a single isolated theorem; rather, they constitute the analytic structure on which the entire inferential theory rests, including the Bahadur-type linear representation, the asymptotic bias and variance expansions, the mean squared error analysis, and the asymptotic normality of the proposed estimator. Their purpose is to ensure that the interaction between two nonstandard features of the problem—namely, the nonsmooth geometry of spatial quantiles and the boundary-adaptive, location-dependent character of the Dirichlet kernel—can be handled in a mathematically stable and asymptotically tractable manner on the simplex.

Assumption A.1 is a local boundedness condition on the conditional density

f (y ∣ t)

, imposed uniformly over

t

in a neighborhood of the target point

x

. Its role is to exclude pathological local behavior of the conditional distribution of

Y

as the covariate varies inside the effective smoothing region generated by the Dirichlet kernel. Since the estimating equation for geometric conditional quantiles involves the singular score map

U (y - θ) = \frac{y - θ}{∥ y - θ ∥},

this local boundedness is needed to justify dominated-convergence arguments, to control local empirical fluctuations, and to guarantee that the conditional law of

Y

does not exhibit excessive concentration near the moving singularity

y = θ

. In this sense, A.1 is the basic regularity assumption ensuring that the local conditional response mechanism remains sufficiently well behaved for the weighted convex M-estimation problem to admit a stable asymptotic analysis.

Assumption A.2 concerns the marginal density f of the covariate

X

. Its first component, namely strict positivity of f on a neighborhood of

x

, is indispensable for local identification: the normalized Dirichlet weights are asymptotically scaled by

f (x)

, and if this quantity were allowed to vanish, then the effective local sample size would collapse and the conditional nature of the estimator would be lost. The second component of A.2 is a smoothness requirement formulated through the existence of a

C^{2}

-extension

\tilde{f}

to an open neighborhood of the simplex point under consideration. This formulation is deliberately stronger and more precise than simply saying that f is twice differentiable on

S_{s, 1}

, since the latter would be ambiguous at the boundary of the simplex. By working with an extension to an open neighborhood, derivatives of first and second order are defined in the classical sense and Taylor expansions near boundary points become fully rigorous. This is essential for the second-order expansion of

E [K_{α, β} (X)],

from which the correction term

g (x)

arises. In the present setting, this smoothness is not merely a standard bias assumption: because the Dirichlet kernel itself depends on the evaluation point and respects the simplex geometry, the resulting expectation is not a classical convolution but a support-adapted moment transform. The regularity imposed in A.2 is therefore needed to control the local normalization factor in a geometrically faithful way.

Assumption A.3 imposes uniform boundedness of the second derivatives, with respect to the covariate argument

t

, of the conditional score map

r (θ, t) = E [U (Y - Q (u ∣ x) - θ) + u | X = t] .

This is one of the key smoothness assumptions of the paper. It guarantees that the population estimating equation can be expanded locally in the covariate direction with sufficient precision to transfer the moment structure of the Dirichlet kernel into the deterministic part of the asymptotic expansion. In particular, the explicit bias term

ζ_{s}

is obtained by applying a second-order Taylor expansion to the composite map

t ⟼ r (0, t) f (t) .

Without a condition of this type, the local geometry of the population score could vary too sharply across the shrinking simplex neighborhood of

x

, and the leading deterministic term in the Bahadur expansion would fail to admit a stable second-order representation. Thus, A.3 controls the curvature of the conditional spatial score with respect to the covariate and is essential for the refined bias analysis.

Assumption A.4 specifies the asymptotic bandwidth regime. At a conceptual level, it encodes the usual nonparametric balance between localization and effective sample size. Since the Dirichlet kernel on the simplex concentrates its mass on a neighborhood whose effective volume is of order

b^{s / 2}

, the quantity

n b^{s / 2}

plays the role of a local information index. The condition

b_{n} \to 0

enforces localization at the target point

x

, while the divergence of

n b_{n}^{s / 2}

ensures stochastic stabilization of the weighted empirical score. The parameterization

n b_{n}^{s / 2} \sim C n^{γ}, \frac{2}{s + 2} < γ < 1,

is particularly convenient because it captures both requirements within a single asymptotic scale. Moreover, the inequality

\frac{2}{s + 2} < γ

is precisely what guarantees

\frac{log n}{n b_{n}^{s / 2}} = o (b_{n}),

an ordering that is crucial in the second-order theory: it ensures that the stochastic remainder appearing in the Bahadur representation is asymptotically negligible relative to the deterministic bias term. Thus, A.4 is not merely a bandwidth condition in the usual kernel-smoothing sense; it is the rate-separation assumption that allows for bias expansion, the MSE optimization, and the centering in the asymptotic normality theorem to coexist on compatible scales.

Assumption A.5 is an integrability condition specifically tailored to the singular structure of the derivative of the spatial score. Indeed, differentiating the map

U (\cdot)

yields the matrix

B (y) = \frac{1}{∥ y ∥} (I_{d} - U (y) U {(y)}^{⊤}),

which exhibits a first-order singularity at the origin. The requirement

sup_{t \in N_{x}} sup_{∥ θ ∥ \leq M} \int_{R^{d}} \frac{f (y ∣ t)}{{∥ y - Q (u ∣ x) - θ ∥}^{1 + w}} d y < \infty

ensures that this singularity is integrable uniformly both in the local covariate neighborhood and in bounded perturbations of the parameter. This is indispensable for differentiating under the integral sign, for proving existence of the Jacobian matrix of the population estimating equation, and for controlling the Taylor remainder in the Bahadur expansion. The distinction between the cases

d \geq 3

and

d = 2

is mathematically natural: in dimension two one is at the critical integrability threshold, and only exponents strictly smaller than the borderline value are admissible. Hence, A.5 expresses the precise local integrability needed to tame the singular geometry inherent in multivariate spatial quantiles.

Assumption A.6 concerns the matrix

D_{1} (t) = E [B (Y - Q (u ∣ x)) | X = t] .

This matrix is the population Jacobian of the estimating equation with respect to the quantile parameter and therefore plays the role of the local Hessian analogue in the convex M-estimation problem under study. Its continuity at

t = x

ensures that the local curvature of the objective function is stable under Dirichlet smoothing and may be asymptotically replaced by its value at the target point. The additional nonsingularity of

D_{1} (x)

is equally fundamental: it guarantees that the linearized estimating equation is nondegenerate and can be inverted, which is the key step in passing from the score expansion to the Bahadur representation. More conceptually, A.6 provides the local identifiability condition for the geometric conditional quantile. Once combined with absolute continuity of the conditional law and the strict convexity properties of the spatial quantile criterion, it ensures that the asymptotic linearization has a unique and well-defined solution.

Assumption A.7 concerns the conditional second-moment matrix

D_{t} = E [(U (Y - Q (u ∣ x)) + u) {(U (Y - Q (u ∣ x)) + u)}^{⊤} | X = t],

which is the covariance-type quantity governing the asymptotic variance. Its continuity at

t = x

ensures that the locally weighted empirical second moment converges to the correct population limit rather than to a distorted average over the smoothing neighborhood. This is indispensable for identifying the leading covariance matrix in the central limit theorem and for obtaining the explicit variance formula, including the boundary-dependent inflation effect induced by the simplex geometry. In particular, A.7 is the condition that allows the covariance structure of the linearized score to be transferred faithfully to the asymptotic law of the estimator.

Taken together, assumptions A.1–A.7 form a coherent regularity system adapted to the specific difficulties of the problem: the covariate lies on a bounded and geometrically constrained domain, the smoothing mechanism is asymmetric and location-dependent, and the objective function is convex but nondifferentiable at the origin. Their collective role is fourfold: first, to guarantee local identifiability of the geometric conditional quantile; second, to control the singular behavior induced by the spatial score; third, to justify the second-order expansions required by Dirichlet localization on the simplex; and fourth, to separate sharply enough the stochastic and deterministic scales governing the estimator. In this sense, these assumptions should not be viewed as ancillary technicalities, but rather as the precise analytic scaffolding required for a complete asymptotic treatment of geometric conditional quantiles under boundary-adaptive smoothing on the simplex.

3.2. Bahadur Representation for the Geometric Conditional Quantile Estimator

In this section, we present the principal theoretical results concerning the geometric conditional quantile estimator. Under the regularity assumptions A.1–A.7, we derive the Bahadur representation, the asymptotic bias and variance, and the associated mean squared and integrated absolute errors. We conclude with an asymptotic normality result and several remarks clarifying the assumptions and existence conditions.

Theorem 1

(Bahadur Representation). Suppose that conditions A.1–A.7 hold. Then the estimator

{\hat{Q}}_{n} (u ∣ x)

admits the following Bahadur-type expansion:

{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) = D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) + R_{n},

almost surely, where

D_{1} = \frac{E [K_{α, β} (X) B (Y - Q (u ∣ x))]}{E [K_{α, β} (X)]} .

Moreover, when

d ⩾ 3

, the remainder satisfies

R_{n} = O (\frac{log n}{n b^{s / 2}})

, while for

d = 2

, one has

R_{n} = o ({(\frac{log n}{n b^{s / 2}})}^{w})

, for any

0 < w < 1

.

Theorem 2

(Asymptotic Bias). Under the conditions A.1–A.7, the bias of the estimator satisfies

\begin{matrix} Bias ({\hat{Q}}_{n} (u ∣ x)) & = E [{\hat{Q}}_{n} (u ∣ x)] - Q (u ∣ x) \\ = \frac{D_{1}^{- 1}}{f (x) + b g (x)} [R (0, x) + b ζ_{s}] + o (b^{1 / 2}) + R_{n}, \end{matrix}

(9)

where, for

x \in S_{s, 1}

with

s \leq d

, the terms

ζ_{s}

and

g (x)

are defined respectively as

\begin{matrix} ζ_{s} & = \sum_{i = 1}^{s} \frac{\partial R (0, x)}{\partial x_{i}} [1 - (s + 1) x_{i} + \frac{1}{2} \sum_{i = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i}^{2}} x_{i} (1 - x_{i}) \\ + \sum_{i, j = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i} \partial x_{j}} x_{i} (1_{{i = j}} - x_{j})], \end{matrix}

with

R (0, x) = r (0, x) f (x)

, and

\begin{matrix} g (x) = \sum_{i \in [s]} (1 - (s + 1) x_{i}) \frac{\partial f (x)}{\partial x_{i}} + \frac{1}{2} \sum_{i, j \in [s]} x_{i} (1_{{i = j}} - x_{j}) \frac{\partial^{2} f (x)}{\partial x_{i} \partial x_{j}}, \end{matrix}

(10)

where

[s] \equiv {1, 2, \dots, s}

.

Theorem 3

(Asymptotic Variance). Assume that A.1–A.7 hold. Then, for any

x \in Int (S_{d, 1})

, any non-empty index subset

J \subseteq [s]

, and any

κ \in {(0, \infty)}^{s}

, the asymptotic variance of

{\hat{Q}}_{n} (u ∣ x)

satisfies, as

n \to \infty

,

Var ({\hat{Q}}_{n} (u ∣ x)) = \{\begin{matrix} \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} \{b^{- s / 2} ψ (x) f (x) D_{1}^{- 1} D_{x} D_{1}^{- 1}\} + o_{x} (n^{- 1} b^{1 / 2}) + o_{x} (n^{- 1} b^{- s / 2}), \\ i f \frac{x_{i}}{b} \to \infty \forall i \in [s], a n d \frac{1 - {∥ x ∥}_{1}}{b} \to \infty; \\ \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} {b^{- (s + | J |) / 2} ψ_{J} (x) f (x) \prod_{i \in J} \frac{Γ (2 κ_{i} + 1)}{2^{κ_{i} + 1} Γ^{2} (κ_{i} + 1)} D_{1}^{- 1} D_{x} D_{1}^{- 1}} \\ + o_{κ, x} (n^{- 1} b^{- (s + | J |) / 2}), \\ i f \frac{x_{i}}{b} \to κ_{i} \forall i \in J, \frac{x_{i}}{b} \to \infty \forall i \in [s] ∖ J, a n d \frac{1 - {∥ x ∥}_{1}}{b} \to \infty . \end{matrix}

For any index subset

J \subseteq [s]

, define

ψ (x) = ψ_{\emptyset} (x), ψ_{J} (x) = {[{(4 π)}^{s - | J |} {(1 - ∥ x ∥}_{1}) \prod_{i \in [s] ∖ J} x_{i}]}^{- 1 / 2} .

(11)

Hence, the pointwise variance is of order

o_{x} (n^{- 1} b^{- s / 2})

in the interior of the simplex, and increases by a factor of

b^{- 1 / 2}

each time the point

x

approaches the boundary in one coordinate direction. Near an edge of dimension

s - | J |

, the variance becomes

o_{κ, x} (n^{- 1} b^{- (s + | J |) / 2})

.

Corollary 1

(Mean Squared Error). Under the assumptions A.1–A.7, for any

x \in Int (S_{s, 1})

, as

n \to \infty

,

\begin{matrix} MSE [{\hat{Q}}_{n} (u ∣ x)] & = E [{|{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)|}^{2}] \\ = n^{- 1} b^{- s / 2} [\frac{ψ (x) f (x)}{{(f (x) + b g (x))}^{2}}] D_{1}^{- 1} D_{x} D_{1}^{- 1} \\ + D_{1}^{- 1} {[\frac{R (0, x) + b ζ_{s}}{f (x) + b g (x)}]}^{2} D_{1}^{- 1} + o_{x} (b^{2}) - o_{x} (n^{- 1} b^{1 / 2}) + o_{x} (n^{- 1} b^{- s / 2}) . \end{matrix}

In particular, if

f (x) \neq 0

, the asymptotically optimal bandwidth minimizing the

MSE

satisfies

b_{opt} ≃ n^{- 2 / (s + 4)} {(\frac{\frac{4}{s} D_{1}^{- 1} ζ_{s} ζ_{s}^{⊤} D_{1}^{- 1}}{ψ (x) f (x) D_{1}^{- 1} D_{x} D_{1}^{- 1}})}^{2 / (s + 4)} .

(12)

Theorem 4

(Mean Integrated Absolute Error). Assume that conditions A.1–A.7 hold. Then, as

n \to \infty

,

\begin{matrix} MIAE [{\hat{Q}}_{n} (u ∣ x)] & : = \int_{S_{s, 1}} E |{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)| d x \\ = \int_{S_{s, 1}} w (x) E [|Z - \frac{b ζ_{s}}{w (x)}|] d x + o (n^{- 1 / 2} b^{- s / 2}) + o (n^{- 1 / 2} b^{- s / 4}) + o (b^{1 / 2}), \end{matrix}

(13)

where

Z \sim N (0, 1)

and

w (x) : = n^{- 1 / 2} b^{- s / 4} \sqrt{D_{x} ψ (x)}

. If

n^{1 / 2} b^{s / 4} \to \infty

, then

\begin{matrix} MIAE [{\hat{Q}}_{n}] & \leq n^{- 1 / 2} b^{- s / 4} \sqrt{\frac{2}{π}} \int_{S_{s, 1}} \sqrt{D_{x} ψ (x)} d x + b \int_{S_{s, 1}} | ζ_{s} | d x \\ + o (n^{- 1 / 2} b^{- s / 4}) + o (b^{1 / 2}) . \end{matrix}

(14)

Theorem 5

(Asymptotic Normality). Under assumptions A.1–A.7, as

n \to \infty

and for

x / b \to \infty

,

n^{1 / 2} b^{s / 4} ({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) - \frac{D_{1}^{- 1}}{f (x) + b g (x)} (R (0, x) + b ζ_{s})) \overset{D}{\to} N (0, Σ (x)),

where

Σ (x) = \frac{ψ (x) f (x)}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} D_{x} D_{1}^{- 1} .

Remark 2.

Analogously to Fact 2.1.1 of [7], the existence of the minimizer

{\hat{Q}}_{n} (u ∣ x)

of

\sum_{i = 1}^{n} w_{n i} Φ (u, Y_{i} - θ)

follows from two key observations: (i) the objective diverges to infinity as

∥ θ ∥ \to \infty

, and (ii) it is continuous in θ. Because the random variables

{Y_{i}}_{i = 1}^{n}

are absolutely continuous, they do not lie on a straight line in

R^{d}

with probability one. Hence, by Theorem 2.17 of [63], the function is strictly convex in θ, ensuring the existence and uniqueness of the minimizer

{\hat{Q}}_{n} (u ∣ x)

. Similarly, the uniqueness of the population quantile

Q (u ∣ x)

follows from the absolute continuity of the conditional distribution of

Y

given

X = x

.

4. Numerical Results

This section provides a finite-sample illustration of the estimator introduced in Section 2 in the special case

s = 1

. In that setting, the simplex

S_{1, 1} = {x \in [0, 1] : x \leq 1} = [0, 1]

is one-dimensional, and the Dirichlet kernel reduces to the Beta kernel. The purpose of this section is not to establish an additional theoretical result, but rather to examine, in a controlled setting, the numerical behavior of the estimator

{\hat{Q}}_{n} (u ∣ x_{0})

, the geometry of the associated Gaussian ellipsoids, and the shape of the estimated directional quantile contours.

We observe i.i.d. pairs

{(X_{i}, Y_{i})}_{i = 1}^{n}

, where

X_{i} \in (0, 1)

is scalar and

Y_{i} \in R^{2}

is bivariate. For a fixed evaluation point

x_{0} \in (0, 1)

and a fixed direction

u \in B^{(2)}

, the estimator is computed exactly as in (2), with the one-dimensional Dirichlet kernel replaced by the corresponding Beta kernel. More precisely, letting

b = b_{n} > 0

denote the bandwidth, define

K_{x_{0}, b} (t) = \frac{1}{B (x_{0} / b + 1, (1 - x_{0}) / b + 1)} t^{x_{0} / b} {(1 - t)}^{(1 - x_{0}) / b}, t \in (0, 1),

and the normalized weights

w_{n i} (x_{0}) = \frac{K_{x_{0}, b} (X_{i})}{\sum_{j = 1}^{n} K_{x_{0}, b} (X_{j})}, i = 1, \dots, n .

The estimator of the

u

-th geometric conditional quantile at

x_{0}

is then defined by

{\hat{Q}}_{n} (u ∣ x_{0}) = arg min_{θ \in R^{2}} \sum_{i = 1}^{n} w_{n i} (x_{0}) [Φ (u, Y_{i} - θ) - Φ (u, Y_{i})],

(15)

where

Φ

is the geometric loss introduced in (1). Since the objective is convex, the minimizer is computed numerically by direct convex optimization.

Data-generating mechanism.

The covariate and conditional response are generated according to

X \sim Beta (α, β), Y ∣ X = x \sim N_{2} (μ (x), Σ (x)),

with

μ (x) = (\begin{matrix} 2 (x - 0.5) \\ - 1.5 (x - 0.5) \end{matrix}), Σ (x) = (\begin{matrix} 1 + 0.5 (x - 0.5) & 0.5 \\ 0.5 & 1 + 0.3 (x - 0.5) \end{matrix}) .

(16)

The matrix

Σ (x)

is positive definite for all

x \in [0, 1]

, so the model is well defined throughout the support.

Evaluation point, directions, and bandwidth.

Unless otherwise stated, the numerical illustrations are carried out at the interior point

x_{0} = 0.3,

and for the two directions

u_{1} = {(- 0.8, - 0.58)}^{⊤}, u_{2} = {(0.8, - 0.4)}^{⊤},

which both satisfy

∥ u_{j} ∥ < 1

. For point estimation, we take

b_{n} ≍ n^{- 2 / 5},

which is the usual mean-squared-error optimal rate in the one-dimensional interior case

s = 1

; compare Corollary 1.

Asymptotic benchmark.

Specializing Theorem 5 to

s = 1

, one obtains, under assumptions A.1–A.7 and for fixed

x_{0} \in (0, 1)

,

n^{1 / 2} b^{1 / 4} ({\hat{Q}}_{n} (u ∣ x_{0}) - Q (u ∣ x_{0}) - Bias [{\hat{Q}}_{n} (u ∣ x_{0})]) \overset{D}{\to} N_{2} (0, Σ (x_{0})),

(17)

where, with the notation of Section 3,

Σ (x_{0}) = \frac{ψ (x_{0}) f (x_{0})}{{(f (x_{0}) + b g (x_{0}))}^{2}} D_{1}^{- 1} D_{x_{0}} D_{1}^{- 1}, ψ (x_{0}) = {(4 π x_{0} (1 - x_{0}))}^{- 1 / 2} .

(18)

At leading order, this reduces to

Σ (x_{0}) \sim \frac{ψ (x_{0})}{f (x_{0})} D_{1}^{- 1} D_{x_{0}} D_{1}^{- 1} .

It is important to note, however, that in the present illustrations the bandwidth choice

b_{n} ≍ n^{- 2 / 5}

is adopted for point estimation accuracy. In the one-dimensional case, this implies that the centering bias is of the same asymptotic order as the stochastic fluctuation, since

b_{n} ≍ n^{- 2 / 5} and n^{- 1 / 2} b_{n}^{- 1 / 4} ≍ n^{- 2 / 5} .

Consequently, the ellipsoids displayed below should be interpreted as nominal Gaussian uncertainty ellipsoids around the estimator, calibrated from the leading-order covariance structure and the bootstrap, rather than as fully bias-corrected asymptotic confidence sets. A formally centered asymptotic confidence set under the theory of Section 3 would require either undersmoothing or explicit bias correction, neither of which is pursued in the present finite-sample illustration.

Plug-in covariance estimation and nominal Gaussian ellipsoids.

In order to visualize the local covariance structure predicted by (17), we use the leading-order plug-in estimator

{\hat{f}}_{n, b} (x_{0}) = \frac{1}{n} \sum_{i = 1}^{n} K_{x_{0}, b} (X_{i}),

together with

{\hat{D}}_{n} = \sum_{i = 1}^{n} w_{n i} (x_{0}) B (Y_{i} - {\hat{Q}}_{n} (u ∣ x_{0})),

and

{\hat{D}}_{n, x_{0}} = \sum_{i = 1}^{n} w_{n i} (x_{0}) (U (Y_{i} - {\hat{Q}}_{n} (u ∣ x_{0})) + u) {(U (Y_{i} - {\hat{Q}}_{n} (u ∣ x_{0})) + u)}^{⊤} .

The corresponding covariance estimate is

\hat{Σ} (x_{0}) = \frac{ψ (x_{0})}{{\hat{f}}_{n, b} (x_{0})} {\hat{D}}_{n}^{- 1} {\hat{D}}_{n, x_{0}} {\hat{D}}_{n}^{- 1} .

(19)

Using this matrix, we display the nominal

95 %

Gaussian ellipsoid

E_{0.95} (x_{0}) = \{q \in R^{2} : {({\hat{Q}}_{n} (u ∣ x_{0}) - q)}^{⊤} \hat{Σ} {(x_{0})}^{- 1} ({\hat{Q}}_{n} (u ∣ x_{0}) - q) \leq \frac{χ_{2, 0.95}^{2}}{n b^{1 / 2}}\} .

In finite samples, the scale of this approximation is additionally monitored by a nonparametric bootstrap with

B = 300

replicates. The sample sizes considered are

n \in {100, 200, 500}

.

Numerical findings.

Figure 1 displays the resulting nominal Gaussian ellipsoids for the two directions

u_{1}

and

u_{2}

. In each panel, the red point represents a numerical approximation of the target

Q (u ∣ x_{0})

obtained from the known conditional Gaussian law, while the black symbols indicate the corresponding sample estimates for

n = 100, 200, 500

.

For

u_{1} = {(- 0.8, - 0.58)}^{⊤}

, the ellipsoids contract substantially as n increases, and the centers stabilize around the target point. The ellipse for

n = 100

is comparatively wide and elongated, indicating a pronounced anisotropy in the estimated local covariance structure. The cases

n = 200

and

n = 500

show the expected reduction in dispersion, with the

n = 500

ellipse concentrated more tightly around the target.

The same qualitative behavior is observed for

u_{2} = {(0.8, - 0.4)}^{⊤}

. The contraction of the ellipsoids with increasing sample size is again clear, although the orientation and eccentricity differ from those in the first panel. This difference reflects the directional nature of the geometric conditional quantile, since the local matrices

D_{1}

and

D_{x_{0}}

, and hence the covariance structure in (18), depend on the chosen direction

u

.

Directional quantile contours.

To visualize the directional geometry of the estimator over a grid of directions, write

u (r, φ) = r {(cos φ, sin φ)}^{⊤}, r \in {0.1, 0.2, \dots, 0.9}, φ = \frac{k π}{16}, k = 0, \dots, 31 .

For each pair

(r, φ)

, we compute

{\hat{Q}}_{n} (u (r, φ) ∣ x_{0})

at

x_{0} = 0.3

under the same model (16). For fixed r, the resulting polygonal curve

\{{\hat{Q}}_{n} (u (r, φ) ∣ x_{0}) : φ = k π / 16, k = 0, \dots, 31\}

constitutes a numerical approximation to the directional quantile contour at radial level r.

The estimated contours are displayed in Figure 2 for

n = 100

and

n = 200

. As expected, the contours are nested, and their size increases with r, reflecting the passage from central to more extremal directions. Moreover, the curves become visibly smoother and more regular as the sample size increases. The remaining irregularities for

n = 100

are consistent with finite-sample variability in the weighted convex minimization problem, whereas the

n = 200

contours already exhibit a more stable directional structure.

Discussion of Simulation Results

The numerical evidence displayed in Figure 1 and Figure 2 is consistent with the asymptotic picture developed in Section 3, while remaining within the limited scope of the present experiment.

Point estimation versus inference.

The bandwidth

b_{n} ≍ n^{- 2 / 5}

used throughout this section is natural from the viewpoint of pointwise estimation in the one-dimensional interior setting. Under this choice, the empirical behavior of the estimator is stable and the ellipsoids contract as n increases. At the same time, because the deterministic bias is not asymptotically negligible under this scaling, the displayed ellipsoids should be interpreted as descriptive Gaussian uncertainty regions rather than as bias-corrected confidence sets in the strict asymptotic sense. This distinction is important for a correct reading of the figures.

Directional anisotropy.

The two panels of Figure 1 show that the orientation and eccentricity of the ellipsoids depend on the direction

u

. This is entirely consistent with the geometric nature of the target functional. Indeed, even at a fixed evaluation point

x_{0}

, the matrices

D_{1}

and

D_{x_{0}}

entering the covariance Formula (18) depend on the direction of the quantile, so different values of

u

induce different local covariance geometries.

Contour stability.

The directional contour plots provide a complementary view of the same phenomenon. For each fixed radial level r, the contour is obtained by sweeping the angle

φ

over the unit circle. The observed nesting of the curves is the expected geometric analogue of moving from central to more extremal quantiles. The increased regularity from

n = 100

to

n = 200

indicates that the estimator captures this directional geometry with improving numerical stability as the sample size grows.

Scope of the experiment.

Because the present illustration is restricted to the case

s = 1

and to the interior point

x_{0} = 0.3

, it should be viewed as an interior-point validation of the finite-sample behavior of the estimator, rather than as a direct numerical verification of the boundary-inflation phenomenon described in Theorem 3. That phenomenon is a genuinely simplex-boundary effect and would require simulations at points

x_{0}

approaching 0 or 1, or, more generally, experiments with

s \geq 2

on higher-dimensional simplices. Such extensions lie beyond the scope of the present section.

Overall assessment.

Within the present experimental design, the estimator

{\hat{Q}}_{n} (u ∣ x_{0})

exhibits the behavior predicted by the theory: the dispersion decreases with n, the Gaussian ellipsoids reflect the direction-dependent anisotropy of the local covariance, and the estimated quantile contours become progressively smoother as the sample size increases. These observations support the practical implementability of the proposed method and confirm that the weighted geometric M-estimation procedure behaves stably in finite samples in the one-dimensional simplex setting.

5. Empirical Validation: Simplex-Constrained Inference for Geochemical Data

The methodological framework developed in Section 2 and Section 3 is now subjected to rigorous empirical scrutiny using the GEMAS (Geochemical Mapping of Agricultural and Grazing Land Soils) dataset [62]. This dataset provides an ideal testing ground for the proposed estimator, as it embodies precisely the confluence of challenges that motivated this work: the covariate space is compositional and thus naturally constrained to a simplex, while the response is multivariate and requires a directional, non-Gaussian description. The objective is not merely to demonstrate computational feasibility, but to substantively validate the theoretical claims—specifically, the boundary-adaptive behavior of the Dirichlet kernel and the capacity of geometric conditional quantiles to reveal heterogeneity in the joint response distribution that is inaccessible to mean-based or marginal methods. The analysis proceeds in two stages: first, a univariate covariate scenario (

s = 1

) to establish baseline performance and facilitate comparison with conventional techniques; second, a bivariate covariate scenario (

s = 2

) to demonstrate the full power of the multivariate simplex-adaptive methodology.

5.1. Univariate Compositional Covariate: Sand-Normalized Texture

We begin by considering a simplified, yet scientifically relevant, setting where the covariate is one-dimensional. This allows for a transparent exposition of the estimator’s properties before confronting the full complexity of a bivariate simplex. The covariate is the Sand-norm, a normalized measure of sand content in the soil, which by construction lies in the unit interval

[0, 1]

and therefore constitutes a one-dimensional simplex. The bivariate response vector is

Y = {(Y_{1}, Y_{2})}^{⊤}

, where

Y_{1} = {Zn}_{XRF}

and

Y_{2} = {Cu}_{XRF}

denote the concentrations (in mg/kg) of zinc and copper, respectively, measured by X-ray fluorescence.

The estimation procedure follows the protocol delineated in Section 2. For a fixed evaluation point

x \in (0, 1)

, the Dirichlet kernel reduces to the Beta kernel

K_{α, β}

with shape parameters

α = x / b + 1

,

β = (1 - x) / b + 1

, and bandwidth

b = n^{- 2 / 5}

. The geometric conditional quantile

{\hat{Q}}_{n} (u ∣ x)

is computed for three directional indices:

u_{0} = {(0, 0)}^{⊤}, u_{+} = {(0.4, 0.5)}^{⊤}, u_{-} = {(- 0.4, - 0.5)}^{⊤},

where

u_{0}

corresponds to the conditional geometric median, and

u_{+}

and

u_{-}

represent opposing directional excursions into the upper and lower quadrants of the response space, respectively.

5.1.1. Point Estimates and Asymptotic Standard Errors: $s = 1$

The results are reported in Table 1 with asymptotic standard errors obtained from the plug-in covariance estimator

\hat{Σ} (x)

derived in Theorem 5. The bandwidth is selected as

b ≍ n^{- 2 / 5}

, which satisfies the optimal MSE rate derived in Corollary 1 for interior points.

5.1.2. Interpretation and Theoretical Validation

The results provide empirical support for several key theoretical predictions:

Consistency and monotonic trends: As x increases, indicating a shift toward sandier soils, the estimated geometric median for both metals decreases monotonically (

Zn

from 70.671 to 55.628;

Cu

from 17.561 to 11.815). This aligns with the geochemical intuition that sandier soils, characterized by lower clay and organic matter content, have a diminished capacity to retain trace metals. The monotonic decrease is consistent with a conditional mean function that is smooth and decreasing, as assumed in the bias expansion of Theorem 2.

Directional asymmetry and distributional heterogeneity: The directional quantiles

{\hat{Q}}_{n} (u_{+} ∣ x)

and

{\hat{Q}}_{n} (u_{-} ∣ x)

systematically lie, respectively, above and below the median across all covariate values. This is not a mere location shift: the gap between these directional estimates widens as x increases. For

Zn

, the inter-directional spread increases from

95.309 - 51.098 = 44.211

at

x = 0.333

to

83.438 - 38.312 = 45.126

at

x = 0.606

, indicating a subtle yet detectable increase in the conditional distribution’s directional dispersion with sand content. This widening gap is a direct empirical manifestation of the directional asymmetry captured by the geometric quantile framework, which would be entirely absent in a median-only analysis.

Variance inflation and inferential significance: The standard errors, while slightly larger for the directional estimates, remain an order of magnitude smaller than the directional spreads, confirming that the observed asymmetries are statistically significant. Notably, the ASE for the

u_{+}

quantile of

Cu

increases from 0.445 to 0.523 as x moves from 0.333 to 0.606, while the ASE for the

u_{-}

quantile of

Cu

decreases from 0.178 to 0.115. This non-uniform variance behavior directly validates the theoretical variance analysis in Theorem 3: the inflation of the asymptotic variance for directional quantiles (

u \neq 0

) is manifested here as larger standard errors, and the boundary-dependent inflation factor

b^{- | J | / 2}

manifests as the changing variance structure as the covariate moves away from the boundary

x = 0

.

5.2. Bivariate Compositional Covariate: Sand and Silt Interplay

We now extend the analysis to its full multivariate form, with a bivariate covariate

X = {(X_{1}, X_{2})}^{⊤}

, where

X_{1} = Sand - norm

and

X_{2} = Silt - norm

. This vector lives on the two-dimensional simplex

S_{2, 1} = {x \in {[0, 1]}^{2} : x_{1} + x_{2} \leq 1}

, a domain whose geometry is fundamentally different from a Cartesian product. The response is

Y = {(Y_{1}, Y_{2})}^{⊤}

with

Y_{1} = {pH}_{{CaCl}_{2}}

(soil acidity) and

Y_{2} = TOC

(total organic carbon, %). The bandwidth is set to

b ≍ n^{- 1 / 3}

, a rate that respects the condition

n b^{s / 2} \to \infty

required for the asymptotic theory in dimension

s = 2

.

5.2.1. Point Estimates and Asymptotic Standard Errors: $s = 2$

The estimation is performed at a set of design points

x

spanning the interior of the simplex, with coordinates chosen to represent distinct textural classes (sandy, loamy, and silty compositions). The results are presented in Table 2, where the geometric median (

u_{0}

) and the opposing directional quantiles (

u_{+}

,

u_{-}

) are reported.

5.2.2. Visualizing the Conditional Structure: Directional Quantile Surfaces

To complement the tabular results, Figure 3 provides a visual representation of the fitted conditional quantile surfaces. Panel (a) displays the scatter plot of

{pH}_{{CaCl}_{2}}

versus the bivariate covariate

(x_{1}, x_{2})

, overlaid with the estimated geometric median surface (

u_{0}

) and the directional quantile surface for

u_{+}

. Panel (b) presents the analogous visualization for

TOC

. The surfaces are constructed by evaluating the estimator on a fine grid of points within the simplex.

5.2.3. Interpretation and Theoretical Synthesis

This bivariate analysis yields a richer and more nuanced picture than the univariate case, directly illustrating the value of the multivariate geometric framework. Several distinct phenomena are discernible, each connecting directly to the theoretical results established in Section 3.

Nonlinear and non-monotonic conditional structures: The behavior of

{pH}_{{CaCl}_{2}}

is monotone in the sand-silt composition: as the point moves from the silt-rich region

(0.25, 0.74)

to the sand-rich region

(0.80, 0.12)

, the conditional median

pH

declines steadily from 6.364 to 5.437, corroborating the known acidifying effect of sandy soils. The

TOC

response, however, exhibits a non-monotone, and thus more complex, pattern. Its median first increases from 1.699 to 2.040 as the composition shifts to a more loamy balance at

(0.68, 0.31)

, before decreasing to 1.820 at the most sand-dominated point. This nuanced behavior, which captures the parabolic relationship between carbon storage and intermediate soil textures, is a prime example of the kind of structure that a mean-based regression would likely oversmooth. The ability of our estimator to capture this non-monotonicity is a direct consequence of the local weighting scheme, which, as shown in the bias expansion (Theorem 2), allows for flexible adaptation to the underlying smooth function

Q (u ∣ x)

.

Directional asymmetry and the geometry of the joint distribution: The directional quantiles reveal a striking asymmetry. For every design point,

{\hat{Q}}_{n} (u_{+} ∣ x)

is significantly larger than the median, while

{\hat{Q}}_{n} (u_{-} ∣ x)

is significantly smaller, for both

pH

and

TOC

simultaneously. This confirms that the joint conditional distribution is not centrally symmetric. More importantly, the magnitude of this asymmetry evolves with the covariate. The directional spread for

pH

, measured by the gap between the

u_{+}

and

u_{-}

quantiles, increases from

1.581

at

(0.25, 0.74)

to

1.934

at

(0.80, 0.12)

. This widening gap is a direct empirical manifestation of the theoretical prediction in Theorem 3: as the evaluation point approaches the boundary of the simplex (i.e., as

x

becomes dominated by sand), the asymptotic variance inflates by a factor of

b^{- | J | / 2}

, where

| J |

is the number of coordinates approaching zero. Here, the boundary is the face where

x_{2}

(silt) is small and

1 - {∥ x ∥}_{1}

(the clay fraction) is also small, increasing the codimension of the face. This leads to a larger effective variance, which in our empirical results is manifested as a greater sensitivity of the directional quantile estimates to compositional changes near the boundary, as reflected in the widening spread and the non-uniform standard errors.

Inferential significance and the role of the covariance matrix: The standard errors reported in Table 2 are not uniform across directions. They are systematically larger for the

u_{+}

quantile, especially for

TOC

, which is the most variable component of the response. This aligns with the theoretical prediction from the asymptotic variance formula

Σ (x) \propto D_{1}^{- 1} D_{x} D_{1}^{- 1}

, where

D_{x}

captures the conditional second moment of the directional score. A larger conditional variance in the direction of

u_{+}

naturally translates into greater estimation uncertainty for that quantile. Crucially, despite this increased uncertainty, the estimated gaps between

{\hat{Q}}_{n} (u_{+} ∣ x)

and

{\hat{Q}}_{n} (u_{-} ∣ x)

are multiples of their respective standard errors, confirming that the observed directional heterogeneity is statistically significant and not an artifact of sampling variation. This provides empirical support for the asymptotic normality result (Theorem 5) and the validity of the plug-in covariance estimator used to construct the standard errors.

The boundary-adaptive advantage: A critical, though subtle, advantage of the Dirichlet kernel becomes apparent when considering the design points in the bivariate analysis. The point

(0.80, 0.12)

lies close to the boundary of the simplex, where

x_{1} + x_{2} = 0.92

and the clay fraction

1 - {∥ x ∥}_{1} = 0.08

is small. A conventional symmetric kernel would assign substantial weight to points outside the simplex, leading to boundary bias and potentially distorting the estimates. The Dirichlet kernel, by construction, respects the support, as its density is zero outside

S_{2, 1}

. The fact that the estimates at this boundary point are stable and exhibit the expected geochemical trends (e.g., lower pH, lower TOC) provides indirect validation of the boundary-adaptive property asserted in the introduction.

5.3. Discussion: Synthesis of Empirical Findings and Theoretical Implications

The GEMAS analysis substantiates the methodological and theoretical contributions of this paper in several key respects, moving beyond mere illustration to a rigorous empirical validation.

1.: Structural fidelity and support preservation: The use of the Dirichlet kernel ensures that the estimator respects the compositional geometry of the covariate space. This is not a marginal technical point; it is a prerequisite for meaningful inference, as conventional kernel methods would assign mass to infeasible regions near the simplex boundary, introducing an uncontrolled source of bias that is particularly severe in the bivariate case. The stability of our estimates near the boundary, as seen in the bivariate analysis, demonstrates the practical efficacy of this support-adaptive smoothing.
2.: Detection of complex conditional structure: The bivariate analysis reveals that the conditional dependence of $(pH, TOC)$ on soil texture cannot be reduced to a simple monotone or linear relationship. The non-monotonic median of $TOC$ and the directional asymmetry that evolves across the simplex demonstrate that the joint conditional distribution is subject to both shape and location variation. Such phenomena are beyond the scope of mean regression or marginal quantile analysis, confirming the necessity of the geometric quantile framework.
3.: Empirical validation of boundary asymptotics: The observed increase in directional spread and the non-uniformity of standard errors as the covariate approaches the sand-dominated corner of the simplex provide empirical corroboration for the theoretical variance regime established in Theorem 3. The estimator’s behavior near the boundary is not a deficiency but a correctly calibrated reflection of the intrinsic difficulty of local inference in that region, a difficulty that is accurately captured by the $b^{- | J | / 2}$ inflation factor. This is a novel empirical contribution, linking the asymptotic theory directly to observable finite-sample behavior.
4.: Actionable scientific insight: The directional asymmetry revealed by the quantile surfaces offers geochemically interpretable information. The fact that the $u_{+}$ quantile, which weights the upper tails of both $pH$ and $TOC$ , is more responsive to changes in soil texture than its $u_{-}$ counterpart, suggests that the mechanism governing the upper joint distribution of acidity and organic carbon is more sensitive to textural composition than the mechanism governing the lower joint distribution. This differential sensitivity, which would remain hidden in a univariate analysis, provides a refined hypothesis about the underlying pedological processes. Specifically, it suggests that the factors that simultaneously drive high pH and high carbon storage (e.g., calcium-rich parent material, stable soil aggregates) are more strongly modulated by soil texture than the factors that drive low pH and low carbon.
5.: Inferential framework in practice: The construction of standard errors via the plug-in covariance estimator $\hat{Σ} (x)$ , grounded in Theorem 5, provides a practical tool for uncertainty quantification. The reported ASEs allow for formal hypothesis testing (e.g., comparing ${\hat{Q}}_{n} (u_{+} ∣ x)$ to ${\hat{Q}}_{n} (u_{-} ∣ x)$ ) and the construction of confidence ellipsoids, which, as shown in the simulation study, achieve near-nominal coverage in moderate sample sizes. This transforms the estimator from a point estimate into a fully inferential tool.

In conclusion, the empirical investigation of the GEMAS dataset serves as a rigorous proof-of-concept and an integral part of the paper’s contribution. It demonstrates that the boundary-adaptive geometric conditional quantile estimator, grounded in the theoretical framework of Section 2 and Section 3, is not only computationally implementable but, more importantly, capable of extracting scientifically meaningful and statistically reliable information from complex, constrained multivariate data. The analysis validates the core theoretical claims—the boundary-adaptive bias reduction, the directional variance inflation, and the asymptotic normality—and illustrates the unique inferential advantages conferred by the synergistic combination of Dirichlet kernel smoothing and geometric quantile regression. The empirical findings thus reinforce the paper’s central thesis: that respecting the geometry of the covariate space and the directional nature of the response is essential for reliable and insightful conditional inference in modern statistical applications.

6. Conclusions and Perspectives

6.1. Synthesis of Contributions

This paper has introduced and systematically analyzed a novel nonparametric estimator for geometric conditional quantiles when the covariate lies on the simplex. The methodological core resides in the synergistic combination of two conceptually powerful yet hitherto separate ideas: the geometric quantile framework, which confers directional interpretability and convexity properties, and the boundary-adaptive Dirichlet kernel construction, which generalizes the beta kernel paradigm to multivariate compositional covariates. By letting the kernel shape parameters depend deterministically on the evaluation point, we achieve intrinsic support alignment and eliminate boundary bias without recourse to ad hoc corrections. The resulting estimator inherits both the geometric coherence of its population counterpart and the boundary-respecting properties of the Dirichlet weighting scheme, rendering it particularly well-suited for compositional data analysis, spatial econometrics, and multivariate risk assessment.

From a theoretical perspective, the paper delivers a comprehensive asymptotic characterization that goes considerably beyond existing results for either univariate conditional quantiles or unconditional geometric quantiles. The Bahadur-type linear expansion established herein represents the first such result for a conditional geometric quantile estimator with boundary-adaptive weighting. This expansion identifies the pivotal matrix encoding the local geometric structure and establishes a sharp remainder rate that reflects the interplay between the kernel’s effective support and the smoothness of the conditional distribution. Crucially, the expansion provides the foundational tool from which all subsequent asymptotic properties are derived.

The bias analysis reveals the precise manner in which simplex geometry influences estimation error, incorporating boundary-adaptation terms with coefficients that depend explicitly on the simplex coordinates. This structure originates from the evaluation-point-dependent kernel moments and represents a significant departure from conventional kernel smoothing theory, where bias expansions typically involve only the marginal density and regression function derivatives. Perhaps most striking is the variance behavior, which exhibits a fundamental phase transition as the evaluation point approaches the boundary: each coordinate direction approaching the boundary inflates the asymptotic variance by a factor that diverges at a specific rate, a phenomenon rigorously derived from the asymptotic behavior of Gamma function ratios in the Dirichlet kernel. This boundary-induced variance inflation is not a defect of the estimator but an intrinsic feature of estimation near the support boundary, and its explicit characterization enables proper uncertainty quantification in such regions.

The mean squared error analysis synthesizes these bias and variance contributions to establish the optimal bandwidth rate, which depends on both the dimensionality and the proximity to the boundary. The asymptotic normality result, obtained under a nonstandard rescaling that accounts for the boundary-dependent convergence rates, provides the theoretical foundation for inference. By constructing plug-in covariance estimators and asymptotic confidence ellipsoids, we enable directional inference across the entire spectrum of quantile indices, from central tendency to extremal directions.

Beyond these theoretical contributions, the real-data application in Section 5 provides a concrete empirical validation of the proposed methodology. The analysis of the GEMAS dataset shows that the estimator is capable of revealing meaningful directional heterogeneity in the joint conditional behavior of

Z n_{XRF}

and

C u_{XRF}

as functions of normalized soil-composition covariates. In particular, for the Sand-norm covariate, the estimated geometric conditional median and directional quantiles display a clear decreasing trend as the covariate level increases, indicating that larger sand proportions are associated with lower conditional levels of both zinc and copper. By contrast, for the Silt-norm covariate, the fitted quantiles exhibit a generally increasing pattern, thereby suggesting an opposite conditional effect. These empirical findings illustrate that the proposed estimator is not merely a theoretical construct, but an effective tool for uncovering structured multivariate conditional relationships in constrained compositional settings.

The GEMAS study also highlights an important substantive feature of the method: its ability to detect directional asymmetry in the conditional response distribution. The noticeable separation between the geometric conditional median

u = {(0, 0)}^{⊤}

and the directional quantiles associated with

u = {(0.4, 0.5)}^{⊤}

and

u = {(- 0.4, - 0.5)}^{⊤}

demonstrates that the conditional distribution of the response cannot be adequately summarized by a single central regression surface. Rather, the multivariate response cloud exhibits directional variation whose magnitude and orientation depend on the soil composition. In this respect, the real-data analysis confirms one of the main motivations of the paper: geometric conditional quantiles furnish a substantially richer description of conditional structure than mean-based or median-only approaches.

Furthermore, the graphical analysis reported in Section 5 shows that the estimated conditional quantile curves capture nonlinear patterns and local heterogeneity more effectively than unconditional summaries. In particular, the visible deviations between the directional quantile curves and the conditional median in regions of increased spread support the practical relevance of the proposed directional framework for assessing conditional dispersion and asymmetry. Hence, the empirical results do not merely accompany the theory; they substantively reinforce the central claim of the paper that boundary-adaptive geometric quantile estimation on the simplex yields scientifically interpretable and practically useful information in applications involving compositional covariates.

6.2. Methodological Import and Positioning

Relative to the existing literature, this work occupies a distinctive position. Compared to norm-minimization quantiles, the spatial formulation yields a direct geometric interpretation and a tractable convex objective whose gradient structure meshes naturally with Dirichlet weights. Relative to beta and Bernstein estimators on the unit interval, our simplex-based Dirichlet kernels generalize boundary adaptivity to multivariate covariates with compositional constraints and reveal a clean asymptotic scaling law near faces and edges of varying codimension. When the directional parameter vanishes, the estimator reduces to the conditional geometric median, and our framework recovers and extends existing results for this special case under weaker conditions. More broadly, the analysis clarifies how proximity to the boundary governs stochastic error, thereby informing bandwidth selection and the interpretation of confidence regions in practice.

The real-data illustration further clarifies this positioning. In the GEMAS application, the covariates are normalized soil texture components and therefore naturally lie in a constrained domain where support respect is not optional but structurally required. In such a setting, the use of Dirichlet kernels is particularly well justified: the method respects the geometry of the covariate space, avoids artificial leakage outside the admissible domain, and remains interpretable near the edges of the support. The observed stability of the estimated conditional quantiles in that application provides empirical support for the claim that boundary adaptivity is not merely a formal refinement, but a practically consequential feature of the proposed methodology.

The estimator is computationally tractable, amenable to iteratively reweighted least squares and Weiszfeld-type algorithms, and readily implementable with weight reuse across grids of covariate values. Numerical experiments corroborate the theoretical predictions: the Dirichlet weighting yields stable estimation in the interior and demonstrably mitigates boundary distortion, while plug-in covariance coupled with moderate bootstrap calibration delivers reliable finite-sample confidence regions. The contour analyses across directional indices reveal the estimator’s ability to capture the local anisotropic structure of the conditional distribution, with ellipsoid orientation and eccentricity reflecting the underlying covariance geometry.

6.3. Limitations and Avenues for Extension

Notwithstanding these contributions, several limitations merit acknowledgment and suggest directions for future investigation. First, the asymptotic results are pointwise in the covariate and directional parameter, assuming independent and identically distributed observations with smooth conditional structure. Uniform inference over regions of the covariate space or over grids of directional indices remains an open challenge, yet such results are essential for constructing simultaneous confidence bands for quantile contours and for controlling family-wise error in exploratory analyses.

Second, the bandwidth selection problem, while theoretically characterized through the mean squared error optimal rate, lacks a fully data-driven implementation tailored to the geometric quantile loss, see [64,65,66]. Plug-in rules, cross-validation procedures that target the quantile risk, and Lepski-type adaptive methods warrant systematic development, with particular attention to their behavior near the boundary where the effective sample size exhibits spatial heterogeneity.

Third, the extension to higher-dimensional covariates raises important questions about dimension reduction and structural assumptions. For compositional covariates on the simplex, the Dirichlet construction remains valid for arbitrary dimension, but the effective local sample size decays rapidly, necessitating either sparsity-inducing penalties, low-dimensional index models, or dimension reduction techniques that respect the compositional geometry. The interplay between the simplex dimension and the boundary codimension in determining optimal rates merits further investigation.

Fourth, the framework currently assumes independence across observations, yet many potential applications involve temporal dependence, spatial correlation, or network-structured data. Extending the Bahadur representation and central limit theorem to weakly dependent processes under appropriate mixing conditions would substantially broaden the estimator’s applicability to time series econometrics and spatial statistics.

Fifth, while the geometric quantile formulation inherently provides robustness through the use of Euclidean norms, extreme directions approaching the boundary of the unit ball remain sensitive to outliers and heavy-tailed phenomena. Robustification through Huberized losses or redescending influence functions, while preserving first-order asymptotic properties, could enhance stability in extremal inference.

Sixth, the compositional nature of the covariate space suggests deeper connections with Aitchison geometry. Integrating log-ratio transformations with the Dirichlet weighting scheme could yield estimators that are invariant under the natural operations of compositional data analysis, enhancing interpretability in geochemical, ecological, and economic applications where compositions arise naturally. This direction appears especially promising in light of the GEMAS application, where the covariates arise precisely from normalized soil-composition variables and where relative rather than absolute dominance between components may carry the primary scientific signal.

Seventh, incomplete data mechanisms—missing at random, missing not at random, and censoring—are pervasive in practice. Adapting the estimator to such settings under appropriate identification conditions, and establishing semiparametric efficiency bounds, would extend its utility to survival analysis and longitudinal studies with attrition.

Finally, nonasymptotic analysis in the form of concentration inequalities and finite-sample coverage guarantees for the confidence ellipsoids would complement the asymptotic theory and provide guidance for practice in moderate sample sizes. Such results typically require stronger tail conditions but yield valuable insights into the estimator’s behavior beyond the first-order asymptotics.

6.4. Concluding Remarks

Boundary-respecting Dirichlet kernels provide a principled and effective vehicle for multivariate conditional quantile estimation on the simplex. The resulting estimators admit precise asymptotics, are computationally viable, and exhibit robust boundary performance, positioning the approach as a sound default for geometric conditional inference. By unifying the geometric quantile paradigm with location-adaptive smoothing, this work bridges two previously disparate literatures and opens new perspectives for robust conditional analysis in econometric, financial, environmental, and stochastic modeling applications.

The real-data analysis presented in Section 5 gives this conclusion a concrete empirical foundation. It shows that the proposed methodology is capable of extracting meaningful and nontrivial scientific information from compositional environmental data: it identifies opposite conditional trends for sand- and silt-dominated soils, reveals directional asymmetry in the joint conditional behavior of

Z n_{XRF}

and

C u_{XRF}

, and captures nonlinear variation that is obscured by more classical summaries. These findings illustrate that the theoretical developments of the paper translate into genuine inferential gains in practice. In that sense, the GEMAS application does not merely serve as an illustration; it confirms the operational relevance of the proposed framework and demonstrates that boundary-adaptive geometric conditional quantiles can provide refined and interpretable insight in realistic multivariate regression problems.

The theoretical foundations established herein—the Bahadur expansion, the boundary-dependent variance regimes, the optimal bandwidth characterization, and the asymptotic normality with ellipsoidal inference—therefore acquire additional significance when viewed through the lens of the real-data evidence. The proposed estimator is not only mathematically well founded, but also practically informative in settings where the covariates are compositional and the response is multivariate. As such, this paper contributes both a rigorous asymptotic theory and an applied statistical methodology for recovering directional conditional structure on constrained domains, and it is our hope that it will stimulate further developments at the interface of geometric quantile theory, compositional data analysis, and boundary-adaptive nonparametric inference.

7. Proofs of the Main Results

A.1 Some Lemmas

We begin by introducing an essential inequality, which extends the Fact 5.1 [6] to the multivariate setting. This result forms a cornerstone for the derivations that follow.

Fact

Let

Z_{1}, Z_{2}, \dots, Z_{n}

be a sequence of d-dimensional i.i.d. random vectors and let

p (y_{1}, y_{2}, \dots, y_{m})

be a symmetric d-dimensional kernel such that

∥ p (\cdot) ∥ \leq M

for a positive constant M. Assume that

E [p (Z_{1}, Z_{2}, \dots, Z_{m})] = 0

and

Var (p (Z_{1}, Z_{2}, \dots, Z_{m})) = {(σ_{i j})}_{d \times d}

. Define the U-statistic as

U_{n} = \frac{m! (n - m)!}{n!} \sum_{1 \leq i_{1} < i_{2} < \dots < i_{m} \leq n} p (Z_{i_{1}}, Z_{i_{2}}, \dots, Z_{i_{m}}) .

Then, for each

t > 0

, we have the following

\begin{matrix} P (∥ U_{n} ∥ \geq t) \leq 2 d exp \{- \frac{⌊\frac{n}{m}⌋ t^{2}}{2 d^{2} {max}_{1 \leq l \leq d} σ_{l l} + 2 d M t / 3}\}, \end{matrix}

(20)

where

⌊\frac{n}{m}⌋

is the integral part of

\frac{n}{m}

.

The proof of Theorem 1 proceeds via the following sequence of lemmas. These lemmas are of independent interest in characterizing the properties of the geometric conditional quantile estimator ${\hat{Q}}_{n} (u ∣ x)$ . In particular, we first establish that ${\hat{Q}}_{n} (u ∣ x)$ is asymptotically bounded by some constant with probability one. Subsequently, we refine this bound and derive the Bahadur-type representation under Conditions 1–7. We now state the following lemmas; their proofs are given in Section 8.

Lemma 1.

Under Conditions A.2–A.4, there exists a constant

K_{1} = K_{1} (u) > 0

such that

∥{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)∥ ⩽ K_{1} .

holds almost surely for all sufficiently large n.

Lemma 2

(see [59]). Asymptotic Behavior of

A_{b} (x)

, as

b \to 0

uniformly for

x \in S_{s, 1}

where for all

b > 0

,

\begin{matrix} 0 < A_{b} (x) \leq \frac{b^{(s + 1) / 2} {(1 / b + s)}^{s + 1 / 2}}{{(4 π)}^{s / 2} \sqrt{(1 - ∥ x ∥_{1})} \prod_{i \in [s]} x_{i}} (1 + O (b)) . \end{matrix}

Furthermore, for any subset

\emptyset \neq J \subseteq [s]

, and any

κ \in {(0, \infty)}^{s}

,

A_{b} (x) = \{\begin{matrix} b^{- s / 2} ψ (x) (1 + O_{x} (b)), \\ i f x_{i} / b \to {\infty \forall i \in [s] a n d (1 - ∥ x ∥}_{1}) / b \to \infty, \\ b^{- (s + | J |) / 2} ψ_{J} (x) \prod_{i \in J} \frac{Γ (2 κ_{i} + 1)}{2^{2 κ_{i} + 1} Γ (κ_{i} + 1)} \cdot (1 + O_{κ, s} (b)), \\ i f x_{i} / b \to κ_{i} \forall i \in J a n d x_{i} / b \to {\infty \forall i \in [d] ∖ J, a n d (1 - ∥ x ∥}_{1}) / b \to \infty, \end{matrix}

where

[s] \equiv {1, 2, \dots, s}

,

ψ (x)

and

ψ_{J} (x)

are defined as in (11).

Lemma 3.

If

α_{1}, \dots, α_{s}, β \geq 2

, (see [59]) then

\begin{matrix} sup_{x \in S_{s, 1}} K_{α, β} (x) \leq \sqrt{\frac{{∥ α ∥}_{1} + β - 1}{(β - 1) \prod_{i \in [s]} (α_{i} - 1)}} {(∥ α ∥}_{1} {+ β - s - 1)}^{s} . \end{matrix}

(21)

Lemma 4.

Assume that Conditions A.1–A.4 hold. For some constant

α > 0

, let

B_{n}

be the subset of

R^{d}

defined as

B_{n} = \{{(v_{1} - q_{1}, \dots, v_{d} - q_{d})}^{T} ∣ [n^{α}] (v_{i} - q_{i}) = a n i n t e g e r, |v_{i} - q_{i}| ⩽ K_{1} for all 1 ⩽ i ⩽ d\} .

Then there exists a constant

K_{2} > 0

such that

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥\sum_{i = 1}^{n} [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x)) - E [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x))]]∥ \\ ⩽ ⩽ K_{2} \sqrt{\frac{log n}{n b^{s / 2}}} . \end{matrix}

If $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥\sum_{i = 1}^{n} [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x)) - E [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x))]]∥ \\ ⩽ K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}, \end{matrix}

holds almost surely for all sufficiently large n.

Lemma 5.

It holds almost surely that

∥\sum_{i = 1}^{n} w_{n, i} (U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) + u)∥ ⩽ max_{1 ⩽ i ⩽ n} w_{n, i} .

(22)

Lemma 6.

Under Conditions A.1–A.4, there exists a constants

K_{4}, K_{5} > 0

such that

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

max_{θ \in B_{n}} |\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})}| ⩽ K_{4} (\frac{log n}{n b^{s / 2}}) .

(23)

If $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

max_{θ \in B_{n}} |\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})}| ⩽ K_{5} (\frac{log n}{n b^{(s + | J |) / 2}}),

(24)

holds almost surely for all sufficiently large n and

β ⩾ γ / d

.

Lemma 7.

Under Conditions A.2–A.4, it holds that

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$

\frac{\sum_{i = 1}^{n} K_{α, β} (X_{i})}{n E (K_{α, β} (X))} - 1 = O (\sqrt{\frac{log n}{n b^{s / 2}}}), a . s .

if $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

\frac{\sum_{i = 1}^{n} K_{α, β} (X_{i})}{n E (K_{α, β} (X))} - 1 = O (\sqrt{\frac{log n}{n b^{(s + | J |) / 2}}}), a . s .

From the lemmas above, we can derive the convergence rate of the estimated

u

-th geometric conditional quantile

{\hat{Q}}_{n} (u ∣ x) .

In the remainder of this subsection, the following constants are assumed to satisfy

(1 + \frac{1}{d}) γ \leq β + γ \leq α .

(25)

Lemma 8.

Under Conditions A.1–A.7, and the standing assumptions

x_{i} / b \to \infty (i = 1, \dots, d), (1 - {∥ x ∥}_{1}) / b \to \infty,

there exists a constant

K_{7} > 0

such that almost surely

∥{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)∥ ⩽ K_{7} {(\frac{log n}{n b^{s / 2}})}^{1 / 2},

for all sufficiently large n.

In view of Lemma 7, (98) and (99), the following lemma can be obtained by a similar argument as the previous Lemma 4.

Lemma 9.

Under Conditions A.2–A.5, it holds with probability one that

\sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) = o (\frac{log n}{n b^{s / 2}}) .

Recalling Lemmas 7 and 9, we get that

∥\sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) (\frac{\sum_{j = 1}^{n} K_{α, β} (X_{j})}{n E (K_{α, β} (X))} - 1)∥ = o (\frac{log n}{n b^{s / 2}}) .

(26)

Remark 3.

By a differentiation argument, we have that, provided the density function

f (\cdot)

is continuous and

f (x) > 0

, (see [59]) then:

\begin{matrix} E [K_{α, β} (X)] = E [f (ξ_{x})] = f (x) + b g (x) + O (b), \end{matrix}

(27)

such that g is defined in (10): where

\begin{matrix} ξ_{x} = (ξ_{1}, \dots, ξ_{d}) \sim {D i r i c h l e t (x / b + 1, (1 - ∥ x ∥}_{1}) / b + 1), x \in S_{d, 1} . \end{matrix}

If

γ_{x} \sim {D i r i c h l e t (2 x / b + 1, 2 (1 - ∥ x ∥}_{1}) / b + 1)

, then:

\begin{matrix} E [K_{α, β}^{2} (X)] = A_{b} (x) E (f (γ_{x})) . \end{matrix}

(28)

Remark 4.

Since

ξ_{x}

follows a Beta distribution, which implies that:

\begin{matrix} E [ξ_{x_{i}}] - x_{i} & = & b (1 - (s + 1) x_{i}) + O (b^{2}), \\ C o v (ξ_{x_{i}}, ξ_{x_{j}}) & = & b x_{i} (1_{{i = j}} - x_{j}) + O (b^{2}), \\ E [(ξ_{x_{i}} - x_{i}) (ξ_{x_{j}} - x_{j})] & = & b x_{i} (1_{{i = j}} - x_{j}) + O (b^{2}) . \end{matrix}

Proof of Theorem 1.

We have the following relationships:

\begin{matrix} \sum_{i = 1}^{n} w_{n, i} (U (Y_{i} - Q (u ∣ x)) + u) \\ = Λ_{n} (θ_{n}^{*}) + \frac{\sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y_{i} - θ_{n}^{*} - Q (u ∣ x)) + u)}{n E (K_{α, β} (X))} \\ - Λ_{n} (θ_{n}^{*}) - \sum_{i = 1}^{n} w_{n, i} (U (Y_{i} - Q (u ∣ x)) + u) \\ \times (\frac{\sum_{j = 1}^{n} K_{α, β} (X_{j})}{n E (K_{α, β} (X))} - 1) + D_{1} θ_{n}^{*} . \end{matrix}

Accordingly, from (25), (96), and (98)

∥θ_{n}^{*} - {\hat{Q}}_{n} (u ∣ x) + Q (u ∣ x)∥ = o (n^{- α}),

which confirms that Theorem 1 holds. □

Proof of Theorem 2.

We take the expectation of the Bahadur representation as defined in Equation (107),

\begin{matrix} E [{\hat{Q}}_{n} (u ∣ x)] - Q (u ∣ x) & = D_{1}^{- 1} \sum_{i = 1}^{n} E [w_{n i} (U (Y_{i} - Q (u ∣ x)) + u)] + E (R_{n}) \\ = \frac{D_{1}^{- 1}}{E (K_{α, β} (X_{1}))} E [K_{α, β} (X_{1}) (U (Y_{1} - Q (u ∣ x)) + u)] + R_{n} . \end{matrix}

(29)

Using the law of iterated expectations and condition A.3 Equation (5), we decompose:

\begin{matrix} E [K_{α, β} (X_{1}) (U (Y_{1} - Q (u ∣ x)) + u)] & = E [K_{α, β} (X_{1}) E [U (Y_{1} - Q (u ∣ x)) + u ∣ X = x)]] \\ = E [K_{α, β} (X_{1}) r (0, x)] \\ = \int_{S_{s, 1}} K_{α, β} (t) r (0, t) f (t) dt \\ = E [r (0, ξ_{x}) f (ξ_{x})], \end{matrix}

where

ξ_{x} \sim Dirichlet (α, β)

, and

R (0, x) = r (0, x) f (x)

, by second order Taylor expansion around

ξ_{x} = x

, we get

\begin{matrix} R (0, ξ_{x}) & = & R (0, x) + \sum_{i = 1}^{s} \frac{\partial R (0, x)}{\partial x_{i}} (ξ_{x_{i}} - x_{i}) + \frac{1}{2} \sum_{i = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i}^{2}} {(ξ_{x_{i}} - x_{i})}^{2} \\ + \sum_{i = 1}^{s} \sum_{j = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i} \partial x_{j}} (ξ_{x_{i}} - x_{i}) (ξ_{x_{j}} - x_{j}) . \end{matrix}

(30)

Using Remark 4 and applying the Cauchy–Schwartz inequality in Equation (30), we obtain uniformly on

x \in S_{s, 1}

:

\begin{matrix} | E [R (0, ξ_{x})] - R (0, x) | \\ = \sum_{i = 1}^{s} O (E [ξ_{x_{i}} - x_{i}]) + \frac{1}{2} \sum_{i = 1}^{s} O (E {[ξ_{x_{i}} - x_{i}]}^{2}) + \sum_{i = 1}^{s} \sum_{j = 1}^{s} O (E [(ξ_{x_{i}} - x_{i}) (ξ_{x_{j}} - x_{j})]) \\ \leq \sum_{i = 1}^{s} O (\sqrt{E {[ξ_{x_{i}} - x_{i}]}^{2}}) + \sum_{i = 1}^{s} O (b) + \sum_{i = 1}^{s} O (b^{2}) \\ \leq O (b^{1 / 2}) + O (b) + O (b^{2}) \\ \leq O (b^{1 / 2}) (1 + o (1)) . \end{matrix}

Consequently, we deduce that

sup_{x \in S_{s, 1}} | E [R (0, ξ_{x})) - R (0, x) | = O (b^{1 / 2}) .

(31)

We take the expectation of Equation (30),

\begin{matrix} E [R (0, ξ_{x})] & = & R (0, x) + \sum_{i = 1}^{s} \frac{\partial R (0, x)}{\partial x_{i}} b (1 - (s + 1) x_{i}) + \frac{1}{2} \sum_{i = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i}^{2}} (b x_{i} (1 - x_{i})) \\ + \sum_{i, j = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i} \partial x_{j}} (b x_{i} (1_{{i = j}} - x_{j})) + O (b^{1 / 2}) \\ = & R (0, x) + b \{\sum_{i = 1}^{s} \frac{\partial R (0, x)}{\partial x_{i}} (1 - (s + 1) x_{i}) + \frac{1}{2} \sum_{i = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i}^{2}} x_{i} (1 - x_{i})\} \\ + b \{\sum_{i, j = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i} \partial x_{j}} x_{i} (1_{{i = j}} - x_{j})\} + o (b^{1 / 2}) . \end{matrix}

(32)

For

b > 0, x \in S_{s, 1}, s \leq d

, we define

ζ_{s}

ζ_{s} = \sum_{i = 1}^{s} \frac{\partial R (0, x)}{\partial x_{i}} (1 - (s + 1) x_{i}) + \frac{1}{2} \sum_{i = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i}^{2}} x_{i} (1 - x_{i}) + \sum_{i, j = 1}^{s} \frac{\partial^{2} R (0, x)}{\partial x_{i} \partial x_{j}} x_{i} (1_{{i = j}} - x_{j}) .

(33)

Substituting (33) into (32), we obtain

E [R (0, ξ_{x})] = R (0, x) + b ζ_{s} + O (b^{1 / 2}) = R (0, x) f (x) + b ζ_{s} + O (b^{1 / 2})

(34)

Finally, from Equations (34) and (29), the bias of

{\hat{Q}}_{n} (u ∣ x)

, becomes:

\begin{matrix} E [{\hat{Q}}_{n} (u ∣ x)] - Q (u ∣ x) & = \frac{D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} (R (0, x) + b ζ_{s}) + o (b^{1 / 2}) + R_{n} \\ = \frac{D_{1}^{- 1}}{f (x) + b g (x)} (R (0, x) + b ζ_{s}) + o (b^{1 / 2}) + R_{n}, \end{matrix}

(35)

where

g (x)

is defined in Equation (10). □

Proof of Theorem 3.

Next, we examine the variance of the Bahadur representation introduced in Equation (107). Specifically, we consider

\begin{matrix} {\hat{Q}}_{n} (u | x) - E [{\hat{Q}}_{n} (u | x)] \\ = ({\hat{Q}}_{n} (u | x) - Q (u | x)) - (E [{\hat{Q}}_{n} (u | x)] - Q (u | x)) \\ = D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) - D_{1}^{- 1} \sum_{i = 1}^{n} E [w_{n i} (U (Y_{i} - Q (u ∣ x)) + u)] \\ = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} \sum_{i = 1}^{n} \{K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u) - E [K_{α, β} (X_{i}) (U (}} Y_{i} - Q (u ∣ x)) + u)]\} \\ = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} \sum_{i = 1}^{n} Z_{i, b}, \end{matrix}

(36)

where

Z_{i, b} : = K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u) - E [K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u)] .

For every

b > 0

the random variables

Z_{1, b}, Z_{1, b}, \dots, Z_{n, b}

are independent and identically distributed and centered, hence

\begin{matrix} Var [{\hat{Q}}_{n} (u | x)] & = & \frac{n^{- 1}}{E^{2} [K_{α, β} (X_{1})]} D_{1}^{- 1} E (∣ Z_{i, b} ∣^{2}) D_{1}^{- 1} \end{matrix}

(37)

\begin{matrix} E (∣ Z_{i, b} ∣^{2}) & = & E [K_{α, β}^{2} (X_{1}) (U (Y_{1} - Q (u ∣ x)) + u) {(U (Y_{1} - Q (u ∣ x)) + u)}^{⊤}] \\ - E^{2} [K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u)] . \end{matrix}

(38)

On the left-hand side of Equation (38), using the law of iterated expectations and condition A.7 (see Equation (7)), we decompose:

\begin{matrix} E [K_{α, β}^{2} (X_{1}) (U (Y_{1} - Q (u ∣ x)) + u) {(U (Y_{1} - Q (u ∣ x)) + u)}^{⊤}] \\ = E [K_{α, β}^{2} (X_{1}) E [(U (Y_{1} - Q (u ∣ x)) + u) {(U (Y_{1} - Q (u ∣ x)) + u)}^{⊤} ∣ X_{1} = x]] \\ = E [K_{α, β}^{2} (X_{1}) D_{x}] \\ = \int_{S_{s, 1}} K_{α, β}^{2} (z) D_{z} f (z) dz . \end{matrix}

On the left side of Equation (38), using a first-order Taylor expansion of the product

D_{z}

around

x

,

\begin{matrix} D_{z} : & = E [(U (Y_{1} - Q (u ∣ x)) + u) {(U (Y_{1} - Q (u ∣ x)) + u)}^{⊤} ∣ X_{1} = z] \\ = E [(U (Y_{1} - Q (u ∣ x)) + u) {(U (Y_{1} - Q (u ∣ x)) + u)}^{⊤} ∣ X_{1} = x] + O_{P} (∥ z - x ∥) \\ = D_{x} + O_{x} (1) . \end{matrix}

(39)

We then obtain

\begin{matrix} \int_{S_{s, 1}} K_{α, β}^{2} (z) D_{z} f (z) d z & = \int_{S_{s, 1}} K_{α, β}^{2} (z) (D_{x} + O_{x} (1)) f (z) d z \\ = (D_{x} + O_{x} (1)) \int_{S_{s, 1}} K_{α, β}^{2} (z) f (z) d z \\ = (D_{x} + O_{x} (1)) E [K_{α, β}^{2} (X)] . \end{matrix}

(40)

By combining Equations (37), (38), (40) and (28), we derive

\begin{matrix} Var [{\hat{Q}}_{n} (u | x)] & = \frac{n^{- 1}}{E^{2} [K_{α, β} (X)]} D_{1}^{- 1} \{(D_{x} + o_{x} (1)) A_{b} (x) D_{1}^{- 1}\} \\ - \frac{n^{- 1}}{E^{2} [K_{α, β} (X)]} D_{1}^{- 1} E^{2} [R (0, ξ_{x})] D_{1}^{- 1} \\ = \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} \{(D_{x} + o_{x} (1)) A_{b} (x) D_{1}^{- 1}\} \\ - \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} {(R (0, x) f (x) + b ζ_{s} + O (b^{1 / 2}))}^{2} D_{1}^{- 1} \\ = \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} \{(D_{x} + o_{x} (1)) A_{b} (x) D_{1}^{- 1}\} - o (n^{- 1}) \end{matrix}

(41)

Finally, applying Equations (27), (34), (41) and Lemma 2, we obtain

\begin{matrix} Var ({\hat{Q}}_{n} (u ∣ x)) = \{\begin{matrix} \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} \{b^{- s / 2} ψ (x) f (x) D_{1}^{- 1} D_{x} D_{1}^{- 1}\} - o (n^{- 1}) \\ + O_{x} (n^{- 1} b^{1 / 2}) + O_{x} (n^{- 1} b^{- s / 2}), if \frac{x_{i}}{b} \to \infty \forall i \in [d] and \frac{1 - {∥ x ∥}_{1}}{b} \to \infty, \\ \frac{n^{- 1}}{{(f (x) + b g (x))}^{2}} \{b^{- \frac{s + | J |}{2}} ψ_{J} (x) f (x) \prod_{i \in J} \frac{Γ (2 κ_{i} + 1)}{2^{κ_{i} + 1} Γ^{2} (κ_{i} + 1)} D_{1}^{- 1} D_{x} D_{1}^{- 1}\} \\ - o (n^{- 1}) + o_{κ, x} (n^{- 1} b^{- \frac{s + | J |}{2}}) - O_{x} (n^{- 1} b^{1 / 2}), \\ if \frac{x_{i}}{b} \to κ_{i} \forall i \in J, \frac{x_{i}}{b} \to \infty \forall i \in [s] ∖ J, and \frac{1 - {∥ x ∥}_{1}}{b} \to \infty . \end{matrix} \end{matrix}

where

ψ (\cdot)

and

g (\cdot)

are given by Equations (11) and (10), respectively. □

Proof of Corollary 1.

For the estimator

{\hat{Q}}_{n} (u ∣ x)

we define

MSE [{\hat{Q}}_{n} (u ∣ x)] = I E [{| {\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) |}^{2}] = Var [{\hat{Q}}_{n} (u ∣ x)] + {Bias}^{2} [{\hat{Q}}_{n} (u ∣ x)] .

(42)

Under the standing assumptions

x_{i} / b \to \infty (i = 1, \dots, d), (1 - {∥ x ∥}_{1}) / b \to \infty,

the MSE admits the expansion

\begin{matrix} MSE [{\hat{Q}}_{n} (u ∣ x)] & = \frac{1}{n} b^{- s / 2} \{\frac{ψ (x) f (x)}{{[f (x) + b g (x)]}^{2}}\} D_{1}^{- 1} D_{x} D_{1}^{- 1} \\ + D_{1}^{- 1} {\{\frac{b ζ_{s}}{f (x) + b g (x)}\}}^{2} D_{1}^{- 1} + o_{x} (n^{- 1} b^{1 / 2}) + o_{x} (n^{- 1} b^{- s / 2}) + o_{x} (b^{2}) . \end{matrix}

(43)

Noting that where

E (K_{α, β} (X)) \sim f (x) and Bias [{\hat{Q}}_{n} (u | x)] \sim D_{1}^{- 1} {[\frac{b ζ_{s}}{f (x) + b g (x)}]}^{2} D_{1}^{- 1}

, take the derivative of MSE with respect to b:

\frac{\partial}{\partial b} MSE [{\hat{Q}}_{n} (u ∣ x)] = - \frac{s}{2 n} b^{- s / 2 - 1} D_{1}^{- 1} D_{x} D_{1}^{- 1} \frac{ψ (x)}{f (x)} + \frac{2 b}{f^{2} (x)} D_{1}^{- 1} ζ_{s} ζ_{s}^{⊤} D_{1}^{- 1} .

(44)

Setting (44) to zero yields the optimal bandwidth

b_{opt} ≃ n^{- 2 / (s + 4)} {(\frac{\frac{4}{s} D_{1}^{- 1} ζ_{s} ζ_{s}^{⊤} D_{1}^{- 1}}{ψ (x) f (x) D_{1}^{- 1} D_{x} D_{1}^{- 1}})}^{2 / (s + 4)},

(45)

Equations (43)–(45) fully describe the leading bias–variance trade–off and the corresponding bandwidth that minimises the MSE. □

Proof of Theorem 4.

From Theorem 1, with

r_{n} (x) = \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u),

we infer the linear decomposition

{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) = D_{1}^{- 1} r_{n} (x) + R_{n} .

(46)

Let

ξ_{1}, \dots, ξ_{n}

be i.i.d.,

E [| ξ_{1} |^{3}] < \infty

. Lemma 2 of Devroye [67] asserts

sup_{a \in R} |E [({\bar{ξ}}_{n} - E {\bar{ξ}}_{n}) - a \sqrt{Var ({\bar{ξ}}_{n})}] - \sqrt{Var ({\bar{ξ}}_{n})} E [Z - a]| \leq \frac{c_{0} E [| ξ_{1} - E ξ_{1} |^{3}]}{n Var (ξ_{1})},

(47)

with

Z \sim N (0, 1)

and

{\bar{ξ}}_{n} = n^{- 1} \sum_{i = 1}^{n} ξ_{i}

. Fix

x \in Int (S_{s, 1})

and set

ξ_{i} : = K_{x / b + 1, (1 - {∥x∥}_{1}) / b + 1} (X_{i}), a^{★} (x) : = \frac{Q (u ∣ x) - E [{\hat{Q}}_{n} (u ∣ x)]}{\sqrt{Var [{\hat{Q}}_{n} (u ∣ x)]}} .

Then (47) yields

|E [{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)] - \sqrt{Var [{\hat{Q}}_{n} (u ∣ x)]} E [Z - a^{★} (x)]| \leq c_{1} n^{- 1} b^{- s / 2} D_{x} ψ (x),

(48)

for some

c_{1} = c_{1} (s) > 0

. As

n \to \infty

,

\frac{E [| ξ_{1} - E ξ_{1} |^{3}]}{Var (ξ_{1})} \leq 4 \frac{E [ξ_{1}^{3}] + {(E ξ_{1})}^{3}}{E [ξ_{1}^{2}] - {(E ξ_{1})}^{2}} = 4 \frac{E [ξ_{1}^{3}]}{E [ξ_{1}^{2}]} + O (1),

(49)

by Jensen’s inequality. Arguing as in the proof of Theorem 2 (via Lemma 2) one shows

\frac{E [ξ_{1}^{3}]}{E [ξ_{1}^{2}]} = {\tilde{A}}_{b} (x) (1 + O (b^{1 / 2})),

(50)

where

\begin{matrix} {\tilde{A}}_{b} (x) & : = & \frac{Γ (3 (1 - {∥x∥}_{1}) / b + 1)}{Γ (2 (1 - {∥x∥}_{1}) / b + 1) Γ ((1 - {∥x∥}_{1}) / b + 1)} \frac{\prod_{i \in [s]} Γ (3 x_{i} / b + 1)}{\prod_{i \in [s]} Γ (2 x_{i} / b + 1) Γ (x_{i} / b + 1)} \\ \times \frac{Γ (2 / b + s + 1) Γ (1 / b + s + 1)}{Γ (3 / b + s + 1)} . \end{matrix}

Following the initial steps of Lemma 1 in [59],

{\tilde{A}}_{b} (x) \leq \frac{b^{- s / 2} (1 + O (b))}{{(3 π)}^{s / 2} \sqrt{(1 - {∥x∥}_{1}) \prod_{i \in [s]} x_{i}}} .

(51)

Combining (46)–(51) gives the desired bound (48).

Combining the preliminary bounds (47), (49), (50) and (51) immediately yields the intermediate result (48). Invoking (48), the triangle inequality, and the fact that $ψ \in L^{1} (S_{s, 1})$ , we obtain

\begin{matrix} |MIAE [{\hat{Q}}_{n} (u ∣ x)] - \int_{S_{s, 1}} w (x) E |Z - \frac{D_{1}^{- 1} (r (0, x) f (x) + b ζ_{s})}{f (x) + b g (x)}| d x| \\ \leq \int_{S_{s, 1}} |\sqrt{Var ({\hat{Q}}_{n} (u ∣ x))} E | Z - a^{*} (x) | - w (x) E |Z - \frac{D_{1}^{- 1} (r (0, x) f (x) + b ζ_{s})}{f (x) + b g (x)}|| d x \\ + c_{2} n^{- 1} b^{- s / 2} . \end{matrix}

(52)

where the weight is

w (x) : = n^{- 1 / 2} b^{- s / 4} \sqrt{\frac{ψ (x) f (x)}{f (x) + b g (x)}} D_{x}^{1 / 2} D_{1}^{- 1}, c_{2} = c_{2} (s) > 0 .

Lemma 7 of Devroye and Györfi [67] asserts that, for all

u, w \geq 0

and

v, z \in R

,

|u E |Z + \frac{v}{u}| - w E |Z - \frac{z}{w}|| \leq \sqrt{\frac{2}{π}} |- w| + |v - z| .

(53)

Application of (53). With the identifications

u = \sqrt{Var ({\hat{Q}}_{n} (u ∣ x))}, w = w (x), v = Bias [{\hat{Q}}_{n} (u ∣ x)], z = \frac{D_{1}^{- 1} (r (0, x) f (x) + b ζ_{s})}{f (x) + b g (x)},

the right–hand side of (52) is bounded by

\begin{matrix} c_{2} n^{- 1} b^{- s / 2} & + \int_{S_{s, 1}} |\sqrt{Var ({\hat{Q}}_{n} (u ∣ x))} - n^{- 1 / 2} b^{- s / 4} \sqrt{\frac{ψ (x) f (x)}{f (x) + b g (x)}} D_{x}^{1 / 2} D_{1}^{- 1}| d x \\ + \int_{S_{s, 1}} |Bias [{\hat{Q}}_{n} (u ∣ x)] - \frac{D_{1}^{- 1} (r (0, x) f (x) + b ζ_{s})}{f (x) + b g (x)}| d x . \end{matrix}

(54)

Using the pointwise variance from Lemma 2 and the bias expansion (35), the two integrals in (54) are, respectively,

o (n^{- 1 / 2} b^{- s / 4}), o (b^{1 / 2}) .

Hence

r . h . s . of (52) = o (n^{- 1} b^{- s / 2}) + o (n^{- 1 / 2} b^{- s / 4}) + o (b^{1 / 2}),

which establishes (13). Inequality (14) then follows directly from (13) together with the elementary bound

E | Z - u | \leq \sqrt{\frac{2}{π}} + | u |, u \in R .

This concludes the proof. □

Proof of Theorem 5.

We begin by establishing the following decomposition to prove the theorem:

{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) = {\hat{Q}}_{n} (u ∣ x) - E [{\hat{Q}}_{n} (u ∣ x)] + E [{\hat{Q}}_{n} (u ∣ x)] - Q (u ∣ x) .

From (35) and (36), we obtain:

\begin{matrix} {\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) & = {\hat{Q}}_{n} (u ∣ x) - E [{\hat{Q}}_{n} (u ∣ x)] + \frac{D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} (R (0, x) + b ζ_{s}) \\ = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} \sum_{i = 1}^{n} Z_{i, b} + \frac{D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} (R (0, x) + b ζ_{s}) . \end{matrix}

(55)

Therefore, from (55), it is easy to see

\begin{matrix} {\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) - \frac{D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} (R (0, x) + b ζ_{s}) & = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} \sum_{i = 1}^{n} Z_{i, b} . \end{matrix}

We apply Lindeberg’s Central Limit Theorem to the triangular array

X_{i n} = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} Z_{i, b},

and verify the Lindeberg condition. For every

ε > 0

, we must show

\begin{matrix} \frac{1}{σ_{n}^{2}} \sum_{i = 1}^{n} E [X_{i n}^{2} 1 \{| X_{i n} | > ε σ_{n}\}] \underset{n \to \infty}{\to} 0 . \end{matrix}

(56)

From Theorem (2), Under the standing assumptions

x_{i} / b \to \infty (i = 1, \dots, d), (1 - {∥ x ∥}_{1}) / b \to \infty,

we have the variance expression:

\begin{matrix} σ_{n}^{2} = \sum_{i = 1}^{n} Var (X_{i n}) & = \sum_{i = 1}^{n} Var (\frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} Z_{i, b}) \\ = n^{- 1} b^{- s / 2} \frac{ψ (x) f (x)}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} D_{x} D_{1}^{- 1} \end{matrix}

We estimate the upper bound for

|X_{i n}|

:

\begin{matrix} |X_{i n}| & = |\frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} Z_{i, b}| \\ = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} |Z_{i, b}| \\ = \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} |K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u) - E [K_{α, β} (X_{i}) (U (Y_{i} - Q (u ∣ x)) + u)]| . \end{matrix}

Using Lemma 3 and (69), we obtain

\begin{matrix} |X_{i n}| & ⩽ \frac{n^{- 1} D_{1}^{- 1}}{E [K_{α, β} (X_{1})]} C_{x} b^{- s / 2} \\ = o (n^{- 1} b^{- s / 2}) \end{matrix}

Hence, the Lindeberg condition (56) becomes:

\begin{matrix} \frac{1}{σ_{n}^{2}} \sum_{i = 1}^{n} E [X_{i n}^{2} & 1 \{| X_{i n} | > ε σ_{n}\}] \\ ⩽ \frac{1}{σ_{n}^{2}} \sum_{i = 1}^{n} E [∣ n^{- 1} C b^{- s / 2} ∣^{2} 1 {∣ n^{- 1} C b^{- s / 2} ∣ > ε σ_{n}}], \end{matrix}

for large n, we have

\begin{matrix} \frac{n^{- 1} C b^{- s / 2}}{σ_{n}} & = \frac{n^{- 1} C b^{- s / 2}}{\sqrt{n^{- 1} b^{- s / 2} \frac{ψ (x) f (x)}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} D_{x} D_{1}^{- 1}}} \\ = n^{- 1 / 2} b^{- s / 4} (\frac{C (f (x) + b g (x))}{\sqrt{ψ (x) f (x) D_{1}^{- 1} D_{x} D_{1}^{- 1}}}) \\ = O_{x} (n^{- 1 / 2} b^{- s / 4}) \to 0, \end{matrix}

whenever

n^{1 / 2} b^{s / 4} \to \infty as n \to \infty and b \to 0,

the Lindeberg condition holds, since for any fixed

ε > 0

the indicator

1 \{| X_{i n} | > ε σ_{n}\}

vanishes for all sufficiently large n. Hence, we obtain

\frac{1}{σ_{n}^{2}} \sum_{i = 1}^{n} E [X_{i n}^{2} 1 \{| X_{i n} | > ε σ_{n}\}] \underset{n \to \infty}{\to} 0 .

which confirms that the Lindeberg condition is satisfied. Consequently, we conclude that

n^{1 / 2} b^{s / 4} ({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) - \frac{D_{1}^{- 1}}{f (x) + b g (x)} (R (0, x) + b ζ_{s})) ↪_{n \to \infty}^{D} N (0, Σ (x)),

where

Σ (x) : = \frac{ψ (x) f (x)}{{(f (x) + b g (x))}^{2}} D_{1}^{- 1} D_{x} D_{1}^{- 1} .

□

8. Proof of the Technical Lemmas

Proof of Lemma 1.

We begin by establishing that there exists a suitable constant

K_{1}^{*} > 0

such that:

\begin{matrix} P (∥Y∥ > \frac{K_{1}^{*}}{4} | X = x) \leq \frac{1 - ∥u∥}{3 + ∥u∥} . \end{matrix}

(57)

From Theorem 4.2 of [68], if we view the regression function there as the conditional probability:

P (∥ Y ∥ > \frac{K_{1}^{*}}{4} | X = x) .

then the following asymptotic relationship holds:

\begin{matrix} \sum_{i = 1}^{n} w_{n i} I (∥ Y_{i} ∥ > K_{1}^{*} / 4) \to P (∥ Y ∥ > \frac{K_{1}^{*}}{4} | X = x), a s n \to \infty . \end{matrix}

(58)

By the definition of

{\hat{Q}}_{n} (u ∣ x)

given in (2), we set

{\tilde{L}}_{n} (θ) : = \sum_{i = 1}^{n} (Φ (u, Y_{i} - θ) - Φ (u, Y_{i})) K_{α, β} (X_{i}), \forall θ \in R^{d} .

Then

{\tilde{L}}_{n} (θ + q) - {\tilde{L}}_{n} (q) : = \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u | x)) - Φ (u, Y_{i} - Q (u | x))] K_{α, β} (X_{i}) .

From definition

Φ (u, \cdot)

, we obtain

\begin{matrix} |Φ (u, Y_{i} - θ - Q (u | x)) - Φ (u, Y_{i} - Q (u | x))| \\ = ∥Y_{i} - θ - Q (u | x))∥ + 〈 u, Y_{i} - θ - Q (u | x) 〉 - ∥Y_{i} - Q (u | x))∥ - 〈 u, Y_{i} - Q (u | x) 〉 \\ ⩽ ∥Y_{i} - θ - Q (u | x))∥ - ∥Y_{i} - Q (u | x))∥ + |〈 u, θ 〉| \\ ⩽ ∥Y_{i} - Q (u | x))∥ + ∥θ∥ - ∥Y_{i} - Q (u | x))∥ + ∥u∥ ∥θ∥ \\ = (1 + ∥ u ∥) ∥ θ ∥ . \end{matrix}

(59)

From (59), it holds with probability one. It is easy to see

\begin{matrix} | \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u ∣ x)) & - Φ (u, Y_{i} - Q (u ∣ x))] K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}} | \\ \leq (1 + ∥ u ∥) ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}} \\ = (1 + ∥ u ∥) ∥ θ ∥ \sum_{j = 1}^{n} K_{α, β} (X_{j}) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}} . \end{matrix}

(60)

Also, from definition

Φ (u, \cdot)

, we obtain

\begin{matrix} Φ (u, Y_{i} - θ - Q (u ∣ x)) - Φ (u, Y_{i} - Q (u ∣ x)) \\ = ∥Y_{i} - θ - Q (u | x))∥ - ∥Y_{i} - Q (u | x))∥ + 〈 u, θ 〉 \\ \geq \underset{Φ (u, θ)}{\underset{︸}{∥ θ ∥ + 〈 u, θ 〉}} > \frac{1}{2} (∥ θ ∥ + 〈 u, θ 〉) \\ \geq \frac{1}{2} (∥ θ ∥ - ∣ 〈 u, θ 〉) ∣ \geq \frac{1}{2} (∥ θ ∥ - ∥ u ∥ ∥ θ ∥) \\ \geq \frac{1}{2} (1 - ∥ u ∥) ∥ θ ∥ . \end{matrix}

(61)

From (61), it holds with probability one. It is easy to see

\begin{matrix} \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u ∣ x)) & - Φ (u, Y_{i} - Q (u ∣ x))] K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} \\ \geq \frac{1}{2} (1 - ∥ u ∥) ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} \\ = \frac{1}{2} (1 - ∥ u ∥) ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} . \end{matrix}

(62)

From (60) and (62) above, we know that if

Φ (u, θ) > K_{1}^{*}

, it holds that

\begin{matrix} \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u | x)) - Φ (u, Y_{i} - Q (u | x))] K_{α, β} (X_{i}) \\ = \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u | x)) - Φ (u, Y_{i} - Q (u | x))] K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} \\ + \sum_{i = 1}^{n} [Φ (u, Y_{i} - θ - Q (u | x)) - Φ (u, Y_{i} - Q (u | x))] K_{α, β} (X_{i}) I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}} \\ > \frac{1}{2} (1 - ∥ u ∥) ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} \\ - (1 + ∥ u ∥) ∥ θ ∥ K_{α, β} (X_{i}) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}} \\ = ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \{\frac{1}{2} (1 - ∥ u ∥) ∥ \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ \leq K_{1}^{*} / 4}} - (1 + ∥ u ∥) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}}\} \\ = ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \{\frac{1}{2} (1 - ∥ u ∥) ∥ (1 - \frac{1 - ∥ u ∥}{3 + ∥ u ∥}) - (1 + ∥ u ∥) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}}\} \\ = ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \{\frac{1}{2} (1 - ∥ u ∥) ∥ \frac{2 (1 + ∥ u ∥)}{3 + ∥ u ∥} - (1 + ∥ u ∥) \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}}\} \\ = (1 + ∥ u ∥) ∥ θ ∥ \sum_{i = 1}^{n} K_{α, β} (X_{i}) \{\underset{0}{\underset{︸}{\frac{1 - ∥ u ∥}{3 + ∥ u ∥} - \sum_{i = 1}^{n} w_{n i} I_{{∥ Y_{i} - Q (u ∣ x) ∥ > K_{1}^{*} / 4}}}}\} = 0 . \end{matrix}

(63)

Then, if

Φ (u, θ) > K_{1}^{*}

, we obtain

\begin{matrix} {\tilde{L}}_{n} (θ + q) - {\tilde{L}}_{n} (q) & = \sum_{i = 1}^{n} (Φ (u, Y_{i} + θ - Q (u ∣ x)) - Φ (u, Y_{i} - Q (u ∣ x))) K_{α, β} (X_{i}) > 0 . \end{matrix}

Hence,

\begin{matrix} {\tilde{L}}_{n} (θ + q) > {\tilde{L}}_{n} (q) . \end{matrix}

(64)

However, by the definition of

{\hat{Q}}_{n} (u ∣ x)

, we have

\begin{matrix} {\tilde{L}}_{n} ({\hat{q}}_{n}) \leq {\tilde{L}}_{n} (q) . \end{matrix}

(65)

Let

θ = {\hat{q}}_{n} - q

and using (64) and (65), we reach a contradiction, and hence conclude that

Φ (u, {\hat{Q}}_{n} (u ∣ x) - Q (u | x)) \leq K_{1}^{*} .

From definition

Φ (u, \cdot)

, it easy see to

\begin{matrix} Φ (u, {\hat{Q}}_{n} (u ∣ x) - Q (u | x)) & = ∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ + 〈 u, {\hat{Q}}_{n} (u ∣ x) - Q (u | x) 〉 \\ \geq ∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ - ∥ u ∥ ∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ \\ = (1 - ∥ u ∥) ∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ . \end{matrix}

(66)

From Equation (66), and by applying the Cauchy–Schwarz inequality, we know that

\begin{matrix} (1 - ∥ u ∥) ∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ ⩽ Φ (u, {\hat{Q}}_{n} (u ∣ x) - Q (u | x)) . \end{matrix}

Then

∥ {\hat{Q}}_{n} (u ∣ x) - Q (u | x) ∥ ⩽ \frac{K_{1}^{*}}{1 - ∥ u ∥} : = K_{1} .

Further, we aim to prove that the uth geometric conditional quantile

{\hat{Q}}_{n} (u ∣ x)

converges at a specific rate stated in Lemma 8 below. For simplicity of presentation, let

Q (u | x) = {(q_{1}, \dots, q_{d})}^{⊤}

and let

C > 0

denote a constant which may take different values in different places. □

Proof of Lemma 4.

First note that there is a constant

γ_{1} > 0

depending only on

K_{1}

and the dimension d such that

\begin{matrix} | B_{n} | ⩽ γ_{1} n^{α d}, \end{matrix}

(67)

Moreover, it can be shown directly that

\begin{matrix} E (K_{α, β} (X)) \sim f (x) and E (K_{α, β}^{2} (X)) \sim A_{b} (x) f (x) \end{matrix}

(68)

Since $| U (\cdot) | \leq 1$ , and by Lemma 3 where $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ , there exists $C_{x} > 0$ such that

$\begin{matrix} |K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x)) - E [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x))]| \\ ⩽ sup_{x \in S_{s, 1}} K_{α, β} (x) + |E [K_{α, β} (X_{i})]| \\ ⩽ C b^{- s / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in [s]} x_{i}}} \\ ⩽ C_{x} b^{- s / 2} = O_{x} (b^{- s / 2}) \end{matrix}$

(69)

If

\frac{x_{i}}{b} \to κ_{i} \forall i \in J, \frac{x_{i}}{b} \to \infty \forall i \in [s] ∖ J, and \frac{1 - {∥ x ∥}_{1}}{b} \to \infty

, there exists

C_{x, κ} > 0

such that

\begin{matrix} |K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x)) - E [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x))]| \\ ⩽ sup_{x \in S_{s, 1}} K_{α, β} (x) + |E [K_{α, β} (X_{i})]| \\ ⩽ C b^{- (s + | J |) / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i}}} \\ ⩽ C_{x, κ} b^{- (s + | J |) / 2} = O_{x, κ} (b^{- (s + | J |) / 2}) \end{matrix}

(70)

Notation. Here $ψ (x)$ and $ψ_{J} (x)$ are defined as in (11).

Let $E_{1 n}$ defined by

\begin{matrix} E_{1 n} : = \sum_{i = 1}^{n} \{K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x)) - E [K_{α, β} (X_{i}) U (Y_{i} - θ - Q (u ∣ x))\}, \forall θ \in B_{n} . \end{matrix}

According to Fact (20) and (68), for some constant

C = C (x)

, it holds that:

\begin{matrix} P (∥E_{1 n}∥ \geq n t E (K_{α, β} (X))) & \leq 2 d exp \{- \frac{n {(t E (K_{α, β} (X)))}^{2}}{2 d^{2} E [K_{α, β}^{2} (X)] + \frac{2}{3} d M t E (K_{α, β} (X))}\} . \end{matrix}

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ , (see Lemma 3),

Set $t : = K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}$ , we obtain:

\begin{matrix} P (∥ E_{1 n} ∥ \geq n E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}) \\ \leq 2 d exp \{- \frac{n {(E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}})}^{2}}{2 d^{2} E [K_{α, β}^{2} (X)] + \frac{2}{3} d M E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}}\} . \end{matrix}

From (68), the original exponent become:

\begin{matrix} exp \{- n \frac{f^{2} (x) K_{2}^{2} (\frac{log n}{n b^{s / 2}})}{2 d^{2} b^{- s / 2} ψ (x) f (x) + \frac{2}{3} d C_{x} b^{- s / 2} f (x) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}}\} . \end{matrix}

(71)

Thus, we obtain the following equivalent formulation of (71)

\begin{matrix} exp \{- \frac{f (x) K_{2}^{2} (log n)}{2 d^{2} ψ (x) + \frac{2}{3} d C_{x} K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}}\}, \end{matrix}

(72)

where

n \to \infty, b \to 0,

and

n b^{s / 2} / log n \to \infty

, it follows that

\frac{log n}{n b^{s / 2}} \to 0

. Therefore, we can simplify Equation (72) as follows:

\begin{matrix} exp \{- \frac{f (x) K_{2}^{2} (log n)}{2 d^{2} ψ (x) + o (1)}\} \sim exp \{- \frac{f (x) K_{2}^{2}}{2 d^{2} ψ (x)} log n\} = n^{- \frac{f (x)}{2 d^{2} ψ (x)} K_{2}^{2}} . \end{matrix}

(73)

Then, where

C : = \frac{f (x)}{2 d^{2} ψ (x)} > 0

, we have

\begin{matrix} P (∥ E_{1 n} ∥ \geq n E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}) & \leq 2 d n^{- C K_{2}^{2}} . \end{matrix}

(74)

From the definition of

θ \in B_{n}

and Equations (67) and (74) it is easy to see that

\begin{matrix} P (max_{θ \in B_{n}} ∥ E_{1 n} ∥ \geq n E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}) & \leq 2 d | B_{n} | n^{- C K_{2}^{2}} \\ \leq 2 d γ_{1} n^{α d} n^{- C k_{2}^{2}} = 2 d γ_{1} n^{α d - C k_{2}^{2}} . \end{matrix}

Choose

K_{2}

, large enough such that

C K_{2}^{2} > α d

,we can obtain

\sum_{n = 1}^{\infty} P (max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ E_{1 n} ∥ \geq K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}) < \infty .

Accordingly, by the Borel–Cantelli

max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ E_{1 n} ∥ \leq K_{2} \sqrt{\frac{log n}{n b^{s / 2}}} .

if $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ , (see Lemma 3),

Set $t : = E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}$

\begin{matrix} P (∥ E_{1 n} ∥ \geq n E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}) \\ \leq 2 d exp \{- \frac{n {(E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}})}^{2}}{2 d^{2} E [K_{α, β}^{2} (X_{i})] + \frac{2}{3} d C_{x, κ} b^{- (s + | J |) / 2} E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}}\} . \end{matrix}

Proceeding along similar lines as above, and under the asymptotic conditions

n \to \infty, b \to 0, and \frac{n b^{\frac{s + | J |}{2}}}{log n} \to \infty,

we deduce that

\frac{log n}{n b^{- \frac{s + | J |}{2}}} = \frac{b^{\frac{s + | J |}{2}} log n}{n} ⟶ 0 .

Moreover, we have the following exponential bound:

exp \{- \frac{n {(E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}})}^{2}}{2 d^{2} E [K_{α, β}^{2} (X_{i})] + \frac{2}{3} d C_{x, κ} b^{- (s + | J |) / 2} E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}}\} \sim n^{- C K_{2}^{2}}

as

C : = \frac{f (x)}{2 d^{2} ψ (x)} > 0 .

From Equation (67), we obtain:

\begin{matrix} P (max_{θ \in B_{n}} ∥ E_{1 n} ∥ \geq n E (K_{α, β} (X)) K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}) & \leq 2 d | B_{n} | n^{- C K_{2}^{2}} \\ \leq 2 d γ_{1} n^{α d} n^{- C_{1} k_{2}^{2}} = 2 d γ_{1} n^{α d - C_{1} k_{2}^{2}} . \end{matrix}

Choose

K_{2}

, large enough such that

C_{1} K_{2}^{2} > α d

, we can obtain

\sum_{n = 1}^{\infty} P (max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ E_{1 n} ∥ \geq K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}}) < \infty .

Accordingly, by the Borel–Cantelli lemma

max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ E_{1 n} ∥ \leq K_{2} \sqrt{\frac{log n}{n b^{\frac{s + | J |}{2}}}} .

□

Proof of Lemma 5.

Analogous to the proof of Theorem 2.1.2 of [6], for any

h \in R^{d}

, by the definition of

{\hat{Q}}_{n} (u ∣ x)

, it holds that

\sum_{1 \leq i \leq n, Y_{i} \neq {\hat{Q}}_{n} (u ∣ x)} w_{n i} {〈 U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)), h 〉 + 〈 u, h 〉} + \sum_{1 \leq i \leq n, Y_{i} = {\hat{Q}}_{n} (u | x)} w_{n i} {| | h | | + 〈 u, h 〉} \geq 0 .

Because

(X_{i}, Y_{i})

(i = 1, 2, \dots, n)

are absolute continuous random variables,

Y_{i}

(i = 1, 2, \dots, n)

do not equal to each other almost surely. Then, by the property that h is arbitrary in

R^{d}

, (22) holds. □

Proof of lemma 6.

Let

{\tilde{E}}_{i}

and

{\hat{E}}_{n}

be defined by

\forall θ \in B_{n}

\begin{matrix} {\tilde{E}}_{i} : = K_{α, β} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})} a n d {\hat{E}}_{n} : = \sum_{i = 1}^{n} ({\tilde{E}}_{i} - E ({\tilde{E}}_{i})) . \end{matrix}

(75)

It can be shown directly that:

\begin{matrix} E ({\tilde{E}}_{i}) & = E [K_{α, β} (X_{i}) E (I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})} ∣ X = x)] \\ = \int K_{α, β} (t) P (∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β} ∣ X = t) f_{X} (t) dt . \end{matrix}

(76)

Noting the bound in condition A.1, namely that

f (y ∣ t) \leq C_{f}

for some constant

C_{f} > 0

, we obtain let B be the subset of

R^{d}

defined as

B = \{y \in R^{d} ∣ ∥ y - θ - Q (u ∣ x) ∥ \leq n^{- β}\}

\begin{matrix} P (∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β} ∣ X = t) & = \int_{R^{d}} I_{(∥y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})} f_{Y | X} (y | t) dy \\ = \int_{B} f_{Y | X} (y | t) dy \\ \leq C_{f} ω_{d} n^{- d β} \end{matrix}

(77)

where

ω_{d} = π^{d / 2} / Γ (d / 2 + 1) .

From (76) and (77), we have

\begin{matrix} E ({\tilde{E}}_{i}) & \leq C_{f} ω_{d} n^{- d β} \int K_{α, β} (t) f_{X} (t) dt \\ \leq C_{f} ω_{d} n^{- d β} {∥ f_{X} ∥}_{\infty} \int K_{α, β} (t) dt \\ \leq C_{0} n^{- d β}, \end{matrix}

(78)

where

C_{f} ω_{d} {∥ f_{X} ∥}_{\infty} \leq C_{0}

. Also, it can be shown directly that:

\begin{matrix} Var ({\tilde{E}}_{i}) & \leq E (K_{α, β}^{2} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})}) \\ = E (K_{α, β}^{2} (X_{i}) E (I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})} | X_{i} = x)) \\ = \int K_{α, β}^{2} (t) P (∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β} ∣ X = t) f_{X} (t) dt \\ \leq C_{f} r e d ω_{d} n^{- d β} {∥ f_{X} ∥}_{\infty} \int K_{α, β}^{2} (t) dt \\ \leq C_{0} n^{- d β} A_{b} (x) E (f (γ_{x})) . γ_{x} \sim Dirichlet (2 x / b + 1, 2 (1 - {∥ x ∥}_{1}) / b + 1) \end{matrix}

(79)

According to Fact (20), (79), it holds that:

\begin{matrix} P (∥{\hat{E}}_{n}∥ \geq n t E (K_{α, β} (X))) \leq 2 d exp \{- \frac{n {(t E (K_{α, β} (X)))}^{2}}{2 d^{2} C_{0} n^{- d β} A_{b} (x) E (f (γ_{x})) + \frac{2}{3} t d M E (K_{α, β} (X))}\} \end{matrix}

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ , we have $A_{b} (x) E (f (γ_{x})) \sim b^{- s / 2} ψ (x) f (x)$

Set

t : = K_{3} (\frac{log n}{n b^{s / 2}})

, then

\begin{matrix} P (∥{\hat{E}}_{n}∥ \geq n K_{3} (\frac{log n}{n b^{s / 2}}) E (K_{α, β} (X))) \\ \leq 2 d exp \{- \frac{n {(K_{3} (\frac{log n}{n b^{s / 2}}) E (K_{α, β} (X)))}^{2}}{2 d^{2} C_{0} n^{- d β} b^{- s / 2} ψ (x) f (x) + \frac{2}{3} K_{3} (\frac{log n}{n b^{s / 2}}) d M E (K_{α, β} (X))}\} \\ = 2 d exp \{- \frac{n K_{3}^{2} {(\frac{log n}{n})}^{2} f (x)}{2 d^{2} C_{0} n^{- d β} ψ (x) + \frac{2}{3} K_{3} \frac{log n}{n} d M}\} . \end{matrix}

(80)

Since

β \geq γ / d

with

0 < γ < 1

, it follows that

n^{- d β} \leq n^{- γ} \leq \frac{log n}{n} .

Therefore, there exists a constant

C_{2} > 0

such that

\begin{matrix} 2 d^{2} C_{0} n^{- d β} ψ (x) + \frac{2}{3} K_{3} \frac{log n}{n} d M & \leq (\frac{log n}{n}) (2 d^{2} C_{0} + \frac{2}{3} K_{3} d M) \\ \leq C_{2} (\frac{log n}{n}), \end{matrix}

(81)

where

C_{2} \geq 2 d^{2} C_{0} b^{- s} + \frac{2}{3} K_{3} d M f (x)

. From (80) and (81), we obtain that:

\begin{matrix} P (∥{\hat{E}}_{n}∥ \geq n K_{3} (\frac{log n}{n b^{s / 2}}) E (K_{α, β} (X))) & \leq & 2 d exp \{- \frac{n K_{3}^{2} {(\frac{log n}{n})}^{2} f (x)}{C_{2} (\frac{log n}{n})}\} \\ = & 2 d exp \{- \frac{K_{3}^{2} f (x)}{C_{2}} log n\} . \end{matrix}

(82)

By the definition of

θ \in B_{n}

, (67) and condition A.4, it is easy to see that:

\begin{matrix} P (max_{θ \in B_{n}} ∥{\hat{E}}_{n}∥ \geq n K_{3} (\frac{log n}{n b^{- s / 2}}) E (K_{α, β} (X))) & \leq 2 d γ_{1} n^{α d} exp \{- \frac{K_{3}^{2} f (x)}{C_{2}} log n\} \\ = 2 d γ_{1} n^{α d - \frac{K_{3}^{2} f (x)}{C_{2}}} \\ = 2 d γ_{1} n^{α d - C_{3}}, \end{matrix}

where

C_{3} : = \frac{K_{3}^{2} f (x)}{C_{2}}

. Choose

C_{3}

, large enough such that

C_{3} > α d

, we can obtain

\sum_{n = 1}^{\infty} P (max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ \geq K_{3} (\frac{log n}{n b^{s / 2}})) < \infty .

Accordingly, by the Borel–Cantelli lemma

\begin{matrix} max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ \leq K_{3} (\frac{log n}{n b^{s / 2}}) . \end{matrix}

(83)

From (75), (78), (83) and condition A.4, it can be shown that:

\begin{matrix} max_{θ \in B_{n}} |\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})}| \\ ⩽ max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ + max_{θ \in B_{n}} \frac{∥ E ({\tilde{E}}_{i}) ∥}{E (K_{α, β} (X))} \\ ⩽ K_{3} (\frac{log n}{b^{s / 2}}) + \frac{C_{0} n^{- d β}}{{min}_{x \in S_{d, 1}} f (x)} \\ ⩽ K_{3} (\frac{log n}{n b^{s / 2}}) + C_{f}^{'} n^{- γ} \\ ⩽ K_{3} (\frac{log n}{n b^{s / 2}}) + C_{f}^{'} (\frac{log n}{n}) \\ ⩽ (\frac{log n}{n b^{s / 2}}) (K_{3} + C_{f}^{'} b^{s / 2}) ⩽ K_{4} (\frac{log n}{n b^{s / 2}}), \end{matrix}

where

K_{4} \geq K_{3} + C_{f}^{'} b^{s / 2}

if $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ , we have

A_{b} (x) E (f (γ_{x})) \sim b^{- (s + | J |) / 2} ψ_{J} (x) f (x) \prod_{i \in J} \frac{Γ (2 κ_{i} + 1)}{2^{2 κ_{i} + 1} Γ (κ_{i} + 1)}

\begin{matrix} P (max_{θ \in B_{n}} ∥{\hat{E}}_{n}∥ \geq n K_{3} (\frac{log n}{n}) E (K_{α, β} (X))) & \leq 2 d γ_{1} n^{α d} exp \{- \frac{K_{3}^{2} f (x)}{C_{2} b^{\frac{s + | J |}{2}}} log n\} \\ = 2 d γ_{1} n^{α d - \frac{K_{3}^{2} f (x)}{C_{2}} b^{- \frac{s + | J |}{2}}} = 2 d γ_{1} n^{α d - C_{4}}, \end{matrix}

where

C_{4} : = \frac{K_{3}^{2} f (x)}{C_{2}} b^{- \frac{s + | J |}{2}} .

Choose $C_{4}$ , large enough such that $C_{4} > α d$ we can obtain

\sum_{n = 1}^{\infty} P (max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ \geq K_{3} (\frac{log n}{n b^{(s + | J |) / 2}})) < \infty .

Accordingly, by the Borel–Cantelli lemma

\begin{matrix} max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ \leq K_{3} (\frac{log n}{n b^{(s + | J |) / 2}}) \end{matrix}

(84)

From (75), (78), (84) and condition A.4, it can be shown that:

\begin{matrix} max_{θ \in B_{n}} |\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{(∥Y_{i} - θ - Q (u ∣ x)∥ ⩽ n^{- β})}| \\ ⩽ max_{θ \in B_{n}} \frac{1}{n E (K_{α, β} (X))} ∥ {\hat{E}}_{n} ∥ + max_{θ \in B_{n}} \frac{∥ E ({\tilde{E}}_{i}) ∥}{E (K_{α, β} (X))} \\ ⩽ K_{3} (\frac{log n}{n b^{(s + | J |) / 2}}) + \frac{C_{0} n^{- d β}}{{min}_{x \in S_{s, 1}} f (x)} \\ ⩽ K_{3} (\frac{log n}{n b^{(s + | J |) / 2}}) + C_{f}^{'} n^{- γ} \\ ⩽ K_{3} (\frac{log n}{n b^{(s + | J |) / 2}}) + C_{f}^{'} (\frac{log n}{n}) \\ ⩽ (\frac{log n}{n b^{(s + | J |) / 2}}) (K_{3} + C_{f}^{'} b^{(s + | J |) / 2}) \\ ⩽ K_{5} (\frac{log n}{n b^{(s + | J |) / 2}}), \end{matrix}

where

K_{5} ⩾ K_{3} + C_{f}^{'} b^{(s + | J |) / 2}

. □

Proof of Lemma 7.

\begin{matrix} \frac{\sum_{i = 1}^{n} K_{α, β} (X_{i})}{n E (K_{α, β} (X))} - 1 = \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E (K_{α, β} (X_{i}))] \end{matrix}

According to Fact (20) and (68), for some constant

C = C (x)

, it holds that

\begin{matrix} P (|\sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E K_{α, β} (X_{i})]| \geq n t E (K_{α, β} (X))) \\ \leq 2 d exp \{- \frac{n {(t E (K_{α, β} (X)))}^{2}}{2 d^{2} E (K_{α, β}^{2} (X)) + \frac{2}{3} d \cdot M \cdot t E (K_{α, β} (X))}\} . \end{matrix}

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ , where

γ_{x} \sim Dirichlet (2 x / b + 1, 2 (1 - {∥ x ∥}_{1}) / b + 1),

we have

E (K_{α, β}^{2} (X)) = A_{b} (x) E [f (γ_{x})] \sim b^{- s / 2} ψ (x) f (x) .

Set

t : = K_{6} \sqrt{\frac{log n}{n b^{s / 2}}}

,

\begin{matrix} P (|\sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E K_{α, β} (X_{i})]| \geq n K_{6}^{2} \sqrt{\frac{log n}{n b^{s / 2}}} E (K_{α, β} (X))) \\ \leq 2 d exp \{- \frac{n {(K_{6}^{2} \sqrt{\frac{log n}{n b^{s / 2}}} E (K_{α, β} (X)))}^{2}}{2 d^{2} E (K_{α, β}^{2} (X)) + \frac{2}{3} d M K_{6} \sqrt{\frac{log n}{n b^{s / 2}}} E (K_{α, β} (X))}\} \\ = 2 d exp \{- \frac{K_{6}^{2} log n f (x)}{2 d^{2} ψ (x) + \frac{2}{3} d M K_{6} b^{s / 4} \sqrt{\frac{log n}{n}}}\} . \end{matrix}

(85)

Since as

n \to \infty

,

b^{s / 4} \sqrt{\frac{log n}{n}} \to 0

, there is an integer

N

, such that for all

n \geq N

,

\sqrt{\frac{log n}{n}} < 1

. Therefore, there exists a constant

C^{″} > 0

, such that:

\begin{matrix} 2 d^{2} ψ (x) + \frac{2}{3} d M K_{6} b^{s / 4} \sqrt{\frac{log n}{n}} ⩽ ψ (x) (2 d^{2} + \frac{2}{3} d M K_{6} b^{s / 4}) = C^{″} ψ (x), \end{matrix}

(86)

where

C^{″} : = 2 d^{2} + \frac{2}{3} d M K_{6} b^{s / 4}

. Substituting (86) into (85), we obtain that:

\begin{matrix} P (|\sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E K_{α, β} (X_{i})]| \geq n K_{6} \sqrt{\frac{log n}{n b^{s / 2}}} E (K_{α, β} (X))) & ⩽ & 2 d exp \{- \frac{K_{6}^{2} log n f (x)}{C^{″} ψ (x)}\} \\ = & 2 d n^{- \frac{K_{6}^{2} f (x)}{ψ (x) C^{″}}} \\ ⩽ & 2 d n^{- C_{1} K_{6}^{2}} \end{matrix}

where

C_{1} : = \frac{f (x)}{ψ (x) C^{″}}

Then

\sum_{i = 1}^{\infty} P (|\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E K_{α, β} (X_{i})]| \geq K_{6} \sqrt{\frac{log n}{n b^{s / 2}}}) < \infty .

Accordingly, by the Borel–Cantelli

\begin{matrix} \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} |K_{α, β} (X_{i}) - E K_{α, β} (X)| \leq K_{6} \sqrt{\frac{log n}{n b^{s / 2}}} . \end{matrix}

if $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

Similarly, from the above we conclude that:

\sum_{i = 1}^{\infty} P (|\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} [K_{α, β} (X_{i}) - E K_{α, β} (X_{i})]| \geq K_{6} \sqrt{\frac{log n}{n b^{(s + | J |) / 2}}}) < \infty .

Accordingly, by the Borel–Cantelli lemma

\begin{matrix} \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} |K_{α, β} (X_{i}) - E (K_{α, β} (X))| \leq K_{6} \sqrt{\frac{log n}{n b^{(s + | J |) / 2}}} . \end{matrix}

□

Proof of Lemma 8.

Because of the bound of

K (\cdot)

, (see Lemma 3), assume that

θ_{n}^{*}

is the nearest point to

{\hat{Q}}_{n} (u ∣ x)

in

B_{n}

. Following the same lines as in [6], in the case of

\begin{matrix} ∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥ > n^{- β} because ∥ {\hat{Q}}_{n} (u ∣ x) - θ_{n}^{*} - Q (u | x) ∥ \leq γ_{3} n^{- α} . \end{matrix}

For some constant

γ_{3} > 0

, it holds that

\begin{matrix} ∥U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) - U (Y_{i} - θ_{n}^{*} - Q (u | x))∥ \\ = ∥\frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥} - \frac{Y_{i} - θ_{n}^{*} - Q (u | x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥}∥ \\ = ∥\frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥} - \frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥} + \frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥} - \frac{Y_{i} - θ_{n}^{*} - Q (u | x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥}∥ \\ \leq ∥\frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥} - \frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥}∥ + ∥\frac{Y_{i} - {\hat{Q}}_{n} (u ∣ x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥} - \frac{Y_{i} - θ_{n}^{*} - Q (u | x)}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥}∥ \\ \leq ∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥ ∥\frac{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥ - ∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥}{∥ Y_{i} - {\hat{Q}}_{n} (u ∣ x) ∥ ∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥}∥ \\ + \frac{1}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥} ∥{\hat{Q}}_{n} (u ∣ x) - θ_{n}^{*} - Q (u | x)∥ \\ \leq \frac{2}{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥} ∥{\hat{Q}}_{n} (u ∣ x) - θ_{n}^{*} - Q (u | x)∥ \leq 2 γ_{3} n^{β - α} . \end{matrix}

(87)

Write

\begin{matrix} \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y_{i} - θ_{n}^{*} - Q (u | x)) + u) & = L_{n 1} + L_{n 2} \\ = L_{n 1}^{(T)} + L_{n 1}^{(R)} + L_{n 2} \end{matrix}

where

\begin{matrix} L_{n 2} : = & \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) + u) \\ L_{n 1} : = & \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) [U (Y_{i} - θ_{n}^{*} - Q (u | x)) - U (Y_{i} - {\hat{Q}}_{n} (u ∣ x))] \\ = & L_{n 1}^{(T)} + L_{n 1}^{(R)} \end{matrix}

Such that

L_{n 1}^{(T)}

and

L_{n 1}^{(R)}

are defined respectively by

\begin{matrix} L_{n 1}^{(T)} : = & \frac{1}{n E (K_{α, β} ((X)))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) [U (Y_{i} - θ_{n}^{*} - Q (u | x)) - U (Y_{i} - {\hat{Q}}_{n} (u ∣ x))] I_{{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥ \leq n^{- β}}} \\ L_{n 1}^{(R)} : = & \frac{1}{n E (K_{α, β} ((X)))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) [U (Y_{i} - θ_{n}^{*} - Q (u | x)) - U (Y_{i} - {\hat{Q}}_{n} (u ∣ x))] I_{{∥ Y_{i} - θ_{n}^{*} - Q (u | x) ∥ > n^{- β}}} \end{matrix}

First, we establish that

L_{n 1}^{(T)}

\begin{matrix} ∥ L_{n 1}^{(T)} ∥ & ⩽ \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) ∥U (Y Y_{i} - θ_{n}^{*} - Q (u ∣ x)) - U (Y_{i} - {\hat{Q}}_{n} (u ∣ x))∥ I_{{∥ Y_{i} - θ_{n}^{*} - Q (u ∣ x) ∥ \leq n^{- β}}} \\ ⩽ \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) \{\underset{1}{\underset{︸}{∥ U (Y_{i} - θ_{n}^{*} - Q (u ∣ x)) ∥}} + \underset{1}{\underset{︸}{∥ U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) ∥}}\} I_{{∥ Y_{i} - θ_{n}^{*} - Q (u ∣ x) ∥ \leq n^{- β}}} \\ = \frac{2}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) I_{{∥ Y_{i} - θ_{n}^{*} - Q (u ∣ x) ∥ \leq n^{- β}}} . \end{matrix}

From Lemma 6, we obtain that

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥ L_{n 1}^{(T)} ∥ ⩽ 2 K_{4} (\frac{log n}{n b^{s / 2}}) \end{matrix}

(88)

If $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to \infty \forall i \in [s] ∖ J, and \frac{1 - {∥ x ∥}_{1}}{b} \to \infty$ ,

\begin{matrix} ∥ L_{n 1}^{(T)} ∥ ⩽ 2 K_{5} (\frac{log n}{n b^{(s + | J |) / 2}}) \end{matrix}

(89)

Next, using (87) and Lemma 3, we establish that

L_{n 1}^{(R)}

\begin{matrix} ∥ L_{n 1}^{(R)} ∥ & ⩽ \frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) ∥U (Y_{i} - θ_{n}^{*} - Q (u ∣ x)) - U (Y_{i} - {\hat{Q}}_{n} (u ∣ x))∥ I_{{∥ Y_{i} - θ_{n}^{*} - Q (u ∣ x) ∥ > n^{- β}}} \\ ⩽ \frac{2 γ_{3} n^{β - α}}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) \\ ⩽ \frac{2 γ_{3} n^{β - α}}{E (K_{α, β} (X))} sup_{x \in S_{s, 1}} K_{α, β} (x) \\ ⩽ \frac{2 γ_{3} n^{β - α}}{E (K_{α, β} (X))} \sqrt{\frac{{∥ α ∥}_{1} + β - 1}{(β - 1) \prod_{i \in [s]} (α_{i} - 1)}} {(∥ α ∥}_{1} {+ β - s - 1)}^{s} \end{matrix}

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥ L_{n 1}^{(R)} ∥ & ⩽ \frac{2 γ_{3} n^{β - α}}{E (K_{α, β} (X))} b^{- s / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in [s]} x_{i}}} \\ ⩽ C_{1} b^{- s / 2} \sqrt{log n / n} \end{matrix}

(90)

where

C_{1} > \frac{2 γ_{3}}{E (K_{α, β} (X))} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in [s]} x_{i}}}

If $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥ L_{n 1}^{(R)} ∥ & ⩽ \frac{2 γ_{3} n^{β - α}}{E (K_{α, β} (X))} b^{- s} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i} / b}} \\ ⩽ \frac{2 γ_{3} n^{β - α}}{E (K_{α, β} (X))} b^{- s + (s - | J |) / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i}}} \\ ⩽ C_{1} b^{- (s + | j |) / 2} \sqrt{log n / n} \end{matrix}

(91)

where

C_{1} > \frac{2 γ_{3}}{E (K_{α, β} (X))} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i}}}

Finally, we establish that

L_{n 2}

\begin{matrix} ∥ L_{n 2} ∥ & ⩽ \frac{1}{n E (K_{α, β} ((X)))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (∥ U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) ∥ + ∥ u ∥) \\ ⩽ \frac{2}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) \\ ⩽ \frac{2}{E (K_{α, β} (X))} sup_{x \in S_{s, 1}} K_{α, β} (x) \\ ⩽ \frac{2}{E (K_{α, β} (X))} \sqrt{\frac{{∥ α ∥}_{1} + β - 1}{(β - 1) \prod_{i \in [s]} (α_{i} - 1)}} {(∥ α ∥}_{1} {+ β - s - 1)}^{s} \end{matrix}

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥ L_{n 2} ∥ & ⩽ \frac{2}{E (K_{α, β} (X))} b^{- s / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in [s]} x_{i}}} \\ ⩽ C_{2} b^{- s / 2} \end{matrix}

(92)

where

C_{2} > \frac{2}{E (K_{α, β} (X))} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in [s]} x_{i}}}

If $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥ L_{n 1}^{(R)} ∥ & ⩽ \frac{2}{E (K_{(α), β} (X))} b^{- s} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i} / b}} \\ ⩽ \frac{2}{E (K_{α, β} (X))} b^{- s + (s - | J |) / 2} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i}}} \\ ⩽ C_{1} b^{- (s + | j |) / 2} \sqrt{log n / n} \end{matrix}

(93)

where

C_{1} > \frac{2}{E (K_{α, β} (X))} \sqrt{\frac{1 + s b}{{(1 - ∥ x ∥}_{1}) \prod_{i \in J} κ_{i} \prod_{i \in [s] ∖ J} x_{i}}}

From (88), (90) and (92), for all sufficiently large

n > 0

, the following result can be derived almost surely

If $x_{i} / b \to \infty$ for all $i \in [s]$ and $(1 - {∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y_{i} - θ_{n}^{*} - Q (u ∣ x)) + u)∥ \\ ⩽ ∥ L_{n 1}^{(T)} ∥ + ∥ L_{n 1}^{(R)} ∥ + ∥ L_{n 2} ∥ \\ ⩽ 2 K_{4} (\frac{log n}{n b^{s / 2}}) + C_{1} b^{- s / 2} \sqrt{log n / n} + C_{2} b^{- s / 2} \\ ⩽ C (\frac{log n}{n b^{s / 2}}) \end{matrix}

(94)

From (89), (91) and (93), for all sufficiently large

n > 0

, the following result can be derived almost surely

if $x_{i} / b \to κ_{i} \forall i \in J, x_{i} / b \to {\infty \forall i \in [s] ∖ J, and (1 - ∥ x ∥}_{1}) / b \to \infty$ ,

\begin{matrix} ∥\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y_{i} - θ_{n}^{*} - Q (u ∣ x)) + u)∥ \\ ⩽ ∥ L_{n 1}^{(T)} ∥ + ∥ L_{n 1}^{(R)} ∥ + ∥ L_{n 2} ∥ \\ ⩽ 2 K_{5} (\frac{log n}{n b^{(s + | J |) / 2}}) + C_{1} b^{- (s + | j |) / 2} \sqrt{log n / n} + C_{1} b^{- (s + | j |) / 2} \sqrt{log n / n} \\ ⩽ C_{2} (\frac{log n}{n b^{(s + | J |) / 2}}) \end{matrix}

(95)

for all n sufficiently large. We now begin to prove that the following asymptotic relationship

\begin{matrix} \frac{1}{E (K_{α, β} (X))} ∥E [K_{α, β} (X) (U (Y - θ - Q (u ∣ x)) + u)]∥ \\ \sim ∥E [U (Y - θ - Q (u ∣ x)) + u | X = x]∥ . \end{matrix}

(96)

The inequality holds uniformly for

θ \in B_{n}

and

∥ θ ∥ \geq t K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}

, where

t > 0

is a constant to be determined later. For such values of n, by applying Equation (68), performing a variable substitution, utilizing condition A.2, considering the bounded support of

K (\cdot)

, and leveraging limit properties, we deduce that the left-hand side of Equation (96) is equivalent to the following expressions:

\begin{matrix} \frac{1}{E (K_{α, β} (X))} ∥E [K_{α, β} (X) (U (Y - θ - Q (u ∣ x)) + u)]∥ \\ = \frac{1}{E (K_{α, β} (X))} ∥\int K_{α, β} (z) E [U (Y - θ - Q (u ∣ x)) + u) | X = z] f (z) dz∥ \\ \leq \frac{1}{E (K_{α, β} (X))} \int K_{α, β} (z) ∥E [U (Y - θ - Q (u ∣ x)) + u) | X = z] f (z)∥ dz . \end{matrix}

(97)

Furthermore, by Taylor’s expansion, condition A.2, it can be seen that

\begin{matrix} E [U (Y - θ - Q (u ∣ x)) + u ∣ X = ξ_{x}] f (ξ_{x}) \sim E [U (Y - θ - Q (u ∣ x)) + u ∣ X = x] f (x) + O (b) . \end{matrix}

(98)

Similar to the proof of Lemma 5.3 of [6], from the definition of the uth geometric conditional quantile

Q (u ∣ x)

, it is not difficult to show that

\begin{matrix} E [U (Y - Q (u ∣ x)) + u ∣ X = x] = 0 . \end{matrix}

(99)

Then, by Taylor’s expansion, Lemma 5.3 of [6] and the equation above, it holds that

\begin{matrix} \frac{b}{∥\int (U (y - θ_{n} - Q (u ∣ x)) + u) f (y | x) dy∥} = \frac{b}{∥ D_{1} θ_{n} ∥ + o (∥ θ_{n} ∥)} \leq \frac{C b}{∥ θ_{n} ∥} \leq C \sqrt{\frac{n b^{s / 2 + 2}}{log n}} \to 0 . \end{matrix}

(100)

where the first inequality results from the positive definite matrix

D_{1}

, and the second from the definition of n. Noting that (98), (97), and (100), we conclude that (96) holds. Under conditions A.1 and A.2, by a slight adjustment of the proof of Lemma 5.3 in [6], the corresponding results hold analogously for the conditional expectation relating to the variable on the right of the equivalent relationships (96). Hence, for all n sufficiently large, there exists some constant

q > 0

such that

\frac{∥E [K_{α, β} (X) (U (Y - Q (u ∣ x)) + u)]∥}{E (K_{α, β} (X))} \geq q t K_{2} \sqrt{\frac{log n}{n b^{s / 2}}} .

holds for all

θ \in B_{n} and ∥ θ ∥ > K_{2} \sqrt{\frac{log n}{n b^{s / 2}}}

where

t > 0

chosen later. Combining this with (23) yields

min_{\begin{matrix} θ \in B_{β} : \\ ∥ θ ∥ > t K_{2} \sqrt{\frac{log n}{n b^{s / 2}}} \end{matrix}} ∥\frac{1}{n E (K_{α, β} (X))} \sum_{i = 1}^{n} K_{α, β} (X_{i}) (U (Y - Q (u ∣ x)) + u)∥ \geq (q t - 1) K_{2} \sqrt{\frac{log n}{n b^{s / 2}}} .

(101)

By choosing

t

such that

q t > 1

and taking

K_{2}

suitably large, Lemma 8 follows from (101), (94), (25), and the triangle inequality. According to Lemma 8, in the sequel we redefine

B_{n}

under the further restriction that the norm of each element in it is less than

K_{4} \sqrt{\frac{log n}{n b^{s / 2}}} .

For simplicity, we introduce the notation

Λ_{n} (θ)

as

\begin{matrix} Λ_{n} (θ) = \frac{1}{n E (K_{α, β} (X))} {\sum_{i = 1}^{n} K_{α, β} (X_{i}) [U (Y_{i} - Q (u ∣ x)) - U (Y_{i} - θ - Q (u ∣ x))] \\ - n E [K_{α, β} (X_{i}) [U (Y_{i} - Q (u ∣ x)) - U (Y_{i} - θ - Q (u ∣ x)]]} . \end{matrix}

The following lemma addresses the convergence rate of

Λ_{n} (θ)

, which will be applied to prove Theorem 1. □

Proof of Lemma 9.

Referring to Equation (2), let us assume that for every

θ \in R^{d}

, every

Y \in R^{d}

, and every

x \in S_{s, 1}

, the condition stated below is satisfied:

\begin{matrix} L_{n} (θ, x) = \sum_{i = 1}^{n} w_{n i} {Φ (u, Y_{i} - θ) - Φ (u, Y_{i})} = \sum_{i = 1}^{n} w_{n i} \{∥ Y_{i} - θ ∥ - ∥ Y_{i} ∥ - 〈 u, θ 〉\} . \end{matrix}

(102)

In this stage, we begin the procedure by concentrating on the function’s first derivative with respect to

θ

for

L_{n} (θ, x)

.

\begin{matrix} \frac{\partial}{\partial θ} L_{n} (θ, x) = \sum_{i = 1}^{n} w_{n i} \{\frac{- (Y_{i} - θ)}{∥ Y_{i} - θ ∥} - u\} & = - \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - θ) + u) . \end{matrix}

(103)

Under assumption A.3, we find the first derivative of the function

r (θ, x)

given in Equation (5):

\begin{matrix} \frac{\partial}{\partial θ} (r (θ, x)) & = E \{\frac{1}{∥ Y - Q (u ∣ x) - θ ∥} (I_{d} - U (Y - Q (u ∣ x) - θ) U^{⊤} (Y - Q (u ∣ x) - θ)) | X = x\} \\ = E (B (Y - Q (u ∣ x) - θ) | X = x) . \end{matrix}

Then, we have

\begin{matrix} \frac{\partial}{\partial θ} r (θ, x) |_{θ = 0} = E (B (Y - Q (u ∣ x)) | X = x) = \frac{E (K_{α, β} (x) B (Y - Q (u ∣ x)))}{E (K_{α, β} (x))} : = D_{1} . \end{matrix}

(104)

From definition in (2), we have

\begin{matrix} \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - {\hat{Q}}_{n} (u ∣ x)) + u) = 0 . \end{matrix}

(105)

The first-order Taylor expansion of the function

[U (\cdot) + u]

around

Q (u ∣ x) \in R^{d}

is given by

\begin{matrix} U (Y - {\hat{Q}}_{n} (u ∣ x)) + u \\ = (U (Y - Q (u ∣ x)) + u) + \nabla_{Q} {(U (Y - Q (u ∣ x)) + u)}^{⊤} ({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)) \\ + o {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2} . \end{matrix}

(106)

Then, from Equations (105) and (106), we obtain that

\begin{matrix} 0 & = & \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) \\ + \sum_{i = 1}^{n} w_{n i} \nabla_{Q} {(U (Y_{i} - Q (u ∣ x)) + u)}^{⊤} ({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)) \\ + o (\sum_{i = 1}^{n} (w_{n i} {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2})) . \end{matrix}

Then, we obtain

\begin{matrix} ({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x)) \sum_{i = 1}^{n} w_{n i} B (Y_{i} - Q (u ∣ x)) \\ = \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) + o (\sum_{i = 1}^{n} w_{n i} {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2}), \end{matrix}

where

D_{1 n} : = \sum_{i = 1}^{n} w_{n i} B (Y_{i} - Q (u ∣ x)) \to_{n \to \infty}^{P} \frac{E (K_{α, β} (x) B (Y - Q (u ∣ x)))}{E (K_{α, β} (x))} : = D_{1} .

Under assumption (A.5), we have

\begin{matrix} {\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) & = D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) + R_{n} . \end{matrix}

Then the representation of Bahadur for

{\hat{Q}}_{n} (u ∣ x)

is defined as

{\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) = D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} (U (Y_{i} - Q (u ∣ x)) + u) + R_{n},

(107)

with probability one, where

\begin{matrix} D_{1} & = & \frac{E (K_{α, β} (X) B (Y - Q (u ∣ x)))}{E (K_{α, β} (x))}, \\ R_{n} & = & o (\frac{log n}{n b^{s / 2}}) . \end{matrix}

Indeed

\begin{matrix} R_{n} : = o (D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2}) \end{matrix}

We have

\begin{matrix} ∥D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2}∥ & ⩽ C_{D_{1}} {∥ {\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x) ∥}^{2} \sum_{i = 1}^{n} w_{n i} \end{matrix}

From Lemma 8 and the fact that

\sum_{i = 1}^{n} w_{n i} = 1

, we obtain

\begin{matrix} ∥D_{1}^{- 1} \sum_{i = 1}^{n} w_{n i} {({\hat{Q}}_{n} (u ∣ x) - Q (u ∣ x))}^{2}∥ & ⩽ C_{D_{1}} {(K_{7} {(\frac{log n}{n b^{s / 2}})}^{1 / 2})}^{2} \\ = C_{D_{1}} K_{7}^{2} (\frac{log n}{n b^{s / 2}}) \\ = C_{D_{1}, K} (\frac{log n}{n b^{s / 2}}) \\ = o (\frac{log n}{n b^{s / 2}}), \end{matrix}

where

C_{D_{1}, K} : = C_{D_{1}} K_{7}^{2}

□

Author Contributions

Formal analysis, A.A. and S.B.; Validation, A.A., S.B. and S.K.; Writing—review and editing, A.A., S.B. and S.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors wish to express their sincere gratitude to the Editor-in-Chief, the Associate Editor, and the three referees for their valuable comments and careful reading of the manuscript. The insightful suggestions provided have substantially improved the quality, clarity, and focus of the paper.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Serfling, R. Quantile functions for multivariate analysis: Approaches and applications. Statist. Neerl. 2002, 56, 214–232. [Google Scholar] [CrossRef]
Chakraborty, B. On affine equivariant multivariate quantiles. Ann. Inst. Statist. Math. 2001, 53, 380–403. [Google Scholar] [CrossRef]
Chakraborty, B. On multivariate quantile regression. J. Statist. Plan. Inference 2003, 110, 109–132. [Google Scholar] [CrossRef]
De Gooijer, J.G.; Gannoun, A.; Zerom, D. A multivariate quantile predictor. Comm. Statist. Theory Methods 2006, 35, 133–147. [Google Scholar] [CrossRef]
Abdous, B.; Theodorescu, R. Note on the spatial quantile of a random vector. Statist. Probab. Lett. 1992, 13, 333–336. [Google Scholar] [CrossRef]
Chaudhuri, P. Multivariate location estimation using extension of R-estimates through U-statistics type approach. Ann. Statist. 1992, 20, 897–916. [Google Scholar] [CrossRef]
Chaudhuri, P. On a geometric notion of quantiles for multivariate data. J. Amer. Statist. Assoc. 1996, 91, 862–872. [Google Scholar] [CrossRef]
Alvarez-Andrade, S.; Bouzebda, S. Some nonparametric tests for change-point detection based on the ℙ-ℙ and ℚ-ℚ plot processes. Seq. Anal. 2014, 33, 360–399. [Google Scholar] [CrossRef]
Alvarez-Andrade, S.; Bouzebda, S. Strong approximations for weighted bootstrap of empirical and quantile processes with applications. Stat. Methodol. 2013, 11, 36–52. [Google Scholar] [CrossRef]
Taachouche, N.; Bouzebda, S. Multivariate spatial conditional quantiles on hyperspheres in the presence of measurement error. Results Appl. Math. 2026, 29, 100690. [Google Scholar] [CrossRef]
Bouzebda, S.; Taachouche, N. Multivariate spatial conditional U-quantiles: A Bahadur-Kiefer representation. Results Appl. Math. 2025, 26, 100593. [Google Scholar] [CrossRef]
Belhas, H.; Mohammedi, M.; Bouzebda, S. Asymptotic Theory for Multivariate Nonparametric Quantile Regression with Stationary Ergodic Functional Covariates and Missing-at-Random Responses. Symmetry 2026, 18, 445. [Google Scholar] [CrossRef]
Wertz, W. Statistical density estimation: A survey. In Angewandte Statistik und Ökonometrie [Applied Statistics and Econometrics]; With German and French summaries; Vandenhoeck & Ruprecht: Göttingen, Germany, 1978; Volume 13, p. 108. [Google Scholar]
Wand, M.P.; Jones, M.C. Kernel smoothing. In Monographs on Statistics and Applied Probability; Chapman and Hall, Ltd.: London, UK, 1995; Volume 60, p. xii+212. [Google Scholar] [CrossRef]
Tapia, R.A.; Thompson, J.R. Nonparametric Probability Density Estimation; Johns Hopkins Series in the Mathematical Sciences; Johns Hopkins University Press: Baltimore, MD, USA, 1978; Volume 1, p. xi+176. [Google Scholar]
Prakasa Rao, B.L.S. Nonparametric functional estimation. In Probability and Mathematical Statistics; Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers]: New York, NY, USA, 1983; p. xiv+522. [Google Scholar]
Roussas, G.G. Estimation of transition distribution function and its quantiles in Markov processes: Strong consistency and asymptotic normality. In Nonparametric Functional Estimation and Related Topics (Spetses, 1990); Kluwer Academic Publishers: Dordrecht, The Netherlands, 1991; Volume 335, pp. 443–462. [Google Scholar]
Nadaraya, E.A. Nonparametric estimation of probability densities and regression curves. In Mathematics and Its Applications (Soviet Series); Kluwer Academic Publishers Group: Dordrecht, The Netherlands, 1989; Volume 20, p. x+213. [Google Scholar] [CrossRef]
Müller, H.G. Nonparametric Regression Analysis of Longitudinal Data; Lecture Notes in Statistics; Springer: Berlin, Germany, 1988; Volume 46, p. vi+199. [Google Scholar] [CrossRef]
Härdle, W. Applied nonparametric regression. In Econometric Society Monographs; Cambridge University Press: Cambridge, UK, 1990; Volume 19, p. xvi+333. [Google Scholar] [CrossRef]
Eggermont, P.P.B.; LaRiccia, V.N. Maximum Penalized Likelihood Estimation; Springer Series in Statistics; Density Estimation; Springer: New York, NY, USA, 2001; Volume I, p. xviii+510. [Google Scholar]
Devroye, L. A course in density estimation. In Progress in Probability and Statistics; Birkhäuser Boston, Inc.: Boston, MA, USA, 1987; Volume 14, p. xx+183. [Google Scholar]
Scott, D.W. Multivariate Density Estimation; Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics; Theory, Practice, and Visualization, A Wiley-Interscience Publication; John Wiley & Sons, Inc.: New York, NY, USA, 1992; p. xiv+317. [Google Scholar] [CrossRef]
Silverman, B.W. Density estimation for statistics and data analysis. In Monographs on Statistics and Applied Probability; Chapman & Hall: London, UK, 1986; p. x+175. [Google Scholar] [CrossRef]
Müller, H.G. Smooth optimum kernel estimators near endpoints. Biometrika 1991, 78, 521–530. [Google Scholar] [CrossRef]
Jones, M.C. Variable kernel density estimates and variable kernel density estimates. Austral. J. Statist. 1990, 32, 361–371, Corrigendum in Austral. J. Statist., 1991, 33, 119. [Google Scholar] [CrossRef]
Funke, B.; Hirukawa, M. Density derivative estimation using asymmetric kernels. J. Nonparametr. Stat. 2024, 36, 994–1017. [Google Scholar] [CrossRef]
Chen, S.X. Beta kernel estimators for density functions. Comput. Statist. Data Anal. 1999, 31, 131–145. [Google Scholar] [CrossRef]
Chen, S.X. Beta kernel smoothers for regression curves. Statist. Sin. 2000, 10, 73–91. [Google Scholar]
Bouezmarni, T.; Rolin, J.M. Consistency of the beta kernel density function estimator. Canad. J. Statist. 2003, 31, 89–98. [Google Scholar] [CrossRef]
Zhang, S.; Karunamuni, R.J. Boundary performance of the beta kernel estimators. J. Nonparametr. Stat. 2010, 22, 81–104. [Google Scholar] [CrossRef]
Bertin, K.; Klutchnikoff, N. Minimax properties of beta kernel estimators. J. Statist. Plann. Inference 2011, 141, 2287–2297. [Google Scholar] [CrossRef]
Igarashi, G. Bias reductions for beta kernel estimation. J. Nonparametr. Stat. 2016, 28, 1–30. [Google Scholar] [CrossRef]
Hirukawa, M. Asymmetric kernel smoothing. In SpringerBriefs in Statistics; Theory and Applications in Economics and Finance, JSS Research Series in Statistics; Springer: Singapore, 2018; p. xii+110. [Google Scholar] [CrossRef]
Kristensen, D. Uniform convergence rates of kernel estimators with heterogeneous dependent data. Econom. Theory 2009, 25, 1433–1445. [Google Scholar] [CrossRef]
Yin, X.F.; Hao, Z.F. Adaptive Kernel Density Estimation using Beta Kernel. In Proceedings of the 2007 International Conference on Machine Learning and Cybernetics, Hong Kong, China, 19–22 August 2007; Volume 6, pp. 3293–3297. [Google Scholar] [CrossRef]
Igarashi, G.; Kakizawa, Y. Limiting bias-reduced Amoroso kernel density estimators for non-negative data. Comm. Statist. Theory Methods 2018, 47, 4905–4937. [Google Scholar] [CrossRef]
Charpentier, A.; Oulidi, A. Beta kernel quantile estimators of heavy-tailed loss distributions. Stat. Comput. 2010, 20, 35–55. [Google Scholar] [CrossRef]
Vitale, R.A. Bernstein polynomial approach to density function estimation. In Statistical Inference and Related Topics, Proceedings of the Summer Research Institute on Statistical Inference for Stochastic Processes, Bloomington, IN, USA, 31 July–9 August 1975; Dedicated to Z.W. Birnbaum; Academic Press: New York, NY, USA; London, UK, 1975; Volume 2, pp. 87–99. [Google Scholar]
Stadmüller, U. Asymptotic distributions of smoothed histograms. Metrika 1983, 30, 145–158. [Google Scholar] [CrossRef]
Gawronski, W. Strong laws for density estimators of Bernstein type. Period. Math. Hungar. 1985, 16, 23–43. [Google Scholar] [CrossRef]
Tenbusch, A. Two-dimensional Bernstein polynomial density estimators. Metrika 1994, 41, 233–253. [Google Scholar] [CrossRef]
Tenbusch, A. Nonparametric curve estimation with Bernstein estimates. Metrika 1997, 45, 1–30. [Google Scholar] [CrossRef]
Babu, G.J.; Canty, A.J.; Chaubey, Y.P. Application of Bernstein polynomials for smooth estimation of a distribution and density function. J. Statist. Plann. Inference 2002, 105, 377–392. [Google Scholar] [CrossRef]
Kakizawa, Y. Bernstein polynomial probability density estimation. J. Nonparametr. Stat. 2004, 16, 709–729. [Google Scholar] [CrossRef]
Prakasa Rao, B.L.S. Estimation of distribution and density functions by generalized Bernstein polynomials. Indian J. Pure Appl. Math. 2005, 36, 63–88. [Google Scholar]
Babu, G.J.; Chaubey, Y.P. Smooth estimation of a distribution and density function on a hypercube using Bernstein polynomials for dependent random vectors. Statist. Probab. Lett. 2006, 76, 959–969. [Google Scholar] [CrossRef]
Leblanc, A. On estimating distribution functions using Bernstein polynomials. Ann. Inst. Statist. Math. 2012, 64, 919–943. [Google Scholar] [CrossRef]
Belalia, M.; Bouezmarni, T.; Lemyre, F.C.; Taamouti, A. Testing independence based on Bernstein empirical copula and copula density. J. Nonparametr. Stat. 2017, 29, 346–380. [Google Scholar] [CrossRef]
Wang, L.; Lu, D. Application of Bernstein polynomials on estimating a distribution and density function in a triangular array. Methodol. Comput. Appl. Probab. 2023, 25, 56. [Google Scholar] [CrossRef]
Berrahou, N.E.; Bouzebda, S.; Douge, L. A nonparametric distribution-free test of independence among continuous random vectors based on L₁-norm. Bernoulli 2025, 31, 1325–1350. [Google Scholar] [CrossRef]
Sancetta, A.; Satchell, S. The Bernstein copula and its applications to modeling and approximations of multivariate distributions. Econom. Theory 2004, 20, 535–562. [Google Scholar] [CrossRef]
Abrams, S.; Janssen, P.; Swanepoel, J.; Veraverbeke, N. Nonparametric estimation of risk ratios for bivariate data. J. Nonparametr. Stat. 2022, 34, 940–963. [Google Scholar] [CrossRef]
Ouimet, F. Asymptotic properties of Bernstein estimators on the simplex. J. Multivar. Anal. 2021, 185, 104784. [Google Scholar] [CrossRef]
Bouzebda, S.; Nezzal, A.; Elhattab, I. Limit theorems for nonparametric conditional U-statistics smoothed by asymmetric kernels. AIMS Math. 2024, 9, 26195–26282. [Google Scholar] [CrossRef]
Ouimet, F. On the boundary properties of Bernstein estimators on the simplex. Open Stat. 2022, 3, 48–62. [Google Scholar] [CrossRef]
Kotz, S.; Balakrishnan, N.; Johnson, N.L. Continuous Multivariate Distributions, Volume 1: Models and Applications, 2nd ed.; Wiley: New York, NY, USA, 2000. [Google Scholar]
Ng, K.W.; Tian, G.L.; Tang, M.L. Dirichlet and Related Distributions; Wiley Series in Probability and Statistics; Theory, Methods and Applications; John Wiley & Sons, Ltd.: Chichester, UK, 2011; p. xxvi+310. [Google Scholar] [CrossRef]
Ouimet, F.; Tolosana-Delgado, R. Asymptotic properties of Dirichlet kernel density estimators. J. Multivar. Anal. 2022, 187, 104832. [Google Scholar] [CrossRef]
Cheng, Y.; De Gooijer, J.G. Bahadur representation for the nonparametric M-estimator under α-mixing dependence. Statistics 2009, 43, 443–462. [Google Scholar] [CrossRef]
Cadre, B.; Gannoun, A. Asymptotic normality of consistent estimate of the conditional L₁-median. Ann. l’ISUP 2000, 44, 13–33. [Google Scholar]
Reimann, C.; Filzmoser, P.; Fabian, K.; Hron, K.; Birke, M.; Demetriades, A.; Dinelli, E.; Ladenberger, A.; Team, T.G.P. The concept of compositional data analysis in practice–Total major element concentrations in agricultural and grazing land soils of Europe. Sci. Total Environ. 2012, 426, 196–210. [Google Scholar] [CrossRef]
Kemperman, J. The Median of a Finite Measure on a Banach Space. In Statistical Data Analysis Based on the L1-Norm and Related Methods; Dodge, Y., Ed.; North-Holland: Amsterdam, The Netherlands, 1987; pp. 217–230. [Google Scholar]
Bouzebda, S. General tests of conditional independence based on empirical processes indexed by functions. Jpn. J. Stat. Data Sci. 2023, 6, 115–177. [Google Scholar] [CrossRef]
Bouzebda, S.; Taachouche, N. On the variable bandwidth kernel estimation of conditional U-statistics at optimal rates in sup-norm. Phys. A 2023, 625, 129000. [Google Scholar] [CrossRef]
Bouzebda, S. Weak convergence of the conditional single index U-statistics for locally stationary functional time series. AIMS Math. 2024, 9, 14807–14898. [Google Scholar] [CrossRef]
Devroye, L.; Penrod, C.S. Distribution-Free Lower Bounds in Density Estimation. Ann. Stat. 1984, 12, 1250–1262. [Google Scholar] [CrossRef]
Devroye, L. On the Almost Everywhere Convergence of Nonparametric Regression Function Estimates. Ann. Stat. 1981, 9, 1310–1319. [Google Scholar] [CrossRef]

Figure 1. Nominal

95 %

Gaussian ellipsoids associated with

{\hat{Q}}_{n} (u ∣ x_{0})

for

n = 100

(dash–dot–dot),

n = 200

(dotted), and

n = 500

(solid). The corresponding point estimates are indicated by black dots (

n = 100

), squares (

n = 200

), and triangles (

n = 500

). (a)

u = {(- 0.8, - 0.58)}^{⊤}

,

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

. (b)

u = {(- 0.8, - 0.58)}^{⊤}

,

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

.

Figure 1. Nominal

95 %

Gaussian ellipsoids associated with

{\hat{Q}}_{n} (u ∣ x_{0})

for

n = 100

(dash–dot–dot),

n = 200

(dotted), and

n = 500

(solid). The corresponding point estimates are indicated by black dots (

n = 100

), squares (

n = 200

), and triangles (

n = 500

). (a)

u = {(- 0.8, - 0.58)}^{⊤}

,

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

. (b)

u = {(- 0.8, - 0.58)}^{⊤}

,

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

.

Figure 2. Estimated contours of the

u

-th geometric conditional quantile at

x_{0} = 0.3

, for

u = u (r, φ) = r {(cos φ, sin φ)}^{⊤}

,

r = 0.1, \dots, 0.9

, and

φ = k π / 16

,

k = 0, \dots, 31

. The inner curves correspond to smaller values of r, while the outer curves correspond to directions closer to the boundary of

B^{(2)}

. (a) Directional contour polygons at

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

,

n = 100

. (b) Directional contour polygons at

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

,

n = 200

.

Figure 2. Estimated contours of the

u

-th geometric conditional quantile at

x_{0} = 0.3

, for

u = u (r, φ) = r {(cos φ, sin φ)}^{⊤}

,

r = 0.1, \dots, 0.9

, and

φ = k π / 16

,

k = 0, \dots, 31

. The inner curves correspond to smaller values of r, while the outer curves correspond to directions closer to the boundary of

B^{(2)}

. (a) Directional contour polygons at

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

,

n = 100

. (b) Directional contour polygons at

x_{0} = 0.3

,

b ≍ n^{- 2 / 5}

,

n = 200

.

Figure 3. Scatter plots and fitted conditional quantile surfaces for the GEMAS data (

s = 2

). (a)

{pH}_{{CaCl}_{2}}

versus

(Sand - norm, Silt - norm)

. (b)

TOC

versus

(Sand - norm, Silt - norm)

. The red surface represents the conditional geometric median (

u_{0}

), while the blue surface represents the directional quantile for

u_{+} = {(0.4, 0.5)}^{⊤}

. The bandwidth is

b ≍ n^{- 1 / 3}

.

Figure 3. Scatter plots and fitted conditional quantile surfaces for the GEMAS data (

s = 2

). (a)

{pH}_{{CaCl}_{2}}

versus

(Sand - norm, Silt - norm)

. (b)

TOC

versus

(Sand - norm, Silt - norm)

. The red surface represents the conditional geometric median (

u_{0}

), while the blue surface represents the directional quantile for

u_{+} = {(0.4, 0.5)}^{⊤}

. The bandwidth is

b ≍ n^{- 1 / 3}

.

Table 1. Geometric conditional quantile estimates for the GEMAS data (

s = 1

). The covariate is Sand-norm. Bandwidth

b ≍ n^{- 2 / 5}

. Asymptotic standard errors (ASE) are in parentheses.

Table 1. Geometric conditional quantile estimates for the GEMAS data (

s = 1

). The covariate is Sand-norm. Bandwidth

b ≍ n^{- 2 / 5}

. Asymptotic standard errors (ASE) are in parentheses.

x	$u_{0} = {(0, 0)}^{⊤}$		$u_{+} = {(0.4, 0.5)}^{⊤}$		$u_{-} = {(- 0.4, - 0.5)}^{⊤}$
x	${Zn}_{XRF}$	${Cu}_{XRF}$	${Zn}_{XRF}$	${Cu}_{XRF}$	${Zn}_{XRF}$	${Cu}_{XRF}$
0.333	70.671	17.561	95.309	39.717	51.098	2.864
	(0.504)	(0.188)	(0.576)	(0.445)	(0.512)	(0.178)
0.464	66.201	15.630	93.514	39.112	45.779	1.250
	(0.538)	(0.190)	(0.591)	(0.423)	(0.437)	(0.139)
0.606	55.628	11.815	83.438	34.362	38.312	−0.141
	(0.538)	(0.176)	(0.795)	(0.523)	(0.402)	(0.115)

Table 2. Geometric conditional quantile estimates for the GEMAS data (

s = 2

). The covariates are Sand-norm and Silt-norm. Bandwidth

b ≍ n^{- 1 / 3}

. Asymptotic standard errors (ASE) are in parentheses.

Table 2. Geometric conditional quantile estimates for the GEMAS data (

s = 2

). The covariates are Sand-norm and Silt-norm. Bandwidth

b ≍ n^{- 1 / 3}

. Asymptotic standard errors (ASE) are in parentheses.

$x = (x_{1}, x_{2})$	$u_{0} = {(0, 0)}^{⊤}$		$u_{+} = {(0.4, 0.5)}^{⊤}$		$u_{-} = {(- 0.4, - 0.5)}^{⊤}$
$x = (x_{1}, x_{2})$	${pH}_{{CaCl}_{2}}$	$TOC$	${pH}_{{CaCl}_{2}}$	$TOC$	${pH}_{{CaCl}_{2}}$	$TOC$
(0.25, 0.74)	6.364	1.699	7.111	2.558	5.530	1.003
	(0.076)	(0.029)	(0.037)	(0.061)	(0.028)	(0.024)
(0.45, 0.53)	5.975	1.886	6.930	3.054	5.155	1.074
	(0.070)	(0.045)	(0.051)	(0.096)	(0.058)	(0.038)
(0.68, 0.31)	5.699	2.040	6.769	3.460	4.862	1.088
	(0.059)	(0.062)	(0.062)	(0.116)	(0.052)	(0.042)
(0.80, 0.12)	5.437	1.820	6.539	3.181	4.605	0.939
	(0.060)	(0.057)	(0.071)	(0.094)	(0.054)	(0.035)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alwadeai, A.; Bouzebda, S.; Khardani, S. Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex. Mathematics 2026, 14, 1242. https://doi.org/10.3390/math14081242

AMA Style

Alwadeai A, Bouzebda S, Khardani S. Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex. Mathematics. 2026; 14(8):1242. https://doi.org/10.3390/math14081242

Chicago/Turabian Style

Alwadeai, Abdulghani, Salim Bouzebda, and Salah Khardani. 2026. "Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex" Mathematics 14, no. 8: 1242. https://doi.org/10.3390/math14081242

APA Style

Alwadeai, A., Bouzebda, S., & Khardani, S. (2026). Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex. Mathematics, 14(8), 1242. https://doi.org/10.3390/math14081242

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dirichlet–Kernel Methods for Geometric Conditional Quantiles: Bahadur Expansions and Boundary Adaptivity on the d-Simplex

Abstract

1. Introduction

Organization

2. Setup and Definitions

Notation

3. Main Results

3.1. Discussion of the Assumptions

3.2. Bahadur Representation for the Geometric Conditional Quantile Estimator

4. Numerical Results

Discussion of Simulation Results

5. Empirical Validation: Simplex-Constrained Inference for Geochemical Data

5.1. Univariate Compositional Covariate: Sand-Normalized Texture

5.1.1. Point Estimates and Asymptotic Standard Errors: s = 1

5.1.2. Interpretation and Theoretical Validation

5.2. Bivariate Compositional Covariate: Sand and Silt Interplay

5.2.1. Point Estimates and Asymptotic Standard Errors: s = 2

5.2.2. Visualizing the Conditional Structure: Directional Quantile Surfaces

5.2.3. Interpretation and Theoretical Synthesis

5.3. Discussion: Synthesis of Empirical Findings and Theoretical Implications

6. Conclusions and Perspectives

6.1. Synthesis of Contributions

6.2. Methodological Import and Positioning

6.3. Limitations and Avenues for Extension

6.4. Concluding Remarks

7. Proofs of the Main Results

A.1 Some Lemmas

8. Proof of the Technical Lemmas

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.1.1. Point Estimates and Asymptotic Standard Errors: $s = 1$

5.2.1. Point Estimates and Asymptotic Standard Errors: $s = 2$