1. Introduction and Geometric Preliminaries
Quantization theory concerns the approximation of a probability distribution by a finite set of representative points (or codepoints) in such a way that the expected distortion is minimized. The classical foundations of Euclidean quantization were established through pioneering work of Zador [
1], the extensive development by Gersho and Gray [
2], and the authoritative survey of Gray and Neuhoff [
3]. A rigorous measure-theoretic and probabilistic framework for quantization of probability distributions was subsequently formulated by Graf and Luschgy in their monograph [
4]. Statistical aspects such as consistency of the
k-means method were studied by Pollard [
5], while learning-theoretic methods in vector quantization were developed by Linder [
6]. These works together form the basis of modern Euclidean quantization theory.
In recent years, there has been growing interest in extending quantization to non-Euclidean and curved spaces, particularly to probability measures supported on manifolds. On the sphere, quantization is relevant in directional statistics [
7], geometric data analysis, and manifold-based applications arising in computer vision, shape analysis, and machine learning. In this setting, classical Euclidean notions such as straight lines, centroids, and Voronoi regions must be replaced by their intrinsic geometric counterparts: geodesic arcs, Fréchet means [
8], Karcher means [
9], and spherical Voronoi tessellations.
While the general problem of optimal quantization on spherical surfaces is inherently high-dimensional and analytically challenging, the focus of this paper is deliberately more specific and pedagogical. We concentrate on intrinsic quantization problems for probability measures supported on one-dimensional spherical curves, namely great circles, small circles, and geodesic arcs. In these settings, the sphere induces a one-dimensional Riemannian structure via arc-length, allowing the intrinsic quantization problem to reduce to a transparent and fully explicit one-dimensional model.
The central theme of the paper is that, once the correct intrinsic metric is adopted, quantization on such spherical curves reduces to classical one-dimensional quantization with respect to arc-length. This reduction explains why the optimal Voronoi cells are contiguous intervals and why, under uniform density, the optimal codepoints are the midpoints of these intervals.
A brief familiarity with differential geometry and intrinsic statistics on manifolds is helpful for understanding quantization beyond Euclidean settings. For an accessible introduction to the geometry of curves and surfaces, we refer the reader to Tapp [
10], while the foundational framework of intrinsic statistics on Riemannian manifolds and geometric measurements was developed in the influential work of Pennec [
11]. These references provide essential background for readers wishing to deepen their understanding of the geometric and statistical structures underlying quantization on curved spaces.
The aim of this introductory section is twofold. First, we provide a concise and pedagogical overview of the geometric and analytical tools required for quantization on the sphere—geodesic distance, spherical coordinates, Slerp interpolation, Fréchet means, Voronoi partitions, and centroid conditions. Second, we establish the conceptual foundations that allow the reader to follow the developments in the later sections smoothly, without requiring prior background in differential geometry. The exposition is intentionally intuitive and example-driven, with the goal of making the subject accessible to beginning graduate students and researchers in analysis, probability, or applied mathematics who wish to learn quantization on manifolds for the first time.
Before entering the detailed geometric and analytic developments, we summarize the guiding principle underlying the main results of this paper. While optimal quantization on spherical surfaces is, in general, a high-dimensional and analytically challenging problem, the focus of this paper is deliberately more specific. We concentrate on intrinsic quantization problems for probability measures supported on one-dimensional spherical curves, namely great circles, small circles, and geodesic arcs. Although the ambient space is the two-dimensional sphere, once the support of the probability measure is constrained to such a spherical curve, the intrinsic quantization problem effectively reduces to a one-dimensional Riemannian model with arc-length parameterization. This reduction principle allows for a transparent and explicit analysis.
1.1. Nature and Contribution of the Paper
The results presented here recover, in the intrinsic spherical setting, the classical one-dimensional uniform quantization formula
where
L denotes the intrinsic length of the support curve. Accordingly, the primary contribution of this paper is not the discovery of new quantization asymptotics, but rather a geometric re-framing of known one-dimensional results within the intrinsic geometry of the sphere. The paper is intended as a tutorial and pedagogical note, highlighting how intrinsic metrics, geodesic Voronoi structures, and arc-length parameterization lead naturally to transparent and explicit solutions on spherical curves.
The reduction principle described above is summarized in the following conceptual theorem.
Theorem 1 (Conceptual reduction to the one-dimensional uniform case). Let be a smooth closed geodesic curve of total intrinsic length L (e.g., a great circle or a small circle), and let P be the uniform probability measure on C with respect to arc-length. Then, the intrinsic quantization problem on with squared geodesic distortion is equivalent to the classical one-dimensional uniform quantization problem on a circle of length L.
In particular, for each , the optimal set of n-means consists of n equally spaced points along C, each Voronoi region has length , each codepoint is the geodesic midpoint of its cell, and the quantization error satisfies The present article is intended as a beginner-friendly expository companion to the author’s forthcoming comprehensive monograph, which will provide a deeper and more systematic treatment of quantization on spherical surfaces [
12]. Accordingly, we emphasize geometric intuition, explicit derivations, and pedagogical clarity throughout. This section provides the essential background for the study of quantization on spherical surfaces, summarizing the geometric and analytical components that will be used throughout the paper.
1.2. Great Circle, Small Circle, and Spherical Triangle
A great circle on a sphere is the intersection of the sphere with a plane that passes through the center of the sphere. Equivalently, it is a circle on the sphere whose center coincides with the center of the sphere. A small circle on a sphere is the intersection of the sphere with a plane that does not pass through the center of the sphere. The center of the small circle lies on the line connecting the center of the sphere and the point on the plane nearest to the center, but it does not coincide with the sphere’s center. A spherical triangle on the surface of a sphere is the region bounded by three arcs of great circles, each pair of which intersects at a vertex. The three vertices lie on the sphere, and the sides are segments of great circles connecting these vertices. The angles of a spherical triangle are the dihedral angles between the planes of the great circles at their intersections.
1.3. Equator and the Prime Meridian, Latitude and Longitude
The Equator is an imaginary closed curve on the surface of the Earth that lies equidistant from the North and South Poles. Geometrically, it is a great circle and it divides the Earth into the Northern Hemisphere and the Southern Hemisphere. Latitude measures how far north or south a point is from the Equator. It ranges from at the Equator to North (the North Pole) and South (the South Pole). The Prime Meridian is an imaginary semicircular great circle on the surface of the Earth that passes through the North Pole and South Pole and the Royal Observatory in Greenwich, England. It divides the Earth into the Eastern Hemisphere and the Western Hemisphere. Longitude measures how far east or west a point is from the Prime Meridian. It ranges from at the Prime Meridian to East and West. The Equator is the line of latitude, and the Prime Meridian is the line of Longitude. Latitude and Longitude form a coordinate pair: . For example: New York City , Rio de Janeiro , London .
1.4. Relationship Between Three Different Coordinates: Cartesian, Spherical, and Geographical
The relationship between Cartesian coordinates
and spherical coordinates
is given by the conversion formulas:
where
ρ: the radial distance from the origin to the point,
θ: the azimuthal angle, measured in the xy-plane from the positive x-axis (longitude),
ϕ: the polar angle, measured from the positive z-axis (colatitude).
If the radius
is fixed, the spherical coordinates of the point can be identified as
. Notice that if the radius
of the sphere is fixed the same point in latitude–longitude coordinates is represented by
. The latitude–longitude coordinates of a point on a sphere are called the
geographical coordinates of the point. Let the geographical coordinates of a point on the sphere be given by
, where
Then, by (
1), we have the embedding of
into
as the vector
1.5. Geodesic Distance via Geographical Coordinates
The geodesic distance between two points on a surface (like a sphere) is the shortest possible distance along the surface that connects them. It is the analog of a “straight line distance” in flat Euclidean space, but restricted to move on the surface. On a sphere, the geodesics are great circle arcs, so the geodesic distance between two points on the sphere equals the length of the shorter great circle arc joining them. Consider two points on a sphere of radius
with geographical coordinates
For two points
and
, their corresponding vectors are
Therefore, the
central angle between
and
satisfies
Then,
which is known as the
geodesic distance between and via geographical coordinates on a sphere of radius
.
1.6. Arc Length on a Spherical Surface
Let
be a sphere of radius
. Let
be a smooth curve lying on the spherical surface. Suppose that
admits a smooth parametrization
such that
and
for all
. The
arc length element along the curve is
is also known as the differential of the arc length. The total length of the curve is given by
1.7. The Two-Argument Arctangent Function
To represent angles in the full range
without ambiguity, we use the two-argument arctangent function
. For any
, this function returns the unique angle
such that
Unlike the single-variable function , the function automatically accounts for the correct quadrant of the point and is standard in geometry, navigation, and numerical implementations.
In this paper, atan2 is used only to define longitude coordinates consistently and plays no further conceptual role in the quantization analysis.
1.8. Spherical Linear Interpolation (Slerp)
It is a smooth parametrization of the shortest geodesic (great-circle) arc connecting two points on a sphere.
Let
be a sphere of radius
, and let
be two distinct points. Denote by
the central angle (in radians) subtended by
and
at the center of the sphere, where
is the Euclidean inner product (dot product) between the 3D vectors
and
. Then, the
Slerp curve between
and
is defined by
Obviously, the curve satisfies , . Moreover, and , which is a constant (see Proposition 1) for all . Thus, the curve lies entirely on the sphere with length the geodesic distance between and . traces the unique great-circle arc connecting and . As increases uniformly from 0 to 1, the central angle from to increases linearly from 0 to s; hence, the motion along the arc has constant angular speed and covers equal arc lengths for equal increments of .
Proposition 1. For the Slerp curvewhere and , we have Proof. Because
, we have
implying
Substituting the values of
a and
b, we have
Using the product-to-sum identity
and after simplification, we obtain
To show , we proceed as follows:
Write the associated unit vectors:
Their central angle
is independent of
, i.e.,
. Build an orthonormal basis of the plane span
:
This is the component of
orthogonal to
. We compute its norm:
Hence, for
, we have
Differentiating with respect to
, we have
Since
and
are orthonormal, we have
. Thus, the proof is complete. □
Remark 1. In Proposition 1, s is the angle between the vectors . If , then , and hence we can take the Slerp curve as a constant curve for . Then, for all . Hence, the length of the Slerp curve isLikewise, the speed is . On the other hand, if , then the vectors and are antiparallel, i.e., . The short geodesic is any great semicircle joining to ; it is not unique because there are infinitely many planes through the origin containing the line . Choose any unit vector . Then, the great semicircle can be parameterized asThis curve starts at and ends at . Differentiating with respect to τ, we obtainBecause and are orthonormal, the speed is constant:Hence, the total length of the semicircular geodesic is 1.9. Fréchet Mean
Let
be a metric space and let
P be a Borel probability measure on
M. The
Fréchet mean (also called the
intrinsic mean or
Riemannian center of mass) of
P is defined as the point or set of points in
M minimizing the expected squared distance to
P. Formally,
When the minimizer is unique,
is called
the Fréchet mean of
P; otherwise, the set of all minimizers is referred to as the
Fréchet mean set of
P. The function
is known as the
Fréchet functional. A point
satisfying (
4) is the unique global minimizer of
F whenever
F is strictly convex.
Interpretation. The Fréchet mean generalizes the classical Euclidean mean to arbitrary metric or Riemannian spaces. In Euclidean space, the minimizer of the mean squared distance coincides with the ordinary arithmetic mean. On a curved manifold, the distance d is replaced by the geodesic distance , so that the Fréchet mean provides the natural notion of “average” consistent with the intrinsic geometry of M.
Example 1 (Euclidean space)
. Let with the standard Euclidean distance . For a random variable X with distribution P, the Fréchet functional becomesDifferentiating with respect to q and setting yieldsHence, the Fréchet mean coincides with the classical Euclidean (arithmetic) mean. Example 2 (Spherical space)
. Let be the unit sphere in equipped with the geodesic distancewhere denotes the standard inner product in . For a probability distribution P on , the Fréchet mean minimizesIf P is the uniform distribution on a geodesic arc of the sphere, the unique minimizer is the midpoint of the arc in geodesic distance. Similarly, for a uniform distribution on a symmetric closed curve such as the boundary of a spherical triangle, the Fréchet mean lies on the axis of symmetry of that curve.
1.10. Quantization Error on the Sphere
Let
P be a Borel probability measure on a sphere
of radius
equipped with the geodesic metric
. Let
be a finite set of points. Such a set
is also referred to as codebook and the elements as codepoints. Let
with
. Then, the
distortion error for
P, of order
associated with
, denoted by
, is defined by
is called the nth quantization error of order r for the probability measure P. When , the problem corresponds to minimizing the mean squared geodesic distance between a random vector X with distribution P and its nearest codepoint. Then, is called the nth quantization error for P with respect to the squared geodesic distance.
1.11. Spherical Voronoi Regions
Given a codebook
, the sphere can be partitioned into
spherical Voronoi regionsEach region
consists of all points on the sphere closer to
than to any other codepoint (with respect to the geodesic metric). The distortion error
can then be written as
1.12. Optimal Quantizers on the Sphere
A codebook
is called an
optimal codebook, also called an
optimal set of n-means, if
Each element of an optimal codebook is called an optimal codepoint or optimal quantizer. In the case , each optimal codepoint satisfies a spherical centroid condition analogous to the Euclidean case.
1.13. Extrinsic Centroid Condition and Its Relation to Intrinsic (Fr Échet) Means
Let
P be a Borel probability measure supported on the sphere
equipped with the geodesic distance
. Let
be an optimal set of
n-means for
P (with respect to squared geodesic distortion), and let
denote the associated spherical Voronoi regions.
Extrinsic centroid condition. In many applied and algorithmic treatments of quantization on spheres (notably spherical
k-means), one considers an
extrinsic formulation obtained by viewing
as embedded in
and minimizing Euclidean squared distance subject to the spherical constraint. More precisely, for each Voronoi region
, one minimizes
Define the (unnormalized) Euclidean conditional expectation over the Voronoi region by
Since typically
, the constrained minimizer is obtained by radial projection of
onto
, yielding the
extrinsic centroidWe emphasize that (
5) is an
extrinsic (embedded/chordal) centroid rule: it arises from Euclidean squared loss in
together with the constraint
.
Relation to intrinsic (Fréchet) means. The intrinsic (Fréchet) mean on
is defined as a minimizer of the geodesic Fréchet functional
In general, the extrinsic centroid (
5) does
not coincide with the intrinsic Fréchet mean, since the intrinsic problem depends nonlinearly on the Riemannian geometry through the exponential and logarithm maps.
For a Voronoi region
, the correct intrinsic first-order optimality condition for a minimizer
of
is the vanishing of the Riemannian gradient, which can be expressed (when
R lies in a domain where the logarithm map is well-defined) as
where
denotes the Riemannian logarithm map at
a on the sphere. In general, (
6) is an implicit condition and does not yield a closed-form centroid formula.
1.14. Quantization Dimension and Quantization Coefficient
Let
P be a Borel probability measure on a sphere
of radius
, equipped with the geodesic metric
. Let
denote the
nth quantization error of order
for
P. If the following limit exists,
then
is called the
quantization dimension of order r of the measure
P. It measures the asymptotic rate at which the optimal quantization error
decreases as
n increases. In particular, if
then
. Assuming that
exists, the
upper and
lower quantization coefficients of order r are defined, respectively, by
If both limits coincide, i.e.,
then
is called the
s-dimensional quantization coefficient of order r for the measure
P.
Interpretation. The quantization coefficient provides the asymptotic constant in the rate of decay of the quantization error:
Hence, characterizes the scaling exponent, while gives the precise asymptotic constant depending on the geometry of the support of P on .
1.15. Applications
Quantization on spheres has applications in various areas such as the following:
Directional statistics, and earth and planetary sciences (e.g., wind directions, orientations).
Quantization of probability measures on compact manifolds.
Spherical coding and communication systems.
Computer graphics and spherical data compression.
8. Worked Examples
We now calculate spherical optimal sets of n-means for different discrete probability measures supported on finitely many points of .
Example 3 (Equator
with
)
. On , take codepoints at angles 0, , . Cells are arcs of length centered at these points. The error is Example 4 (Small circle with
,
)
. Here, . Then, Example 5 (Great circular arc of length with ). Two equal sub-arcs of length ; representatives are midpoints at distances from the ends. The error is .
Example 6 (Discrete-uniform arc: points, clusters). With equally spaced points on an arc of length L, take three contiguous blocks of size 3; each block center is the middle point (by Lemma 1). The exact discrete is the average (over all points) of squared geodesic distances to their block centers; as m grows, this converges to .
Example 7 (Antipodal pair: intrinsic vs. extrinsic centroids)
. Letwith equal weights , and letbe a discrete probability measure on . Intrinsic (Fréchet) formulation. For , we minimize the Fréchet functional
Since and are antipodal, for every , By symmetry, is constant along the great circle orthogonal to the axis through and . Hence, the intrinsic Fréchet mean is not unique: the Fréchet mean set is precisely this equatorial great circle.
For example, choosingwe haveand therefore Extrinsic centroid. The extrinsic centroid (see Section 1.13) is obtained by first computing the Euclidean mean
Since , normalization onto the sphere is not possible, and hence the extrinsic centroid is undefined in this case.
Two-means case. For , choosing the codebook yields zero distortion, and thus Remark. This example illustrates that for antipodally symmetric distributions on the sphere, intrinsic Fréchet means may be non-unique, while extrinsic centroids may fail to exist. Such behavior has no Euclidean analog and highlights the geometric nature of quantization on spherical manifolds.
Example 8 (Three equally spaced equatorial points)
. Let , with equal weights. Then, Example 9 (Two-point nonuniform distribution). For , with , , minimizing yields and .
Example 10 (Regular tetrahedron)
. Let be the vertices of a regular tetrahedron on . Then, for , so . For , any vertex can serve as , giving Example 11 (Uniform discrete set on a small circle)
. Fix latitude and m equally spaced points . Then, for any , the optimal configuration preserves longitudes and the quantization error scales as Example 12 (Spherical triangle)
. Let , , with equal weights. The Euclidean centroid projects to , giving