Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series

Lamothe-Fernández, Prosper; Rojas, Eduardo; Bayuk, Andriy

doi:10.3390/math14071098

Open AccessArticle

Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series

by

Prosper Lamothe-Fernández

^1,*

,

Eduardo Rojas

² and

Andriy Bayuk

³

¹

Department of Financing and Commercial Research, UDI Financing, Universidad Autónoma de Madrid, Calle Francisco Tomas y Valiente 5, 28049 Madrid, Spain

²

Doctoral School, Universidad Autónoma de Madrid, Calle Francisco Tomas y Valiente 5, 28049 Madrid, Spain

³

Equity Derivatives Structuring, Quantum Leap Equity Holding GmbH, 1010 Vienna, Austria

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(7), 1098; https://doi.org/10.3390/math14071098

Submission received: 23 February 2026 / Revised: 18 March 2026 / Accepted: 19 March 2026 / Published: 24 March 2026

(This article belongs to the Section E: Applied Mathematics)

Download

Browse Figures

Versions Notes

Abstract

Classic machine learning and regime identification methods applied to financial time series lack theoretical guarantees and exhibit systematic failure modes: heavy-tails invalidate moment-based geometry, rendering distances and centroids dominated by extremes or unstable; jumps violate smoothness, destabilizing local regressions, kernel methods, and gradient-based learning; and non-stationarity disrupts neighborhood relations, so distances in classical feature spaces no longer reflect meaningful proximity. To address these challenges, we propose a topology-based machine-learning framework grounded on probabilistic reconstruction of state-space geometry, which replaces moment- and smoothness-dependent representations with deformation-stable summaries of state-space geometry, preserving neighborhoods, adjacency, and topology. The finite-sample validity of homeomorphic state-space reconstruction, required for topology-based machine learning, is assessed through numerical studies on synthetic data with heavy tails, jumps, and known ground-truth regimes. Further diagnostics of local invertibility and bounded geometric distortion quantify when embedding windows are consistent with local diffeomorphic behavior, enabling metric-sensitive, geometry-aware learning. Clustering of Hilbert-space summaries accurately recovers underlying market tail-risk regimes with robust results across selected filtrations. Temporal, feature-space, and cluster-label null tests confirm that topology-based clustering captures genuine topological structure rather than noise or artifacts, and encodes temporal dependencies at local, mesoscopic, and network levels associated with market regimes.

Keywords:

persistence homology; L²-norms; heavy-tailed time series; machine learning; unsupervised clustering; market regimes; tail risk

MSC:

62R40; 68T05; 62H30; 62G10; 62G05

1. Introduction

Machine learning methods perform well when data are naturally geometric and assumptions about distances, neighborhood relations, and smoothness hold, as in images, speech, or sensor data. In financial time series, these geometric assumptions often fail: distances may not reflect true proximity, nearby points may not share task-relevant properties, and feature representations may distort or destroy the underlying geometry. Hidden Markov Models (HMMs) and Gaussian Mixture Models (GMMs) [1,2] are widely used for market regime detection but rely on summary statistics (mean, variances, correlations, quantiles) that obscure the underlying geometry and fail under heavy tails, jumps, and non-stationarity as means do not converge, covariances fluctuate erratically, and distances become dominated by extremes. Wasserstein-based K-means methods preserve distributional geometry [3,4] but still lack guarantees that clustering boundaries correspond to true regime dynamics when finite-moment assumptions fail, and inherit sensitivity to cluster compactness and centroid placement in heavy-tailed environments [5,6]. Consequently, distances in conventional feature spaces are algebraic rather than geometric, and downstream learning may reflect representation artifacts rather than true system dynamics.

Persistence homology offers an alternative to classic machine learning methods, providing multiscale, deformation-stable summaries of geometry that are robust to noise [7,8], free of parametric assumptions, and mitigate the curse of dimensionality [9]. Applied to delay-embedded time series when diffeomorphic conditions hold, it preserves neighborhoods and relative distances in the reconstructed state space, making distances between topological summaries meaningful for machine learning applications. However, financial time series are stochastically forced systems exhibiting heavy tails, power-law decay, and volatility clustering [10,11,12,13,14], which violate Takens’ delay embedding theorem [15], compromising its guarantees.

As a result, many applications of topological data analysis in finance bypass Takens’ delay embedding and heuristically apply persistence homology to point clouds constructed on rolling windows of multivariate returns [16], correlation matrices [17,18,19], or k-nearest-neighbor networks [20]. The resulting topological summaries are interpreted as indicators of evolving market states rather than reconstructions of an underlying geometry [16]. Variations include heuristic calibration of Takens’ delay parameters [21] and fitting price bubble dynamics to quasi-periodic processes with a finite-time critical point [22]. Figure 1 illustrates this approach for the four US indices between 1981 and 2025. Spikes of

L^{2}

-norms

H_{1}

are observed before the dot-com, Lehmann, and COVID-19 crashes.

Empirical applications of this heuristic approach show that changes in topological features—such as

L^{1}

or

L^{2}

-norms of

H_{1}

—can provide early warning signals of financial market crises [16,23,24,25], identify critical market transitions [26,27,28], characterize tail-risk geometry [29,30], and improve realized volatility forecasting using PH-based models [29]. However, because these point clouds are purely statistical objects, they do not encode the dynamical flow, and the resulting neighborhoods, distances, and clusters lack a principled geometric interpretation required for regime identification and machine learning.

In this context, Gidea et al. [31] introduced the notion of a slowly varying instantaneous attractor, so that financial dynamics over short windows can be approximated as deterministic. Under this assumption, Takens’ delay embeddings are expected to yield a locally stable and geometrically meaningful reconstruction of state-space geometry, so that neighborhoods, distances, and clustering outcomes are directly interpretable and useful for machine learning tasks. In practice, the assumption of local deterministic dynamics often fails in financial time series. Delay embeddings may still yield empirically meaningful geometric structure, but without theoretical justification from local determinism, direct machine learning interpretability is compromised: clusters may reflect noise, finite-sample effects, or arbitrary projections rather than true dynamical regimes.

Building on extensions of Takens’ theorem to stochastically forced systems due to Stark et al. [32,33], we develop a framework under which calibrated delay embeddings can be both theoretically and empirically justified for clustering, regime detection, and machine learning of heavy-tailed, stochastic financial time series. Under appropriate embedding dimension and smoothness conditions on the stochastic forcing, the stochastic embedding theorems of Stark et al. [32] guarantee that the delay-embedding defines a

C^{r}

homeomorphism: it preserves neighborhood relations, proximity structure, and topological invariants, providing a theoretical reference framework for topology-based clustering and regime identification under which delay embeddings preserve topological structure along stochastic trajectories. Under stronger smoothness assumptions, the delay map is locally diffeomorphic for almost every noise realization, preserving local metric and differential structure up to bounded distortion, thus justifying metric-sensitive machine-learning techniques.

However, these guarantees are asymptotic and probabilistic; therefore, they may not hold in finite samples of noisy financial time series. Because neither the true system state nor the underlying map is observable, this work conducts a numerical study to systematically evaluate through quantitative diagnostics whether the delay embeddings from such samples exhibit the observable consequences of

C^{r}

homeomorphism or local diffeomorphic reconstruction. Accordingly, we do not interpret these results as establishing the validity of the embedding, but rather as motivating diagnostics targeting their observable consequences. To enable rigorous testing in a setting where the true system dynamics are known, we generate synthetic price trajectories exhibiting jumps, heavy tails, and known ground-truth regimes, construct their corresponding delay-embedded point clouds, and apply these diagnostics.

The numerical experiments further assess whether clustering of topological features derived from these embeddings captures genuine temporal dependence and regime structure. To this end, we compute compact, Lipschitz-stable functional summaries of persistent homology—L²-norms of

H_{0}

and

H_{1}

[34,35]—and apply a heavy-tailed K-means algorithm [6]. To distinguish genuine topological and temporal structure from noise and finite-sample artifacts, we compare the resulting filtration and clustering outcomes against pointwise, temporal, and cluster-label null models extending the universal topological randomness results of Bobrowski and Skraba [36]. Significant separation relative to the temporal nulls—in metrics such as Total Accuracy or Silhouette score—provides evidence that preserved neighborhood structure is captured in the embedding and topological features, while feature-space and economic validation evaluate whether clusters capture genuine topological and distributional differences.

The proposed framework extends recent methods based on network-based representations of attractors for change-point detection [37] and zigzag persistence on temporal networks for identifying periodic or chaotic transitions [38] to regime detection and clustering of stochastically forced, heavy-tailed time series. Unlike these methods, which rely on discretized phase-space states or network snapshots relative to a baseline—well suited for deterministic, low-noise systems with well-defined trajectories but scaling poorly—our temporal topological network tracks the probabilistic temporal evolution of local geometry directly across temporal scales, using stable and computationally efficient persistence-based functional summaries, grounded on rigorous probabilistic guarantees of homeomorphic and diffeomorphic stochastic reconstruction.

2. Theoretical Framework

This section presents the mathematical and methodological foundations of a topology-based machine-learning framework for identifying market regimes in heavy-tailed financial time series. In this context, we address the fundamental challenges of applying topology-based methods to financial time series, formulate a heuristic topological model of temporal dependence for regime evaluation, and introduce a hierarchy of geometric and temporal topological null models designed to discriminate genuine temporal topological organization (market regimes) from noise.

2.1. Delay Embedding of Stochastically Forced Time Series

Time series of asset prices

S_{t}

and log-returns

r_{t}

are realistically modeled as stochastically forced dynamical systems

d S_{t} = μ S_{t} d t + σ S_{t} d W_{t} + S_{t -} d Y_{t}

(1)

where

μ S_{t} d t

is a deterministic drift and

σ S_{t} d W_{t} + S_{t -} d Y_{t}

introduces stochastic forcing through Brownian fluctuations and discontinuous jump dynamics. Here

d W_{t}

denote the increments of a standard Brownian Motion and

d Y_{t}

represents the jump increment at time

t

. The corresponding log-returns process satisfies

r_{t} = (μ - \frac{1}{2} σ^{2}) d t + σ Δ W_{t} + {Δ Y}_{t}

(2)

Following Stark et al. [32], delay-coordinate embeddings of stochastically forced dynamical systems can recover the underlying state geometry provided the stochastic forcing enters through a finite-dimensional, smooth family of maps.

Formally, let

M

be a compact manifold of dimension

m \geq 1

, and let

N

be a compact manifold of dimension

n

. Let

f : M \times N \to M

be a

C^{r}

map with

r \geq 1 .

Let

ω = (η_{0}, η_{1}, \dots .,) \in N^{Z}

(3)

denote a bi-infinite sequence of forcing parameters, and define the associated skew-product system

F (x, ω) = (f_{η_{0}} (x), σ ω)

(4)

where

σ : N^{Z} \to N^{Z}

denotes the left shift on sequences. Given a scalar observable

φ \in C^{r} (M \times N, R)

, the fiber-wise delay-coordinate map along a fixed noise realization

ω

is defined by

Φ_{f, φ . ω} (x) = (φ (x), φ (f_{η_{0}} (x)), φ (f_{η_{1}} ° f_{η_{0}} (x)), \dots . ., φ (f_{η_{d - 1}} ° \dots . . ° f_{η_{0}} (x)))

(5)

where

d

is the embedding dimension and

f_{k} (x) = f (x, ω_{k})

.

Theorem 5 of Stark–Broomhead–Davies–Huke [32]: if

d \geq 2 m + 1

and

r \geq 1

, where

m = d i m (M

), then there exists a residual set of pairs

(φ, f) \in C^{r} (M \times N, R) \times C^{r} (M \times N, M)

such that, for an open dense set of noise realizations

ω \in N^{Z}

, the fiberwise delay-coordinate map

Φ_{f, φ, ϖ} : M \to R^{d}

is a

C^{r}

embedding, i.e., an injective

C^{r}

map that is a homeomorphism onto its image.

Crucially, Theorem 5 imposes no invertibility assumptions on the fiber maps

f_{η}

; contractions, expansions, and folds are all permitted. The embedding dimension

d

is independent of the forcing dimension

n

, reflecting that the infinite-dimensional forcing sequence

ω

enter only through a finite-dimensional smooth family of maps

\{f_{η} : η \in N\}

. As a result, the classical Takens dimension bound remains valid, and the underlying state manifold is reconstructed almost surely along stochastic trajectories. The reconstruction is therefore fiberwise and probabilistic rather than a deterministic recovery of a single global attractor.

Under these conditions, delay embeddings preserve neighborhood relations and topological invariants of the state-space, providing a rigorous foundation for topological summaries and adjacency-based representations that rely on local connectivity rather than smoothness or metric information, including clustering and manifold-based learning. Temporal variations in these topological features therefore encode dependence structure and multiscale temporal dynamics.

While Theorem 5 guarantees the existence of topological embedding almost surely, it does not ensure that such a structure is detectable, or statistically stable in finite-sample delay embeddings constructed from noisy, heavy-tailed data. Since neither the true state

x_{t} \in M

nor the stochastic map

f_{η} (x_{t})

are observable from finite samples, empirical validation must be performed on the observable consequences of

C^{r}

embeddings. Accordingly, we assess consistency with Theorem 5 through diagnostics of local injectivity, noise regularity, smoothness, finite-dimensional forcing, and Lipschitz stability of reconstructed topological summaries derived from using the scalar observations

y_{t} = φ (x_{t})

without claiming to verify the regularity assumptions of the theorem [32]. The formal definition and detailed formulation of the diagnostics are provided in Appendix A, with implementation described in Algorithm A1.

Theorem 6 of Stark–Broomhead–Davies–Huke [32]: Suppose that the fiber maps are parameterized by a smooth family

p \in C^{r} (N, {D i f f}^{r} (M))

so that each realization

f_{η} = p (η) : M \to M

is a

C^{r}

diffeomorphism. Let the observable be state-dependent only,

φ \in C^{r} (M, R)

, and assume the forcing variables

ω_{k}

are i.i.d. with a common distribution

μ

, where

μ

is absolutely continuous with respect to Lebesgue measure on

N .

Then, for a generic pair

(p, φ)

, the delay-coordinate map

Φ_{f, φ, ω}

is a

C^{r}

embedding for a full-measure set of noise realizations

ω \in N^{Z}

. Moreover, the embedding varies smoothly with respect to the noise realization, yielding a family of delay maps that is locally diffeomorphic almost surely along the stochastic trajectory.

Consequently, the reconstructed dynamics are locally one-to-one and smoothly invertible along almost every finite observation window. Local geometric and differential structure—such as distances, angles, and Jacobians—is preserved up to bounded distortion. These guarantees justify the use of metric-sensitive and geometry-aware machine learning techniques, while remaining inherently local and probabilistic. Since neither the true state

x_{t} \in M

nor the underlying stochastic map

f_{η} (x_{t})

is directly observable in finite samples, empirical verification is naturally formulated in terms of the observable consequences of local

C^{r}

diffeomorphism through Jacobian-based diagnostics of local invertibility, bounded local distortion, and smooth variation across time. The formal formulation of the diagnostics is provided in Appendix B, with implementation described in Algorithm A2.

Figure 2 illustrates both the feasibility of delay-coordinate reconstruction from scalar data under stochastic forcing and how local diffeomorphic structure can be empirically assessed in practice through Jacobian-based diagnostics. The delay-coordinate reconstruction uses an embedding dimension d = 7 satisfying the sufficient conditions for almost-sure local

C^{r}

diffeomorphic embedding under smooth stochastic forcing. Window-wise Jacobian diagnostics of local injectivity, bounded distortion, and smooth variation over time indicate that the observable geometric consequences of local

C^{r}

diffeomorphism hold in approximately 90% of windows, indicating that despite stochastic forcing and finite sampling, the reconstructed dynamics preserve local geometric structure along most stochastic trajectories.

2.2. Temporal Topological Networks

Building on Theorem 5 [32], which guarantees almost-sure topological embedding under stochastic forcing, we summarize the evolving structure of the reconstructed state-space using stable topological functionals. For each sliding window

W_{t}

we define a scalar functional:

H_{t} = F (D_{t})

(6)

where

D_{t}

denotes the persistence diagram computed from the delay-embedded point cloud in the window

t

and

F

is a stable map on persistence diagrams. Two natural instantiations of

F

are considered. The first is the diagram-space total persistence

H_{t}^{P D} = \sum_{(b, d) \in D_{t}^{(0)}} (d - b) + \sum_{(b, d) \in D_{t}^{(1)}} (d - b)

(7)

that corresponds to the sum of the lifetimes of all features in the

H_{0}

and

H_{1}

persistence diagrams. Alternatively, following Bubenik [34], persistence diagrams can be mapped to persistence landscapes, forming the Hilbert space

L^{2}

-norm instantiation

H_{t}^{L_{2}} = {‖λ^{0} (D_{t})‖}_{2} + {‖λ^{1} (D_{t})‖}_{2}

(8)

which supports standard geometric distances and similarity kernels while retaining multiscale topological information. The

L^{2}

-norm induces a detrended Hilbert-space measure of topological change analog of Wasserstein distances

ϕ_{t}^{w d} = |H_{t} - {\bar{H}}_{t - w}|

, where

{\bar{H}}_{t - w}

is the mean over a rolling window of length

w

.

Both functionals are Lipschitz continuous with respect to Wasserstein distances [34], stable under homeomorphic delay embeddings [32], yielding compact time series

{\{H_{t}\}}_{t = 1}^{T}

that captures the evolution of the reconstructed geometry, preserving adjacency, similarity, and neighborhood relations inherited from the underlying state space. Under the stronger local diffeomorphic reconstruction conditions of Theorem 6 [32], the Hilbert-space

L^{2}

formulation further admits metric-based comparisons and geometry-aware learning.

Each functional summary

H_{t}

provides a compact descriptor of the instantaneous topological state of the reconstructed point cloud at time t. Collectively, the sequence

{\{H_{t}\}}_{t = 1}^{T}

encodes the temporal evolution of the system’s topology. To capture temporal organization, we construct a temporal topological network

V = \{v_{1}, v_{2}, \dots v_{T}\}

(9)

where each node

v_{i}

represents one such instantaneous topological state at time t. Edges

e_{t, s}

between nodes encode similarity between pairs of topological states at rime

t

and

s

:

e_{t, s} = f (D_{t}, D_{s})

(10)

where f is a diagram-level and geometry-aware distance such as

L^{2}

distances in the Hilbert space formulation. In the diagram-space formulation, we use Wasserstein distances:

d_{t, s}^{W a s s} = W (D_{t}, D_{s})

(11)

measuring the magnitude of topological change between times t and s. Collecting these distances yields the matrix

D^{W a s s} = {[d_{t, s}^{W a s s}]}_{t, s = 1}^{T}

(12)

which provides a visual map of similarity over time. Figure 3 shows this matrix for

H_{0}

and for

H_{1}

, directly representing the edges of the temporal topological network. The dark diagonal reflects minimal Wasserstein distances between consecutive windows, indicating topological similarity and temporal continuity, whereas off-diagonal light regions reveal significant topological changes between windows.

Distances are converted into topological similarities via an exponential kernel

{s i m}_{d i a g} (t, s) = e x p (- \frac{d_{t, s}^{W a s s}}{{m a x}_{i, j} d_{i, j}^{W a s s}}) \in (0, 1]

(13)

which preserves ordering and ensures boundedness in

⌈0, 1⌉

. To incorporate geometric scale information of the reconstructed local geometry, we compute the minimum spanning tree (MST) length

L_{t}

for each point cloud and define a scale-based similarity:

{s i m}_{M S T} (t, s) = e x p (- \frac{|L_{t} - L_{s}|}{σ (L)}), σ (L) = s t d ({\{L_{t}\}}_{t = 1}^{T})

(14)

This prevents geometrically rescaled but topologically similar states from being treated as temporally coherent, while

σ (L)

ensures the kernel is scale invariant. Edges

e_{t, s}

exist for

s > t

, with

s - t \leq T_{m a x}

if the resulting similarity is non-negligible. Thus, edge weights

w_{t, s}

are defined as follows:

w_{t, s} = g (d_{t, s}, L_{t}, L_{s}) \in (0, 1]

(15)

where g is a similarity functional acting on both topological and geometric information. In diagram-space,

w_{t, s} = s i m (t, s) = e x p (- \frac{W (D_{t}, D_{s})}{\max W}) e x p (- \frac{|L_{t} - L_{s}|}{σ (L)})

(16)

while in Hilbert-space,

w_{t, s} = s i m (t, s) = e x p (- \frac{d_{t, s}^{P L}}{σ_{P L}}) e x p (- \frac{|L_{t} - L_{s}|}{σ (L)})

(17)

where

d_{t, s}^{P L}

is the instantaneous pairwise

L^{2}

distance between persistence landscapes embeddings. Large weights

w_{t, s}

indicate topological similarity and geometric coherence, while small weights signal regime transitions, structural changes, or temporal deformation.

Thus, Equations (9)–(17) specify the quantitative metrics that operationalize the temporal topological network. Figure 4 illustrates the metrics for heavy-tailed financial time series. Geometric scale

L_{t}

is derived from the minimum spanning tree (MST) of a point cloud, while the similarity matrix

w_{t, s}

combines diagram-level Wasserstein distances with the geometric MST scales. High values indicate windows that are topologically and geometrically coherent, whereas low values indicate windows with greater dissimilarity in topological and geometric structure, implying regime shifts.

2.3. Clustering as a Statistical Operator on Temporal Topology

The temporal topological network provides the foundation for clustering and regime detection under the probabilistic guarantees of Theorems 5 and 6. Let

M

denote the underlying state-space manifold, and

R^{d}

the delay-embedding space induced by

Φ : M \to R^{d}

. By Theorem 5 [32],

Φ

is a

C^{r}

embedding (i.e., homeomorphism onto its image) along almost every stochastic trajectory, preserving the topological structure of the underlying state space. In particular,

Φ

preserves neighborhoods and adjacency relations:

x \in n_{M} (y) \Leftrightarrow Φ (x) \in n_{R^{d}} (Φ (y))

(18)

where

n_{M} (y)

denotes the neighborhood of

y

on the manifold and

n_{R^{d}} (Φ (y))

denotes the corresponding neighborhood in the embedding space.

Let

V = \{v_{1}, v_{2}, \dots v_{T}\}

denote time-indexed nodes associated with topological summaries

H_{t}

, each computed on a window

W_{t}

. A regime is defined as a temporally contiguous subset

C_{k} \subset V

for which the reconstructed local geometry exhibits consistent adjacency relations along the observed stochastic trajectory. Nodes

v_{t} \in C_{k}

are mutually adjacent, in the sense of strong topological similarity, while nodes in different regimes show weaker adjacency. This is illustrated in Figure 5, which show how spectral embedding of the k-NN similarity graph and the thresholded temporal connectivity reveal underlying regimes and transitions in the temporal geometric structure of a heavy-tailed process.

The coherence within a regime can be quantified via adjacency-weighted averages:

C o h e r e n c e (C_{k}) = \frac{1}{|C_{k}| (|C_{k}| - 1)} \sum_{\begin{matrix} t, s \in C_{k}, \\ t \neq s \end{matrix}} w_{t, s}

(19)

where

w_{t, s} \in [0, 1]

encodes combined topological and geometric similarity (e.g., Wasserstein distances between persistence diagrams and MST-based scaling). Similarly, the separation between the two regimes

C_{k}

and

C_{l}

is as follows:

S e p a r a t i o n (C_{k}, C_{l}) = 1 - \frac{1}{|C_{k}| |C_{l}|} \sum_{t \in C_{k}, s \in C_{l}} w_{t, s}

(20)

Therefore, clustering acts as a statistical operator on the temporal topological network, partitioning the trajectory into subsets that maximize internal adjacency and coherence while maintaining separation between regimes. Crucially, this construction relies solely on the preservation of neighborhood structure, adjacency relations, and topological invariants guaranteed by Theorem 5 [32].

Within this framework, two complementary clustering approaches are admissible: (i) geometry- and topology-aware clustering of similarities

w_{t, s}

, using network-based methods such as spectral clustering [39,40] or weighted edge clustering [41]; and (ii) node-level clustering of functional topological summaries

H_{t}

, treating distances between summaries as statistical dissimilarities between stable topological states and operating solely on the topological summaries, including vector-based methods (K-means) and clustering using Wasserstein (or bottleneck) distances [42,43,44], interpreted as intrinsic dissimilarities between node-associated persistence diagrams.

For sequences of persistence diagrams, Wasserstein or bottleneck distances provide a principled measure of topological deformation between reconstructed neighborhoods, forming natural similarity metrics for clustering and regime detection. These distances are defined intrinsically on the space of persistence diagrams, independent of the ambient state-space metric, and thus compare node-level functional topological summaries. For homology dimension

k

, the

p

-Wasserstein distance between persistence diagrams

D_{t}

and

D_{s}

is

W_{p}^{k} (D_{t}, D_{s}) = {({i n f}_{γ : D_{t} \to D_{s}} \sum_{x \in D_{t}} {‖x - γ (x)‖}_{\infty}^{p})}^{\frac{1}{p}}, p \geq 1

(21)

with

p = \infty

corresponding to the bottleneck distance. To enable clustering in a linear space, persistence diagrams may be embedded into a Hilbert-space via persistence landscapes. Following Bubenik [34], persistence landscapes admit a natural

L^{2}

metric

d_{t, s}^{k} = {‖λ^{k} (D_{t}) - λ^{k} (D_{s})‖}_{2}

(22)

which quantifies the dissimilarity of topological features in homology dimension

k

. For multiples homology dimensions

k = 0, 1, \dots, K

, the combined

L^{2}

distance is defined as follows:

d_{t, s} = \sqrt{\sum_{k = 0}^{K} {‖λ^{k} (D_{t}) - λ^{k} (D_{s})‖}_{2}^{2}}

(23)

Each node

v_{t}

encodes the topological state at time

t

as a vector of Hilbert-space functionals:

v_{t} = (H_{0} (t), H_{1} (t), ϕ_{t}^{w d} (t))

(24)

summarizing the local topology of the reconstructed trajectory.

Thus, clustering in the Hilbert space of topological summaries naturally groups nodes with similar topological neighborhoods, while regime transitions correspond to sustained increases in distances

d_{t, s}

, reflecting topological deformation rather than geometric displacement. In this setting, clustering outcomes are directly interpretable in terms of stable adjacency relations and topological change, consistent with the probabilistic guarantees of Theorem 5 [32].

If the observable consequences of local

C^{r}

diffeomorphism hold for a sufficiently large proportion of embeddings, intrinsic and embedded distances satisfy

d_{M} (x, y) \approx d_{R^{d}} (Φ (x), Φ (y))

(25)

up to smooth, bounded distortion. Consequently, local neighborhood geometry and tangent structure are preserved to first order, elevating the temporal topological network from a pure topological graph to a geometry-aware approximation of the manifold’s intrinsic temporal organization, thereby expanding the class of statistical operators that can legitimately act on it.

In this setting, similarity weights

w_{t, s}

admit metric interpretability as approximations of intrinsic distances between reconstructed neighborhoods. This enables geometry-aware clustering techniques, including diffusion-based methods, spectral clustering with meaningful kernels, tangent-space approximations, and kernel-based methods, complementing the topology-based approaches.

2.4. Temporal Scales of Topological Organization

Financial time series exhibit persistent temporal organization at multiple scales. Short-range dependence reflects significant correlations at small lags ℓ, whereas long-range dependence decays slowly according to a power law [12,45]:

ρ (l) ~ l^{- γ}, 0 < γ < 1

implying long memory [46,47] and the absence of a characteristic correlation horizon [47].

Under conditions of homeomorphic reconstruction [32], financial time series multiscale temporal dependence is encoded probabilistically in the evolving stochastic trajectory of node-level topological functionals

H_{t}

, pairwise distances

d_{t, s}

, similarity measures

e_{t, s}

, and geometry-aware edge weights

w_{t, s}

, which integrates topological similarity (via persistence distances) with geometric coherence (via MST lengths

L_{t}

).

Figure 6 illustrates this framework for a stochastically forced, heavy-tailed process with regime shifts. MST lengths

L_{t}

summarizes local geometric evolution, while similarities

w_{t, s}

combine diagram-based Wasserstein distances and geometric rescaling. High similarity indicates regime persistence, sharp drops signal topological deformation or geometric rescaling, and sustained plateaus reveal mesoscopic temporal organization.

Moreover, the topological and geometrical quantities associated with nodes, edges, and weights encode distinct scales of temporal organization, thereby defining the structure of a multiscale topological temporal network:

Local scale: Short-range dependence appears as fine-scale fluctuations in $H_{t}$ , and low-lag autocorrelation of deviation signals as $ϕ_{t, s}^{w d}$ for small $|t - s|$ . Although computed on short windows, $H_{t}$ aggregates topological information across landscapes, providing mesoscopic resolution at a local time scale.
Mesoscopic scale: Intermediate- and long-range dependence governing regime persistence appears as a sustained elevation in $H_{t}$ , persistent bursts in $ϕ_{t}^{w d}$ , and densely connected sub-graphs in $w_{t, s}$ linking windows with similar reconstructed geometry.
Global scale is captured by network-level statistics derived from $w_{t, s}$ , including cluster coherence, cross-regime separation, tail dependence, and graph-based metrics.

This framework provides a principled, probabilistic basis for regime detection at different scales of the temporal organization in stochastic, heavy-tailed financial time series. By explicitly linking topological and geometric quantities to distinct temporal scales, it guides the selection of appropriate statistics and null models for inference, enabling rigorous tests of whether observed temporal structure reflects genuine dynamics rather than noise. In this way, the multiscale topological temporal network becomes a falsifiable, heuristic construct grounded in the embedding guarantees of Stark et al. [32], suitable for interpretable machine learning applications.

2.5. Topological Null-Models

As highlighted in Section 2.1, delay-coordinate reconstruction guarantees provided by Theorems 5 and 6 of Stark et al. [32] are inherently probabilistic, holding almost surely or window-wise along the observed trajectory. This implies that reconstructed topological features from stochastically forced systems should be interpreted as probabilistic manifestations of structure. Thus, they cannot be distinguished from artifacts induced by random fluctuations, finite sampling, or noise without a principled benchmark since neither the true system state nor the underlying map is observable.

To assess topological randomness rigorously, explicit topological Mull models are required. In this context, Bobrowski and Skraba [36] established that persistence diagrams arising from random point clouds converge to a universal distribution, independent of sampling distribution, geometry, or curvature. This result provides a principled benchmark against which observed topological features can be tested.

Let

{d g m}_{k}

denote a persistence diagram in homological dimension k, decomposed conceptually as follows:

{d g m}_{k} = {d g m}_{k}^{S} \cup {d g m}_{k}^{N}

(26)

where

{d g m}_{k}^{S}

captures latent topological features inherent to the data and

{d g m}_{k}^{N}

consists of cycles arising from random point-cloud geometry. For a persistence point p = (birthday (p), death (p)), the persistence ratio of

p \in {d g m}_{k}

is defined as follows:

π (p) = \frac{d e a t h (p)}{b i r t h (p)}

(27)

Asymptotically, persistence ratios arising from noise grow polylogarithmically with the sample size n,

π (p) \approx ο ({(\log n)}^{\frac{1}{k}})

(28)

whereas ratios associated with genuine geometric features scale polynomially,

π (p) \approx Θ (n^{\frac{1}{d}})

(29)

with d the intrinsic dimension of the underlying manifold. This separation provides a theoretical basis for distinguishing noise-induced from structure-induced topological features.

To obtain a universal limiting law, a double-logarithmic transformation is applied:

l (p) = A \log \log (π (p)) + B

(30)

Here, A depends on the filtration and homology dimension (e.g., for Čech complexes, A = 1, for Rips complexes, A = 1/2), and B is a location constant chosen to center the transformed data. Under the geometric null, the ℓ-values of noise-induced cycles converge to a left-skewed Gumbel distribution, providing a universal reference for a rigorous per-cycle hypothesis testing of structure.

This geometric null model is point-wise and spatial: it characterizes randomness in point-cloud geometry within individual windows but is agnostic to temporal ordering and therefore cannot assess whether topological features are organized in time. Such tests are well-suited to classification tasks in domains where objects of interest are spatially indexed, including cellular organization, biology, morphology, or cancer genomics. However, financial time series are high-dimensional and noisy, so that meaningful topological structure does not manifest primarily at the level of isolated persistence diagrams but in the temporal organization, dependence, and persistence of topological features along the observed trajectory. In this setting, pointwise geometric tests are insufficient, motivating the introduction of temporal topological null models to assess for genuine temporal structure.

2.6. Temporal Topological Null-Models

To extend topological null-models from individual diagrams to time-evolving features, we consider sequences of diagrams

{\{D_{t}\}}_{t = 1}^{T}

together with diagram-level functional

H_{t} = h (D_{t})

(e.g.,

L^{2}

-norms of persistence landscapes, persistence weighted distances, etc.).

The temporal topological null model is defined by a random permutation

π \in S_{T}

of the functional sequence:

{\{H_{t}^{τ}\}}_{t = 1}^{T} = {\{H_{π (t)}\}}_{t = 1}^{T} π \in S_{T}

(31)

where

H_{t}^{π}

denotes the permuted sequence induced by

π

. By construction, this null model preserves the full geometry of each persistence diagram

D_{t}

, the diagram-level functional values

H_{t} = h (D_{t})

derived from each diagram, and the exact empirical marginal distribution of all topological and geometric features. At the same time, it destroys all temporal organization, including the temporal order of the functionals, serial correlations and regime structures, temporal memory encoded in the ordering of diagrams, and cross-cycle dependencies among functionals.

Significance is evaluated empirically via

N_{s u r r}

permuted sequences:

p_{e m p} = \frac{1 + \sum_{j = 1}^{N_{s u r r}} 1 (D_{j} \geq D_{o b s})}{1 + N_{s u r r}}

(32)

where

D_{j}

denotes the statistic computed on the j-th permuted sequence and

1 (\cdot)

is the indicator function. This framework applies to any arbitrary statistic

D (\{H_{t}\})

and requires no assumptions on Gaussianity, stationarity, or finite moments, making it suitable for heavy-tailed, heteroskedastic, and non-stationary financial series. Implementation details and executable pseudocode are provided in Algorithm S7 (Supplementary Materials).

Figure 7 illustrates how the temporal topological model provides an appropriate benchmark for randomness in time-indexed topological features. For a Gaussian i.i.d., the z-scored deviation functional

ϕ_{t}^{w d}

exhibits no temporal organization; autocorrelations at all lags remain within the null envelope, consistent with the absence of memory or regime structure. In contrast, for a regime-shifting, fat-tailed process, autocorrelations of both signed and absolute deviations lie well outside the null bounds across a wide range of lags, revealing persistent temporal dependence and regime structure that cannot be attributed to marginal geometric variability alone.

By removing dependence at all temporal scales (local, mesoscopic, global) while preserving within-window geometry, the temporal null provides a principled benchmark for detecting meaningful multiscale temporal organization. However, statistical coherence alone does not guarantee economic interpretability or relevance. We therefore introduce a cluster label null to test whether regime assignments correspond to persistent market regimes rather than arbitrary labeling, with implementation details and pseudocode provided in Algorithm S12 (Supplementary Materials).

Together, this framework defines three nested layers of inference:

(1): Geometric structure of individual windows;
(2): Temporal organization of evolving topological features;
(3): Economic significance of clusters.

This hierarchy ensures that candidate regimes are both statistically robust and economically interpretable, linking the conceptual distinction between local, mesoscopic, and global temporal scales to actionable market-relevant structures. This approach complements existing inference methods in topological data analysis. While Robinson and Turner [48] use permutation-based null hypothesis tests on pair-wise distances between persistence, our approach targets temporal dependences and regime structures within a single evolving sequence of persistence diagrams.

3. Materials and Methods

We conduct a numerical study, formulated as a controlled machine-learning experiment on a synthetic dataset, to evaluate whether delay-embedded topological functional features recover genuine regime structure in stochastic, fat-tailed, and non-stationary financial time series with known ground-truth regimes. Specifically, using this controlled testbed, we validate whether the delay embeddings exhibit the observable consequences of probabilistic homeomorphic or diffeomorphic reconstruction, as well as the validity of clustering based on topological functionals.

Apparent structure in clustered topological summaries can arise from three distinct and independent sources of randomness: (i) geometric randomness within windows, arising from finite-sample point-cloud effects; (ii) temporal randomness across windows, arising from loss of adjacency and neighborhood structure; and (iii) regime assignment randomness, reflecting artifacts of the clustering procedure. To distinguish genuine structure and dynamical regimes from artifacts of finite sampling, reconstruction, or clustering, we explicitly test all three through the hierarchy of null hypotheses:

(1): Geometric null: Topological features arise from random point-cloud geometry within windows.
(2): Temporal null: Observed structure is indistinguishable from random temporal ordering of features.
(3): Clustering label null: Regime assignments arise from random labeling rather than persistent structure.

Rejection of both the temporal null hypothesis and the cluster-label null hypothesis is required to establish that clustering reflects genuine market regime structure. This hierarchy grounds machine-learning inference in interpretable changes in reconstructed geometry.

3.1. Synthetic Data Generation

Synthetic price paths simulate alternating bull and bear regimes of deterministic duration. Prices follow a regime-dependent geometric Brownian motion (GBM) with fat-tailed innovations and jumps, aggregated into open–high–low–close (OHLC) bars preserving regime structure. The synthetic price process is instantiated once under a fixed random seed to ensure reproducibility. Full executable pseudocode is provided in Algorithm S1 (Supplementary Materials).

3.2. Tests of Local Homeomorphic Delay-Embeddings

Log-returns are partitioned into overlapping windows and embedded using delay-coordinates. Embeddings are assessed by empirical diagnostics designed to test the observable consequences of local homeomorphic embedding guarantees consistent with Theorem 5 of Stark et al. [32]. Specifically, we assess the following:

Injectivity: Stability of persistence diagrams, landscapes, and Fill-factor criteria.
Noise regularity: Smooth variation in delay vectors, autocorrelation, and Mutual Information.
Finite-dimensional forcing: Inter-window autocorrelation differences.
Lipschitz stability of topological features: Bottleneck and Hilbert-space distances.

Appendix A and Algorithm A1 provide full methodological and implementation details of the diagnostics. Algorithm S2 in Supplementary Materials offers an executable-style pseudocode.

3.3. Jacobian-Based Diagnostics of Local Diffeomorphic Embedding

We assess the window-wise, local diffeomorphic validity of the reconstructed delay-embeddings using Jacobian-based diagnostics derived from the observable consequences of Theorem 6 of Stark et al. [32]. Specifically, we evaluate the following:

Local invertibility: Non-singularity of the embedding Jacobians.
Bounded distortion: Local bi-Lipschitz behavior quantified via Jacobian condition-number bounds.
Smooth variation: Temporal stability of differential quantities across windows, ensuring that the local linearization does not fluctuate erratically across the window.

Windows satisfying all three criteria are considered locally invertible (hence locally one-to-one), geometrically non-degenerate, and differentially stable, consistent with

C^{r}

diffeomorphic reconstruction. Full implementation details are provided in Appendix B and Algorithm A2. Algorithm S3 in Supplementary Materials offers an executable-style pseudocode.

3.4. Computation of Topological Functionals

Topological features are extracted from OHLC price representations, using two filtrations:

(1): Vietoris–Rips (VR), yields $H_{0}$ and $H_{1}$ persistence; computed using the giotto-tda library version 0.6.2 in Python version 3.10.15;
(2): One-Pass K-Clusters (1PKF) [49]: A graph-based filtration, which by construction yields $H_{0}$ persistence only.

VR captures fine-scale geometric deformation of the reconstructed trajectory, while the 1PFK emphasizes the emergence and persistence of statistically meaningful clusters by suppressing small or noisy components. Consistency across both pipelines supports robustness to filtration choice.

Persistence diagrams are mapped to Hilbert-space functionals, with local fluctuations summarized by rolling deviation functionals capturing bursts of topological activity, heteroskedasticity, and departures from i.i.d. assumptions. Algorithm S4 in Supplementary Materials provides an executable-style pseudocode of 1PFK-filtration.

3.5. Unsupervised Learning

Heavy-tailed K-means [6] is applied to time-indexed feature vectors. Centroids minimize an α-powered dispersion functional relative to an isotropic α-stable reference distribution, ensuring stable centroid updates under heavy tails (Gaussian mean for α = 2; Cauchy for α = 1). To ensure deterministic and reproducible clustering, centroids are initialized using evenly spaced feature percentiles rather than random seeding. Tail-risk-aware reweighting assigns more pull to extreme points in centroids updates, which emphasizes time windows where topological features are most extreme. Full executable pseudocode of heavy-tailed K-means is provided in Algorithm S5 (Supplementary Materials).

3.6. Geometric Null Hypothesis

To rigorously assess topological structure, we apply Bobrowski and Skraba’s [36] point-wise geometric null test to the persistence diagrams obtained from both VR and 1PFK-filtrations. For each persistence point, the persistence ratio is transformed into a

l

-value and compared against the universal left-skewed Gumbel distribution to compute a p-value. Points with p-values below a significant threshold (with and without Bonferroni correction) are classified as signal, forming signal diagrams. To reduce computational cost, a 10% random subset of diagrams may be used. The fraction of significant points and their mean p-value are used to classify the dataset as either consistent with random topology or exhibiting genuine topological structure. Executable-style pseudocode is provided in Algorithm S6 (Supplementary Materials).

3.7. Temporal Topological Null Hypothesis Test

We test whether the observed clustering reflects statistically significant temporal topological structure rather than random temporal fluctuations. Let

A_{i j}

denote the observed adjacency-level statistics computed from the time-indexed topological features:

A_{i j} = \{\begin{matrix} ρ_{i j} s h o r t - r a n g e d e p e n d e n c i e s (a u t o c o r r e l a t i o n) \\ κ_{i j} i n t e r c l u s t e r s e p a r a t i o n a n d i n t r a - c l u s t e r c o h e r e n c e \\ λ_{L, i j} c r o s s - c l u s t e r u p p e r a n d l o w e r t a i l d e p e n d e n c e \end{matrix}

For each metric, a temporal null distribution is generated by applying random permutations

π

to the time indices of the feature sequence:

{\{A_{i j}^{π}\}}_{π = 1}^{N_{s u r}}

where

N_{s u r}

is the number of surrogate sequences. This temporal null preserves the marginal geometric structure while destroying temporal ordering, serial correlations, and regime-dependent structure.

The clustering algorithm

C (\cdot)

is applied to each surrogate realization, yielding surrogate datasets

{\{X^{π}\}}_{π}^{N_{s u r r}}

. Significance is assessed by comparing the observed adjacency metric of observed clusters

A_{i j} (C (X))

to its null distribution

A_{i j} (C (X^{π}))

, yielding empirical p-values:

p_{e m p} = \frac{1 + \sum_{j = 1}^{N_{s u r}} 1 (A_{i j}^{π} (C (X^{π})) \geq A_{i j} (C (X)))}{1 + N_{s u r}}

(33)

Metrics used for

κ_{i j}

include the following:

Ground-truth clustering metrics: Total Accuracy (TA), Bear-Recall (BR), and Adjusted Rand Index (ARI) [50,51].
Unsupervised cluster separation and coherence metrics: Silhouette [52], Calinski-Harabasz (CH) [53], Davies Bouldin Index (DB) [54], Maximum Mean Discrepancy (MMD) [55] measure intra-cluster coherence and inter-cluster separation; computed using Python’s scikit-learn library version 1.3.2 [56].

The temporal null construction and validation are detailed in Algorithm S7, while metric-specific implementations are provided in Algorithm S8 (autocorrelation tests

ρ_{i j}

), Algorithms S9 and S10 (clustering metrics tests

κ_{i j}

), and Algorithm S11 (tail dependence tests

λ_{L, i j}

). Monte Carlo experiments confirm correct Type 1 error control under i.i.d. heavy-tailed nulls and non-trivial power against structured temporal alternatives.

3.8. Topological Cluster-Label Null Hypothesis Tests

To test whether clusters encode genuine topological and economically meaningful structure, we construct a label-permutation null model. Let

C_{t}

denote the cluster label associated with window-level features

w_{t}

. Under the null, labels are randomly permuted:

(w_{t}, C_{t}) \to (w_{t}, π (C_{t})), π \in S_{k}

(34)

This procedure preserves all window-level features

w_{t}

while destroying any information contained in the original cluster assignments. Test statistics are computed on both

C_{t}

and

π (C_{t}) .

Let

T_{o b s}

denote the observed statistic computed on

C_{t}

and

{\{T_{j}^{π}\}}_{π = 1}^{N_{p e r m}}

the statistic computed on

N_{p e r m}

permuted labels sequences. The empirical p-value is given by the following:

p = \frac{1 + \sum_{j = 1}^{N_{p e r m}} 1 (T_{j}^{π} > T_{o b s})}{1 + N_{p e r m}}

(35)

Statistics T include Kruskal–Wallis, Mann–Whitney, Levene, Kolmogorov–Smirnov, Anderson–Darling, and a distribution-free permutation test. Two complementary null hypotheses are evaluated:

(1): Feature-Space or Topological Separability: Rejection of the null indicates that clusters arise from genuine changes in Hilbert-space-embedded topological functionals.
(2): Economic Distinguishability: Rejection of the null indicates that clusters correspond to distinct market regimes and tail-risk patterns, rather than arbitrary temporal partitions.

The cluster-label null construction and validation are detailed in Algorithm S12. Executable-style pseudocode of the feature-space and economics distinguishability test is provided in Algorithm S13 in Supplementary Materials. Monte Carlo experiments using Algorithm S12 confirm correct Type 1 error control under i.i.d. heavy-tailed nulls and non-trivial power against structured temporal alternatives.

3.9. Left-Tail Power-Law Fitting and Distributional Analysis of Cluster Tails

For each cluster, left-tail behavior is analyzed using power-law fitting via the Clauset–Shalizi–Newman [57] framework, adapted to include the Anderson–Darling test for increased sensitivity to tail behavior. For each cluster, we estimate the left-tail scaling exponent

α

and the tail cut-off

x_{m i n}

, goodness-of-fit p-values using Kolmogorov–Smirnov and Anderson–Darling tests, and likelihood-ratio tests comparing power-law fits to alternative distributions (log-normal, exponential). Executable-style pseudocode of power-law fitting tests is provided in Algorithm S14 (Supplementary Materials).

4. Results of the Numerical Studies

Synthetic price trajectories with known regime shifts and heavy-tailed dynamics were generated to evaluate homeomorphic and diffeomorphic delay-coordinate reconstruction. We also assess whether Hilbert-space summaries encode geometric or temporal dependence, and whether topology-based clustering genuinely recovers the underlying regimes compared to temporal and cluster-label nulls.

4.1. Ground-Truth Synthetic Price Trajectories

Ground-truth bull and bear regimes were simulated with heavy-tailed dynamics and jumps, producing synthetic price trajectories at 5 min resolution over 2 years (39,212 observations). The simulation parameters are summarized in Table 1. Figure 8 shows the resulting price trajectory and log-returns corresponding for a single realization generated under a fixed random seed to ensure reproducibility. Periods corresponding to the ground-truth bear regime are highlighted in light blue and marked with the letter “B”.

The provided dataset for reproducibility includes latent

S_{t}

price, time stamps, OHLC prices, log-returns, and ground-truth bull/bear regime labels for each timestamp.

4.2. Local Homeomorphism Tests for Delay Embeddings

We evaluated whether the reconstructed delay-coordinate maps satisfy the observable consequences of a homeomorphic embedding of Theorem 5 of Stark et al. [32]. Since neither the true state space nor the embedding map is observable, we test necessary local conditions across 39,291 sliding windows using four diagnostics: (i) injectivity across embedding dimensions, (ii) temporal coherence and noise stability, (iii) local stationarity of the stochastic flow, and (iv) Lipschitz stability of persistence-based summaries.

These diagnostics are evaluated across a range of candidate embedding parameters (embedding dimension

d

, delay

τ

). These parameters are not treated as tunable hyperparameters; rather, they are selected based on reconstruction diagnostics. Evaluating them over a range of candidate values serves a dual purpose: (i) identifying admissible parameter regions consistent with valid delay reconstruction, and (ii) verifying that the observed topological and statistical properties are not confined to a single calibration.

Accordingly, the results provide an implicit sensitivity analysis, distinguishing parameter regions where the observable consequences of embedding theory are satisfied from those where reconstruction degrades.

4.2.1. Local Injectivity

Following Appendix A.3, injectivity was assessed via stabilization of topological summaries across candidate embedding dimensions

d = 3, \dots, 10

, using Wasserstein distances between persistence diagrams,

L^{2}

distances between persistence landscapes, and the Fill-factor across rolling windows and homology degrees

k = 0,1

(Table 2 and Table 3). Wasserstein and

L^{2}

distances remain uniformly small

(O (10^{- 3}))

, well below the tolerance level

ε = 0.1

, with no systematic drift as

d

increases. No material changes in

H_{0}

or

H_{1}

topology occur as additional coordinates are added. Importantly, the absence of systematic drift across embedding dimensions

d = 3, \dots, 10

indicates that the reconstructed topology is stable across a range of admissible dimensions, rather than being specific to a single choice.

Fill-factors for

d = 3

exhibit a mean normalized volume

\bar{V} = e x p (- 5.3055) \approx 0.0055

(std. ~0.0041), indicating non-degenerate local neighborhood volumes and absence of geometric collapsing. These results, illustrated in Figure 9, support injective reconstruction and preservation of neighborhood structure.

4.2.2. Noise Regularity and Smoothness

Temporal smoothness was assessed via window-to-window variation

‖X_{t} - X_{t - 1}‖

for the candidate delays

τ = 1, \dots 10

. Mean variations remain near 3.6 × 10⁻³ with stable upper quantiles across all delays (Table 4), indicating that the delay-embeddings evolve smoothly, consistent with Lipschitz-continuous behavior.

To operationalize

‖X_{t} - X_{t - 1}‖ < δ

, the smoothness scale

δ

was determined empirically using both autocorrelation decay and Mutual Information estimates (histogram-based and Kraskov estimators). Autocorrelation results (Figure 10a) are near zero for all

τ = 1 - 10

, with no clear

1 / e

monotone decay, no clear first minimum, or a meaningful first root, indicating

τ = 1

, is the minimal non-redundant delay. Kraskov Mutual Information (Figure 10b) remains tightly between −0.2407 and −0.2486 across delays, with the normalized Mutual Information fluctuating closely around 1.0.

The stability of smoothness diagnostics across delays

τ = 1, \dots, 10

, together with the absence of a pronounced minimum in Mutual Information, indicates that the embedding is not sensitive to variations in delays within this range.

4.2.3. Finite Dimensional Forcing, Low Dimensional Flow

Low-dimensional flow and local stationarity were assessed by monitoring the short-lag autocorrelation function (ACF) across consecutive sliding windows. A window was classified as locally stationary if the change in ACF satisfied

|Δ A C F (l)| \leq δ_{A C F} = 0.15, l \in [1, 5]

. Approximately 94–95% of the windows satisfy this criterion (Table 5), indicating that the stochastic dynamics are locally stable and effectively low-dimensional over short intervals. These results support finite-dimensional forcing and smooth local evolution of delay coordinates along typical trajectories.

4.2.4. Lipschitz Stability of Topological Summaries

Persistence-based summaries were monitored using rolling bottleneck distances

W_{\infty} (D_{t}, D_{t - 1})

and

L^{2}

distances in Hilbert space. Following Cohen–Steiner et al. [58], rolling bottleneck distances

W_{\infty} (D_{t}, D_{t - 1})

for

H_{0}

and

H_{1}

remain uniformly small across windows (means:

4.45 \times 10^{- 4}

,

1.22 \times 10^{- 4}

and maxima:

7.8 \times 10^{- 3}

,

1 \times 10^{- 3}

) with occasional spikes coinciding with regime transitions. These excursions indicate bounded perturbations rather than structural instability (Figure 11a).

L^{2}

distances between persistence landscapes exhibit similar pattern of results while being computationally more economical, solving a linear problem rather than an optimal transport problem (Figure 11b).

Similarly,

L^{2}

distances between persistence landscapes

λ (D_{t})

and

λ (D_{t - 1})

remain low (mean:

7.32 \times 10^{- 4}

,

1.22 \times 10^{- 4}

and maxima:

2.98 \times 10^{- 2}

,

1.81 \times 10^{- 3}

), indicating smooth temporal evolution at the functional level (Figure 11b). Together, these results provide empirical evidence of Lipschitz continuous evolution of persistence-based summaries under time-local perturbations of the delay embeddings.

4.2.5. Summary of Local Validity

Overall, injectivity, noise regularity, finite-dimensional forcing, and Lipschitz stability diagnostics support the local homeomorphic validity of the delay embeddings. Neighborhood structure, adjacency relations, topological invariants, and smooth local evolution are preserved over finite windows, consistent with Theorem 5 [32]. This provides empirical evidence consistent with conditions under which persistence-based clustering and regime identification are expected to be meaningful under jumps, heavy tails, and regime shifts.

4.3. Jacobian-Based Test of Local Diffeomorphism

We next evaluate whether the reconstructed delay-coordinate maps satisfy the observable differential consequences of diffeomorphic embeddings implied by Theorem 6 of Stark et al. [32], for the synthetic stochastic, heavy-tailed, and regime-shifting price dynamics. Since the true dynamics are unobservable, testing is conducted through window-wise diagnostics evaluating (i) local invertibility, (ii) bounded geometric distortion, and (iii) smooth temporal variation in the Jacobian field associated with the reconstructed dynamics. Table 6 summarizes the results of the probabilistic, window-wise diagnostics.

Local invertibility, assessed via the minimum singular value

σ_{m i n} (J_{t}) > ε_{J}

, is satisfied essentially by all windows, with a pass rate of 1.000 for the probabilistic threshold

p > η

(

η = 0.90)

. Lipschitz bounded distortion, quantified by the Jacobian condition number

κ (J_{t}) < K_{m a x} = 500

, holds for 99.7% of windows. Temporal

C^{1}

smoothness, assessed through the temporal variation in the minimum singular value, is satisfied by all windows. Joint enforcement of all three conditions yields a pass rate of 99.7%, with violations of invertibility or bounded distortion well below 1%, indicating that the probabilistic thresholds required by Theorem 6 are satisfied almost surely.

Overall, the Jacobian-based diagnostics provide numerical evidence that the delay-coordinate behaves as a locally diffeomorphic family: it preserves local invertibility, exhibits uniformly bounded local geometric distortion, and varies smoothly over time. Consequently, local metric structure—distances, angles, and neighborhood geometry—is not excessively distorted, supporting the applicability of metric-sensitive machine learning methods, including distance-based clustering and kernel methods.

4.4. Computation of Topological Functionals

Persistence diagrams were computed for all rolling windows using both the Vietoris–Rips (VR) and One-Pass K-Cluster Filtration (1PKF) pipelines, yielding 39,291 persistence diagrams per filtration. Functional summaries in

L^{2} (R)

were constructed for VR (

H_{0}

,

H_{1}

) and 1PFK (

H_{0}

). To quantify temporal variation in topological structure, the rolling deviation functional

ϕ_{t}^{w d (k)}

was computed on

L_{2} (H_{1})

time series, instantiated at k = 1. Figure 12 reports the resulting trajectories of topological functional summaries for the VR-filtration.

4.5. Geometric Null Hypothesis Test

Geometric significance was assessed using Bobrowski and Skraba’s [36] topological null model on the full ensemble of persistence diagrams (39,291 diagrams) and a 10% random subsample (4000 diagrams). This test evaluates whether individual birth–death pairs arise from genuine geometric structure or are statistically indistinguishable from noise, using the universal distribution of the persistence ratio

l

as a benchmark.

For the VR-filtration, mean p-values across all settings are well above the rejection threshold

p \leq α = 0.1

, with average values of approximately 0.67 for

H_{0}

and 0.56 for

H_{1}

(Table 7). After Bonferroni correction, no persistence points are identified as significant in either homology dimension. Uncorrected tests yield at most ~0.1% signal points for

H_{1}

. Results for

H_{0}

persistence diagrams from 1PFK-filtration are qualitatively identical, with a mean p-value ~0.57, well above the rejection threshold

p \leq α = 0.1

. Figure 13 shows that the empirical distribution of

l

-values closely match the left-Gumbel distribution predicted under the null. Overall, these findings indicate that individual persistence points are largely indistinguishable from geometric noise.

4.6. Temporal Null: Global Autocorrelation Test

Temporal organization was tested via autocorrelation tests applied to the rolling deviation functional

ϕ_{t}^{w d (k)}

, using

N_{s u r r} = 500

surrogate realizations generated under the temporal null model. Tests were conducted for both VR-filtration using

L^{2} (H_{1})

and 1PFK-filtration using

L^{2} (H_{0})

. For both filtrations, absolute deviations

{a ϕ}_{t}^{w d (k)}

are consistent with the null

(p = 1.0)

, while signed deviation functional

{s ϕ}_{t}^{w d (k)}

shows a small but statistically significant mean bias

(p = 0.002)

, large

T_{m a x}

and

T_{s q}

, and the large number of FDR-significant lags (Table 8 and Table 9).

This combination of results indicates that, although topological deviations are modest in size, their temporal ordering is highly non-random. In particular, the persistence of significant autocorrelation across extended lag ranges reveals coherent temporal organization in the evolution of topological features. These findings provide strong evidence for structures, regime-dependent topological dynamics that cannot be explained by random temporal reshuffling alone.

4.7. Unsupervised Learning: Heavy-Tailed K-Means Clustering

Heavy-tailed K-means was applied to time series of topological functionals to assess whether regime structure can be recovered in a fully unsupervised manner. For the VR-filtration, clustering was applied to

L^{2} (H_{0}), L^{2} (H_{1})

and the rolling deviation functional

ϕ_{t}^{w d (k)}

. For the 1PFK-filtration, clustering was applied to

L^{2} (H_{0})

. Figure 14 and Figure 15 display the synthetic price trajectory with ground truth regimes, together with the resulting cluster assignments obtained from heavy-tailed K-means. Despite operating solely on topological summaries, the clustering recovers a clear segmentation of the time series that aligns closely with the underlying regime structure. In particular, Cluster 1 corresponds predominantly to the ground-truth bear-market periods, while Cluster 0 aligns with the ground-truth bull-market regimes.

Descriptive statistics of the clusters (Table 10 and Table 11) show clear regime differentiation. For both filtrations, Cluster 1—aligned with bear regime—exhibits higher volatility and substantially higher kurtosis, while Cluster 0 displays near-Gaussian tails. These characteristics are consistent with the known construction of the synthetic data, which embeds regime-dependent changes in tail thickness and volatility.

4.8. Temporal Null Hypothesis Tests

We evaluate the statistical significance of the extracted clusters using three temporal null tests: (i) ground-truth alignment, (ii) unsupervised cluster-quality measures independent of labels, and (iii) tail dependence structure.

4.8.1. Ground-Truth Alignment

Cluster assignments were mapped to price points and compared with known ground-truth regimes using Total Accuracy (TA), Bear Recall (BR), and the Adjusted Rand Index (ARI). For both VR- and 1PFK-filtrations, all metrics significantly exceed their null expectations (p = 0.000, Table 12). In particular, ARI values are far above the null mean

(~ 0.001)

, indicating strong, non-random alignment between topological clusters and true regime labels. Overall, these results indicate that the unsupervised, topology-based clustering reliably recovers the latent regime structure of the synthetic process, well beyond what is expected under temporal randomization.

4.8.2. Unsupervised Clustering Metrics

Independent of ground truth, cluster quality was assessed using Silhouette (S), Calinski–Harabasz (CH), Davies–Bouldin (DB), and Maximum Mean Discrepancy (MMD). Inter-cluster separation metrics (S, CH, DB, between-cluster MMD) are all highly significant relative to the temporal null (p = 0.000, Table 13), while within-cluster MMD values are small and statistically indistinguishable from the null. Together, these results indicate strong regime separation combined with internally coherent cluster structure, implying that clustering reflects genuine temporal organization rather than random partitioning.

4.8.3. Upper- and Lower-Tail Dependence

Upper- and lower-tail dependence coefficients were compared against temporal null surrogate distributions. For the VR-filtration, clusters exhibit reduced lower-tail dependence relative to the null, indicating strong separation in downside risk, while upper-tail differences remain statistically indistinguishable from the null (Table 14). The 1PFK-filtration shows a qualitatively similar but weaker pattern. These results indicate that cluster separation is primarily driven by extreme left-tail behavior. Moreover, the reduced left-tail dependence relative to the null is incompatible with random temporal partitioning and therefore provides evidence that the clustering recovers genuine structure.

4.9. Cluster-Label Null Hypothesis Tests

We next assess whether the clusters are intrinsically meaningful (i.e., encode genuine regime information) rather than artifacts of the clustering or incidental temporal partitioning using a cluster-label null model. The tests focus on three complementary effects: feature-space separability, economic relevance, and empirical calibration.

4.9.1. Feature-Space Validation

Across both filtrations, non-parametric and distributional tests (Kruskal–Wallis, Mann–Whitney U, Kolmogorov–Smirnov, Anderson–Darling) strongly reject the cluster-label null (

p = 0.001,

Table 15). Thus, the clusters correspond to statistically distinct distributions of persistence-based summaries, indicating that they occupy disjoint regions in the reconstructed topological feature-space. This separation is robust across selected filtrations, persists across homology degrees, and is not driven solely by higher-order topological features, implying that the extracted clusters reflect genuine differences in the underlying stochastic dynamics rather than artifacts of label assignment or clustering geometry.

4.9.2. Economic Meaning Validation

We assessed whether the extracted clusters encode economically meaningful regimes using returns and volatility statistics against the cluster-label null. Location-based tests for returns (Kruskal–Wallis, Mann–Whitney) do not reject the null, indicating no systematic mean-return differences between clusters (Table 16).

In contrast, distribution-sensitive tests (Kolmogorov–Smirnov, Anderson–Darling) strongly reject the null

(p = 0.001)

, indicating substantial differences in distributional shape and tail behavior. Volatility tests (Levene) also reject the null for both rolling and annualized volatility, indicating the clusters correspond to distinct volatility and tail-risk regimes.

Taken together, these results show that the clustering isolates risk regimes characterized by changes in dispersion and tail behavior rather than average returns, consistent with economically meaningful regime differentiation in heavy-tailed financial systems.

4.9.3. Empirical Calibration of Cluster-Label Tests

Empirical size and power analysis confirm that feature-space and distribution-sensitive economic tests achieve the correct size

(~ 0)

and maximal power

(~ 1)

, whereas location-based tests exhibit zero power, consistent with the absence of mean effects (Table 17). These results demonstrate that the clustering is robust across selected filtrations, and encodes genuine temporal–topological and economic structure, rather than reflecting random assignment or artifacts of the clustering procedure.

4.10. Left-Tail Power-Law Fitting and Distributional Analysis of Cluster Tails

Left-tail behavior of clusters returns was analyzed using the Clauset–Shalizi–Newman framework [57]. For the VR-filtration, Cluster 1 exhibits a substantially heavier left-tail

(α = 4.58)

than Cluster 0

(α = 7.44)

indicating a much higher frequency of extreme negative returns (Table 18). Goodness-of-fit tests support a power-law model for both clusters, though likelihood-ratio tests suggest Cluster 0 marginally favors a lognormal or exponential alternative, while Cluster 1 is well described by a power law.

For the 1PFK-filtration, results are consistent: Cluster 1 again displays a much heavier left-tail

(α = 4.95)

compared to Cluster 0

(α = 12.65)

(Table 19). Poer-law fits are validated by the KS and AD tests, with likelihood-ratio comparisons favoring power-law behavior. Figure 16 illustrates the left-tail CCDFs and fitted models.

5. Discussion

In deterministic dynamical systems, Takens’ theorem guarantees that delay embeddings faithfully reconstruct the global geometry of the underlying attractor. However, for stochastic time series, such guarantees do not hold a priori: stochastic forcing, heavy tails, and regime shifts violate the assumptions required by classical embedding theory. Stark et al. [32,33] extended embedding theory to stochastically forced systems, demonstrating that delay embeddings can reconstruct the underlying state geometry almost surely along stochastic trajectories, provided the stochastic forcing enters through a finite-dimensional, smooth family of maps, the embedding dimension satisfies

d \geq 2 d i m (M) + 1

and the smoothness order satisfies

r \geq 1

.

Under these conditions, the delay-coordinate map is a

C^{r}

embedding, i.e., an injective

C^{r}

map that is a homeomorphism onto its image, yielding trajectory-wise probabilistic reconstruction of state-space geometry that preserves topological invariants, neighborhood structure, and adjacency relations. Since

C^{r}

regularity cannot be directly verified from finite observations, we assessed through numerical studies whether synthetically generated financial time series—designed to mirror stylized facts of financial markets such as heavy tails, jumps, regime shifts—exhibit the observable consequences of

C^{r}

embeddings in reconstruction space, rather than the unknown true flow.

These diagnostics indicate that delay-coordinates extracted from stochastic, heavy-tailed time series exhibit observable consequences consistent with

C^{r}

embeddings, thus neighborhood structure, adjacency relations, and topological invariants are preserved, providing justification for the use of adjacency- and topology-based clustering, as well as other machine-learning applications that rely on neighborhood structure. Jacobian-based window-wise tests further show stable local invertibility, bounded geometric distortion, and smooth temporal variation, consistent with the trajectory-wise guarantees of Stark et al. [32] (Theorem 6). This confirms that distances, angles, and local metric structures are preserved sufficiently for the use of metric-sensitive machine learning approaches, including topology-based clustering and regime detection with theoretical guarantees.

At the same time, windows that fail Theorem 5 or 6 diagnostics reveal the framework’s natural failure modes. In such cases, delay embedding does not reliably preserve topology or neighborhood relations, and local geometric distortions become significant. Consequently, clustering or other machine learning procedures relying on adjacency, distances, or topological invariants should not be applied to these windows. This provides an explicit and falsifiable criterion for determining when the methods are valid, and when extreme jumps, short-lived regimes, infinite variance innovations, or abrupt structural changes compromise reconstruction.

Topological features derived from these embeddings provide a principled summary of the evolving stochastic flow. Locally, persistence diagrams are largely indistinguishable from noise, but low-lag autocorrelations reveal short-term temporal structure that propagates to mesoscopic and network scales, so that Hilbert-space-embedded functionals display persistence temporal dependencies, cross-cluster coherence, and clear separation in extreme left-tail behavior. Thus, local temporal ordering gives rise to higher-level mesoscopic and network structure, independent of higher-order homology dimensions.

Cluster-label null tests of feature-space separation and economic distinguishability confirm that the detected clusters encode intrinsic temporal and topological structure, rather than random assignments or artifacts of a particular clustering algorithm. Specifically, the observed clusters correspond to statistically distinct distributions of persistence-based summaries, occupying disjoint regions in the reconstructed topological feature-space. Distribution-sensitive economic tests further reveal distinct tail behavior and volatility regimes, while location-based tests confirm the absence of systematic mean-return differences consistent with the stochastic nature of the price process. Clustering outcomes are robust across both VR- and 1PFK-filtrations for recovery of genuine regime structure.

Taken together, these findings indicate that topological functionals computed on locally homeomorphic delay-embeddings from stochastic, heavy-tailed time series provide valid, dynamically interpretable features for adjacency- and topology-based machine-learning applications. They capture intrinsic temporal organization, encode economically meaningful distinctions, and provide a computationally efficient framework that bridges geometric interpretability and practical machine learning.

However, this numerical analysis is conducted on a single, carefully controlled synthetic environment. While this allows precise evaluation against known ground truth, it does not establish robustness across alternative stochastic specifications. Extending the framework to multiple data-generating processes, including variations in jump intensity, tail behavior, and regime persistence, is an important direction for future work.

Moreover, the present framework does not normalize the topological summaries as scale itself carries meaningful information: periods of elevated market activity simultaneously amplify connected components and loop structures, contributing to the temporal signal. Normalization would partially remove this effect, potentially obscuring the relevant dynamics. However, distinguishing scale-driven and structure-driven components—e.g., through normalization or orthogonalization—remains an open and relevant research question and a promising direction for further research.

A related direction for future research is to assess the role of operational thresholds and parameter choices, particularly for the Jacobian-based diagnostics. While the current framework identifies admissible regions for valid reconstruction through diagnostic pass criteria, a systematic mapping of parameter regions to diagnostic outcomes would provide a stronger characterization of robustness and boundary behavior. Developing such explicit sensitivity mappings would further strengthen the interpretability and practical implementation of the methodology.

This topology-based methodology serves as a blueprint for applications to empirical financial data and extensions to supervised learning, metric-sensitive tools (e.g., kernel methods, gradient flows), and geometry-aware approaches (e.g., manifold learning, geometric deep learning) using labeled regimes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math14071098/s1, Algorithm S1. Synthetic price data generation; Algorithm S2. Tests of local homeomorphic delay-embeddings; Algorithm S3. Jacobian-based window-wise diagnostics of homeomorphic delay-embeddings; Algorithm S4. One-Pass k-Clusters filtration; Algorithm S5. Heavy-tailed K-means; Algorithm S6. Point-wise test of topological randomness; Algorithm S7. Construction and validation of the temporal topological null model; Algorithm S8. Autocorrelation test of temporal topological randomness; Algorithm S9. Ground-truth tests of temporal topological randomness; Algorithm S10. Unsupervised clustering tests of temporal topological randomness; Algorithm S11. Upper and lower tail dependence tests of temporal topological randomness; Algorithm S12. Construction and validation of the topological cluster-label null model; Algorithm S13. Cluster-label null tests of feature-space separation and economics distinguishability; Algorithm S14. Cluster-based power-law fit left-tail.

Author Contributions

Conceptualization, P.L.-F., E.R. and A.B.; methodology, P.L.-F., E.R. and A.B.; software, A.B. and E.R.; validation, P.L.-F., E.R. and A.B.; formal analysis, E.R. and A.B.; investigation, E.R. and A.B.; resources, A.B.; data curation, A.B. and E.R.; writing—original draft preparation, E.R.; writing—review and editing, P.L.-F. and A.B.; visualization, E.R. and A.B.; supervision, P.L.-F.; project administration, E.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original data presented in the study are openly available in OSF at https://osf.io/tjpvf/overview?view_only=051e8bb4910b4a8ebb6608439f8cbb94 (accessed on 18 March 2026).

Conflicts of Interest

Author Andriy Bayuk was employed by the Equity Derivatives Structuring, Quantum Leap Equity Holding GmbH. This employment had no influence in the design, analysis, or interpretation of the study, and the research was conducted independently in the author’s personal capacity. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Tests of Local Homeomorphic Delay Embeddings

This Appendix describes the theoretical foundation and the numerical construction of the numerical diagnostics used to assess whether delay-embedded point clouds derived from stochastically forced, heavy-tailed time series satisfy the observable consequences of Theorem 5 in Stark et al. [32], i.e., the delay map defines a topological embedding (homeomorphism onto its image) for almost every noise realization under generic conditions.

Because the structural assumptions of stochastic embeddings concern the (unobservable) data generation mechanism, they cannot be verified directly from finite-sample observations. Therefore, we assess the observable empirical consequences of Theorem 5 in Stark et al. [32]:

Injectivity across embedding dimensions: The embedding dimension is sufficient to avoid geometric self-intersections, so stable reconstruction of topological features remains stable as additional delay coordinates are added.
Noise regularity and smoothness of delay vectors: Small stochastic perturbations or sliding-window shifts do not produce large changes in the embedded point clouds, ensuring smooth temporal evolution along the observed trajectory.
Finite-dimensional forcing: The stochastic input acts through an effectively low-dimensional structure over finite windows, preserving the local geometry so that topological summaries accurately reflect an underlying manifold rather than high-dimensional noise.
Lipschitz stability of topological summaries: Persistence diagrams and landscapes evolve continuously along the trajectory, with small changes in delay vectors inducing small changes in topological features.

Probabilistic satisfaction of these four conditions implies that the reconstructed geometry behaves as a locally valid homeomorphic reconstruction of the underlying dynamics along typical noise realizations (i.e., a locally valid topological embedding).

Appendix A.1. Theoretical Conditions (Stark et al. [32])

Let

M

be a compact manifold of dimension

m \geq 1

, and let

N

be a compact manifold of dimension

n

. Let

f : M \times N \to M

be a

C^{r}

map with

r \geq 1 .

Let

ω = (η_{0}, η_{1}, \dots,) \in N^{Z}

(A1)

be a bi-infinite sequence of forcing parameters, and define the associated skew-product system

F (x, ω) = (f_{η_{0}} (x), σ ω)

(A2)

where

σ : C \to C

denotes the left shift on

N^{Z}

. Given a scalar observable

φ \in C^{r} (M \times N, R)

, the fiber-wise delay-coordinate map along a fixed noise realization

ω

is defined by

Φ_{p, φ . ω} (x) = (φ (x), φ (f_{η_{0}} (x)), φ (f_{η_{1}} ° f_{η_{0}} (x)), \dots . ., φ (f_{η_{d - 1}} ° \dots . . ° f_{η_{0}} (x)))

(A3)

where

d

denotes the number of delays and

f_{k} (x) = f (x, ω_{k})

.

Stark–Broomhead–Davies–Huke Theorem 5 [32] states that if

(i): the forcing family is finite-dimensional and $C^{r}$

the maps

f_{η}

are drawn from a finite-dimensional,

C^{r}

-smooth parameter family

\{f_{η} : η \in N\}

where N is a finite-dimensional manifold;

(ii): the noise distribution is absolutely continuous,

the law of the noise sequence

η_{t}

is absolutely continuous with respect to Lebesgue measure on N, ensuring almost every noise realization is well-defined;

(iii): the observation function is generic (residual),

the scalar observable

φ \in C^{r} (M)

belongs to a residual subset of observation functions, analogous to the genericity condition in classical Takens’ embedding theory;

(iv): the parameterization of maps is generic,

the parameterization

p \in C^{r} (N, {D i f f}^{r} (M))

, defined by

f_{η} = p (η)

, is generic in the sense of belonging to a residual subset, ensuring that coincidences in delay coordinates occur only on nongeneric sets;

(v): the embedding dimension is sufficient (injectivity),

d \geq 2 m + 1

is satisfied, where

m = \dim (M)

is the manifold dimension, and smoothness is

τ \geq 1

.

For a residual set of

(p, φ),

there exists an open dense (or full-measure) set of noise realizations

ω

such that the delay map

Φ_{p, φ, ω}

is an embedding of

M

, i.e., injective and a homeomorphism onto its image.

These conditions guarantee that the delay map preserves the geometry of locally

M

along almost every noise realization. Neighborhoods in the reconstructed space maintain the structure of

M

, enabling reliable application of clustering, regime identification, and state-dependent risk measures.

The diagnostics integrated in this Appendix A test the observable consequences of these conditions in finite samples.

Appendix A.2. Construction of Delay-Embedded Point Clouds

Given a univariate return series

{\{r_{t}\}}_{t = 1}^{T}

, we construct delay vectors

X_{t} = (r_{t}, r_{t - τ}, \dots . r_{t - (d - 1) τ})

(A4)

For embedding dimension

d

and delay

τ

. Sliding a window of width W across time yields point clouds

X_{t} = \{X_{t - W + 1}, \dots \dots X_{t}\}

(A5)

From each

X_{t}

, we compute Vietoris–Rips persistence diagrams

D_{t}^{(k)}

for k = 0, 1.

Appendix A.3. Embedding Dimension Condition (Injectivity)

Theorem 5 guarantees injectivity when

d \geq 2 \dim (M) + 1

, where

d i m (M)

is the manifold dimension of the state space M. Since dim (M) is unknown and may vary across regimes, we operationalize injectivity via topological stabilization across dimensions.

For candidate dimensions

d = 2, 3, \dots, 10

, we test

{‖D_{t}^{(k)} (d) - D_{t}^{(k)} (d - 1)‖}_{W} \leq ϵ, k = 0, 1

(A6)

where

{‖\cdot‖}_{W}

denotes the Wasserstein distance between diagrams. When this condition holds uniformly across representative windows, the 0- and 1-dimensional topological features no longer change materially with the additions of further coordinates, indicating that the reconstructed geometry has stabilized.

Alternatively, because Wasserstein distances scale at a rate

O (N^{3})

with the number of persistence pairs and becoming expensive for large datasets, we implement a functional–analytic alternative using persistence landscapes

λ_{t}^{(k)} (d)

, and evaluate their

L^{2}

distances across consecutive dimensions:

{‖λ_{t}^{(k)} (d) - λ_{t}^{(k)} (d - 1)‖}_{2} \leq ϵ, k = 0,1

(A7)

The smallest d satisfying Equation (A7) uniformly (or on average) over windows is taken as the operational embedding dimension

d^{*}

. Because persistence landscapes are 1-Lipschitz with respect to the bottleneck distance, this functional criterion is a computationally efficient surrogate for (A6).

The Fill-factor method proposed by Buzug et al. [59] is optionally applied as a geometric proxy for assessing static injectivity by measuring the average volume of parallelepipeds formed from local neighborhoods. However, this method is computationally expensive and escalates as

O (K d^{3})

per window (for k-sampled neighborhoods), so we apply it only for selected values of d.

Appendix A.4. Noise Regularity and Smoothness: Choice of Delay $τ$

Theorem 5 assumes smooth dependence on stochastic forcing. Empirically, this requires delay vectors to evolve smoothly over time. To enforce this smoothness in practice, we choose the delay τ so that the sliding window

X_{t} = (x_{t}, x_{t - τ}, \dots ., x_{t - (d - 1) τ})

(A8)

represents a quasi-instantaneous geometric state of the system. This is enforced via a local smoothness constraint:

‖X_{t} - X_{t - 1}‖ \leq δ

(A9)

where the smoothness scale δ is obtained from empirical autocorrelation and Mutual Information analysis as discussed in Tan et al. [37].

Accordingly, we choose

τ

as the first minimum of the autocorrelation function, or the decay to 1/e, or the first root of the local autocovariance function. These choices correspond to the smallest lag at which the components of

X_{t}

contain sufficiently new (i.e., non-redundant) information. In the Mutual Information method, the first minimum of Average Mutual Information (AMI) identifies the lag that minimizes redundancy while preserving dynamical dependence.

These criteria prevent the stochastic forcing from decorrelating the delay window too rapidly and ensure the persistence diagrams vary Lipschitz-continuously in time so that

|H_{t} - H_{t - 1}| \leq L ‖X_{t} - X_{t - 1}‖

(A10)

for some Lipschitz constant L. Hence, the Hilbert-space functional

H_{t}

empirically exhibits the smooth dependence on noise predicted by stochastic embedding theory.

This ensures that (i) the noise does not decorrelate the embedded window too rapidly; (ii) successive delay vectors correspond to smoothly varying states of the stochastic dynamics; and (iii) persistence diagrams vary smoothly over time.

Appendix A.5. Finite-Dimensional Forcing and Embedding Fidelity

Theorem 5 [32] assumes stochastic forcing acts through a finite-dimensional smooth family. Financial time series violate global stationarity due to jumps, heavy tails, and regime shifts, undermining the smoothness assumption. Therefore, we enforce local stationarity within sliding windows; thus, even if the global forcing is high-dimensional, the effective dynamics within each window can behave as if finite-dimensional.

Satisfaction of the local stationarity condition is assessed by monitoring the short-range autocorrelation function (ACF) across consecutive overlapping windows. For lags k = 1, 2, 3, we require

|{A C F}_{l a g k}^{(t)} - {A C F}_{l a g k}^{(t - 1)}| \leq δ_{A C F}

(A11)

where

δ_{A C F}

is a small tolerance parameter (typically 0.1–0.2, consistent with standard practice in time-series diagnostics).

A time window is classified as locally stationary if the above condition holds simultaneously for all monitored lags. The series is deemed locally stationary over a given period if this condition is satisfied for a large proportion of sliding windows. In practice, we require approximately 80–90% of windows to satisfy Equation (A11); results are robust to reasonable variations in this tolerance parameter.

Stable ACF values imply approximate preservation of finite-dimensional effective dynamics within each window, consistent with the theorem’s structural assumptions.

Appendix A.6. Lipschitz Stability of Persistence Diagrams, Landscapes and Lp-Norm Functionals

The choice of the delay parameter

τ

and the noise regularity diagnostics in Appendix A.4 assess local geometric stability, ensuring that delay vectors evolve smoothly under stochastic forcing. In contrast, the diagnostics in this section assess global topological stability, verifying that persistence diagrams and landscapes evolve Lipschitz-continuously over time. The former operates at the level of the embedded trajectory, while the latter concerns the stability of topological functionals derived from it.

Cohen-Steiner [58] proved that the bottleneck distance between diagrams is stable under sup-norm perturbations of the filtration function:

W_{\infty} (D (f), D (g)) \leq {‖f - g‖}_{\infty}

(A12)

where f and g are tame filtration functions defined on a simplicial complex or topological space K and

{‖f - g‖}_{\infty} = {s u p}_{x \subset K} |f (x) - g (x)|

(A13)

measures the maximum pointwise perturbation of the filtration values. Since filtrations

f_{t}

are induced implicitly by data-driven constructions (e.g., sliding-window embeddings and Vietoris–Rips filtrations) and not directly observable, we use rolling bottleneck distances

W_{\infty} (D_{t}, D_{t - 1})

which provides an empirical diagnosis of Lipschitz stability.

Small rolling bottleneck distances indicate robust topological structure, while large values signal abrupt geometric changes due to noise, regime shifts, or structural transitions. Stability is therefore interpreted relative to the empirical distribution of these distances.

Bubenik [34] proved that persistence landscapes

λ (D_{t})

provide a Lipschitz-stable functional representation:

{‖λ (D_{t}) - λ (D_{t - 1})‖}_{\infty} \leq W_{\infty} (D_{t}, D_{t - 1})

(A14)

This implies that landscape-level stability is automatically bounded by diagram-level bottleneck stability. To quantify stability in Hilbert space, we compute rolling distances between consecutive persistence landscapes:

L_{t}^{2} = {‖λ (D_{t}) - λ (D_{t - 1})‖}_{2}

(A15)

Small

L^{2}

values indicate stable topological features, while spikes indicate abrupt changes in the reconstructed attractor, e.g., due to non-stationarity or regime shifts.

Because persistence landscapes are Lipschitz-stable functional summaries, they provide robustness in heavy-tailed environments where moment-based statistics (variance, kurtosis, PCA) become unstable.

Appendix A.7. Algorithmic Implementation

The following algorithm describes a diagnostic and validation pipeline of stochastic local delay embeddings; failure of individual tests does not invalidate the method but flags regimes where local reconstruction assumptions may be violated.

Algorithm A1. Tests of Local Homeomorphic Delay-Embeddings

Input:

Time-series $x_{t}, t = 1, 2, \dots, T$ ,
Sliding window size W,
$Candidate embedding dimensions d \in \{d_{m i n}, \dots \dots, d_{m a x}\}$ ,
$Candidate delays τ \in T$ ,
Tolerance parameters:
- $ϵ$ is the stability threshold for Wasserstein distances, $γ$ is the Fill-factor threshold, $δ$ is noise regularity bound on delay vectors, and $δ_{A C F}$ is the tolerance threshold for local stationarity.

Step 1: Select embedding dimension

d

(Injectivity Test)

For each d

and window length W,

1.

Form delay vectors X_{t}^{(d, τ)} = (τ_{t}, τ_{t - τ}, \dots \dots, τ_{t - (d - 1) τ})

,

2.

Build point clouds X_{t}

,

3.

Compute the homology-

k persistence diagrams D_{t}^{(k)} (d), k = 0, 1

,

4.

Compute persistence landscapes λ_{t}^{(k)} (d) = P L (D_{t}^{(k)} (d))

,

5.

Compute rolling Hilbert-space distances:

{‖λ_{t}^{(k)} (d) - λ_{t}^{(k)} (d - 1)‖}_{2}

,

6.

Compute Fill-factor

{F F}_{t} (d)

from the normalized parallelepiped volumes of the delay clouds,

7.

The selected embedding dimension

d *

is the smallest d satisfying the following:

Diagram-level $stability W_{\infty} (D_{t}^{(k)} (d), D_{t}^{(k)} (d - 1)) \leq ϵ f o r k = 0, 1$ ;
$Persistence landscape stability : {‖λ_{t}^{(k)} (d) - λ_{t}^{(k)} (d - 1)‖}_{2} \leq ϵ f o r k = 0, 1$ ;
Fill-factor $criterion \bar{F F} (d) \geq γ a n d \bar{F F} (d) - \bar{F F} (d . 1) > 0$ .

Step 2: Select delay

τ

: Noise Regularity (Local Smoothness Test)

1.: $For each candidate delay τ \in T$ , evaluate the noise-regularity condition:

‖X_{t} - X_{t - 1}‖ \leq δ

ensuring that the delay-embedded trajectory varies smoothly in time and the stochastic forcing does not decorrelate the delay window too rapidly.

2.

Candidate delays τ

assessed with two complementary diagnostics

Autocorrelation test: Compute the lag- $τ$ $autocorrelation function ρ (τ)$ $and select the first local minimum, or the first τ$ $such that |ρ (τ)| < 1 / e$ .
$Average Mutual Information (AMI) : Estimate A M I (τ)$ using both non- $adaptive histograms and kNN estimators (Kraskov MI - KSG); select the first minimum of A M I (τ)$ providing an information-rich but minimally redundant delay.

3.

The selected

τ

is the value satisfying both smoothness and information criteria.

Step 3: Validate local stationarity (local smoothness and finite-dimensional forcing assumption)

1.: For each sliding window $[t - W + 1, t]$ , compute the sample autocorrelation

{A C F}_{t} (l), l \in L

2.: For each lag $l$ , compute the absolute inter-window differences

Δ_{A C F} (t, l) = |{A C F}_{t} (l) - {A C F}_{t - 1} (l)|

3.: A window satisfies the condition if

Δ_{A C F} (t, l) \leq δ_{A C F} f o r m o s t l \in L

The proportion of windows satisfying this inequality provides a quantitative measure of local stationarity.

Step 4: Lipschitz-stability of persistence diagrams and landscapes (Global Smoothness Test)

1.

For each window and homology degree k = 0, 1, compute persistence diagrams D_{t}^{(k)}

and corresponding persistence landscapes λ_{t}^{(k)} = P L (D_{t}^{(k)})

.

2.

Compute rolling Hilbert - space distances {‖λ_{t}^{(k)} (d) - λ_{t}^{(k)} (d - 1)‖}_{2}

.

3.

Compute

$Bottleneck distances W_{\infty} (D_{t}^{(k)}, D_{t - 1}^{(k)})$ ;
$L 2 - distances of landscapes {‖λ_{t}^{(k)} - λ_{t - 1}^{(k)}‖}_{2}$ .

4.

A window t is considered locally Lipschitz-stable if both

W_{\infty} (D_{t}^{(k)}, D_{t - 1}^{(k)}) \leq ϵ_{W}, ‖λ_{t}^{k} - λ_{t - 1}^{(k)}‖ \leq ϵ_{L} k = 0, 1

5.: Windows violating either condition are flagged for local stationarity failure, excessive noise, jumps, or transitions between dynamical regimes.

Output:

Stable, interpretable topological features

λ (D_{t})

, their

L^{p}

-norm functionals, and window-wise state-space geometry.

The pseudocode in executable style is provided in Supplementary Materials S2.

Appendix B. Jacobian-Based Window-Wise Diagnostics of Diffeomorphic Delay-Embeddings

This appendix describes the theoretical motivation, numerical construction, and interpretation of the local Jacobian diagnostics implemented to assess whether a delay-coordinate embedding satisfies the observable consequences of the diffeomorphic embedding result of Theorem 6 in Stark et al. [32].

Appendix B.1. Motivation and Theoretical Context

Stark et al. [32] establish two distinct embedding results for stochastically forced dynamical systems:

(1): Theorem 5 (Embedding/Homeomorphism)

For sufficiently large delay dimension

d = 2 m + 1

with

m = d i m (M)

and for

r \geq 1

, the delay coordinate map

Φ_{f, φ, ω} : M \to R^{d}

is generically as

C^{r}

embedding (i.e., an immersion and a homeomorphism onto its image), even under stochastic forcing.

(2): Theorem 6 (Diffeomorphic Embedding)

Suppose that the fiber maps are parameterized by a smooth family

p \in C^{r} (N, {D i f f}^{r} (M))

so that each realization

f_{η} = p (η) : M \to M

is a

C^{r}

diffeomorphism. Let the observable be state-dependent only,

φ \in C^{r} (M, R)

, and assume the forcing variables

ω_{k}

are i.i.d. with a common distribution

μ

, where

μ

is absolutely continuous with respect to a Lebesgue measure on

N .

Then, for a generic pair

(p, ω)

, the delay coordinate map

Φ_{f, φ, ω}

is a

C^{r}

embedding for a full-measure set of noise realizations

ω \in N^{Z}

. Moreover, the resulting embedding varies smoothly with respect to the noise realization, yielding a family of delay maps that is locally diffeomorphic.

In summary, if the underlying system dynamics form a smooth family of diffeomorphisms and the forcing is smooth and i.i.d., then for a generic choice of parameters and almost every noise realization, the delay-coordinate map is a

C^{r}

diffeomorphism onto its image. Thus, beyond topological invariants, the reconstruction preserves local differential and metric structure up to smooth coordinate change.

Appendix B.2. Empirical Interpretation and Diagnostic Approach

Although Theorem 6 [32] establishes an almost-sure

C^{r}

diffeomorphic embedding of the underlying dynamical system, nor the true state

x_{t} \in M

nor the delay-coordinate map

Φ

is directly observable in empirical applications. Therefore, we validate the observable differential consequences implied by the diffeomorphism result:

Full-rank differential: The Jacobian of the delay map is non-singular, implying local invertibility via the Inverse Function Theorem.
Local bi-Lipschitz behavior: The map exhibits bounded geometric distortion.
Regularity of the Jacobian field: The Jacobian varies smoothly along trajectories, reflecting $C^{1}$ -regularity.

Accordingly, we implement a window-wise Jacobian analysis that evaluates whether the delay reconstruction satisfies these observable consequences with high empirical frequency and whether deviations occur sporadically or systematically across time. The Jacobian-based diagnostics are applied to the induced delay dynamics in reconstruction space, not the latent system dynamics. The objective is not to prove diffeomorphism, but to test consistency with its necessary local differential implications.

Appendix B.3. Window-Wise Local Diffeomorphism Diagnostics

For each reconstructed state

x_{t}

within a sliding window, the delay coordinate reconstruction is evaluated according to the following criteria:

Locally invertibility: By the Inverse Function Theorem, if a $C^{1}$ map $F : R^{d} \to R^{d}$ has a non-singular Jacobian $J_{t} = D F (x_{t})$ at point $x_{t}$ , i.e., $d e t (J_{t}) \neq 0$ , equivalently if $σ_{m i n} (J_{t}) > 0$ then $F$ is locally invertible in the neighborhood of $x_{t}$ and hence locally one-to-one. Empirically we require $σ_{m i n} (J_{t}) > ε_{J}$ where $σ_{m i n} (J_{t})$ is the smallest singular value of the estimated Jacobian and $ε_{J} > 0$ is a numerical tolerance accounting for estimation error.
Bounded local distortion: Local bi-Lipschitz behavior is enforced by bounding the Jacobian condition number $κ (J_{t}) = σ_{m a x} (J_{t}) / σ_{m i n} (J_{t})$ . Empirically, this is evaluated as $κ (J_{t}) < K_{m a x}$ which prevents excessive anisotropic stretching or folding and ensures geometrically non-degenerate local behavior.
Smooth variation in the Jacobian field: Smoothness of the embedding concerns the temporal regularity of the Jacobian field along trajectories rather than pointwise properties. As a practical, robust proxy of $C^{1}$ -regularity, we evaluate the window-wise variance of the smallest singular value: $V a r (σ_{m i n} (J_{t})) < δ_{t}$ . This provides a robust empirical measure of the temporal regularity of the differential structure.

If a sliding window satisfies all three criteria for a sufficient proportion of admissible Jacobians

p > η

, where

η

is a sufficiently large threshold (e.g., >90%), the delay-coordinate reconstruction over the interval is locally invertible, geometrically non-degenerate, and differentially stable.

The fraction of windows satisfying all conditions provides an empirical analog for the “almost sure” qualification in stochastic delay-embedding theory: isolated local failures are permitted, whereas systematic violations indicate a breakdown of the diffeomorphic embedding assumptions.

Appendix B.4. Jacobian of the Delay-Coordinate Map

Let

x_{t} \in R^{d}

denote a point in the delay-embedded reconstruction obtained from a scalar observable

φ

. The (unknown) delay-coordinate map

Φ

induces a Jacobian

J_{t} = D Φ (x_{t}) \in R^{d \times d}

which governs the linearized local behavior of the embedding near

x_{t}

.

Because neither the underlying manifold

M

nor the map

Φ

is known explicitly, the Jacobian

J_{t}

must be estimated numerically from the reconstructed trajectory in delay space. The estimated Jacobian therefore does not approximate the Jacobian of the true underlying dynamical system, but rather the Jacobian of the induced dynamics in delay-coordinate space, i.e., the local linearization of the shift map acting on the reconstructed trajectory.

This distinction is essential: the diagnostics implemented here assess the local differential properties of the delay-coordinate reconstruction itself, which are the observable consequences implied by the diffeomorphic embedding result of Theorem 6, rather than properties of the unknown latent flow.

Appendix B.5. Local Jacobian Estimation via kNN Linearization

Local Jacobians are estimated using k-nearest-neighbor (kNN) local linear regression, a standard approach in non-linear time-series analysis.

For each interior point

x_{t}

in a delay-embedded point cloud

1.: Identify the set of k-nearest neighbors

$\{x_{j 1}, \dots, x_{j i}\}$

of $x_{t}$ in Euclidean distance.
2.: Construct local displacement matrices

$X = [\begin{matrix} x_{j i} - x_{t} \\ ⋮ \\ x_{j k} - x_{t} \end{matrix}], Y = [\begin{matrix} x_{j i + 1} - x_{t + 1} \\ ⋮ \\ x_{j k + 1} - x_{t + 1} \end{matrix}]$
3.: Jacobian Estimation
Estimate the local Jacobian $J_{t}$ via least-squares regression:

$Y \approx X J_{t}^{T}, J_{t} = \arg {m i n}_{j} {‖Y - X J^{T}‖}_{F}$

This procedure provides a consistent approximation to the Jacobian of the induced delay-coordinate dynamics when the embedding is $C^{1}$ and the neighborhood is sufficiently small. Boundary points lacking valid successors are excluded to avoid finite-sample artifacts.

Appendix B.6. Singular Value Diagnostics

For each estimated Jacobian

J_{t}

, we compute its singular values

σ_{1} (J_{t}) \geq \dots \geq σ_{d} (J_{t}) > 0

via singular value decomposition (SVD). These quantities encode the local geometric action of the embedding:

$σ_{m i n} (J_{t})$ controls local invertibility;
$(J_{t}) = \frac{σ_{m a x} (J_{t})}{σ_{m i n} (J_{t})}$ quantifies local geometric distortion (bi-Lipschitz behavior).

Appendix B.7. Diffeomorphism Criteria (Theorem 6 Tests)

We evaluate three necessary conditions for diffeomorphic embedding:

1.: Full Rank/Local Invertibility

$σ_{m i n} (J_{t}) > ε_{J}$

ensuring that the delay map is locally one-to-one.
2.: Lipschitz Bounded Distortion

$κ (J_{t}) < K_{m a x}$

preventing excessive local stretching or folding.
3.: Smooth Family Condition

${V a r}_{t} (σ_{i} (J_{t})) < δ_{J}$

Ensuring that the family of local Jacobians varies smoothly across time, as required for a $C^{1}$ family of diffeomorphisms.

Appendix B.8. Probabilistic vs. Mean-Based Window-Level Criterion

Within each sliding window

W_{t}

, we define two approaches to assess local diffeomorphism:

1.: Probabilistic (Exact) Test:

p_{i} = \frac{1}{|W_{i}|} \sum_{t \in W_{i}} 1 \{σ_{m i n} (J_{t}) \geq ε_{J}\}, p_{i} = \frac{1}{|W_{i}|} \sum_{t \in W_{i}} 1 \{κ (J_{t}) \geq K_{m a x}\}

A sliding window is declared consistent with the diffeomorphic local consequences of Theorem 6 if for the two conditions

p_{i} \geq η

hold simultaneously for a

η

that is a sufficiently large (e.g., >90%) and

V a r (σ_{m i n} (J_{t})) < δ_{J}

.

2.: Mean-based approximation (Optional)

Instead of computing the fraction of admissible Jacobians per window, which is computationally expensive, optionally one can compare the window-mean Jacobian value against the threshold:

{\bar{σ}}_{m i n} (W_{i}) \geq ε_{J}, \bar{κ} (W_{i}) \leq K_{m a x}

This provides a computationally efficient approximate test, useful for large datasets. Thus, if these conditions hold together with

V a r (σ_{m i n} (J_{t})) < δ_{J}

, then a sliding window is declared consistent with the diffeomorphic local consequences of Theorem 6.

Algorithm A2. Jacobian-Based Window-Wise Diagnostics of Diffeomorphic Delay-Embeddings

Input:

Scalar time- $series {\{x_{t}\}}_{t = 1}^{T}$ ,
Sliding window size W,
$Embedding dimensions (d^{*}, τ^{*})$ selected in Algorithm A1,
Neighborhood size k parameters for local linearization (k-nearest neighbor),
Tolerance parameters:
- $ε_{J}$ is the minimum singular value threshold, $K_{m a x}$ is the maximum allowable Jacobian condition number, $δ_{J}$ $is the temporal Jacobian variation threshold, η \in (0, 1]$ is the minimum proportion of admissible Jacobians per window.

Step 1: Delay-Coordinate Reconstruction

Construct the delay-embedding using the selected parameters

(d^{*}, τ^{*})

. For each time index

t

, define the delay vector

X_{t} = (x_{t}, x_{t - τ}, \dots . ., x_{t - (d^{*} - 1) τ^{*}}) \in R^{d^{*}}

Partition the reconstructed trajectory into overlapping sliding windows

W_{i} = \{z_{t} : t = i - W + 1, \dots ., i\}, i = W, \dots, T

Each window

W_{i}

defines a local delay-embedded point cloud

Step 2: Local Jacobian Estimation

For each delay vector

z_{t} \in W_{i}

with a valid temporal successor:

1.: Identify a local neighborhood

N_{t} = \{z_{j i}, \dots ., z_{j k}\}

Using k-nearest neighbors in

R^{d^{*}}

,

2.: Form displacement matrices

X = [\begin{matrix} x_{j i} - x_{t} \\ ⋮ \\ x_{j k} - x_{t} \end{matrix}], Y = [\begin{matrix} x_{j i + 1} - x_{t + 1} \\ ⋮ \\ x_{j k + 1} - x_{t + 1} \end{matrix}]

3.: If rank $(X_{t}) = d^{*}$ , estimate the local Jacobian $J_{t}$ via ordinary least squares:

Y_{t} \approx X_{t} J_{t,} J_{t} = \arg {m i n}_{J} {‖Y - X J‖}_{F}

Points with insufficient neighbors or rank-deficient

X_{t}

are excluded.

The Jacobian

J_{t}

estimates the local linearization of the induced delay-coordinate time-shift map on reconstruction space.

Step 3: Singular Value Diagnostics and Local Invertibility

For each estimated Jacobian

J_{t}

,

1.: Compute the singular value decomposition:

J_{t} = U_{t} \sum_{t} V_{t}^{T}, \sum_{t} = d i a g (σ_{1}, \dots ., σ_{d})

2.: Extract the singular values:

σ_{m i n} (J_{t}) = σ_{d^{*}}, σ_{m a x} (J_{t}) = σ_{1} . κ (J_{t}) = \frac{σ_{m a x} (J_{t})}{σ_{m i n} (J_{t})}

3.

Evaluate local diffeomorphic conditions:

Local injectivity condition

$σ_{m i n} (J_{t}) \geq ε_{J}$
Bounded distortion condition

κ (J_{t}) = \frac{σ_{m a x} (J_{t})}{σ_{m i n} (J_{t})} < K_{m a x}

Jacobians failing either condition are flagged as locally non-admissible.

Step 4: Temporal Regularity of the Jacobian Field

Within each window

W_{i}

, assess smoothness of the Jacobian family via the empirical variability of singular values.

Define

{V a r}_{i} (σ_{m i n}) = V a r \{σ_{m i n} (J_{t}) : z_{t} \in W_{i}\}

Temporal regularity is deemed satisfied if

{V a r}_{i} (σ_{m i n}) < δ_{J}

Indicating a smoothly varying family of local diffeomorphisms consistent with

C^{1}

regularity.

Step 5: Window-Level Aggregation and Stability Criterion

For each sliding window

W_{t}

, compute the admissible proportion:

p_{i} = \frac{1}{|W_{i}|} \sum_{t \in W_{i}} 1 \{σ_{m i n} (J_{t}) \geq ε_{J}\}, p_{i} = \frac{1}{|W_{i}|} \sum_{t \in W_{i}} 1 \{κ (J_{t}) \geq K_{m a x}\}

A window is declared locally diffeomorphic if

$p_{i} \geq η$ , and
$V a r (σ_{m i n}) < δ_{J}$ .

Optionally compute the approximation based on window-mean Jacobian values:

{\bar{σ}}_{m i n} (W_{i}) \geq ε_{J}, \bar{κ} (W_{i}) \leq K_{m a x}

A window is declared locally diffeomorphic if

${\bar{σ}}_{m i n} (W_{i}) \geq ε_{J}$ , and $\bar{κ} (W_{i}) \leq K_{m a x}$ , and $V a r (σ_{m i n}) < δ_{J}$

Output:

Window-wise indicators of local diffeomorphism validity.
Empirical distributions of
- $σ_{m i n} (J_{t})$ ;
- $κ (J_{t})$ .
Time intervals where the delay embedding satisfies the hypothesis of Theorem 6.
Numerical validations that the reconstructed dynamics belong to a $C^{1}$ $(or C^{r})$ family of diffeomorphisms, conditional on Algorithm A1.

References

Messina, E.; Toscani, D. Hidden Markov models for scenario generation. IMA J. Manag. Math. 2008, 19, 379–401. [Google Scholar] [CrossRef]
Palma, G.R.; Skoczen, M.; Maguire, P. Asset price movement prediction using empirical mode decomposition and Gaussian mixture models. arXiv 2025, arXiv:2503.20678. [Google Scholar] [CrossRef]
Horvath, B.; Zacharia, I.; Aitor, M. Clustering Market Regimes Using the Wasserstein Distance. arXiv 2021, arXiv:2110.11848. [Google Scholar] [CrossRef]
Luan, Q.; Hamp, J. Automated Regime Detection in Multidimensional Time Series Data Using Sliced Wasserstein K-Means Clustering. arXiv 2023, arXiv:2310.01285. [Google Scholar] [CrossRef]
Cao, Y.; Leung, P.; Monod, A. K-Means Clustering for Persistence Homology. Adv. Data Anal. Classif. 2025, 19, 95–119. [Google Scholar] [CrossRef]
Sayde, M.; Fahs, J.; Abou-Faycal, I. Heavy-Tailed Linear Regression and K-Means. Information 2025, 16, 184. [Google Scholar] [CrossRef]
Edelsbrunner, H.; Letscher, D.; Zomorodian, A. Topological persistence and simplification. Discret. Comput. Geom. 2002, 28, 511–533. [Google Scholar] [CrossRef]
Zomorodian, A.; Carlson, G. Computing Persistent Homology. Discret. Comput. Geom. 2005, 33, 249–274. [Google Scholar] [CrossRef]
Leykam, D.; Angelakis, D. Topological data analysis and machine learning. Adv. Phys. X 2023, 8. [Google Scholar] [CrossRef]
Fama, E.F. Mandelbrot and the Stable Paretian Hypothesis. J. Bus. 1963, 36, 420–429. [Google Scholar] [CrossRef]
Longin, F. The Asymptotic Distribution of Extreme Stock Market Returns. J. Bus. 1996, 69, 383–408. [Google Scholar] [CrossRef]
Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Financ. 2001, 1, 223–236. [Google Scholar] [CrossRef]
Gabaix, X.; Gopikrishnan, P.; Plerou, V.; Stanley, H.E. A Theory of Limited Liquidity and Large Investors Causing Spikes in Stock Market Volatility and Trading Volume. J. Eur. Econ. Assoc. 2007, 5, 564–573. [Google Scholar] [CrossRef][Green Version]
Gabaix, X. Power Laws in Economics: An Introduction. J. Econ. Perspect. 2016, 30, 185–206. [Google Scholar] [CrossRef]
Takens, F. Detecting Strange Attractors in Turbulence. In Dynamical Systems and Turbulence, Warwick 1980; Rand, D.A., Young, L.S., Eds.; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany, 2006; Volume 898, pp. 366–381. [Google Scholar] [CrossRef]
Gidea, M.; Katz, Y. Topological Data Analysis of Financial Time Series: Landscapes of Crashes. Phys. A Stat. Mech. Its Appl. 2018, 491, 820–834. [Google Scholar] [CrossRef]
Gidea, M. Topology Data Analysis of Critical Transitions in Financial Networks. arXiv 2017, arXiv:1701.06081. [Google Scholar] [CrossRef]
Yen, P.T.W.; Cheong, S.A. Using topological data analysis (TDA) and persistent homology to analyze the stock markets in Singapore and Taiwan. Front. Phys. 2021, 9, 572216. [Google Scholar] [CrossRef]
Yen, P.T.W.; Xia, K.; Cheong, S.A. Understanding Changes in the Topology and Geometry of Financial Market Correlations during a Market Crash. Entropy 2021, 23, 1211. [Google Scholar] [CrossRef]
Nie, C.X. Unveiling complex nonlinear dynamics in stock markets through topological data analysis. Phys. A Stat. Mech. Its Appl. 2025, 680, 131025. [Google Scholar] [CrossRef]
Katz, Y.A.; Biem, A. Time-resolved topological data analysis of market instabilities. Phys. A Stat. Mech. Its Appl. 2021, 571, 125816. [Google Scholar] [CrossRef]
Akingbade, S.W.; Gidea, M.; Manzi, M.; Nateghi, V. Why topological analysis detects financial bubbles? arXiv 2023, arXiv:2304.06877. [Google Scholar] [CrossRef]
Ismail, M.R.; Noorani, M.S.M.; Ismail, M.; Razak, F.A.; Alias, M.S. Early warning signals of financial crises using persistent homology. Phys. Stat. Mech. Appl. 2022, 586, 126459. [Google Scholar] [CrossRef]
Rudkin, S.; Qiu, W.; Dlotko, P. Uncertainty, volatility and the persistence norms of financial time series. Expert Syst. Appl. 2023, 223, 119894. [Google Scholar] [CrossRef]
Valdivia, A.D. Topological variability in financial markets. Quant. Financ. Econ. 2023, 7, 391–402. [Google Scholar] [CrossRef]
Ruiz-Ortiz, M.A.; Gómez-Larrañaga, J.C.; Rodríguez-Viorato, J. A persistent-homology-based turbulence index & some applications of TDA on financial markets. arXiv 2022, arXiv:2203.05603. [Google Scholar] [CrossRef]
Aguilar, A.; Ensor, K. Topology Data Analysis Using Mean Persistence Landscapes in Financial Crashes. J. Math. Financ. 2020, 10, 648–678. [Google Scholar] [CrossRef]
Aromi, L.L.; Katz, Y.; Vives, J. Topological features of multivariate distributions: Dependency on the covariance matrix. Commun. Nonlinear Sci. Numer. Simul. 2021, 103, 105996. [Google Scholar] [CrossRef]
Souto, H.G. Topological tail dependence: Evidence from forecasting realized volatility. J. Financ. Data Sci. 2023, 9, 100107. [Google Scholar] [CrossRef]
Souto, H.G.; Moradi, A. A generalization of the Topological Tail Dependence theory: From indices to individual stocks. Decis. Anal. J. 2024, 12, 100512. [Google Scholar] [CrossRef]
Gidea, M.; Goldsmith, D.; Katz, Y.; Roldan, P.; Shmalo, Y. Topological recognition of critical transitions in time series of cryptocurrencies. Phys. A Stat. Mech. Its Appl. 2020, 548, 123843. [Google Scholar] [CrossRef]
Stark, J.; Broomhead, D.S.; Davies, M.E.; Huke, J. Takens embedding theorems for forced and stochastic systems. Nonlinear Anal. Theory Methods Appl. 1997, 30, 5303–5314. [Google Scholar] [CrossRef]
Stark, J.; Broomhead, D.; Davies, M.; Huke, J. Delay Embeddings for Forced Systems. II. Stochastic Forcing. J. Nonlinear Sci. 2003, 13, 519–577. [Google Scholar] [CrossRef]
Bubenik, P. Statistical Topological Data Analysis using Persistence Landscapes. J. Mach. Learn. Res. 2015, 16, 77–102. [Google Scholar] [CrossRef]
Adams, H.; Emerson, T.; Kirby, M.; Neville, R.; Peterson, C.; Shipman, P.; Chepushtanova, S.; Hanson, E.; Motta, F.; Ziegelmeier, L. Persistence images: A stable vector representation of persistent homology. J. Mach. Learn. Res. 2017, 18, 1–35. [Google Scholar] [CrossRef]
Bobrowski, O.; Skraba, P. A universal null-distribution for topological data analysis. Sci. Rep. 2023, 13, 12274. [Google Scholar] [CrossRef]
Tan, E.; Algar, S.D.; Corrêa, D.; Stemler, T.; Small, M. Network representations of attractors for change point detection. Commun. Phys. 2023, 6, 340. [Google Scholar] [CrossRef]
Myers, A.; Muñoz, D.; Khasawneh, F.A.; Munch, E. Temporal network analysis using zigzag persistence. EPJ Data Sci. 2023, 12, 6. [Google Scholar] [CrossRef]
Von Luxburg, U. A tutorial on spectral clustering. Stat. Comput. 2007, 17, 395–416. [Google Scholar] [CrossRef]
Jia, H.; Ding, S.; Xu, X.; Nie, R. The latest research progress on spectral clustering. Neural Comput. Appl. 2014, 24, 1477–1486. [Google Scholar] [CrossRef]
Li, H.; Sewell, D.K. Model-based edge clustering for weighted networks with a noise component. Comput. Stat. Data Anal. 2025, 209, 108172. [Google Scholar] [CrossRef]
Kontak, M.; Vidal, J.; Tierny, J. Statistical Parameter Selection for Clustering Persistence Diagrams. arXiv 2019, arXiv:1910.08398. [Google Scholar] [CrossRef]
Carriere, M.; Cuturi, M.; Oudot, S. Sliced Wasserstein Kernel for Persistence Diagrams. In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, Australia, 6–11 August 2017; pp. 664–673. [Google Scholar] [CrossRef]
Davies, T.; Aspinall, J.; Wilder, B.; Tran-Thanh, L. Fuzzy c-Means Clustering for Persistence Diagrams. arXiv 2020, arXiv:2006.02796. [Google Scholar] [CrossRef]
Cont, R.; Potters, M.; Bouchaud, J.-P. Scaling in Stock Market Data: Stable Laws and Beyond. arXiv 1997, arXiv:cond-mat/9705087. [Google Scholar] [CrossRef]
Bouchaud, J.P.; Potters, M. Apparent Multifractality in Financial Time Series. arXiv 1999, arXiv:cond-mat/9906347. [Google Scholar] [CrossRef]
Ding, Z.; Clive, W.J.; Granger, C.W.J.; Engle, R.F. A long memory property of stock market returns and a new model. J. Empir. Financ. 1993, 1, 83–106. [Google Scholar] [CrossRef]
Robinson, A.; Turner, K. Hypothesis testing for topological data analysis. J. Appl. Comput. Topol. 2017, 1, 241–261. [Google Scholar] [CrossRef]
Bobrowski, O.; Skraba, P. Cluster Persistence for Weighted Graphs. Entropy 2023, 25, 1587. [Google Scholar] [CrossRef]
Rand, W.M. Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc. 1971, 66, 846–850. [Google Scholar] [CrossRef]
Hubert, L.; Arabie, P. Comparing partitions. J. Classif. 1985, 2, 193–218. [Google Scholar] [CrossRef]
Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
Caliński, T.; Harabasz, J. A dendrite method for cluster analysis. Commun. Stat. 1974, 3, 1–27. [Google Scholar] [CrossRef]
Davies, D.L.; Bouldin, D. A Cluster Separation Measure. IEEE Trans. Pattern Anal. Mach. Intell. 1979, 1, 224–227. [Google Scholar] [CrossRef] [PubMed]
Gretton, A.; Borgwardt, K.M.; Rasch, M.J.; Schölkopf, B.; Smola, A. A Kernel Two-Sample Test. J. Mach. Learn. Res. 2012, 13, 723–773. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar] [CrossRef]
Clauset, A.; Shalizi, C.R.; Newman, M.E.J. Power-law distributions in empirical data. SIAM Rev. 2009, 51, 661–703. [Google Scholar] [CrossRef]
Cohen-Steiner, D.; Edelsbrunner, H.; Harer, J. Stability of Persistence Diagrams. Discret. Comput. Geom. 2007, 37, 103–120. [Google Scholar] [CrossRef]
Buzug, T.; Pfister, G. Comparison of algorithms calculating optimal embedding parameters for delay time coordinates. Phys. D 1992, 58, 127–137. [Google Scholar] [CrossRef]

Figure 1. Topological summaries computed from multivariate daily return vectors of S&P 500, Dow Jones Index, Nasdaq 100, and Russell 2000 for the period of 1981–2025 using a window size w = 100 days: (a) Sample Persistence Diagram of homology dimensions

H_{0}

,

H_{1},

and

H_{2}

. (b) Time evolution of

L^{2}

-norms of persistence landscapes computed from point clouds extracted from multivariate returns of the 4 indices overlaying the S&P 500 index for the period of 1992–2025. Spikes of

L^{2}

-norms

H_{1}

are observed before the dot-com, Lehmann, and COVID-19 crashes.

Figure 1. Topological summaries computed from multivariate daily return vectors of S&P 500, Dow Jones Index, Nasdaq 100, and Russell 2000 for the period of 1981–2025 using a window size w = 100 days: (a) Sample Persistence Diagram of homology dimensions

H_{0}

,

H_{1},

and

H_{2}

. (b) Time evolution of

L^{2}

-norms of persistence landscapes computed from point clouds extracted from multivariate returns of the 4 indices overlaying the S&P 500 index for the period of 1992–2025. Spikes of

L^{2}

-norms

H_{1}

are observed before the dot-com, Lehmann, and COVID-19 crashes.

Figure 2. Diffeomorphic reconstruction of a stochastically forced Lorenz system from scalar observation. (a) Trajectory of the stochastically forced Lorenz system in

R^{3}

, showing the characteristic “butterfly” dynamics. (b) Scalar time series

x_{1} (t)

extracted from the observed stochastic trajectory. (c) Delay-coordinate reconstruction of the scalar series using an embedding dimension

d = 7

satisfying the sufficient conditions for a local diffeomorphism. Window-wise Jacobian diagnostics confirm that the observable geometric consequences of

C^{r}

diffeomorphism hold for approximately 90% of finite windows.

Figure 2. Diffeomorphic reconstruction of a stochastically forced Lorenz system from scalar observation. (a) Trajectory of the stochastically forced Lorenz system in

R^{3}

, showing the characteristic “butterfly” dynamics. (b) Scalar time series

x_{1} (t)

extracted from the observed stochastic trajectory. (c) Delay-coordinate reconstruction of the scalar series using an embedding dimension

d = 7

satisfying the sufficient conditions for a local diffeomorphism. Window-wise Jacobian diagnostics confirm that the observable geometric consequences of

C^{r}

diffeomorphism hold for approximately 90% of finite windows.

Figure 3. Matrix of pairwise Wasserstein distances of delay-embedded point clouds across homology dimensions: (a) a matrix for

H_{0}

, where dark diagonal elements indicate minimal distance, while off-diagonal regions with high distance reveal topological deformation; (b) a matrix for

H_{1}

, where light-colored blocks indicate the emergence or disappearance of loop structure, reflecting regime shifts; and (c) a combined matrix of

H_{0}

and

H_{1}

, which emphasizes windows topologically similar across both homology dimensions.

Figure 3. Matrix of pairwise Wasserstein distances of delay-embedded point clouds across homology dimensions: (a) a matrix for

H_{0}

, where dark diagonal elements indicate minimal distance, while off-diagonal regions with high distance reveal topological deformation; (b) a matrix for

H_{1}

, where light-colored blocks indicate the emergence or disappearance of loop structure, reflecting regime shifts; and (c) a combined matrix of

H_{0}

and

H_{1}

, which emphasizes windows topologically similar across both homology dimensions.

Figure 4. Combined topological and geometrical similarity measures across time used to define temporal edge weights

w_{t, s}

in the temporal topological network. (a) Minimum spanning tree (MST) of delay-embedded heavy-tailed log-returns. The point cloud corresponds to a single time window. MST edges colored by their Euclidean lengths reflecting intra-window geometric weights. The characteristic geometric scale

L_{t}

is computed as the minimum distance to connect all the points in the point cloud. (b) Similarity matrix representing temporal edge weights

w_{t, s}

between time windows. Each entry combines topological coherence via diagram-level Wasserstein distances and geometric scale via MST lengths. High values indicate strongly similar geometrical states, while low values reveal regime transitions or structural deformation.

Figure 4. Combined topological and geometrical similarity measures across time used to define temporal edge weights

w_{t, s}

in the temporal topological network. (a) Minimum spanning tree (MST) of delay-embedded heavy-tailed log-returns. The point cloud corresponds to a single time window. MST edges colored by their Euclidean lengths reflecting intra-window geometric weights. The characteristic geometric scale

L_{t}

is computed as the minimum distance to connect all the points in the point cloud. (b) Similarity matrix representing temporal edge weights

w_{t, s}

between time windows. Each entry combines topological coherence via diagram-level Wasserstein distances and geometric scale via MST lengths. High values indicate strongly similar geometrical states, while low values reveal regime transitions or structural deformation.

Figure 5. Temporal geometric structure of delay-embedded stochastic, heavy-tailed time series. (a) Spectral embedding of a sparse k-NN temporal similarity graph (k = 6) reveals compact regions of time windows with coherent reconstructed geometry and weakly connected bridges corresponding to regime transitions. Node color indicates total MST length. The embedding coordinates correspond to the leading eigenvectors of the graph Laplacian and have no direct physical interpretation; they quantify similarity in the k-NN graph, so only relative distances and cluster structure are meaningful. (b) Timeline view of the thresholded top-k similarity graph showing persistent connectivity during stable regimes and transient connections during transitions, with node color reflecting local geometric complexity.

Figure 6. Temporal evolution of the local geometry of fat-tailed stochastic time series exhibiting regime shift. (a) Log-returns time series highlighting persistent regimes with varying volatility. (b) Total MST length

L_{t}

computed on sliding windows of a delay-embedded point cloud.

L_{t}

provides a geometrically faithful summary of the local reconstructed attractor at each time step. (c) Temporal evolution of edge weights

w_{t, s}

capturing joint topological similarity—via diagram-based Wasserstein distances—and geometric coherence—via MST length differences between consecutive delay embedded windows.

Figure 6. Temporal evolution of the local geometry of fat-tailed stochastic time series exhibiting regime shift. (a) Log-returns time series highlighting persistent regimes with varying volatility. (b) Total MST length

L_{t}

computed on sliding windows of a delay-embedded point cloud.

L_{t}

provides a geometrically faithful summary of the local reconstructed attractor at each time step. (c) Temporal evolution of edge weights

w_{t, s}

capturing joint topological similarity—via diagram-based Wasserstein distances—and geometric coherence—via MST length differences between consecutive delay embedded windows.

Figure 7. Temporal topological null-model applied to a Gaussian i.i.d. and a regime-switching fat-tailed process. (a) z-scores of

ϕ_{t}^{w d}

computed from L²-norms of

H_{1}

relative to the null model (dashed blue line) across window index, Gaussian case in the top panel, fat-tailed case in the lower panel. (b) Autocorrelations (lags = 1–50, x-axis) of the z-score series, compared to the null envelope (±2 standard deviations, gray band). Significant deviations from the null model highlight genuine temporal organization in the dynamics of

ϕ_{t}^{w d}

, beyond marginal geometric variability. Gaussian case in the top panel, fat-tailed in the lower panel.

Figure 7. Temporal topological null-model applied to a Gaussian i.i.d. and a regime-switching fat-tailed process. (a) z-scores of

ϕ_{t}^{w d}

computed from L²-norms of

H_{1}

relative to the null model (dashed blue line) across window index, Gaussian case in the top panel, fat-tailed case in the lower panel. (b) Autocorrelations (lags = 1–50, x-axis) of the z-score series, compared to the null envelope (±2 standard deviations, gray band). Significant deviations from the null model highlight genuine temporal organization in the dynamics of

ϕ_{t}^{w d}

, beyond marginal geometric variability. Gaussian case in the top panel, fat-tailed in the lower panel.

Figure 8. Synthetic price process at 5 min resolution with ground-truth bull and bear market regimes, heavy-tails, and jumps: (a) Synthetic price trajectory over a period of 2 years (39,212 observations). Ground-truth bear regimes highlighted in light-blue and letter “B”: (b) log-returns over the same period. Ground-truth bear regimes highlighted in light-blue and letter “B”.

Figure 9. Fill-factor test of local injectivity for

d = 3

: (a) log-volume normalized over window index t; (b) histogram with distribution of Fill-factor values. Mean normalized volume

\bar{V} = e x p (- 5.055)

\approx 0.0055

with std. = 0.0041 indicates non-degenerate local neighborhood volumes and absence of geometric collapsing.

Figure 9. Fill-factor test of local injectivity for

d = 3

: (a) log-volume normalized over window index t; (b) histogram with distribution of Fill-factor values. Mean normalized volume

\bar{V} = e x p (- 5.055)

\approx 0.0055

with std. = 0.0041 indicates non-degenerate local neighborhood volumes and absence of geometric collapsing.

Figure 10. Temporal smoothness diagnostics over delays

τ = 1, \dots, 10

: (a) autocorrelation of log-return windows (red line indicates zero auto-correlation); (b) Kraskov Mutual Information estimator. Minimal autocorrelation and nearly constant MI indicate temporal coherence and low redundancy.

Figure 10. Temporal smoothness diagnostics over delays

τ = 1, \dots, 10

: (a) autocorrelation of log-return windows (red line indicates zero auto-correlation); (b) Kraskov Mutual Information estimator. Minimal autocorrelation and nearly constant MI indicate temporal coherence and low redundancy.

Figure 11. Lipschitz stability test (a) Method 1: Lipschitz stability measured as bottleneck distance between persistence diagrams; (b) Method 2: Lipschitz stability measured in Hilbert space as

L^{2}

distance between persistence landscapes. Despite differences in absolute scale, both Methods produce consistent stability patterns: small values across time indicate bounded perturbations, with occasional excursions at regime transitions.

Figure 11. Lipschitz stability test (a) Method 1: Lipschitz stability measured as bottleneck distance between persistence diagrams; (b) Method 2: Lipschitz stability measured in Hilbert space as

L^{2}

distance between persistence landscapes. Despite differences in absolute scale, both Methods produce consistent stability patterns: small values across time indicate bounded perturbations, with occasional excursions at regime transitions.

Figure 12. Trajectories of topological functional summaries over time computed using Vietoris-Rips filtration: (a)

L^{2}

norms of persistence landscapes for homology

H_{0}

. Ground-truth bear regime shaded in gray. Sharp excursions exhibit clear temporal alignment with bear regime; (b)

L^{2}

norms of persistence landscapes for homology

H_{1}

. Ground-truth bear regime shaded in gray. Fluctuations exhibit a more gradual pattern, with increases preceding regime shifts and moderate excursions during bear regimes.

Figure 12. Trajectories of topological functional summaries over time computed using Vietoris-Rips filtration: (a)

L^{2}

norms of persistence landscapes for homology

H_{0}

. Ground-truth bear regime shaded in gray. Sharp excursions exhibit clear temporal alignment with bear regime; (b)

L^{2}

norms of persistence landscapes for homology

H_{1}

. Ground-truth bear regime shaded in gray. Fluctuations exhibit a more gradual pattern, with increases preceding regime shifts and moderate excursions during bear regimes.

Figure 13.

l

-value distribution vs. left-Gumbel for VR-filtration: (a)

L^{2} (H_{0})

persistence landscapes for homology

H_{0}

; (b)

L^{2} (H_{1})

persistence landscapes.

Figure 13.

l

-value distribution vs. left-Gumbel for VR-filtration: (a)

L^{2} (H_{0})

persistence landscapes for homology

H_{0}

; (b)

L^{2} (H_{1})

persistence landscapes.

Figure 14. Synthetic price trajectory of ground-truth process with True Bull (blue) and True Bear (red) market regimes. Ground-truth bear regimes are shaded in gray.

Figure 15. Synthetic price trajectory over time, with segments colored according to cluster assignments obtained via topology-based clustering of

L^{2}

-norms (

H_{0}

,

H_{1}

) and the deviation functional

ϕ_{t}^{w d}

computed on delay-embedded log-returns point-clouds. Clusters inferred from log-returns topology are mapped to the price trajectory. Ground-truth bear regimes are shaded in gray.

Figure 15. Synthetic price trajectory over time, with segments colored according to cluster assignments obtained via topology-based clustering of

L^{2}

-norms (

H_{0}

,

H_{1}

) and the deviation functional

ϕ_{t}^{w d}

computed on delay-embedded log-returns point-clouds. Clusters inferred from log-returns topology are mapped to the price trajectory. Ground-truth bear regimes are shaded in gray.

Figure 16. Left-tail complementary cumulative distribution function (CCDF) and power-law fit by cluster versus exponential and lognormal fit: (a) Cluster 0 Vietoris–Rips (VR) filtration; (b) Cluster 1 Vietoris–Rips (VR) filtration.

Table 1. Parameters for synthetic price trajectories with ground-truth regimes.

Regime	Process	Parameter	Value ¹
Bull	Fat-tail	True or False	True = Student t
	Drift	$μ_{b u l l}$	0.07
	Volatility	$σ_{b u l l}$	0.20
	Deg. Freedom	$ν_{b u l l}$	20
	Jump Intensity	$λ_{b u l l}$	0.5
	Jump Mean, std.	$μ_{Y}^{b u l l}, σ_{Y}^{b u l l}$	0.01,0.01
Bear	Fat-tail	True or False	True = Student t
	Drift	$μ_{b e a r}$	−0.15
	Volatility	$σ_{b e a r}$	0.30
	Deg. Freedom	$ν_{b e a r}$	5
	Jump Intensity	$λ_{b e a r}$	2.5
	Jump Mean, std.	$μ_{Y}^{b u l l}, σ_{Y}^{b u l l}$	−0.02,0.01

¹ Annualized value.

Table 2. Local injectivity test: Wasserstein distances consecutive persistence diagrams across embedding dimensions

d = 3, \dots, 10

.

Table 2. Local injectivity test: Wasserstein distances consecutive persistence diagrams across embedding dimensions

d = 3, \dots, 10

.

	Wasserstein Distances Across Embedding Dimensions
Homology	d = 3	d = 4	d = 5	d = 6	d = 7	d = 8	d = 9	d = 10
$H_{0}$	0.002788	0.002493	0.002318	0.00228	0.002355	0.002528	0.002754	0.003023
$H_{1}$	0.000411	0.000505	0.000563	0.000588	0.0006	0.000588	0.00058	0.000563

Table 3. Local injectivity test:

L^{2}

distances consecutive persistence landscapes across embedding dimensions

d = 3, \dots, 10

.

Table 3. Local injectivity test:

L^{2}

distances consecutive persistence landscapes across embedding dimensions

d = 3, \dots, 10

.

	$L^{2}$ Distances Across Embedding Dimensions
Homology	d = 3	d = 4	d = 5	d = 6	d = 7	d = 8	d = 9	d = 10
$H_{0}$	0.002601	0.00282	0.002361	0.00174	0.002152	0.002265	0.001906	0.001577
$H_{1}$	0.000248	0.000305	0.000328	0.000409	0.00031	0.000259	0.000334	0.000287

Table 4. Smoothness measured as window-to-window variation

‖X_{t} - X_{t - 1}‖

.

Table 4. Smoothness measured as window-to-window variation

‖X_{t} - X_{t - 1}‖

.

	Window-to-Window Variation
$Delay τ$	Mean	Max	90th perc.	95th perc.	99th perc.
1	0.003566	0.028987	0.006046	0.007162	0.010146
2	0.003651	0.021807	0.005936	0.006948	0.009584
3	0.003653	0.022596	0.00593	0.006932	0.009563
4	0.003653	0.024183	0.00594	0.006932	0.009482
5	0.003652	0.021622	0.005939	0.006912	0.0096
6	0.003647	0.022339	0.005927	0.006961	0.009628
7	0.003652	0.022091	0.005932	0.006928	0.009602
8	0.003653	0.024198	0.005936	0.006911	0.009646
9	0.003653	0.023394	0.005919	0.006961	0.009653
10	0.003651	0.022713	0.005934	0.006938	0.00968

Table 5. Autocorrelation function (ACF) test results for monitored lag

l \in [1, 5]

: Percentage of windows satisfying

|Δ A C F (l)| \leq δ_{A C F} = 0.15

.

Table 5. Autocorrelation function (ACF) test results for monitored lag

l \in [1, 5]

: Percentage of windows satisfying

|Δ A C F (l)| \leq δ_{A C F} = 0.15

.

	Percentage of Windows \|ΔACF\| ≤ 0.15
Criteria	Lag 1	Lag 2	Lag 3	Lag 4	Lag 5
% Windows \|ΔACF\| ≤ 0.15	94.45	94.45	94.67	94.73	94.81

Table 6. Window-wise Jacobian diagnostics of observables Theorem 6.

Diagnostic	Windows $p > η$	$Mean p$ /Window	Window $p < η$	Rate $p < η$	Mean Per Window
$Local invertibility (full rank) σ_{m i n} (J_{t}) > ε_{J}$	1.000	0.9982	13	0.0003	0.3439
$Bounded distortion κ (J_{t}) < K_{m a x}$ $, K_{m a x} = 500$	0.997	0.9952	125	0.003	31.7410
$C^{1}$ $smoothness V a r (σ_{m i n} (J_{t})) < δ_{J}$ $, δ_{J} = 1.0$	1.000	-	-	-	0.0496
All conditions jointly	0.997	-	-	-

Table 7. Geometric Null Test for VR-filtration

H_{0}

and

H_{1}

homology dimensions (full data ensemble and 10% sample).

Table 7. Geometric Null Test for VR-filtration

H_{0}

and

H_{1}

homology dimensions (full data ensemble and 10% sample).

	$H_{0}$	$H_{1}$	$H_{0}$	$H_{1}$
Ensemble	Full	Full	Sample	Sample
Number diagrams tested	39,291	39,291	4000	4000
Total ℓ-values computed	667,947	114,785	68,000	11,772
Mean p-value	0.6704	0.5557	0.5704	0.5555
Signal points α = 0.1 (uncorrected)	0	133	0	9
% Signal points (uncorrected)	0%	0.1%	0%	0.1%
Signal points α = 0.1 (Bonferroni)	0	0	0	0
% Signal points (Bonferroni)	0%	0%	0%	0%

Table 8. Temporal topology null test: autocorrelation test VR-filtration.

Functional	Statistic	Observed	Surrogate Mean *	Surrogate Std. *	z-Score	p-Value	Significant Lags	Statistical Significance
$Signed {s ϕ}_{t}^{w d (k)}$	Mean	1.36 × 10⁻⁷	−1.38 × 10⁻⁹	±2.07 × 10⁻⁸	6.64	0.0020		Yes
	$T_{m a x}$	0.7950	-	-	-	0.0020	$[1, \dots, 44]$	Yes
	$T_{s q}$	2.1823	-	-	-	0.0020		Yes
$Abs . {a ϕ}_{t}^{w d (k)}$	Mean	1.55 × 10⁻⁴	1.69 × 10⁻⁴	±1.64 × 10⁻⁷	−80.6	1.000		No
	$T_{m a x}$	0.7361	-	-	-	0.0020	$[1, \dots, 50]$	Yes
	$T_{s q}$	1.6291	-	-	-	0.0020		Yes

* Surrogate distributions obtained via

N_{s u r r} = 500 .

Table 9. Temporal topology null test: autocorrelation test 1PFK-filtration.

Functional	Statistic	Observed	Surrogate Mean *	Surrogate Std. *	z-Score	p-Value	Significant Lags	Statistical Significance
$Signed {s ϕ}_{t}^{w d (k)}$	Mean	1.19 × 10⁻⁷	−3.090 × 10⁻⁹	3.27 × 10⁻⁸	3.74	0.002		Yes
	$T_{m a x}$	0.9035	-	-	-	0.002	$[1, \dots, 47]$	Yes
	$T_{s q}$	4.4984	-	-	-	0.002		Yes
$Abs . {a ϕ}_{t}^{w d (k)}$	Mean	1.53 × 10⁻⁴	2.251 × 10⁻⁴	2.41 × 10⁻⁷	299.79	1.000		No
	$T_{m a x}$	0.9031	-	-	-	0.002	$[1, \dots, 50]$	Yes
	$T_{s q}$	6.0798	-	-	-	0.002		Yes

* Surrogate distributions obtained via

N_{s u r r} = 500 .

Table 10. VR-filtration: Descriptive statistics of clusters extracted using heavy-tailed K-means.

	Windows	Mean Return 5 min	Vol. /Std.	Ann. Mean Return	Ann. Vol	Skew	Kurtosis
Cluster 0	26,543	5.093 × 10⁻⁷	0.001445	0.01	0.201096	−0.004068	0.341958
Cluster 1	12,728	−1.711 × 10⁻⁵	0.00195	−0.3362	0.274687	−0.003642	4.538515

Table 11. 1PFK-filtration: Descriptive statistics of clusters extracted using heavy-tailed K-means.

	Windows	Mean Return 5 min	Vol. /Std.	Ann. Mean Return	Ann. Vol	Skew	Kurtosis
Cluster 0	25,047	−0.000004	0.001400	−0.0786	0.196630	0.004704	2.416410
Cluster 1	14,245	−0.000008	0.001952	−0.1573	0.273702	−0.015169	4.981497

Table 12. Temporal topology null test: ground-truth metrics.

	VR-Filtration			1PFK-Filtration
Metrics	Observed	Mean Null	p-Value	Observed	Mean Null	p-Value
Total Accuracy (TA)	0.8026	0.5708	0.0000	0.7545	0.5549	0.0000
Bear-Recall (BR)	0.7149	0.3231	0.0000	0.6951	0.3630	0.0000
Adjusted Rand Index (ARI)	0.3593	0.0001	0.0000	0.2501	0.0000	0.0000

Table 13. Temporal topology null hypothesis test: unsupervised clustering metrics.

	VR-Filtration			1PFK-Filtration
Metrics	Observed	Mean Null	p-Value	Observed	Mean Null	p-Value
Silhouette (S)	0.5348	−0.0003	0.000	0.5472	−0.0005	0.000
Calinski–Harabasz (CH)	24,187.0	0.9391	0.000	28,823.3	0.9412	0.000
Davies–Bouldin Index (DB)	0.7553	570.32	0.000	0.7186	643.32	0.002
MMD between cluster separation	0.70709	0.0093	0.000	0.7596	0.0092	0.000
MMD within-cluster 0 self-similarity	0.00148	0.0093	0.5374	0.00121	0.0092	0.5391
MMD within-cluster 1 self-similarity	0.00160	0.0093	0.4609	0.00156	0.0092	0.4534

Table 14. Temporal topology null hypothesis test: lower tail-dependence.

	VR-Filtration			1PFK-Filtration
Metrics	Observed	Mean Null	p-Value	Observed	Mean Null	p-Value
Upper Tail-Dependence	0.034	0.0249	0.7850	0.0183	0.0249	0.4930
Lower Tail-Dependence	0.0127	0.025	0.1730	0.0230	0.0256	0.3300

Table 15. Feature-space statistical tests compared to the temporal null values.

		VR-Filtration			1PFK-Filtration
Tests	Metric	Observed	Mean Null	p-Value	Observed	Mean Null	p-Value
Kruskal–Wallis	L2_H0	25,007	0.9950	0.0010	27,241	0.9847	0.0010
Kruskal–Wallis	L2_H1	223.62	0.9636	0.0010	-	-	-
Kruskal–Wallis	$ϕ_{t}^{w d} (1)$	504.01	1.0003	0.0010	-	-	-
Mann–Whitney U	L2_H0	1.69 × 10⁸	8.39 × 10⁵	0.0010	1.78 × 10⁸	0.86 × 10¹⁰	0.0010
Mann–Whitney U	L2_H1	1.57 × 10⁷	8.45 × 10⁵	0.0010	-	-	-
Mann–Whitney U	$ϕ_{t}^{w d} (1)$	2.36 × 10⁷	8.68 × 10⁵	0.0010	-	-	-
Kolmogorov–Smirnov	L2_H0	0.9990	0.0094	0.0010	1.000	0.0090	0.0010
Kolmogorov–Smirnov	L2_H1	0.1046	0.0093	0.0010	-	-	-
Kolmogorov–Smirnov	$ϕ_{t}^{w d} (1)$	0.1196	0.0094	0.0010	-	-	-
Anderson–Darling	L2_H0	18,421.4	−0.0182	0.0010	19,029	−0.0212	0.0010
Anderson–Darling	L2_H1	313.89	0.0170	0.0010	-	-	-
Anderson–Darling	$ϕ_{t}^{w d} (1)$	408.87	−0.0153	0.0010	-	-	-

Table 16. Economic meaning statistical tests compared to the temporal null values.

		VR-Filtration			1PFK-Filtration
Tests	Metric	Observed	Mean Null	p-Value	Observed	Mean Null	p-Value
Kruskal–Wallis	Return	0.3691	1.0037	0.5774	0.3449	1.0173	0.8362
Mann–Whitney U	Return	638,142	837,865	0.5495	201,857	875,962	0.8691
Kolmogorov–Smirnov	Return	0.0490	0.0091	0.0010	0.0422	0.0090	0.0010
Anderson–Darling	Return	122.15	0.0163	0.0010	118.47	−0.0099	0.0010
Levene	Vol roll.	5646.48	1.1666	0.0010	4,008,94	1.0378	0.0010
Levene	Vol ann.	5648.48	0.9749	0.0010	4,008.94	1.0041	0.0010

Table 17. Empirical size and power summary across filtrations.

Test Class	Metric Type	VR-Filtration	1PFK-Filtration	Interpretation
Feature-Space	$L^{2} (H_{0}), L^{2} (H_{1}),$	Size = 0	Size = 0	Strong, filtration robust
Feature-Space		Power = 1	Power = 1	Geometric separation
Feature-Space	$ϕ_{t}^{w d} (1)$	Size = 0	Size = 0	Persistent topological
Feature-Space		Power = 1	Power = 1	discrimination
Economic (Returns)	Kruskal	Size controlled	Size controlled	No systematic mean return
Economic (Returns)	Mann–Whitney	Power = 0	Power = 0	differences
Economic (Returns)	KS, AD	Size = 0	Size = 0	Regime-dependent distributional
Economic (Returns)		Power = 1	Power = 1	and tail behavior
Economic (Volatility)	Levene	Size = 0	Size = 0	Distinct volatility regimes
Economic (Volatility)		Power = 1	Power = 1

Table 18. Power-law fit results for the left tails of VR-filtration clusters.

Test Class	$α$	$x_{m i n}$	KS Statistic	KS p-Value	AD Statistic	Ad p-Value	Power-Law vs. Exp (R²)	Power-Law vs. Logn (R²)
Cluster 0	7.4438	0.034	0.0430	0.577	1.146	0.290	−4.1133	−3.9712
Cluster 0							p = 0.016	p = 0.060
Cluster 1	4.5813	0.039	0.0269	0.953	0.441	0.803	5.9846	−0.9089
Cluster 1							p = 0.223	p = 0.406

Table 19. Power-law fit results for the left tails of 1PFK-filtration clusters.

Test Class	$α$	$x_{m i n}$	KS Statistic	KS p-Value	AD Statistic	AD p-Value	Power-Law vs. Exp (R²)	Power-Law vs. Logn (R²)
Cluster 0	12.6486	0.035	0.0568	0.647	1.1284	0.257	−1.738	−2.741
Cluster 0							p = 0.018	p = 0.183
Cluster 1	4.9498	0.040	0.0234	0.963	0.3322	0.890	8.924	−0.422
Cluster 1							p = 0.098	p = 0.583

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lamothe-Fernández, P.; Rojas, E.; Bayuk, A. Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series. Mathematics 2026, 14, 1098. https://doi.org/10.3390/math14071098

AMA Style

Lamothe-Fernández P, Rojas E, Bayuk A. Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series. Mathematics. 2026; 14(7):1098. https://doi.org/10.3390/math14071098

Chicago/Turabian Style

Lamothe-Fernández, Prosper, Eduardo Rojas, and Andriy Bayuk. 2026. "Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series" Mathematics 14, no. 7: 1098. https://doi.org/10.3390/math14071098

APA Style

Lamothe-Fernández, P., Rojas, E., & Bayuk, A. (2026). Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series. Mathematics, 14(7), 1098. https://doi.org/10.3390/math14071098

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Topology-Based Machine Learning and Regime Identification in Stochastic, Heavy-Tailed Financial Time Series

Abstract

1. Introduction

2. Theoretical Framework

2.1. Delay Embedding of Stochastically Forced Time Series

2.2. Temporal Topological Networks

2.3. Clustering as a Statistical Operator on Temporal Topology

2.4. Temporal Scales of Topological Organization

2.5. Topological Null-Models

2.6. Temporal Topological Null-Models

3. Materials and Methods

3.1. Synthetic Data Generation

3.2. Tests of Local Homeomorphic Delay-Embeddings

3.3. Jacobian-Based Diagnostics of Local Diffeomorphic Embedding

3.4. Computation of Topological Functionals

3.5. Unsupervised Learning

3.6. Geometric Null Hypothesis

3.7. Temporal Topological Null Hypothesis Test

3.8. Topological Cluster-Label Null Hypothesis Tests

3.9. Left-Tail Power-Law Fitting and Distributional Analysis of Cluster Tails

4. Results of the Numerical Studies

4.1. Ground-Truth Synthetic Price Trajectories

4.2. Local Homeomorphism Tests for Delay Embeddings

4.2.1. Local Injectivity

4.2.2. Noise Regularity and Smoothness

4.2.3. Finite Dimensional Forcing, Low Dimensional Flow

4.2.4. Lipschitz Stability of Topological Summaries

4.2.5. Summary of Local Validity

4.3. Jacobian-Based Test of Local Diffeomorphism

4.4. Computation of Topological Functionals

4.5. Geometric Null Hypothesis Test

4.6. Temporal Null: Global Autocorrelation Test

4.7. Unsupervised Learning: Heavy-Tailed K-Means Clustering

4.8. Temporal Null Hypothesis Tests

4.8.1. Ground-Truth Alignment

4.8.2. Unsupervised Clustering Metrics

4.8.3. Upper- and Lower-Tail Dependence

4.9. Cluster-Label Null Hypothesis Tests

4.9.1. Feature-Space Validation

4.9.2. Economic Meaning Validation

4.9.3. Empirical Calibration of Cluster-Label Tests

4.10. Left-Tail Power-Law Fitting and Distributional Analysis of Cluster Tails

5. Discussion

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Tests of Local Homeomorphic Delay Embeddings

Appendix A.1. Theoretical Conditions (Stark et al. [32])

Appendix A.2. Construction of Delay-Embedded Point Clouds

Appendix A.3. Embedding Dimension Condition (Injectivity)

Appendix A.4. Noise Regularity and Smoothness: Choice of Delay τ

Appendix A.5. Finite-Dimensional Forcing and Embedding Fidelity

Appendix A.6. Lipschitz Stability of Persistence Diagrams, Landscapes and Lp-Norm Functionals

Appendix A.7. Algorithmic Implementation

Appendix B. Jacobian-Based Window-Wise Diagnostics of Diffeomorphic Delay-Embeddings

Appendix B.1. Motivation and Theoretical Context

Appendix B.2. Empirical Interpretation and Diagnostic Approach

Appendix B.3. Window-Wise Local Diffeomorphism Diagnostics

Appendix B.4. Jacobian of the Delay-Coordinate Map

Appendix B.5. Local Jacobian Estimation via kNN Linearization

Appendix B.6. Singular Value Diagnostics

Appendix B.7. Diffeomorphism Criteria (Theorem 6 Tests)

Appendix B.8. Probabilistic vs. Mean-Based Window-Level Criterion

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix A.4. Noise Regularity and Smoothness: Choice of Delay $τ$