1. Introduction
Robust and reliable root-finding remains a central component of scientific computing, underpinning nonlinear optimization, inverse problems, parameter estimation, and the numerical solution of discretized PDE models [
1,
2,
3]. In many applications, the practical performance of an iterative solver depends not only on the underlying function
f and the initialization, but also on algorithmic hyperparameters (e.g., damping, step control, restart policies) and on the sensitivity of the iteration map to perturbations. This sensitivity is especially pronounced in high-order and parameterized iterative schemes, such as multipoint methods, where the convergence dynamics may change significantly with the choice of internal parameters [
4]. As a consequence, there is growing interest in methods for assessing solver reliability across parameter domains and in identifying whether useful information about convergence behavior may already be available during the early stages of the iteration process.
Nonlinear solvers are frequently embedded within broader computational workflows, for instance in parameter estimation, inverse problems, multiphysics simulations, and nonlinear model predictive control. Nonlinear models of practical relevance arise in a wide range of applications, including biomedical and physiological modeling, bioheat transfer in therapeutic contexts, and pharmacological therapy optimization [
5,
6,
7,
8]. In such settings, as well as in large-scale steady-state computations in dynamical systems models [
9], unreliable solver behavior may affect the robustness and reliability of downstream computations, as recently discussed in the context of inference for differential equation models [
10]. While the present work focuses on controlled parameterized root-finding benchmarks, these broader settings motivate the study of whether early solver-derived information can provide useful indications about future convergence behavior before the maximum iteration count is reached.
Recent work has explored the integration of data-driven components into numerical algorithms, with the aim of improving robustness, efficiency, and adaptivity [
11,
12,
13]. In the context of nonlinear solvers, these approaches include learned initialization strategies, surrogate-assisted solvers, and neural network-based root-finding methods [
14,
15,
16]. At the same time, recent studies have also emphasized the importance of carefully assessing the scope and validity of data-driven approaches in scientific computing, particularly in relation to benchmark design, baseline comparisons, and the interpretation of predictive performance [
17].
A related line of research has focused on trajectory-derived quantities motivated by dynamical systems theory. Since iterative solvers generate sequences of approximations that can be interpreted as discrete dynamical systems, quantities such as Lyapunov exponents and related finite-time indicators provide one possible way to characterize transient sensitivity and local divergence behavior [
18,
19,
20]. These quantities can provide useful qualitative information about solver dynamics, but they are typically constructed from full trajectories and therefore are less naturally suited to early-stage prediction settings.
In earlier work [
21], we introduced a contractivity profiling approach based on a kNN-derived proxy of a largest Lyapunov exponent (LLE), enabling the construction of stability landscapes over parameter domains. While that study focused on trajectory-derived stability profiles computed from complete solver histories, it also motivated a related question: to what extent can information extracted from only the early stages of the iteration process be used to anticipate convergence within prescribed iteration horizons?
The present work revisits this limitation and reformulates the underlying question in a more restricted predictive setting. Rather than asking whether a trajectory-derived quantity can itself be reconstructed from partial information, we investigate what information about later convergence outcomes may already be contained in the early stages of the iteration process. In particular, we study whether short prefixes of solver trajectories contain predictive information about convergence behavior at different iteration horizons.
To examine this question, we consider solver-level targets that measure convergence within prescribed iteration horizons. Specifically, for each parameter configuration, we define quantities such as
which measure the fraction of trajectories that converge within a prescribed number of iterations. These targets provide a hierarchy of prediction tasks, ranging from short-horizon to longer-horizon convergence behavior. Compared with our earlier profile-based setting, this formulation avoids predicting a trajectory-derived quantity from its own partial evolution and instead focuses on convergence outcomes that are directly tied to solver behavior.
Within this setting, we analyze several families of early solver-derived features extracted from the initial iterations, including residual-based, step-based, and trajectory-derived quantities. The objective is not to propose a new machine learning methodology, but rather to examine which types of early-time information appear most predictive of convergence outcomes, and how this depends on the feature representation and on the prediction horizon. To further clarify the positioning of the present study with respect to related approaches,
Table 1 summarizes the main differences in terms of targets, feature representations, and methodological objectives. In particular, the present study examines how early solver-derived features relate to solver-level convergence outcomes at multiple finite iteration horizons.
Within the considered benchmark setting, the numerical results indicate several recurring trends across the analyzed datasets. Predictive performance is not uniform across iteration horizons, but depends on both the target horizon and the structure of the retained features. In the present experiments, short-horizon targets (e.g., ) are more strongly associated with level-type quantities reflecting the current state of the solver trajectory, whereas longer-horizon targets (e.g., and ) appear to depend more strongly on information related to the evolution of the trajectory, including trends and variability measures. These observations suggest that the predictive content of early solver-derived features is both horizon-dependent and problem-dependent within the considered benchmark problems.
A second observation concerns the relative behavior of different feature families. In the reported experiments, residual-based features generally provide the strongest predictive performance, which is consistent with their direct relation to the convergence quantities used as targets. Step-based features also remain competitive across several settings, while the Lyapunov-based proxy, although theoretically motivated, plays a more limited role as a standalone predictor in the present setting. Taken together, these comparisons help clarify which types of early solver-derived quantities appear most strongly associated with the targets in the considered experiments.
The experiments also show substantial variability across problem instances and parameter regimes. Some benchmark settings exhibit near-saturated predictability already at very short prefixes, whereas others remain comparatively challenging even at longer horizons. In addition, cross-problem transfer is consistently more difficult than within-problem prediction, suggesting that predictive relationships learned in one setting may not generalize uniformly across different solver configurations and benchmark problems. These observations highlight both the potential usefulness and the current limitations of early convergence outcome prediction in controlled parameterized settings.
Taken together, the present study should be viewed primarily as a controlled investigation of how early solver-derived features relate to convergence outcomes within prescribed iteration horizons, rather than as a general diagnostic methodology for nonlinear solvers. The goal is to examine, in a simplified benchmark setting, which types of early-time information appear most predictive, how this depends on the prediction horizon, and to what extent compact feature representations retain useful predictive content. Our study is intentionally restricted to controlled scalar benchmark problems, with the aim of isolating and analyzing the structure of early-time predictive information in a simplified setting. Extension to higher-dimensional nonlinear systems, broader solver classes, and application-driven computational settings remains future work.
The main contributions of this work can be summarized as follows:
A reformulation of the prediction setting based on solver-level convergence targets (), designed to study convergence outcomes rather than proxy-derived trajectory scores.
A multi-horizon analysis of early solver-derived information, examining how predictive behavior changes across short-, intermediate-, and longer-iteration convergence targets.
A comparative study of several feature families, including residual-based, step-based, and trajectory-derived quantities, in order to assess which types of early-time information appear most predictive in the considered benchmark problems.
An empirical evaluation of both within-problem prediction and cross-problem transfer, highlighting the variability of predictive performance across different parameter regimes and benchmark settings.
A comparative predictive analysis based on a simple fixed regression model used as a feature-analysis probe, with the goal of examining the relationship between compact early-time feature representations and convergence outcomes.
The remainder of the paper is organized as follows.
Section 2 introduces the considered prediction setting, including the iterative scheme, feature construction, and multi-horizon prediction protocol.
Section 3 describes the experimental design and benchmark configuration.
Section 4 presents the main numerical results, including horizon-dependent behavior and feature comparisons.
Section 5 discusses the interpretation and limitations of the observed predictive trends. Finally,
Section 6 concludes the paper and outlines directions for future work.
2. Methodology
This section describes the prediction setting and feature construction procedure used to study early solver-derived information in parameterized root-finding problems. In contrast to the earlier profile-based formulation, the present study focuses on solver-level targets that quantify convergence behavior within fixed iteration counts. The goal is to examine how information contained in short prefixes of solver trajectories relates to convergence outcomes at different iteration horizons.
Figure 1 provides a conceptual overview of the workflow. Solver ensembles are generated over a parameter grid, iteration histories are recorded, and several feature families are constructed from residual, step, and trajectory information. These feature families are then restricted to early prefixes and either used directly as prefix vectors or compressed into single scalar summaries. In the present study, the direct-prefix analyses are summarized at
, whereas the scalar summary analyses are evaluated at the fixed prefix lengths
and
. A fixed kNN regression model is then used to estimate solver-level targets, and the resulting performance is analyzed as a function of both the prediction horizon and the type of feature used.
2.1. Problem Setting and Iterative Scheme
We consider nonlinear root-finding problems of the form
where
in the benchmark problems considered in this study. More generally, the same type of question may arise for systems
, although such extensions are not investigated here. The computational backbone is a derivative-free, parallel, two-parameter iterative scheme controlled by parameters
. In the present study, the scheme is used as a fixed generator of solver trajectories over the parameter grid, and no modification of the underlying numerical method is introduced. For each fixed
, the solver generates a sequence of iterates
starting from an initial guess
.
The objective of this work is not to modify or improve the underlying numerical scheme, but rather to study how quantities extracted from the observed iteration dynamics can be used to predict convergence outcomes at prescribed iteration horizons. Auxiliary validation of the solver itself, including sensitivity to initialization and comparison with existing methods, is reported in
Appendix A.1.
2.2. Solver Trajectories and Observable Quantities
Each solver run produces a trajectory , from which we extract observable quantities that summarize the evolution of the iteration. In this study, we consider the following in particular:
Residual-based quantities, when available;
Step-based quantities, such as ;
Derived scalar time series constructed from the evolution of these quantities.
One quantity used throughout the analysis is the log step norm
which provides a compact scalar representation of the local motion of the iterates. Step-based quantities are natural solver observables because they provide a direct measure of the local motion of the iterates and complement residual-based information.
2.3. Solver-Level Targets and Prediction Horizons
To quantify convergence behavior at selected iteration horizons, we define solver-level targets based on fixed iteration counts. For a given parameter configuration
and a set of initializations, we define
where
in the present study.
These quantities provide a hierarchy of prediction tasks:
describes short-horizon convergence behavior;
describes intermediate-horizon convergence behavior;
describes longer-budget convergence behavior.
This formulation avoids the self-referential aspect of predicting a profile-derived quantity from its own partial trajectory, and instead focuses on quantities that are more directly tied to solver performance. It also makes it possible to examine how predictive behavior changes with the iteration horizon.
2.4. Early-Time Feature Construction
To extract information from the initial phase of the solver dynamics, we construct features from short prefixes of the trajectory. For a given prefix length
N, we define the feature vector
where
denotes a scalar observable derived from the solver trajectory, such as a residual-based or step-based quantity. Thus,
N is the number of early iterations retained from a given feature family. In the main within-base analysis, the dependence on prefix length is visualized through curves in
N; for compact reporting, the direct-prefix summaries are reported at
, while the scalar summary analyses are reported at
and
.
We consider multiple families of features:
Residual-based features, which describe the evolution of the residual;
Step-based features, which describe the motion of the iterates;
Trajectory-derived features, including quantities constructed from the sequence ;
Lyapunov-based proxy features, obtained via a kNN-based approximation of local divergence rates.
The kNN–LLE proxy is constructed by applying a delay embedding to the micro-series and estimating local divergence through multi-horizon prediction errors. This yields a profile , which is used here as a trajectory-derived descriptor related to local stability. In the present study, this proxy is treated as one feature family among several, rather than as the central object of the analysis.
2.5. Feature Families and Scalar Summaries
To make the feature comparison more transparent and to isolate the contribution of different types of early-time information, we additionally compress each prefix into a single scalar summary . These summaries are used to represent distinct aspects of the first N iterations:
Level: the last value, prefix mean, or prefix median;
Trend: the linear slope, last–first difference, or early–late mean difference;
Variability: the prefix standard deviation or mean absolute first difference;
Curvature: the mean second difference or related quantities.
This decomposition makes it possible to compare two complementary representations: (i) the full-prefix vector , and (ii) a compressed representation in which only one scalar descriptor is retained. The latter viewpoint helps assess which type of early information—level, trend, variability, or curvature—is most strongly associated with predictive performance in the considered benchmark problems.
2.6. Multi-Horizon Prediction Setting
For each prefix length
N and each horizon
, we define two related prediction tasks:
The first formulation corresponds to the feature-family curves shown later in the paper, whereas the second corresponds to the single-summary predictors.
This multi-horizon setting makes it possible to examine how predictive performance changes as more early trajectory information is retained and how it depends on the temporal scale of the target. In particular, it allows us to compare short-horizon prediction, which may be more strongly associated with the current level of the trajectory, with longer-horizon prediction, where trend and variability information may become more relevant. It also makes the role of the selected early-prefix lengths explicit: and for the scalar summary analyses, and , , and for the aggregate direct-prefix summaries.
2.7. Benchmark Design, Validation Settings, and Evaluation Metrics
The experimental setup is based on a controlled benchmark including multiple datasets, parameter grids, and feature configurations. The main text reports representative within-base results for Dataset 1 and Dataset 2A, while additional multi-dataset summaries are used to assess transfer behavior across the considered benchmark problems.
Within each dataset, performance is evaluated by a random train/test split, providing an estimate of in-distribution predictive accuracy. In addition, we consider cross-base transfer, in which a model trained on one dataset is tested on another. This setting provides a stricter comparison than within-base prediction and is used to examine how strongly the observed predictive relationships depend on the specific benchmark problem.
All predictive experiments use the same regression model family, namely k-nearest neighbors (kNN), combined with standardization and a small grid search for the neighborhood size. Model performance is evaluated by the coefficient of determination (), mean absolute error (MAE), and root mean squared error (RMSE) for within-base prediction, and by the weighted absolute percentage error (WAPE) for cross-base transfer. The use of a simple regression model is intentional: the goal is not to optimize machine learning performance or to propose a new learning algorithm, but to use a fixed predictive probe for comparing feature representations and assessing the information contained in early solver trajectories.
3. Experimental Setup
This section describes the experimental design used to evaluate the early prediction of convergence outcomes within prescribed iteration horizons. The setup is designed to examine how predictive performance depends on the prefix length, the prediction horizon, and the type of feature used.
3.1. Benchmark Equations, Parameter Grid, and Computational Budget
The controlled benchmark used in the present study is built from two nonlinear equation classes, considered here as controlled root-finding test problems. Dataset 1 uses the polynomial test problem
treated with the vector-valued modified two-step scheme used in the original solver setting. Dataset 2A uses a scalar oscillatory benchmark of the form
with the representative configuration
and center parameter
. In both cases, the solver is controlled by the same method parameters
.
All experiments are performed over a two-parameter control grid
defined by
sampled on a uniform
grid over the parameter domain.
For each grid point, we generate an ensemble of independent solver trajectories corresponding to distinct initializations. Each trajectory is computed for a maximum of iterations, and a fixed random seed is used to ensure reproducibility across all experiments.
3.2. Trajectory Construction and Observables
From each trajectory
, we extract scalar time series that summarize the evolution of the solver. In particular, we consider the log step norm
which provides a compact scalar representation of the local motion of the iterates.
To ensure numerical robustness, a log-domain tail floor is applied to prevent instability in extremely small step regimes. In addition, a lightweight stabilization mechanism is used to avoid numerical overflow in strongly divergent cases, while preserving the natural dynamics in non-explosive regimes.
3.3. Definition of Solver-Level Targets
To quantify convergence behavior at selected iteration horizons, we define solver-level targets based on fixed iteration counts. For each parameter configuration
, we define
where
.
These targets provide a hierarchy of prediction tasks:
describes short-horizon convergence behavior;
describes intermediate-horizon convergence behavior;
describes longer-budget convergence behavior.
This formulation allows us to assess whether early-time information is associated with convergence outcomes at different temporal scales.
3.4. Feature Construction and Prefix Definition
For each trajectory, features are constructed from early prefixes of the observed time series. For a given prefix length
N, we define
where
denotes a scalar observable derived from the trajectory, such as a residual-based or step-based quantity. Thus,
N is the number of early iterations retained from a given feature family. In the main within-base analysis, the dependence on prefix length is visualized through curves in
N; for compact reporting, the direct-prefix summaries use
, whereas the scalar summary analyses use
and
.
Two complementary representations are studied. First, the prefix vector itself is used as the predictor. Second, the same prefix is compressed into a single scalar summary . The latter representation is used to assess which aspect of the first N iterations is most informative when only one scalar descriptor is retained. The candidate summaries include level-type descriptors (e.g., last value, prefix mean, prefix median), trend-type descriptors (e.g., linear slope, last–first difference, early–late mean difference), and variability descriptors (e.g., prefix standard deviation, mean absolute first difference, mean second difference).
As an additional feature source, we also consider a kNN-based proxy for local divergence rates derived from the micro-series . This proxy is computed using a delay embedding with look-back length , forecast horizons , and neighborhood size . The resulting profile is used here as a trajectory-derived descriptor related to local stability, and is treated as one feature family among several.
3.5. Multi-Horizon Prediction Protocol
For each prefix length
N and each target horizon
, we define a prediction problem of the form
The first formulation corresponds to the feature-family curves shown later in the paper, whereas the second corresponds to the single-summary predictors. Prefix lengths are scanned as
up to the available trajectory length, allowing us to evaluate how prediction accuracy changes as longer early prefixes are retained. In addition to the full curves, the manuscript explicitly discusses scalar summary analyses at
and
, together with a direct-prefix reference at
, in order to make the notion of “early” information concrete.
3.6. Train/Test Splits and Transfer Settings
Within each dataset, grid points are randomly partitioned into training and test sets with a fixed test fraction of , providing an estimate of in-distribution predictive performance. In addition, we consider cross-base transfer, where a model trained on one dataset is tested on another. This setting provides a stricter comparison than within-dataset prediction and is used to examine how strongly the observed predictive relationships depend on the specific benchmark problem.
3.7. Prediction Model and Evaluation Metrics
For each pair , a k-nearest neighbors (kNN) regressor is trained to predict from either the prefix vector or the scalar summary , depending on the representation being evaluated. Standardization is applied, and a small grid search is used to select the neighborhood size. The use of kNN is not intended as a machine learning contribution; rather, it provides a fixed and simple nonparametric probe for comparing feature representations under the same predictive model.
Performance is evaluated on held-out test data using the mean absolute error (MAE), root mean squared error (RMSE), and coefficient of determination () for within-base prediction. For cross-base transfer, we use the weighted absolute percentage error (WAPE), which provides a transparent error measure for comparing transfer behavior across datasets. The analysis focuses not only on aggregate performance, but also on how predictive accuracy depends on the prefix length, the target horizon, and the feature family.
5. Discussion
The revised results provide a more focused view of how early solver-derived information relates to convergence outcomes at prescribed iteration horizons in the considered parameterized root-finding benchmarks. By replacing the original proxy-based target with solver-level success fractions , , and , the analysis shifts from predicting a trajectory-derived quantity to examining which types of early information are associated with convergence outcomes at different iteration horizons.
Across the considered datasets and targets, the strongest predictors are generally simple scalar summaries of the residual and step sequences, particularly mean and median log residual and log step features (
Table 2). The fixed-prefix scalar summary analyses (
Table 3,
Table 4,
Table 5 and
Table 6, and
Figure 4) make this observation more explicit: a single scalar extracted from the first
or
iterations can already provide substantial predictive information, and, in the strongest cases, the continuous curves of
Figure 3 show only limited additional improvement when extending the direct prefix to
.
Within the considered experiments, the relative importance of feature families appears to depend on the target horizon: shorter horizons are more closely associated with early trajectory level, whereas longer horizons appear to benefit from information about the evolution of the trajectory. These observations suggest that early prediction in this benchmark setting is strongly influenced by level information, namely the magnitude and initial decay behavior of the residual. By contrast, variability measures and Lyapunov-based features provide a weaker standalone predictive performance in the present experiments. This indicates that, for the targets considered here, much of the useful predictive signal is captured by relatively simple early trajectory summaries rather than by more elaborate trajectory-derived descriptors.
The results also show a visible distinction between easier and more challenging prediction regimes within the considered benchmarks. In Dataset 1, the prediction problem is close to saturated: leading features achieve near-perfect
values at very short prefixes (
Figure 3, left panel). By contrast, Dataset 2A provides a more challenging case, where prediction performance remains moderate and feature rankings become more discriminative (
Figure 3, right panel). The contrast between Dataset 1 and Dataset 2A indicates that predictive difficulty is strongly problem-dependent, even within controlled scalar benchmarks. This supports the importance of including sufficiently heterogeneous benchmark settings when evaluating feature-based early prediction approaches.
A recurring observation in the reported experiments is the strong performance of median-based summaries, especially for shorter horizons such as
(
Table 2 and
Table 7). The fixed-prefix tables further show that, in the more irregular Dataset 2A, simple level-type summaries remain competitive already at
, whereas increasing the prefix length to
stabilizes the ranking and allows a variability-based step summary to slightly outperform the direct-prefix baseline. Compared with mean-based features, median statistics appear to provide a more stable characterization of early dynamics in the presence of oscillatory or heterogeneous trajectories. As the target horizon increases, a gradual shift toward mean-based residual summaries is observed, suggesting that longer-horizon prediction is more closely associated with the overall trajectory level than with very early transient behavior.
The cross-base transfer results (
Table 8 and
Table 9) provide a stricter comparison than within-base prediction. In this setting, residual and step summaries yield the lowest transfer errors among the tested feature families, with the median residual and median step features achieving the lowest WAPE values in the representative two-dataset screen for
. In contrast, the Lyapunov-based feature results in larger and less stable transfer errors, suggesting that it is less effective as a standalone transferable descriptor in the present benchmark setting. The two-dataset transfer matrices also reveal an asymmetry between source and target datasets: some configurations are more difficult as targets than as sources, indicating that the observed predictive relationships do not transfer uniformly across the considered parameter regimes.
From a practical perspective, the results suggest that, in controlled parameterized settings of the type considered here, useful early convergence outcome predictors can be built from relatively simple solver-derived quantities. In particular, residual- and step-based summaries are inexpensive to compute and provide transparent information about the early state and evolution of the iteration. This supports the use of simple feature representations as a first screening tool for parameter configurations, without requiring more complex trajectory-derived indicators as standalone predictors.
A natural scenario is repeated parameter exploration, where many related nonlinear solves must be executed over a grid or during calibration loops. In such settings, an early prediction layer could be used to flag parameter configurations that appear likely to converge within a prescribed number of iterations, or conversely configurations that may require further attention before the maximum iteration count is reached. Similar ideas may also be relevant in repeated steady-state computations or inverse-problem workflows, where the same solver is called many times under changing parameters. These examples should be interpreted as plausible application scenarios rather than validated application claims for the present benchmark, and direct domain-specific validation remains an important topic for future work.
The present study is intentionally limited to controlled scalar benchmark problems and to a fixed parameterized root-finding setting. This restriction makes it possible to isolate feature-level effects, but it also means that the results should not be interpreted as establishing a general diagnostic methodology for nonlinear solvers. In particular, the present analysis does not identify root causes of solver failure, such as ill-conditioning, stiffness, bifurcation effects, or basin boundary phenomena; rather, it studies the early prediction of convergence within prescribed iteration horizons from solver-derived features.
The study also focuses on single-feature predictors and on a simple fixed regression model used as a predictive probe. This provides a controlled setting for comparing feature representations, but it does not address the potential benefits of feature combinations, alternative learning models, or adaptive decision rules. In addition, although the cross-base experiments provide a first indication of transfer behavior within the considered benchmarks, broader validation across different equation classes, higher-dimensional nonlinear systems, PDE-based problems, and solver families remains an important direction for future work. Extending the analysis to more heterogeneous problem settings may further clarify when early solver-derived information can support practical solver selection or parameter screening.
In the considered benchmarks, early prediction is strongly associated with simple level-based features, especially mean and median residual and step summaries.
The fixed-prefix scalar summary analyses at and indicate that substantial predictive information can already be present at very short prefixes, with only modest additional gains by for the leading direct-prefix families.
Dataset structure has a strong effect on predictive difficulty, with the oscillatory benchmark providing a more discriminative test case than the smoother polynomial benchmark.
Median-based summaries are particularly competitive at shorter horizons, while mean residual summaries become more relevant for longer-horizon targets.
Residual- and step-based summaries show lower transfer errors than the Lyapunov-based feature in the considered cross-base experiments.
In the more irregular oscillatory benchmark, compact scalar summaries can occasionally match or slightly outperform the full-prefix representation, suggesting that scalar compression may suppress part of the transient variability present in the full-prefix.
In controlled parameterized settings of the type studied here, simple and computationally inexpensive solver-derived features can provide useful early predictors of convergence outcomes.
6. Conclusions
This work studied the early prediction of convergence outcomes in a controlled parameterized root-finding setting. The analysis was based on solver-level targets defined as success fractions at prescribed iteration horizons, namely , , and . This formulation shifts the focus from predicting a profile-derived score to examining how early solver-derived features relate to convergence outcomes at different iteration horizons.
The results show that, within the considered benchmarks, simple scalar summaries of the residual and step sequences are among the most informative predictors. In particular, mean and median log residual and log step features provide strong within-base performance in the reported experiments. The fixed-prefix analysis at and , together with the continuous curves extending to , indicates that substantial predictive information can already be present in very short prefixes. In the more irregular oscillatory benchmark, compact single-summary predictors can occasionally match or slightly outperform the corresponding full-prefix models, suggesting that scalar compression may suppress part of the transient variability present in the full-prefix representation. By contrast, more elaborate trajectory-derived descriptors, including Lyapunov-based features, provide weaker standalone predictive performance in the present experiments. Overall, these findings suggest that, for the targets considered here, much of the useful predictive signal is captured by relatively simple early trajectory summaries.
The analysis also highlights the importance of benchmark structure. While some configurations lead to nearly saturated prediction performance, the more irregular oscillatory benchmark exposes clearer differences between feature families and provides a more discriminative evaluation setting. In addition, the cross-base transfer experiments show that out-of-dataset prediction is more difficult than within-dataset prediction. Residual- and step-based summaries achieve the lowest transfer errors among the tested feature families, whereas the Lyapunov-based feature degrades more substantially. This underscores the gap between within-base predictability and transfer across the considered benchmark problems.
From a practical standpoint, these findings suggest that, in controlled parameterized settings of the type studied here, simple solver-derived features can provide useful early predictors of convergence outcomes. Such features are computationally inexpensive and may support the preliminary screening of parameter configurations before the maximum iteration count is reached. This possible use should be interpreted as a direction for further investigation rather than as a validated deployment claim.
Future work will extend the present analysis in several directions, including the study of multi-feature models, alternative predictive probes, broader cross-problem validation, and adaptive decision rules. A key next step is to assess whether the observed feature rankings and early prediction behavior persist for higher-dimensional nonlinear systems, PDE-based problems, and broader solver families. Another important direction is to move beyond convergence outcome prediction toward diagnostic analyses that can identify the possible causes of solver failure, such as ill-conditioning, stiffness, bifurcation effects, or basin boundary phenomena.