1. Introduction
The control of robotic systems has traditionally relied on accurate and comprehensive mathematical models that describe a robot’s behavior under a wide range of operating conditions. In their seminal work, Slotine and Li [
1] laid the foundation for adaptive control, which assumes complete knowledge of system parameters, including mass distribution, frictional forces, actuator dynamics, and other critical characteristics. However, in real-world applications, acquiring such complete and precise models is challenging due to the inherent complexity of robot–environment interactions. Factors such as external disturbances, sensor noise, and unmodeled dynamics (e.g., frictional nonlinearities, structural flexibilities, and edge effects) can significantly degrade the performance of traditional controllers as noted by Siciliano et al. [
2].
Computed torque control (CTC), in particular, has emerged as one of the most effective model-based control strategies. Under the assumption of complete and precise modeling, this method demonstrates superior tracking accuracy and disturbance rejection. However, despite their effectiveness under ideal conditions, even minor inaccuracies or unmodeled dynamics can lead to performance degradation in model-based control. This has spurred research into data-driven approaches that aim to overcome these limitations by learning from real-time sensor data. Data-driven methods for CTC have gained traction as powerful alternatives to traditional model-based approaches in complex environments or modeling uncertainties. Unlike classical CTC, which relies on precise knowledge of the dynamic model of a robot (involving inertia, Coriolis, and gravity terms), data-driven techniques leverage machine learning to approximate these dynamics directly from sensor data, allowing robust control even with incomplete or inaccurate models. In particular, Gaussian Process Regression (GPR) [
3] and deep neural networks [
4] were employed to learn inverse dynamics models that replace or augment analytical formulations in CTC, thus enabling robust control even with incomplete or inaccurate models. Hybrid methods that combine physics-based models with data-driven corrections (e.g., residual learning) have shown improved generalization and safety. Overall, data-driven CTC methods offer increased flexibility, adaptability, and performance in modern robotic systems. Recent work by Kiumarsi et al. [
5] and Chen et al. [
6] has demonstrated that data-driven control strategies can adapt to changing environments and unmodeled dynamics, offering improved performance when the full model is not available.
Other recent approaches for robot control include deep reinforcement learning-based control strategies [
7] and comprehensive surveys of data-driven robotic control methods [
8]. Feedforward networks approximate nonlinear mappings from state control to state derivatives, serving as black-box dynamics models for model-based control or planning [
9,
10]. These approaches require training on a significant amount of data to generalize. Recurrent neural networks (RNN, LSTM, and GRU) capture temporal dependencies and partial observability, useful when dynamics depend on unmeasured states or previous history of control actions [
11]. Non-parametric Gaussian processes (GPs) provide efficient probabilistic models that quantify uncertainty, supporting safe exploration and data-efficient learning [
12,
13]. These studies explore model-based policy search methods that incorporate a probabilistic dynamics model to express uncertainty. However, GP models are computationally expensive and scale poorly with data. Their application in robotics is limited to smooth and non-switching dynamics.
Physics-informed neural networks (PINNs) embed physical laws (expressed by partial differential equations, e.g., Euler–Lagrange equations) into the loss function or the network architecture, intending to improve data efficiency and generalization. PINNs have found significant applications in fluid dynamics, where they enable the accurate simulation of Navier–Stokes equations without requiring traditional mesh-based methods [
14]. In biomedical engineering, PINNs assist in estimating parameters in models of blood flow and tissue mechanics, offering non-invasive diagnostic tools [
15]. PINNs obviate the need for acquiring extensive experimental data for model training when the governing equations are known [
16]. In the field of robotics, PINNs have been used for dynamic model prediction and parameter identification for collaborative robots [
17]. In another study, PINN were employed to handle nonconservative effects for dynamic modeling and control of complex robotic systems [
18]. By combining PINN with model-based controllers, precise control performance close to theoretical stability bounds was achieved. Recently, PINNs were used to build an efficient surrogate model that, when coupled with a nonlinear model predictive controller, enabled real-time optimization in legged locomotion [
19].
Lagrangian and Hamiltonian neural networks enforce energy conservation constraints in physical systems. As a result, the Hamiltonian network trains faster and generalizes better than a regular neural network [
11]. At the same time, learning physics models for model-based control requires robust extrapolation from fewer samples. A deep Lagrangian network can learn the equations of motion of a mechanical system efficiently while ensuring physical plausibility, and performs very well in robot tracking control [
4]. However, Lagrangian networks only model conservative forces that do not include friction, damping, and contact effects.
This study presents a novel framework for the modeling and control of robotic systems based on data from real-time sensors to account for unmodeled dynamics. We describe how the parameters of the robot manipulator can be estimated online, followed by a CTC controller design based on the constructed model. The proposed approach eschews the need for complete a priori knowledge of system parameters, providing a viable solution for unknown environments where traditional control methods are not adequate.
Although numerous model-based and data-driven approaches have been proposed for robotic control, existing studies still face key limitations. Classical CTC assumes complete or partial knowledge of the robot dynamics, while purely data-driven models often require large offline datasets and lack physical consistency. Hybrid methods combining analytical and learning-based models partially address these issues but remain task-specific or limited in scope. To overcome these challenges, this paper introduces a unified data-driven framework for online estimation and control of robotic manipulators that merges the interpretability of physics-based models with the adaptability of modern learning. Our main contributions are as follows:
Online estimation of gravity, Coriolis, frictional, and inertial effects directly from trajectory control data with no prior parameter knowledge.
Use of PINNs for model learning to ensure physical consistency and stability.
Integration of these estimates into a CTC structure for adaptive and robust tracking.
The rest of the paper is organized as follows. Control methodologies for robot manipulators are discussed in
Section 2, followed by online estimation of the robot parameters in
Section 3.
Section 4 presents the results for the online estimation of the gravity, Coriolis, and inertial matrices, followed by a conclusion in
Section 5.
3. Online Estimation of Model Parameters
In this section, we describe how the various parameters in the robot model, that is, the inertial, Coriolis, friction, and gravity parameters, can be estimated by recording robot trajectories in real-time.
3.1. Estimation of Gravity
To estimate the gravitational torques
in the robot dynamics, we exploit the fact that when the robot is in a steady state, the measured joint velocity
and joint acceleration
are zero. Under these conditions, the dynamic model simplifies considerably. In particular, if the robot is held in a fixed position by a controller, the inertial term
and the Coriolis/centrifugal term
vanish, and friction effects are minimal or can be assumed negligible. Thus, in steady state, the applied control torque is primarily used to balance the gravitational torque, i.e.,
To generate the required data for the estimation of gravity, we use a simple PD control law. In this setup, the desired velocity and acceleration are set to zero. The PD control law for each joint is given by
where
and
are positive definite gain matrices,
is the desired final position.
During the experiment, the robot is controlled using this PD law. Once it reaches a steady state (that is, when and ), the applied torque compensates primarily for gravity. At this point, the joint positions q and the corresponding control torque are recorded. Although the final position may not exactly match , it still provides an acceptable data point. Repeating this process in various static positions in the workspace yields a set of data points .
The gravity vector
is estimated using a Multilayer Perceptron (MLP) Regressor. This model learns the nonlinear mapping from the joint position to the steady-state torque. The MLP Regressor utilizes fully connected layers. To optimize performance and ensure robustness, the model’s architecture and learning parameters were selected via Randomized Search Cross-Validation (
) [
21]. The search space included varying the number of layers/neurons (e.g.,
,
), activation functions (‘relu’, ‘tanh’), L2 regularization (
), and initial learning rate. The best model parameters found are summarized below:
Model Type: MLP Regressor.
Optimization: Adam solver, with a maximum of 5000 iterations.
Best Architecture: hidden layer size = (100, 100), with relu activation.
Regularization: = 0.001
Before training, both the input positions and the target torques were standardized using a StandardScaler. This crucial preprocessing step ensures that all features have zero mean and unit variance, accelerating the convergence and improving the numerical stability during gradient descent optimization.
Stability Considerations for Gravity Estimation
For the robot manipulator with PD control with no gravity compensation, the Lyapunov function is typically defined as
This Lyapunov function
V is always positive (except at equilibrium when it is zero), and its time derivative is given by
which is non-positive, indicating that the total “energy” of the system decreases over time. Hence, the closed-loop system is stable under PD control without gravity compensation. Although the PD controller ensures stability, it exhibits a steady-state offset due to uncompensated gravitational torques. This behavior is consistent with the data generation objective, which focuses on the static equilibrium states of the manipulator.
3.2. Estimation of Combined Coriolis and Friction Effects (C + f)
The combined effects of Coriolis, centrifugal, and friction forces are denoted as
To estimate these forces, we take advantage of the data collected during robot motion at a constant speed. Under such conditions, the acceleration is approximately zero (
), so the dynamic model in (
1) simplifies to
Assuming that external disturbances
are negligible or have been filtered out, and using our previously estimated gravity model
, we can isolate the combined term by subtracting the estimated gravitational component from the measured torque:
To generate a constant-speed motion trajectory, we consider implementing the control law given by a PID Position Controller with Estimated Gravity Compensation:
where
is a positive definite gain matrix for the integral action. For constant-speed motion, the desired trajectory is defined as
, and
is its integral. This control law ensures that the torque
compensates for gravity while the integral term generates the necessary steady-state torque to overcome unmodeled Coriolis and friction effects, driving the position and velocity errors to zero. Once the control action is applied at a constant speed, the term
effectively captures the combined dynamics:
where
denotes the residual gravity estimation error. By collecting data during these stable, constant-speed movements, we record the joint positions
q, the joint velocities
, and compute the total residual torque
, which is dominated by the unmodeled dynamic effects. This yields a dataset
, which is used to train a regression model to predict the combined Coriolis and friction terms based on
q and
.
Stability Considerations for Coriolis and Friction Effects
The controller incorporates the learned gravity term and is expressed as
Defining the position error
, velocity error
, and integral error
, the closed-loop dynamics can be written as
where
contains all uncompensated terms, including Coriolis, friction, and residual gravity. Consider the Lyapunov function candidate
Differentiating
V and substituting the closed-loop error dynamics together with the skew-symmetry property
being skew-symmetric (so that
), yields
If
on the operating set, then This establishes Uniform Ultimate Boundedness [
20] of the tracking errors, with the ultimate bound tightened by larger
or smaller
. Hence, the closed-loop system remains stable, and the joint velocities converge to
, enabling reliable collection of torque data.
3.3. Estimation of the Inertia Matrix
Once the gravitational, Coriolis, and frictional torques have been factored out, the inertial torques can be estimated during the normal operation of the robot, i.e., sensor data obtained during joint acceleration and deceleration.
The training data for the physics-informed neural network (PINN) was generated through a high-fidelity simulation of a two-link robotic manipulator under closed-loop control. This section details the key components of the data generation pipeline.
3.3.1. Closed-Loop Trajectory Control Design
During the inertia estimation phase, the torque applied to the manipulator is generated by the controller, which combines the learned dynamic model with feedback regulation. The control law is expressed as
Here, the feedforward terms , , and compensate for the Coriolis, gravitational, and frictional effects, while the PID feedback terms ensure a stable and sufficiently excited motion. At each sampling instant, the applied control torque and the joint states are recorded.
To isolate the inertial component of the dynamics, the predicted gravity, Coriolis, and friction terms are subtracted from the total applied torque, yielding the residual torque
This residual term corresponds to the torque portion responsible for generating joint acceleration, i.e., the inertial contribution , up to modeling and estimation errors. Physically, represents the effective torque that drives acceleration once all other dynamic effects have been compensated by the learned models.
Consequently, each recorded sample
captures the instantaneous relationship between joint accelerations and the corresponding inertial torque. The complete dataset is defined as
where
N denotes the total number of recorded data samples collected during all simulation runs. This dataset serves as the basis for learning the inertia matrix
through the physics-informed neural network. The proposed approach ensures that the collected data directly represent the inertial dynamics of the manipulator, while the feedback controller maintains stability and provides sufficient excitation for accurate parameter identification.
3.3.2. Physics-Informed Inertia Matrix Estimation
The inertia matrix is a fundamental component of robotic dynamics that encodes the mass distribution and coupling effects between joints. Traditional methods of estimating require precise knowledge of the robot’s physical parameters, which are often unavailable in practice. We present a data-driven approach using neural networks to learn the inertia matrix directly from operational data.
This study presents a PINN approach for learning the inertia matrix of a robotic manipulator. The method combines data-driven learning with fundamental physics constraints to ensure physically consistent predictions. By embedding the structure of the Euler–Lagrange equations into the neural network architecture and training process, we achieve accurate parameter estimation while maintaining physical plausibility. We implement a feedforward neural network
that maps joint configurations to the unique elements of the symmetric inertia matrix:
The inertia matrix is then constructed to ensure physical consistency:
This parameterization guarantees the following:
Positive definiteness through exponential terms on the diagonal.
Symmetry by construction.
Continuous and differentiable dependence on the joint configuration q
The PINN architecture predicts the unique elements of the inertia matrix. The PINN training incorporates two loss terms:
- 1.
- 2.
Physics consistency loss:
where
is computed via automatic differentiation. The Jacobian computation is achieved via TensorFlow’s automatic differentiation [
22]. The total loss is
where
is the tuning scalar hyperparameter coefficient that balances how much emphasis is placed on the physics-consistency term versus the purely data-fitting term in the total loss.
3.3.3. Stability Considerations for Estimation of Inertial Dynamics
Using the robot dynamics and defining the position error
, the velocity error
, and the integral of the position error
, The closed-loop error dynamics become
where
represents the bounded modeling error, with
. Using the Lyapunov candidate function
and the skew-symmetry property
, the time derivative of
V satisfies
where
denotes the smallest eigenvalue of the positive-definite gain matrix
, representing the minimum damping level in the control system. Hence, the tracking error is UUB [
20]. Increasing the damping gain
or improving the accuracy of the learned models (reducing
) tightens this bound. This controller ensures stable tracking performance without requiring knowledge of the inertia matrix, while still benefiting from learned compensation of gravitational, Coriolis, and frictional effects.
4. Results and Discussion
All simulations were conducted using a planar two-link robotic manipulator modeled with standard rigid-body dynamics. The mechanical and inertial parameters of each link are listed in
Table 1. These values are consistent with benchmark models widely used in robotic control studies.
The controllers used during data generation are summarized in
Table 2. The proportional–derivative (PD) controller was employed for static equilibrium tests to estimate gravitational torques, whereas the velocity–integral controller was used for constant-velocity experiments to isolate Coriolis and frictional effects. The computed-torque controller, incorporating the learned dynamics, was used for trajectory-tracking experiments.
Simulation results for online parameter estimation in the case of a two-link robotic manipulator are presented below, starting with online estimation of gravitational torques, followed by the estimation of Coriolis and inertia matrices.
4.1. Estimation of Gravitational Torques
We conducted 20,000 simulations to construct the dataset
. Eighty percent of the dataset was used to train the optimized model, and a dedicated 20% held-out test set was used to assess its generalization capability across the operational space.
Figure 1 illustrates the PD control true robot joint torques
versus the predicted joint torques
when the manipulator reached various static configurations. The trained multi-layer perceptron (MLP) model predicts the static torques required to hold a two-link robotic arm at desired joint angles with very high accuracy, as evidenced by a mean squared error (MSE) of approximately 0.01048 (Nm)
2 and an
value of about 0.9997 on the test set. This near-perfect alignment is also evident in the scatter plots of predicted versus true torque components, where the data points lie almost exactly on the ideal
line. The simulations rely on a PD controller to drive the arm to various final positions within
for each joint, effectively sampling a wide range of configurations. Because the final torques in this setup largely compensate for gravity (with a small PD offset), the collected dataset captures the essential information for learning the torque–angle relationship.
The sample size of 20,000 simulated configurations was determined based on the model’s complexity and empirical convergence analysis. Following standard neural network heuristics (
, where
W denotes the number of trainable parameters), the adopted two-hidden-layer MLP with approximately 12,000 parameters would nominally require on the order of
samples for exhaustive coverage of the configuration space. However, due to the smoothness and low stochasticity of the simulated torque–angle mapping, 20,000 samples were experimentally found to ensure stable generalization with no significant accuracy gain beyond this threshold. Model hyperparameters were tuned using 5-fold cross-validation within the
RandomizedSearchCV [
21] framework to guarantee robust parameter selection. All simulations were performed in Python 3.10 on Google Colab using a fourth-order Runge–Kutta integrator for trajectory computation.
4.2. Estimation of Coriolis and Friction Effects
A total of 44,548 samples were generated through numerical simulations of a two-link robotic manipulator operating under constant-velocity conditions. Each simulation lasted 10 s with a sampling period of 0.01 s, producing approximately 1000 time steps per run. A total of 3000 simulations were executed in parallel, each with randomly initialized joint angles and velocities, and distinct desired joint velocity vectors
drawn uniformly within
rad/s. At each simulation step where the velocity tracking error satisfied
, a data record was saved. The collected dataset contains six columns representing both the robot’s configuration and control signals:
Here,
are the joint positions,
are the corresponding joint velocities, and
denotes the total control torques computed by the velocity-integral controller that includes proportional and integral feedback terms, as well as gravity compensation through the pre-trained model
. The resulting dataset, consisting of 44,548 samples and 6 features, provides a rich and diverse collection of robot motion and control data suitable for learning dynamic models such as the Coriolis and friction components. To isolate the Coriolis and friction term, the exact combined term,
, was obtained by subtracting the predicted gravitational torque,
, from the total measured torque
as expressed by
The optimized Random Forest (RF) Regressor model [
23,
24], which predicts the combined torque contribution of the Coriolis and friction effects,
was trained to approximate
using the collected dataset containing configuration and control signals. The model was optimized through
RandomizedSearchCV with 5-fold cross-validation and achieved a test mean squared error (MSE) of
(N·m)
2 and an
score of
. These results demonstrate high accuracy in capturing the nonlinear coupling between joint velocities and torques. The model learns the mapping from
to
, effectively modeling the difference between the total measured torque and the predicted gravity torque. Scatter plots in
Figure 2 of exact versus predicted values further confirm the strong alignment between the model’s predictions and the ground truth, showing that the learned models for
and
track the true dynamics closely, even in the presence of small unmodeled effects.
To evaluate robustness, Gaussian noise of varying intensity was added to the input data. As the noise intensity doubled, the estimation error increased significantly: the MSE rose from (N·m)2 with no noise () to (N·m)2 at , (N·m)2 at , and (N·m)2 at , corresponding to roughly a increase in error when the noise intensity was doubled. This behavior highlights a nonlinear correlation between noise level and estimation accuracy. Despite these promising results, the study has several limitations, including a relatively small dataset, a single-joint model that does not capture multi-DOF coupling, and fixed load conditions that overlook the effect of payload variations. Future work should therefore adopt a joint-wise estimation with a global fusion framework to extend scalability to multi-joint systems, integrate data augmentation through simulated trajectories under different load and speed conditions, implement noise-aware training to enhance robustness, and explore hybrid modeling approaches that combine analytical dynamic equations with learned residuals to improve both accuracy and interpretability.
4.3. Estimation of Inertial Dynamics
The parameters used for the simulation and data collection for estimation of inertial dynamics are summarized in
Table 3. These settings ensure a realistic operating range for the two-link manipulator and provide a sufficient signal-to-noise ratio for accurate estimation of inertial effects.
To estimate the inertial dynamics, a persistently exciting reference trajectory was generated as
with excitation frequencies
rad/s and amplitudes
. This multi-frequency design ensures excitation of all dynamic modes, while the phase shift between joints prevents correlated motion. Control inputs were saturated at
N·m to simulate realistic actuator limits.
To mimic sensor noise, Gaussian perturbations were added to all measured signals:
Samples were retained only when the joint acceleration satisfied rad/s2, as low-acceleration regions contribute little information to inertia estimation. After filtering, approximately samples remained, providing a diverse and informative dataset for learning the inertia matrix.
Figure 3 compares the exact and predicted elements of the inertia matrix, while
Table 4 summarizes the inertia-matrix prediction performance across three representative robot configurations. The
home position corresponds to the nominal posture
, the
extended configuration represents the fully stretched pose
, and the
singularity occurs when the two links are vertically aligned
. The exponential parameterization guarantees positive definiteness, with the MSE between
and
(N·m)
2 and a maximum relative error below
. These results confirm that the inertia model’s prediction accuracy lies within the standard
tolerance commonly accepted in robot dynamics identification.
4.4. Sensitivity and Robustness Analysis
To assess stability, the proposed PINN was trained under various physical-consistency weights
. As summarized in
Table 5, the mean inertia-matrix reconstruction error decreased from
at
to
at
, confirming that moderate physical regularization improves both convergence and generalization. A higher value
offered no further benefit, indicating an optimal balance around
. Five repeated runs yielded a standard deviation
, demonstrating result repeatability. Boundary-condition experiments showed limited degradation: the MSE increased by approximately
under doubled noise intensity,
for higher-frequency trajectories, and less than
for a
gravity bias. These results confirm the robustness of the proposed PINN estimator to measurement noise, excitation variations, and parametric uncertainty.
Overall, the experiments demonstrate that incorporating a physical-consistency term significantly improves inertia-matrix learning stability and robustness. Moreover, moderate weighting () yields the best accuracy while maintaining low sensitivity to measurement noise and modeling uncertainties.
4.5. Manipulator Control with Estimated Dynamics
In this section, we present the outcome of the two-link robot’s movement when controlled using only the data-driven estimates of the inertia matrix , the Coriolis/centrifugal matrix , and the gravity vector .
Figure 4 illustrates the performance of the manipulator in following a desired trajectory. It is observed that despite relying solely on the learned model, the robot’s joints (solid) closely follow the desired paths (dashed) over the entire 10 s maneuver. This confirms that
,
, and
provide a sufficiently accurate estimate of the manipulator dynamics to achieve effective trajectory tracking.
5. Conclusions
Traditional control of robotic systems has often been predicated on the assumption of complete knowledge of system parameters. In reality, unmodeled dynamics, friction, and external disturbances usually degrade performance. This study presented a data-driven framework that learns key components of robot dynamics from operational data. Using real-time sensor information, the proposed method provides an effective alternative to purely model-based control, particularly in scenarios where obtaining an accurate model is impractical or impossible. Future research could focus on extending these techniques to high-dimensional systems, incorporating uncertainty quantification, and exploring advanced machine learning architectures for improved generalization.
Future work will focus on extending the proposed framework toward scalable, uncertainty-aware, and experimentally validated robotic control. For high-dimensional manipulators, we plan to employ a distributed PINN structure in which joint-level estimators are fused through a global coordination network to ensure scalability. To quantify model reliability, techniques such as Bayesian dropout and Gaussian-Process residual modeling will be explored for estimating both epistemic and aleatoric uncertainties, enabling probabilistic confidence bounds on predicted torques. The incorporation of attention-based and transformer-enhanced PINNs is expected to capture long-range coupling and improve generalization to multi-joint systems. Further extensions include transfer learning and domain adaptation to facilitate the reuse of learned models across different robot platforms and environments, thereby reducing retraining costs. The integration of hybrid gray-box modeling, combining analytical rigid-body dynamics with data-driven residual estimators, will improve interpretability and data efficiency. Experimental implementation on a physical manipulator will provide validation under variable payload, friction, and noise conditions, while energy-based and Hamiltonian constraints will enforce physical consistency.
Finally, the deployment of uncertainty-aware computed-torque controllers on embedded or edge hardware will be investigated to achieve real-time operation in industrial settings. Collectively, these developments will advance the proposed framework toward a generalizable, robust, and physically grounded solution for adaptive control of complex robotic systems.