Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle

Takahashi, Kazuhiko; Imamura, Yuta; Hashimoto, Masafumi

doi:10.3390/engproc2026134096

Open AccessProceeding Paper

Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle^†

by

Kazuhiko Takahashi

^1,*,

Yuta Imamura

¹ and

Masafumi Hashimoto

²

¹

Information Systems Design, Faculty of Science and Engineering, Doshisha University, Kyoto 610-0394, Japan

²

Intelligent Information Engineering, Faculty of Science and Engineering, Doshisha University, Kyoto 610-0394, Japan

^*

Author to whom correspondence should be addressed.

^†

Presented at the 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025), Yunlin, Taiwan, 14–16 November 2025.

Eng. Proc. 2026, 134(1), 96; https://doi.org/10.3390/engproc2026134096 (registering DOI)

Published: 13 May 2026

(This article belongs to the Proceedings of The 7th Eurasia Conference on IoT, Communication and Engineering 2025 (ECICE 2025))

Download

Browse Figures

Versions Notes

Abstract

We investigated the potential of multi-layer extreme learning machines (MLELMs) for trajectory tracking in autonomous unmanned vehicles, focusing on an unmanned surface vehicle (USV). MLELMs were used to design position and attitude controllers within a two-loop control architecture, ensuring that the USV accurately follows a reference trajectory. The performance of an MLELM-based controller in a tracking task was evaluated via numerical simulations of a USV dynamic model governed by nonlinear equations. The computational results confirmed the feasibility of the MLELM to accomplish this task with appropriate accuracy, demonstrating its potential applicability in control systems.

Keywords:

extreme learning machine; tracking control; two-loop control; unmanned surface vehicle

1. Introduction

Deep neural networks (DNNs) have become the standard for addressing complex real-world problems across various domains, achieving near state-of-the-art performance in various applications [1,2], such as image processing, speech recognition, and natural language processing, owing to their high capability to learn from large datasets. However, increasing the size of datasets to attain improved accuracy increases the complexity of DNN model parameters, resulting in increased computational, memory, and energy consumption requirements. To address these issues, lightweight network architectures such as extreme learning machines (ELMs) [3] and echo state networks (ESNs) [4] have gained considerable attention as alternatives to DNNs. ELMs are data-driven learning algorithms for single-layer feedforward neural networks (NNs) with random hidden neurons, while ESNs are recurrent NN models within reservoir computing frameworks. The lightweight structure and high training speeds of these networks [5] make them suitable for resource-constrained environments. Consequently, they have been successfully applied to nonlinear function approximation and temporal data processing in many applications [6,7].

In control engineering, NNs are popular and powerful tools for nonlinear signal processing and system identification and control owing to their nonlinear mapping, learning, and generalisation capabilities. Although DNNs have been applied to address control problems in nonlinear dynamic systems, they occasionally encounter challenges due to the aforementioned constraints. Considering their success in addressing nonlinear systems, ELMs are considered a promising alternative for effectively handling the dynamic relationships in data related to the control of nonlinear systems. In particular, multi-layer ELMs (MLELMs) [8], which comprise stacked hidden layers tuned via autoencoders, enable propagation of the hierarchical representation of external inputs to the last layer and combine the expressiveness of deep architectures for the inputs with learning efficiency. The widespread adoption of MLELMs has demonstrated their effectiveness in various real-world applications [9].

This study investigated the potential of MLELMs for control system applications. As a practical example, an MLELM-based controller was designed for an autonomous unmanned vehicle, particularly an unmanned surface vehicle (USV). The control objectives were accurate reference trajectory tracking and attitude control. Although numerous control techniques for USVs, including feedback, adaptive, nonlinear, and intelligent control, have been reported in the literature [10,11], we employed a control architecture comprising two parallel loops. In this architecture, position control is implemented in the first loop to generate the desired translational forces based on the reference trajectory, while attitude control is executed in the second loop to produce desired torques for ensuring that the USV’s orientation aligns with the reference attitude computed by a line-of-sight guidance law. The control functions for each loop were approximated by separate MLELMs. The performance of the MLELM-based controller was then evaluated through computational experiments.

2. Multi-Layer Extreme Learning Machine

Currently, three approaches are available for creating a multi-layered ELM structure: random mapping, kernel correntropy strategy, and conditional probability strategy. This study adopts the random mapping approach to design an MLELM.

The forward calculation of the MLELM between the

(i - 1)

-th and

i

-th layers (

1 \leq i \leq M

) can be expressed as follows:

\begin{array}{r} x^{(i)} (k) = f (Γ^{(i)} x^{(i - 1)} (k)), \end{array}

(1)

where

x^{(i)} (k) \in R^{N^{(i)}}

is the state vector in the

i

-th layer at time

k

,

N^{(i)}

indicates the number of neuron units in the

i

-th layer,

Γ^{(i)} \in R^{N^{(i)} \times N^{(i - 1)}}

represents the connection weight matrix between the (

i - 1

) and

i

-th layers and

f (\cdot)

is a component-wise activation function of a neuron. When an external input with

L

components is provided to the MLELM by

s (k) \in R^{L}

, it is assigned to the vector

x^{(0)}

wherein

N^{(0)}

equals to

L

. After random initialisation, an autoencoder is applied to tune the connection weight matrix. Subsequently, ridge regression is applied to minimise the objective function, defined as

J_{i} = \sum_{k = 1}^{K} ∥ Γ^{(i)^{T}} x^{(i)} (k) - x^{(i - 1)} (k) ∥_{2}^{2} + β_{i} ∥ Γ^{(i)^{T}} ∥_{2}^{2}

, which yields Equation (2).

\begin{array}{r} Γ^{(i)^{T}} = X^{(i - 1)} X^{(i)^{T}} (X^{(i)} X^{(i)^{T}} + β_{i} I)^{- 1}, \end{array}

(2)

where

X^{(i)} = [\begin{matrix} x^{(i)} (1) & \dots & x^{(i)} (K) \end{matrix}] \in R^{N^{(i)} \times K}

,

K

indicates the data length,

I

is an identity matrix and

β_{i} \in R_{\geq 0}

is a regularisation parameter.

The output of the MLELM

σ (k) \in R^{N}

can be expressed as follows:

\begin{array}{r} σ (k) = W x^{(M)} (k), \end{array}

(3)

where

W \in R^{N \times N^{(M)}}

represents the connection weight matrix between the

M

-th and output layers. The matrix

W

is trained offline using a precollected dataset of the desired input–output sets to minimise the objective function

J = \sum_{k = 1}^{K} ∥ d (k) - σ (k) ∥_{2}^{2} + β ∥ W ∥_{2}^{2}

via ridge regression as follows:

\begin{array}{r} W = D X^{(M)^{T}} (X^{(M)} X^{(M)^{T}} + β I)^{- 1}, \end{array}

(4)

where

D = [\begin{matrix} d (1) & \dots & d (K) \end{matrix}] \in R^{N \times K}

,

d (k)

is the desired output vector and

β \in R_{\geq 0}

is a regularisation parameter.

3. Computational Experiments for Tracking Control of USV

Considering a six-degree-of-freedom mathematical model of a USV as the control target, the equations of motion can be given as follows [12]:

\begin{array}{r} M_{o} \dot{ν} + C_{o} (ν) ν + D_{o} (ν) ν + g (η) = F, \end{array}

(5)

where

ν \in R^{6}

comprises the linear (surge, sway, heave) and angular (roll, pitch, yaw) velocities in the body frame,

η \in R^{6}

comprises the position

p = [\begin{matrix} x & y & z \end{matrix}]^{T}

and orientation

ϑ = [\begin{matrix} ϕ & θ & ψ \end{matrix}]^{T}

in the inertial frame,

M_{o} \in R^{6 \times 6}

is the inertia matrix including added mass,

C_{o} (ν) \in R^{6 \times 6}

is the Coriolis and centripetal matrix,

D_{o} (ν) \in R^{6 \times 6}

is the hydrodynamic damping matrix comprising linear and quadratic terms,

g (η) \in R^{6}

represents the restoring forces owing to gravity and buoyancy and

F \in R^{6}

is the vector of forces generated by the thrusters. The relationship between the parameters in both frames is given by

\dot{η} = [\begin{matrix} R (ϑ) & 0 \\ 0 & T (ϑ) \end{matrix}] ν,

where

R (ϑ) \in S O (3)

is the rotation matrix from the body to the inertial frame and

T (ϑ) \in R^{3 \times 3}

is the transfer matrix, defined as follows:

T (ϑ) = [\begin{matrix} 1 & s i n ϕ t a n θ & c o s ϕ t a n θ \\ 0 & c o s ϕ & - s i n ϕ \\ 0 & \frac{s i n ϕ}{c o s θ} & \frac{c o s ϕ}{c o s θ} \end{matrix}]

(6)

The heave, roll, and pitch of surface vehicles can often be neglected or passively controlled as these vehicles usually operate with small roll and pitch angles. Consequently, the USV assumes linearised static stability for heave, roll, and pitch based on the steady-state equilibrium of gravity and buoyancy, resulting in

F_{i} = 0

(

i = 3, 4, 5

) and

z \approx 0

. The USV employs two stern thrusters and one bow thruster to control its position and attitude. The thrust vector

u \in R^{3}

is related to the force vector through the following allocation matrix:

F = [\begin{matrix} 1 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ - b_{1} & b_{1} & - b_{2} \end{matrix}] u

(7)

where

b_{1}

and

b_{2}

denote the offset distances from the body centre to the stern and bow thrusters, respectively. Herein, a reversible thrust with upper and lower limits is assumed.

A two-loop control architecture is adopted for USV trajectory tracking. The first loop performs position control and generates the desired surge and sway forces according to the reference trajectory. The second loop conducts attitude control, generating a desired yaw torque to ensure that the vehicle orientation aligns with the desired yaw angle designed by the line-of-sight guidance law. In the control system, separate MLELMs are used for the position and attitude controllers. The MLELM constituting the position controller, hereafter referred to as MLELM1, uses the external input

s = {[\begin{matrix} e_{p_{x}} & e_{p_{y}} & {\dot{e}}_{p_{x}} & {\dot{e}}_{p_{y}} & \int e_{p_{x}} d t & \int e_{p_{y}} d t \end{matrix}]}^{T}

, where

e_{p_{x}}

and

e_{p_{y}}

are the

x

- and

y

-direction position errors in the inertia frame, respectively, which are defined using the difference between the reference position

p_{d}

and the USV’s position

p

. Subsequently, the outputs of the MLELM1 are assigned as the desired forces, i.e.,

F_{i} = σ_{i}

(

i = 1, 2

). The other MLELM constituting the attitude controller, hereafter referred to as MLELM2, uses the external input

s = {[\begin{matrix} e_{ϑ_{ψ}} & {\dot{e}}_{ϑ_{ψ}} & \int e_{ϑ_{ψ}} d t \end{matrix}]}^{T}

, where

e_{ϑ_{ψ}}

is the attitude error in the inertia frame, which is defined using the difference between the reference angle

ϑ_{d}

and the USV’s orientation

ϑ

. The output of MLELM2 is assigned as the desired torque, i.e.,

F_{6} = σ_{1}

.

Both MLELMs are pre-trained offline in advance using pre-collected datasets obtained through point-to-point (PTP) control of the USV, wherein proportional–integral–derivative controllers with empirically tuned gains are employed for both position and attitude control. In addition, the MLELMs undergo online training to compensate for errors during the control process. The MLELMs are implemented as discrete-time models with a sampling interval of

Δ t

, and online training is conducted for each sampling instance to minimise the cost functions

E_{k} = \frac{1}{2} (e_{p_{x}}^{2} + e_{p_{y}}^{2} + {\dot{e}}_{p_{x}}^{2} + {\dot{e}}_{p_{y}}^{2})

and

E_{k} = \frac{1}{2} (e_{ϑ_{ψ}}^{2} + {\dot{e}}_{ϑ_{ψ}}^{2})

for MLELM1 and MLELM2, respectively.

In the numerical experiments, the mass of the USV was set to 30 kg, and the inertia tensor was set to 1.2, 1.3, and 1.5

{k g m}^{2}

around each axis of the body frame. A disturbance force vector with a uniform distribution in the range

[- 1, 1]

was added to Equation (5). In the controller, MLELM1 comprised seven input and two output units, while MLELM2 comprised four input units and one output unit. The number of hidden layers was identical for both networks, and the number of neurons in MLELM2 was set to half that of MLELM1 by considering the input and output numbers of both networks. To account for the neuron threshold, a constant value of one was added to the external input component of each network. The training datasets for the MLELMs were collected by conducting PTP control experiments with an operational area defined by

- 6 \leq x \leq 6

and

- 6 \leq y \leq 6

. This area was divided into a grid dividing the

x

and

y

axes into six equal segments. Experiments were conducted from the origin on every grid vertex. This process, involving 48 target positions with

100

samples per trial collected over a fixed duration of 5 s, yielded

4.8 \times 10^{3}

datasets. In the control experiments, the sampling interval for the MLELMs was 50 ms. The regularisation parameters

β_{i}

and

β

used for offline training of both networks were set to an identical value of

{1 \times 10}^{- 4}

. The learning factors for online training of both networks were set to

1 \times 10^{- 2}

. Tracking control experiments were conducted under the following two conditions. In the first condition, the reference trajectory comprised a poly line and squared sin wave, and the initial condition of the USV was set to

p (0) = [\begin{matrix} - 7 & 5 & 0 \end{matrix}]^{T}

and

ϑ (0) = [\begin{matrix} 0 & 0 & \frac{π}{2} \end{matrix}]^{T}

. In the second condition, the reference trajectory was a figure-eight-shaped curve defined using the Bernoulli lemniscate, and the initial condition of the USV was set to

p (0) = [\begin{matrix} 0 & - 3 & 0 \end{matrix}]^{T}

and

ϑ (0) = [\begin{matrix} 0 & 0 & \frac{π}{2} \end{matrix}]^{T}

. In both conditions,

T_{0}

denotes the terminal time of simulation, which was set to 20 s.

Figure 1 shows the simulation results for tracking control using the MLELM-based controller under the first condition. The controller configuration used three hidden layers, with 200 and 100 neurons per layer for MLELM1 and MLELM2, respectively. Despite attitude errors at the point where the reference trajectory changed, small position errors, and thrust saturation during large control errors, the controller ensured that the USV followed the reference trajectory. This result confirms the feasibility of using MLELMs for USV tracking control.

Figure 2 shows an evaluation of the effect of hidden layer configurations on controller performance using the mean squared error (MSE) of the position and attitude of the USV. Although increasing the number of neurons enhances control performance, the number of layers does not affect control performance when sufficient neurons are provided. This indicates that when using MLELMs, deep architectures are not necessarily useful for accomplishing this task.

Figure 3 and Figure 4 present the simulation results for tracking control under the second condition and the relationship between the hidden layer configurations and control performance, respectively. As shown in Figure 3, the MLELMs used a network topology identical to that of the first condition (Figure 1). Although the trajectory’s continuously changing curvature induced attitude errors and resulting position error fluctuations, the MLELM-based controller completed the tracking task. This result demonstrates the adaptability of the MLELM-based controller. The effect of the hidden layer configurations on control performance exhibits a similar trend to that observed under the first condition (Figure 4).

4. Conclusions

We explored the practical application of MLELMs in control systems, focusing on USV trajectory tracking. The MLELMs, created using random mapping, were implemented as the position and attitude controllers within a two-loop control architecture. The performance of the MLELM-based controller was evaluated through computational experiments in which the USV was tasked with tracking a reference trajectory. The simulation results confirmed the feasibility of employing MLELMs to realise the control system and demonstrated that the MLELM-based controller effectively achieved its objectives with acceptable performance. Future research is necessary to explore hardware implementation and testing of the proposed control system in real USVs.

Author Contributions

Conceptualization, K.T.; methodology, K.T.; software, K.T. and Y.I.; validation, K.T. and Y.I.; formal analysis, K.T. and Y.I.; investigation, K.T. and Y.I.; resources, K.T.; data curation, K.T.; writing—original draft preparation, K.T.; writing—review and editing, K.T.; visualization, K.T.; supervision, K.T.; project administration, K.T. and M.H.; funding acquisition, K.T. and M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Martín, A.; Camacho, D. Recent advances on effective and efficient deep learning–based solutions. Neural Comput. Appl. 2022, 34, 10205–10210. [Google Scholar] [CrossRef] [PubMed]
Mienye, I.D.; Swart, T.G. A comprehensive review of deep learning: Architectures, recent advances, and applications. Information 2024, 15, 755. [Google Scholar] [CrossRef]
Huang, G.B.; Zhu, Q.Y. Extreme learning machine: Theory and applications. Neurocomputing 2006, 70, 489–501. [Google Scholar] [CrossRef]
Lukoševičius, M.; Jaeger, H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 2009, 3, 127–149. [Google Scholar] [CrossRef]
Stulp, F. Many regression algorithms, one unified model: A review. Neural Netw. 2015, 69, 60–79. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Lu, S.; Wang, S.H.; Zhang, Y.D. A review on extreme learning machine. Multimed. Tools Appl. 2022, 81, 41611–41660. [Google Scholar] [CrossRef]
Sun, C.; Song, M.; Cai, D.; Zhang, B.; Hong, S.; Li, H. A systematic review of echo state networks from design to application. IEEE Trans. Artif. Intell. 2024, 5, 23–37. [Google Scholar] [CrossRef]
Kasun, L.L.C.; Zhou, H.; Huang, G.-B.; Vong, C.M. Representational learning with extreme learning machine for big data. IEEE Intell. Syst. 2013, 28, 31–34. [Google Scholar]
Vásquez-Coronel, J.A.; Mora, M.; Vilches, K. A review of multilayer extreme learning machine neural networks. Artif. Intell. Rev. 2023, 56, 13691–13742. [Google Scholar] [CrossRef]
Xu, H.; Soares, G. Review of path–following control systems for maritime autonomous surface ships. J. Mar. Sci. Appl. 2023, 22, 153–171. [Google Scholar] [CrossRef]
Qiao, Y.; Yin, J.; Wang, W.; Duarte, F.; Yang, J.; Ratti, C. Survey of deep learning for autonomous surface vehicles in marine environments. IEEE Trans. Intell. Transp. Syst. 2023, 24, 3678–3701. [Google Scholar] [CrossRef]
Fossen, T.I.; Fjellstad, O.-E. Nonlinear modelling of marine vehicles in 6 degrees of freedom. Math. Model. Syst. 1995, 1, 17–27. [Google Scholar] [CrossRef]

Figure 1. Simulation results of tracking trajectory for the USV using the MLELM-based controller under the first condition. In (a), the reference and actual positions are denoted using black dotted and purple lines, respectively, and the triangle illustrates the attitude of the USV at 1 s intervals. In (b), the top, middle and bottom figures show the position errors, attitude errors and thrusts, respectively.

Figure 2. Relationship between the number of units in hidden layers and MSE under the first condition, where the (left) and (right) figures show the MSE for position and attitude, respectively.

Figure 3. Simulation results of tracking trajectory for the USV using the MLELM-based controller under the second condition. In (a), the reference and actual positions are denoted using black dotted and purple lines, respectively, and the triangle illustrates the attitude of the USV at 1 s intervals. In (b), the top, middle and bottom figures show the position errors, attitude errors and thrusts, respectively.

Figure 4. Relationship between the number of units in hidden layers and MSE under the second condition, where the (left) and (right) figures show the MSE for position and attitude, respectively.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Takahashi, K.; Imamura, Y.; Hashimoto, M. Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle. Eng. Proc. 2026, 134, 96. https://doi.org/10.3390/engproc2026134096

AMA Style

Takahashi K, Imamura Y, Hashimoto M. Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle. Engineering Proceedings. 2026; 134(1):96. https://doi.org/10.3390/engproc2026134096

Chicago/Turabian Style

Takahashi, Kazuhiko, Yuta Imamura, and Masafumi Hashimoto. 2026. "Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle" Engineering Proceedings 134, no. 1: 96. https://doi.org/10.3390/engproc2026134096

APA Style

Takahashi, K., Imamura, Y., & Hashimoto, M. (2026). Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle. Engineering Proceedings, 134(1), 96. https://doi.org/10.3390/engproc2026134096

Article Menu

Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle^†

Abstract

1. Introduction

2. Multi-Layer Extreme Learning Machine

3. Computational Experiments for Tracking Control of USV

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle †

Abstract

1. Introduction

2. Multi-Layer Extreme Learning Machine

3. Computational Experiments for Tracking Control of USV

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Remarks on the Application of Multi-Layer Extreme Learning Machines for Controlling an Autonomous Unmanned Vehicle^†