Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

Özbaltan, Mete; Özbaltan, Nihan; Bıçakcı Yeşilkaya, Hazal Su; Demir, Murat; Şeker, Cihat; Yıldırım, Merve

doi:10.3390/biomimetics10060346

Open AccessArticle

Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

by

Mete Özbaltan

^1,*,†

,

Nihan Özbaltan

^2,†

,

Hazal Su Bıçakcı Yeşilkaya

^1,†

,

Murat Demir

^1,†

,

Cihat Şeker

^1,†

and

Merve Yıldırım

^3,†

¹

Department of Electrical and Electronics Engineering, Faculty of Engineering and Architecture, İzmir Bakırçay University, 35665 İzmir, Türkiye

²

Department of Computer Engineering, Faculty of Engineering and Architecture, İzmir Bakırçay University, 35665 İzmir, Türkiye

³

Department of Software Engineering, Faculty of Engineering, Karadeniz Technical University, 61080 Trabzon, Türkiye

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Biomimetics 2025, 10(6), 346; https://doi.org/10.3390/biomimetics10060346

Submission received: 23 April 2025 / Revised: 15 May 2025 / Accepted: 17 May 2025 / Published: 24 May 2025

Download

Browse Figures

Versions Notes

Abstract

Task scheduling for multiple humanoid robot manipulators in industrial and collaborative settings remains a significant challenge due to the complexity of coordination, resource sharing, and real-time decision-making. In this study, we propose a framework for modeling task scheduling for multiple humanoid robot manipulators by using the symbolic discrete controller synthesis technique. We encode the task scheduling problem as discrete events using parallel synchronous dataflow equations and apply our synthesis algorithms to manage the task scheduling of multiple humanoid robots via the resulting controller. The control objectives encompass the fundamental behaviors of the system, strict rules, and mutual exclusions over shared resources, categorized as the safety property, whereas the optimization objectives are directed toward maximizing the throughput of robot-processed products with optimal efficiency. The humanoid robots considered in this study consist of two pairs of six-degree-of-freedom (6-DOF) robot manipulators, and the inverse kinematics problem of the 6-DOF arms is addressed using metaheuristic approaches inspired by biomimetic principles. Our approach is experimentally validated, and the results demonstrate high accuracy and performance compared to other approaches reported in the literature. Our approach achieved an average efficiency improvement of 40% in 70-robot systems, 20% in 30-robot systems, and 10% in 10-robot systems in terms of production throughput compared to systems without a controller.

Keywords:

humanoid robot manipulators; task scheduling; multiple degrees of freedom; inverse kinematics; ANN; optimization algorithms; symbolic discrete controller synthesis

1. Introduction

Humanoid robots are programmed to mimic the actions and movements of humans, making them eligible for any kind of application, from industrial automation to personal assistance. They have sensors, actuators, and control units that enable them to perform complicated motions and act in response to their surroundings much like a human being.

Humanoid robots face several challenges that are frequently emphasized in the literature, including achieving stable locomotion and balance to operate effectively across diverse environments (e.g., walking on various surfaces), ensuring effective human–robot interaction for performing human-friendly tasks, demonstrating robustness and adaptability to execute complex movements such as crab walk and possessing cognitive capabilities for decision-making, object recognition, and responding to given commands.

To address these challenges, the humanoid robot design simulation is usually performed prior to physically creating the robot. Task scheduling is one of the most important issues in humanoid robot design. Task scheduling is the method of assigning and coordinating tasks between human and robot operators to realize maximum productivity while ensuring seamless collaboration. In the literature, task scheduling solutions have been predominantly explored through heuristic approaches, metaheuristic (MH) techniques, exact algorithms, simulations, and hybrid methodologies. For instance, Pereira et al. [1] present an algorithm using the GRASP MH for task scheduling and assignment in human–robot collaborative settings to demonstrate its effectiveness in productivity and flexibility improvement. Similarly, Pupa et al. [2] present a resilient online task scheduling framework that adapts to deviation and error in real time during execution to facilitate efficient human–robot collaboration.

Task scheduling in multiple humanoid robots to improve operational efficiency has significantly high practical benefits in real-world applications. By optimizing task allocation and coordination, industries such as manufacturing, healthcare, and services can achieve enhanced security and precision. For example, in the healthcare sector, humanoid robots can assist with patient mobility or provide surgical assistance, enabling them to be utilized in critical tasks with high precision and reliability.

Our research idea is derived from a comprehensive literature review and aims to address the challenges and problems outlined above. In this context, our research question is the following: How can task scheduling be optimized to enhance the overall performance and efficiency of collaborative robots?

In the literature, such problems are typically addressed either using classical control methods, which often suffer from high computational complexity, or metaheuristic approaches, which may introduce inaccuracies. In contrast, we observed that the task scheduling problem for multiple humanoid robots can be modeled as a feedback control problem. Accordingly, we propose a symbolic discrete controller synthesis approach that offers significantly better computational performance compared to classical control methods. Furthermore, unlike metaheuristic techniques, our method ensures the synthesis of a controller with formally verified correctness, thereby achieving high accuracy in task scheduling.

Inverse kinematics (IK) is another highly significant area of humanoid robot design that deals with determining the joint angles required to achieve a desired robot end-effector position. Optimal solutions to the IK problem are vital to guarantee correct and smooth motion of the robots. O’Flaherty et al. [3] provide thorough forward and IK derivation for the HUBO2+ humanoid robot in the context of being able to control a 27 degree-of-freedom (DOF) robot. Moreover, Ali et al. [4] introduce a closed-form IK joint solution for humanoids, presenting a generalized and efficient method which can be applied to various humanoid platforms. Said et al. [5] obtain an explicit, omnidirectional, analytical, and decoupled closed-form solution for lower limb kinematics of humanoid robot NAO. Their solution separates position and orientation analysis from the total Denavit–Hartenberg (DH) transformation matrices, thereby allowing efficient computation of joint angles.

The 6-DOF robotic arm consists of six serial links, making it capable of operating at any angle or along any trajectory. With computer technology developing at a high rate, the control of 6-DOF robotic arms is increasingly becoming challenging. Current control methods feature sophisticated algorithms to effectively address both inherent errors of the robotic arm and external disturbances. MH approaches to 6-DOF robotic arm control and optimization involve the utilization of sophisticated optimization algorithms to enhance the performance and precision of such robotic arms. Such approaches are very effective in addressing complex problems such as trajectory planning, error correction, and compensation for external disturbances. To minimize errors and improve stability, a fractional-order PID (Proportional Integral Derivative) controller [6], computer vision-based calibration framework, quasi-physical model, frequency response function, deep reinforcement learning methods, and gray wolf optimization algorithm (GWO) were utilized on 6-DOF robotic arms.

The use of control algorithms for robotic manipulators through simulation programs is a common topic studied by researchers. A neural network solver for kinematic control systems has been developed on the V-REP simulation platform [7]. In [8]; the focus is on the implementation of PID controllers in the MATLAB Simscape environment, and in [6] the MATLAB Simulink environment has been used. In this paper, the discrete controller synthesis technique has been used in simulation platforms such as MATLAB, Python, and Reax and in physical environments. In addition, artificial neural networks have been used to solve inverse kinematics and task planning problems in this study. In the proposed model, the artificial neural network has been optimized with the gray wolf optimization algorithm and superior results have been obtained.

The DCS technique was developed by Ramadge and Wonham in 1989 and was asserted as a theoretical framework for formal language models [9]. In this framework, synthesized controllers were proposed for specific control purposes. This technique has subsequently been used in many modeling methods, such as automata, Petri nets, and finite-state machines. In this context, Balemi et al. [10] have adopted an input-—output viewpoint, representing systems as finite-state machines or automata. An alternative approach was introduced by Maraninchi et al. [11], who proposed a synchronous language framework grounded in automata theory. However, these methodologies have faced significant limitations, particularly due to the state explosion problem, which poses challenges to scalability and practical implementation. In response to these challenges, a symbolic methodology based on labeled transition systems was proposed in [12]. This marked the emergence of symbolic discrete control techniques, which initially focused exclusively on Boolean variables. Subsequent studies, including References [13,14], offered solutions grounded in the synthesis algorithm. Building upon this foundation, a symbolic framework capable of addressing infinite-state systems was later introduced in the study by Berthier et al. [15] using the ReaX environment.

The literature comprises a broad spectrum of DCS work. Some of the notables among them are the use of current-state and initial-state opacity [16], the use of current-state opacity verification over time-labeled Petri nets [17], the integration of reinforcement learning and supervised control theories with DCS for the control of train traffic [18], its use in dynamic timed automata [19], and automata-based DCS for reconfigurable systems [20].

In the literature concerning the application of DCS in humanoid robots, Ishimizu et al. [21] have employed this method for the control of drones, specifically for takeoff, landing, and battery management. Additionally, Li et al. [22] utilized the DCS approach to regulate the movements of the robot. Zudaire et al. [23] explored how controller synthesis can facilitate the transition from old mission objectives to new ones, as well as the architectural restructuring necessary to incorporate new software actuators and sensors when required. Ref. [24] has identified a significant gap in the existing literature concerning the application of task scheduling in robotic studies. In the study, Gueye et al. [25] employed a discrete control approach based on automata to manage unmanned vehicles. In their methodology, they developed a model by implementing the DCS technique using an FPGA and integrating a task scheduling layer. Ref. [26] used DCS and task scheduling methods together to ensure that no tasks can be scheduled on a faulty processor.

Our study tackles the IK problem by framing it as an optimization challenge. We employ ANNs to develop the IK model and use MH algorithms to optimize the network’s weights and biases. This hybrid ANN-MH approach significantly enhances both accuracy and generalizability, effectively addressing the limitations of traditional IK solvers. Moreover, we extend this approach to address the task scheduling problem for multiple humanoid robots operating within a closed-loop system that integrates pipeline and parallel processes. By treating the task scheduling problem as a feedback control issue using symbolic DCS, we encode uncontrolled system behaviors as parallel dataflow equations in synchronous programming languages and incorporate desired specifications as control objectives. These objectives include safety algorithms and optimization goals, ensuring optimal token planning in pipeline channels during task scheduling. The synthesized controller, validated in both physical and simulation environments, guarantees desired system behaviors. The motivation for our study arises from the identified gap in the literature regarding the application of task planning in robotic studies, as noted in [24,27]. Addressing this gap is essential for advancing the field of robotics, particularly in improving the efficiency and reliability of humanoid robots in complex operational environments.

Contributions: Our integrated methodology not only closes gaps in task scheduling and IK for humanoid robot-identified gaps but also provides considerable improvements in performance, reliability, and efficiency.

Symbolic DCS for Task Scheduling: We employ symbolic DCS to manage task scheduling for humanoid robots, ensuring strict rules and mutual exclusions through safety algorithms and maximizing energy efficiency in the shortest time possible.
Synchronous Language Modeling: Synchronous programming languages simulate humanoid robot systems as a task scheduler manager through generating a controller using DCS and implementing it in the robot system.
IK with MH Algorithms: The IK of robot joints is addressed with metaheuristic algorithms for enhancing the performance of multiple-degree-freedom robot arm manipulators.
ANN for IK Model Development: We develop the inverse kinematics model using ANN and couple it with MH algorithms to train the network weights and biases for achieving maximum accuracy and generalizability.
Experimental Validation: Our method is experimentally validated with high accuracy and performance compared to other methods in the literature.

The remainder of the paper is organized as follows. The next subsection provides a comprehensive literature review. Section 2 provides an overview of the general technical background of humanoid robots. Section 3 elaborates on how the inverse kinematics problem is addressed and presents detailed models of task scheduling among humanoid robots. Section 4 reports the results of our experimental evaluations. At last, Section 5 concludes the paper and suggests directions for future work.

1.1. Related Work

1.1.1. Task Scheduling

Numerous studies in the literature have explored diverse applications of robotics across a variety of domains, as demonstrated in works such as [28,29,30]. Building on the historical evolution of humanoid robots and the fundamental challenges in their development, recent research has increasingly focused on equipping these systems with intelligent control strategies and learning-based approaches. These efforts are primarily driven by the growing need to cope with the complexity of tasks performed in dynamic and unstructured environments [31,32,33]. The overall goal is to enhance humanoid robots with greater flexibility, robustness, and adaptability. Various control models have been explored in the literature to improve motion and task execution in robotic systems. Among them, two-degree-of-freedom (2-DOF) systems have gained popularity due to their simplicity and suitability for real-world applications such as helicopter dynamics. In particular, 2-DOF helicopter platforms provide a valuable benchmark for evaluating novel control strategies in nonlinear and coupled dynamics, especially in discrete-time environments [34].

Ref. [35] presents LoRaStat, a low-power, portable potentiostat integrated with LoRaWAN for wireless communication, optimized for long-term monitoring of ammonium, nitrate, and calcium levels in aqueous environments. Ref. [36] presents scheduling algorithms that optimize job scheduling to maximize weight for batteryless IoT devices, addressing energy constraints with polynomial-time solutions and NP-Hardness for general cases. Ref. [37] demonstrates that with proper MOCAP calibration, wearable sensors can accurately monitor knee range of motion in TKA patients, offering a reliable alternative to traditional methods. Ref. [38] proposes a robust two-layer distributed adaptive learning control strategy for multi-agent robotic manipulators, enabling trajectory tracking and learning of nonlinear uncertain dynamics in a leader-follower framework.

1.1.2. Inverse Kinematics Problem

In parallel with traditional control schemes, significant progress has been made in the field of estimating and tracking 6-DoF poses. However, many existing methods rely heavily on pre-defined CAD models or prior category knowledge, which limits their applicability to previously unseen objects. To overcome this limitation, BundleSDF is proposed as a co-designed method that integrates pose tracking with neural reconstruction. This model-free framework enables 3D reconstruction and causal 6-DoF tracking from monocular RGB-D sequences, even under challenging conditions such as occlusion, specular surfaces, or lack of texture [39]. Complementary to pose estimation, grasp generation in cluttered scenes has also received considerable attention. Contact-GraspNet addresses the limitations of previous approaches by providing an end-to-end network that predicts stable and diverse 6-DoF grasp poses directly from raw depth images. In particular, it avoids reliance on object labels or instance segmentation, which enhances its ability to generalize in unstructured environments [40]. To support these learning-based approaches, the accessibility and quality of the datasets play a crucial role. Existing 6-DoF pose estimation benchmarks often suffer from limited accessibility or outdated object models, reducing their utility in real-world robotic manipulation tasks. The HOPE dataset fills this gap by providing high-quality textured 3D models of 28 household objects, together with challenging and cluttered scene configurations, to serve as a robust benchmark for pose estimation tasks [41].

A major challenge that remains is the inverse kinematics problem in redundant systems, where the number of joints exceeds the number of degrees of freedom needed to control the end effector. This redundancy results in multiple feasible joint configurations for a single target pose. While this flexibility has clear advantages, including improved obstacle avoidance, fault tolerance, and configuration flexibility, it also complicates the learning process. Traditional multi-solution IK approaches address this by partitioning the joint space into sub-regions, solving for each separately and then merging the results [42,43,44]. In this study, metaheuristic optimization algorithms are used to efficiently explore the high-dimensional joint space and determine optimal configurations for the desired end-effector objectives.

1.1.3. Symbolic Controller Synthesis

In terms of control-level improvements, the integration of advanced control strategies, intelligent optimization techniques, and accurate kinematic modeling has significantly enhanced the performance of humanoid robot manipulators. Three core research areas stand out: discrete controller synthesis (DCS), inverse kinematics (IK), and artificial neural network (ANN)-based optimization. DCS ensures the safe and deterministic coordination of robot actions [14,45], while solving the IK problem is essential for achieving accurate end-effector positioning [44], especially in systems with redundant degrees of freedom [42]. In addition, the combination of metaheuristic optimization techniques with ANN models has proved effective in dealing with the nonlinearities and real-time constraints commonly encountered in complex robotic systems. Recent developments in the synthesis of discrete-event controllers highlight the growing interest in combining formal verification techniques with data-driven learning. For example, the DEEPDECS framework proposed by Calinescu et al. enables correct-by-construction controllers for autonomous systems by integrating deep neural networks (DNNs) for perception [46]. Similarly, symbolic controller synthesis techniques have been applied to nonlinear and underactuated systems such as quadcopters, providing reliable path planning and control without the need for additional hardware [24,47,48,49,50]. These methods have shown superior performance compared to traditional fractional order PID (FOPID) controllers in both dynamic behavior control and trajectory generation tasks.

2. Technical Background of the Common Problems Encountered in the Task Scheduling of Humanoid Robots

Humanoid robots are systems that have a human-like physical structure and are designed to perform specific tasks. They are characterized by their ability to perform a given system task using their limbs. They are also capable of interacting with the environment and with each other and have a degree of autonomy. Accordingly, their interactions, both with the external environment and with other humanoid agents, and their task-planning processes must be managed in accordance with desired control objectives.

Humanoid robots usually consist of several robot manipulators with a high degree of freedom working in a coordinated manner. To enable controlled movement from one coordinate to another, the inverse kinematics (IK) problem must first be solved as a fundamental step in motion planning.

2.1. Six-Degree-of-Freedom Robot Manipulators

Humanoid robots are articulated manipulators operating with multiple degrees of freedom. One of their primary functions is to mimic the dexterity and flexibility of typical human limbs. In this study, a humanoid robot consisting of two 6-DOF manipulators working in a coordinated manner to perform complex tasks was used.

Six-degree-of-freedom humanoid robots can perform displacement motions such as sliding left/right on the X-axis, descending/ascending on the Y-axis, and moving forward/backward on the Z-axis. It can also perform rotational motions such as roll (

ϕ

) around the X-axis (arm rotation around itself), pitch (

θ

) around the Y-axis (tilting up and down) and yaw (

ψ

) around the Z-axis (left/right rotation). All movements of the robot are presented in Figure 1.

In this paper, an inverse kinematics (IK) model is proposed for the arms of a 6-DOF humanoid robot using artificial neural networks. Metaheuristic algorithms are used to optimize the weights and biases of the artificial neural network. This results in lower error rates, higher accuracy rates, and better solutions without getting stuck in local minima.

2.2. Inverse Kinematics Problem

There are two basic problems that restrict the movements of 6-DOF humanoid robots. The first of these is the forward kinematics problem, where the position of the end effector is determined according to the given joint angles. The second is the inverse kinematics problem, where all the joint angles required to bring the end effector to the desired position are calculated.

In the inverse kinematics problem, which is also the subject of this study, the joint angles

θ = [θ_{1}, θ_{2}, \dots, θ_{n}]

need to be calculated in order to bring the end effector to the desired position

P_{target}

. Mathematically, the joint angles are calculated as follows:

f (θ) = P_{target}

(1)

where

f (θ)

represents the forward kinematics function. Forward kinematics problems are solved using trigonometric or algebraic methods and have a single solution. However, inverse kinematics problems are solved using closed-form or iterative methods. They have more than one solution or no solution at all.

In order to reduce the complexity of traditional methods and produce faster solutions, the inverse kinematics problem is solved using artificial neural networks. The inputs of the network are the positions of the end effector of the 6-DOF humanoid robot in free space, and the outputs are the joint angles corresponding to these positions. The positions of the end effector corresponding to random joint angles are calculated using forward kinematics formulas. The artificial neural network is trained with the obtained data. The weights and biases of the network are optimized using metaheuristic algorithms. Metaheuristic algorithms increase both the accuracy rate of artificial neural networks and improve error metrics (such as MSE and

R^{2}

).

2.3. Artificial Neural Network (ANN) Model

The artificial neural network model developed within the scope of this study consists of three components. The input layer contains information about the position of the end effector of the 6-DOF humanoid robot. In the hidden layer, the input data are processed and inferences are made. The learning process takes place in this layer. The output layer contains the joint angles estimated by the network. The three-layer artificial neural network structure is given in Figure 2. The outputs of the network are calculated using a standard feedforward structure whose equation is given below.

y = f (W x + b)

(2)

Here, W is the weight matrix, x is the input vector (i.e., the position of the end effector), b is the bias term, and

f (\cdot)

is the nonlinear activation function. The weights and biases of the network are optimized using metaheuristic algorithms. In this way, the optimum artificial neural network structure in terms of the number of layers and learning rate is created.

2.4. Metaheuristic Approaches

The metaheuristic algorithms utilized in this study are as follows.

Particle Swarm Optimization (PSO): It has been observed that the movements of birds and fish moving in flocks to find food affect other individuals in the flock, and the flock reaches its goal more easily. Inspired by this observation, it is an optimization algorithm developed in 1995 [51]. The best solution is achieved by changing the speeds and positions of the particles.

Ant Colony Optimization (ACO): This algorithm is inspired by mimicking the behaviors and interactions of individual ants in an ant colony [52]. Ants leave trails in their environment to reach food sources. These trails help other ants find the right route, while the trailing ants adjust their movements according to the density of the trails. The ant colony algorithm similarly creates a roadmap by searching for possible solutions and using a series of signs and trials to reach better solutions. Probabilistic decision-making mechanisms work while trying to reach the optimum solution.

Artificial Bee Colony (ABC) Algorithm: This algorithm models the foraging behavior of honey bees [53]. Bees tend to explore new food sources based on the information shared by other bees in order to find the most efficient source. When all known food sources are exhausted, the bees continue their search randomly in different locations.

Gray Wolf Optimization (GWO): This algorithm is based on the hunting model of a gray wolf pack [54]. There are four types of wolves, alpha, beta, delta, and omega, according to their role in the pack. The alpha holds the greatest ’authority’ in decision-making and leading the pack. Following the alpha are the beta and delta wolves, which are subordinate to the alpha yet dominant over other members. The omega wolf is always subordinate to the more dominant wolves in the hierarchy.

Coyote Optimization Algorithm (COA): It is a population-based algorithm classified both as a swarm intelligence and an evolutionary heuristic approach [55]. The algorithm is inspired by behaviors of coyotes. Unlike the GWO algorithm, the COA is based on a different algorithmic structure. The COA does not focus on social hierarchy and dominance rules. Instead of merely emphasizing hunting behavior, it also incorporates social structures and experience sharing among individuals.

All of the aforementioned metaheuristic algorithms were employed to optimize the weight and bias parameters of the ANN model. The obtained results were evaluated using the Mean Squared Error (MSE) method:

MSE = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}

(3)

where

y_{i}

represents the actual joint angles, while

{\hat{y}}_{i}

denotes the predicted joint angles.

In addition to MSE, the performance of each model was also evaluated using the coefficient of determination (

R^{2}

). This enabled a better approximation of the predicted outputs to the actual values. In the MSE method, the squared differences between the actual and predicted joint angles are summed and divided by the number of samples. When calculating

R^{2}

, the ratio of the residual sum of squares to the total sum of squares is computed and then subtracted from 1. A lower MSE value and an

R^{2}

value closer to 1 indicate the accuracy and generalizability of the proposed inverse kinematics model. These evaluation metrics provide a comprehensive understanding of the predictive performance of ANN models optimized using different metaheuristic algorithms.

3. Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

This study addresses task scheduling for multiple humanoid robots through the use of the symbolic discrete controller synthesis technique. The workflow of our study is presented in Figure 3. As illustrated in the figure, two distinct problems (inverse kinematics and task scheduling) commonly encountered in robot design are addressed separately, and solutions developed for these two problems are then integrated. Our study is structured into a five-phase modeling framework, as shown in the figure.

In the first phase, the system behaviors that have not yet been controlled are modeled both as a discrete controller synthesis problem and within a virtual robotic environment. Additionally, a database is created for the joint angles and target positions of the robots.

The system specifications (control objectives) are then associated with our plant model, and through the integration of the developed scheduling algorithms, a controller is generated using our safety and optimization synthesis algorithms. This controller is subsequently implemented within the virtual robotic environment for task scheduling operations. Furthermore, the inverse kinematics problem in the robots is addressed by utilizing the database, and the degrees of the joints are determined using the developed MH algorithms to achieve the desired position. This process is also integrated into the virtual robotic environment. The results obtained from the simulation environment are validated in a physical setting.

The modeling framework presented below is systematically detailed, providing an in-depth overview of the approach.

3.1. Overview

This study focuses on optimizing task scheduling for multiple humanoid robots operating within a closed-loop system as part of a pipeline. In this context, the inverse kinematics problem for each limb of the humanoid robots is addressed as the first step.

In the designs we considered in this work, the inverse kinematics problem is addressed using 6-DOF robot manipulators as an example for the limbs of humanoid robots. Initially, a database is created for the joint angles and target positions of the 6-DOF robot manipulators. Subsequently, various metaheuristic approaches and hybrid models commonly used in the literature are employed to complete the training process for our database. Finally, during the implementation phase in both physical and simulation environments, the trained models are used to generate output angles, ensuring that the humanoid robots can optimally achieve the necessary joint angles to move from one position to another.

In this study, humanoid robots are considered as two-pair 6-DOF manipulators, and the task scheduling problem for multiple humanoid robots operating within a closed-loop system that combines pipeline and parallel processes is addressed. The inverse kinematics problem for each limb of each humanoid robot, as discussed in the previous context, is solved using metaheuristic approaches. The task scheduling problem of multiple humanoid robots, on the other hand, is treated as a feedback control problem using the symbolic discrete controller synthesis technique. Initially, the uncontrolled system behaviors for the humanoid robots are encoded as parallel dataflow equations in synchronous programming languages. Subsequently, the desired specifications are incorporated into the model as control objectives. These control objectives fall into two categories: the first includes safety algorithms, such as strict rules and mutual exclusions, while the second focuses on optimization objectives, which are used to plan the tokens in the channels of the pipeline in the most optimal manner during task scheduling. Once the model is complete, during the compilation process, the controller, which always ensures the desired system behaviors, is synthesized by applying the safety and optimization algorithms. Finally, this synthesized controller is integrated into the physical or simulation environment to be validated.

3.2. Inverse Kinematics Model of 6-DOF Robot Manipulators

Within the scope of this study, the humanoid robots are considered as two pairs of 6-DOF manipulators. The general view of the 6-DOF manipulator and humanoid robot is shown in Figure 4. In this context, the kinematic problem in humanoid robots is also addressed through 6-DOF manipulators.

The mathematical model of the 6-DOF manipulator is expressed as follows. The homogeneous transformation matrix given in Equation (10) is obtained using all transformation matrices given in Equations (4) to (9).

R_{k}

and

T_{k}

represent the rotation matrix and the translation matrix of joint k, respectively.

R_{x} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & cos α & - sin α & 0 \\ 0 & sin α & cos α & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(4)

T_{x} = [\begin{matrix} 1 & 0 & 0 & a \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(5)

R_{y} = [\begin{matrix} cos β & 0 & sin β & 0 \\ 0 & 1 & 0 & 0 \\ - sin β & 0 & cos β & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(6)

T_{y} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & b \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(7)

R_{z} = [\begin{matrix} cos γ & - sin γ & 0 & 0 \\ sin γ & cos γ & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

(8)

T_{z} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & c \\ 0 & 0 & 0 & 1 \end{matrix}]

(9)

T = \prod_{k = 1}^{n} R_{k} T_{k}

(10)

The physical 6-DOF manipulator used in this study, along with its kinematic scheme when all joint angles are set to zero, link lengths, and joint angle limits, is depicted in Figure 5.

In this study, various optimization algorithms are employed to solve the inverse kinematics of 6-DOF manipulators. A multilayer artificial neural network (ANN) model is used, with artificial neurons positioned between the input and output layers, and a single hidden layer containing 10 neurons. Additionally, optimization processes involve evolutionary algorithms such as Particle Swarm Optimization (PSO), which is based on social interactions between particles; Ant Colony Optimization (ACO), which uses pheromone-based communication among artificial ants; Artificial Bee Colony (ABC), inspired by the behavior of honeybees searching for nectar; Gray Wolf Optimization (GWO), based on the hunting behavior of wolves; and the Coyote Optimization Algorithm (COA), which mimics the social structure and hunting methods of coyotes. These algorithms are effectively applied to solve the inverse kinematics problem of 6-DOF robot manipulators for finding optimal solutions.

Swarm intelligence algorithms are highly effective in controlling the behavior of robotic systems, particularly for avoiding local minima during complex optimization tasks. In the literature, algorithms such as PSO, ACO, ABC, GWO, and COA have demonstrated strong performance in solving inverse kinematics problems by providing robust and accurate solutions. In this study, optimization was performed using these algorithms, where the joint angles generated through inverse kinematics were validated by comparing the corresponding end-effector positions obtained via forward kinematics. This approach ensures both the accuracy and reliability of the proposed kinematic solutions.

3.3. Symbolic Model of Humanoid Robot Manipulators

Task scheduling optimization in humanoid robots has been modeled as a symbolic discrete controller synthesis problem. The system design, as illustrated in Figure 6, considers robots arranged in both pipeline and parallel configurations, where the workflow proceeds through two separate sub-lines. The final robot in the sequence processes tokens arriving from both lines. However, to ensure optimal performance, the tokens arriving from both branches must be processed in a balanced manner, requiring synchronization between the execution times of the robots along each line. This necessitates a cost-efficient optimization procedure. In line with this objective, the workflow scenarios as depicted in the figure are systematically modeled, and discrete controller synthesis is carried out in detail, as presented below.

3.3.1. Activation Model

The activation model is designed to control whether each robot in a closed system is actively performing its assigned task. This model is formulated as a parallel synchronous dataflow equation, as shown below:

R_{i} = if c_{i} then A_{a} else A_{p}, \forall i \in {1, 2, \dots, | R |},

(11)

where

R

denotes the set of robots within the system, and

| R |

represents the total number of robots. The variable c is a controllable input parameter, where each robot is associated with an individual control input

c_{i}

. The domain

A

is a custom-defined Boolean-like domain with two discrete values,

a, p

, where

A_{a}

indicates that a robot is in an active state, and

A_{p}

indicates that the robot is in a passive state.

3.3.2. Mutual Exclusion Constraints

The grammar that can be used to formalize the constraints related to the interactions between collaborative robots working on shared resources, as well as their environmental relations, is formulated as follows:

(12)

where the grammar represented by

G

illustrates the formalization of the system; the symbol

: =

denotes the definition of the corresponding grammar;

T

represents any transition defined within the system model, such as transitioning a robot into an active state.

3.3.3. Token Model

The tokens exchanged between humanoid robots operating on a pipeline are formalized in the following dataflow equation:

\begin{matrix} P_{i} = & (R_{i} = A_{a}) \land e_{i}; \end{matrix}

(13)

\begin{matrix} D_{i} = & (R_{i j} = A_{a}) \land e_{j}, \end{matrix}

(14)

where

P_{i}

represents the tokens processed by robot i;

D_{i}

denotes the delivered tokens of robot i;

R_{i j}

represents the robot j that delivers to robot i; and e indicates the completion of a process on a token, after which it is fired.

The tokens processed by humanoid robots exist in an infinite domain. However, for higher-performance coding, the tokens are abstracted in a finite domain, as illustrated in Figure 7.

To provide a more detailed explanation of our task scheduling algorithm, the symbolic discrete controller synthesis approach begins by modeling the uncontrolled system behaviors and the desired specifications as parallel dataflow equations. This modeling captures strict rules governing robot behaviors, including conditions such as mutual exclusions, as well as optimization objectives within the system.

3.3.4. Control

After modeling the system behaviors and desired specifications, controller synthesis is performed through the application of our safety and optimization synthesis algorithms.

In this context, our safety objectives such as the activation model and mutual exclusion constraints are formulated as follows:

S_{g l o b .} = ⋀ S_{i},

(15)

where

S_{i}

represents all the safety objectives in our desist model.

S_{g l o b .}

, on the other hand, ensures that evaluation is performed on controllable variables while remaining at a constant value of 1, acting as an invariant.

With the cost function formulated as shown below, the robots are coded to consume the maximum number of tokens with the optimal cumulative cost through our optimization algorithm:

O = \sum_{i \in | R |} R_{i} ⊥ {L, M, H},

(16)

where ⊥ represents the quantity of tokens present in the channels between the humanoid robots.

3.3.5. Integration

Finally, after the controller is synthesized, it is translated into languages such as C and Verilog using our software tools like SDCS2C and SDCS2HDL. This translation enables integration into environments such as Python, MATLAB, and VREP or into physical microcontrollers. The overall workflow of the study is outlined in Algorithm 1.

Algorithm 1 Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

1:: Stage 1: Inverse Kinematics Dataset Generation
2:: for each sample $j = 1 \dots N$ do
3:: Generate random joint angles: $θ^{(j)} \sim U ([θ_{i, min}, θ_{i, max}])$
4:: Compute end-effector position: $x^{(j)} = f (θ^{(j)})$
5:: Add to dataset: $D \leftarrow D \cup {(x^{(j)}, θ^{(j)})}$
6:: end for
7:
8:: Stage 2: Neural Network Training (via GWO)
9:: Initialize weights: $ϕ \sim random$
10:: while termination criterion not met do
11:: Update weights using Grey Wolf Optimizer:
12:: $ϕ \leftarrow ϕ - η \cdot \nabla_{ϕ} L (ϕ)$
13:: end while
14:
15:: Stage 3: Task Definition and Scheduling
16:: for each task $T_{i}$ do
17:: Determine activity: $A_{i} (t) \in {0, 1}$
18:: Update task queue: $δ_{i} (t + 1) = δ_{i} (t) + A_{i} (t) - C_{i} (t)$
19:: Enforce mutual exclusion: $A_{i} (t) \cdot A_{j} (t) = 0, \forall i \neq j$
20:: end for
21:
22:: Stage 4: Controller Synthesis
23:: Objective: $min \sum_{i, t} w_{i} \cdot δ_{i} (t)$
24:: Subject to: $\sum A_{i} (t) \leq k$
25:: Generate control policy: $u (t) = π (δ (t), A (t))$
26:
27:: Stage 5: Simulation and Performance Comparison
28:: Run simulations for both uncontrolled and controlled systems
29:: Compare metrics: task completion time, conflicts, regulation, queue length

4. Experimental Evaluation

The experimental work of this study is addressed in two stages. The dataset of 1,700,000 data points was generated by simulating various motion scenarios for a 6-DOF robot manipulator, with joint angles ranging from −50 to +50 degrees. The data were collected by systematically varying joint angles and recording corresponding end-effector positions, which were then used to train the inverse kinematics model. For each configuration, the corresponding PSO, ACO, ABC, GWO, and COA were calculated using forward kinematics. The dataset was then created by pairing the generated joint angles with their respective end-effector positions, ensuring a comprehensive representation of the manipulator’s kinematic behavior. The results were then validated in both MATLAB Simulink and a physical setup. The second stage of the study, which involves the task scheduling problem of multiple humanoid robots, was encoded in the discrete controller synthesis environment, ReaX, through its own synchronous programming language. For validation of the experimental phase, system behaviors modeled in both Python (version 3.8.5) and MATLAB (version 2021a) Simulink environments were integrated with the controller synthesized in the ReaX environment, and the results are presented below.

Swarm algorithms are crucial for avoiding local minima in robot motion control. According to a comprehensive analysis in the literature, algorithms such as PSO, ACO, ABC, GWO, and COA have emerged as particularly effective in this regard. The performance evaluation of different optimization algorithms in modeling joint angles (

α_{1}

,

α_{2}

,

α_{3}

,

β_{1}

,

β_{2}

, and

β_{3}

) using an ANN for the inverse kinematics problem of a 6-DOF robot manipulator is presented in Table 1. The weights and biases of a single hidden layer comprising 10 neurons were designated as search parameters. The experiments were conducted using MATLAB, with the number of iterations and the population size for the optimization algorithms set to 100 and 1000. According to the table, the performance of the models is evaluated based on the proximity of the MSE value to zero and the coefficient of determination (

R^{2}

) value to one. The GWO algorithm achieved the lowest MSE value, surpassing other algorithms by 470%, 25,133%, 26,737%, and 8912% in terms of the MSE criterion. Similarly, GWO demonstrated superior performance by reducing the MSE by 25,780%, 30,395%, 5741%, and 35,011% compared to alternative methods. Furthermore, GWO attained the lowest MSE value, outperforming competing algorithms by 13,842%, 3383%, 179%, and 2460%. The algorithm also exhibited the lowest MSE value, exceeding the performance of other methods by 5233%, 4913%, 4067%, and 9476%. Additionally, GWO outperformed other algorithms by 465%, 450%, 462%, and 218% in terms of MSE reduction. Lastly, GWO achieved the lowest MSE value, surpassing other methods by 560%, 552%, 1196%, and 576%.

Overall, GWO demonstrated the highest efficacy in optimizing all ANN models presented in this study. Nevertheless, several strategies can be explored to further enhance modeling accuracy. Future research could focus on experimenting with novel algorithms, adjusting population sizes, and integrating advanced metaheuristic approaches into ANN models for the inverse kinematics solution of robotic manipulators. The mean performance of the GWO results was considered, and a comparative analysis with similar studies based on Taguchi Optimization, the Sequential Mutation Genetic Algorithm (SMGA), and Adaptive Particle Swarm Optimization (APSO) is provided in Table 2.

Figure 8 presents a comparative performance analysis of our task scheduling algorithm. The colored bars in the figure represent designs containing varying numbers of humanoid robots. Specifically, the designs labeled

D_{10}

,

D_{30}

, and

D_{70}

correspond to configurations with 10, 30, and 70 humanoid robots, respectively, each performing an activity with both sequential and parallel processes. This task scheduling comparative performance analysis has been implemented in a Python environment, with 100 trials conducted for each design, and the results are displayed in the bars. The lower and upper whiskers indicate the outlier points in the experiments, while the mean is represented by the median. The other segments are denoted by the lower and upper quartiles. On the x-axis, the k-step value of our optimization algorithm is represented, while the performance criterion is expressed as a percentage on the y-axis.

In the conducted experiments, our symbolic discrete controller synthesis approach was evaluated in terms of task scheduling performance on multiple humanoid robots. Within the scope of the experiments, when the designs are evaluated using dataflow-based modeling formalisms, each humanoid robot is represented as an actor (node), while the edges (arcs) correspond to the interactions between the robots. The objective of performance improvement in this context is to maximize the number of tokens fired by the final actor (robot), which serves as an indicator of overall system efficiency.

The same experimental environment was utilized under two conditions: with and without the integration of the SDCS-based controller. A value of k-step

= 0

indicates that no optimization was applied. The 0% point on the y-axis represents the case in which the controller was not integrated into the system. As expected, when input values were assigned randomly, approximately 50% of the cases demonstrated a performance improvement, while the remaining 50% showed a decrease. However, with the application of the optimization algorithm, the average performance consistently improved as the k-step value increased. After k-step

= 4

, a plateau in performance gain was observed, suggesting that further increases in k-step no longer contributed to significant improvements. This indicates that the effect of future-oriented optimization had reached its limit or become negligible. Finally, the observed increase in bar height with a higher number of robots in the design is attributed to the growing number of combinations resulting from the randomness in the experimental outcomes.

The computational tool performance is presented in Table 3, which details both the computational time and the maximum memory usage as a function of the number of robots within a closed system. The table demonstrates the application of cumulative sum optimization over a time window of k-steps. As the value of k increases, signifying an extended lookahead within the time window, the computational time also increases. However, this rise in computational time is accompanied by a corresponding increase in the reliability of the optimization results. Notably, this increase in the time domain is scalable. As seen in the table, the computational time increases in proportion to the number of robots in the system. It is crucial to note, however, that the compilation process is carried out only once. Following the initial compilation, the controller operates in real time on the supervisory system.

Our proposed approach is compiled once after modeling and then operates dynamically in real time. However, when changes need to be made to the system, the model must be updated, and in such cases, the compilation process needs to be repeated. After this update, the system continues to operate in real time.

The results obtained through the experimental evaluation have been thoroughly reported and validated both in simulation and in a physical environment. The symbolic discrete control synthesis technique we employed functions as a model checking tool, thereby ensuring formal verification [15].

In this study, energy consumption is not directly addressed; however, energy efficiency has been indirectly improved as a result of faster and more coordinated task execution. In the literature, the task scheduling problem in robotic systems has been addressed in [59,60,61] using classical control methods or heuristic approaches. Ref. [62] introduces ISC-QL, a novel edge server placement algorithm for Intelligent Transportation Systems (ITS) that integrates Improved Spectral Clustering and Q-Learning. The method first clusters base stations by location and user density, then refines server placement using reinforcement learning. Experimental results using real-world data show that ISC-QL significantly enhances load balancing, reduces energy consumption, and lowers communication delay. However, our approach demonstrates significantly higher performance and a level of formal accuracy compared to the results of these works. Our method not only tackles the task scheduling problem in robotic systems with high performance and formal precision but also presents a novel integration with the inverse kinematics problem. Thus, our pioneering work presented in this paper would be for future research in this area.

5. Conclusions and Future Work

In this paper, we propose a modeling framework for task scheduling in multiple humanoid robot manipulators using the symbolic discrete controller synthesis technique. Our models have been systematically encoded in the ReaX environment by means of the ReaX’s parallel synchronous language. By incorporating control objectives into the model, the safety and optimization algorithms we applied effectively manage task scheduling operations, and our approach successfully achieves satisfying the desired control objectives. Additionally, the inverse kinematics problem for robots consisting of two pairs of 6-DOF manipulators has been trained using metaheuristic methods to enable movement from one position to another. Our approach, addressing two distinct key topics, has been integrated within the framework and the results obtained in the experimental evaluation environment are reported in detail. The results demonstrate that our approach outperforms those found in the literature, and the experimental evaluations have successfully validated our method.

In real-world applications, limitations in task scheduling often stem from scalability issues, particularly when deployed in large-scale or distributed systems. To address these challenges, future work will explore the integration of our proposed method with advanced machine learning techniques, such as reinforcement learning or neural architecture search, which can enable adaptive decision-making and improved generalization across diverse environments. Moreover, the inverse kinematics problem, as approached with metaheuristic algorithms in this study, may lead to approximate or suboptimal solutions. In safety-critical applications, such as robotics in healthcare or aerospace systems, ensuring correctness and reliability is essential. Therefore, future research will also focus on combining metaheuristic approaches with formal verification techniques or constraint-based solvers to ensure both flexibility and correctness. Additionally, we plan to investigate the real-time performance and hardware deployment of the proposed framework to assess its practical viability in industrial settings.

The modeling approach we propose can be easily applied to similar systems with different control objectives in future work. The optimization control algorithms we have used are pessimistic, and new approaches can be developed with the implementation of novel control algorithms. The metaheuristic algorithms used for inverse kinematics within the scope of this study can be hybridized with other optimization algorithms to achieve higher accuracy results. Alternatively, the reliability of results obtained through advanced machine learning algorithms, such as deep reinforcement learning, can be tested. To enable dynamic task scheduling in unstructured environments, future work may explore the integration of deep reinforcement learning algorithms into the proposed framework. Such methods could enhance adaptability and decision-making in complex, real-time multi-robot scenarios. Finally, our work can be extended to effectively operate in dynamic and unstructured environments that require real-time adaptation.

Author Contributions

Conceptualization, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Data curation, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Formal analysis, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Funding acquisition, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Investigation, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Methodology, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Project administration, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Resources, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Software, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Supervision, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Validation, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Visualization, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Writing—original draft, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y.; Writing—review and editing, M.Ö., N.Ö., H.S.B.Y., M.D., C.Ş. and M.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

ANN	Artificial Neural Network
DCS	Discrete Controller Synthesis
GWO	Gray Wolf Optimization
ACO	Ant Colony Optimization
ABC	Artificial Bee Colony
PSO	Particle Swarm Optimization
COA	Coyote Optimization Algorithm
IK	Inverse Kinematic
MH	Inverse Kinematic
SDCS	Symbolic Discrete Controller Synthesis
DOF	Degree-of-freedom
DH	Denavit–Hartenberg
PID	Proportional Integral Derivative
FOPID	Fractional Order PID
CAD	Computer-Aided Design
MSE	Mean Square Error
APSO	Adaptive Particle Swarm Optimization

References

Pereira, J.; Pimentel, C.; Santos, V. An Algorithm for the Assignment and Scheduling of Tasks in Human-Robot Collaboration. In Proceedings of the Flexible Automation and Intelligent Manufacturing: Establishing Bridges for More Sustainable Manufacturing Systems, Taichung, Taiwan, 23–26 June 2024; Silva, F.J.G., Ferreira, L.P., Sá, J.C., Pereira, M.T., Pinto, C.M.A., Eds.; Springer: Cham, Switzerland, 2024; pp. 208–215. [Google Scholar]
Pupa, A.; Van Dijk, W.; Brekelmans, C.; Secchi, C. A Resilient and Effective Task Scheduling Approach for Industrial Human-Robot Collaboration. Sensors 2022, 22, 4901. [Google Scholar] [CrossRef] [PubMed]
O’Flaherty, R.; Vieira, P.; Grey, M.; Oh, P.; Bobick, A.F.; Egerstedt, M.B.; Stilman, M. Kinematics and Inverse Kinematics for the Humanoid Robot HUBO2+; Georgia Tech Library: Atlanta, GA, USA, 2013. [Google Scholar]
Ali, M.A.; Park, H.A.; Lee, C.G. Closed-form inverse kinematic joint solution for humanoid robots. In Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipei, Taiwan, 18–22 October 2010; pp. 704–709. [Google Scholar]
Said, A.; Rodriguez-Leal, E.; Soto, R.; Gordillo, J.; Garrido, L. Decoupled Closed-Form Solution for Humanoid Lower Limb Kinematics. Math. Probl. Eng. 2015, 2015, 437979. [Google Scholar] [CrossRef]
Jiang, Z.; Zhang, X.; Liu, G. Trajectory tracking control of a 6-DOF robotic arm based on improved FOPID. Int. J. Dyn. Control 2025, 13, 137. [Google Scholar] [CrossRef]
Liu, M.; Zhang, J.; Shang, M. Real-time cooperative kinematic control for multiple robots in distributed scenarios with dynamic neural networks. Neurocomputing 2022, 491, 621–632. [Google Scholar] [CrossRef]
Ahmed, K.; Roshdy, A.A.; Ata, W.; Salem, A. PID Control of Dual Axis Inertially Stabilized Platform Simscape Multibody Model. In Proceedings of the 2022 18th International Computer Engineering Conference (ICENCO), Giza, Egypt, 29–30December 2022; Volume 1, pp. 66–73. [Google Scholar]
Ramadge, P.; Wonham, W. The control of discrete event systems. Proc. IEEE 1989, 77, 81–98. [Google Scholar] [CrossRef]
Balemi, S.; Hoffmann, G.J.; Gyugyi, P.; Wong-Toi, H.; Franklin, G.F. Supervisory control of a rapid thermal multiprocessor. IEEE Trans. Autom. Control 1993, 38, 1040–1059. [Google Scholar] [CrossRef]
Maraninchi, F.; Rémond, Y. Argos: An automaton-based synchronous language. Comput. Lang. 2001, 27, 61–92. [Google Scholar] [CrossRef]
Altisen, K.; Clodic, A.; Maraninchi, F.; Rutten, É. Using controller-synthesis techniques to build property-enforcing layers. In Proceedings of the European Symposium on Programming, Warsaw, Poland, 7–11 April 2003; Springer: Berlin/Heidelberg, Germany, 2003; pp. 174–188. [Google Scholar]
Marchand, H.; Le Borgne, M. Partial order control of discrete event systems modelled as polynomial dynamical systems. In Proceedings of the 1998 IEEE International Conference on Control Applications (Cat. No. 98CH36104), Trieste, Italy, 4 September 1998; Volume 2, pp. 817–821. [Google Scholar]
Marchand, H.; Bournai, P.; Borgne, M.L.; Guernic, P.L. Synthesis of discrete-event controllers based on the signal environment. Discret. Event Dyn. Syst. 2000, 10, 325–346. [Google Scholar] [CrossRef]
Berthier, N.; Marchand, H. Discrete controller synthesis for infinite state systems with reax. IFAC Proc. Vol. 2014, 47, 46–53. [Google Scholar] [CrossRef]
Zhang, J.; Dong, Y.; Yin, L.; Mostafa, A.M.; Li, Z. Opacity enforcement in discrete event systems using differential privacy. Inf. Sci. 2025, 688, 121284. [Google Scholar] [CrossRef]
Qin, T.; Yin, L.; Liu, G.; Wu, N.; Li, Z. Strong Current-State Opacity Verification of Discrete-Event Systems Modeled with Time Labeled Petri Nets. IEEE/CAA J. Autom. Sin. 2025, 12, 54–68. [Google Scholar] [CrossRef]
Hu, Y.; Wang, D.; Yang, M.; He, J. Integrating reinforcement learning and supervisory control theory for optimal directed control of discrete-event systems. Neurocomputing 2025, 613, 128720. [Google Scholar] [CrossRef]
Tigane, S.; Guerrouf, F.; Hamani, N.; Kahloul, L.; Khalgui, M.; Ali, M.A. Dynamic Timed Automata for Reconfigurable System Modeling and Verification. Axioms 2023, 12, 230. [Google Scholar] [CrossRef]
An, X.; Rutten, E.; Diguet, J.P.; Le Griguer, N.; Gamatié, A. Discrete control for reconfigurable fpga-based embedded systems. IFAC Proc. Vol. 2013, 46, 151–156. [Google Scholar] [CrossRef]
Ishimizu, Y.; Li, J.; Yamauchi, T.; Chen, S.; Cai, J.; Hirano, T.; Tei, K. Towards Efficient Discrete Controller Synthesis: Semantics-Aware Stepwise Policy Design via LLM. In Proceedings of the 2024 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), Danang, Vietnam, 3–6 November 2024; pp. 1–4. [Google Scholar]
Li, J.; Yamauchi, T.; Li, N.; Chen, Z.; Zhang, M.; Hirano, T.; Tei, K. Demonstration of a Real-world Self-adaptive Robot Path-finding using Discrete Controller Synthesis. In Proceedings of the 2023 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion (ACSOS-C), Toronto, ON, Canada, 25–29 September 2023; pp. 27–28. [Google Scholar] [CrossRef]
Zudaire, S.A.; Nahabedian, L.; Uchitel, S. Assured mission adaptation of UAVs. ACM Trans. Auton. Adapt. Syst. (TAAS) 2022, 16, 1–27. [Google Scholar] [CrossRef]
Özbaltan, M.; Çaşka, S. Incorporating Symbolic Discrete Controller Synthesis into a Virtual Robot Experimental Platform: An Implementation with Collaborative Unmanned Aerial Vehicle Robots. Drones 2024, 8, 206. [Google Scholar] [CrossRef]
Gueye, S.M.K.; Rutten, E.; Diguet, J.P. Autonomic management of missions and reconfigurations in FPGA-based embedded system. In Proceedings of the 2017 NASA/ESA Conference on Adaptive Hardware and Systems (AHS), Pasadena, CA, USA, 24–27 July 2017; pp. 48–55. [Google Scholar] [CrossRef]
Dumitrescu, E.; Girault, A.; Marchand, H.; Rutten, E. Optimal discrete controller synthesis for modeling fault-tolerant distributed systems. IFAC Proc. Vol. 2007, 40, 169–174. [Google Scholar] [CrossRef]
Çaşka, S. The Performance of Symbolic Limited Optimal Discrete Controller Synthesis in the Control and Path Planning of the Quadcopter. Appl. Sci. 2024, 14, 7168. [Google Scholar] [CrossRef]
Jiang, X.; Wu, B.; Li, S.; Zhu, Y.; Liang, G.; Yuan, Y.; Li, Q.; Zhang, J. Multi-Humanoid Robot Arm Motion Imitation and Collaboration Based on Improved Retargeting. Biomimetics 2025, 10, 190. [Google Scholar] [CrossRef]
Minh Trieu, N.; Thinh, N.T. Advanced Design and Implementation of a Biomimetic Humanoid Robotic Head Based on Vietnamese Anthropometry. Biomimetics 2024, 9, 554. [Google Scholar] [CrossRef]
Tian, T.; Liang, Z.; Wei, Y.; Luo, Q.; Zhou, Y. Hybrid whale optimization with a firefly algorithm for function optimization and mobile robot path planning. Biomimetics 2024, 9, 39. [Google Scholar] [CrossRef] [PubMed]
Mohammed, M.Q.; Kwek, L.C.; Chua, S.C.; Al-Dhaqm, A.; Nahavandi, S.; Eisa, T.A.E.; Miskon, M.F.; Al-Mhiqani, M.N.; Ali, A.; Abaker, M.; et al. Review of learning-based robotic manipulation in cluttered environments. Sensors 2022, 22, 7938. [Google Scholar] [CrossRef] [PubMed]
Kroemer, O.; Niekum, S.; Konidaris, G. A review of robot learning for manipulation: Challenges, representations, and algorithms. J. Mach. Learn. Res. 2021, 22, 1–82. [Google Scholar]
Liu, R.; Nageotte, F.; Zanne, P.; de Mathelin, M.; Dresp-Langley, B. Deep reinforcement learning for the control of robotic manipulation: A focussed mini-review. Robotics 2021, 10, 22. [Google Scholar] [CrossRef]
Pandey, V.; Kamal, S.; Ghosh, S. Finite-time discrete control for two-DOF helicopter system. IEEE Trans. Circuits Syst. I Express Briefs 2024, 71, 3800–3804. [Google Scholar] [CrossRef]
Amngostar, P.; Dehabadi, M.; Hagh, S.F.; Badireddy, A.R.; Huston, D.; Xia, T. LoRaStat: A Low-cost Portable LoRaWAN Potentiostat for Wireless and Real-Time Monitoring of Water Quality and Pollution. IEEE Sens. J. 2024, 25, 1561–1570. [Google Scholar] [CrossRef]
Schieber, B.; Samineni, B.; Vahidi, S. Interweaving real-time jobs with energy harvesting to maximize throughput. In Proceedings of the International Conference and Workshops on Algorithms and Computation, Hsinchu, Taiwan, 22–24 March 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 305–316. [Google Scholar]
Chapman, R.M.; Taylor, K.B.; Kaczynski, E.; Khodabakhsh, S.; Richards, S.; Hutchinson, J.B.; Marchand, R.C. Accuracy amidst errors: Evaluating a commercially available wearable sensor system and its associated calibration procedures for monitoring sagittal knee motion in patients undergoing total knee arthroplasty. Knee 2025, 54, 316–328. [Google Scholar] [CrossRef]
Jandaghi, E.; Stein, D.L.; Hoburg, A.; Stegagno, P.; Zhou, M.; Yuan, C. Composite Distributed Learning and Synchronization of Nonlinear Multi-Agent Systems with Complete Uncertain Dynamics. In Proceedings of the 2024 IEEE International Conference on Advanced Intelligent Mechatronics (AIM), Boston, MA, USA, 15–18 July 2024; pp. 1367–1372. [Google Scholar]
Wen, B.; Tremblay, J.; Blukis, V.; Tyree, S.; Müller, T.; Evans, A.; Fox, D.; Kautz, J.; Birchfield, S. Bundlesdf: Neural 6-dof tracking and 3d reconstruction of unknown objects. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 606–617. [Google Scholar]
Sundermeyer, M.; Mousavian, A.; Triebel, R.; Fox, D. Contact-graspnet: Efficient 6-dof grasp generation in cluttered scenes. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 13438–13444. [Google Scholar]
Tyree, S.; Tremblay, J.; To, T.; Cheng, J.; Mosier, T.; Smith, J.; Birchfield, S. 6-DoF pose estimation of household objects for robotic manipulation: An accessible dataset and benchmark. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; pp. 13081–13088. [Google Scholar]
Lu, B.L.; Ito, K. Computation of Multiple Inverse Kinematic Solutions for Redundant Manipulators by Inverting Modular Neural Networks. IEEE J. Trans. Electron. Inf. Syst. 1995, 116, 49–56. [Google Scholar] [CrossRef]
Oyama, E.; Chong, N.Y.; Agah, A.; Maeda, T. Inverse kinematics learning by modular architecture neural networks with performance prediction networks. In Proceedings of the 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164), Seoul, Republic of Korea, 21–26 May 2001; Volume 1, pp. 1006–1012. [Google Scholar] [CrossRef]
D’Souza, A.; Vijayakumar, S.; Schaal, S. Learning inverse kinematics. In Proceedings of the 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180), Maui, HI, USA, 29 October–3 November 2001; Volume 1, pp. 298–303. [Google Scholar] [CrossRef]
Cassandras, C.G.; Lafortune, S. Introduction to Discrete Event Systems; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Calinescu, R.; Imrie, C.; Mangal, R.; Pasareanu, C.S.; Santana, M.A.; Vázquez, G. Discrete-Event Controller Synthesis for Autonomous Systems with Deep-Learning Perception Components. arXiv 2022, arXiv:abs/2202.03360. [Google Scholar]
Özbaltan, M.; Berthier, N. Power-aware scheduling of data-flow hardware circuits with symbolic control. Arch. Control Sci. 2021, 31, 431–446. [Google Scholar]
Özbaltan, M. Control of Discrete Event Systems by Using Symbolic Transition Model: An Application to Power Grids. Arab. J. Sci. Eng. 2025, 50, 937–949. [Google Scholar] [CrossRef]
Özbaltan, M.; Çaşka, S. Altitude control of quadcopter with symbolic limited optimal discrete control. Int. J. Dyn. Control 2024, 12, 1533–1540. [Google Scholar] [CrossRef]
Çaşka, S.; Özbaltan, M. Adaptation of Symbolic Discrete Control Synthesis for Energy-Efficient Multi-Pocket Milling. Processes 2024, 12, 584. [Google Scholar] [CrossRef]
Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar] [CrossRef]
Dorigo, M.; Socha, K. An introduction to ant colony optimization. In Handbook of Approximation Algorithms and Metaheuristics; Chapman and Hall/CRC: Boca Raton, FL, USA, 2018; pp. 395–408. [Google Scholar]
Karaboga, D. Artificial bee colony algorithm. Scholarpedia 2010, 5, 6915. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
Pierezan, J.; Coelho, L.D.S. Coyote optimization algorithm: A new metaheuristic for global optimization problems. In Proceedings of the 2018 IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro, Brazil, 8–13 July 2018; pp. 1–8. [Google Scholar]
Ibarra-Pérez, T.; Ortiz-Rodríguez, J.M.; Olivera-Domingo, F.; Guerrero-Osuna, H.A.; Gamboa-Rosales, H.; Martínez-Blanco, M.d.R. A novel inverse kinematic solution of a six-DOF robot using neural networks based on the taguchi optimization technique. Appl. Sci. 2022, 12, 9512. [Google Scholar] [CrossRef]
Zhou, Z.; Guo, H.; Wang, Y.; Zhu, Z.; Wu, J.; Liu, X. Inverse kinematics solution for robotic manipulator based on extreme learning machine and sequential mutation genetic algorithm. Int. J. Adv. Robot. Syst. 2018, 15, 1729881418792992. [Google Scholar] [CrossRef]
Deng, H.; Xie, C. An improved particle swarm optimization algorithm for inverse kinematics solution of multi-DOF serial robotic manipulators. Soft Comput. 2021, 25, 13695–13708. [Google Scholar] [CrossRef]
Song, M.; Tarn, T.J.; Xi, N. Integration of task scheduling, action planning, and control in robotic manufacturing systems. Proc. IEEE 2000, 88, 1097–1107. [Google Scholar] [CrossRef]
Zacharia, P.T.; Aspragathos, N. Optimal robot task scheduling based on genetic algorithms. Robot. Comput.-Integr. Manuf. 2005, 21, 67–79. [Google Scholar] [CrossRef]
Ho, T.M.; Nguyen, K.K.; Cheriet, M. Federated deep reinforcement learning for task scheduling in heterogeneous autonomous robotic system. IEEE Trans. Autom. Sci. Eng. 2022, 21, 528–540. [Google Scholar] [CrossRef]
Zhou, Z.; Abawajy, J. Reinforcement Learning-Based Edge Server Placement in the Intelligent Internet of Vehicles Environment. IEEE Trans. Intell. Transp. Syst. 2025. [CrossRef]

Figure 1. The joint configuration of the right arm of the 6-DOF humanoid robot and the roll, pitch, and yaw rotations of its end effector.

Figure 2. General artificial neural network (ANN) architecture.

Figure 3. Workflow of our approach for task scheduling of multiple humanoid robot manipulators.

Figure 4. General view of the 6-DOF manipulator and humanoid robot.

Figure 5. Kinematic scheme, link lengths, and joint angle limits of the 6-DOF manipulator.

Figure 6. An illustrative task scheduling scenario in multiple humanoid robots.

Figure 7. Token model transition between humanoid robots.

Figure 8. Comparison of task scheduling performance results.

Table 1. Performance evaluation of different optimization algorithms in modeling joint angles using ANN.

		Joint Angles
Method	Error	$α_{1}$	$α_{2}$	$α_{3}$	$β_{1}$	$β_{2}$	$β_{3}$
PSO [51]	MSE	0.0003	0.0359	0.0140	0.0229	0.0135	0.0284
PSO [51]	$R^{2}$	0.9992	0.8819	0.9536	0.9235	0.9547	0.9063
ACO [52]	MSE	0.0141	0.0337	0.0138	0.0270	0.0033	0.0275
ACO [52]	$R^{2}$	0.9561	0.8889	0.9569	0.9118	0.9882	0.9097
ABC [53]	MSE	0.0150	0.0279	0.0299	0.0051	0.0002	0.0282
ABC [53]	$R^{2}$	0.9536	0.9082	0.9000	0.9825	0.9994	0.9073
GWO [54]	MSE	0.0001	0.0007	0.0025	0.0001	0.0001	0.0061
GWO [54]	$R^{2}$	0.9998	0.9976	0.9914	0.9998	0.9996	0.9805
COA [55]	MSE	0.0050	0.0650	0.0144	0.0311	0.0024	0.0133
COA [55]	$R^{2}$	0.9831	0.7878	0.9541	0.9086	0.9914	0.9553

Table 2. Performance comparison of optimization methods for the inverse kinematics problem.

Method	Error		References
GWO	0.0016	(MSE)	Proposed Method
Taguchi	0.0670	(MSE)	[56]
SMGA	1.7220	(MSE)	[57]
APSO	0.0010	(Position)	[58]

Table 3. Computational tool performance.

Number of Robots	k-Step	Time (s)	Max Memory (MB)
10	1	0.11	246,438
	2	0.17	250,492
	3	0.23	261,342
	4	0.47	267,998
	5	0.51	271,122
30	1	0.11	342,120
	2	0.23	358,224
	3	0.43	370,412
	4	1.21	381,102
	5	1.35	403,444
70	1	0.13	560,178
	2	0.22	592,174
	3	0.47	611,526
	4	1.32	624,104
	5	2.44	652,666

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Özbaltan, M.; Özbaltan, N.; Bıçakcı Yeşilkaya, H.S.; Demir, M.; Şeker, C.; Yıldırım, M. Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control. Biomimetics 2025, 10, 346. https://doi.org/10.3390/biomimetics10060346

AMA Style

Özbaltan M, Özbaltan N, Bıçakcı Yeşilkaya HS, Demir M, Şeker C, Yıldırım M. Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control. Biomimetics. 2025; 10(6):346. https://doi.org/10.3390/biomimetics10060346

Chicago/Turabian Style

Özbaltan, Mete, Nihan Özbaltan, Hazal Su Bıçakcı Yeşilkaya, Murat Demir, Cihat Şeker, and Merve Yıldırım. 2025. "Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control" Biomimetics 10, no. 6: 346. https://doi.org/10.3390/biomimetics10060346

APA Style

Özbaltan, M., Özbaltan, N., Bıçakcı Yeşilkaya, H. S., Demir, M., Şeker, C., & Yıldırım, M. (2025). Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control. Biomimetics, 10(6), 346. https://doi.org/10.3390/biomimetics10060346

Article Menu

Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

Abstract

1. Introduction

1.1. Related Work

1.1.1. Task Scheduling

1.1.2. Inverse Kinematics Problem

1.1.3. Symbolic Controller Synthesis

2. Technical Background of the Common Problems Encountered in the Task Scheduling of Humanoid Robots

2.1. Six-Degree-of-Freedom Robot Manipulators

2.2. Inverse Kinematics Problem

2.3. Artificial Neural Network (ANN) Model

2.4. Metaheuristic Approaches

3. Task Scheduling of Multiple Humanoid Robot Manipulators by Using Symbolic Control

3.1. Overview

3.2. Inverse Kinematics Model of 6-DOF Robot Manipulators

3.3. Symbolic Model of Humanoid Robot Manipulators

3.3.1. Activation Model

3.3.2. Mutual Exclusion Constraints

3.3.3. Token Model

3.3.4. Control

3.3.5. Integration

4. Experimental Evaluation

5. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI