Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery

Guerreiro, Bruno J.; Azevedo, Francisco; Oliveira, Paulo; Cunha, Rita

doi:10.3390/s26020653

Open AccessArticle

Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery

¹

DEEC/CTS/LASI, NOVA School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal

²

ISR/LARSYS, Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisboa, Portugal

³

LAETA and DEM, Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisboa, Portugal

^*

Author to whom correspondence should be addressed.

Sensors 2026, 26(2), 653; https://doi.org/10.3390/s26020653

Submission received: 18 November 2025 / Revised: 12 January 2026 / Accepted: 15 January 2026 / Published: 18 January 2026

(This article belongs to the Special Issue Advanced Sensors for Intelligent Robotic Systems: Vision, Touch, and Dexterous Manipulation)

Download

Browse Figures

Versions Notes

Abstract

This paper presents the development of an autonomous grasping mechanism for drone-based parcel delivery systems towards developing capabilities for in-flight package transfer. The approach integrates a mechanical gripper fitted with sensors and a pose estimation method for parcels, all coordinated through a hybrid Model Predictive Control (MPC) architecture. The gripper’s mechanical structure and prototype are developed using 3D printing technology for both the main framework and gear components. A hybrid dynamical model is formulated that integrates the gripper mechanics with simplified drone dynamics, capturing distinct operational phases including package acquisition, transport, and release. The hybrid MPC framework computes reference trajectories for both the gripper arm configuration and the drone’s spatial path toward designated target positions. Experimental validation is conducted using the operational gripper prototype and pose estimation system, while drone behavior is represented through simulation.

Keywords:

hybrid MPC; pose estimation; drone delivery; gripper; 3D printing

1. Introduction

In the early stages of unmanned aerial vehicle (UAV) adoption, commonly known as drones, the dominant applications centered on observation and surveillance tasks. However, contemporary technological advances and growing demands for automation and remote operation have expanded the scope of UAV capabilities. Equipping these platforms with manipulation functionality enables not merely environmental observation but active physical interaction with surroundings as well. The inherent mobility of UAVs can increase the potential for ubiquitous object grasping and transportation, eliminating requirements for elaborate ground-based infrastructure. Addressing these opportunities, the REPLACE project [1] pursues the development of a rapid parcel delivery system for urban settings using drone platforms, acknowledging that relay operations between multiple drones become essential when individual vehicle range and endurance prove insufficient for complete delivery missions.

This paper focuses on developing a lightweight, fast, and autonomous grasping system for drone-based parcel exchange operations. The primary objectives are as follows: (i) a gripper design capable of handling parcels with known characteristics under constraints of mass, velocity, and energy consumption; (ii) establishing a perception and control architecture for the grasping device that enables package pose detection and responsive actuation. This foundation work can afterwords support the creation of basic grasping agents suitable for executing parcel transfers in operational contexts.

The task of aerial package manipulation and transport presents challenges that span a spectrum of scenarios with varying complexity levels. At the simplest level, a vehicle retrieves a package from one stationary ground location and deposits it at another fixed position. Upon approaching the pickup zone, the gripper system engages and computes an optimal approach maneuver and path based on the detected package position. Once the payload is secured, the vehicle navigates to the delivery site for package release. A more demanding scenario involves acquiring objects from mobile platforms (such as another aerial vehicle or ground vehicle), where the cargo remains visible and accessible from above. Advanced scenarios might integrate the motion of both pickup and delivery platforms within a supervisory control strategy to optimize transfer speed and efficiency, such as the work published in [2,3].

Concerning gripper system design, ref. [4] provides an extensive kinematic analysis of 64 linkage-based gripper configurations, establishing a foundation for subsequent investigations. The work in [5] documents established industrial practices and design methodologies for gripping mechanisms, providing foundational principles for the present study. Additionally, ref. [6] addresses the integration of grasping mechanisms with aerial platforms, emphasizing vehicle dynamics and control architecture, whereas [7,8] explore the deployment of both impactive and ingressive gripper types on UAV systems. Comprehensive reviews of aerial manipulation technologies and methodologies are presented in [9,10,11], which discuss various gripper designs, control strategies, and application scenarios.

For grasping operations in uncertain or incompletely characterized environments, such as aerial load transport, perception and sensing capabilities become critically important. The research presented in [12] addresses this topic by examining UAV modeling and control with explicit consideration of environmental interactions. The work in [13] develops a vision-based control approach for quadrotor perching maneuvers on cables, though the challenge intensifies when accurate pose estimation of target objects becomes necessary, for which the methodology presented in [14] provides an effective and widely adopted solution. Recent developments include sophisticated rigid designs such as the dual-arm aerial manipulator with anthropomorphic grippers [15] or lightweight adaptive gripper for parcel delivery [16]. While rigid grippers offer high load capacity and predictable behavior, soft and soft–rigid hybrid designs provide compliant grasping with improved adaptability to geometric uncertainty, as demonstrated in [17] for food handling and in [18] with variable stiffness grippers. Specifically for the aerial parcel delivery application, ref. [19] provides an thorough review of the state-of-the-art technologies and challenges, highlighting the trade-offs between different gripper designs. The approach presented here emphasizes a streamlined simple grasping device combined with coordinated autonomous control of both vehicle and manipulation subsystems for package exchange operations.

Problems featuring multiple operational phases with both continuous and discrete state dynamics may require hybrid dynamical modeling approaches. The foundational concept of hybrid continuous-time dynamical systems was established in [20], which examined how discrete state variables can capture transitions among continuous dynamics modes and their stability properties. The hybrid automaton framework, introduced in [21] and elaborated in [22], provides a formal representation for hybrid systems, though numerous alternative formulations appear throughout the literature, such as [23]. The work in [24] advances toward more tractable frameworks for describing, analyzing, implementing, and controlling such systems through a Mixed Logical Dynamical (MLD) formulation.

The main contributions of this paper are the design, implementation and experimental validation of a new automatic gripping system towards multi-drone parcel delivery comprising the mechanical and electronics systems, a pose estimation method, and a hybrid MPC strategy to achieve automatic planning and control for grasping parcels. The proposed hybrid MPC strategy is capable of coordinating both the drone motion and the gripper actuation through different discrete operation phases, including approach, grasping, and release maneuvers, alowing for the operation design to focus on goals and respective combination of cost functionals and contrainst, rather than using predefined trajectories that might not be able to cope with a moving parcel. The control algorithm is validated in simulation and an experimental validation trials for the gripper system is also presented.

The paper is organized as follows. Section 2 outlines the gripper mechanism design procedure, while Section 3 presents a package pose estimation method based on ArUco fiducial markers. Section 4 develops the dynamical models for both drone and gripper subsystems, subsequently integrating them into a unified hybrid model with an associated hybrid MPC controller. Section 5 describes the implemented prototype along with validation experiments that integrate the various system components, and Section 6 provides concluding observations and directions for future investigation.

2. Gripper Design

Gripping mechanisms represent well-established devices that exhibit diverse configurations depending on their intended application, operational constraints, and available fabrication materials. The objective here is not to present a universal gripper design methodology, but rather to document the development process for a mechanism tailored to the specific requirements of this application.

2.1. Motion Constraints and Prehension

The transported item (or its enclosure) is assumed to possess a rectangular prismatic geometry with uniform mass distribution, which given the vehicle’s payload capacity limitations, is constrained to remain below 500 g. For the fundamental scenario where the vehicle acquires and releases cargo at fixed ground locations, the gripper’s motion envelope is bounded only by the vehicle frame and ground surface. The gripper arm dimensions must respect both the vertical clearance when in the closed configuration and the lateral clearance required for the fully opened state. A slow prehension sequence would necessitate reducing vehicle speed such that arrival at the target position coincides with jaw closure completion. Following package acquisition, the gripper must retain secure hold throughout the transport trajectory to the destination.

To handle packages of varying dimensions and aspect ratios, the gripper jaw surfaces should maintain parallel orientation throughout their motion.Unlike angular jaw configurations, parallel jaw motion enables grasping from any accessible package surface, providing versatility across multiple acquisition scenarios while ensuring greater tolerance for positioning errors. Gripper mechanisms can be categorized into two primary types: ingressive and impactive. The present work focuses exclusively on the latter category, which operates through a straightforward principle where grasping is accomplished and sustained by normal forces applied by the jaw surfaces against opposing faces of the target object. Package retention results from the friction force

F_{f}

generated at these contact interfaces. Selecting appropriate jaw surface materials that yield suitable friction coefficients

μ_{s}

when paired with the package surface is essential. For cardboard–rubber material pairs,

μ_{s}

typically ranges from 0.5 to 0.8.

2.2. Typical Forces During Operation

Developing an appropriate gripper mechanism requires identifying and quantifying the operational forces involved. Given the assumed cuboid package geometry, the contact region between the package and jaw surfaces forms a rectangular area. Larger contact areas enhance retention stability while simultaneously reducing required gripping forces. The adopted configuration employs symmetric bilateral grasping as illustrated in Figure 1.

Each jaw applies a normal force

F_{G}

normal to the grasped package surface, whereas the friction force

F_{f}

required to prevent the package descent is expressed as

F_{f} = \frac{m g}{n}

, where n denotes the number of contact points (fingers and jaws, in this case, 2), m represents the package mass, and g is the gravitational acceleration. Given the relationship

F_{f} = μ_{s} N

, where N represents the normal contact force, the friction force

F_{f}

and gripping force

F_{G}

are related through

F_{G} = \frac{F_{f}}{μ_{s}}

.

During package transport, the vehicle may undergo accelerations beyond gravitational effects in the vertical direction. The most demanding condition arises during rapid ascent maneuvers, when the vehicle experiences maximum acceleration

a_{m a x}

, which compounds the gravitational acceleration. Under these circumstances, considering a gripper arm length l, the forces and moments required to maintain secure package retention are, respectively,

F_{G}^{*} = \frac{m \cdot (g + a_{m a x})}{μ_{s}},

(1)

M_{G}^{*} = F_{G}^{*} l .

(2)

2.3. Power Drive Chain

Selecting appropriate power transmission components is critical to meeting the system’s minimum velocity, force, and torque specifications. As illustrated in Figure 2, an actuator, specifically an electric motor, supplies torque

M_{m o t o r}

to the system.

Power transmission from the motor to the gripper finger rotation axis occurs through gear mechanisms, for which spur gears represent the most prevalent and elementary gear type. For a spur gear featuring N teeth, several fundamental geometric parameters can be established. The pitch circle defines a theoretical reference circle forming the basis for geometric calculations. The circular pitch, p, represents the arc length between corresponding points on adjacent teeth, measured along the pitch circle. The pressure angle

α

characterizes the inclination of the gear tooth profile. The module m serves as the standard ISO parameter for gear tooth sizing in gear nomenclature, defined as

m = p / π

, whereas the pitch diameter

d_{p}

relates to the module through

d_{p} = N m

.

For successful meshing between two spur gears, three principal requirements must be satisfied:

1.: The gears must be mounted on parallel shafts;
2.: Both gears must share identical module values m;
3.: The shaft separation, the center distance, must equal half the sum of the two pitch diameters.

For two properly meshed gears X and Y, their gear ratio is expressed as

S_{X - Y} = \frac{N_{Y}}{N_{X}} = \frac{d_{Y}}{d_{X}} .

(3)

During meshing operation, the pitch circles of both gears roll without slip, and the velocity at contact point c remains constant for gears with pitch radii

r_{X}

,

r_{Y}

, as well as angular velocities

ω_{X}

,

ω_{Y}

. Consequently, the angular velocities satisfy the relationship

\frac{ω_{X}}{ω_{Y}} = \frac{r_{Y}}{r_{X}} = \frac{d_{Y}}{d_{X}} = S_{X - Y} .

(4)

Given that properly designed gear meshes exhibit high efficiency with approximately 2% losses, power transmission through the mesh is treated as constant.

The torques on each gear can be determined by equating the mechanical power transmitted, resulting in

\frac{T_{X}}{T_{Y}} = \frac{ω_{Y}}{ω_{X}} = \frac{1}{S_{X - Y}} .

(5)

As such, the relationship between motor torque and the torque at the finger rotation axis is expressed as

M_{G} = \frac{M_{m o t o r} S_{t o t a l}}{n},

(6)

where

S_{t o t a l}

represents the overall gear ratio of the complete transmission train (accounting for multiple gear stages if present).

The grasping system comprises two pairs of gripper arms, with all elements designed to operate symmetrically and in synchronization, as depicted in Figure 3 for a single arm pair. Gears B and C maintain a 1:1 ratio,

S_{B - C}

, to preserve symmetry within each arm pair, while the ratios

S_{A - B}

and

S_{D - C}

are identical, and gear B functions as an idler, reversing the rotational direction of gear A.

Power transfer from the motor to the gear drive shaft employs a worm gear mechanism, which incorporates two components: a worm screw and a worm wheel (or worm gear). The transmission ratio for a worm drive is given by

S_{w o r m} = \frac{N_{G}}{N_{W}},

(7)

where

N_{G}

denotes the tooth count on the worm wheel and

N_{W}

represents the number of thread starts on the worm screw. A key advantage of worm drives is their potential for self-locking behavior in certain configurations, where the worm wheel cannot back-drive the worm screw. Finally, combining Equations (1), (2) and (6) yields the minimum permissible total gear train ratio threshold, expressed as

S^{*} = \frac{M_{g}^{*}}{M_{m o t o r}} .

(8)

2.4. Final Prototype and Experimental Assessment

The final gripper design was subject to several tests to validate its performance against the specified requirements. Regarding the use of the worm gear, from (4) and (5), it is possible to calculate the expected values of the gripper jaw’s maximum angular velocity and torque provided by the chosen motor and gear set combination, which are 5.39 Nm and 0.94 rad/s, respectively. However, empirical tests that account for losses and force transmission inefficiencies were performed for the angular velocity, as depicted in Figure 4.

The arm angular velocity was measured in 30 trials with the same step input reference, and the mean value was computed for each time step. From this data, it can be inferred that the maximum measured angular speed is

{\bar{ω}}_{a r m} \approx 0.6

rad/s, which is about 64% of the expected values. Assuming the same losses for the available torque,

M_{m o t o r} = 3.45

Nm. From (1) and (2), the minimum torque necessary to hold a parcel with

m = 200

g, considering

μ_{s} = 0.8

and a combined maximum allowable acceleration of

12 {ms}^{- 2}

, is

M_{G}^{*} = 3

Nm, which is within the estimated gripper capabilities, as

M_{m o t o r} \geq M_{G}^{*}

.

To determine the gripper’s current angular position, a potentiometer is attached to one of the gripper arms. Experimental trials were performed to test the gripper going from fully open to fully closed, as shown in Figure 5, where the gripper arm angle

θ_{a r m}

is plotted against time.

In addition to position tracking, the system must also verify whether the package has been successfully secured. This is accomplished by identifying instances when the servo motor stalls or experiences exceptional loading, indicating insufficient power to overcome mechanical resistance from an obstacle. A current detection circuit interfaced with the microcontroller was developed to acquire this data, as illustrated in Figure 6.

When stalling is detected during testing, the motor temporarily halts before resuming motion. Current exceeding the calibrated threshold (red dashed line) triggers a motor stop and the system proceeds with the following steps defined for regular operation. Considering the trial depicted in Figure 5, corresponding the current measurement results are shown in Figure 7.

Current measurements demonstrate proper system operation, with the microcontroller accurately detecting full gripper closure through characteristic current peaks. When measured current surpasses the predefined threshold, the gripper arms are confirmed to be in contact with either the target parcel or the opposing jaw.

The final gripper prototype depicted in Figure 8, both standalone and integrated in the drone holding a parcel, was also evaluated for its load-bearing capacity during static conditions.

The bulk of its structure and all of the spur gears were 3D printed using Fused Deposition Modeling with a PLA material, weighing 250 g without the camera and 290 g with the used camera. Experimental tests were performed using a spring-scale dynamometer attached to the bottom surface of a test parcel box to assess the maximum weight that the gripper could hold without electric current being supplied to the motor. It was experimentally observed that the gripper mechanism equipped with the worm gear and end parts fitted with rubber mats could hold cargo of up to 1 kg.

In Table 1, a comparison between the proposed gripper and some recent works found in the literature is presented.

Although its purpose is more specific to the application and usage detailed above, it can be seen that it is lighter than most of the rigid grippers presented in [10,11], which usually weigh more than 300 g, while still being able to handle parcels up to 1 kg, which is above the required payload for the intended application and for the payload capacity for which most small drones are designed to carry. Another design option related to the autonomy of the drone+gripper system is the use of the worm gear, which provides self-locking capabilities, avoiding the need for continuous power supply to the motor to keep the parcel grasped during transportation. This type of discussion is seldomly found in the literature, but it is an important aspect to consider when designing aerial manipulation systems, as it can significantly impact the overall energy consumption and flight endurance of the drone.

3. Parcel Pose Estimation

Accurate determination of the target package position can be achieved through various approaches. External positioning systems, such as GPS tracking, represent one possibility, but their positional uncertainties and limited update frequencies render them unsuitable for close-range operations and rapid maneuvers. Conversely, onboard sensing methods, including computer vision or proximity sensors, typically deliver improved accuracy at shorter ranges with fewer environmental obstructions. The adopted approach combines both methodologies: initial acquisition at larger relative distances with relaxed accuracy requirements, transitioning to precise measurement as separation decreases, corresponding to scenarios (a) and (b) depicted in Figure 9, respectively.

The present work concentrates exclusively on the onboard sensing phase (b), where a camera establishes correspondences between environmental features and their image plane projections. Incorporating passive markers on the package surface significantly enhances both image capture and processing performance by supplying the detection algorithm with predefined reference points. This approach, termed a fiducial marker system, enables parcel pose estimation relative to a monocular camera with low computational cost, substantial robustness, and rapid processing.

3.1. ArUco Marker System

Fiducial marker systems operate using predefined marker patterns and algorithms that execute detection, error correction, and pose estimation. Among the various implementations available, refs. [14,28] present a computationally efficient and robust square fiducial marker approach employing binary encoding, with capabilities for detecting and estimating poses of individual markers or marker arrays. The ArUco library provides an open-source implementation of this methodology, where the marker generation occurs offline through an optimization algorithm that maximizes inter-marker distance and bit transition count, with each marker assigned a unique identifier and stored in a dictionary. Marker detection within images is executed through the ArUco library function detectMarkers().

ArUco tags can be deployed either individually or in collective arrangements, where the latter may be distributed across planar surfaces (termed boards) or three-dimensional structures. Three-dimensional marker arrangements prove particularly suitable for package applications, as they provide redundancy to compensate for marker occlusion or partial visibility, enabling detection from arbitrary viewing angles. Constructing a 3D marker structure requires specifying the spatial coordinates of each marker corner, assigning individual marker identifiers, and selecting the appropriate dictionary. Once these parameters are defined, the ArUco library function Board_create() generates the corresponding 3D structure object.

The package reference frame

P

is established based on the marker configuration geometry, as illustrated in Figure 10, with its origin located at the package centroid and axes

x_{P}

,

y_{P}

and

z_{P}

aligned with the box length, width, and height, respectively.

Determining the pose of an ArUco marker structure requires the camera’s intrinsic matrix

C_{I}

and distortion coefficient vector

D_{c f}

, which are camera-specific and obtained through calibration procedures. Providing these camera parameters together with the corner coordinates of each of the

N_{d}

detected markers, their corresponding ids, and the predefined 3D marker geometry to the ArUco library function estimatePoseBoard() yields the package pose relative to the camera parameterized by a vector

r_{v} \in R^{3}

representing the rotation and a vector

t_{v} \in R^{3}

the translation, where the latter corresponds to the position of

P

expressed in

C

, denoted by

{}^{C}p_{P}

.

The vector

r_{v}

representing the rotation can be converted to the rotation matrix representing the relative attitude of

P

as seen by

C

, via the Rodrigues’ rotation formula

{}_{P}^{C}R = I + S ({\bar{r}}_{v}) sin (α) + S^{2} ({\bar{r}}_{v}) (1 - cos (α))

(9)

where the rotation angle is

α = ∥ r_{v} ∥

, the rotation axis is

{\bar{r}}_{v} = r_{v} / ∥ r_{v} ∥

, I represents the

3 \times 3

identity matrix, and

S (a)

denotes the skew-symmetric matrix that defines the cross product as

S (a) b = a \times b

, where

a, b \in R^{3}

.

3.2. Drone-with-Gripper Perception of Package Pose

The camera frame

C

is mounted beneath the vehicle frame

B

to maximize parcel visibility while minimizing obstructions during grasping operations. Specifically,

C

is offset from the

B

origin by a fixed displacement

l_{o} = [l_{o}^{x}, l_{o}^{y}, 0]

and rotated about the

y_{B}

axis by angle

θ_{o}

. The transformation relating the camera frame

C

to the vehicle body frame

B

is then expressed as

{}_{C}^{B}R = [\begin{matrix} cos (θ_{o}) & 0 & - sin (θ_{o}) \\ 0 & 1 & 0 \\ sin (θ_{o}) & 0 & - cos (θ_{o}) \end{matrix}],

(10)

with

{}^{B}p_{C} = - l_{o}

. As such, the complete transformation from

P

to

W

is given by

{}^{W}p_{P} = {}^{W}p_{B} - {}_{B}^{W}R ({}^{B}p_{C} + {}_{C}^{B}R {}^{C}p_{P}),

(11)

considering the parcel orientation in frame

W

given by

{}_{P}^{W}R = {}_{B}^{W}R {}_{C}^{B}R {}_{P}^{C}R

.

3.3. ArUco Pose Estimation Evaluation

To evaluate the quality of the discussed method and its applicability in our proposed scenario, a group of tests were made, resembling the expected working conditions. The camera used for these tests was a C290 (by Logitech International S.A., Lausanne, Switzerland) with a stated image resolution of 800 × 600. The sample rate of the pose estimation algorithm is strongly dependent on the camera frame rate and on the computer processing capabilities, which in this case were both able to properly function at 30 Hz.

To better assess algorithm performance and interpret results, outputs are presented as camera pose relative to the parcel frame,

{}^{P}p_{C}

. Assuming horizontal parcel placement,

P

coincides with

W

, allowing direct interpretation as ground distances. A first test involved horizontal approach along the x axis, a second examined z axis motion, where non-perpendicular observation increases motion blur susceptibility, and a third test the rotation about the z axis was tested. These tests are depicted in Figure 11, along with a picture of the testing environment.

While no groundthrough was available, qualitative assessment confirms adequate accuracy of the pose estimation strategy with minimal noise, attributed to the perpendicular observation angle reducing motion blur. It is also noticeable that z axis motion reveals increased noise, though remaining acceptable at approximately 1 cm magnitude, which can be mitigated throught simple filtering techniques.

4. Hybrid Grasping Model Predictive Control

This section develops dynamic models for both the vehicle and gripper subsystems, subsequently integrating them into a unified hybrid model capable of representing multiple operational scenarios and modes. The objective is to formulate a Hybrid Model Predictive Controller (HMPC) that enforces all critical constraints with minimal deviation. Figure 12 illustrates the control architecture employed in this work, wherein the HMPC computes reference signals for both the gripper and vehicle controllers.

4.1. Drone Dynamics

Developing a fully autonomous grasping system for aerial parcel delivery necessitates establishing appropriate quadrotor dynamics and control models, for which the approach followed in [29] is adopted. The rotation matrix of the vehicle body frame

B

relative to the world frame

W

, denoted as

{}_{B}^{W}R

or simply as R, can be parameterized using, for instance, the ZYX Euler angles

ϕ

(roll),

θ

(pitch), and

ψ

(yaw), respectively. This rotation matrix can also be recovered by composing three simple rotations based on the Euler angles, according to the ZYX sequence. The angular velocity of

B

relative to

W

expressed in

B

is denoted as

ω \in R^{3}

, which can also be related to the time derivatives of Euler angles through an appropriate transformation. Additionally, the position of the origin of

B

relative to

W

is denoted by

p \in R^{3}

, whereas its linear velocity is

v \in R^{3}

. Thus, the kinematics and dynamic differential equations that describe the motion of the vehicle can be written as

\begin{matrix} \dot{p} & = v \end{matrix}

(12)

\begin{matrix} \dot{v} & = - g z_{W} + \frac{u_{1}}{m} z_{B} \end{matrix}

(13)

\begin{matrix} \dot{R} & = R S {(ω)}_{B} \end{matrix}

(14)

\begin{matrix} \dot{ω} & = J^{- 1} (- S (ω) J ω + u_{τ}) \end{matrix}

(15)

where m is the mass of the vehicle, g is the gravitational acceleration, J is the inertia matrix of the vehicle,

u_{1}

is the total thrust generated by the rotors, and

u_{τ} = {[u_{2} u_{3} u_{4}]}^{T}

is the vector of moments applied to the vehicle in its body frame. Also, vectors

z_{w}

and

z_{B}

are the z axis in

W

and

B

, respectively. These two vectors are related by

z_{B} = R z_{w}

. An appropriate control law, capable of providing the motor input vector u that is able to follow a desired trajectory based on position, velocity, and orientation references is thoroughly described in [29].

Considering a hierarchical control strategy, a drone might use several control loops that account, progressively, local control laws for angular velocity, attitude, linear velocity, and position. As typical autopilots provide such inner-loop control laws, a high-level controller such as the one considered here can assume that, if a velocity reference is computed, the autopilot inner loops can easily follow that reference. Thus, the high-level drone model used for integrated guidance and control can be greatly simplified, simply considering

\dot{p} = v

and

\dot{ψ} = ω_{z}

, where the inputs are now considered to be the drone linear velocity,

v

, and the angular rate about the z axis,

ω_{z}

. A discrete-time version of this model can also be defined, considering normalized velocity and yaw-rate inputs,

u_{v} \in {[- 1, 1]}^{3}

and

u_{ψ} \in [- 1, 1]

, respectively, as well as one sample time delay,

T_{s}

, on the velocity input, yielding

\begin{matrix} p (k + 1) & = p (k) + T_{s} C_{v} u_{v} (k - 1) \end{matrix}

(16)

\begin{matrix} ψ (k + 1) & = ψ (k) + T_{s} C_{ψ} u_{ψ} (k) \end{matrix}

(17)

where

C_{v}

and

C_{ψ}

are constant parameters. Considering the drone state vector

x_{d} (k) = {[p (k) u_{v} (k - 1) ψ (k)]}^{T}

and respective input vector

u_{d} (k) = {[u_{v} (k) u_{ψ} (k)]}^{T}

, this model can be rewritten as

\begin{matrix} x_{d} (k + 1) & = A_{d} x_{d} (k) + B_{d} u_{d} (k) \end{matrix}

(18)

where

\begin{matrix} A_{d} & = [\begin{matrix} I & T_{s} C_{v} I & 0_{3 \times 1} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 1} \\ 0_{1 \times 3} & 0_{1 \times 3} & 1 \end{matrix}] & B_{d} & = [\begin{matrix} 0_{3 \times 3} & 0_{3 \times 1} \\ I & 0_{3 \times 1} \\ 0_{1 \times 3} & T_{s} C_{ψ} \end{matrix}] \end{matrix}

(19)

4.2. Dynamic Model of the Gripper

The only actuator in the gripper system is a servo motor modified to be able to have an infinite rotation span, controlled by an Arduino microcontroller. This modification removes the original position feedback capability of a servo motor, but enables its position control.

To model the motor dynamics, simple identification tools where used and a 2nd-order discrete-time system is found to be sufficiently accurate, relating the motor angular velocity

ω_{m} \in R

with a normalized input

u_{m} \in [- 1, 1]

, yielding

ω_{m} (k + 1) = - a_{1} ω_{m} (k) - a_{2} ω_{m} (k - 1) + b_{1} u_{m} (k) + b_{2} u_{m} (k - 1)

(20)

where

a_{1}

,

a_{2}

,

b_{1}

, and

b_{2}

are constant model coefficients. The gripper arm angular velocity is defined as

ω = S_{t o t a l} ω_{m}

, where

S_{t o t a l}

is the combined gear ratio from the motor to the gripper arm, and the angular position of the arm can also be defined as

θ = S_{t o t a l} θ_{m}

, where

θ_{m}

is the angular position of the motor. Thus, considering a sample time

T_{s}

,

A_{i} = S_{t o t a l} a_{i}

, and

B_{i} = S_{t o t a l} b_{i}

, the discrete-time dynamics of the gripper can be defined as

\begin{matrix} θ (k + 1) & = θ (k) + T_{s} ω (k) \end{matrix}

(21)

\begin{matrix} ω (k + 1) & = - A_{1} ω (k) - A_{2} ω (k - 1) + B_{1} u_{m} (k) + B_{2} u_{m} (k - 1) \end{matrix}

(22)

Considering the gripper state vector

x_{g} (k) = {[θ (k) ω (k) ω (k - 1) u_{m} (k - 1)]}^{T}

, this model can be rewritten as

\begin{matrix} x_{g} (k + 1) & = A_{g} x_{g} (k) + B_{g} u_{m} (k) \end{matrix}

(23)

where

\begin{matrix} A_{g} & = [\begin{matrix} 1 & T_{s} & 0 & 0 \\ 0 & - A_{1} & - A_{2} & B_{2} \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}] & B_{g} & = [\begin{matrix} 0 \\ B_{1} \\ 0 \\ 1 \end{matrix}] \end{matrix}

(24)

Based on the geometry illustrated in Figure 13, the jaw separation resulting from motor rotation is given by

d = 2 (a + l cos (θ) - b)

, where l represents the gripper arm length, while a and b denote geometric parameters defining the spacing between gripper arm rotation axes.

Consequently, for a parcel of width

d_{r e f}

, the required gripper arm angle is

θ_{r e f} = arccos (\frac{\frac{d_{r e f}}{2} - a + b}{l}) .

(25)

The actual limitations of the arms angle is constrained by the geometry of the parts, which considering a zero parcel dimension, yields

θ_{m a x} = arccos ((- a + b) / l)

, whereas

θ_{m i n} = 0

due to mechanical limitations.

4.3. Hybrid Model

Given that the system does not consist only of state and input variables representing physical quantities, but also on parts described by logic and discrete evolution, a hybrid model can be formulated. To this end, additional variables are defined in order to better model the system, using a notation where binary variables are represented by a

δ_{v a r} \in {0, 1}^{n_{v a r}}

and continuous variables by a

γ_{v a r} \in R n_{v a r}

. The hybrid model of the drone and gripper system, based on the models introduced above, is described by a set of continuous state variables:

θ, ω \in R

representing the angle and angular velocity of the gripper arm;

p \in R^{3}

and

v \in R^{3}

denoting the drone’s position and velocity in

W

, and

ψ \in R

denoting the drone’s yaw angle. Another important state will be the phase in which the hybrid model of the gripper is, which we can enumerate as

{A, B, C}

, characterizing the current mode of operation. To represent this, a binary vector can be used, such as

δ_{p h a s e} = {[δ_{A} δ_{B} δ_{C}]}^{T} \in {0, 1}^{3}

.

Considering first the continuous state variables and their respective equations defined in (18) and (23), defining the continuous state vector

x_{c} (k) = {[x_{g} {(k)}^{T} x_{d} {(k)}^{T}]}^{T}

and respective input vector

u (k) = {[u_{m} (k) u_{d} {(k)}^{T}]}^{T}

, the following state equation defines the discrete-time drone and gripper continuous dynamics:

\begin{matrix} x_{c} (k + 1) & = A_{c} x_{c} (k) + B_{c} u (k) \end{matrix}

(26)

where

\begin{matrix} A_{c} & = [\begin{matrix} A_{g} & 0_{4 \times 7} \\ 0_{7 \times 3} & A_{d} \end{matrix}] & B_{c} & = [\begin{matrix} B_{g} & 0_{4 \times 4} \\ 0_{4 \times 1} & B_{d} \end{matrix}] \end{matrix}

(27)

The evolution of the variable

δ_{p h a s e}

can be described by the diagram in Figure 14.

Three operational phases are defined to capture both discrete state transitions and continuous dynamics variations within (26). Phase A represents conditions where only the drone motion is affected by the controller while the gripper remains inactive, either fully closed or fully opened, which encompasses approach and transport operations, during which the vehicle navigates toward a designated target location. Phase B spans the interval from initiation of the grasping maneuver until secure package capture is achieved at the pickup location

p_{g r a b} \in R^{3}

, where both the drone motion and the gripper are actively controlled. Phase C governs package release operations at the delivery location

p_{d r o p} \in R^{3}

, which also implies the control of both gripper and drone motions. Transitions from phase A into phases B and C occurs upon entering proximity zones around the respective target locations, characterized by

∥ p_{a} - p ∥ \leq d_{z o n e}

, where

p_{a}

is either

p_{g r a b}

or

p_{d r o p}

and

d_{z o n e}

is a constant parameter. To each stage corresponds a binary variable,

δ_{A}

,

δ_{B}

, or

δ_{C}

, constrained by

δ_{A} + δ_{B} + δ_{C} = 1

, meaning that at any point in time, the system can only be in one of the phases. Concerning the gripper arm rotation span, an auxiliary variable

δ_{c l o s e} \in {0, 1}

, specifying when the gripper jaws are fully closed, is created. It is defined as

δ_{c l o s e} = \{\begin{matrix} 0, & if θ < θ_{c l o s e} \\ 1, & if θ \geq θ_{c l o s e} \end{matrix}

(28)

where

θ_{c l o s e}

is a predefined angle that varies according to the box’s dimensions and can be calculated from (25). An additional binary variable

δ_{f o r c e}

conveys information about reaction forces applied to the gripper arms, indicating whether cargo is actively being held in these operational phases. The gripper state

δ_{g r i p p e r}

, indicating successful parcel acquisition, is determined through logical combinations of these variables and their complements (denoted by the ¬ operator), expressed as

δ_{g r i p p e r} = \{\begin{matrix} 0, & if \neg δ_{f o r c e} \land δ_{o p e n} \\ 1, & if δ_{f o r c e} \land δ_{c l o s e} \end{matrix} .

(29)

Upon successful object capture, the system must immediately transition back to phase A, whereas an identical transition occurs following gripper opening. These constraints are expressed as

δ_{g r i p p e r} (k) \land δ_{B} (k) \Rightarrow δ_{A} (k + 1),

(30)

\neg δ_{g r i p p e r} (k) \land δ_{C} (k) \Rightarrow δ_{A} (k + 1) .

(31)

A final binary auxiliary variable is necessary to indicate if the drone has reached passed its target location, denoted as

δ_{p a s s e d}

.

With state and auxiliary variables established, the hybrid model can be formulated as a Discrete Hybrid Automaton (DHA). Expressing the model as a DHA enhances comprehension and provides rigorous formalization of the relationships among dynamics and constraints. This formulation further enables systematic conversion to alternative representations, such as Mixed Logical Dynamical (MLD) systems, which characterize the system through linear difference equations incorporating both continuous and binary variables alongside linear inequality constraints, making it well-suited for hybrid model representation and optimization-based control implementations. The HYbrid System DEscription Language (HYSDEL), introduced in [30], provides a modeling framework for specifying DHA models, whereas the methodology for converting to MLD systems and associated language constructs is detailed in [24].

4.4. Hybrid Model Predictive Controller

Satisfying the hybrid model’s requirements, constraints, and objectives is most effectively accomplished through Model Predictive Control (MPC), wherein finite-horizon optimal control problems are solved iteratively at each time step. Selecting the prediction horizon,

N_{p}

, necessitates understanding the system’s dominant dynamical behavior, as the controller must anticipate critical operational events with sufficient foresight to enable appropriate corrective actions. Idealy,

N_{p}

should be greater than the number of time steps necessary to fully close the gripper mechanism to its gripping angle

θ_{c l o s e}

. To prevent the need to use exceedingly large values for

N_{p}

, the gripping strategy consists of first moving the gripper to an intermediate closing angle

θ_{p r e}

, after it reaches a predefined safety zone. This way, the final gripping maneuver requires a considerably smaller prediction horizon and, depending on the choice of parameters, for

T_{s} = 0.1

s, the most efficient prediction horizon is between 5 and 7 time intervals.

The optimization problem to be solved in each iteration of the MPC is formulated as the Mixed Integer Quadratic Programing (MIQP) problem:

\begin{matrix} min_{q_{0}} & J_{0} (x (k), q_{0}) \\ s . t . & x (k + 1) & = A x (k) + B_{1} u (k) + B_{2} δ (k) + B_{3} z (k) + B_{5}, \\ y (k) & = C x (k) + D_{1} u (k) + D_{2} δ (k) + D_{3} z (k) + D_{5}, \\ E_{2} δ (k) + E_{3} z (k) & \leq E_{1} u (k) + E_{4} x (k) + E_{5} \end{matrix}

(32)

where the cost function

J_{0}

is given by

J_{0} (x (k), q_{0}) = {∥ x (N) ∥}_{Q_{x_{N}}}^{2} + \sum_{k = 0}^{N - 1} {[∥ y (k) ∥}_{Q_{y}}^{2} + {∥ u (k) ∥}_{Q_{u}}^{2} + {∥ z (k) ∥}_{Q_{z}}^{2} + {∥ x (k) ∥}_{Q_{x}}^{2}]

(33)

with

{∥ x ∥}_{Q}^{2} = x^{T} Q x

,

q_{0}

denoting the vector consisting of the optimization slack variables, the sequence of future values of the inputs u, and the auxiliary and output variables

δ

, z, and y. The cost functional matrix weights,

Q_{x}

,

Q_{y}

,

Q_{u}

,

Q_{z}

, and

Q_{x_{N}}

, can be chosen according to the relative weights of the current state of the system x, the outputs y, the auxiliary variables z, and the state vector at the prediction horizon

x_{N}

, respectively. The Hybrid Toolbox for MATLAB, presented in [31], provides a mechanism to convert the formulation expressed before into a more desirable compact form, accepted by the most common solvers, such as IBM’s CPLEX or MATLAB’s optimization toolbox.

The

Q_{y}

matrix takes advantage of a combination of auxiliary variables to compute a more reliable functional cost. First, it is necessary to define the

y (k)

vector containing the auxiliary variables. Since there are different objectives for each stage, some auxiliary variables are only relevant when the system is at a specific phase. The notation

γ_{x}^{p h} \ δ_{x}^{p h}

is hereby equivalent to

γ_{x} \ δ_{x} \land p h a s e = p h

. The auxiliary variables used to computed the cost of the optimization problem are

\begin{matrix} y (k) = {[\begin{matrix} δ_{\neg c l o s e & p a s s e d}^{B} (k) & γ_{c l o s e}^{B} (k) & γ_{z o n e}^{B} (k) & δ_{m o t i o n}^{B} (k) & γ_{d r o p}^{C} (k) \end{matrix}]}^{T} . \end{matrix}

(34)

Several auxiliary variables contribute to the cost function formulation. The variable

γ_{c l o s e}

yields the remaining distance to the target when the gripper has achieved full closure, while

δ_{\neg c l o s e & p a s s e d}

equals unity when the predicted trajectory includes instances where the vehicle has overshot the target without completing gripper closure. The servo motor command

u_{m}

is provided by

γ_{z o n e}

when the vehicle remains beyond a designated safety radius from the target location. Additionally,

δ_{m o t i o n}

assumes a value of one when the gripper occupies an intermediate configuration rather than a limit position, and

γ_{d r o p}

supplies the servo motor command only prior to the vehicle reaching

p_{g r a b}

.

5. Experimental Results

This section presents the results obtained from both simulations using the developed gripper prototype integrated with the hybrid MPC framework described in Section 4, as well as experimental trials using the parcel perception system outlined in Section 3.

5.1. Hybric MPC Simulation Results

Simulation results were obtained using the hybrid MPC framework described in Section 4, implemented in MATLAB R2018a and using the HYSDEL 3.0 to generate the MIQP problem, which was then solved using IBM CPLEX solver 12.8 (in particular, the solver “cplexmiqp”).

The control parameters were determined through a systematic tuning procedure, initially selecting these based on the system’s physical properties and time constants, the prediction horizon was chosen to balance computational load and performance, whereas the cost function matrices were adjusted iteratively to achieve desired tracking accuracy and control effort trade-offs. The final parameter values are summarized in Table 2.

After defining the model parameters and calibrating the optimization cost weight matrices, some simulation tests were used to assess the overall performance of the system. To provide the hybrid model and MPC with the required information to simulate a scenario with the 3 different phases (altough phase A repeats after phases B and C), it is necessary to define the state references for each phase. Assuming the initial conditions to be

x_{0} = {[0, 0_{1 \times 3}, p_{i n i}^{'}, 0_{1 \times 3}, 0]}^{'}

, a scenario could be devised where the state references given in each phase are as follows:

-: Phase $A_{1}$ : $x_{r e f} = {[0, 0_{1 \times 3}, p_{g r a b}^{'}, 0_{1 \times 3}, ψ_{g r a b}]}^{'}$
-: Phase B: $x_{r e f} = {[θ_{c l o s e}, 0_{1 \times 3}, p_{g r a b}, 0_{1 \times 3}, ψ_{g r a b}]}^{'}$
-: Phase $A_{2}$ : $x_{r e f} = {[θ_{c l o s e}, 0_{1 \times 3}, p_{d r o p}, 0_{1 \times 3}, ψ_{g r a b}]}^{'}$
-: Phase C: $x_{r e f} = {[0, 0_{1 \times 3}, p_{d r o p}, 0_{1 \times 3}, 0]}^{'}$
-: Phase $A_{3}$ : $x_{r e f} = {[0, 0_{1 \times 3}, p_{e n d}, 0_{1 \times 3}, 0]}^{'}$

where

ψ_{g r a b}

is the yaw angle of the parcel at the grasping position to be acquired by the camera.

Figure 15 shows the results of a simulation using a reference structure as the one described above.

The hybrid MPC outer-loop controller will relay the reference values of

ω

to the gripper mechanism as well as v and

ψ

as desired references to the drone inner-loop controllers.

It is possible to observe that the gripper arm fully closes at the exact moment (ii) then it reaches the desired position

p_{g r a b}

. It accomplishes this through the strategy described in Section 4.4, where the gripper arm first rotates to

θ_{p r e}

before fully rotating to

θ_{c l o s e}

. The MPC is able to compute a trajectory that passes through the specified target locations according to the gripper arm angle

θ_{a} r m

. The dropping of the parcel, represented by the opening of the gripper arm, also occurs at the exact expected moment (iii). These gripper arm movements are enforced by the motor inputs

u_{m o t o r}

computed with the MPC. It is also confirmed that the hybrid model phases evolve as expected. The system phases transition according to Figure 14. Phase B, or the grasping motion sequence, happens between moments (i) and (ii) and is enabled when the drone enter the neighborhood of

p_{g r a b}

. Phase C, or the parcel dropping motion sequence, occurs almost instantaneously after moment (iii), when the drone is at

p_{d r o p}

. The simulation results indicate that the hybrid MPC is capable of effectively managing the gripper mechanism’s operation in conjunction with the drone’s movement, ensuring precise timing for grasping and releasing the parcel.

5.2. Grasping Experimental Results

For the experimental trials described hereafter, only the gripper mechanism is actuated, considering the simulated drone dynamics using the model described in Section 4.1 while manually moving the gripper system. The experimental setup that includes the gripper hardware and the HMPC is depicted in Figure 16.

The data acquired from the angular position sensor and force threshold sensor was relayed to the main processing unit by an Arduino microcontroller through serial communication at 100 Hz, whereas the camera module provided the parcel position and yaw angle at 30 Hz, relayed to the HMPC solver through a UDP socket. With this information, the HMPC computes at 10 Hz the motor control input

u_{m}

as well as the desired drone velocities,

v_{r} e f = u_{v}

, and yaw rate,

ω_{z, r e f} = u_{ψ}

, to provide the inner loops for drone motion control and the gripper arm actuation.

Both the yaw angle and position of the parcel are shown in Figure 17 and Figure 18. The drone’s position is given by the distance to the parcel where

p_{g r a b, x} = 1 m

. For this scenario, the gripper arm angle references are defined by

θ_{p r e} = 18^{\circ}

and

θ_{c l o s e} = 28^{\circ}

. This trial is also illustrated in a video (Video S1 in Supplementary Materials).

It is observable that the gripper arm rotates as expected, performing the pre-closing maneuver in order to fully prehend the object in the correct time. The confirmation of a secure parcel prehension comes from the distance computed from the camera in conjunction with the binary variable

δ_{f o r c e}

obtained from the electric current sensor described above.

Figure 18 shows the yaw angle alignment between the gripper and the parcel during the experimental trial, which is drived towards an acceptable bound during the maneuver, ensuring a successful grasp.

Figure 19 presents snapshots taken by the camera module at different moments during the prehension maneuver, also identified as (A, B, C) in Figure 17.

The presented experimental results validate the proposed hybrid MPC framework’s capability to coordinate the gripper mechanism’s operation with the drone’s positioning, ensuring accurate timing for grasping maneuvers based on real-time parcel pose estimation. Nonetheless, it is important to acknowledge that these trials were conducted under controlled conditions, with the drone’s motion simulated and the parcel remaining stationary, and further experimental validation is necessary to assess the system’s performance in dynamic scenarios involving actual drone flight and moving parcels.

6. Conclusions

This paper described the development and experimental assessment of an autonomous grasping system designed for integration with aerial vehicles in parcel delivery and exchange operations. The implementation involved constructing an operational gripper prototype equipped with angular position sensors and force detection capabilities, alongside a pose estimation method for packages, with all components coordinated through a Hybrid Model Predictive Controller that determines optimal vehicle trajectories and gripper actuation commands. Individual subsystems underwent successful testing, and the complete integrated system was validated through both simulated and physical experiments, where package pose was determined via the vision-based estimation algorithm using the onboard camera.

Extensions to agile maneuvers and multi-vehicle scenarios within the hybrid modeling framework represent a potential avenue for further research, as do alternative approaches for hybrid MPC formulation and implementation, given that the HYSDEL formalism has limitations in representing complex nonlinear hybrid dynamics with sufficient fidelity. Additionally, future research could focus on establishing theoretical stability foundations to ensure robust performance across a wider range of operating conditions as well as to use robust MPC techniques for disturbance rejection.

While the presented results provide an initial validation of the proposed approach, further research is necessary to advance toward a fully operational aerial grasping solution. This line of work could conduct extensive experimental trials in real-world scenarios with onboard computation, evaluating the system’s performance under varying environmental conditions and with different parcel types to validate its robustness and adaptability, particularly to moving parcels of increasing agile motions.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/s26020653/s1. Video S1: experimental validation of the proposed techniques.

Author Contributions

B.J.G.: Conceptualization, Supervision, Investigation, Writing—Original draft; F.A.: Software, Validation, Investigation, Writing—Original draft; P.O.: Supervision, Writing—Reviewing and Editing; R.C.: Supervision, Writing—Reviewing and Editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially funded by FCT project REPLACE (PTDC/EEIAUT/32107/2017) which includes Lisboa 2020 and PIDDAC funds, project CAPTURE (PTDC/EEI-AUT/1732/2020), as well as projects CTS (UIDB/00066/2020), LARSYS (UIDB/50009/2020), and LAETA (UIDB/EMS/50022/2020).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Guerreiro, B. REPLACE Project: Fast Delivery in Urban Environments Using Drone RElays: PLAnning, Control, and Estimation. 2022. Available online: http://replace.isr.tecnico.ulisboa.pt/ (accessed on 12 January 2026).
Pinto, J.; Guerreiro, B.J.; Cunha, R. Planning parcel relay manoeuvres for quadrotors. In Proceedings of the 2021 International Conference on Unmanned Aircraft Systems (ICUAS), Athens, Greece, 15–18 June 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 137–145. [Google Scholar] [CrossRef]
Pinto, J.; Guerreiro, B.J.; Cunha, R. Planning Aggressive Drone Manoeuvres: A Geometric Backwards Integration Approach. J. Intell. Robot. Syst. 2025, 111, 16. [Google Scholar] [CrossRef]
Pio Belfiore, N.; Pennestrì, E. An atlas of linkage-type robotic grippers. Mech. Mach. Theory 1997, 32, 811–833. [Google Scholar] [CrossRef]
Monkman, G.J. Robot Grippers; Wiley Online Library: Hoboken, NJ, USA, 2007. [Google Scholar]
Pounds, P.E.; Bersak, D.R.; Dollar, A.M. Grasping from the air: Hovering capture and load stability. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 2491–2498. [Google Scholar] [CrossRef]
Mellinger, D.; Lindsey, Q.; Shomin, M.; Kumar, V. Design, modeling, estimation and control for aerial grasping and manipulation. In Proceedings of the IEEE International Conference on Intelligent Robots and Systems, San Francisco, CA, USA, 25–30 September 2011. [Google Scholar] [CrossRef]
Mellinger, D.; Shomin, M.; Kumar, V. Control of quadrotors for robust perching and landing. In Proceedings of the International Powered Lift Conference, Philadelphia, PA, USA, 5–7 October 2010; pp. 205–225. [Google Scholar]
Ruggiero, F.; Lippiello, V.; Ollero, A. Aerial manipulation: A literature review. IEEE Robot. Autom. Lett. 2018, 3, 1957–1964. [Google Scholar] [CrossRef]
Ollero, A.; Tognon, M.; Suarez, A.; Lee, D.; Franchi, A. Past, Present, and Future of Aerial Robotic Manipulators. IEEE Trans. Robot. 2022, 38, 626–645. [Google Scholar] [CrossRef]
Meng, J.; Buzzatto, J.; Liu, Y.; Liarokapis, M. On Aerial Robots with Grasping and Perching Capabilities: A Comprehensive Review. Front. Robot. AI 2022, 8, 739173. [Google Scholar] [CrossRef] [PubMed]
Gentili, L.; Naldi, R.; Marconi, L. Modeling and control of VTOL UAVs interacting with the environment. In Proceedings of the IEEE Conference on Decision and Control, Cancun, Mexico, 9–11 December 2008; pp. 1231–1236. [Google Scholar] [CrossRef]
Mohta, K.; Kumar, V.; Daniilidis, K. Vision-based control of a quadrotor for perching on lines. In Proceedings of the IEEE International Conference on Robotics and Automation, Hong Kong, China, 31 May–7 June 2014; pp. 3130–3136. [Google Scholar] [CrossRef]
Garrido-Jurado, S.; Muñoz-Salinas, R.; Madrid-Cuevas, F.J.; Marín-Jiménez, M.J. Automatic generation and detection of highly reliable fiducial markers under occlusion. Pattern Recognit. 2014, 47, 2280–2292. [Google Scholar] [CrossRef]
Suarez, A.; Jimenez-Cano, A.E.; Vega, V.M.; Heredia, G.; Rodriguez-Castaño, A.; Ollero, A. Design of a lightweight dual arm system for aerial manipulation. Mechatronics 2018, 50, 30–44. [Google Scholar] [CrossRef]
Hingston, L.; Mace, J.; Buzzatto, J.; Liarokapis, M. Reconfigurable, adaptive, lightweight grasping mechanisms for aerial robotic platforms. In Proceedings of the 2020 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Abu Dhabi, United Arab Emirates, 4–6 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 169–175. [Google Scholar] [CrossRef]
Bo, V.; Franco, L.; Turco, E.; Pozzi, M.; Malvezzi, M.; Prattichizzo, D.; Salvietti, G. Design and Control of Soft-Rigid Grippers for Food Handling. In Proceedings of the ICRA2024 Workshop on Cooking Robotics: Perception and Motion Planning, Yokohama, Japan, 13–17 May 2024; Available online: https://openreview.net/forum?id=EVJbAtEUSf (accessed on 12 January 2026).
Hu, H.; Xie, Z.; Liu, Y.; Liu, Y.; Yang, G.; Xia, J.; Liu, H. A Variable Stiffness Actuation Based Robotic Hand Designed for Interactions. IEEE/ASME Trans. Mechatronics 2024, 29, 249–259. [Google Scholar] [CrossRef]
Saunders, J.; Saeedi, S.; Li, W. Autonomous aerial robotics for package delivery: A technical review. J. Field Robot. 2024, 41, 3–49. [Google Scholar] [CrossRef]
Witsenhausen, H.S. A Class of Hybrid-State Continuous-Time Dynamic Systems. IEEE Trans. Autom. Control 1966, 11, 161–167. [Google Scholar] [CrossRef]
Alur, R.; Courcoubetis, C.; Henzinger, T.A.; Ho, P.H. Hybrid automata: An algorithmic approach to the specification and verification of hybrid systems. In Proceedings of the International Hybrid Systems Workshop; Springer: Berlin/Heidelberg, Germany, 1991; pp. 209–229. [Google Scholar] [CrossRef]
Henzinger, T.A. The Theory of Hybrid Automata. In Verification of Digital and Hybrid Systems; Springer: Berlin/Heidelberg, Germany, 2000; Volume 300, pp. 265–292. [Google Scholar] [CrossRef]
Chai, J.; Casau, P.; Sanfelice, R.G. Analysis and design of event-triggered control algorithms using hybrid systems tools. Int. J. Robust Nonlinear Control 2020, 30, 5936–5965. [Google Scholar] [CrossRef]
Bemporad, A.; Morari, M. Control of systems integrating logic, dynamics, and constraints. Automatica 1999, 35, 407–427. [Google Scholar] [CrossRef]
Ma, R.R.; Odhner, L.U.; Dollar, A.M. A modular, open-source 3D printed underactuated hand. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 2737–2743. [Google Scholar] [CrossRef]
Kruse, L.; Bradley, J. A hybrid, actively compliant manipulator/gripper for aerial manipulation with a multicopter. In Proceedings of the 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Philadelphia, PA, USA, 6–8 August 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–8. [Google Scholar] [CrossRef]
Zhang, H.; Sun, J.; Zhao, J. Compliant Bistable Gripper for Aerial Perching and Grasping. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; pp. 1248–1253. [Google Scholar] [CrossRef]
Romero-Ramirez, F.J.; Muñoz-Salinas, R.; Medina-Carnicer, R. Speeded up detection of squared fiducial markers. Image Vis. Comput. 2018, 76, 38–47. [Google Scholar] [CrossRef]
Mellinger, D. Trajectory Generation and Control for Quadrotors. Ph.D. Thesis, University of Pennsylvania, Philadelphia, PA, USA, 2012. [Google Scholar]
Torrisi, F.D.; Bemporad, A. HYSDEL—A tool for generating computational hybrid models. IEEE Trans. Control Syst. Technol. 2004, 12, 235–249. [Google Scholar] [CrossRef]
Bemporad, A. Hybrid Toolbox-User’s Guide. 2004. Available online: http://cse.lab.imtlucca.it/~bemporad/hybrid/toolbox (accessed on 12 January 2026).

Figure 1. Forces acting on the object.

Figure 2. Structure of the gripper forces drive chain.

Figure 3. Gear train in one pair of gripper arms.

Figure 4. Mean gripper arm angular velocity measured from 30 different trials with the same step input reference.

Figure 5. Evolution of the gripper arm angle

θ_{a r m}

, from fully open (A), closing (B), fully closed (C), and back to fully open (D).

Figure 5. Evolution of the gripper arm angle

θ_{a r m}

, from fully open (A), closing (B), fully closed (C), and back to fully open (D).

Figure 6. Current sensor circuit diagram. The black dashed line indicates the median current value, whereas the red dashed line represents the threshold established through a calibration procedure.

Figure 7. Current measurements and threshold (red dashed line) for the test showed in Figure 5.

Figure 8. Final gripper prototype as well as the integration with the drone and grasping a parcel.

Figure 9. Combination of offboard (a) and onboard (b) position acquisition.

Figure 10. ArUco 3D structure with

P

coordinate frame.

Figure 10. ArUco 3D structure with

P

coordinate frame.

Figure 11. Pose estimation tests: (a) x axis approach; (b) z axis approach; (c) yaw rotation; (d) testing environment.

Figure 12. Control loop including the hybrid MPC combining the drone and gripper dynamics.

Figure 13. Closed gripper diagram and 3D rendering.

Figure 14. Diagram of the different stages of the hybrid model.

Figure 15. Hybrid MPC simulation. (i) Gripper closes to an intermediate angle

θ_{p r e}

when the drone reaches the safety zone around

p_{g r a b}

; (ii) Gripper fully closes when the drone reaches

p_{g r a b}

; (iii) Gripper opens when the drone reaches

p_{d r o p}

, (green marker).

Figure 15. Hybrid MPC simulation. (i) Gripper closes to an intermediate angle

θ_{p r e}

when the drone reaches the safety zone around

p_{g r a b}

; (ii) Gripper fully closes when the drone reaches

p_{g r a b}

; (iii) Gripper opens when the drone reaches

p_{d r o p}

, (green marker).

Figure 16. Experimental setup of gripper prototype integrated with the HMPC framework.

Figure 17. Hybrid MPC experiment: x axis position and gripper arm angle, considering the three moments, also depicted below in Figure 19: (A) Approaching the parcel; (B) Pre-closing the gripper arm; (C) Grasping the parcel.

Figure 18. Hybrid MPC experiment: drone–parcel alignment measurements.

Figure 19. Hybrid MPC experiment: Camera snapshots from different moments during the prehension maneuver: (A) Approaching the parcel; (B) Pre-closing the gripper arm; (C) Grasping the parcel.

Table 1. Comparison of the proposed gripper with recent works found in the literature.

Gripper	Weight (kg)	Max. Payload (kg)
T Model [25]	0.4	1.5
Dual Arm [15]	1.8	0.75
Adaptive [26]	0.3	0.06
Compliant [27]	0.009	0.06
Proposed	0.25	1.0

Table 2. Hybrid Model simulation parameters.

Parameter	Value
$θ_{c l o s e}$	57.3 deg
$θ_{p r e}$	17.2 deg
$k_{p}$	0.9
$T_{s}$	0.1 s
$N_{p}$	6
$Q_{x}$	$diag ([30, 1, 20, 1, 20, 1, 10, 1, 10, 0, 0])$
$Q_{x_{N}}$	$diag ([30, 1, 20, 1, 50, 1, 10, 1, 10, 0, 0])$
$Q_{y}$	$diag ([10, 100, 1, 50, 10])$
$Q_{u}$	$diag ([10, 1, 1, 1, 1])$
$p_{i n i}$	${[0, 0, 0]}^{'}$
$p_{g r a b}$	${[5, 10, 1]}^{'}$
$p_{d r o p}$	${[8, 11, 1]}^{'}$
$p_{e n d}$	${[13, 16, 2]}^{'}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guerreiro, B.J.; Azevedo, F.; Oliveira, P.; Cunha, R. Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery. Sensors 2026, 26, 653. https://doi.org/10.3390/s26020653

AMA Style

Guerreiro BJ, Azevedo F, Oliveira P, Cunha R. Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery. Sensors. 2026; 26(2):653. https://doi.org/10.3390/s26020653

Chicago/Turabian Style

Guerreiro, Bruno J., Francisco Azevedo, Paulo Oliveira, and Rita Cunha. 2026. "Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery" Sensors 26, no. 2: 653. https://doi.org/10.3390/s26020653

APA Style

Guerreiro, B. J., Azevedo, F., Oliveira, P., & Cunha, R. (2026). Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery. Sensors, 26(2), 653. https://doi.org/10.3390/s26020653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automatic Grasping System and Hybrid Controller Towards Multi-Drone Parcel Delivery

Abstract

1. Introduction

2. Gripper Design

2.1. Motion Constraints and Prehension

2.2. Typical Forces During Operation

2.3. Power Drive Chain

2.4. Final Prototype and Experimental Assessment

3. Parcel Pose Estimation

3.1. ArUco Marker System

3.2. Drone-with-Gripper Perception of Package Pose

3.3. ArUco Pose Estimation Evaluation

4. Hybrid Grasping Model Predictive Control

4.1. Drone Dynamics

4.2. Dynamic Model of the Gripper

4.3. Hybrid Model

4.4. Hybrid Model Predictive Controller

5. Experimental Results

5.1. Hybric MPC Simulation Results

5.2. Grasping Experimental Results

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI