Advances in Deep Learning for Drones and Its Applications: 3rd Edition

A special issue of Drones (ISSN 2504-446X).

Deadline for manuscript submissions: 26 October 2026 | Viewed by 775

Editors

Chef Robotics, San Francisco, CA 94103, USA
Interests: UAV; VLA; robot vision; state estimation; deep learning in agriculture (horticulture); reinforcement/imitation learning; agricultural robotics; visual inertial navigation; generative AI
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
Faculty of Aerospace Engineering, Delft University of Technology, 2600 AA Delft, The Netherlands
Interests: learning; planning; active sensing; environmental mapping; informative path planning; robotic decision-making; agricultural robotics
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
Centre for Automation and Robotic Engineering Science, Department of Electrical and Computer Engineering, University of Auckland, Auckland, New Zealand
Interests: agricultural robots; IPT; smart farm; human-robot interaction
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
RovifyLab, Brisbane, QLD 4070, Australia
Interests: visual SLAM; optimization; scene reconstruction

Special Issue Information

Dear Colleagues,

Following the outstanding success and high visibility of the first and second editions of our Special Issue, it is with great pleasure that we announce the launch of the third edition.

Unmanned aerial vehicles (UAVs), particularly vertical takeoff and landing (VTOL) platforms, have firmly transitioned from research laboratories to indispensable tools in real-world aerial operations. Today, they are deployed extensively for infrastructure inspection, precision agriculture, and environmental monitoring. However, the next frontier in field robotics lies in heterogeneous collaboration. Integrating the high-altitude vantage points and agility of UAVs with the heavy payload capacity and endurance of unmanned ground vehicles (UGVs) unlocks unprecedented potential for complex, long-term, and multi-domain missions.

Simultaneously, the landscape of machine learning and computer vision is undergoing a massive transformation. Beyond traditional deep neural networks, we are witnessing the rapid emergence of vision–language–action (VLA) models and foundation models that bridge the gap between high-level semantic reasoning, scene understanding, and low-level robotic control. Furthermore, sophisticated advancements in reinforcement learning (RL) and imitation learning (IL) are empowering aerial and ground robots to dynamically adapt to unstructured environments and learn complex behaviors with unparalleled efficiency.

Based on these cutting-edge technological intersections, there is a rapidly growing interest in leveraging next-generation machine learning techniques to push the boundaries of robotic autonomy, perception, and multi-agent collaboration.

Within this context, we invite researchers to submit original papers to this third edition of our Special Issue, focusing on the latest advances in deep learning, computer vision, and machine learning for drones and general field robotics.

Papers are solicited on all areas directly related to these topics, including but not limited to, the following areas of interest:

  • UAV and UGV Collaborations: Heterogeneous multi-agent coordination, joint perception, and task allocation;
  • Vision–Language–Action (VLA) Models: Applications of foundation models and LLMs/VLMs for aerial and field robotics;
  • Advanced Scene Understanding: 3D semantic mapping, spatial reasoning, and modern computer vision for robotic navigation;
  • Imitation Learning (IL) and Reinforcement Learning (RL): Continuous/discrete control, sim-to-real transfer, and dynamic environment adaptation;
  • Large-Scale Datasets and Benchmarks: Standardized evaluation tools for training and testing deep learning solutions in robotics;
  • Deep Neural Networks (DNNs) for Perception: Object detection, instance segmentation, and semantic classification for autonomous navigation;
  • State Estimation and Dynamic Identification: Utilizing recurrent networks and novel architectures for robust localization;
  • Learning-based Planning and Manipulation: Decision-making, task planning, and physical interaction in cluttered environments;
  • Real-Time Data Analytics: Aerial and ground robots-in-the-loop for time-critical decision-making;
  • Domain-Specific Applications: Deep learning-powered robotics in precision agriculture, industrial inspection, and search-and-rescue;
  • Innovative Design: Novel mechanical and electrical designs optimized for AI-driven aerial and ground vehicles.

Dr. Inkyu Sa
Dr. Marija Popovic
Dr. Ho Seok Ahn
Dr. Chanoh Park
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 250 words) can be sent to the Editorial Office for assessment.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-anonymized peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Drones is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • UAVs
  • aerial robots
  • drones
  • remote sensing
  • deep learning
  • deep neural networks
  • computer vision
  • robotic perception

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Related Special Issue

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Review

59 pages, 1676 KB  
Review
Vision–Language–Action (VLA) Models for Unmanned Aerial Robotics and Bimanual Manipulation: A Review
by Inkyu Sa, Chanoh Park, Hea-Min Lee, Donghee Noh and Ho Seok Ahn
Drones 2026, 10(6), 412; https://doi.org/10.3390/drones10060412 - 26 May 2026
Viewed by 462
Abstract
Vision–Language–Action (VLA) models unify visual perception, natural-language understanding, and action generation within a single foundation model, allowing a robot to follow instructions such as “fold the towel” or “fly to the red building” directly from camera images. Because VLAs inherit world knowledge from [...] Read more.
Vision–Language–Action (VLA) models unify visual perception, natural-language understanding, and action generation within a single foundation model, allowing a robot to follow instructions such as “fold the towel” or “fly to the red building” directly from camera images. Because VLAs inherit world knowledge from internet-scale pre-training, they have become the dominant framework for learning-based manipulation, with bimanual coordination serving as the most demanding testbed: two arms with 7+ degrees of freedom each must move in concert to fold, assemble, and reorient objects. Unmanned aerial robotics faces a structurally similar challenge: a drone must coordinate thrust, attitude, and increasingly gripper commands from visual observations under strict latency and payload constraints. This review covers 183 contributions spanning 2017–2026 and organized along seven dimensions: VLA architectures, training recipes, action representations, bimanual coordination (2022–2026), unmanned aerial vehicle (UAV) navigation and control (2017–2026), language grounding, and cross-cutting concerns including memory and world models. We show that the coordination strategies, training recipes, and action representations developed for bimanual VLAs transfer to unmanned aerial systems and identify fourteen research directions across both domains. Full article
Show Figures

Graphical abstract

Back to TopTop