Next Article in Journal
Deterring Street Crimes Using Aerial Police: Data-Driven Joint Station Deployment and Patrol Path Planning for Policing UAVs
Previous Article in Journal
Test Methodologies for Collision Tolerance, Navigation, and Trajectory-Following Capabilities of Small Unmanned Aerial Systems
Previous Article in Special Issue
Community Drones: A Concept Study on Shared Drone Services
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

UAV-Enabled Video Streaming Architecture for Urban Air Mobility: A 6G-Based Approach Toward Low-Altitude 3D Transportation

1
Department of Information Management, National Taipei University of Nursing and Health Sciences, Taipei 108306, Taiwan
2
Department of Computer Science & Information Engineering, National Dong Hwa University, Hualien 974301, Taiwan
3
Lookout, Inc., Taipei 110207, Taiwan
*
Author to whom correspondence should be addressed.
Drones 2025, 9(6), 448; https://doi.org/10.3390/drones9060448
Submission received: 2 May 2025 / Revised: 10 June 2025 / Accepted: 16 June 2025 / Published: 18 June 2025

Abstract

:
As urban populations expand and congestion intensifies, traditional ground transportation struggles to satisfy escalating mobility demands. Unmanned Electric Vertical Take-Off and Landing (eVTOL) aircraft, as a key enabler of Urban Air Mobility (UAM), leverage low-altitude airspace to alleviate ground traffic while offering environmentally sustainable solutions. However, supporting high bandwidth, real-time video applications, such as Virtual Reality (VR), Augmented Reality (AR), and 360° streaming, remains a major challenge, particularly within bandwidth-constrained metropolitan regions. This study proposes a novel Unmanned Aerial Vehicle (UAV)-enabled video streaming architecture that integrates 6G wireless technologies with intelligent routing strategies across cooperative airborne nodes, including unmanned eVTOLs and High-Altitude Platform Systems (HAPS). By relaying video data from low-congestion ground base stations to high-demand urban zones via autonomous aerial relays, the proposed system enhances spectrum utilization and improves streaming stability. Simulation results validate the framework’s capability to support immersive media applications in next-generation autonomous air mobility systems, aligning with the vision of scalable, resilient 3D transportation infrastructure.

1. Introduction

Urban Air Mobility (UAM) is a networked system that enables the operation of aerial vehicles, such as drones, air taxis, and autonomous aircraft, to provide safe and efficient transportation within urban environments. With growing populations and increasing congestion in major cities, traditional ground transportation infrastructure often struggles to keep pace with rising mobility demands. Electric Vertical Take-Off and Landing (eVTOL), a key component of UAM, offers a promising solution by utilizing urban airspace to reduce traffic congestion and deliver environmentally friendly transportation alternatives [1,2].
eVTOL aircraft, typically operating at altitudes below 1000 m, are increasingly utilized for applications beyond passenger transport, including emergency medical services, cargo delivery, and surveillance [3]. Regulatory bodies such as the European Union Aviation Safety Agency (EASA) anticipate that these vehicles will operate primarily within the 150–300 m altitude range [4], where line-of-sight communication with ground infrastructure remains highly reliable. This environment supports real-time video streaming with low latency and sufficient resolution [5,6], enabling a wide range of ground-oriented multimedia services. Such capabilities are particularly beneficial for urban missions involving passenger infotainment, medical support, and regional monitoring. For instance, medical eVTOLs can serve as airborne communication hubs, allowing onboard personnel to livestream medical procedures to remote specialists. This facilitates real-time expert consultation and critical decision making during transit, thereby significantly enhancing patient care while in flight.
In the context of infotainment, eVTOL passengers are expected to engage in a variety of in-flight activities, such as checking emails, participating in remote meetings, streaming high-resolution live content (e.g., sports events), and using on-demand video platforms like TikTok and YouTube. These applications demand stable, real-time video streaming. Similarly, urban monitoring tasks depend on multi-angle, real-time video feeds to support situational awareness and facilitate rapid response. For passenger entertainment, high-quality content delivery is essential—for instance, 360-degree video [7] offers an immersive, panoramic experience with interactive features, enriching the overall in-flight experience.
Recognizing this potential, many countries are actively exploring how eVTOL technology can help alleviate growing urban transportation challenges. Recent studies highlight increasing interest in the technical and operational dimensions of UAM, particularly the integration of communication, navigation, and control systems. These advancements aim to address critical challenges, such as traffic management, connectivity, aerial infotainment and emergency video delivery, and airspace coordination, in densely populated areas.
However, the growing demand for high-quality video streaming, immersive entertainment, and real-time interactive applications has significantly increased bandwidth requirements. For example, the panoramic nature of 360-degree videos imposes high data transmission demands on UAM networks. Additionally, due to the close viewing distance—often just a few centimeters between the viewer’s eyes and the display—higher spatial resolution is essential to ensure a satisfactory visual experience [8]. Consequently, the proliferation of eVTOLs and video-based applications and services is expected to generate massive amounts of data and network traffic. Future UAM networks must accommodate the need for rapid-response, delay-sensitive, and bandwidth-intensive multimedia transmission, especially in densely populated urban areas.
Cellular networks currently serve as the primary infrastructure for wireless connectivity, largely due to the extensive deployment of terrestrial base stations [9]. However, conventional ground-based cellular systems are not well suited for aerial platforms operating in three-dimensional airspace. Specifically, the downward-facing antennas and earth-bound propagation models of these networks often lead to unreliable coverage and unstable links at higher altitudes [10]. To overcome these challenges, future 6G networks are being designed with several key enhancements, including dynamic beamforming, support for aerial user equipment, and integration with non-terrestrial networks such as satellites and HAPS, thereby offering complementary connectivity for low-altitude UAM applications [11].
In parallel with these mobile network advancements, a range of high-capacity wireless communication technologies is being explored to satisfy the demanding requirements of UAM. Millimeter-wave (mmWave) communication [12], operating between 30 and 300 GHz, offers high data throughput over short distances, making it suitable for localized UAM links. However, it is highly susceptible to blockage and atmospheric absorption. Terahertz (THz) communication (>0.1 THz) extends this concept by providing even greater bandwidth and potential data rates [13], though it faces considerable challenges, including severe propagation loss, hardware complexity, and sensitivity to environmental factors [14].
Free-space optical (FSO) communication is also emerging as a viable solution for aerial connectivity. Utilizing highly directional laser beams, FSO enables secure, high-bandwidth links that are inherently resistant to radio frequency interference. Its advantages include abundant unlicensed spectrum, energy efficiency, and low electromagnetic interference [15]. Nevertheless, FSO performance is contingent on maintaining line of sight and is particularly vulnerable to adverse weather conditions such as fog, snow, and atmospheric turbulence, though it is less affected by rain compared to traditional RF systems [16]. To address these limitations, hybrid FSO/RF systems are under development, employing adaptive switching mechanisms to maintain link robustness in varying atmospheric conditions.
Together, advanced mobile networks, mmWave, THz, and FSO technologies are expected to form a heterogeneous and resilient communication infrastructure tailored for UAM. By leveraging the complementary strengths of these approaches, future wireless systems can enable the high reliability, low latency, and massive connectivity required for safe and efficient UAM operations, particularly in beyond-line-of-sight and dense urban environments [14].
One limitation of the mmWave and THz frequency bands is their high path loss, resulting in relatively shorter transmission distances [17]. To address the limitations of ground-based base stations in providing reliable coverage in three-dimensional spaces, High-Altitude Platform Systems (HAPSs) have emerged as a promising communication infrastructure and key research focus. Advances in solar panel efficiency, lightweight composite materials, autonomous avionics, and antenna technologies have significantly improved the feasibility of HAPS as aerial network nodes [18]. HAPSs are typically deployed in the stratosphere at an altitude of around 20 km, offering several advantages, including wide coverage areas, line-of-sight links, and a fixed position relative to the Earth. Moreover, they have shorter communication delays compared to low-Earth-orbit (LEO) satellites [19,20], making them better suited for supporting multimedia transmission services for eVTOL aircraft passengers.
While advanced communication technologies facilitate video transmission in UAM systems, the encoding process fundamentally governs transmission efficiency for eVTOL passenger applications. Scalable High-Efficiency Video Coding (SHVC) [21,22] encodes video into a multi-layer structure spanning temporal, spatial, and quality dimensions. The base layer provides a basic reconstruction of tiles at the lowest usable quality, while enhancement layers include additional information to incrementally improve the quality of the tiles. To reconstruct a tile at a specific quality level, all preceding enhancement layers, including the base layer, must be decoded. This layered structure allows clients to adaptively select and decode varying numbers of layers based on network conditions and device capabilities, effectively addressing the instability of IP networks and the diversity of devices.
In summary, achieving efficient multimedia transmission in UAM scenarios requires overcoming the limitations of conventional communication technologies by designing a multi-layered communication architecture that adapts to the dynamic characteristics of three-dimensional spaces. Effective transmission strategies must be integrated to optimize performance. However, ground cellular infrastructures, originally designed for terrestrial users, face significant limitations in providing reliable aerial coverage. During network congestion periods such as morning and evening rush hours, high-priority demands may be preempted by lower-priority traffic, exacerbating resource contention.
To address these challenges, this study proposes a novel video streaming architecture that integrates HAPS, eVTOLs, and 6G communication technologies to enhance spatial coverage and resource availability in aerial networks. The research specifically focuses on bandwidth allocation strategies for video streaming services in unmanned eVTOL systems, with the goal of ensuring service continuity and efficient resource utilization under dynamic network load conditions. This work directly tackles these challenges through the following key innovations:
  • We propose a network architecture tailored for video streaming services in unmanned eVTOL aircraft, integrating advanced communication technologies such as mmWave, THz, and FSO. High-Altitude Platform Stations (HAPSs) are incorporated to overcome the coverage limitations of ground base stations, thereby enhancing link reliability and service availability.
  • We design a priority-aware bandwidth allocation strategy that differentiates between critical and non-critical video applications. A cooperative offloading mechanism is introduced, whereby HAPSs and neighboring unmanned eVTOLs assist in relaying video traffic from underutilized ground base stations to improve bandwidth availability in congested regions. Simulations show that this method achieves 100% satisfaction for emergency video tasks and maintains over 60% satisfaction for real-time applications under peak load.
  • We develop an aerial relay and multicast scheduling mechanism based on a service-provider-coordinated multicast tree structure. This scheme significantly improves bandwidth utilization—by 45% compared to the state-of-the-art I-SAC algorithm [23]—by leveraging underutilized eVTOLs and HAPSs to support overloaded links.
  • To enhance video playback quality for non-critical applications, we propose a buffer-aware preloading strategy that caches upcoming video segments as aircraft pass over base stations with available capacity. This approach reduces playback interruptions in bandwidth-constrained areas and improves overall service continuity.
Following this introduction, Section 2 reviews the existing literature related to communication technologies and video streaming in dynamic aerial scenarios. Section 3 details the novel network architecture and adaptive streaming algorithm developed for unmanned eVTOL in-flight video delivery. Section 4 presents the simulation results and their analysis. Finally, Section 5 concludes this study and summarizes key findings.

2. Related Work

Existing research, predominantly centered on Unmanned Aerial Vehicle (UAV) networks, offers foundational insights applicable to UAM, as both domains operate within highly dynamic, three-dimensional environments. For instance, Shon and Lee [24] proposed an optimized decision-support framework for strategic and operational planning in UAM systems. Their model integrates high-level system design, including optimal eVTOL fleet sizing, vertiport infrastructure allocation, and charger deployment, with low-level operational control of individual eVTOL units, managing transitions between service, charging, idling, and repositioning states.
Similarly, Sun et al. [25] introduced a hierarchical architecture that combines terrestrial and aerial computing resources, leveraging UAV mobility to enhance performance. Their joint optimization framework manages resource allocation, computation offloading, and flight trajectory control to maximize overall system utility. Li et al. [26] advanced this approach by incorporating Integrated Sensing and Communication (ISAC) into UAV networks, enabling multiple UAVs to perform radar-based ground user detection while simultaneously supporting communication services.
THz communication technologies have also been explored for their applicability in UAM environments. Liang et al. [27] addressed co-channel interference in UAV-based THz communication systems by formulating a multidimensional resource optimization problem and solving it through a sequential iteration algorithm. Saber et al. [28] developed an analytical model for directional THz links, using the Meijer G-function to derive closed-form expressions for the channel gain’s probability density and cumulative distribution functions. Mustari et al. [29] evaluated a cooperative MAC protocol for THz-based flying ad hoc networks, demonstrating superior performance over direct communication methods. Tun et al. [30] extended this by proposing a joint optimization framework for THz-assisted mobile edge computing, improving both computation offloading and communication efficiency.
Hybrid transmission schemes that integrate RF and FSO links have also received attention. In [31], fog-induced signal attenuation was empirically modeled using real-time visibility data to assess its impact on FSO/RF link performance. The study involved monitoring operational FSO and RF links at 1 GHz, 10 GHz, and 25 GHz under varying atmospheric conditions. Xu et al. [32] proposed an adaptive combining scheme for cooperative FSO/RF communications within Space–Air–Ground Integrated Networks (SAGINs). Their approach dynamically selects the communication mode based on the real-time signal-to-noise ratio (SNR) of the FSO relay. If the SNR surpasses a threshold, only the FSO link operates with an amplify-and-forward protocol; otherwise, both FSO and RF links are used and combined at the receiver via maximal ratio combining to improve reliability. Likewise, Mashiko et al. [33] examined a hybrid FSO/RF SAGIN architecture, where high-capacity but weather-sensitive FSO links are backed by more stable, lower-capacity RF links across satellite–HAPS–ground segments. Their joint control strategy optimized HAPS deployment and coverage using a solution space reduction technique to efficiently determine optimal parameters.
In terms of video transmission, several studies have focused on optimizing streaming performance in UAV networks. Ghazikor et al. [34] developed a decentralized control strategy for video streaming in unlicensed bands, balancing packet loss against video quality. Zhan et al. [35] highlighted how 6G technologies can support adaptive aerial video streaming by responding to environmental and channel variations. Yaqoob et al. [23] introduced an enhanced off-policy soft actor–critic (I-SAC) reinforcement learning algorithm, which optimizes Quality of Experience by jointly considering video playback conditions, UAV flight dynamics, and historical network data.
Despite ongoing advancements, current studies predominantly focus on UAV-based monitoring and fall short of addressing the specific operational and service requirements of passenger-oriented UAM systems. These works typically consider single-use cases in isolation, overlooking the diversity of applications within UAM, each demanding different levels of bandwidth and quality of service. Managing bandwidth across concurrent UAM services, such as real-time flight control, system telemetry, passenger connectivity, and multimedia streaming, remains an unresolved and pressing issue. With the anticipated growth in eVTOL ridership, maintaining sufficient network capacity during periods of high demand is critical. Moreover, although HAPSs hold promise as stable, long-duration communication relays, their role within integrated UAM network architectures has yet to be fully investigated. This review identifies a key gap: the absence of unified, application-aware communication frameworks capable of supporting the complex and bandwidth-intensive service ecosystem of future UAM operations.

3. Materials and Methods

This study proposes a robust transmission architecture for UAM environments, specifically designed to meet the high-bandwidth, low-latency demands of multimedia video services for passengers aboard unmanned eVTOLs. As illustrated in Figure 1, the architecture integrates HAPSs and eVTOLs with 6G-enabled base stations to ensure resilient air-to-ground communication. This integration enables high-speed data transmission, wide-area coverage, and seamless handovers across heterogeneous aerial nodes, ensuring uninterrupted video delivery in dynamic, mobile conditions.
Video streaming services in the system originate from both onboard and external sources. Passenger entertainment, such as YouTube, TikTok, and immersive 360-degree video, is typically downlinked from ground servers. In contrast, mission-critical applications, including aerial monitoring, emergency reporting, and real-time conferencing, require uplink transmission to ground control centers or cloud infrastructure.
The system supports these diverse transmission needs using 6G base stations equipped with mmWave and THz technologies as the primary communication interfaces. In dense urban environments, where signal attenuation is common due to physical obstructions, HAPSs operating at stratospheric altitudes provide additional coverage. These platforms utilize FSO for high-throughput communication and a hybrid FSO/RF downlink to maintain reliability under adverse weather conditions. This dual-layer architecture ensures resilient, low-latency connectivity for next-generation aerial networks.
Regional management servers are distributed across predefined geographic zones to handle mobility and dynamic network conditions. These servers continuously track the positions of eVTOLs and monitor bandwidth availability across connected nodes. Based on real-time metrics, bandwidth is dynamically allocated to optimize spectral efficiency. During periods of congestion, a cooperative relay mechanism is activated, enabling data forwarding through HAPS or neighboring eVTOLs connected to underutilized base stations, as illustrated by the dashed lines in Figure 1.
Efficiency in video transmission is further enhanced by integrating SHVC, which allows for adaptive delivery of video layers based on current network conditions. A multicast tree-based content distribution strategy helps minimize redundancy and ease congestion in both core and edge networks. Additionally, the system selects the edge node nearest to the requesting eVTOL to reduce delivery latency and improve the user experience.
The system supports both planar and 360-degree video formats to meet the growing demand for immersive media. For 360-degree content, a tile-based streaming strategy is employed, where independently encoded tiles are transmitted with higher-quality layers prioritized for the predicted field of view (FoV), while only base layers are delivered to peripheral regions. This selective delivery conserves bandwidth and ensures smooth playback, even when FoV predictions are not fully accurate.
To maintain service quality and bandwidth efficiency, video content is classified into three priority levels: emergency, real time, and non-real time. Emergency applications, such as video feeds from air ambulances or disaster response operations, receive the highest priority, ensuring minimal transmission latency. Real-time services, including live streaming and video conferencing, are assigned secondary priority, with strict but slightly more flexible latency constraints. Non-real-time content, such as video-on-demand (VoD), is scheduled opportunistically during off-peak periods.
As outlined in Table 1, the system comprises five core modules. Upon initiation of a video application by an eVTOL passenger, the “Video Bandwidth Allocation for Unmanned eVTOL Aircraft Passengers” module processes the initial bandwidth request. If the content is time sensitive, the “Critical Video Resource Allocation” module is activated; otherwise, the “Asynchronous Video Preloader” module handles non-urgent content. Lower-priority applications may be required to release bandwidth when higher-priority requests are made. If the available bandwidth remains insufficient, the system accesses underutilized base stations in uncongested areas. Additional resources are then relayed via HAPSs or other unmanned eVTOLs through either the “Critical Video Bandwidth Assistance Unit” module or the “Non-Critical Video Bandwidth Assistance Unit” module, depending on the urgency and bandwidth needs of the application.
The detailed functionality of each module is described below.

3.1. Video Bandwidth Allocation for Unmanned eVTOL Aircraft Passengers

This module is designed to request the required bandwidth for video applications from regional management servers along the flight path of an eVTOL aircraft. To alleviate potential congestion during peak periods, it dynamically adjusts the bandwidth allocated to non-critical video applications as needed, thereby improving overall bandwidth utilization.
Figure 2 presents a flowchart of this module. Prior to departure, the eVTOL must provide its origin, destination, and scheduled departure time. Based on these inputs and current flight conditions, the optimal flight route is assumed to be determined using a representative flight navigation algorithm from the recent literature [36]:
R σ = s 1 σ , s 2 σ , , s j σ , , s F σ 1 σ , s F σ σ ,   1 j F σ
where the route R σ can be divided into multiple sub-routes. s 1 σ and s F σ σ denote the origin and destination of eVTOL σ , respectively, while s j σ denotes the j-th waypoint along the route.
After the eVTOL aircraft passenger launches a video streaming application, the system retrieves the video specifications and multicast tree node information from the video application service provider. The module then determines the waypoints that the eVTOL can reach within the predefined time interval ϰ as follows:
a r g ϖ   M a x r t c ϖ σ r t c 1 σ ,  
subject to:
R σ = c 1 σ , c 2 σ , , c i σ , , c η σ 1 σ , c η σ σ ,   1 i η σ ,
r t c ϖ σ r t c 1 σ M i n ϰ , r t c η σ σ r t c 1 σ ,
where R σ is defined as the complete trajectory between the aircraft’s present position and its final destination, ϖ denotes a waypoint reachable within the time interval ϰ , c i σ refers to the i-th waypoint on R σ and r t c ϖ σ represents the arrival time at waypoint c ϖ σ .
The constraint in Equation (2) governs the selection of the furthest waypoint that the eVTOL can reach along its planned trajectory R σ within the specified time interval ϰ . The objective is to maximize the time span from the aircraft’s current position to the most distant reachable waypoint, as constrained by Equations (3) and (4). Specifically, Equation (4) ensures that the selected waypoint lies within the permissible time window starting from the current time r t c 1 σ , without exceeding the time required to reach the final destination. The use of the minimum function in Equation (4) serves a dual purpose: it prevents the selection of a waypoint beyond the planned trajectory and ensures that only those waypoints reachable within the time interval ϰ are considered.
The module transmits the calculated arrival time, video specifications, and multicast tree node information to the regional management server overseeing the waypoint. For time-sensitive video applications, the system activates the “Critical Video Resource Allocation” module; otherwise, it initiates the “Asynchronous Video Preloader” module. In scenarios where available bandwidth is insufficient to support video application demands, this module dynamically improves the handling of non-critical applications by reducing frame rates or resolutions, thereby lowering bandwidth consumption. The frame rate adjustment is performed as follows:
u f t j v = v f f v 1 i f   u r s u t j v > 0 ,   f v > 1 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v , 1 i < η σ f _ v o t h e r w i s e , ,
d f t j v = v f f v 1 i f   d r s u t j v > 0 ,   f v > 1 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v , 1 i < η σ f _ v o t h e r w i s e , ,
where u f t j v and d f t j v represent the upload and download frame rates for video v , respectively. Similarly, u r s u t j v and d d r s u t j v denote upload and download bandwidth shortages within timeslot t j v between waypoints c i σ and c i + 1 σ , respectively. The variables t 1 v and t E v v indicate the current time and video end time, respectively. The minimum frame rate allowed by the system is denoted as f _ v , while v f f v represents the frame rate corresponding to the quality level f v specified by the aircraft passenger.
The adjustment for resolution is implemented as follows:
u v r t j v = v r q v 1 i f   u r s u t j v > 0 ,   q v > 1 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v , 1 i < η σ r _ v o t h e r w i s e ,
d v r t j v = v r q v 1 i f   d r s u t j v > 0 ,   q v > 1 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v , 1 i < η σ r _ v o t h e r w i s e , ,
where u v r t j v and d v r t j v represent the upload and download resolutions for video v , respectively. The variable r _ v denotes the minimum resolution set by the system, and v r q v is the resolution for quality level q v specified by the aircraft passenger.
The system dynamically modifies video frame rates and resolution to meet bandwidth requirements or until further adjustments become unfeasible. If the demand cannot be satisfied, the system alerts the aircraft passenger that the video application cannot be executed and deactivates the module.

3.2. Critical Video Resource Allocation

This module ensures that sufficient bandwidth is allocated to support emergency and real-time video services for eVTOL aircraft passengers, thereby maintaining a high level of service quality under varying network conditions. During periods of peak traffic and limited bandwidth availability, the module first reallocates bandwidth from non-critical video streams. If additional bandwidth is required, it then engages the “Critical Video Bandwidth Support” module to ensure critical video transmission. Further adjustments involve redistributing bandwidth from other real-time video applications to prioritize critical operations.
The flowchart of this module is illustrated in Figure 3. Upon receiving a bandwidth request from an eVTOL, the system calculates the bandwidth required for video applications during each time slot of the flight, based on the arrival times at each waypoint. It then performs real-time validation of base station capacity to ensure minimum bandwidth thresholds are met for both emergency video streams and real-time operational feeds.
The calculation of upload and download bandwidth surpluses is as follows:
d u b t j v = γ v · θ U t j v θ u f t j v · u r t j v ,   1 j E v ,     r t c i σ t j v r t c i + 1 σ ,
d d b t j v = θ D t j v θ d f t j v · d r t j v ,     1 j E v ,     r t c i σ t j v r t c i + 1 σ ,    
u r t j v = d r t j v = v r q v ,   1 j < E v ,   1 q v q ¯ v ,
u f t j v = d f t j v = v f f v ,   1 j < E v ,   1 f v f ¯ v ,
where the binary flag γ v indicates whether video v is a bidirectional real-time video. Specifically, γ v = 1 for bidirectional video, and γ v = 0 for unidirectional video. U t j v θ represents the upload bandwidth that base station θ can allocate to bidirectional video v when the eVTOL passes through θ ’s transmission range during time slot t j v . Similarly, D t j v θ represents the download bandwidth that θ can allocate to video v . The variables u r t j v and d r t j v denote the upload and download video resolutions for v in time slot t j v , while u f t j v and d f t j v stand for the upload and download frame rates. Additionally, q ¯ v and f ¯ v denote the upper limits for the video v ’s resolution and frame rate, respectively.
The time slot is updated as follows:
t j + 1 v = t j v + ,   1 j < E v ,
where Δ is the length of a time slot. The start and end times of the video application must satisfy:
r t c 1 σ t 1 v t E v v r t c η σ σ ,
which ensures that the video playback duration falls within the eVTOL’s flight time range.
If either d u b t j v or d d b t j v becomes negative, the available bandwidth is insufficient to maintain video v ’s minimum required rate. In this case, the following conditions are applied:
u r s u t j v = r s u c i σ i f   d u b t j v < 0 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v ,   1 i < η σ , 0 o t h e r w i s e ,  
d r s u t j v = r s u c i σ i f   d d b t j v < 0 ,   r t c i σ t j v r t c i + 1 σ ,   1 j E v , 1 i < η σ 0 o t h e r w i s e , ,
where r s u t j v represents the regional management server index value of the sub-route c i σ during time slot t j v . The variables u r s u t j v and d r s u t j v are used to identify areas with insufficient upload or download bandwidth.
At this stage, bandwidth initially allocated to non-critical video applications is reallocated to preserve critical video service quality. This involves assessing whether base stations along the eVTOL’s flight path can free up additional bandwidth from non-essential video streams.
The process determines the total bandwidth available for reallocation as below:
b u b t j v = μ φ μ U t j v φ μ   i f   d u b t j v < 0 0 o t h e r w i s e , ,         1 j E v ,
d u b t j v = d u b t j v + b u b t j v ,
b d b t j σ , v = μ φ μ D t j v φ μ   i f   d d b t j v < 0 , 0 o t h e r w i s e ,   ,         1 j E v ,
d d b t j v = d d b t j v + b d b t j v ,
where μ serves as the index for non-critical video applications, and the base stations tasked with managing bandwidth allocation to critical video μ are represented by φ μ . During time slot t j v , base station φ μ can provide upload bandwidth U t j v φ μ and download bandwidth D t j v φ μ for bidirectional video v . Additionally, b u b t j v and b d b t j v correspond to the upload and download bandwidth reallocated from non-critical video to emergency or real-time video, respectively.
When b u b t j v or b d b t j v is non-zero, it indicates that the base station can reallocate part of its bandwidth initially assigned to non-critical video, directing it to critical video needs. This module will then notify the base station to execute the resource reallocation accordingly. If the reallocated bandwidth is still inadequate to fulfill the demands of critical video, this module queries other regional management servers along the video multicast tree nodes to determine whether the base stations in these regions can provide additional support. The priority for scheduling bandwidth is then determined based on the distance between the nodes and the eVTOL aircraft’s current waypoint.
The upload and download bandwidth scheduling priorities are defined as follows:
U M T v , j = u 1 v , j , u 2 v , j , , u k v , j , , u T v , j 1 v , j , u T v , j v , j ,   1 k < T v , j , 1 j E v , i f   d u b t j v < 0 ,
D M T v , j = d 1 v , j , d 2 v , j , , d k ^ v , j , , d T ^ v , j 1 v , j , d T ^ v , j v , j ,   1 k ^ < T ^ v , j , 1 j E v ,   i f   d d b t j v < 0 ,
where U M T v , j and D M T v , j represent the sequences of multicast tree nodes that can allocate upload and download bandwidth to video v at time t j v , respectively. These sequences are sorted in ascending order of distance between each node and the waypoint c i σ :
u k v , j c i σ u k + 1 v , j c i σ ,   1 k T v , j , 1 j E v , 1 i < η σ ,   i f   d u b t j v < 0 ,
d k ^ v , j c i σ d k ^ + 1 v , j c i σ ,   1 k ^ T ^ v , j , 1 j E v , 1 i < η σ ,   i f   d d b t j v < 0 ,
Based on the sorted sequences, the number of supporting multicast nodes is calculated as follows:
d u b t j v = d u b t j v + a r g   ξ v , j M i n k = 1 ξ v , j ψ u k v , j U t j v ψ u k v , j , d u b t j v , 1 k T v , j , 1 j E v , i f   d u b t j v < 0 ,
d d b t j v = d d b t j v + a r g   ξ ^ v , j M i n k ^ = 1 ξ ^ v , j ψ d k ^ v , j D t j v ψ d k ^ v , j , d d b t j v ,   1 k ^ T ^ v , j , 1 j E v , i f   d d b t j v < 0 ,
where ξ v , j and ξ ^ v , j represent the number of supporting multicast nodes for upload and download bandwidth, respectively. ψ u k v , j denotes the base station in the coverage of node u k v , j , which is obtained from U M T v , j , and U t j v ψ u k v , j represents the upload bandwidth that ψ u k v , j can allocate to bidirectional video v at time t j v . Similarly, ψ d k ^ v , j denotes the base station in the coverage of node d k ^ v , j , which is obtained from D M T v , j , and D t j v ψ d k ^ v , j represents the download bandwidth available.
This module sequentially activates the regional management servers in the specified sequence to trigger the “Critical Video Bandwidth Support” module. The sequences U M T v , j and D M T v , j are updated iteratively as follows, until the bandwidth demand is satisfied or the sequences are empty:
u k v , j = u k + l v , j ,   1 k T v , j ξ v , j , 1 j E v , i f   d u b t j v < 0 ,
d k ^ v , j = d k ^ + l v , j ,   1 k ^ T ^ v , j ξ ^ v , j , 1 j E v ,   i f   d d b t j v < 0 ,
T v , j = T v , j ξ v , j ,   1 j E v , i f   d u b t j v < 0 ,
T ^ v , j = T ^ v , j ξ ^ v , j ,   1 j E v , i f   d d b t j v < 0 .
If the bandwidth demand for the emergency video application still cannot be satisfied, the system reallocates bandwidth from other real-time video applications as follows:
d u b t j v = d u b t j v + M i n μ φ μ U t j v φ μ , d u b t j v , i f   d u b t j v < 0 ,
d d b t j v = d d b t j v + M i n μ φ μ D t j v φ μ , d d b t j v , i f   d d b t j v < 0 .

3.3. Asynchronous Video Preloader

This module ensures the smooth playback of non-critical video while enhancing bandwidth usage efficiency. By employing a pre-download mechanism and a bandwidth scheduling strategy, it effectively facilitates the transmission of video segments. A flowchart of this module is illustrated in Figure 4.
This module begins by setting the index of the most recent un-played video segment ( l p v ) to zero. The remaining pre-download playback time length ( S L v ) is set as S L ¯ v , which represents the upper limit of the pre-download video playback time defined by the system. This module then allocates base station bandwidth to download non-critical video segments until all segments are downloaded or the buffer reaches its capacity. The number of video segments ( p s v ) to be downloaded is dynamically adjusted based on the current state of the playback buffer:
a r g p s v   M i n ϑ D τ s l p v + 1 v + δ ϑ l = l p v + 1 l p v + p s v S S c f r s l v , c b r s l v , c d s l v ,
where s l p v + 1 v represents the video segment v starting from index l p v + 1 ; τ s l p v + 1 v is the time when the segment s l p v + 1 v starts downloading. The parameter δ specifies the duration, in terms of allocated time slots, for uninterrupted downloading. The bandwidth provided by base station ϑ at time τ s l p v + 1 v is given by D τ s l p v + 1 v ϑ , which is used to download the segment s l p v + 1 v . The function S S · calculates the segment size, which is determined by its frame rate c f r s l v , resolution c b r s l v and playback duration c d s l v .
The process must satisfy several constraints. First, the available bandwidth must cover the size of the segments to be downloaded:
ϑ D τ s l p v + 1 v + δ ϑ l = l p v + 1 l p v + p s v S S c f r s l v , c b r s l v , c d s l v ,      
Second, the playback duration of the downloaded segments must meet flight requirements for uninterrupted playback before reaching the destination, ensuring proper playback order and buffer constraints:
l = f p v l p v + p s v c d s l v r t c η σ σ C u r ,     l p v + p s v 1 t s v ,
l = l p v + 1 l p v + p s v c d s l v S L v ,     l p v + p s v 1 t s v ,
c p t s l 1 v v c p t s l v v ,         l p v + 1 l l p v + p s v 1 ,
b u f σ + l = l p v + 1 l p v + p s v S S c f r s l v , c b r s l v , c d s l v b u f ¯ σ ,  
where C u r and r t c η σ σ represent the current time and the time when the eVTOL aircraft reaches its destination, respectively. t s v is the total number of video segments, b u f σ is the remaining buffer size before pre-downloading, and b u f ¯ σ is the total buffer size of the eVTOL aircraft.
After allocating bandwidth, this module updates the following variables:
S L v = S L v l = l p v + 1 l p v + p s v c d s l v
b u f σ = b u f σ + l = l p v + 1 l p v + p s v S S c f r s l v , c b r s l v , c d s l v
l p v = l p v + p s v .
If the video segments are not fully downloaded, the number of additional segments to be downloaded ( p s ~ v ) is calculated:
a r g p s ~ v   M i n l = l p v + 1 l p v + p s ~ v 1 c d s l v S L v ,   l = l p v + 1 l p v + p s ~ v c d s l v > S L v
If the base station cannot provide sufficient bandwidth, the module queries other regional management servers along the video multicast tree nodes to determine if any node can forward at least one segment. The smallest video segment ( l ~ ) is selected:
a r g l ~   M i n S S c f r s l ~ v , c b r s l ~ v , c d s l ~ v ,     l p v + 1 l ~ l p v + p s ~ v 1 ,
The system prioritizes downloading based on node distance and available bandwidth, forming sequences for node bandwidth ( N M T v ) , segment indices ( S M T v ) , and node segment counts ( C M T v ) , ensuring nodes providing the maximum bandwidth are prioritized. The sorting condition is as follows:
ψ d ~ k ~ 1 v D ~ τ s ~ k ~ 1 v + δ ~ d ~ k ~ 1 v ψ d ~ k ~ 1 v ψ d ~ k ~ v D ~ τ s ~ k ~ v + δ ~ d ~ k ~ v ψ d ~ k ~ v ,   1 < k ~ < T ~ v , l p v + 1 s ~ k ~ v l p v + p s ~ v 1 ,  
The total available bandwidth must also satisfy the following condition:
1 k ~ T ~ v δ ~ d ~ k ~ v ψ d ~ k ~ v D ~ τ s ~ k ~ v + δ ~ d ~ k ~ v ψ d ~ k ~ v l = l p v + 1 l p v + p s ~ v S S c f r s l v , c b r s l v , c d s l v ,      
where d ~ k ~ v and s ~ k ~ v represent the k ~ -th node in the sets N M T v and S M T v , respectively. T ~ v is the total number of nodes in the sets N M T v , S M T v and C M T v . ψ d ~ k ~ v refers to the base station within the coverage of node d ~ k ~ v , and D ~ τ s ~ k ~ v + δ ~ d ~ k ~ v ψ d ~ k ~ v represents the bandwidth provided by base station ψ d ~ k ~ v at time τ s ~ k ~ v + δ ~ d ~ k ~ v .
The download process starts from the first segment index l p v + 1 , and the total number of segments to be downloaded ( p s ~ v ) must equal the sum of the segments allocated by all participating nodes:
p s ~ v = 1 k ~ T ~ v n ~ k ~ v ,
where n ~ k ~ v represents the number of video segments that node k ~ in C M T v can provide for download.
Once the multicast node prioritization and bandwidth allocation are complete, this module notifies the regional management servers controlling the nodes to activate the “Non-Critical Video Bandwidth Support” module. This module then allocates bandwidth from its managed base stations and forwards the required bandwidth to the requesting eVTOL aircraft passengers via other eVTOL aircraft and HAPS.
This module terminates when all video segments have been downloaded or when the eVTOL reaches its destination. If neither condition is met, the process is repeated at the next waypoint to ensure continuous playback, after updating the relevant variables accordingly:
p s v = p s ~ v ,
l p v = l p v + p s v 1 .

3.4. Critical Video Bandwidth Assistance Unit

This module dynamically utilizes available bandwidth in non-congested regions to meet the real-time video bandwidth demands of eVTOL aircraft passengers, thereby enhancing overall network performance. As illustrated in Figure 5, upon receiving a bandwidth support notification, the regional management server activates this module to assist in forwarding bandwidth. This module first uses navigation software [37] to plan a route from the base stations within its management area to the eVTOL aircraft’s current position and queries other regional management servers along the route to obtain relevant information about HAPSs and other unmanned VTOL aircraft within their coverage areas. Based on the integrated path and available bandwidth, the server establishes a bandwidth transmission route and selects the upload and download paths using the following optimization criteria:
a r g M v   M i n m = 1 M v ψ r m , 1 v U t j v ψ r m , 1 v + d u b t j v ,     i f   d u b t j v < 0 ,
a r g M ~ v   M i n m = 1 M ~ v ψ r ~ m , 1 v D t j v ψ r ~ m , 1 v + d d b t j v ,     i f   d d b t j v < 0
where M v and M ~ v represent the number of upload and download connections for video v , respectively. r m , i v and r ~ m , i v denote the index positions of the i-th node on the m -th upload and download connection. Other parameters are defined similarly to those used in the previous modules.
To ensure bandwidth requirements are satisfied, the upload and download paths must meet the following constraints:
m = 1 M v ψ r m , 1 v U t j v ψ r m , 1 v + d u b t j v 0 ,         1 j E v , i f   d u b t j v < 0 ,
m = 1 M ~ v ψ r ~ m , 1 v D t j v ψ r ~ m , 1 v + d d b t j v 0 ,   1 j E v , i f   d d b t j v < 0 ,
Additionally, adjacent nodes on each path must have sufficient residual bandwidth to satisfy the demand. For upload and download nodes, the constraints are as follows:
u r b t j v r m , i v , r m , i + 1 v u v b t j v r m , i v , r m , i + 1 v ,       1 i < χ m v ,   1 m M v , 1 j E v ,       i f   d u b t j v < 0 ,  
d r b t j v r ~ m , i v , r ~ m , i + 1 v d v b t j v r ~ m , i v , r ~ m , i + 1 v ,       1 i < χ ~ m v ,   1 m M ~ v , 1 j E v ,   i f   d d b t j v < 0 ,  
where χ m v and χ ~ m v represent the number of upload and download nodes in the m -th ground-to-air connection, respectively. u v b t j v r m , i v , r m , i + 1 v and d v b t j v r ~ m , i v , r ~ m , i + 1 v represent the total bandwidth required for uploading and downloading from r m , i v to r m , i + 1 v at time t j v .
After completing the bandwidth path calculation and bandwidth allocation, this module sends the results back to the regional management server that initiated the request.

3.5. Non-Critical Video Bandwidth Assistance Unit

Similar to the preceding module, this module is designed to facilitate the pre-download of non-critical video segments by establishing enhanced support paths that satisfy the required pre-download bandwidth. As illustrated in Figure 6, upon receiving a bandwidth support request, the regional management server activates this module to compute the path from the base stations to the requesting eVTOL and allocate available bandwidth to complete the pre-download of video segments. This module then queries other regional management servers along the planned route to gather information about HAPSs and other unmanned eVTOL aircraft within their coverage areas, enabling collaborative resource allocation across network regions.
The key distinction between this module and the preceding one lies in its design focus. This module aims to fulfill the size requirements for pre-downloaded video segments by selecting paths that enhance performance based on the following objective function:
a r g M ^ v   M i n m = 1 M ^ v , k ~ ψ r ^ m , 1 v , k ~ D τ s ~ k ~ v + δ ~ d ~ k ~ v ψ r ^ m , 1 v , k ~ l = s ~ k ~ v n ~ k ~ v + s ~ k ~ v 1 S S c f r l , c b r l , c d l ,    
where M ~ v represents the number of connections used to forward video segments, and r ~ m , i v denotes the index of the i -th node in the m -th connection. Other parameters are defined as described in Section 3.3.
To ensure that the selected paths meet the pre-download requirements, the total bandwidth of the paths must be greater than or equal to the total size of the video segments:
m = 1 M ^ v , k ~ ψ r ^ m , 1 v , k ~ D τ s ~ k ~ v + δ ~ d ~ k ~ v ψ r ^ m , 1 v , k ~ l = s ~ k ~ v n ~ k ~ v + s ~ k ~ v 1 S S c f r l , c b r l , c d l
Additionally, the residual bandwidth at each node along the path must be greater than or equal to the bandwidth demand for pre-downloading the video segments:
d r b τ s ~ k ~ v r ^ m , i v , k ~ , r ^ m , i + 1 v , k ~ d v b τ s ~ k ~ v r ^ m , i v , k ~ , r ^ m , i + 1 v , k ~ ,       1 i < χ ^ m v ,   1 m M ~ v , k ~ ,      
where d r b τ s ~ k ~ v r ^ m , i v , k ~ , r ^ m , i + 1 v , k ~ represents the residual bandwidth at time τ s ~ k ~ v from node r ^ m , i v , k ~ to node r ^ m , i + 1 v , k ~ , while d v b t j v r ^ m , i v , k ~ , r ^ m , i + 1 v , k ~ represents the total bandwidth demand for downloading video segments between these nodes. χ ^ m v indicates the number of nodes in the m -th connection.

4. Results and Discussion

This section presents the simulation results and analysis to validate the effectiveness of the proposed strategy. Since 6G and UAM technologies are still in development, real-world data are not yet available. Consequently, an ad hoc simulator was developed for this study to assess the performance of the proposed bandwidth allocation and scheduling approach under varying traffic conditions. The simulator operates at the system level rather than the physical or link layer, abstracting low-level communication protocols and focusing on mission-level traffic modeling, frequency resource limitations, and service prioritization across heterogeneous frequency bands. In the simulation, mmWave and THz ground base stations are modeled as the primary bandwidth sources for eVTOLs, while FSO/RF-based HAPS act as relays. Table 2 summarizes the maximum bandwidth and transmission range associated with each communication technology, including the effective transmission range for THz communications. These parameters are based on established experimental and theoretical studies [38,39,40], providing a realistic basis for simulation. Communication between eVTOLs is also facilitated via mmWave and THz links.
To simulate air traffic, vehicular traffic data provided in [41] were adapted to represent aerial traffic flow. The origins and destinations of eVTOLs were randomly generated, and their video bandwidth requirements are summarized in Table 3 [42,43,44,45]. Notably, video conferencing applications involve bidirectional data transmission, requiring both uplink and downlink bandwidth. And emergency video applications may involve multi-angle video transmission. For rigorous evaluation, this study configures emergency video applications as 360-degree video conferencing, where video quality cannot be degraded to avoid blurriness, which could critically impact the intended outcome.
The simulation assumes optimal wireless channels for eVTOLs, employing 10 s time slots and 1000-millisecond video packets. Every aircraft has storage resources that dwarf the size of cached video material. Using traffic density data from [41], the variation in eVTOL aircraft numbers was simulated over a 24 h period. Figure 7 demonstrates that the eVTOL aircraft numbers follow a bimodal distribution, aligning with the time-dependent characteristics of urban traffic demand patterns.
The number of eVTOLs is lowest during the early-morning hours, rising sharply to a peak between 6:00 AM and 8:00 AM, driven by increased morning commuting and logistics demands. Following this peak, the numbers gradually decline, reflecting off-peak traffic characteristics. In the afternoon, eVTOL activity surges again, reaching the highest daily peak between 6:00 PM and 7:00 PM, corresponding to the evening commuting rush. As nighttime traffic demand diminishes, eVTOL operations decrease significantly, stabilizing below 3000 after midnight.
Figure 8 illustrates the daily variation trends in demand for different types of video content. The proportions and frequencies of video usage are estimated based on website data from [46,47,48,49,50,51,52]. The simulation results reveal distinct time-based distribution patterns for video demand, including morning and evening peaks that align closely with aerial traffic activity. As shown in the figure, demand for 2D video is higher than that for 360-degree video. This preference stems from 2D video’s significantly lower bandwidth requirements and broader compatibility with existing devices and technologies. As a result, under current network bandwidth limitations, users are more likely to choose 2D video. Additionally, the higher demand for VoD can be attributed to its flexibility in playback, enabling users to watch content at their convenience. In contrast, live streaming and video conferencing require participants to be online simultaneously, leading to comparatively lower demand.
Figure 9 shows the bandwidth requirements of an unmanned eVTOL aircraft when performing different types of video tasks in a day. These demands are estimated based on the usage frequency of video applications at various time intervals, with resolution and frame rates randomly selected from Table 3. Emergency video and 360-degree video conferencing consistently exhibit high bandwidth demand due to their bidirectional nature, requiring users to simultaneously upload and download video streams. Additionally, 360-degree video demands substantial bandwidth to deliver an immersive, panoramic viewing experience, a trend also observed in live streaming scenarios. In contrast, 2D VoD ranks third in bandwidth demand, driven by its widespread usage. For clarity, subsequent figures will categorize the data by video application types, including emergency video applications, real-time video applications (2D/360-degree video streaming, interactive video session), and non-critical applications (2D/360-degree VoD).
To evaluate the effectiveness of the proposed framework, a state-of-the-art aerial video streaming algorithm from the recent literature [23], referred to as I-SAC, was selected for comparison. Figure 10a,b illustrate the bandwidth allocation and corresponding satisfaction ratios for various application types over a 24 h period using the I-SAC algorithm. As shown in Figure 11b, the satisfaction ratio reaches 100% during nighttime (11:00 PM to 6:00 AM), indicating that demand remains within manageable limits. However, during daytime hours, when bandwidth is limited, competition among video applications becomes evident. During both morning and evening peaks, Figure 10a reveals that the bandwidth allocation for emergency video applications is notably insufficient. In some instances, allocation for emergency applications even falls below that for non-real-time applications, with satisfaction ratios dropping to below 20%. This shortfall could hinder critical tasks, potentially leading to severe consequences.
Although real-time video applications are allocated a relatively larger share of bandwidth, they still suffer from competition with non-real-time applications, leading to suboptimal resource allocation. This issue is highlighted in Figure 10b, where non-real-time applications achieve higher satisfaction ratios than both emergency and real-time categories. These findings indicate that the I-SAC algorithm does not effectively prioritize video applications based on their importance and timeliness.
In comparison, as illustrated in Figure 11a,b, the proposed mechanism focuses on allocating bandwidth to critical video applications under identical conditions. Throughout the day, the allocation for emergency video applications shows significant improvement throughout the day. Notably, during peak hours, the system fully satisfies the bandwidth demands of emergency applications. This demonstrates that the proposed method effectively ensures the reliability, timeliness, and stability of critical services, mitigating the impact of resource competition.
Furthermore, the satisfaction ratio for real-time video applications also improves, demonstrating that the proposed method allocates bandwidth more efficiently and mitigates the impact of competition during high-load periods. While the satisfaction ratio for non-critical video applications decreases during peak hours, this reflects a deliberate trade-off to prioritize critical tasks. However, during off-peak periods, the demands for non-critical video applications are consistently met.
Figure 12 compares the performance of the I-SAC algorithm and the proposed method in meeting video service demands over a 24 h period. A notable performance gap is evident, especially during periods of network congestion. Under the I-SAC algorithm, the video application satisfaction ratio falls below 60% during peak hours of user activity. In contrast, the proposed method employs a video bandwidth support module alongside a non-critical video pre-download module, which effectively capitalizes on available bandwidth during off-peak periods. As a result, the system achieves a 45% enhancement in overall resource utilization.
Additionally, the video bandwidth allocation module dynamically adjusts eVTOL aircraft passengers’ video demands, further reducing bandwidth requirements. Consequently, the proposed method achieves a higher satisfaction ratio, surpassing traditional approaches and ensuring more reliable service delivery.

5. Conclusions

This study presents a network architecture for video streaming transmission in UAM, leveraging advanced communication technologies such as mmWave, THz, and FSO communications to achieve high transmission rates and efficient video delivery. To address the coverage limitations of ground base stations, HAPSs are integrated, offering wide-area coverage and enhancing the communication system’s reliability and availability.
In terms of bandwidth allocation, a priority-oriented mechanism is proposed based on the significance of video application scenarios for eVTOL aircraft passengers. This mechanism ensures that high-priority demands, such as emergency and real-time video applications, are met effectively. During peak traffic times, including morning and evening rush hours, the I-SAC algorithm performs poorly in meeting high-priority demands, as evidenced by satisfaction levels plummeting below 20% in simulations.
In contrast, the proposed method introduces a relay mechanism and a pre-download strategy, improving the utilization of remaining bandwidth in non-congested regions. As a result, the system is able to meet the bandwidth requirements of emergency video applications even during peak hours, significantly enhancing their satisfaction ratios. Bandwidth allocation for real-time video applications also increases substantially, providing strong support for high-priority scenarios. However, due to overall bandwidth limitations, a small portion of non-critical video demands are moderately compromised to ensure the delivery of emergency and real-time applications. These results demonstrate the effectiveness of the proposed mechanism and underscore the rationality of its resource allocation strategy.
Low latency is essential for real-time video services; however, the current model does not impose strict delay constraints, as the primary objective is to explore architectural design and bandwidth scheduling strategies. Latency-related factors will be addressed in future work. Additionally, this study focuses exclusively on application-layer video streaming services, which are functionally decoupled from flight-critical Command and Control (C2) communications. C2 traffic—vital to the safe operation of eVTOL aircraft—demands far stricter latency and reliability guarantees and is, therefore, beyond the scope of the current simulation. Nonetheless, due to its critical importance in unmanned aerial operations, future work will incorporate C2 communications into the simulation framework to enable a more holistic evaluation of bandwidth allocation strategies in UAM networks. Further research may also investigate the use of machine learning techniques to enhance predictive bandwidth management and support adaptive resource allocation in highly dynamic aerial environments.

Author Contributions

Methodology and conceptualization, L.-C.C.; conceptualization, funding acquisition and project administration, C.-J.H.; data curation and software, K.-W.H.; data curation, visualization, Y.-S.C.; and writing, M.-E.J. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge partial financial support from the National Science and Technology Council, Taiwan (Contract No. NSTC 112-2221-E-259-006).

Data Availability Statement

Simulation data may be shared with qualified academic researchers for collaborative purposes, subject to approval by the corresponding author and compliance with institutional data sharing agreements.

Conflicts of Interest

Author Ken-Wen Hu was employed by the company Lookout, Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Xiong, B.K.; Jiang, R.; Wang, K.; Tang, X.; Shang, Y.; Qu, X. Utilizing eVTOL aircraft to alleviate traffic congestion on an arterial road. IEEE Trans. Intell. Transp. Syst. 2025. Early Access. [Google Scholar] [CrossRef]
  2. Sarkar, M.; Yan, X.; Gebru, B.; Nuhu, A.R.; Gupta, K.D.; Vamvoudakis, K.G.; Homaifar, A. A data-driven approach for performance evaluation of autonomous eVTOLs. IEEE Trans. Aerosp. Electron. Syst. 2025, 61, 3626–3641. [Google Scholar] [CrossRef]
  3. Zewde, L.; Raptis, I.A. Conceptualizing UAM: Technologies and methods for safe and efficient urban air transportation. Green Energy Intel. Transp. 2025, in press. [Google Scholar] [CrossRef]
  4. van Egmond, P.; Panchal, P. Integration of UAM Services in Sustainable Mobility and Transport Planning Processes and Other Relevant Policies Including Co-Modality. AiRMOUR Project Deliverable D4.5. 2023. Available online: https://airmour.eu/deliverables/ (accessed on 29 May 2025).
  5. Bertizzolo, L.; Tran, T.X.; Buczek, J.; Balasubramanian, B.; Jana, R.; Zhou, Y.; Melodia, T. Streaming from the air: Enabling drone-sourced video streaming applications on 5G open-RAN architectures. IEEE Trans. Mob. Comput. 2021, 22, 3004–3016. [Google Scholar] [CrossRef]
  6. Fotouhi, A.; Ding, M.; Hassan, M.; Giordano, L.G.; Garcia-Rodriguez, A.; Lin, X. 5G-enabled drone and eVTOL communications for real-time video streaming in urban environments. IEEE Commun. Mag. 2022, 60, 62–68. [Google Scholar]
  7. Li, Z.; Wang, Y.; Liu, Y.; Li, J.; Zhu, P. JUST360: Optimizing 360-degree video streaming systems with joint utility. IEEE Trans. Broadcast. 2024, 70, 468–481. [Google Scholar] [CrossRef]
  8. Zhang, L.; Zhou, H.; Shen, L.; Liu, J.; Cui, L. Towards attention-aware interactive 360-degree video streaming on smartphones. IEEE Netw. 2025. Early Access. [Google Scholar] [CrossRef]
  9. Sinha, P.; Chowdhury, M.M.U.; Guvenc, I.; Matolak, D.W.; Namuduri, K. Wireless connectivity and localization for advanced air mobility services. IEEE Aerosp. Electron. Syst. Mag. 2024, 39, 4–14. [Google Scholar] [CrossRef]
  10. Sengupta, R.; Bulusu, V.; Mballo, C.E.; Onat, E.B.; Cao, S. Urban air mobility research challenges and opportunities. Annu. Rev. Control Robot. Auton. Syst. 2025, 8, 407–431. [Google Scholar] [CrossRef]
  11. Zhang, Y.; Liu, W.; Wang, J. 6G-enabled integrated non-terrestrial and terrestrial networks: A comprehensive overview. IEEE Commun. Mag. 2024, 62, 50–57. [Google Scholar]
  12. Pham, D.A.; Lee, M.; Sarkar, A.; Lim, S. High-gain elevational-scanning multibeam planar SSPP antenna array with large ground-clearance for millimeter-wave UAM applications. IEEE Trans. Veh. Technol. 2023, 73, 5139–5149. [Google Scholar] [CrossRef]
  13. Saeed, A.; Erdem, M.; Gurbuz, O.; Akkas, M.A. THz band drone communications with practical antennas: Performance under realistic mobility and misalignment scenarios. Ad Hoc Networks 2025, 166, 103644. [Google Scholar] [CrossRef]
  14. Xu, F.; Dong, M. Unmanned-aerial-vehicle-assisted secure free space optical transmission in Internet of Things: Intelligent strategy for optimal fairness. Sensors 2024, 24, 8070. [Google Scholar] [CrossRef]
  15. Sun, Q.; Hu, Q.; Wu, Y.; Chen, X.; Zhang, J.; López-Benítez, M. Performance analysis of mixed FSO/RF system for satellite-terrestrial relay network. IEEE Trans. Veh. Technol. 2024, 73, 11378–11393. [Google Scholar] [CrossRef]
  16. Elfikky, A.; Singh, M.; Boghdady, A.I.; Abd El-Mottaleb, S.A.; Mohsan, S.A.H.; Aly, M.H. Spatial diversity-based FSO links under adverse weather conditions: Performance analysis. Opt. Quantum Electron. 2024, 56, 826. [Google Scholar] [CrossRef]
  17. Jain, P.; Gupta, A.; Kumar, N.; Guizani, M. Dynamic and efficient spectrum utilization for 6G with THz, mmWave, and RF band. IEEE Trans. Veh. Technol. 2023, 72, 3264–3273. [Google Scholar] [CrossRef]
  18. Abbasi, O.; Yadav, A.; Yanikomeroglu, H.; Đào, N.D.; Senarath, G.; Zhu, P. HAPS for 6G networks: Potential use cases, open challenges, and possible solutions. IEEE Wirel. Commun. 2024, 31, 324–331. [Google Scholar] [CrossRef]
  19. Jang, G.; Choi, J.P. HAPS altitude optimization for downlink communications: Alleviating the effect of channel elevation angles. IEEE Trans. Aerosp. Electron. Syst. 2025. Early Access. [Google Scholar] [CrossRef]
  20. Bithas, P.S. Distributed HAPS-assisted communications in FSO/RF space-air-ground integrated networks. Opt. Quantum Electron. 2024, 56, 1020. [Google Scholar] [CrossRef]
  21. Wali, I.; Kessentini, A.; Masmoudi, N. CNN-based intra partitioning process for spatial and SNR scalability for SHVC. Multimed. Tools Appl. 2024, 84, 6203–6217. [Google Scholar] [CrossRef]
  22. Wang, S.; Shen, L.; Liu, J. Spatial-temporal inter-layer reference frame generation network for spatial SHVC. IEEE Trans. Multimed. 2023, 25, 3235–3250. [Google Scholar] [CrossRef]
  23. Yaqoob, A.; Yuan, Z.; Muntean, G.M. A UAV-Centric Improved Soft Actor-Critic Algorithm for QoE-Focused Aerial Video Streaming. IEEE Trans. Veh. Technol. 2024, 73, 13498–13512. [Google Scholar] [CrossRef]
  24. Shon, H.; Lee, J. An optimization framework for urban air mobility (UAM) planning and operations. J. Air Transp. Manag. 2025, 124, 102720. [Google Scholar] [CrossRef]
  25. Sun, Z.; Sun, G.; Wu, Q.; He, L.; Liang, S.; Pan, H.; Niyato, D.; Yuen, C.; Leung, V.C. TJCCT: A two-timescale approach for UAV-assisted mobile edge computing. IEEE Trans. Mob. Comput. 2025, 24, 3130–3147. [Google Scholar] [CrossRef]
  26. Li, H.; Xiao, M.; Wang, K.; Kim, D.I.; Debbah, M. Large language model based multi-objective optimization for integrated sensing and communications in UAV networks. IEEE Wirel. Commun. Lett. 2025, 14, 979–983. [Google Scholar] [CrossRef]
  27. Liang, L.; Hui, Y.; Wen, W.; Jia, Y. Interference coordination with multi-dimensional resource optimization for terahertz UAV communications. IEEE Wirel. Commun. Lett. 2024, 13, 10–14. [Google Scholar] [CrossRef]
  28. Saber, M.J.; Hasna, M.; Badarneh, O.S. THz-enabled UAV communications under pointing errors: Tractable statistical channel modeling and security analysis. IEEE Open J. Veh. Technol. 2025, 6, 801–811. [Google Scholar] [CrossRef]
  29. Mustari, N.; Karabulut, M.A.; Shah, A.S.; Tureli, U. Cooperative THz communication for UAVs in 6G and beyond. Green Energy Intell. Transp. 2024, 3, 100131. [Google Scholar] [CrossRef]
  30. Tun, Y.K.; Dán, G.; Park, Y.M.; Hong, C.S. Joint UAV Deployment and Resource Allocation in THz-Assisted MEC-Enabled Integrated Space-Air-Ground Networks. IEEE Trans. Mob. Comput. 2025, 24, 3794–3808. [Google Scholar] [CrossRef]
  31. Hamoudi, Z.A.; Ali, M.A.A. Analysis of hybrid RF/FSO wireless communications links under foggy weather conditions. J. Opt. Commun. 2024, 45 (Suppl. S1), s1359–s1367. [Google Scholar] [CrossRef]
  32. Xu, G.; Xu, M.; Zhang, Q.; Song, Z. Cooperative FSO/RF space-air-ground integrated network system with adaptive combining: A performance analysis. IEEE Trans. Wirel. Commun. 2024, 23, 17279–17293. [Google Scholar] [CrossRef]
  33. Mashiko, K.; Kawamoto, Y.; Kato, N.; Yoshida, K.; Ariyoshi, M. Combined control of coverage area and HAPS deployment in hybrid FSO/RF SAGIN. IEEE Trans. Veh. Technol. 2025. Early Access. [Google Scholar] [CrossRef]
  34. Ghazikor, M.; Roach, K.; Cheung, K.; Hashemi, M. Channel-aware distributed transmission control and video streaming in UAV networks. arXiv 2024, arXiv:2408.01885. [Google Scholar]
  35. Zhan, C.; Hu, H.; Liu, Z.; Wang, J.; Cheng, N.; Mao, S. Aerial video streaming over 3D cellular networks: An environment and channel knowledge map approach. IEEE Trans. Wirel. Commun. 2023, 23, 1432–1446. [Google Scholar] [CrossRef]
  36. Wei, H.; Lou, B.; Zhang, Z.; Liang, B.; Wang, F.Y.; Lv, C. Autonomous navigation for eVTOL: Review and future perspectives. IEEE Trans. Intell. Veh. 2024, 9, 4145–4171. [Google Scholar] [CrossRef]
  37. Wei, Y.; Ye, Q.; Qu, K.; Zhuang, W.; Shen, X. Customized transmission protocol for tile-based 360° VR video streaming over core network slices. IEEE/ACM Trans. Netw. 2025, 33, 340–354. [Google Scholar] [CrossRef]
  38. Akyildiz, I.F.; Han, C.; Hu, Z.; Nie, S.; Jornet, J.M. Terahertz band communication: An old problem revisited and research directions for the next decade. IEEE Trans. Commun. 2022, 70, 4250–4285. [Google Scholar] [CrossRef]
  39. Mustari, N.; Karabulut, M.A.; Shah, A.S.; Ilhan, H.; Tureli, U. RF, mmWave, and THz communication: Applications, opportunities, and challenges. Authorea Prepr. 2024, 35, 40. [Google Scholar]
  40. Samy, R.; Yang, H.C.; Rakia, T.; Alouini, M.S. Reliable terabits feeder link for very high-throughput satellite systems with SAG-FSO transmission. IEEE Wirel. Commun. 2023, 31, 112–116. [Google Scholar] [CrossRef]
  41. New York City Vehicle Flow Statistics. Available online: https://www.nyc.gov/html/dot/html/about/datafeeds.shtml#trafficcounts (accessed on 3 April 2025).
  42. Video Bitrate: How to Choose the Ideal Bitrate for Your Video. Available online: https://riverside.fm/blog/video-bitrate (accessed on 5 April 2025).
  43. Kim, Y.H.; Huh, J.; Jeong, J. Distributed video transcoding system for 8K 360° VR tiled streaming service. In Proceedings of the IEEE 2018 International Conference on Information and Communication Technology Convergence (ICTC), Jeju, Republic of Korea, 17–19 October 2018; pp. 592–595. [Google Scholar]
  44. Lee, J.Y.; Park, S.I.; Kwon, S.; Lim, B.M.; Kim, H.M.; Hur, N.; Pesin, A.; Chevet, J.-C.; Llach, J.; Stein, A.J.; et al. Efficient transmission of multiple broadcasting services using LDM and SHVC. IEEE Trans. Broadcast. 2017, 64, 177–187. [Google Scholar] [CrossRef]
  45. Nguyen, D.; Hung, N.V.; Phong, N.T.; Huong, T.T.; Thang, T.C. Scalable multicast for live 360-degree video streaming over mobile networks. IEEE Access 2022, 10, 38802–38812. [Google Scholar] [CrossRef]
  46. Essential YouTube Statistics You Need to Know in 2024. Available online: https://thesocialshepherd-com.translate.goog/blog/youtube-statistics?_x_tr_sl=en&_x_tr_tl=zh-TW&_x_tr_hl=zh-TW&_x_tr_pto=sc (accessed on 5 April 2025).
  47. TikTok by the Numbers: Stats, Demographics & Fun Facts. Available online: https://www.omnicoreagency.com/tiktok-statistics/ (accessed on 5 April 2025).
  48. Zoom User Statistics: How Many People Use Zoom in 2024? Available online: https://www.searchlogistics.com/learn/statistics/zoom-user-statistics/ (accessed on 5 April 2025).
  49. Virtual Reality Statistics: The Ultimate List in 2024. Available online: https://academyofanimatedart.com/virtual-reality-statistics/ (accessed on 5 April 2025).
  50. Number of Daily Active Users (DAU) of Microsoft Teams Worldwide as of 2023. Available online: https://www.statista.com/statistics/1033742/worldwide-microsoft-teams-daily-and-monthly-users/ (accessed on 5 April 2025).
  51. National Health Service England Ambulance Quality Indicators. Available online: https://www.england.nhs.uk/statistics/statistical-work-areas/ambulance-quality-indicators/ (accessed on 5 April 2025).
  52. Live Streaming—Statistics & Facts. Available online: https://www.statista.com/topics/8906/live-streaming/#topicOverview (accessed on 5 April 2025).
Figure 1. A scenario of video streaming for unmanned eVTOL aircraft passengers.
Figure 1. A scenario of video streaming for unmanned eVTOL aircraft passengers.
Drones 09 00448 g001
Figure 2. Flowchart of the video bandwidth allocation module for unmanned eVTOL aircraft passengers.
Figure 2. Flowchart of the video bandwidth allocation module for unmanned eVTOL aircraft passengers.
Drones 09 00448 g002
Figure 3. Critical video resource allocation flowchart.
Figure 3. Critical video resource allocation flowchart.
Drones 09 00448 g003
Figure 4. Asynchronous video preloader flowchart.
Figure 4. Asynchronous video preloader flowchart.
Drones 09 00448 g004
Figure 5. Flowchart of critical video bandwidth assistance unit.
Figure 5. Flowchart of critical video bandwidth assistance unit.
Drones 09 00448 g005
Figure 6. Flowchart of non-critical video bandwidth assistance unit.
Figure 6. Flowchart of non-critical video bandwidth assistance unit.
Drones 09 00448 g006
Figure 7. Variation in the number of eVTOL aircraft over a 24 h period.
Figure 7. Variation in the number of eVTOL aircraft over a 24 h period.
Drones 09 00448 g007
Figure 8. Distribution of demand for different video types over a 24 h period.
Figure 8. Distribution of demand for different video types over a 24 h period.
Drones 09 00448 g008
Figure 9. Bandwidth demand for different video types over a 24 h period.
Figure 9. Bandwidth demand for different video types over a 24 h period.
Drones 09 00448 g009
Figure 10. Bandwidth allocation results using the I-SAC algorithm. (a) Bandwidth Allocation. (b) Demand Satisfaction Ratio.
Figure 10. Bandwidth allocation results using the I-SAC algorithm. (a) Bandwidth Allocation. (b) Demand Satisfaction Ratio.
Drones 09 00448 g010
Figure 11. Bandwidth allocation results using the proposed mechanism. (a) Bandwidth Allocation. (b) Demand Satisfaction Ratio.
Figure 11. Bandwidth allocation results using the proposed mechanism. (a) Bandwidth Allocation. (b) Demand Satisfaction Ratio.
Drones 09 00448 g011
Figure 12. Satisfaction ratio comparison between I-SAC and the proposed method.
Figure 12. Satisfaction ratio comparison between I-SAC and the proposed method.
Drones 09 00448 g012
Table 1. The role of each module in the proposed system.
Table 1. The role of each module in the proposed system.
Module NameRole
Video Bandwidth Allocation for Unmanned eVTOL Aircraft PassengersInitiates the bandwidth allocation process when a video application request is made.
Critical Video Resource AllocationHandles bandwidth allocation for time-sensitive video applications.
Asynchronous Video PreloaderManages non-urgent video content by pre-downloading data during low network utilization.
Critical Video Bandwidth Assistance UnitRelays additional bandwidth via HAPS or eVTOLs for critical applications facing shortages.
Non-Critical Video Bandwidth Assistance UnitRelays supplementary bandwidth for non-critical applications from underutilized sources.
Table 2. Maximum bandwidth and transmission range for wireless networking solutions.
Table 2. Maximum bandwidth and transmission range for wireless networking solutions.
Wireless Networking
Solutions
Effective Transmission RangePeak Capacity
mmWave100 m20 Gbps
THz500 m240 Gbps
FSOHAPS 20 km height335 Gbps
Table 3. Bandwidth requirements of 2D and 360-degree videos.
Table 3. Bandwidth requirements of 2D and 360-degree videos.
Video TypeVideo Resolution
Frame Rate
Expected
Bandwidth
2D video7680 × 4320 60 fps135–156 Mbps
3840 × 2160 60 fps38–44 Mbps
2560 × 1440 60 fps17 Mbps
1280 × 720 60 fps5 Mbps
360-degree video7680 × 3840 60fps125–309 Mbps
4090 × 4090 60fps70–174 Mbps
2890 × 1920 60fps33–82 Mbps
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, L.-C.; Huang, C.-J.; Cheng, Y.-S.; Hu, K.-W.; Jian, M.-E. UAV-Enabled Video Streaming Architecture for Urban Air Mobility: A 6G-Based Approach Toward Low-Altitude 3D Transportation. Drones 2025, 9, 448. https://doi.org/10.3390/drones9060448

AMA Style

Chen L-C, Huang C-J, Cheng Y-S, Hu K-W, Jian M-E. UAV-Enabled Video Streaming Architecture for Urban Air Mobility: A 6G-Based Approach Toward Low-Altitude 3D Transportation. Drones. 2025; 9(6):448. https://doi.org/10.3390/drones9060448

Chicago/Turabian Style

Chen, Liang-Chun, Chenn-Jung Huang, Yu-Sen Cheng, Ken-Wen Hu, and Mei-En Jian. 2025. "UAV-Enabled Video Streaming Architecture for Urban Air Mobility: A 6G-Based Approach Toward Low-Altitude 3D Transportation" Drones 9, no. 6: 448. https://doi.org/10.3390/drones9060448

APA Style

Chen, L.-C., Huang, C.-J., Cheng, Y.-S., Hu, K.-W., & Jian, M.-E. (2025). UAV-Enabled Video Streaming Architecture for Urban Air Mobility: A 6G-Based Approach Toward Low-Altitude 3D Transportation. Drones, 9(6), 448. https://doi.org/10.3390/drones9060448

Article Metrics

Back to TopTop