Skip to Content
EntropyEntropy
  • Review
  • Open Access

20 April 2025

A Survey on Semantic Communications in Internet of Vehicles

,
,
and
1
School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
2
State Key Laboratory of Space Network and Communications, Department of Electronic Engineering, Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China
3
Qualcomm, San Jose, CA 95110, USA
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Semantic Information Theory

Abstract

The Internet of Vehicles (IoV), as the core of intelligent transportation system, enables comprehensive interconnection between vehicles and their surroundings through multiple communication modes, which is significant for autonomous driving and intelligent traffic management. However, with the emergence of new applications, traditional communication technologies face the problems of scarce spectrum resources and high latency. Semantic communication, which focuses on extracting, transmitting, and recovering some useful semantic information from messages, can reduce redundant data transmission, improve spectrum utilization, and provide innovative solutions to communication challenges in the IoV. This paper systematically reviews state-of-the-art semantic communications in the IoV, elaborates the technical background of the IoV and semantic communications, and deeply discusses key technologies of semantic communications in the IoV, including semantic information extraction, semantic communication architecture, resource allocation and management, and so on. Through specific case studies, it demonstrates that semantic communications can be effectively employed in the scenarios of traffic environment perception and understanding, intelligent driving decision support, IoV service optimization, and intelligent traffic management. Additionally, it analyzes the current challenges and future research directions. This survey reveals that semantic communications have broad application prospects in the IoV, but it is necessary to solve the real existing problems by combining advanced technologies to promote their wide application in the IoV and contributing to the development of intelligent transportation systems.

1. Introduction

In today’s digital era, the Internet of Vehicles (IoV) [1] and semantic communications [2,3,4], as cutting-edge technologies in the field of intelligent transportation and communication, are gradually changing the way we travel and are transforming traffic management. The IoV realizes real-time sharing and interaction of traffic information through vehicle-to-vehicle (V2V), vehicle-to-infrastructure (V2I), and vehicle-to-pedestrian (V2P) communications. It greatly improves traffic efficiency and safety. Semantic communications, on the other hand, are an emerging communication paradigm that significantly improves the efficiency and reliability of communications by directly transmitting the semantic content of information instead of the traditional sequence of bits.
However, with the continuous emergence of new applications in the IoV, such as autonomous driving [5], vehicle remote monitoring and diagnostics [6], and IoV entertainment services [7], the mobile data traffic between vehicles and roadside units (RSUs), as well as between vehicles, has shown an explosive growth trend. This surge in data volume has extremely stringent demands on the communication infrastructure of the IoV, particularly highlighting the issue of scarce spectrum resources. When facing massive amounts of data, traditional IoV communication technologies often reveal limitations such as limited spectrum resources, high latency, and insufficient capabilities for processing semantic information. These issues significantly restrict the further enhancement of IoV performance and hinder the development of intelligent transportation systems.
Against this background, semantic communication becomes a promising communication technology to address the IoV communication bottleneck. Unlike traditional communication that primarily focuses on the accurate transmission of data, semantic communication concentrates on extracting, transmitting, and recovering the semantic information contained within messages, enabling the receiver to understand the intent and content of the sender. The integration of the IoV and semantic communication can effectively improve the efficiency of data transmission and ensure the accuracy and reliability of information transfer in complex traffic environments. The concealment of semantic communication provides a new approach for privacy protection in the Internet of Vehicles, making information transmission more secure. At the same time, it can flexibly adjust information content and services according to the specific needs and scenarios of users, providing more precise support for applications such as autonomous driving and intelligent traffic management.
This combination also faces some challenges. For example, in dynamic and changeable traffic scenarios, how to accurately understand and process contextual information and how to ensure the efficient decoding and comprehension of information while maintaining its concealment are all issues that require further research. Despite these challenges, the combination of the IoV and semantic communication undoubtedly paves a new way for the development of future intelligent transportation, and its potential and value are worthy of in-depth exploration.
In recent years, researchers have paid attention to the application of semantic communication in the field of the IoV. A substantial amount of research has been conducted on how to integrate the advantages of semantic communication into the IoV. In light of this, this survey provides a comprehensive and systematic review of the current research related to semantic communications in the IoV. We summarize these studies from two perspectives: key technologies and specific applications, as shown in Table 1. To the best of our knowledge, this is the first comprehensive study discussing the IoV based on semantic communication.The rest of the paper is organized as follows: Section 2 begins with an overview of the architecture and key technologies of the IoV, as well as the fundamental principles, system architecture, and classification of semantic communication, providing the theoretical background for subsequent research. Section 3 delves into the key technologies of semantic communication in the IoV, covering the core technical areas of semantic information extraction, semantic communication architecture, resource allocation and management, and details regarding current research progress. Section 4 demonstrates the significant achievements of semantic communication in practical application scenarios such as traffic environment perception and understanding, intelligent driving decision support, IoV service optimization, and intelligent traffic management through specific case studies, further validating the potential of semantic communication in the IoV. Section 5 provides an in-depth analysis of the current challenges and a prospective outlook on future research directions. Finally, in Section 6, we present the conclusions of this survey. Additionally, the organization and structure of the survey are shown in Figure 1.
Table 1. Studies related to semantic communication in IoV.
Figure 1. Structure of this survey.

3. Key Technologies of Semantic Communication in IoV

Based on the technical background discussed above, the following section will delve into the key technologies of semantic communication in the IoV, including semantic information extraction, communication architecture, and resource allocation, which are critical for realizing efficient and reliable semantic communication.
The IoV aims to build an intelligent transportation ecosystem where vehicles, infrastructure, and people are interconnected. This imposes extremely high demands on communication technologies. In the pursuit of higher transmission efficiency, stronger information comprehension capabilities, and more reliable collaborative interactions, semantic communication has emerged as a breakthrough. It goes beyond the traditional signal transmission level of conventional communication, delving into the dimensions of semantic understanding and knowledge sharing. Compared to the traditional three-layer architecture of the IoV, the semantic communication-based IoV introduces an additional semantic layer to facilitate the acquisition and processing of semantic information. For instance, ref. [106] proposed a four-layer semantic V2X architecture comprising the device layer, network layer, semantic layer, and application layer. The newly added semantic layer extracts key semantic features and compresses redundant information with a semantic encoder and then restores the semantic content via a semantic decoder, ensuring that the receiver accurately understands the sender’s intent. The semantic layer also relies on a shared knowledge base to provide a unified foundation for semantic understanding and achieves collaborative optimization between semantic encoding and channel encoding, as well as semantic decoding and channel decoding. This enhances transmission efficiency, reduces data redundancy, overcomes the limitations of traditional communication, and meets the requirements for efficient communication in complex scenarios [107]. Figure 4 shows the system architecture combining the IoV and semantic communication.
Figure 4. The system architecture combining the IoV and semantic communication.
To comprehensively evaluate the performance of integrated IoV and semantic communication technologies, we propose a multidimensional unified evaluation framework. This framework covers critical metrics such as semantic accuracy and real-time capability, enabling systematic quantitative comparisons across methodologies. These metrics are described as follows:
  • Semantic accuracy: This measures the precision of semantic information extraction and reconstruction. It includes general metrics (e.g., BLEU for text, PSNR/SSIM for images, etc.) and task-specific metrics (e.g., IoU for object detection, F1-score for classification, etc.).
  • End-to-end latency: This defines the total time from data generation to application reception (encoding + transmission + decoding). This directly impacts real-time-sensitive scenarios like V2V collision warnings.
  • Resource efficiency: This includes computational overhead (FLOPs/memory usage) and bandwidth efficiency (data compression ratio), determining feasibility for edge device deployment.
Table 2 summarizes the complete metric definitions and typical application scenarios. Subsequent sections will analyze the strengths and limitations of each technology based on this framework.
Table 2. A summary of three major categories of evaluation metrics.

3.1. Semantic Information Extraction

Semantic information extraction is the foundation of semantic communication and a critical component for efficient information interaction in the IoV. Vehicles need to extract key semantic information from massive sensor data to support real-time perception and intelligent decision making. The development of semantic information extraction technology has evolved from processing single-sensor data to multimodal data fusion. With the rise of deep learning techniques in recent years, the efficiency and accuracy of semantic information extraction have been significantly improved.
In the IoV domain, semantic information extraction predates that of semantic communication. Early studies focused on extracting useful semantic information from single-sensor data or multimodal data (e.g., vision, radar, LiDAR, etc.) to support vehicle perception and decision making. Initial approaches relied on computer vision or multisensor fusion techniques to extract semantic information about the vehicle’s surrounding environment (e.g., object detection, road conditions, etc.), thereby enhancing the performance of autonomous driving systems.

3.1.1. Single-Modal Extraction

In the semantic communication system of the IoV, computer vision-based methods dominate the processing of single-modal data. Semantic segmentation techniques are widely used to extract key features, and different methods exhibit distinct advantages. Semantic segmentation is a pixel-level image classification task, aiming to assign each pixel in an image to a predefined category. This process usually relies on deep learning models. The process from image input to semantic information extraction mainly includes the following steps. Firstly, the input image needs to be pre-processed, such as resizing the image and normalizing pixel values, so that the model can process it better. Secondly, feature extraction is performed on the image through convolutional neural networks (such as UNet, DeepLab, etc.). These networks extract high-level semantic features of the image step by step through multiple convolutional operations. For example, the UNet network extracts and fuses image features through the downsampling and upsampling processes. Then, based on the extracted features, the network classifies each pixel to determine its category. This step is usually achieved through a classification layer, such as a fully connected layer or a convolutional layer. Finally, after obtaining the preliminary segmentation result, post-processing is required, such as noise reduction and boundary smoothing, to improve the accuracy and consistency of the segmentation result. The process of semantic information extraction for other data, such as text data, is similar to that of image data, generally involving steps such as text pre-processing, lexical analysis, syntactic analysis, and semantic understanding.
In the IoV scenario, semantic information extraction technology can be used to extract key information in the road environment, such as vehicles, pedestrians, and traffic signs. For example, ref. [8] employed a deep convolutional model for the semantic segmentation of aerial images and achieved vehicle detection and classification through pixel-level semantic mapping. Their architecture based on SegNet achieved an Intersection over Union (IoU) of 82.4% and an F1-score of 95.7% in their vehicle segmentation task. However, the segmentation accuracy of its pyramid pooling module for small targets (such as traffic signs) decreased by approximately 15%, indicating that this type of method is more suitable for extracting macroscopic objects. Ref. [9] conducted a comparative study to evaluate the effects of U-Net, Seg-Net, and Fully Convolutional Networks (FCNs) for road segmentation. U-Net achieved a mean Intersection over Union (mIoU) of 94% on the Camvid dataset, while Seg-Net prioritized computational efficiency in real-time applications. Ref. [10] combined semantic segmentation with learned image compression, focusing on the preservation of regions of interest (ROIs). At 0.25 bits per pixel (bpp), their method achieved a peak signal-to-noise ratio (PSNR) of 24.25 decibels (dB) in the ROIs, which is 1.19 dB higher than traditional compression methods. The latest research by [12] introduced a multiscale feature extractor based on Swin Transformer, which increased the mIoU by 75% on the Cityscapes dataset compared to traditional methods. Ref. [13] used computer vision algorithms such as RealtimeSeg to extract semantic information from images. Experimental results show that SemCom can significantly increase the load supported by the network, reduce latency, and also perform well in energy consumption.
From the above methods, it can be seen that different semantic information extraction methods have their own characteristics and advantages. Semantic segmentation methods based on deep convolutional models perform well in vehicle detection and classification tasks and can accurately extract vehicle semantic information. Road detection methods based on U-Net, Seg-Net, and FCNs have their own advantages in distinguishing between road and non-road areas and are suitable for different application scenarios. Methods based on learned image compression can not only extract semantic information but also effectively reduce the amount of data and improve transmission efficiency, especially in enhancing the image quality within the region of interest. The method based on the Swin Transformer has significant advantages in multiscale feature capture and target recognition accuracy in complex traffic scenarios. The semantic segmentation technology in semantic communication performs well in solving the bandwidth bottleneck problem of IoV image transmission and can significantly improve network performance and energy consumption efficiency.

3.1.2. Multimodal Fusion

Multimodal fusion integrates heterogeneous sensor data (such as LiDAR, cameras, and Global Positioning System—GPS) to enhance the environmental perception ability. The semantic communication framework further optimizes this process through task-aware feature extraction and transmission.
Ref. [11] used PSPNet for semantic segmentation and the Random Sample Consensus (RANSAC) algorithm for geometric feature alignment, fusing visual, GPS, and map data. Their Bird’s-Eye-View (BEV) registration achieved a feature matching accuracy of over 90%, effectively eliminating blind spots. Ref. [14] designed a hybrid Convolutional Neural Network–Graph Neural Network–Long Short-Term Memory Network (CNN-GNN-LSTM) encoder to unify multimodal features (images, point clouds, and radar) into a semantic space. This method maintained an F1-score of 0.93 at an SNR of 0 dB, demonstrating strong noise resistance. Ref. [15] combined Bird’s-Eye-View (BEV) fusion with a Diffusion Model (DM) for noise suppression. At an SNR of 0 dB, the DM-enhanced decoding increased the IoU from 0.65 to 0.80, with further improvements at higher SNR levels.
In the field of multimodal data fusion, different methods have their unique advantages and applicable scenarios. Ref. [11] significantly expanded the vehicle’s perception range by integrating multisource data such as visual, GPS, and digital maps, especially excelling in eliminating blind spots with a feature point matching accuracy of over 90%. The semantic communication in [14] focused on efficiently extracting key semantic information from multimodal data through a semantic encoder. It used multiple neural network architectures to process different modal data, not only significantly reducing the amount of data transmission but also maintaining an F1-score as high as 0.93 under low-SNR conditions. The work of [15] integrates multisensor data through technologies such as BEV fusion and uses generative AI technology for noise reduction and enhancement. Its IoU has been significantly improved under different SNR conditions, especially under low-SNR conditions, where the IoU value increased from 0.65 to 0.80, and it can effectively predict future scenarios. Overall, these methods have different emphases in expanding the perception range, reducing data redundancy, enhancing robustness, and improving prediction capabilities, providing diverse solutions for autonomous driving systems.
From the current research, semantic information extraction technology shows a trend toward more efficient and accurate models. With the increasing requirements for real-time performance and accuracy in the Internet of Vehicles, future research will pay more attention to the lightweight and fast-processing capabilities of models to meet the real-time processing requirements of a large amount of data during high-speed vehicle driving. At the same time, the accuracy and real-time performance of multimodal data fusion will also be a key research direction. Through more advanced fusion algorithms and technologies, the quality and efficiency of semantic information extraction from multimodal data will be further improved.

3.2. Semantic Communication Architecture

In practical application scenarios of the IoV, such as autonomous driving and intelligent traffic coordination management, the realization of complex functionalities not only relies on accurate semantic information extraction but also imposes stringent requirements on the subsequent transmission and interaction modes of information. The design of semantic communication architectures must meet the demands of the IoV for real-time performance, reliability, and efficiency. This section categorizes and summarizes existing semantic communication architectures in the IoV, highlighting their applications and optimization strategies in different scenarios. Table 3 summarizes the semantic communication architecture in the IoV.
Table 3. Classification and characteristics of semantic communication architectures in IoV.

3.2.1. Multiuser Collaboration and Multitask Driving

In the IoV, multiuser collaboration and multitask driving are two closely related semantic communication scenarios. Multiuser collaboration focuses on semantic information sharing between vehicles, while multitask driving focuses on optimizing the processing and transmission of semantic information for different tasks.
Ref. [16] proposed a multiuser collaborative semantic communication architecture (Co-SC), which comprises components such as semantic codecs (Sem-Codec), joint source–channel codecs (JSC-Codec), and task-related modules. The Co-SC architecture significantly enhances semantic reconstruction performance and recognition accuracy under low-SNR conditions. By leveraging multiuser collaboration, Co-SC effectively exploits the semantic correlation among users, reducing the transmission of redundant information and thereby significantly improving communication efficiency. However, the performance improvement of this architecture is limited under high-SNR conditions, and it has a high dependence on channel state information (CSI), requiring accurate channel estimation. Moreover, the performance of the architecture is highly reliant on the pre-trained knowledge base and offline-trained models, and its adaptability to the dynamically changing VANET environment needs further investigation. Overall, the Co-SC architecture provides an effective solution for semantic communication in VANETs, but a trade-off between computational resources and real-time requirements is necessary in practical deployment.
Ref. [14] presented a unified multiuser semantic communication system that integrates semantic information from multiple users to achieve collaborative processing. This framework significantly reduces the amount of data transmission through semantic encoding. For instance, in the task of object classification, it achieved performance comparable to or even better than traditional methods with a compression rate of only about 8.4%. The F1-score reached 0.9880 under high-SNR conditions and remained above 0.93 under low-SNR conditions, demonstrating good robustness. The system not only supports multitask processing but also ensures semantic understanding and vehicle privacy by constructing shared and private knowledge bases. However, the system complexity is relatively high, requiring substantial computational resources to support joint training of multiple tasks, which may pose a challenge to the real-time processing capabilities of vehicles in VANETs. Moreover, the adaptability of the system under dynamic channel conditions needs further improvement.
Ref. [17] proposed a multitask semantic communication framework for autonomous vehicles. It employs convolutional autoencoders for semantic encoding of traffic sign images and utilizes satellite communication for information sharing among vehicles. The image reconstruction quality is assessed using the structural similarity index measure (SSIM), and the results show that it outperformed the traditional 16-QAM scheme under low-SNR conditions, indicating good performance in semantic similarity. Additionally, the framework significantly improved data transmission efficiency by reducing the amount of transmitted data by up to 89%, indirectly reducing transmission latency. However, the paper does not provide a detailed discussion on computational overhead. Considering its implementation based on deep learning, it may face certain computational resource demands in real-time deployment.
Ref. [18] introduced a semantic-aware hybrid automatic repeat request (SemHARQ) framework for multitask semantic communication in VANETs. This framework significantly enhances the efficiency and robustness of semantic feature transmission through feature importance ranking (FIR) and feature distortion evaluation (FDE) networks. The study demonstrates excellent performance under low-SNR conditions. For example, in the task of vehicle reidentification, the rank-1 accuracy was improved by more than 20% compared to existing methods, and the vehicle color classification accuracy was increased by 10%. However, despite its superior performance in multitask scenarios, the computational overhead may limit its real-time deployment in VANETs, especially on resource-constrained edge devices. Moreover, the study mainly focuses on the transmission efficiency and accuracy of semantic features, with less discussion on key indicators for practical deployment such as transmission delay and computational overhead. Therefore, although SemHARQ has significant theoretical advantages, further optimization is needed in practical applications to reduce computational complexity and meet real-time requirements.
In the research on multiuser collaboration and multitask-driven semantic communication frameworks, these studies have proposed various distinctive solutions. The Co-SC architecture significantly improves semantic reconstruction performance through multiuser collaboration under low-SNR conditions but has a high dependence on CSI and limited performance improvement under high-SNR conditions. The system in [14] significantly reduces data transmission through semantic encoding and supports multitask processing, but it has high system complexity and a large demand for computational resources. The framework in [17] improves image reconstruction quality and transmission efficiency through semantic encoding in the autonomous driving scenario, but the issue of computational overhead needs to be clarified. The SemHARQ framework significantly enhances the efficiency and accuracy of semantic feature transmission through feature importance ranking under low SNR conditions, but its computational overhead may limit real-time deployment. These studies each have their own advantages, but in practical VANET applications, it is necessary to comprehensively consider factors such as computational resources, real-time performance, and adaptability to channel conditions to achieve more efficient and practical semantic communication solutions. Future work may explore the integration of the advantages of these methods to optimize computational overhead and real-time performance, thereby further enhancing the overall performance of semantic communication in VANETs.

3.2.2. Oriented to Image Transmission

Image transmission is one of core tasks in the IoV for environmental perception and decision making. Efficient image transmission requires not only reducing data volume but also ensuring that image quality meets the real-time demands of autonomous driving and intelligent transportation systems. For specific tasks such as image segmentation and object detection, researchers have proposed various optimized semantic communication architectures to address the needs of different application scenarios.
Ref. [12] proposed an Image Segmentation Semantic Communication (ISSC) system for the IoV. This system utilizes a multiscale semantic feature extractor and a semantic feature aggregator based on the Swin Transformer to achieve an efficient transformation from the input image to semantic features. Additionally, it accurately reconstructs the image segmentation at the receiving end through a semantic feature decoder and a reconstructor. Experimental results show that the ISSC system performs excellently in a low-SNR environment, with its mean Intersection over Union (mIoU) increased by 75% compared to traditional coding methods, indicating that the system has a significant advantage in semantic similarity. Furthermore, although the paper does not explicitly mention the specific values of transmission delay and computational overhead, considering its deep learning-based architecture, the computational overhead may be high, especially in real-time deployment scenarios.
Ref. [19] proposed a semantic communication system for autonomous driving based on importance-aware image segmentation (VIS-SemCom). Through multiscale semantic feature extraction and an importance-aware loss function, it significantly improves the accuracy of image segmentation and communication efficiency. Experiments show that under the requirement of an average mIoU of 60%, this system can achieve a coding gain of nearly 6 dB compared to traditional solutions and performs well under low-SNR conditions, with the maximum reduction of the transmitted data volume reaching 70%. The system improved the segmentation accuracy of key objects (such as vehicles and pedestrians) by approximately 4%, but its generalization ability in complex environments still needs to be further verified.
Ref. [20] proposed a semantic end-to-end communication system named SEECAD, which is specifically designed for image transmission in autonomous driving scenarios. By constructing a semantic encoder and decoder through a deep learning architecture and combining with Low-Density Parity-Check (LDPC) codes, it efficiently transmits the semantic information of images instead of the original pixel data, thus significantly reducing the amount of transmitted data and improving the transmission efficiency. At the receiving end, the system reconstructs the image segmentation mask using a shared knowledge base, enhancing its robustness in noisy environments. Experimental results show that under 4QAM modulation and LDPC (16, 32) coding, it achieved a segmentation accuracy of 97.4% and an IOU value of 0.75 at an SNR of 12 dB, demonstrating high semantic similarity and anti-noise ability. However, the training and real-time inference of the deep learning model require high computational resources, which may limit its real-time deployment in the IoV.
Ref. [21] proposed a semantic communication method for multiscene object detection in intelligent vehicle networks (MTDSC), aiming to improve the processing efficiency of image data through semantic encoding and transmission. This method uses a convolutional neural network and a region proposal network for object detection, combines Spatial Pyramid Pooling (SPP) with a long short-term memory network (LSTM) for semantic label assignment, and finally achieves reliable transmission through a variational autoencoder. In terms of model performance evaluation, MTDSC performs well in semantic similarity. Experiments have verified its high mIoU values in different road scenes (such as 83.276% in highway scenes), indicating that it can accurately understand and transmit key semantic information. In terms of transmission delay, this method reduces the amount of data through efficient semantic encoding, thus reducing the communication delay. However, its limitation lies in the high computational overhead, especially in the training and inference stages of the deep learning model, which may pose a challenge to the real-time deployment in the IoV.
Ref. [22] combined low-latency routing and semantic communication to optimize the transmission of image data in the IoV. In this method, the low-latency routing algorithm takes into account factors such as the distance between vehicle nodes, the packet loss probability of candidate nodes, remaining energy, and load, and it selects the optimal routing node to ensure the rapid transmission of image data. At the same time, semantic communication technology uses a convolutional neural network to extract image features, obtains a semantic sequence through a feature-semantic transformation function, compresses the feature map according to the task importance, and then transmits it. The receiving end performs the inverse operation to restore the image for classification and recognition. This method mainly addresses the problem of high transmission latency of image data in the IoV caused by the high-speed movement of vehicles and the dynamic changes of the network topology. Experiments have proven that compared with traditional methods, this method can significantly reduce the transmission latency and has little impact on the accuracy of image classification. Its advantage lies in significantly reducing the transmission latency and maintaining a high image classification accuracy. However, the high computational overhead and the lack of detailed evaluation limit its feasibility for real-time deployment in resource-constrained environments.
In the research on semantic communication for image transmission in the IoV, different methods have shown their unique advantages and limitations. The ISSC system demonstrates excellent semantic similarity in a low-SNR environment through efficient semantic feature extraction and transformation, with a significant increase in the average mIoU. However, the high computational overhead brought by the deep learning architecture may restrict real-time deployment. The VIS-SemCom system performs excellently in terms of image segmentation accuracy and communication efficiency, with obvious coding gain and a significant reduction in the transmitted data volume. However, its generalization ability in complex environments, as well as the detailed information about computational overhead and transmission delay, remains unclear. The SEECAD system has excellent semantic similarity and anti-noise ability at high SNRs, but it has a certain dependence on the SNR, and the training and inference of the deep learning model require high computational resources. The MTDSC method performs well in semantic similarity for multiscene object detection and reduces the consumption of computational resources through reinforcement learning optimization, but the computational overhead is still high. Combining low-latency routing and semantic communication to optimize image data transmission significantly reduces the transmission latency and maintains a high image classification accuracy, but it also faces the problems of high computational overhead and lack of detailed evaluation. These methods have their own focuses in improving semantic similarity, reducing the transmitted data volume, and decreasing the transmission latency, but there are still challenges to varying degrees in terms of computational overhead, real-time performance, generalization ability, and adaptability to the SNR. Further research is needed in the future to balance these factors to better meet the actual needs of image transmission in the IoV.

3.2.3. Generative AI-Based Semantic Communication Architectures

Generative AI technologies, such as generative adversarial networks (GANs) and large language models (LLMs), have shown significant potential in optimizing data processing and transmission efficiency in semantic communication, particularly in the context of the IoV. By transforming complex data (e.g., images, text, etc.) into concise semantic representations and leveraging multimodal fusion and reconstruction techniques, generative AI substantially reduces data volume while enhancing the intelligence and accuracy of communication. Current generative AI-based semantic communication architectures in the IoV primarily focus on image transmission. Unlike traditional image transmission architectures, these approaches emphasize the use of generative AI to intelligently process complex data. By converting images, text, and other data types into compact semantic representations and integrating multimodal fusion and reconstruction, these architectures not only reduce data transmission but also significantly improve communication intelligence, thus offering new perspectives and methods for semantic communication in the IoV. Below are several generative AI-based semantic communication architectures and their applications.
Ref. [23] proposed an scalable Artificial Intelligence Generated Content (AIGC) encoder–decoder architecture. The encoder leverages large language models to convert images into concise text prompts. Before transmission, the text information is optimized by a reinforcement learning module. The decoder then converts the optimized text back into images and decides whether to transmit semantically important image regions based on bandwidth conditions. The system demonstrates excellent performance in semantic similarity. Experiments have validated its advantages in image reconstruction quality while significantly reducing the amount of transmitted data, with a compression ratio of several times. This is highly beneficial for bandwidth-constrained VANET environments.
Ref. [24] proposed a multimodal semantic-aware framework based on generative AI. It achieves efficient data transmission and reconstruction by extracting semantic text information and image skeletons. The framework first extracts semantic text information and image skeletons from real-time road images and then combines them into small data packets for V2V communication. The receiving end uses a generative AI model to generate road condition images that match reality and reconstructs them for display to intelligent assistants. This significantly reduces data volume and effectively reduces transmission delay. However, although the framework emphasizes data privacy protection, it does not provide detailed explanations on how to ensure data security in practical deployment. Therefore, while the framework excels in improving semantic similarity and transmission efficiency, it still needs further optimization in terms of computational overhead and privacy security.
Ref. [15] proposed a Generative AI-Enhanced Multimodal Semantic Communication (G-MSC) framework enhanced by generative AI. It reduces data redundancy through multimodal alignment and fusion and performs denoising and semantic inference on noisy data at the receiver. In the G-MSC framework, generative AI technology enhances the capabilities of the semantic encoder, enabling it to better handle multimodal data. The optimization of channel transmission and the semantic decoder also improves the reliability and stability of communication. Experimental results show that diffusion models significantly improve image clarity and IoU. Particularly under low-SNR conditions, it effectively mitigates the impact of channel noise on performance. The framework’s strengths lie in its efficient processing of multimodal data and adaptability to dynamic environments. However, its limitation is the high demand for computational resources, which may restrict its real-time deployment in resource-constrained VANET environments. Future research directions include hybrid digital–analog transmission, multivehicle semantic information scheduling, and cross-task coordination, which are expected to further optimize the performance and application scope of the G-MSC framework.
Ref. [25] proposed a GAI-driven semantic communication framework for next-generation wireless networks (such as 6G). The framework extracts key semantic information from images using the lightweight Mobile Segment Anything Model (MSAM) and reconstructs and denoises images at the receiver using a generative adversarial network (GAN), achieving high-quality image reconstruction under different SNR conditions. Experimental results show that models trained under specific SNR conditions can achieve high PSNRs. Moreover, the framework further reduces data transmission by periodically updating background information. Experimental results indicate that the framework achieved a significant reduction in data transmission volume, up to 93.45%, while maintaining the integrity of the original content. However, the framework is sensitive to the SNR during training, with limited robustness. The real-time and feasibility of its deployment in VANETs still need further investigation.
Ref. [26] proposed an Agent-Driven Generative Semantic Communication (A-GSC) framework based on reinforcement learning. It seamlessly integrates semantic extraction and semantic sampling using generative AI. By transmitting the semantic information of a scene in text form, it reduces data volume while improving information interpretability. Moreover, the semantic sampling agent based on reinforcement learning can dynamically adjust sampling strategies according to channel conditions and changes in source data, ensuring optimal semantic information transmission under limited energy consumption. The framework performs well in terms of semantic similarity, transmission delay, and computational overhead. Its strengths are high semantic similarity, low transmission delay, and small computational overhead, making it suitable for real-time applications in VANETs. However, its limitations include insufficient adaptability to complex communication environments, and the feasibility of real-time deployment on resource-constrained devices still needs further verification.
In summary, these generative AI-based semantic communication frameworks each have their own characteristics. The AIGC architecture excels in data compression but lacks sufficient evaluation of real-time performance and computational overhead, with its deployment feasibility remaining to be verified. The multimodal framework has high data transmission efficiency but demands substantial computational resources and lacks clear privacy security guarantees. The G-MSC framework has strong capabilities in processing multimodal data and adapting to dynamic environments, but its high demand for computational resources restricts real-time deployment. The framework in [25] achieves high semantic similarity and significant reduction in data transmission under specific SNR conditions, but it is sensitive to the SNR, with limited robustness, and the real-time and feasibility of its deployment in VANETs need further investigation. The A-GSC framework performs well in semantic similarity, transmission delay, and computational overhead, making it suitable for real-time applications in VANETs, but it has insufficient adaptability to complex communication environments, and the feasibility of real-time deployment on resource-constrained devices remains to be verified.
Overall, these methods have their own advantages in terms of data transmission efficiency and semantic similarity, but they also have varying degrees of shortcomings in computational overhead, real-time performance, robustness, privacy security, and deployment feasibility. Future research needs to balance and optimize these aspects to better meet the practical needs of integrating VANETs and semantic communication.

3.2.4. Other Architectures

In addition to the mainstream semantic communication architectures mentioned above, several innovative architectures have demonstrated unique advantages in the IoV. These architectures expand the application scenarios of semantic communication and enhance communication efficiency and intelligence by integrating technologies such as blockchain, knowledge graphs, and neural network sharing.
The fusion of blockchain and edge computing technologies brings multiple benefits to semantic communication, including data security, privacy protection, and improved system efficiency. Ref. [27] proposed a blockchain-based edge-assisted knowledge base management system for semantic communication in the IoV. This system extracts key information through semantic segmentation and utilizes blockchain sharding technology to enhance the efficiency and security of knowledge base management. Experiments show that the proposed semantic communication method has lower transmission latency compared with traditional communication methods under low-SNR conditions and performs better in terms of transmission efficiency. Its advantages lie in effectively reducing communication costs, improving data transmission efficiency, and ensuring the consistency and security of the knowledge base with the help of blockchain technology. However, the limitation of this study is that it does not discuss in detail the computational overhead in the real-time deployment of the IoV, as well as the scalability issues in large-scale networks, which may affect its widespread application in practical scenarios.
Ref. [28] proposed a task-driven semantic-aware green cooperative transmission strategy (PreCMTS) that is suitable for intermittently connected IoV scenarios. This strategy employs a weighted directed graph to achieve semantic-aware transmission by analyzing parameters such as the remaining dwell time of vehicles at RSUs, encounter time with target vehicles, and lifetime of V2V links. It derives an expression for achievable throughput that meets delay requirements and formulates the coupled problem of semantic unit allocation and predictive relay selection as a combinatorial optimization problem. A low-complexity algorithm based on Markov approximation is designed to solve this problem. PreCMTS primarily addresses the intermittent connectivity caused by the high deployment costs and energy consumption of the infrastructure in vehicular networks, as well as the limited applicability of existing semantic communication research in dynamic and complex vehicle networks. The strategy aims to achieve both semantic and green communication. Experimental results show that PreCMTS effectively reduces energy consumption, improves the reliability of semantic transmission, and enhances semantic energy efficiency, thus outperforming baseline methods under various conditions.However, this strategy has deficiencies in terms of the analysis of computational overhead and feasibility for real-time deployment, and it fails to thoroughly explore the computational complexity and resource consumption of the deep learning model in the IoV environment.
In the integration of the IoV and semantic communication, besides transmitting IoV data through semantic communication systems, neural network models can also be shared. Ref. [29] proposed a deep semantic communication framework, SCKS, to address the efficiency issues of knowledge sharing of neural network models in the IoV. This architecture extracts semantic features of neural network models using a dataset distillation algorithm and designs a semantic decoding algorithm based on GANs. By leveraging the deep semantic communication system, RSUs extract semantic information from neural network models, encode it into semantic feature vectors, and transmit it to vehicles. Upon receiving the vectors, vehicles decode them into synthetic datasets using a semantic decoder and then update their neural network models. This enables efficient knowledge sharing from RSUs to multiple vehicles. The experimental results demonstrate that the framework exhibits remarkable performance in semantic similarity, effectively extracting the semantic features of neural network (NN) models. Under low-SNR conditions, the framework outperformed traditional methods in terms of transmission delay. However, the framework incurs a high computational overhead, particularly during the training phase, which poses limitations on the feasibility of real-time deployment. Despite these challenges, the framework, referred to as SCKS, performs well on high-resolution datasets. Further investigation is required to assess its performance when the number of categories increases.
These innovative architectures, through task-driven approaches, knowledge graph optimization, and neural network sharing, further expand the application scenarios of semantic communication in the IoV. However, they also demonstrate their respective advantages and limitations. The blockchain-based edge-assisted knowledge base management system performs excellently in terms of data security and transmission efficiency. However, the issues of computational overhead in the real-time deployment of the IoV and scalability in large-scale networks have not been resolved yet. The PreCMTS effectively reduces energy consumption and improves the reliability of semantic transmission, but it lacks a detailed analysis of the computational complexity and resource consumption of the deep learning model. The deep semantic communication framework SCKS enhances semantic similarity and transmission efficiency through the knowledge sharing of the neural network model. However, the relatively high computational overhead, especially during the training phase, limits its feasibility for real-time deployment. Although these architectures are innovative in their own ways, in practical applications, various factors such as computational overhead, scalability, and energy consumption need to be comprehensively considered to better adapt to the complex and changeable environmental requirements of the IoV and achieve efficient and reliable semantic communication.

3.3. Resource Allocation and Management

In intelligent networking scenarios, semantic-oriented resource allocation methods have significant advantages [108]. Similarly, in the context of the IoV, the high mobility of vehicles and the diverse semantic communication tasks make resource allocation and management critically important. Different tasks have time-varying demands for resources such as spectrum, power, and computing. For instance, road condition information transmission requires low latency and high bandwidth, while vehicle diagnostics emphasize data accuracy and completeness. This subsection reviews relevant research progress and categorizes resource allocation and management strategies based on different technical approaches. Table 4 summarizes existing research on resource allocation and management in the IoV based on semantic communication.
Table 4. Resource allocation and management methods in IoV based on semantic communication.

3.3.1. Reinforcement Learning-Based Resource Allocation Methods

In the context of integrating VANETs with semantic communication, reinforcement learning (RL)-based resource allocation methods have garnered significant attention due to their strong dynamic adaptability and optimization capabilities. These methods can dynamically adjust resource allocation strategies in real time according to the dynamic environment and task requirements in VANETs, thereby significantly enhancing system performance.
Ref. [30] proposed a Semantic-Aware Resource Allocation and Decision-making Framework (SARADC) specifically for the complex scenario of spectrum sharing between vehicles and Wi-Fi users in 5G-V2X heterogeneous networks. This research introduces innovative metrics such as high-speed semantic transmission rate (HSR) and high-speed semantic spectrum efficiency (HSSE). The near-policy optimization (PPO) algorithm is employed to optimize parameters, including channel allocation, power allocation, duty cycle, and semantic symbol length. Experimental results demonstrate that the framework significantly improves the HSSE and semantic throughput (ST) under high-resolution and low-SNR conditions, effectively enhancing resource utilization efficiency and system performance. However, the discussion on computational overhead and transmission delay is insufficient, which may limit its real-time deployment in VANETs.
Similarly, ref. [31] proposed a Semantic-Aware Spectrum Sharing (SSS) algorithm based on Deep Reinforcement Learning (DRL) to address the spectrum sharing problem in the IoV. This algorithm redefines spectrum sharing metrics such as HSSE and the HSR by incorporating semantic information and utilizes the soft actor-critic (SAC) method to optimize decisions to maximize HSSE and improve the success rate of effective semantic information transmission (SRS). Simulation results show that the SSS algorithm outperforms traditional bit-based spectrum sharing methods in terms of HSSE and SRS. However, the limitation of this algorithm is its potentially high computational overhead, especially considering the computational capabilities and resource constraints of vehicles in real-time deployment.
Ref. [24] started from the perspective of generative AI and used a deep reinforcement learning (DRL) algorithm to design a resource allocation strategy for V2V communication in the IoV. The study uses the double deep Q-network (DDQN) method to optimize parameters such as channel selection, transmission power, and diffusion steps, ensuring efficient semantic information transmission in VANETs. Additionally, the method designs action space, state space, and reward functions adapted to semantic communication to accommodate the dynamically changing channel conditions and transmission requirements in VANETs. Its advantage lies in dynamically adjusting resource allocation using DRL to adapt to the dynamics of vehicular networks, effectively reducing transmission delay and improving semantic similarity. However, the study lacks in-depth discussion on the quantification of computational overhead and lacks discussion on the weight allocation of multimodal data fusion, which may affect its robustness in complex scenarios and scalability of practical applications.
Ref. [32] aimed at the difficult problem of C-V2X platoon communication resource management and proposed a Semantic-Aware Multimodal Resource Allocation (SAMRA) algorithm using Multiagent Reinforcement Learning (MARL). The study defines metrics and quality of experience (QoE) concepts suitable for semantic and multimodal data in the system model, with maximizing the QoE and the success rate of V2V semantic information transmission as joint optimization objectives. By optimizing channel allocation, power allocation, and semantic symbol length using MARL, experimental results show that the algorithm outperformed baseline methods in multiple scenarios. Its advantage is the enhanced scalability and adaptability of the system through distributed decision making, effectively improving the QoE and the success rate of semantic information transmission (SRS). However, similar to the previous studies, this research lacks in-depth discussion on computational overhead in real-time deployment and lacks quantitative comparison among different methods, making it difficult to comprehensively assess its feasibility in practical VANET scenarios.
Finally, ref. [33] aimed at the dilemma of in-vehicle video semantic resource allocation and built a model based on the multiagent deep Q-network (MADQN) for optimization according to the tasks between vehicles, base stations, and vehicles. Experimental results show that compared to traditional algorithms, the VSRAA-SM achieved higher accuracy in video semantic understanding under different vehicle transmission power and bandwidth conditions and also performed better in terms of the CUE outage probability and V2V transmission rate, making it more suitable for spectrum reuse scenarios. However, the limitation of this study is the lack of detailed analysis on computational overhead, with the feasibility of real-time deployment remaining unclear. Moreover, the paper does not construct a unified evaluation framework, posing certain difficulties for quantitative comparison with other methods.
In summary, these RL-based resource allocation methods have demonstrated significant advantages in enhancing semantic communication performance. However, they still have shortcomings in terms of computational overhead, the feasibility of real-time deployment, and multimodal data fusion. Future research needs to further optimize these methods to reduce computational overhead and improve their adaptability and scalability in practical VANET scenarios. Additionally, constructing a unified evaluation framework to quantitatively compare the performance of different methods will be crucial for advancing this field.

3.3.2. Optimization Theory-Based Resource Allocation Methods

Optimization algorithms are also widely used in resource allocation to enable efficient resource distribution through mathematical modeling and optimization techniques. Compared to reinforcement learning methods, optimization algorithms often provide more precise solutions for resource allocation problems under specific constraints, demonstrating superior performance in system stability and robustness. For example, ref. [109] proposed an RSU caching incentive scheme based on the Stackelberg game and optimized the content caching strategy and pricing mechanism by establishing a game model between the base station (BS) and RSU. In the field of the combination of vehicle networking and semantic communication, researchers have also proposed some resource allocation schemes based on optimization theory, significantly improving the efficiency and performance of semantic communication in scenarios such as Device-to-Device (D2D) communication and vehicle platooning in the IoV.
Ref. [34] proposed a long-term robust resource allocation scheme for D2D in-vehicle networks. The scheme also considers the semantic access control of the application layer and the power control of the physical layer. The Lyapunov optimization method is used to transform the long-term constraints into queue stability conditions. Then, successive convex approximation and the Karush–Kuhn–Tucker (KKT) condition are used to solve the subproblem, while Bernstein approximation is used to deal with uncertain probability constraints. Simulation results show that the algorithm enables rapid convergence of metrics such as the semantic access rate and power. Under different interruption probability thresholds, the algorithm demonstrates the trade-off between the transmission rate and delay. Compared to traditional methods, it exhibited superior performance in system robustness and power consumption. Simulation results show that this algorithm performs excellently in terms of system robustness and power consumption. However, there is relatively little discussion on the computational overhead and real-time performance in practical deployment, which limits the feasibility of its application in an IoV environment with limited resources.
Beyond resource allocation in D2D vehicular networks, optimization theory has also been extensively applied in vehicle platooning scenarios. Ref. [35] proposed an innovative resource allocation scheme based on semantic communication for the collaborative data processing scenario between autonomous vehicle platoons and base stations in the IoV. The scheme aims to maximize semantic detection accuracy and minimize wireless resource costs while incorporating constraints such as bandwidth, transmission power, and SNR. Given the NP-hard nature of the optimization problem, a two-stage suboptimal solution is adopted. In the first stage, a bipartite graph is used to transform the communication mode selection problem into a maximum weight matching problem, and the Hungarian algorithm is applied to determine the matching relationship between vehicles and task processing units, ensuring communication link quality and reliability. In the second stage, based on the results of the first stage, the resource allocation problem is transformed into a convex optimization problem, which is solved using Matlab’s fmincon function to achieve reasonable allocation of resources such as bandwidth and transmission power. Simulation results show that, compared to the scheme that only used V2I communication, this method significantly improved the accuracy of video semantic detection, especially showing obvious advantages when the distance between vehicles is large. However, the paper does not provide specific data on key indicators such as semantic similarity, transmission delay, and computational overhead, which limits the comprehensive evaluation of the performance of the scheme.
Additionally, ref. [36] proposed a Stochastic Semantic Transmission Scheme (SSTS) based on two-stage Stochastic Integer Programming (SIP) to address resource allocation problems in virtual traffic networks within the metaverse. By introducing semantic communication technology, the scheme leverages the sensing capabilities of edge devices to transmit data from the physical world to the virtual world, providing immersive experiences for Virtual Service Providers (VSPs). The SSTS scheme considers the demand uncertainty of VSPs and optimizes resource allocation through a combination of reservation and on-demand plans, reducing transmission costs. Experiments have verified the advantages of the SSTS in energy consumption optimization. The energy consumption of semantic data transmission is significantly lower than that of non-semantic data transmission, which indicates that it has significant advantages in reducing energy consumption. However, the limitation of this study is that it mainly focuses on the transmission cost of semantic data and the subscription strategy, and the analysis of the computational overhead and complexity in real-time deployment is not in-depth enough.
In these studies on resource allocation for semantic communication in the IoV, methods based on optimization theory exhibit diverse characteristics and advantages, but they also have their own limitations. The D2D in-vehicle network resource allocation scheme proposed in reference [34] uses methods such as Lyapunov optimization, which significantly enhances the system’s robustness in high-speed mobile scenarios. However, it lacks sufficient discussion on computational overhead and real-time performance. In reference [35], for the scenario of vehicle platoon collaboration, technologies such as bipartite graph matching are employed, effectively improving the accuracy of semantic detection. Nevertheless, there is a lack of comprehensive evaluation of key indicators such as semantic similarity and transmission delay. The SSTS scheme, on the other hand, focuses on energy consumption optimization and demonstrates the advantage of semantic data transmission in reducing energy consumption. But the analysis of computational overhead and complexity in real-time deployment is not in-depth enough.
These methods have different focuses in terms of objectives, technical means, and application scenarios. However, in practical applications, various factors such as system stability, resource utilization efficiency, real-time performance, and computational overhead need to be comprehensively considered to achieve a more optimal resource allocation strategy.

3.3.3. Federated Learning-Based Resource Allocation Methods

Federated learning, as a distributed machine learning paradigm, has demonstrated unique advantages in semantic communication for the IoV. By enabling distributed training and knowledge sharing, federated learning optimizes resource allocation and enhances system efficiency while preserving data privacy.
Ref. [37] proposed a Mobility-Aware Split Federated Transfer Learning (MSFTL) framework, offering a novel approach to resource optimization in IoV semantic communication. The framework divides model training into four parts, leveraging split federated learning to reduce computational costs for vehicles. By incorporating a Stackelberg game-based resource optimization mechanism, it considers factors such as vehicle dwell time, computational load, and communication overhead to fairly select the most suitable training data volume for each vehicle and the entire network. This approach effectively reduces training costs and improves resource utilization efficiency. Experiments show that MSFTL outperformed traditional Federated Learning (FL) in terms of convergence speed and final accuracy, with lower computational overhead. In the scenario of few-shot learning, it also has an advantage in terms of communication cost. However, its communication efficiency during large-scale data transmission still needs to be improved.
Ref. [38] introduced the Federated Vehicular Semantic Communication (FVSCom) framework, which also focuses on resource optimization. This framework employs federated learning for semantic extraction and proposes a semantic utility metric to evaluate performance. The problem of maximizing semantic utility is transformed into a stochastic optimization problem, which is solved using a deep reinforcement learning-driven dynamic semantic-aware algorithm. This approach achieves efficient semantic extraction and resource allocation, improving computational efficiency and semantic extraction accuracy while enhancing robustness to scenarios where vehicles leave or drop out. However, the limitation of this study is that, although a semantic utility index is proposed, in the actual IoV scenarios, key performance indicators such as the specific transmission delay and computational overhead under different channel conditions are not taken into account.
Through comparative analysis, it can be found that although [37,38] both focus on the resource optimization problem in semantic communication of the IoV, they have different emphases.The MSFTL framework, through split federated learning and a resource optimization mechanism based on the Stackelberg game, performs excellently in reducing the computational cost of vehicles and improving the efficiency of resource utilization. Especially in the scenario of few-shot learning, it has an obvious advantage in communication cost. However, its communication efficiency is insufficient during large-scale data transmission.The FVSCom framework, by introducing a semantic utility index and a dynamic semantic-aware algorithm driven by deep reinforcement learning, achieves efficient semantic extraction and resource allocation, improves computational efficiency and the accuracy of semantic extraction, and also enhances the robustness in the situation of vehicles leaving or exiting. However, it does not adequately consider key performance indicators such as transmission delay and computational overhead under different channel conditions.These two methods each have their own advantages in terms of resource allocation optimization, but they also have certain limitations. Future research can consider combining the advantages of the two while making up for their respective deficiencies so as to further improve the efficiency and performance of resource allocation in semantic communication of the IoV.

3.4. Data Security and Privacy Protection

The application of semantic communication in the IoV has significantly improved communication efficiency and intelligence levels. However, it has also introduced new security and privacy challenges. The sensitivity of semantic information (such as vehicle trajectories, driving intentions, passenger preferences, etc.) requires the system to ensure confidentiality, integrity, and reliability during the transmission process. This section systematically analyzes the security risks faced by semantic communication in the IoV and summarizes the existing countermeasures.

3.4.1. Security Risk Analysis

The security threats to semantic communication systems mainly manifest as adversarial attacks, privacy leakage, man-in-the-middle attacks, model poisoning attacks, etc. Their risk characteristics and attack mechanisms show significant differences:
  • Adversarial attacks: Existing research shows that semantic communication systems are significantly vulnerable to adversarial attacks. Tiny perturbations generated based on algorithms such as Auto-PGD, FSGM, and DeepFool can lead to a substantial decline in the accuracy of semantic segmentation [110]. Of particular concern is the new type of covert attack mechanism. For example, the Covert Semantic Backdoor Attack (CSBA) can achieve the directional elimination of target semantics (such as traffic signs) without explicit triggers by analyzing the self-contained semantic features of the transmitted images [111]. Experiments show that even under high SNRs, the CSBA can still successfully remove the target semantics, and the restored image is visually indistinguishable from the original image. In addition, the Semantic Noise Attack (SNA) can inject semantic-level interference into the transmitted data, causing cascading error propagation in the encoding and decoding stages and leading to the failure of system decision making [112].
  • Privacy leakage risk: The deep correlation characteristics of semantic information enable attackers to reverse-derive users’ sensitive data through multidimensional semantic analysis. For example, by analyzing the spatio-temporal patterns of vehicle trajectory semantics, users’ resident areas and travel patterns can be inferred. Continuous monitoring of driving intention semantics may expose confidential information such as commercial transportation routes.
  • Man-in-the-middle attack threat: In V2V/V2I communication links, attackers can take advantage of the vulnerabilities of semantic protocols to conduct data eavesdropping and tampering. Typical attack scenarios include forging emergency braking commands, tampering with the semantic state of traffic lights, and hijacking path planning semantic data to induce vehicles to enter a preset area. Since such attacks directly operate on semantic layer information, traditional encryption mechanisms are difficult to effectively detect them.
  • Model poisoning attack: During the construction of a distributed semantic knowledge base, malicious vehicles can carry out covert poisoning by uploading contaminated data (such as distorted semantic features of traffic signs). More seriously, the poisoning attack may trigger systematic deviations in the semantic rule system, resulting in the failure of the Vehicle-to-Everything (V2X) collaborative decision-making mechanism.
The current defense mechanisms mainly focus on single-point protection, and a security system covering the entire chain of semantic generation, transmission, and processing has not been formed yet. In particular, there are still significant technical gaps in detection algorithms for semantic-level covert attacks and dynamic privacy protection solutions, which urgently require systematic breakthroughs through the design of cross-layer defense architectures and lightweight cryptographic primitives.

3.4.2. Countermeasures

To effectively address the numerous security risks faced by semantic communication in the IoV, researchers have proposed a series of comprehensive countermeasures and key technologies. These can be addressed through methods such as semantic information encryption, federated learning, blockchain, edge intelligence, and adversarial sample detection, which are defined below:
  • Semantic information encryption: Semantic information encryption is an important means to protect the confidentiality of semantic data in the Internet of Vehicles. By designing lightweight semantic-aware encryption algorithms, such as semantic feature obfuscation technology based on lattice cryptography, end-to-end confidentiality can be achieved while ensuring semantic decodability. This encryption method can effectively prevent data from being illegally stolen and tampered with during data transmission and storage, ensuring the security of semantic information.
  • Federated learning and differential privacy: Federated learning is a distributed machine learning framework that enables distributed training of semantic models without sharing the original data. Combined with differential privacy technology, by adding controllable noise to semantic features, the leakage of original data can be further prevented. This combined approach can not only protect data privacy but also improve the robustness and generalization ability of semantic models. For example, refs. [37,38] have elaborated on the application of federated learning in semantic communication of the Internet of Vehicles, demonstrating its remarkable effects in privacy protection and model performance improvement.
  • Blockchain and edge intelligence: The introduction of blockchain technology provides new ideas for data security and privacy protection in semantic communication of the Internet of Vehicles. Using blockchain to record the update operations of the semantic knowledge base can ensure the consistency and immutability of semantic rules. For example, the blockchain sharding technology proposed in [27] reduces the verification delay of the knowledge base by dividing the knowledge base into multiple small pieces for verification while effectively resisting tampering attacks. In addition, the application of edge intelligence also provides strong support for privacy protection. Ref. [14] achieved a balance between semantic understanding and vehicle privacy by building shared and private knowledge bases on edge servers. The shared library stores the background knowledge of autonomous driving on the edge server, while the private library stores the unique information of vehicles. The private library can be transmitted to the edge server according to the travel plan and updated by the vehicle itself when updated, and the edge server aggregates multisource information to update the shared library. This hierarchical architecture allows vehicles to only update and maintain their own private knowledge bases without uploading all data to the shared knowledge base, thus greatly reducing the risk of data leakage.
  • Adversarial sample detection: Adversarial sample detection is an important technology to deal with potential attacks in semantic communication of the Internet of Vehicles. The semantic anomaly detection module constructed based on the generative adversarial network (GAN) can identify adversarial semantic features in real time. Ref. [112] proposed a defense mechanism based on Semantic Distance Minimization (SDM). SDM generates adversarial samples during the training process and optimizes the model to enable it to extract correct semantic information from adversarial samples. This method not only improves the model’s robustness against adversarial attacks but also enhances the model’s semantic understanding ability to a certain extent, ensuring the accuracy and reliability of semantic communication.
In conclusion, through the comprehensive application of key technologies such as semantic information encryption, federated learning and differential privacy, blockchain and edge intelligence, and adversarial sample detection, the issues of data security and privacy protection in the combination of the Internet of Vehicles and semantic communication can be effectively addressed. These methods are not only innovative in theory but also show good effects in practical applications, providing a solid technical guarantee for the further development of semantic communication in the Internet of Vehicles.

3.5. Lessons Learned

Semantic information extraction and communication architectures must balance computational efficiency with accuracy. Deep learning-based methods (e.g., Swin Transformer, GANs, etc.) excel in multimodal fusion and noise resistance but face deployment challenges due to high resource demands. Reinforcement learning and federated learning show promise in dynamic resource allocation but require optimization for real-time IoV constraints. Blockchain integration enhances security but introduces scalability issues. Future work should prioritize lightweight models, edge-compatible algorithms, and hybrid optimization strategies to address these trade-offs.

4. Applications of Semantic Communication in IoV

In Section 3, we delved deep into the key technologies of semantic communication in the IoV, including semantic information extraction, communication architecture design, and resource allocation and management. These technologies form the foundation for achieving efficient and reliable semantic communication. They involve how to extract crucial semantic information from raw data, how to design the communication architecture to support the transmission of semantic information, and how to manage and allocate resources to optimize communication performance.The design of the semantic communication architecture needs to take into account the requirements of different application scenarios. For example, intelligent driving decision support may require communication with low latency and high reliability, while traffic management may place more emphasis on the comprehensiveness and accuracy of data. In this chapter, we will demonstrate how these key technologies function in practical application scenarios and how they support key applications in the IoV, such as intelligent traffic management, driving decision support, and service optimization. Through these application cases, we will further verify the potential of semantic communication in the IoV and explore its advantages and challenges in actual deployment. Table 5 summarizes several application scenarios of semantic communication in the IoV.
Table 5. Application scenarios of semantic communication in IoV.

4.1. Traffic Environment Perception and Understanding

In the IoV, vehicles need to perceive and understand the surrounding traffic environment in real time, including road conditions, traffic signs, and the dynamics of other vehicles and pedestrians. This is crucial for enhancing traffic safety, optimizing traffic flow, and enabling advanced applications such as autonomous driving. Traditional data transmission methods struggle to meet the demands of massive, real-time, and dynamically changing traffic information. Semantic communication, with its deep mining and precise transmission of data semantics, offers a novel solution for traffic environment perception and understanding in IoV. The following research cases demonstrate the effectiveness of semantic communication in this field.
The deployment of distributed sensing nodes in the IoV can significantly enhance the coverage and accuracy of environmental perception. Ref. [39] proposed an environmental semantic communication method to support distributed sensing-assisted networks, particularly in millimeter-wave (mmWave) and terahertz (THz) communication systems. By deploying multiple distributed sensing nodes, the study extracts environmental semantic information (e.g., bounding boxes and masks of targets) using RGB cameras and transmits this information to base stations to predict optimal beams. This approach significantly reduces the storage and transmission requirements of raw image data while improving the system’s adaptability to dynamic environments. The experiment was conducted based on the DeepSense 6G dataset, and the results show that the proposed solution can accurately predict the optimal beam in the real communication environment while reducing the transmission overhead of sensing data.
In autonomous driving scenarios, collaborative perception among vehicles can provide more comprehensive environmental information, thereby enhancing driving safety. Ref. [40] proposed a semantic communication framework that combines Joint Source–Channel Coding (JSCC) and Hybrid Automatic Repeat Request (HARQ) technologies. The framework extracts critical semantic features through importance maps and optimizes transmission. By employing intermediate fusion, sensor data (e.g., LiDAR point clouds) from vehicles are semantically processed and transmitted using Orthogonal Frequency Division Multiplexing (OFDM) in time-varying multipath fading channels. The OPV2V dataset was used in the study. Simulation results show that, compared to the traditional separate source–channel coding method, the proposed model has significantly improved in terms of sensing performance and throughput. In addition, this study also introduced a new semantic error detection method, SimCRC, and combined it with Hybrid Automatic Repeat Request (HARQ) to enhance the transmission reliability in scenarios with low SNRs.
Both of the above two experimental studies verify the effectiveness of the proposed methods through specific experimental designs. The work of Ref. [39] performed outstandingly in reducing data transmission overhead and beam prediction, and it is more suitable for optimizing communication resources. However, as the amount of data increases, this requires a higher processing capacity of the base station. The work of Ref. [40] has obvious advantages in sensing performance, and indicators such as AP@0.5 and AP@0.7 are superior to traditional methods. It is also more suitable for scenarios of precise target recognition. However, as the amount of data increases, it may face the problem of insufficient computing resources.
Through the above discussion on traffic environment perception and understanding, we can see the importance of the semantic information extraction techniques proposed in Section 3 in practical applications. These techniques not only improve the accuracy of environmental perception but also optimize communication efficiency by reducing the amount of data transmitted. In addition, the design of the semantic communication architecture ensures the reliable transmission of information in dynamic and complex traffic environments. The resource allocation and management strategies further optimize the transmission of semantic information, ensuring that critical information can be preferentially processed and transmitted under limited bandwidth and computing resources. The integrated application of these techniques demonstrates the potential of semantic communication in enhancing the performance of the IoV.
In the scenarios of achieving traffic environment perception and understanding, vehicles must be capable of processing and integrating information from different sensors and data sources in real time. The challenge in this process lies in the fact that the data formats and accuracies of different sensors vary, and the traffic environment itself is dynamically changing, including constantly changing weather conditions and traffic flows. These factors not only increase the complexity of data fusion but also require the perception system to have a high degree of adaptability and accuracy to ensure that it can still provide reliable environmental understanding in a changeable environment.

4.2. Intelligent Driving Decision Support

In intelligent driving scenarios, vehicles need to perceive the surrounding environment in real time and make rational decisions, which relies on accurate and efficient information transmission and processing. Semantic communication technology can precisely extract key semantic information from massive traffic data, avoiding the transmission of redundant data in traditional communication methods, thereby significantly improving data transmission efficiency and decision-making speed. Below are several research cases that utilize semantic communication to enhance intelligent driving decisions.
In autonomous driving scenarios, vehicles need to perceive dynamic changes in the environment, such as sudden road obstacles (e.g., road maintenance, accidents, or vehicle breakdowns). In such cases, vehicles must quickly decide whether to change lanes or maintain their current lane to ensure safe and efficient driving. Ref. [41] proposed a deep learning-based semantic traffic control system that assigns semantic encoding tasks to vehicles themselves rather than relying on server processing, thereby alleviating resource constraints. Specifically, autonomous vehicles (AVs) collect critical driving dynamics (e.g., speed, acceleration, and position) through sensors and use the deep Q-network (DQN) algorithm for decision making. The system converts complex driving environment information into compact semantic representations through semantic encoding and transmits them to the traffic monitoring module. The traffic monitoring module receives the encoded information and processes it using the DQN algorithm to derive appropriate driving decisions. In the experiment, the researchers used the KUL traffic sign classification dataset from Belgium, the MASTIF dataset from Croatia, and the German Traffic Sign Recognition Benchmark dataset to train the object detection model through a custom framework. The experimental results show that, compared to directly transmitting images, when the semantic system transmits a single image, 900 images equivalent to a 30 s video, and 1800 images equivalent to a 60 s video, the file transmission time is significantly shortened, and the file size is also greatly reduced. In addition, the accuracy rate of the traffic optimization model of this system reached 92%. Although affected by the noise added in the simulation (such as changes in vehicle speed), this result still indicates that the system has a high degree of reliability in intelligent driving decision making.
In more complex traffic scenarios, such as fully connected autonomous vehicle networks involving High-Altitude Platforms (HAPs), semantic communication also plays a crucial role. Ref. [42] proposed an AI-driven semantic communication framework for this scenario. In this framework, the traffic infrastructure (TI) extracts semantic concepts from traffic signs using a convolutional autoencoder (CAE) and transmits these concepts to macro base stations (MBSs). Upon receiving the semantic concepts, the MBS uses the Proximal Policy Optimization (PPO) algorithm to make decisions for CAVs. For example, when the TI observes a “left turn ahead” sign, it extracts the corresponding semantic concept and transmits it to the MBS. The MBS analyzes the concept using the PPO algorithm and generates appropriate driving instructions, such as “turn left”, for the CAVs. In the experiment, 12 traffic signs from the TSRD dataset were used, and the proposed framework was compared with the Augmented Random Search (ARS) baseline. The results show that the deep Q-network (DQN) outperform ARS in terms of training convergence and received rewards, with the total reward increased by 37.11%. In addition, as the size of the Resource Block (RB) increased, the total reward received by the Connected and Automated Vehicle (CAV) also increased, which indicates that more complete semantic information helps to improve the accuracy of decision making. This framework was able to reduce the communication cost by up to 90.81%, demonstrating great potential in improving communication efficiency.
Overall, both semantic communication methods have achieved positive results in the experiments, but some key factors still need to be considered in actual deployment. The method in [41] shows high real-time performance and decision-making accuracy when dealing with sudden roadblock scenarios, making it suitable for autonomous driving scenarios with high requirements for response speed. However, its stability may be affected by the quality of model training data and the network environment. The framework in [42] has significant advantages in reducing communication costs and improving resource utilization, and it is suitable for large-scale data transmission and decision support in the Internet of Vehicles. But in practical applications, it is necessary to address the resource requirements for deep Q-network (DQN) training and improve the system’s adaptability to complex traffic environments.
In the context of exploring decision support for intelligent driving, the semantic communication architecture discussed in Section 3 demonstrates its core role in handling real-time decision-making information. Semantic information extraction technology can filter out key information from the massive data generated by vehicles and their environments, such as the positions and speed changes of potential obstacles, which are crucial for the immediate response of vehicles. In addition, through optimized resource allocation and management strategies, it is ensured that these key pieces of information can be transmitted and processed in the shortest possible time, thereby supporting vehicles in making quick and accurate driving decisions. The combination of this technology and application not only improves the reliability of intelligent driving systems but also provides strong technical support for the development of future autonomous driving technologies.
In the scenario of intelligent driving decision support, vehicles need to quickly and accurately extract key semantic information from complex traffic data and make timely driving decisions based on it. The challenge in this process is how to ensure that in a dynamic and unpredictable traffic environment the decision-making system can not only accurately understand environmental information but also respond quickly to avoid potential collisions and improve driving safety. In addition, how to effectively utilize vehicle data while protecting personal privacy is also an issue that needs to be addressed.

4.3. IoV Service Optimization

In the field of the IoV, leveraging semantic communication technology to enhance service performance has become a critical research direction. Semantic communication, with its advantages of reducing data volume and improving real-time performance and accuracy, offers users more efficient and personalized service experiences. Below, specific research cases are analyzed to delve into its applications in IoV service optimization.
The demand for next-generation ultra-reliable low-latency communication (xURLLC) in the IoV is growing, necessitating the optimization of semantic data packet transmission to reduce queuing delays and enhance user experience. Ref. [43] addressed the challenges posed by resource scarcity and xURLLC requirements by proposing a SemCom-empowered Service Supplying Solution. Based on queuing theory, the study derives a formula for queuing delays of semantic data packets based on knowledge base matching. An optimization problem is formulated to minimize queuing delays, which is subject to reliability constraints in knowledge base construction (KBC) and vehicle service pairing (VSP). Using the Lagrangian dual method, the original problem is decomposed into two subproblems: in the first stage, a heuristic algorithm is applied to solve the KBC subproblem for potential vehicle pairs while identifying the optimal knowledge base construction strategy; in the second stage, based on the results of the first stage, a greedy algorithm is used to determine the optimal pairing strategy for the VSP subproblem. This solution significantly reduces queuing delays, improves the throughput of semantic data packets, and enhances user satisfaction. While the theoretical advantages of this approach are evident through precise mathematical derivations and algorithm design, its practical application may incur high computational costs and require advanced hardware performance, limiting its use in low-configuration vehicles.
Traditional semantic communication often fails to meet users’ personalized needs due to the loss of critical receiver information. Ref. [44] proposed a receiver-centric semantic communication system where the received component can request specific semantic information from senders. The sender utilizes the natural language processing capabilities of large language models (e.g., GPT-4) to understand the request, combined with tools such as object detection and license plate detection, to analyze video or image data. Through an attention mechanism-based semantic extraction algorithm, relevant semantic information is accurately extracted and fed back in text form. If task planning is unreasonable, a task reflection module replans or selects the most relevant video frames for transmission. This system emphasizes user-centric personalized demands, leveraging advanced language models and intelligent algorithms to address the shortcomings of traditional semantic communication. However, reliance on large language models introduces challenges, such as high training costs and potential delays in real-time scenarios due to model processing speed limitations, affecting service immediacy.
In in-vehicle scenarios, efficient perception and interaction are essential to enhance user experience. Ref. [45] proposed a 6G semantic communication solution integrated with smart fabrics. Smart fabrics perceive the position, movements, and environmental conditions (e.g., temperature and humidity) of individuals inside the vehicle through principles such as electromagnetic induction, collecting raw data. The semantic sensing terminal designs a deep network based on recurrent neural networks (RNNs) and their variants (e.g., LSTM, GRU, etc.) to extract environmental data from time-series features of the data stream, thus reducing transmission data volume through dimensionality reduction techniques. Semantic information is transmitted via an integrated space–air–ground network to a remote AI platform, where technologies like GANs enable semantic recovery and advanced deep learning services. This solution achieves high-quality signal reconstruction and advanced services under different SNR conditions, effectively improving communication efficiency and service quality in in-vehicle scenarios. While this solution is specifically designed for in-vehicle scenarios, where 6G technologies and smart fabrics are leveraged to provide high-quality communication and service experiences, its applicability is limited to other IoV scenarios, such as vehicle-to-vehicle or vehicle-to-infrastructure communication.
By comparing the experimental results of the three literature sources, it can be seen that the S4 scheme in [43] performs excellently in the service optimization of the Internet of Vehicles. It has significant advantages, especially in reducing latency and increasing throughput, and is suitable for Internet of Vehicles scenarios with high real-time requirements. The system in [44] performs well in reducing the amount of data transmission and meeting the semantic information needs of the receiving end, and it is suitable for scenarios that require efficient transmission of key information. However, there is still room for improvement in its performance when dealing with complex requests. The DL-SCMT model in [45] performs outstandingly in terms of signal reconstruction and classification accuracy and has good robustness, especially in environments with a low signal-to-noise ratio. It is suitable for 6G application scenarios that require high reliability and intelligent interaction.
The discussion on the service optimization of the IoV further confirms the potential of semantic communication in enhancing the service quality and user experience. Through identifying the true intentions behind user requests, semantic information extraction technology enables service providers to more accurately meet user needs. The optimized strategies for resource allocation and management ensure the efficient transmission of semantic information between users and service providers, reducing unnecessary data transmission and improving the service response speed. This user-centered service optimization approach not only enhances user satisfaction but also provides a new direction for the innovation and development of IoV services.
In the scenario of IoV service optimization, the challenge lies in how to meet users’ demands for personalized services while maintaining the efficiency and consistency of services. As users’ expectations for IoV services continue to rise, service providers need to optimize resource allocation under limited network resources to support services with different quality of service requirements. In addition, integrating diverse services into a unified platform while ensuring service interoperability and an excellent user experience is also a technical challenge.

4.4. Intelligent Traffic Management

In the field of intelligent traffic management, semantic communication has demonstrated significant potential and value. Numerous research papers have focused on this area, providing rich insights and practical outcomes for the application of semantic communication. Below, we explore the specific applications of semantic communication in intelligent traffic management, including vehicle count prediction, multitask communication, and emergency vehicle scheduling.
Vehicle count prediction is a critical component of intelligent traffic management to provide decision-making support for traffic signal control and congestion mitigation. Ref. [46] proposed a vehicle count prediction model based on semantic communication. This model employs a joint convolutional neural network (CNN) and long short-term memory network (LSTM) to construct a semantic encoder–decoder architecture. Raw images captured by cameras are first processed by the semantic encoder to extract key semantic information, such as vehicle density maps, which are then converted into symbols and transmitted to the central traffic controller (CTC). The CTC utilizes the received semantic information, combined with the LSTM’s ability to handle temporal correlations in image sequences, to achieve accurate vehicle count predictions for various road segments. Based on these predictions, scientifically effective traffic management strategies are formulated. The experiment utilized the TRANCOS dataset, which consists of 1244 images, and a total of 46,796 cars were labeled. The results show that, compared to the traditional source encoder/decoder method, this model reduced the communication overhead by 54.42%. In terms of the Mean Absolute Error (MAE) and Mean Squared Error (MSE), this model outperformed the existing methods based on GRU, LSTM, and FCN-rLSTM respectively. The MAE was reduced by 90.71%, 73.03%, and 19.1%, respectively, and the MSE was reduced by 103.91%, 77.74%, and 13.45%, respectively. This result indicates that semantic communication technology can significantly improve the accuracy of vehicle number prediction while reducing the amount of data transmission, providing strong support for real-time traffic management.
As the complexity of intelligent transportation systems increases, multitask processing capabilities have become increasingly important. Ref. [47] discussed a scalable multitask semantic communication system (SMSC-FIR) that focuses on tasks such as vehicle reidentification (ReID), vehicle color classification, and vehicle type classification. The system prioritizes the transmission of critical semantic features using a Feature Importance Ranking (FIR) method. The experiment made use of the VeRi-776 dataset, which contains more than 50,000 images of 776 vehicles. The results demonstrate that under low-SNR conditions, the SMSC-FIR outperformed the existing state-of-the-art methods in various tasks. For instance, at 0 dB, compared to the sequential selection and random selection methods, the performance of vehicle reidentification (ReID) was improved by 40.0% and 212.2%, respectively, the performance of color classification was enhanced by 11.0% and 9.8%, respectively, and the performance of type classification was increased by 6.9% and 7.0%, respectively. This result indicates that through feature importance ranking and dynamic coding rate adjustment, SMSC-FIR can effectively improve task performance in complex multitask scenarios, especially when the channel conditions are poor.
In multiuser semantic communication systems, channel interference remains a critical issue affecting communication quality. Ref. [48] proposed a Diffusion Model-based Channel Enhancer (DMCE) to address channel interference in multiuser semantic communication systems for intelligent traffic management. The DMCE improves channel equalization performance by learning the specific data distribution of channel effects on transmitted semantic features, thereby suppressing noise in channel state information (CSI) estimation. In the system model, multiple users transmit semantic features of multisource images (e.g., RGB and infrared images) of the same traffic scene through a MIMO channel to a centralized receiver. The receiver then recovers and fuses the semantic features using channel equalization and DMCE-enhanced CSI estimation while ultimately generating a semantic segmentation image of the traffic scene. The experiment used a multisource image dataset containing 1569 pairs of RGB-IR urban traffic scene images. The results show that under low-SNR conditions, the DMCE could increase the mean Intersection over Union (mIoU) by more than 25%. In addition, the DMCE significantly reduced the Normalized Mean Squared Error (NMSE) of the channel state information (CSI) estimation, with an average reduction of 14 dB. This result indicates that the DMCE can effectively improve the accuracy of semantic feature recovery in complex wireless channel environments, providing strong support for the practical application of multiuser semantic communication systems.
Efficient scheduling of emergency vehicles is crucial for improving emergency response capabilities. Ref. [49] explored the application of semantic communication in 6G networks, particularly for resource optimization in emergency vehicle scheduling. Through semantic encoding and decoding, data from emergency vehicles (e.g., GPS location, speed, and medical equipment information) are converted into semantic information and sent to a traffic infrastructure that dynamically adjusts traffic signal durations to prioritize emergency vehicles. This semantic communication-based scheduling method not only improves the efficiency of emergency vehicles but also minimizes disruption to other traffic flows, thus significantly enhancing emergency response capabilities. However, the scalability of this solution in large-scale urban traffic networks poses challenges, because how to ensure efficient operation and rapid response in increasingly complex and high-traffic environments remains a key focus for future research.
In these four studies, ref. [46] verified the advantages of the semantic communication model based on CNN-LSTM in reducing overhead and improving prediction accuracy by constructing the model and conducting training and testing on the TRANCOS dataset. Ref. [47] proposed the SMSC-FIR system and carried out multitask learning experiments on the VeRi-776 dataset. Through comparison with existing methods, it demonstrated the performance improvement of the system under low-SNR conditions. Ref. [48] proposed the DMCE scheme. By introducing a diffusion model into the multiuser semantic communication system to enhance the estimation of channel state information, it improves the accuracy of semantic feature recovery.
In the application scenarios of intelligent traffic management, the key technologies of semantic communication introduced in Section 3 have demonstrated their significant value in improving the overall efficiency and safety of the traffic system. Semantic information extraction technology enables the traffic management system to quickly identify key events, such as traffic accidents or congestion situations, from complex traffic data. The semantic communication architecture supports the rapid and reliable transmission of this information, while the resource allocation and management strategies ensure the real-time nature and accuracy of traffic management decisions. The integrated application of these technologies not only enhances the intelligent level of traffic management but also provides technical support for the sustainable development of urban traffic.
In the scenario of intelligent traffic management, the system needs to handle large-scale data from an entire city or region, which poses challenges to data storage, processing, and analysis capabilities. Moreover, in order to achieve effective traffic control, the collaborative work among different traffic management systems is of vital importance. This requires the systems to be able to achieve effective coordination among different management platforms. At the same time, the implementation of intelligent traffic management may be restricted by existing policies and regulations. It is necessary to cooperate with the government and regulatory authorities to promote the necessary updates of regulations so as to adapt to the development and application of new technologies.

4.5. Lessons Learned

Semantic communication significantly improves IoV applications like traffic perception and emergency response by reducing data redundancy and enhancing decision accuracy. Case studies demonstrate that task-oriented semantic frameworks (e.g., HARQ-enhanced perception and generative AI-driven scheduling) outperform traditional methods in low-SNR scenarios. However, generalization to complex environments (e.g., extreme weather) remains a challenge. Collaborative systems leveraging edge computing and distributed knowledge sharing are critical for scalability, while privacy-preserving mechanisms (e.g., federated learning) must be prioritized in sensitive applications like remote diagnostics.

5. Challenges and Future Research Directions

5.1. Knowledge Base Creation and Updating

In the semantic communication-based IoV, the knowledge base plays an indispensable role as the foundation for achieving accurate semantic understanding and effective communication. However, the construction and updating of knowledge bases still face significant challenges. In terms of construction, the complexity of data sources is a primary obstacle. In the IoV, vehicles generate massive amounts of data through various onboard sensors, while external data sources such as traffic management centers and map service providers continuously contribute additional data. These data are often heterogeneous in format and lack unified standards, making integration highly challenging. Moreover, the inherent complexity and ambiguity of traffic-related knowledge pose difficulties in knowledge representation. For instance, describing traffic congestion is not a simple binary state of “congested” or “clear”; it involves multiple degrees of congestion influenced by factors such as time, location, and traffic flow. Accurately representing such complex knowledge in a machine-understandable form remains a major challenge. Selecting appropriate knowledge representation methods, such as semantic networks, ontologies, or production rules, is crucial for transforming complex traffic knowledge into a usable format. In semantic communication, the knowledge base must be dynamically updated based on data from sources, channels, and tasks. The dynamic nature of IoV also imposes stringent real-time requirements on knowledge base updates. However, current update mechanisms often suffer from delays and cannot meet the real-time demands of the IoV.
To address these challenges, future research can explore multidimensional approaches by integrating cutting-edge technologies. On the one hand, incorporating AI large models can leverage their powerful natural language processing and knowledge understanding capabilities to deeply analyze and integrate heterogeneous traffic data, efficiently constructing knowledge graphs. The continuous learning capabilities of large models can also facilitate real-time updates to the knowledge base, enabling them to promptly capture changes in the traffic environment. On the other hand, federated learning can be employed to enable collaborative training of distributed data while preserving data privacy, thereby enriching the knowledge base and enhancing its generalizability. Additionally, blockchain technology can be introduced to ensure the security and consistency of the updating process. By leveraging its tamper-proof and decentralized characteristics, blockchain can record update operations, ensuring reliable knowledge transmission. These combined efforts can drive the advancement of knowledge base technologies in the IoV. In addition, the uncertainty of user behavior brings additional challenges to the update of the knowledge base. Users’ personalized needs and behavior patterns change over time, which requires the system to be able to continuously learn and adapt to these changes to ensure that the relevant information in the knowledge base can reflect users’ latest preferences and demands.

5.2. Semantic Understanding and Ambiguity

The IoV involves multisource heterogeneous data from various vehicles, sensors, and infrastructure, which differ in format, semantics, and precision. For example, sensors from different manufacturers may use varying measurement units and data formats, while data exchange between roadside infrastructure and vehicles often encounters semantic mismatches. Integrating and accurately understanding the semantics of such heterogeneous data is a significant challenge, requiring the development of universal data models and semantic transformation methods to achieve seamless integration and effective utilization. Additionally, natural language descriptions of traffic scenarios and instructions often contain ambiguities. For instance, the phrase “slow down at the upcoming intersection” lacks a clear definition of what constitutes “slow”, leading to potential differences in interpretation among drivers or vehicle systems. In semantic communication, resolving such ambiguities is critical. This can be achieved by establishing precise semantic models, incorporating contextual information, and leveraging knowledge base reasoning to ensure accurate transmission and understanding of semantic information. In the process of multisource data fusion, the heterogeneity of data and the dynamic changes of the environment may lead to ambiguities and misunderstandings of semantic information. Therefore, the development of advanced data fusion technologies capable of handling these complexities is the key to achieving accurate semantic understanding.
To address these challenges, future research can focus on integrating advanced technologies to achieve seamless data integration and accurate semantic understanding. On the one hand, efforts should be directed toward developing universal data models and semantic transformation methods, including cross-domain unified models, dynamic semantic conversion, and international standardization. On the other hand, research should emphasize ambiguity resolution through context-aware semantic understanding, multimodal semantic fusion, and human–machine collaborative semantic interaction. Advanced technologies such as deep learning, reinforcement learning, and generative AI can be utilized for feature extraction, autonomous decision making, and data augmentation of heterogeneous data. Knowledge graph construction and fusion can provide background knowledge and reasoning support for semantic understanding. Additionally, edge computing can be employed to decentralize semantic processing, enabling intelligent decision making at the edge and enhancing the real-time performance and efficiency of semantic communication.

5.3. Real-Time and Reliability Requirements

In the IoV, vehicles are in constant high-speed motion, which leads to rapidly changing network topologies and unstable communication channels. This demands that semantic communication achieves real-time performance in highly dynamic environments to achieve timely transmission and processing of semantic information. For example, in scenarios such as emergency braking or lane changes, relevant semantic information (e.g., braking intent or lane change direction) must be quickly transmitted to surrounding vehicles to prevent collisions. Therefore, vehicles need to rapidly encode and decode large amounts of road condition and driving intent information during operation. When multiple vehicles communicate simultaneously, encoding and decoding delays may occur, affecting the timeliness and accuracy of information transmission. The dynamically changing traffic environment poses higher requirements for the real-time performance of the system. The system must be able to quickly adapt to environmental changes, such as fluctuations in traffic flow and sudden road conditions, to ensure the continuity of communication and the timeliness of decision making.
In order to meet the real-time and reliability requirements, it is necessary to optimize the communication protocol, adopt efficient channel access technology and resource allocation algorithms, and reduce the communication delay. For example, ref. [113] proposed the TDMA-based Capture-Aware MAC protocol (CT-MAC), which effectively improves broadcast reliability and channel utilization efficiency by optimizing frame length and utilizing capture effect. This shows that optimization combined with the MAC protocol can better cope with the challenges of high vehicle dynamics in the vehicle networking environment, thus improving real-time performance and reliability and providing a useful reference for the combination of vehicle networking and semantic communication. Meanwhile, semantic information in the IoV is critical for traffic safety, necessitating reliable transmission. Interference, fading, and noise in wireless channels can lead to data loss or errors. To enhance reliability, error control techniques (e.g., ARQ and FEC), multipath transmission, and redundancy mechanisms should be employed to ensure accurate and error-free delivery of semantic information under adverse channel conditions. Future research can focus on low-latency, high-accuracy semantic encoding and decoding algorithms, such as those based on the Transformer architecture, to improve information processing speed and meet the real-time and reliability requirements of the IoV.

5.4. Security and Privacy Protection

Semantic communication involves sensitive semantic information such as vehicle trajectories, driving behaviors, and passenger details, the leakage of which could pose significant threats to user privacy. For instance, attackers could analyze semantic communication data to infer a vehicle’s travel routes and stop locations, thereby violating user privacy. If semantic information is intercepted or tampered, this could lead to erroneous vehicle decisions that may jeopardize driving safety. Additionally, the IoV encompasses numerous communication links, including vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) connections. These links are vulnerable to security threats such as hacking, interference, and attacks like semantic message forgery, tampering, and man-in-the-middle attacks. Malicious actors could send false traffic condition information to mislead vehicles or alter semantic communication commands between vehicles and infrastructure, causing traffic disruptions or even accidents. While intelligent connected vehicles can alleviate spectrum pressure through semantic communication, deep learning-based semantic communication is susceptible to adversarial attacks [110], backdoor attacks [111], and other security threats. The IoV based on semantic communication also faces risks such as eavesdropping attacks, adversarial attacks, and poisoning attacks [112].
To address these challenges, it is essential to develop encryption algorithms specifically designed for semantic information, integrating digital signatures and identity authentication technologies to ensure the confidentiality, integrity, and authenticity of semantic communication. Dedicated security communication protocols for the IoV should be established, leveraging technologies such as blockchain and federated learning [114], to build distributed trust mechanisms, preventing communication link attacks and data theft. Existing encryption and detection algorithms, such as homomorphic encryption, often introduce high latency and are difficult to meet the millisecond response requirements of the IoV. It is necessary to design lightweight security protocols, such as dynamic key distribution mechanisms based on edge computing. And existing detection methods are mostly targeted at specific attack patterns (such as FGSM or PGD), with insufficient generalization ability for unknown attack types. Combining meta-learning with online adaptive mechanisms may be a potential solution. The acceptance of new technologies by users also affects the effectiveness of privacy protection. By enhancing users’ trust in the system, making the data processing procedures transparent and strengthening data security measures, we can boost users’ confidence in privacy protection.

5.5. Standardization and Regulation

Currently, different enterprises and institutions in the IoV industry may have different definitions and encoding methods for semantics, making seamless semantic communication between vehicles and infrastructure challenging. Moreover, the IoV involves multiple industries, including automotive, telecommunications, and transportation, each of which has its own semantic standards and business requirements, and this thus complicates the harmonization of semantic standards.
Therefore, stakeholders such as automotive manufacturers, communication companies, and research institutions should collaborate to develop unified semantic standards for the IoV, clarifying semantic definitions, encoding rules, and data formats to ensure compatibility across different devices and systems. Active participation in international standardization efforts is also crucial to align domestic standards with global ones, fostering the coordinated development of the worldwide IoV industry.

5.6. Lessons Learned

The semantic IoV ecosystem faces unresolved challenges in security (e.g., adversarial attacks), real-time knowledge base updates, and standardization. Dynamic environments demand adaptive semantic models with low-latency processing, while cross-domain semantic alignment requires unified ontologies. Future research should focus on AI-driven solutions (e.g., large language models for knowledge fusion), lightweight encryption for resource-constrained devices, and global standardization efforts to ensure interoperability. Collaborative frameworks integrating academia, industry, and policymakers will accelerate practical adoption.

6. Conclusions

This paper provides a comprehensive and in-depth review of semantic communication technologies in the IoV that covers key aspects ranging from foundational technical backgrounds to specific applications, challenges, and future directions. As the core of intelligent transportation systems, the IoV integrates multiple communication modes to achieve comprehensive connectivity between vehicles and their environment. However, traditional communication technologies face numerous bottlenecks in IoV applications. Semantic communication, with its unique advantages such as reducing redundant transmissions, enhancing communication efficiency and accuracy, and ensuring information security, offers innovative solutions to IoV communication challenges while demonstrating broad application prospects in the field. By enabling efficient extraction, transmission, and understanding of semantic information, semantic communication can significantly improve the communication efficiency, data processing capabilities, and intelligent decision-making support of the IoV, providing robust technical support for safer, more efficient, and smarter transportation systems.
Nevertheless, the application of semantic communication in the IoV still faces many challenges. Future research needs to further explore these critical issues and propose more effective solutions by integrating advanced technologies such as artificial intelligence, edge computing, and blockchain. Additionally, strengthening interdisciplinary collaboration and fostering close ties between academia, industry, and application will accelerate the practical implementation of semantic communication technologies in the IoV and promote the comprehensive development of intelligent transportation systems.

Author Contributions

Conceptualization, S.Y. and Q.W.; methodology, S.Y.; investigation, S.Y.; data curation, S.Y.; writing—original draft preparation, S.Y. and Q.W.; writing—review and editing, P.F. and Q.F.; visualization, S.Y.; supervision, Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant No. 61701197, in part by the National Key Research and Development Program of China under Grant No. 2021YFA1000500(4), and in part by the 111 project under Grant No. B23008.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The author Qiang Fan was employed by the company Qualcomm. The remaining authors declare that this research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflicts of interest.

References

  1. Kaiwartya, O.; Abdullah, A.H.; Cao, Y.; Altameem, A.; Prasad, M.; Lin, C.T.; Liu, X. Internet of vehicles: Motivation, layered architecture, network model, challenges, and future aspects. IEEE Access 2016, 4, 5356–5373. [Google Scholar] [CrossRef]
  2. Luo, X.; Chen, H.H.; Guo, Q. Semantic communications: Overview, open issues, and future research directions. IEEE Wirel. Commun. 2022, 29, 210–219. [Google Scholar] [CrossRef]
  3. Gündüz, D.; Qin, Z.; Aguerri, I.E.; Dhillon, H.S.; Yang, Z.; Yener, A.; Wong, K.K.; Chae, C.B. Beyond transmitting bits: Context, semantics, and task-oriented communications. IEEE J. Sel. Areas Commun. 2022, 41, 5–41. [Google Scholar] [CrossRef]
  4. Shi, G.; Xiao, Y.; Li, Y.; Gao, D.; Xie, X. Semantic communication networking for the intelligence of everything. Chin. J. Internet Things 2021, 5, 26–36. [Google Scholar]
  5. Duan, W.; Gu, J.; Wen, M.; Zhang, G.; Ji, Y.; Mumtaz, S. Emerging technologies for 5G-IoV networks: Applications, trends and opportunities. IEEE Netw. 2020, 34, 283–289. [Google Scholar] [CrossRef]
  6. Maksimychev, O.; Matiukhina, E.; Ostroukh, A.; Vasiliev, Y. Connected Vehicle Remote Diagnostic System. In Proceedings of the 2021 Systems of Signals Generating and Processing in the Field of on Board Communications, Moscow, Russia, 16–18 March 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–5. [Google Scholar]
  7. Ang, L.M.; Seng, K.P.; Ijemaru, G.K.; Zungeru, A.M. Deployment of IoV for smart cities: Applications, architecture, and challenges. IEEE Access 2018, 7, 6473–6492. [Google Scholar] [CrossRef]
  8. Audebert, N.; Le Saux, B.; Lefèvre, S. Segment-before-detect: Vehicle detection and classification through semantic segmentation of aerial images. Remote Sens. 2017, 9, 368. [Google Scholar] [CrossRef]
  9. Dewangan, D.K.; Sahu, S.P. Road Detection Using Semantic Segmentation-Based Convolutional Neural Network for Intelligent Vehicle System. In Data Engineering and Communication Technology; Reddy, K.A., Devi, B.R., George, B., Raju, K.S., Eds.; Springer: Singapore, 2021; pp. 629–637. [Google Scholar]
  10. Löhdefink, J.; Bär, A.; Schmidt, N.M.; Hüger, F.; Schlicht, P.; Fingscheidt, T. Focussing Learned Image Compression to Semantic Classes for V2X Applications. In Proceedings of the 2020 IEEE Intelligent Vehicles Symposium (IV), Las Vegas, NV, USA, 19 October–13 November 2020; pp. 1641–1648. [Google Scholar] [CrossRef]
  11. Xiao, Z.; Mo, Z.; Jiang, K.; Yang, D. Multimedia Fusion at Semantic Level in Vehicle Cooperactive Perception. In Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), San Diego, CA, USA, 23–27 July 2018; pp. 1–6. [Google Scholar] [CrossRef]
  12. Pan, Q.; Tong, H.; Lv, J.; Luo, T.; Zhang, Z.; Yin, C.; Li, J. Image Segmentation Semantic Communication over Internet of Vehicles. In Proceedings of the 2023 IEEE Wireless Communications and Networking Conference (WCNC), Glasgow, UK, 26–29 March 2023; pp. 1–6. [Google Scholar] [CrossRef]
  13. Gimenez-Guzman, J.M.; Leyva-Mayorga, I.; Popovski, P. Semantic V2X Communications for Image Transmission in 6G Systems. IEEE Netw. 2024, 38, 48–54. [Google Scholar] [CrossRef]
  14. Feng, Y.; Shen, H.; Shan, Z.; Yang, Q.; Shi, X. Semantic Communication for Edge Intelligence Enabled Autonomous Driving System. arXiv 2024, arXiv:2406.10606. [Google Scholar] [CrossRef]
  15. Lu, J.; Yang, W.; Xiong, Z.; Xing, C.; Tafazolli, R.; Quek, T.Q.; Debbah, M. Generative AI-Enhanced Multi-Modal Semantic Communication in Internet of Vehicles: System Design and Methodologies. arXiv 2024, arXiv:2409.15642. [Google Scholar]
  16. Xu, W.; Zhang, Y.; Wang, F.; Qin, Z.; Liu, C.; Zhang, P. Semantic Communication for the Internet of Vehicles: A Multiuser Cooperative Approach. IEEE Veh. Technol. Mag. 2023, 18, 100–109. [Google Scholar] [CrossRef]
  17. Eldeeb, E.; Shehab, M.; Alves, H. A Multi-Task Oriented Semantic Communication Framework for Autonomous Vehicles. arXiv 2024, arXiv:2403.12997. [Google Scholar] [CrossRef]
  18. Hu, J.; Wang, F.; Xu, W.; Gao, H.; Zhang, P. SemHARQ: Semantic-Aware HARQ for Multi-task Semantic Communications. arXiv 2024, arXiv:2404.08490. [Google Scholar]
  19. Lv, J.; Tong, H.; Pan, Q.; Zhang, Z.; He, X.; Luo, T.; Yin, C. Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving. arXiv 2024, arXiv:2401.10153. [Google Scholar]
  20. Ribouh, S.; Hadid, A. SEECAD: Semantic End-to-End Communication for Autonomous Driving. In Proceedings of the 2024 IEEE Intelligent Vehicles Symposium (IV), Jeju Island, Republic of Korea, 2–5 June 2024; pp. 1808–1813. [Google Scholar] [CrossRef]
  21. Wan, Z.; Liu, S.; Xu, Z.; Ni, W.; Ye, S.; Wang, F. A Semantic Communication Approach for Multiscene Target Detection in Intelligent Vehicle Networks. IEEE Internet Things J. 2024, 11, 35877–35890. [Google Scholar] [CrossRef]
  22. Wan, Z.; Liu, S.; Xu, Z.; Zou, J. A Low Latency Routing for Telematics Combined with Image Semantic Communication. In Proceedings of the 2023 IEEE 3rd International Conference on Software Engineering and Artificial Intelligence (SEAI), Xiamen, China, 16–18 June 2023; pp. 229–234. [Google Scholar] [CrossRef]
  23. Feng, H.; Yang, Y.; Han, Z. Scalable AI Generative Content for Vehicular Network Semantic Communication. arXiv 2023, arXiv:2311.13782. [Google Scholar]
  24. Zhang, R.; Xiong, K.; Du, H.; Niyato, D.; Kang, J.; Shen, X.; Poor, H.V. Generative AI-Enabled Vehicular Networks: Fundamentals, Framework, and Case Study. IEEE Netw. 2024, 38, 259–267. [Google Scholar] [CrossRef]
  25. Raha, A.D.; Munir, M.S.; Adhikary, A.; Qiao, Y.; Hong, C.S. Generative ai-driven semantic communication framework for nextg wireless network. arXiv 2023, arXiv:2310.09021. [Google Scholar]
  26. Yang, W.; Xiong, Z.; Yuan, Y.; Jiang, W.; Quek, T.Q.; Debbah, M. Agent-driven Generative Semantic Communication with Cross-Modality and Prediction. IEEE Trans. Wirel. Commun. 2024, 24, 2233–2248. [Google Scholar] [CrossRef]
  27. Lin, Y.; Murase, T.; Ji, Y.; Bao, W.; Zhong, L.; Li, J. Blockchain-based knowledge-aware semantic communications for remote driving image transmission. Digit. Commun. Netw. 2024, in press. [CrossRef]
  28. Yang, W.; Chi, X.; Zhao, L.; Xiong, Z.; Jiang, W. Task-Driven Semantic-Aware Green Cooperative Transmission Strategy for Vehicular Networks. IEEE Trans. Commun. 2023, 71, 5783–5798. [Google Scholar] [CrossRef]
  29. Wang, Z.; Leng, S.; Zhang, H.; Yuen, C. Deep Semantic Communication for Knowledge Sharing in Internet of Vehicles. IEEE Internet Things J. 2025. [Google Scholar] [CrossRef]
  30. Shao, Z.; Wu, Q.; Fan, P.; Cheng, N.; Fan, Q.; Wang, J. Semantic-Aware Resource Allocation Based on Deep Reinforcement Learning for 5G-V2X HetNets. IEEE Commun. Lett. 2024, 28, 2452–2456. [Google Scholar] [CrossRef]
  31. Shao, Z.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Wang, J.; Ben Letaief, K. Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning. IEEE Internet Things J. 2024, 11, 38521–38536. [Google Scholar] [CrossRef]
  32. Shao, Z.; Wu, Q.; Fan, P.; Wang, K.; Fan, Q.; Chen, W.; Letaief, K.B. Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning. arXiv 2024, arXiv:2411.04672. [Google Scholar]
  33. Zhu, M.; Feng, C.; Chen, J.; Guo, C.; Gao, X. Video Semantics based Resource Allocation Algorithm for Spectrum Multiplexing Scenarios in Vehicular Networks. In Proceedings of the 2021 IEEE/CIC International Conference on Communications in China (ICCC Workshops), Xiamen, China, 28–30 July 2021; pp. 31–36. [Google Scholar] [CrossRef]
  34. Su, J.; Liu, Z.; Xie, Y.a.; Ma, K.; Du, H.; Kang, J.; Niyato, D. Semantic Communication-Based Dynamic Resource Allocation in D2D Vehicular Networks. IEEE Trans. Veh. Technol. 2023, 72, 10784–10796. [Google Scholar] [CrossRef]
  35. Ma, X.; Yang, X. V2X Wireless Resource Optimization for Vehicle Road Collaboration in Vehicle Platoons. In Proceedings of the 2024 IEEE/CIC International Conference on Communications in China (ICCC), Hangzhou, China, 7–9 August 2024; pp. 203–207. [Google Scholar] [CrossRef]
  36. Ng, W.C.; Du, H.; Lim, W.Y.B.; Xiong, Z.; Niyato, D.; Miao, C. Stochastic Resource Allocation for Semantic Communication-Aided Virtual Transportation Networks in the Metaverse. In Proceedings of the 2024 IEEE Wireless Communications and Networking Conference (WCNC), Dubai, United Arab Emirates, 21–24 April 2024; pp. 1–6. [Google Scholar] [CrossRef]
  37. Zheng, G.; Ni, Q.; Navaie, K.; Pervaiz, H.; Min, G.; Kaushik, A.; Zarakovitis, C. Mobility-Aware Split-Federated with Transfer Learning for Vehicular Semantic Communication Networks. IEEE Internet Things J. 2024, 11, 17237–17248. [Google Scholar] [CrossRef]
  38. Liu, J.; Lu, Y.; Wu, H.; Dai, Y. Efficient Resource Allocation and Semantic Extraction for Federated Learning Empowered Vehicular Semantic Communication. In Proceedings of the 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall), Hong Kong, China, 10–13 October 2023; pp. 1–5. [Google Scholar] [CrossRef]
  39. Imran, S.; Charan, G.; Alkhateeb, A. Environment Semantic Communication: Enabling Distributed Sensing Aided Networks. arXiv 2024, arXiv:2402.14766. [Google Scholar] [CrossRef]
  40. Sheng, Y.; Liang, L.; Ye, H.; Jin, S.; Li, G.Y. Semantic Communication for Cooperative Perception using HARQ. arXiv 2024, arXiv:2409.09042. [Google Scholar]
  41. Figetakis, E.; Bello, Y.; Refaey, A.; Shami, A. Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks. In Proceedings of the ICC 2024—IEEE International Conference on Communications, Denver, CO, USA, 9–13 June 2024; pp. 5449–5454. [Google Scholar] [CrossRef]
  42. Deb Raha, A.; Shirajum Munir, M.; Adhikary, A.; Qiao, Y.; Park, S.B.; Seon Hong, C. An Artificial Intelligent-Driven Semantic Communication Framework for Connected Autonomous Vehicular Network. In Proceedings of the 2023 International Conference on Information Networking (ICOIN), Bangkok, Thailand, 11–14 January 2023; pp. 352–357. [Google Scholar] [CrossRef]
  43. Xia, L.; Sun, Y.; Niyato, D.; Feng, D.; Feng, L.; Imran, M.A. xURLLC-Aware Service Provisioning in Vehicular Networks: A Semantic Communication Perspective. IEEE Trans. Wirel. Commun. 2024, 23, 4475–4488. [Google Scholar] [CrossRef]
  44. Liu, X.; Sun, Y.; Wang, Z.; You, L.; Pan, H.; Wang, F.; Cui, S. Receiver-Centric Generative Semantic Communications. arXiv 2024, arXiv:2411.03127. Available online: http://arxiv.org/abs/2411.03127 (accessed on 25 February 2025).
  45. Tang, Y.; Zhou, N.; Yu, Q.; Wu, D.; Hou, C.; Tao, G.; Chen, M. Intelligent Fabric Enabled 6G Semantic Communication System for In-Cabin Scenarios. IEEE Trans. Intell. Transp. Syst. 2023, 24, 1153–1162. [Google Scholar] [CrossRef]
  46. Kadam, S.; Kim, D.I. Semantic Communication-Empowered Vehicle Count Prediction for Traffic Management. In Proceedings of the 2024 IEEE Wireless Communications and Networking Conference (WCNC), Dubai, United Arab Emirates, 21–24 April 2024; pp. 1–6. [Google Scholar] [CrossRef]
  47. Hu, J.; Wang, F.; Xu, W.; Gao, H.; Zhang, P. Scalable Multi-Task Semantic Communication System with Feature Importance Ranking. In Proceedings of the ICASSP 2023—2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 4–10 June 2023; pp. 1–5. [Google Scholar] [CrossRef]
  48. Zeng, Y.; He, X.; Chen, X.; Tong, H.; Yang, Z.; Guo, Y.; Hao, J. DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems. arXiv 2024, arXiv:2401.16017. [Google Scholar]
  49. Sharif, S.; Khandaker, F.; Ejaz, W. Semantic Communication: Implication for Resource Optimization in 6G Networks. In Proceedings of the 2024 IEEE International Conference on Advanced Telecommunication and Networking Technologies (ATNT), Johor Bahru, Malaysia, 9–10 September 2024; Volume 1, pp. 1–4. [Google Scholar] [CrossRef]
  50. Bhardwaj, I.; Khara, S. Research trends in Architecture, Security, Services and Applications of Internet of Vehicles (IOV). In Proceedings of the 2018 International Conference on Computing, Power and Communication Technologies (GUCON), Greater Noida, India, 28–29 September 2018; pp. 91–95. [Google Scholar] [CrossRef]
  51. Sharma, S.; Kaushik, B. A survey on internet of vehicles: Applications, security issues & solutions. Veh. Commun. 2019, 20, 100182. [Google Scholar]
  52. Wu, Q.; Wang, W.; Fan, P.; Fan, Q.; Zhu, H.; Letaief, K.B. Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation Networks. IEEE Trans. Netw. Serv. Manag. 2024, 21, 4179–4196. [Google Scholar] [CrossRef]
  53. Zhang, C.; Zhang, W.; Wu, Q.; Fan, P.; Fan, Q.; Wang, J.; Letaief, K.B. Distributed Deep Reinforcement Learning Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing. IEEE Internet Things J. 2025, 12, 4899–4913. [Google Scholar] [CrossRef]
  54. Al-Sultan, S.; Al-Doori, M.M.; Al-Bayatti, A.H.; Zedan, H. A comprehensive survey on vehicular Ad Hoc network. J. Netw. Comput. Appl. 2014, 37, 380–392. [Google Scholar] [CrossRef]
  55. Tahmasbi-Sarvestani, A.; Mahjoub, H.N.; Fallah, Y.P.; Moradi-Pari, E.; Abuchaar, O. Implementation and evaluation of a cooperative vehicle-to-pedestrian safety application. IEEE Intell. Transp. Syst. Mag. 2017, 9, 62–75. [Google Scholar] [CrossRef]
  56. Wu, C.; Yoshinaga, T.; Ji, Y.; Zhang, Y. Computational intelligence inspired data delivery for vehicle-to-roadside communications. IEEE Trans. Veh. Technol. 2018, 67, 12038–12048. [Google Scholar] [CrossRef]
  57. Jomaa, D.; Yella, S.; Dougherty, M. A comparative study between vehicle activated signs and speed indicator devices. Transp. Res. Procedia 2017, 22, 115–123. [Google Scholar] [CrossRef]
  58. Endo, M.; Tanaka, K. Evaluation of Storage Capacity of Electric Vehicles for Vehicle to Grid Considering Driver’s Perspective. In Proceedings of the 2018 IEEE International Conference on Environment and Electrical Engineering and 2018 IEEE Industrial and Commercial Power Systems Europe (EEEIC/I&CPS Europe), Palermo, Italy, 12–15 June 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–5. [Google Scholar]
  59. Sadiku, M.N.; Tembely, M.; Musa, S.M. Internet of vehicles: An introduction. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2018, 8, 11. [Google Scholar] [CrossRef]
  60. Agbaje, P.; Anjum, A.; Mitra, A.; Oseghale, E.; Bloom, G.; Olufowobi, H. Survey of Interoperability Challenges in the Internet of Vehicles. IEEE Trans. Intell. Transp. Syst. 2022, 23, 22838–22861. [Google Scholar] [CrossRef]
  61. Panigrahy, S.K.; Emany, H. A survey and tutorial on network optimization for intelligent transport system using the internet of vehicles. Sensors 2023, 23, 555. [Google Scholar] [CrossRef] [PubMed]
  62. Yang, F.; Li, J.; Lei, T.; Wang, S. Architecture and Key Technologies for Internet of Vehicles: A Survey. J. Commun. Inf. Netw. 2017, 2, 1–17. [Google Scholar] [CrossRef]
  63. Liu, K.; Xu, X.; Chen, M.; Liu, B.; Wu, L.; Lee, V.C.S. A Hierarchical Architecture for the Future Internet of Vehicles. IEEE Commun. Mag. 2019, 57, 41–47. [Google Scholar] [CrossRef]
  64. Contreras-Castillo, J.; Zeadally, S.; Guerrero Ibáñez, J.A. A seven-layered model architecture for Internet of Vehicles. J. Inf. Telecommun. 2017, 1, 4–22. [Google Scholar] [CrossRef]
  65. Contreras-Castillo, J.; Zeadally, S.; Guerrero-Ibañez, J.A. Internet of Vehicles: Architecture, Protocols, and Security. IEEE Internet Things J. 2018, 5, 3701–3709. [Google Scholar] [CrossRef]
  66. Yang, F.; Wang, S.; Li, J.; Liu, Z.; Sun, Q. An overview of Internet of Vehicles. China Commun. 2014, 11, 1–15. [Google Scholar] [CrossRef]
  67. Alnasser, A.; Sun, H.; Jiang, J. Cyber security challenges and solutions for V2X communications: A survey. Comput. Netw. 2019, 151, 52–67. [Google Scholar] [CrossRef]
  68. Taslimasa, H.; Dadkhah, S.; Neto, E.C.P.; Xiong, P.; Ray, S.; Ghorbani, A.A. Security issues in Internet of Vehicles (IoV): A comprehensive survey. Internet Things 2023, 22, 100809. [Google Scholar] [CrossRef]
  69. Guo, J.; Bilal, M.; Qiu, Y.; Qian, C.; Xu, X.; Raymond Choo, K.K. Survey on digital twins for Internet of Vehicles: Fundamentals, challenges, and opportunities. Digit. Commun. Netw. 2024, 10, 237–247. [Google Scholar] [CrossRef]
  70. Wan, J.; Zhang, D.; Zhao, S.; Yang, L.T.; Lloret, J. Context-aware vehicular cyber-physical systems with cloud support: Architecture, challenges, and solutions. IEEE Commun. Mag. 2014, 52, 106–113. [Google Scholar] [CrossRef]
  71. Ji, B.; Zhang, X.; Mumtaz, S.; Han, C.; Li, C.; Wen, H.; Wang, D. Survey on the Internet of Vehicles: Network Architectures and Applications. IEEE Commun. Stand. Mag. 2020, 4, 34–41. [Google Scholar] [CrossRef]
  72. Ji, M.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Wang, J.; Letaief, K.B. Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications. IEEE Internet Things J. 2025, 12, 3613–3628. [Google Scholar] [CrossRef]
  73. Qi, K.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Letaief, K.B. Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-Shift Optimization and Multi-User Power Allocation. IEEE Internet Things J. 2025, 12, 764–778. [Google Scholar] [CrossRef]
  74. Zhou, H.; Xu, W.; Chen, J.; Wang, W. Evolutionary V2X Technologies Toward the Internet of Vehicles: Challenges and Opportunities. Proc. IEEE 2020, 108, 308–323. [Google Scholar] [CrossRef]
  75. Wang, X.; Wu, Q.; Fan, P.; Fan, Q.; Zhu, H.; Wang, J. Vehicle Selection for C-V2X Mode 4-Based Federated Edge Learning Systems. IEEE Syst. J. 2024, 18, 1927–1938. [Google Scholar] [CrossRef]
  76. Zhang, Z.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Letaief, K.B. DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoV. IEEE Trans. Green Commun. Netw. 2025. [Google Scholar] [CrossRef]
  77. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  78. Jiang, R.; Xiong, K.; Fan, P.; Zhou, L.; Zhong, Z. Outage probability and throughput of multirelay SWIPT-WPCN networks with nonlinear EH model and imperfect CSI. IEEE Syst. J. 2019, 14, 1206–1217. [Google Scholar] [CrossRef]
  79. Di, X.; Xiong, K.; Fan, P.; Yang, H.C.; Letaief, K.B. Optimal Resource Allocation in Wireless Powered Communication Networks With User Cooperation. IEEE Trans. Wirel. Commun. 2017, 16, 7936–7949. [Google Scholar] [CrossRef]
  80. Li, T.; Fan, P.; Chen, Z.; Letaief, K.B. Optimum Transmission Policies for Energy Harvesting Sensor Networks Powered by a Mobile Control Center. IEEE Trans. Wirel. Commun. 2016, 15, 6132–6145. [Google Scholar] [CrossRef]
  81. Zhou, H.; Fan, P.; Li, J. Global Proportional Fair Scheduling for Networks With Multiple Base Stations. IEEE Trans. Veh. Technol. 2011, 60, 1867–1879. [Google Scholar] [CrossRef]
  82. Lu, Z.; Li, R.; Lu, K.; Chen, X.; Hossain, E.; Zhao, Z.; Zhang, H. Semantics-Empowered Communications: A Tutorial-Cum-Survey. IEEE Commun. Surv. Tutor. 2024, 26, 41–79. [Google Scholar] [CrossRef]
  83. Xin, G.; Fan, P.; Letaief, K.B. Semantic Communication: A Survey of Its Theoretical Development. Entropy 2024, 26, 102. [Google Scholar] [CrossRef]
  84. Zhang, P.; Liu, Y.; Song, Y.; Zhang, J. Advances and challenges in semantic communications: A systematic review. Natl. Sci. Open 2024, 3, 20230029. [Google Scholar] [CrossRef]
  85. Bourtsoulatze, E.; Burth Kurka, D.; Gündüz, D. Deep Joint Source-Channel Coding for Wireless Image Transmission. IEEE Trans. Cogn. Commun. Netw. 2019, 5, 567–579. [Google Scholar] [CrossRef]
  86. Kurka, D.B.; Gündüz, D. DeepJSCC-f: Deep Joint Source-Channel Coding of Images with Feedback. IEEE J. Sel. Areas Inf. Theory 2020, 1, 178–193. [Google Scholar] [CrossRef]
  87. Xie, H.; Qin, Z.; Li, G.Y.; Juang, B.H. Deep Learning Enabled Semantic Communication Systems. IEEE Trans. Signal Process. 2021, 69, 2663–2675. [Google Scholar] [CrossRef]
  88. Weng, Z.; Qin, Z. Semantic Communication Systems for Speech Transmission. IEEE J. Sel. Areas Commun. 2021, 39, 2434–2444. [Google Scholar] [CrossRef]
  89. Xie, H.; Qin, Z.; Li, G.Y. Task-Oriented Multi-User Semantic Communications for VQA. IEEE Wirel. Commun. Lett. 2022, 11, 553–557. [Google Scholar] [CrossRef]
  90. Yang, W.; Du, H.; Liew, Z.Q.; Lim, W.Y.B.; Xiong, Z.; Niyato, D.; Chi, X.; Shen, X.; Miao, C. Semantic Communications for Future Internet: Fundamentals, Applications, and Challenges. IEEE Commun. Surv. Tutor. 2023, 25, 213–250. [Google Scholar] [CrossRef]
  91. Guler, B.; Yener, A. Semantic index assignment. In Proceedings of the 2014 IEEE International Conference on Pervasive Computing and Communication Workshops (PERCOM WORKSHOPS), Budapest, Hungary, 24–28 March 2014; pp. 431–436. [Google Scholar] [CrossRef]
  92. Papineni, K.; Roukos, S.; Ward, T.; Zhu, W.J. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, USA, 6–12 July 2002; pp. 311–318. [Google Scholar]
  93. Sara, U.; Akter, M.; Uddin, M.S. Image quality assessment through FSIM, SSIM, MSE and PSNR—A comparative study. J. Comput. Commun. 2019, 7, 8–18. [Google Scholar] [CrossRef]
  94. Vincent, E.; Gribonval, R.; Févotte, C. Performance measurement in blind audio source separation. IEEE Trans. Audio Speech Lang. Process. 2006, 14, 1462–1469. [Google Scholar] [CrossRef]
  95. Rix, A.; Beerends, J.; Hollier, M.; Hekstra, A. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs. In Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No. 01CH37221), Salt Lake City, UT, USA, 7–11 May 2001; Volume 2, pp. 749–752. [Google Scholar] [CrossRef]
  96. Wang, Y.; Chen, M.; Luo, T.; Saad, W.; Niyato, D.; Poor, H.V.; Cui, S. Performance Optimization for Semantic Communications: An Attention-Based Reinforcement Learning Approach. IEEE J. Sel. Areas Commun. 2022, 40, 2598–2613. [Google Scholar] [CrossRef]
  97. Jiang, P.; Wen, C.K.; Jin, S.; Li, G.Y. Deep Source-Channel Coding for Sentence Semantic Transmission with HARQ. IEEE Trans. Commun. 2022, 70, 5225–5240. [Google Scholar] [CrossRef]
  98. Yates, R.D.; Sun, Y.; Brown, D.R.; Kaul, S.K.; Modiano, E.; Ulukus, S. Age of Information: An Introduction and Survey. IEEE J. Sel. Areas Commun. 2021, 39, 1183–1210. [Google Scholar] [CrossRef]
  99. Molin, A.; Esen, H.; Johansson, K.H. Scheduling networked state estimators based on value of information. Automatica 2019, 110, 108578. [Google Scholar] [CrossRef]
  100. Qi, K.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Wang, J.; Letaief, K.B. Deep-Reinforcement-Learning-Based AoI-Aware Resource Allocation for RIS-Aided IoV Networks. IEEE Trans. Veh. Technol. 2024, 74, 1365–1378. [Google Scholar] [CrossRef]
  101. Qi, K.; Wu, Q.; Fan, P.; Cheng, N.; Fan, Q.; Wang, J. Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning. IEEE Commun. Lett. 2024, 28, 2427–2431. [Google Scholar] [CrossRef]
  102. Gu, X.; Wu, Q.; Fan, P.; Fan, Q.; Cheng, N.; Chen, W.; Letaief, K.B. Drl-based resource allocation for motion blur resistant federated self-supervised learning in iov. IEEE Internet Things J. 2024, 12, 7067–7085. [Google Scholar] [CrossRef]
  103. Gu, X.; Wu, Q.; Fan, P.; Cheng, N.; Chen, W.; Letaief, K.B. DRL-based federated self-supervised learning for task offloading and resource allocation in ISAC-enabled vehicle edge computing. Digit. Commun. Netw. 2024, in press. [CrossRef]
  104. Sun, P. Semantic Communication Technologies. In A Guidebook for 5GtoB and 6G Vision for Deep Convergence; Springer Nature: Singapore, 2023; pp. 301–313. [Google Scholar] [CrossRef]
  105. ITU-T. Requirements and Reference Architecture of Semantic Aware Networking in Future Networks; Technical Report TR.Reqts SAN; ITU-T: Geneva, Switzerland, 2023. [Google Scholar]
  106. Lan, Q.; Wen, D.; Zhang, Z.; Zeng, Q.; Chen, X.; Popovski, P.; Huang, K. What is Semantic Communication? A View on Conveying Meaning in the Era of Machine Intelligence. J. Commun. Inf. Netw. 2021, 6, 336–371. [Google Scholar] [CrossRef]
  107. Liu, Y.; Wang, X.; Ning, Z.; Zhou, M.; Guo, L.; Jedari, B. A survey on semantic communications: Technologies, solutions, applications and challenges. Digit. Commun. Netw. 2024, 10, 528–545. [Google Scholar] [CrossRef]
  108. Chen, J.; Guo, C.; Feng, C.; Liu, C. Resource allocation for the semantic communication in the intelligent networked environment. Chin. J. Internet Things 2022, 6, 47–57. [Google Scholar] [CrossRef]
  109. Wang, Y.; Lin, Y.; Chen, L.; Shi, J. A Stackelberg game-based caching incentive scheme for roadside units in VANETs. Sensors 2020, 20, 6625. [Google Scholar] [CrossRef]
  110. Ribouh, S.; Hadid, A. Is Semantic Communication for Autonomous Driving Secured against Adversarial Attacks? In Proceedings of the 2024 IEEE 6th International Conference on AI Circuits and Systems (AICAS), Abu Dhabi, United Arab Emirates, 22–25 April 2024; pp. 139–143. [Google Scholar] [CrossRef]
  111. Xu, X.; Chen, Y.; Wang, B.; Bian, Z.; Han, S.; Dong, C.; Sun, C.; Zhang, W.; Xu, L.; Zhang, P. CSBA: Covert Semantic Backdoor Attack Against Intelligent Connected Vehicles. IEEE Trans. Veh. Technol. 2024, 73, 17923–17928. [Google Scholar] [CrossRef]
  112. Kang, J.; He, J.; Du, H.; Xiong, Z.; Yang, Z.; Huang, X.; Xie, S. Adversarial Attacks and Defenses for Semantic Communication in Vehicular Metaverses. IEEE Wirel. Commun. 2023, 30, 48–55. [Google Scholar] [CrossRef]
  113. Wang, Y.; Shi, J.; Chen, L.; Lu, B.; Yang, Q. A novel capture-aware TDMA-based MAC protocol for safety messages broadcast in vehicular ad hoc networks. IEEE Access 2019, 7, 116542–116554. [Google Scholar] [CrossRef]
  114. Zheng, G.; Ni, Q.; Navaie, K.; Pervaiz, H.; Zarakovitis, C. A Distributed Learning Architecture for Semantic Communication in Autonomous Driving Networks for Task Offloading. IEEE Commun. Mag. 2023, 61, 64–68. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.