Integrating Large Language Models into Traffic Systems: Integration Levels, Capability Boundaries, and an Information-Theoretic Perspective
Abstract
1. Introduction
- First, this paper analyzes the limitations of classical deep learning models in traffic applications from the perspectives of uncertainty modeling and information processing and distills the core modeling considerations and capability requirements at the intersection of LLMs and traffic optimization.
- Second, this paper conducts a narrative synthesis of LLM-based research in traffic representation, prediction and reasoning, planning and control, and autonomous agents under the proposed four-level integration perspective, revealing capability dependencies, recurring technical patterns, and differences in integration pathways across studies.
- Third, through comparative analysis, this paper discusses the applicability boundaries, failure modes, and appropriate modeling roles of LLMs in traffic tasks, and proposes corresponding directions for future research toward bridging entropy-based semantic reasoning and physical system modeling.
2. Methodological Framework of the Narrative Review
2.1. Identification and Screening of Core Applied Literature
- Search Strategy and Sources (December 2025):
- Exact Search Query:
- Screening Process and Inclusion Criteria:
- Final Composition of the Core Applied Corpus:
2.2. Selection of Foundational and Contextual References
2.3. Synthesis via the Four-Level Integration Framework
2.4. Methodological Transparency and Limitations
3. Fundamental Theories and Modeling Analysis of LLM Applications in Traffic
3.1. Limitations of Classical Deep Learning Models in Traffic Applications
3.2. From Transformer Architectures to Multimodal and Agent-Based Systems
3.3. Core Capabilities and Architectural Adaptation of LLM-Enabled Traffic Systems
3.4. LLM-Based Traffic Task Modeling
3.5. Entropy Structures in Traffic Tasks and Implications for LLM Integration
3.5.1. Three Hierarchical Levels of Entropy
- Representation-Level Entropy
- Inference- and Prediction-Level Entropy
- Multi-Agent and System-Level Entropy
3.5.2. Implications for LLM Integration
4. Review of LLM-Based Intelligent Transportation Applications
4.1. LLM Integration Classification Perspective in Traffic Systems
4.2. Research on Traffic Application of Representation Integrated
4.2.1. Traffic Application Research Based on Pure Text Analysis
4.2.2. Traffic Application Research Based on Text–Visual Fusion
| Classification | Ref. | Traffic Task | Visual/Language Input | Fusion Strategy/LLM’s Role | Problem Addressed |
|---|---|---|---|---|---|
| Semantic Alignment & Verification | [2] | Scene dataset construction | Scene images. VQA text | VQA-driven alignment/Semantic representation support | Lack of traffic-domain semantics in general MLLMs |
| [46] | Driving scene segmentation | Scene images. Language-supervised embeddings | Language-assisted alignment/Semantic constraint module | Purely visual segmentation lacks semantic constraints | |
| [47] | Efficient 3D data annotation | Point clouds/images. Consistency prompts | Language-guided verification/Semantic consistency verifier | Semantic drift in 3D annotations | |
| Deep Cross-Modal Fusion | [20] | EV charging demand prediction | Satellite imagery. Structured prompts | Spatial semantic alignment/Urban functional reasoner | Remote sensing lacks usage semantics |
| [4] | 3D environment perception for AD | Camera & LiDAR. Cross-modal features | Feature-level attention (MoE)/Semantic-space fusion enhancer | Heterogeneous sensor alignment | |
| [48] | Efficient ITS data management | Multimodal sensor data. Compression prompts | Knowledge-driven reconstruction/Data compression enhancer | Low efficiency in multimodal data storage/transmission | |
| Interactive Task & QA | [49] | Vehicle monitoring & interaction | Vehicle images. Detection-driven QA prompts | Task pipeline coordination/Interactive query controller | Single-function models lack semantic interaction |
| Deep Semantic Understanding | [38] | Holistic scene understanding | Scene images. Scene descriptions | Shared embedding space/Scene reasoning module | Vision models lack holistic & relational semantics |
| [39] | Scene generation, QA, explanation | Traffic sign images. Traffic instructions | Domain-adaptive MLLM/Generation & QA agent | General MLLMs lack traffic-specific knowledge | |
| [40] | Traffic accident analysis | Traffic videos. Explanation queries | Video–language reasoning/Causal explanation generator | Fragmented “detect-then-analyze” pipelines | |
| Real-Time Dynamic Analysis | [41] | Accident prediction, AD support | Real-time scene visuals. Contextual instructions | Real-time VLM fusion/Context enhancer & decision assistant | Traditional models ignore real-time visual/text context |
| [19] | Human–machine interaction in AD | Driving scene images. NL instructions | Cross-modal attention/Instruction grounding executor | Grounding complex language in dynamic scenes | |
| Unified Multimodal Model | [3] | Multi-sensor fusion, planning | Images, videos. (Implicit) time-series | LLM architecture/Explanation & decision analyzer | Inefficiency of multiple specialized models |
4.2.3. Traffic Application Research Based on Text–Spatiotemporal Fusion
| Classification | Ref. | Spatiotemporal Data | Fusion Strategy/LLM’s Role | Key Advantage |
|---|---|---|---|---|
| Semantic Feature Enhancement | [50] | Bicycle flow sequences | Text descriptions, LLM embeddings; as semantic enhancer | Improves prediction under special events by integrating contextual text. |
| Architectural Improvement & Fusion | [51] | Road network sequences | CNN + GCN embeddings; as few-shot predictor | Mitigates the problem of historical data scarcity. |
| [52] | Spatiotemporal flow data | Fusion layer embeddings; as spatiotemporal interaction enhancer | Strengthens the learning of temporal-spatial relationships. | |
| [53] | Flow with exogenous factors | Multi-source attention; as fluctuation modeler | (Metro passenger flow prediction)Integrates complex exogenous influencing factors. | |
| [54] | Road network sequences | Condensed Spatial Prompting; as frozen predictor | Highly efficient; compresses graph info into prompts. | |
| [55] | Spatiotemporal sequences | GAT + unified embeddings; as cooperative modeler | Decoupled design leverages graph networks (space) & LLMs (time). | |
| [56] | Large-scale road network flow | Lightweight generative model; as edge predictor | Reduces central cloud pressure via edge deployment. | |
| spacetime tokenizer, partial frozen attention | [57] | Urban flow data | Spatiotemporal tokenizer; as reasoning module | Enables strong zero-/few-shot transfer capability. |
| [58] | Location-based features | Partially frozen attention; as dependency capturer | Robust in low-data regimes; captures global dependencies. | |
| Robust & Probabilistic Modeling | [59] | Multimodal system data | Denoising diffusion; as structure restorer | Robust to input noise and missing data. |
| [60] | Road network sequences | Probabilistic modeling; as adaptive predictor | Adaptable and interpretable under data distribution shifts. | |
| Global Unified Modeling | [61] | Charging series & spatial data | Multi-source embeddings; as global modeler | (EV charging prediction) Unifies heterogeneous spatiotemporal and contextual data. |
4.2.4. Traffic Application Research Based on Text–Graph and Knowledge Integration
| Ref. | Graph/Knowledge | LLM Access Method | LLM’s Core Role | Key Problem Addressed | Methodological Edge | Task |
|---|---|---|---|---|---|---|
| [62] | Traffic accident KG | RAG + KG construction | KG builder & QA enhancer | Manual KG construction; poor interactivity | Semi-auto extraction, RAG reduces hallucination | Accident Q&A & causal analysis |
| [63] | Cross-domain EV KG | Not directly used | Target for LLM integration | Scattered EV ecosystem knowledge | Provides structured KB for LLM apps | EV decision support |
| [64] | Traffic accident KG | Not directly used | Classic KG framework | Complex multi-dimensional data | auto KG construction | Accident visualization |
| [65] | Ship collision KG | NLP + ontology | Baseline for LLM methods | Lengthy reports; inefficient extraction | Ontology-based semi-auto extraction | Maritime accident analysis |
| [66] | Regional demand graph | Geo-semantic embedding | Cross-city encoder | Poor graph model generalization | Transferable semantic priors | Delivery demand prediction |
| [67] | Road behavior KG | RAG + KG reasoning | Retriever & explainer | Black-box, uninterpretable predictions | Explainable prediction | Pedestrian & lane-change prediction |
| [68] | Trajectory semantics | Multi-source encoding | Travel reasoner | DL struggles with travel semantics | Explicit semantic fusion | Pedestrian mode identification |
| [69] | Traffic element hierarchy | Hierarchical CoT | Scene analyzer & generator | Uncontrollable simulation | CoT + Frenet for control | Controllable AD scenario simulation |
4.2.5. Summary of Research on Representation Integration
4.3. Research on Traffic Applications at the Reasoning and Prediction Integration
4.3.1. Research on LLM-Based Traffic Prediction
| Classification | Ref. | Task | LLM Role | Input Form/Prediction Output | Technical Feature | Main Contribution |
|---|---|---|---|---|---|---|
| Zero-shot/Few-shot Prompt Reasoning | [70] | mode choice | Zero-shot explainer | Task & attribute prompts/Mode + explanation | Zero-shot; CoT reasoning | Matches classic models without training data. |
| [71] | Next location | Zero-shot predictor | Textualized history/Location + reasoning | Zero-shot; geo-knowledge | proof of zero-shot mobility prediction; strong in cold-start. | |
| Multimodal and Hybrid Enhancement | [72] | EV charging demand | Direct predictor | Textualized spatiotemporal data/Demand sequence | End-to-end text-to-text | Reduces feature engineering; good generalization. |
| [28] | Lane-change prediction | Explainable predictor | NL scene prompts/Intent & trajectory | Fine-tuning; CoT | LLM for lane-change prediction; explainable. | |
| [73] | Traffic flow prediction | Explainable predictor | NL data descriptions/Future flow | Instruction tuning | Builds explainable flow prediction model. | |
| [20] | EV charging demand | Multimodal predictor | Image + text prompts/Station demand | Vision-semantic learning | Robust cross-scene prediction via visual semantics. | |
| [74] | Transit demand | Hybrid system | Aligned flow/OD data/Value + explanation | Modular; prompt tuning | Flexible prediction with event-aware explanations. | |
| [75] | CAV misbehavior detection | Adaptive detector | Textualized V2X messages/Authenticity (class) | Hybrid fine-tuning | Unified detection of forged CAV signals/motions. | |
| Dedicated Arch. | [59] | Multimodal sys. | Robust predictor | Heterogeneous sequences/Flow & demand values | Diffusion; ST-LLM | Unified framework for noisy multimodal data. |
4.3.2. Research on LLM-Enhanced Traffic Prediction
| Ref. | Traffic Task | Role of LLM | Enhancement Mechanism | Main Contribution |
|---|---|---|---|---|
| [11] | Traffic flow prediction | Responsibility-aware predictor | Multimodal textification and causal reasoning | Introduces reliability- and responsibility-oriented prediction |
| [76] | Trajectory prediction | Semantic interaction enhancer | Implicit semantic modeling via pretrained LLM | Demonstrates robustness in few-shot settings |
| [12] | Accident severity inference | Context modeling and explanation | Table-to-text conversion and interpretable reasoning | Achieves accurate and explainable severity prediction |
| [13] | Flow and demand prediction | Reasoning-guided feature reordering | Prompt engineering and multi-step reasoning | Improves generalization under small samples |
| [2] | Traffic scene understanding | Semantic infrastructure | Multimodal QA dataset construction | Strengthens LLM reasoning foundation for prediction |
4.3.3. Research on RAG-Enhanced Traffic Reasoning
| Ref. | Scenario | Retrieved Knowledge | Reasoning Role of LLM | Main Contribution |
|---|---|---|---|---|
| [67] | Behavior prediction | KG, behavioral relations | Causal reasoning with retrieval | Achieves explainable participant behavior prediction |
| [77] | Public transit services | Operational DB and policies | Constraint-aware reasoning hub | Extends RAG to rule-intensive service reasoning |
| [79] | Autonomous disengagement analysis | Historical report corpus | Pattern discovery and diagnosis | Enables large-scale root cause analysis |
| [80] | Risk assessment | Scenario-specific knowledge base | Causal and progressive reasoning | Reduces hallucination via external constraints |
| [78] | Fault diagnosis | Parameterized expert knowledge | Expert-level reasoning | Demonstrates parameterized RAG via LoRA |
| [26] | Traffic signal control | Historical decision memory | Experience-based coordination | Integrates RAG with Actor–Critic control |
4.3.4. Comprehensive Analysis of the Reasoning and Prediction Integration
4.4. Research on Traffic Applications at Planning and Control Integration
4.4.1. Research on LLM-Guided Reinforcement Learning (RL) Traffic Applications
- LLM-guided RL (Reward-centric)
- High-level Planning in Hierarchical RL
- Priors and Constraints for RL Exploration
| Classification | Subclass | Ref. | Scenario | LLM Role | Guidance Mechanism | Key Contribution |
|---|---|---|---|---|---|---|
| LLM-guided RL (Reward-centric) | Semantic reward generation from language goals | [81] | AD | Reward generator (with VLM) | Language goals, semantic rewards | Contrastive language rewards reduce collisions & improve generalization. |
| Automatic reward construction and evolution | [82] | Bus holding control | Reward function evolver | LLM generates/optimizes reward functions | Automatic reward evolution enhances control stability & robustness. | |
| High-level Planning in Hierarchical RL | LLM-based High-level Planners | [22] | AD | High-level planner | Long-term goals & meta-action guidance | Improves generalization in complex scenes & decision explainability. |
| [83] | Traffic signal control | Scene interpreter & decision generator | Perception, semantic reasoning, action generation | Vision-LLM joint control: LLM reasons for direct action, RL as fallback. | ||
| [24] | Complex urban signal control | Reasoning decision center | Tool-augmented LLM, zero-shot reasoning | Hybrid framework for zero-shot adaptation & robust control in complex scenarios. | ||
| LLM-enhanced Decision Refinement Modules | [84] | On-ramp merging control | Decision optimizer & enhancer | RL decisions to LLM CoT refinement | LLM refines/generalizes RL decisions for efficiency & safety in congestion. | |
| [85] | Highway AD | Explainable trajectory predictor | RL meta-action + state, trajectory generation | Cascaded framework: RL, meta-actions, LLM, safe trajectories, controller, execution. | ||
| [26] | Displaced left-turn control | AC optimizer | Dual-agent: LLM Actor (decisions), Critic (RAG/memory) | GPTTC: LLM-based AC with RAG for adaptive control, reducing delay/stops. | ||
| LLM-centric Decision Agents | [25] | Traffic signal control | Core decision agent | State → LLM (LightGPT), direct action | LLMLight: Specialized LLM agent for direct, efficient, explainable control. | |
| [86] | Adaptive signal control | Knowledge-accumulating agent | Zero-shot CoT & GCA with interactive learning | Generalist LLM agent (GCA) learns interactively, generates adaptive phases. | ||
| Priors and Constraints for RL Exploration | Semantic Action or State Space Re-mapping | [87] | Traffic signal control (Sim-to-Real) | Dynamics interpreter & action transformer | State, text, LLM reasoning, action remapping | PromptGAT: Uses LLM to understand dynamics & bridge sim-to-real gap via prompt-driven action transformation. |
| Knowledge-guided Exploration and Initialization | [88] | Vehicle powertrain/energy | Prior knowledge coordinator | Initialization & exploration constraints | Reduces sample complexity, speeds convergence, improves energy efficiency. | |
| Coordination and Policy Search Guidance | [89] | Network traffic optimization | Analysis & prediction module | Data insights, RL action guidance | LLM-RL co-optimization: LLM predicts bottlenecks for RL signal/route guidance. | |
| [90] | Multi-intersection signal control | Multi-agent collaborator | LLM as reasoning component in MARL | LLM-enhanced MARL integrates Transformer, improves multi-agent collaboration. |
4.4.2. Research on Traffic Applications of LLM-Based Rule Induction and Constraint Reasoning
| Ref. | Application Scenario | LLM Role | Rule & Constraint Approach | Key Contribution |
|---|---|---|---|---|
| [91] | Urban delivery optimization | Implicit rule inducer | Behavior to language; learns human constraints + TSP | Shows LLMs learn complex rules to enhance classical optimization. |
| [5] | Urban traffic management | Task/rule coordinator | NL rule understanding; multi-model reasoning | TrafficGPT: LLM-foundation model collaboration for interactive decision. |
| [92] | AD rule formalization | Rule-to-logic translator | NL rules to CoT to MTL generation | TR2MTL: Auto-translates rules to formal MTL, enabling scalable verification. |
| [93] | Green wave control | Strategy generator/analyzer | NL-driven interactive strategy generation | Explores LLM for interactive green wave strategy design. |
| [94] | Intersection conflict mgmt. | Real-time conflict resolver | Zero-shot reasoning based on rules | Validates LLMs for real-time rule-based conflict prediction & resolution. |
| [12] | Traffic safety analysis | Domain-expert rule inducer | Structured narratives; fine-tuning; interpretability | CrashSage: LLM as explainable safety engine for causal rule induction. |
| [1] | General ITS | Unified reasoning core | Dual (physical + semantic) state-space theory | Theoretical framework unifying physical dynamics & semantic rules. |
| [21] | End-to-end AD | Behavior semanticizer | Intent-based control + NL explanation (VICS) | DriveLLM-V: Translates control to NL (VICS) to explain driving rules. |
| [95] | Signal control optimization | Feasible config generator | LLM generates constraint-satisfying phase timings | Validates off-the-shelf LLMs for auto-generating high-quality signal configs. |
4.4.3. Research on Traffic Applications of LLM-Based Uncertainty-Aware Planning
| Ref. | Scenario | Role of LLM | Uncertainty Modeling | Key Contribution |
|---|---|---|---|---|
| [96] | Urban signal control (complex scenarios) | Strategy generator & HMI coordinator | ACP strategy library; LLM generates novel strategies with human feedback | Proposes LLM-driven control paradigm with autonomous/feedback/manual modes for uncertainty. |
| [97] | AD scenario testing | Scenario & environment generator | Text, LLM, dynamic synthesis of virtual environments | Explores LLMs for generating diverse, uncertain driving scenarios for edge-case testing. |
| [98] | Trajectory prediction for AD & IoT | Semantic encoder & pattern extractor | LLM + spatiotemporal encoding + Normalizing Flows | First LLM integration into probabilistic trajectory prediction for multimodal uncertainty. |
| [99] | AD (high-risk & long-tail) | Risk reasoning & decision optimizer | Risk quantification + memory retrieval + reflective learning | SafeDrive: Modular system for context-aware safe decisions in uncertain, high-risk scenarios. |
| [42] | Traffic simulation & policy testing | Interactive planning agent | NL translation of policy goals; agent handles uncertainty | AgentSUMO: Agentic framework for interactive scenario generation & policy experimentation. |
| [100] | AD testing (pedestrian) | Pedestrian behavior generator | LLM configures diverse, context-sensitive behaviors via prompting | LLM-enhanced traffic editing to inject complex pedestrian behaviors for realistic AV testing. |
| [101] | AD scenario generation | Scenario augmentation agent | Language-guided, fine-grained scene augmentation | AGENTS-LLM: Generates OOD, challenging scenarios for planner evaluation. |
| [27] | Urban intersection control | Real-time traffic controller | LLM reasoning integrates data, resolves conflicts via CoT | Proposes LLMs as direct real-time controllers for dynamic traffic uncertainty. |
4.4.4. Summary and Analysis of Planning and Control Integration
4.5. Research on Traffic Applications at the Autonomous Agent
4.5.1. Research on Traffic Applications of Single-Task Autonomous Agents
| Classification | Ref. | Agent Task | Role of LLM | Key Technical Design | Main Contribution | Limitations |
|---|---|---|---|---|---|---|
| Zero-shot/Few-shot Reasoning | [102] | Accident monitoring | Info extraction agent | Social media data; multi-task learning | First LLM multi-task learning on accident tweets | Data quality dependent; not full system view |
| [103] | Cycling info support | Service agent | Geospatial data; prompting; orchestration | Reproducible method for personalized safety info | Needs real-world validation | |
| Task Reformulation & Instruction Tuning | [104] | Traffic safety consultation | Expert agent | LLaMA fine-tuning; safety standards alignment | First domain-specific LLM for safety; professional responses | Text-only; no real-time system integration |
| [15] | Real-time traffic monitoring | Analysis agent | GPT-4 + DB; auto SQL; CoT; multi-agent | NL to complex query mapping; lowers analysis barrier | No direct physical control | |
| [105] | Parking search | Behavior sim agent | Persona; uncertain decision contexts | Simulates risk preferences & utility trade-offs | Lacks real behavioral data validation | |
| Hybrid Enhancement & Modular Architecture | [7] | Traffic task scheduling | General agent | NL instruction parsing; tool invocation | Open-TI: end-to-end autonomous task execution | High complexity; toolchain dependent; costly |
| [43] | Daily route choice | Traveler agent | Memory; persona; retrieval; LLM reasoning | Human-like route switching with explanations | Less stable than equilibrium models; high compute | |
| [106] | Parking planning | Planning agent | Structured prompts; modular chains | Flexible planning tool for AV/HDV transition | Manual workflow; non-real-time | |
| [42] | Simulation & policy test | Simulation agent | Goal understanding; task planning; SUMO tools | NL-driven simulation setup; lowers barrier | SUMO dependent; scenario validity unverified | |
| Dedicated Architectures & Physical Control | [23] | Traveler behavior sim | Conceptual agent | Structured modules; activity-based alignment | LLM as rich agent in ABM; new demand modeling path | Conceptual; scalability/efficiency unproven |
| [27] | Intersection control | Control agent | CoT; fine-tuning; conflict resolution | LLM as real-time controller under uncertainty | Preliminary; needs safety/scalability validation | |
| [25] | Signal control | Core controller | Custom LightGPT; state, reasoning, action | Efficient, generalizable, interpretable control | Specialized model; safety in edge cases unclear |
4.5.2. Research on Traffic Applications of Multi-Agent Collaborative Systems
| Multi-Agent Type | Ref. | Role of LLM | Collaboration Mechanism Design | Main Contributions | Limitations |
|---|---|---|---|---|---|
| Interactive Vehicle/Entity Systems | [107] | Regional & global collaborative reasoning engine | Hybrid: Individual RL + LLM coordination + RAG | Enhances safety and human-likeness in multi-vehicle merging. | Complex architecture; high cost. |
| [90] | Multi-agent collaborative reasoning enhancer | LLM reasoning embedded within MAPPO framework. | Improves coordination efficiency of signal control. | Increases training complexity. | |
| Functionally Divided Dual-Agent Systems | [108] | Trajectory generator & constraint evaluator | Generator–discriminator dual-channel collaboration. | Improves trajectory safety and controllability. | Relies on fixed architecture; lacks adaptive learning. |
| [26] | Action optimizer (Actor) & policy evaluator (Critic) | Dual-agent Actor-Critic framework with LLMs. | Achieves adaptive control under near-saturated traffic. | Limited to single intersections; stability unverified. | |
| Hierarchical & Conceptual Systems | [14] | Core component & potential coordinator of future ITS. | Conceptual framework based on multimodal learning. | Provides visionary insights into LLM-centered ITS. | Purely conceptual; lacks empirical validation. |
| [101] | Scenario augmentation task agents. | Language-guided multi-agent collaboration. | Enables generation of challenging traffic scenarios. | Scenario physical validity depends on LLM. | |
| Collaborative Task-Handling Systems | [15] | Traffic management & analysis agent. | Multi-agent collaboration for complex query handling. | Improves task completion in traffic monitoring. | Internal collaboration, not true multi-agent cooperation. |
4.5.3. Research on Traffic Applications of Human–Machine Collaborative Agents
| Ref. | Scenario | Human Role | LLM’s Role | Collaboration Mechanism | Contributions | Limitations |
|---|---|---|---|---|---|---|
| [109] | Aviation communication training | Pilot (Trainee) | Professional language trainer | Keyword-driven scenario & dialogue generation | Enables low-cost communication training | Language-only; lacks control integration |
| [77] | Public transport services | Passenger, Dispatcher | Conversational agent & data interpreter | Natural language interaction with data query/feedback | Enhances information accessibility | Dependent on high-quality structured data |
| [110] | Travel recommendation | Traveler | Interactive recommender & explainer | Feedback-driven iterative recommendation | Facilitates serendipitous travel discovery | Limited to small-scale empirical tests |
| [34] | Human–machine co-driving | Driver | Empathetic cognitive partner | Multimodal emotion recognition & adaptive interaction | ethical–emotional governance framework | Conceptual; lacks quantitative validation |
4.5.4. Research on Traffic Applications of Ethically Aligned Social Agents
| Ref. | Scenario | Role of LLM | Ethical Modeling Approach | Contributions | Limitations |
|---|---|---|---|---|---|
| [35] | AD ethical dilemmas | Core ethical decision-maker | Choice experiments with implicit value modeling; Logit and decision tree interpretation | Empirically decoded LLM moral preferences | Offline scenarios only |
| [34] | Human–machine symbiotic driving | Ethical and emotional alignment core | Emotion computing + value alignment + governance | Unified ethics, emotion, and governance | Theoretical framework |
| [111] | Traffic policy analysis | Social value analyzer | Legislative text analysis with LLM + XAI | Extended ethical analysis to policy formation | Single-country data |
| [112] | AD | Social norm–aware decision enhancer | Social behavior modeling + LLM reasoning | Enabled context-aware norm compliance | Simulation-based validation |
4.5.5. Summary and Analysis of Autonomous Agent Integration
5. Fundamental Limitations and Failure Modes of LLM-Enabled Traffic Systems
5.1. Limitations and Failure Mode Analysis
5.1.1. Entropy Mismatch Between Discrete Semantic Spaces and Continuous Physical States
5.1.2. From Correlation Entropy to Causal Uncertainty: Limits of Probabilistic Inference
5.1.3. Generative Planning vs. Entropy-Constrained Execution
5.1.4. Individual Rationality, Social Entropy, and System-Level Instability
5.2. Boundaries of LLM Irreplaceability and Hybrid Intelligence Architectures
6. Research on Future Agendas for LLM-Based Traffic Applications
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ABM | Agent-Based Modeling |
| AC | Actor-Critic |
| ACP | Artificial societies, Computational experiments, Parallel execution |
| AD | Autonomous Driving |
| AV | Autonomous Vehicle |
| BERT | Bidirectional Encoder Representations from Transformers |
| CAV | Connected and Autonomous Vehicle |
| CNNs | Convolutional Neural Networks |
| CoT | Chain-of-Thought |
| DB | Database |
| DL | Deep Learning |
| EIVM | Externally Integrated Vision Modality Model |
| EV | Electric Vehicle |
| GAT | Graph Attention Network |
| GCA | Generally Capable Agent |
| GCN | Graph Convolutional Network |
| GNNs | Graph Neural Networks |
| GPS | Global Positioning System |
| GPT | Generative Pre-trained Transformer |
| HDV | Human-Driven Vehicle |
| HMI | Human–Machine Interaction |
| IoT | Internet of Things |
| KG | Knowledge Graph |
| LiDAR | Light Detection and Ranging |
| LLaMA | Large Language Model Meta AI |
| LLMs | Large Language Models |
| LoRA | Low-Rank Adaptation |
| MAE | Mean Absolute Error |
| MAPPO | Multi-Agent Proximal Policy Optimization |
| MARL | Multi-Agent Reinforcement Learning |
| MoE | Mixture-of-Experts |
| MPC | Model Predictive Control |
| MLM | Masked Language Model |
| MTD | Multimodal Traffic Dataset |
| MTL | Metric Temporal Logic |
| NL | Natural Language |
| NLP | Natural Language Processing |
| OD | Origin-Destination |
| OOD | Out-Of-Distribution |
| PCA | Principal Component Analysis |
| PID | Proportional-Integral-Derivative |
| QA | Question Answering |
| RAG | Retrieval-Augmented Generation |
| Rep. | Representation |
| RL | Reinforcement Learning |
| RMSE | Root Mean Square Error |
| RNNs | Recurrent Neural Networks |
| SQL | Structured Query Language |
| ST-LLM | Spatio-Temporal Large Language Model |
| SUMO | Simulation of Urban Mobility |
| TSP | Traveling Salesman Problem |
| V2X | Vehicle-to-Everything |
| VAEs | Variational Autoencoders |
| VICS | Vehicle Intention-Based Control Signals |
| VLMs | Vision-Language Models |
| VQA | Visual Question Answering |
| XAI | Explainable Artificial Intelligence |
References
- Liu, Z.; Zhou, Z.; Gu, Z.; Liu, S.; Liu, P.; Zhang, Y.; He, Y.; Zhang, K. TRIP: Transport reasoning with intelligence progression —A foundation framework. Transp. Res. Part C Emerg. Technol. 2025, 179, 105260. [Google Scholar] [CrossRef]
- Kuang, S.; Liu, Y.; Qu, X.; Wei, Y. Traffic-IT: Enhancing traffic scene understanding for multimodal large language models. Transp. Res. Part C Emerg. Technol. 2025, 180, 105325. [Google Scholar] [CrossRef]
- Le, D.; Yunusoglu, A.; Tiwari, K.; Isik, M.; Dikmen, I. Multimodal LLM for intelligent transportation systems. arXiv 2024, arXiv:2412.11683. [Google Scholar] [CrossRef]
- Jiang, J.; Li, Y.; Nie, J.; Li, J.; Wen, B.; Gadekallu, T.R. Integrating large language models with cross-modal data fusion for advanced intelligent transportation systems in sustainable cities development. Appl. Soft Comput. 2025, 177, 113278. [Google Scholar] [CrossRef]
- Zhang, S.; Fu, D.; Liang, W.; Zhang, Z.; Yu, B.; Cai, P.; Yao, B. TrafficGPT: Viewing, processing and interacting with traffic foundation models. Transp. Policy 2024, 150, 95–105. [Google Scholar] [CrossRef]
- Li, D.; Jiang, B.; Huang, L.; Beigi, A.; Zhao, C.; Tan, Z.; Bhattacharjee, A.; Jiang, Y.; Chen, C.; Wu, T.; et al. From Generation to Judgment: Opportunities and Challenges of LLM-as-a-Judge. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Suzhou, China, 4–9 November 2025. [Google Scholar]
- Nam, D.; Macvean, A.; Hellendoorn, V.; Vasilescu, B.; Myers, B. Using an LLM to Help with Code Understanding. In Proceedings of the IEEE/ACM 46th International Conference on Software Engineering, Lisbon, Portugal, 14–20 April 2024. [Google Scholar]
- Yao, Y.; Duan, J.; Xu, K.; Cai, Y.; Sun, Z.; Zhang, Y. A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly. High-Confid. Comput. 2024, 4, 100211. [Google Scholar] [CrossRef]
- Ge, Y.; Hua, W.; Mei, K.; Tan, J.; Xu, S.; Li, Z.; Zhang, Y. OpenAGI: When LLM meets domain experts. Adv. Neural Inf. Process. Syst. 2023, 36, 5539–5568. [Google Scholar]
- Wu, S.; Fei, H.; Qu, L.; Ji, W.; Chua, T.-S. Next-GPT: Any-To-Any Multimodal LLM. In Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria, 21–27 July 2024. [Google Scholar]
- Guo, X.; Zhang, Q.; Jiang, J.; Peng, M.; Yang, H.F.; Zhu, M. Towards Responsible and Reliable Traffic Flow Prediction with Large Language Models. Available online: https://ssrn.com/abstract=4805901 (accessed on 6 February 2026).
- Zhen, H.; Yang, J.J. CrashSage: A large language model-centered framework for contextual and interpretable traffic crash analysis. Artif. Intell. Transp. 2025, 3–4, 100030. [Google Scholar] [CrossRef]
- Alsahfi, T.; Qasim, K.U. TraffiCoT-R: A framework for advanced spatio-temporal reasoning in large language models. Alex. Eng. J. 2025, 128, 464–475. [Google Scholar] [CrossRef]
- Zheng, O.; Abdel-Aty, M.; Wang, D.; Wang, Z.; Ding, S. ChatGPT is on the horizon: Could a large language model be suitable for intelligent traffic safety research and applications? arXiv 2023, arXiv:2303.05382. [Google Scholar] [CrossRef]
- Wang, B.; Cai, Z.; Karim, M.M.; Liu, C.; Wang, Y. Traffic Performance GPT (TP-GPT): Real-Time Data Informed Intelligent Chatbot for Transportation Surveillance and Management. In Proceedings of the 2024 IEEE 27th International Conference on Intelligent Transportation Systems (ITSC), Edmonton, AB, Canada, 24–27 September 2024. [Google Scholar]
- Long, Q.; Liu, S.; Cao, N.; Ren, Z.; Ju, W.; Fang, C.; Zhu, Z.; Zhu, H.; Zhou, Y. A survey of large language models for traffic forecasting: Methods and applications. arXiv 2025. [Google Scholar] [CrossRef]
- Mahmud, D.; Hajmohamed, H.; Almentheri, S.; Alqaydi, S.; Aldhaheri, L.; Khalil, R.A.; Saeed, N. Integrating LLMs With ITS: Recent Advances, Potentials, Challenges, and Future Directions. IEEE Trans. Intell. Transp. Syst. 2025, 26, 5674–5709. [Google Scholar] [CrossRef]
- Zhang, D.; Zheng, H.; Yue, W.; Wang, X. Advancing Its Applications with LLMs: A Survey on Traffic Management, Transportation Safety, and Autonomous Driving. In International Joint Conference on Rough Sets; Springer: Berlin/Heidelberg, Germany, 2024. [Google Scholar]
- Liao, H.; Shen, H.; Li, Z.; Wang, C.; Li, G.; Bie, Y.; Xu, C. GPT-4 enhanced multimodal grounding for autonomous driving: Leveraging cross-modal attention with large language models. Commun. Transp. Res. 2024, 4, 100116. [Google Scholar] [CrossRef]
- Song, Y.; Hu, X. VisionEV: Multimodal large language models for spatially aware electric vehicle charging demand prediction using satellite imagery. Transp. Res. Part D Transp. Environ. 2025, 150, 105069. [Google Scholar] [CrossRef]
- Hou, J.; Wang, H.; Li, T.; Dong, E. An explainable end-to-end autonomous driving framework based on large language model and vision modality fusion: Design and application of DriveLLM-V. Transp. Res. Part C Emerg. Technol. 2025, 181, 105368. [Google Scholar] [CrossRef]
- Li, L.; Tan, R.; Fang, J.; Xue, J.; Lv, C. LLM-augmented hierarchical reinforcement learning for human-like decision-making of autonomous driving. Expert Syst. Appl. 2025, 294, 128736. [Google Scholar] [CrossRef]
- Liu, T.; Yang, J.; Yin, Y. Toward LLM-agent-based modeling of transportation systems: A conceptual framework. Artif. Intell. Transp. 2025, 1, 100001. [Google Scholar] [CrossRef]
- Wang, M.; Pang, A.; Kan, Y.; Pun, M.-O.; Chen, C.S.; Huang, B. LLM-assisted light: Leveraging large language model capabilities for human-mimetic traffic signal control in complex urban environments. arXiv 2024, arXiv:2403.08337. [Google Scholar]
- Lai, S.; Xu, Z.; Zhang, W.; Liu, H.; Xiong, H. LLMLight: Large language Models as Traffic Signal Control Agents. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V 1, Toronto, ON, Canada, 3–7 August 2025. [Google Scholar]
- Jovanović, A.; Uzelac, A.; Teodorović, D.; Kukić, K. Traffic signal control at displaced left-turn intersections using an LLM-based actor-critic algorithm. Expert Syst. Appl. 2025, 301, 130555. [Google Scholar] [CrossRef]
- Masri, S.; Ashqar, H.I.; Elhenawy, M. Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm. Vehicles 2025, 7, 11. [Google Scholar] [CrossRef]
- Peng, M.; Guo, X.; Chen, X.; Chen, K.; Zhu, M.; Chen, L.; Wang, F.-Y. LC-LLM: Explainable lane-change intention and trajectory predictions with Large Language Models. Commun. Transp. Res. 2025, 5, 100170. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 6000–6010. [Google Scholar]
- Li, Z.; Liu, F.; Yang, W.; Peng, S.; Zhou, J. A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6999–7019. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Yu, P.S. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4–24. [Google Scholar] [CrossRef]
- Schuster, M.; Paliwal, K.K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 1997, 45, 2673–2681. [Google Scholar] [CrossRef]
- Yenduri, G.; Ramalingam, M.; Selvi, G.C.; Supriya, Y.; Srivastava, G.; Maddikunta, P.K.R.; Raj, G.D.; Jhaveri, R.H.; Prabadevi, B.; Wang, W.; et al. GPT (Generative Pre-Trained Transformer)—A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions. IEEE Access 2024, 12, 54608–54649. [Google Scholar] [CrossRef]
- Dong, Z.; Chen, C.; Liao, C.; Chen, X.M. Integrating large language models and affective computing for human-machine symbiosis in intelligent driving. Innovation 2025, 6, 101014. [Google Scholar] [CrossRef]
- Xu, Z.; Sengar, N.; Chen, T.; Chung, H.; Oviedo-Trespalacios, O. Where is morality on wheels? Decoding large language model (LLM)-driven decision in the ethical dilemmas of autonomous vehicles. Travel Behav. Soc. 2025, 40, 101039. [Google Scholar] [CrossRef]
- Yang, Y.; Zhang, W.; Lin, H.; Liu, Y.; Qu, X. Applying masked language model for transport mode choice behavior prediction. Transp. Res. Part A Policy Pract. 2024, 184, 104074. [Google Scholar] [CrossRef]
- Liu, Y.; Ott, M.; Goyal, N.; Du, J.; Joshi, M.; Chen, D.; Levy, O.; Lewis, M.; Zettlemoyer, L.; Stoyanov, V. Roberta: A robustly optimized bert pretraining approach. arXiv 2019, arXiv:1907.11692. [Google Scholar]
- Liu, S.; Shi, L.; Shi, Y.; Gao, Y.; Sun, X. Traffic scene perception via multimodal large language model with data augmentation and efficient training strategy. Appl. Soft Comput. 2025, 177, 113210. [Google Scholar] [CrossRef]
- Wang, P.; Wei, X.; Hu, F.; Han, W. TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation. In Proceedings of the 2024 International Conference on Computational Linguistics and Natural Language Processing (CLNLP), Yinchuan, China, 19–21 July 2024. [Google Scholar]
- Zhang, R.; Wang, B.; Zhang, J.; Bian, Z.; Feng, C.; Ozbay, K. When language and vision meet road safety: Leveraging multimodal large language models for video-based traffic accident analysis. Accid. Anal. Prev. 2025, 219, 108077. [Google Scholar] [CrossRef] [PubMed]
- de Zarzà, I.; de Curtò, J.; Roig, G.; Calafate, C.T. LLM multimodal traffic accident forecasting. Sensors 2023, 23, 9225. [Google Scholar] [CrossRef] [PubMed]
- Jeong, M.; Chang, J.; Yoon, Y. Speak to Simulate: An LLM-Guided Agentic Framework for Traffic Simulation in SUMO. In Proceedings of the GeoSIM ‘25: 8th ACM SIGSPATIAL International Workshop on Geospatial Simulation, Minneapolis, MN, USA, 3 November 2025. [Google Scholar]
- Wang, L.; Duan, P.; He, Z.; Lyu, C.; Chen, X.; Zheng, N.; Yao, L.; Ma, Z. Agentic Large Language Models for day-to-day route choices. Transp. Res. Part C Emerg. Technol. 2025, 180, 105307. [Google Scholar] [CrossRef]
- Arteaga, C.; Park, J. Deep Learning and Clustering-Based Analysis of Text Narratives for Identification of Traffic Crash Severity Contributors. Eng. Proc. 2023, 36, 31. [Google Scholar]
- Pullanikkat, R.; Poddar, S.; Das, A.; Jaiswal, T.; Singh, V.K.; Basu, M.; Ghosh, S. Utilizing the Twitter social media to identify transportation-related grievances in Indian cities. Soc. Netw. Anal. Min. 2024, 14, 118. [Google Scholar] [CrossRef]
- Li, Y.; Hu, C.; Ren, K.; Xi, H.; Fan, J. UASeg: Uncertainty aware weakly supervised semantic segmentation for autonomous driving. Expert Syst. Appl. 2025, 292, 128609. [Google Scholar] [CrossRef]
- Ma, Y.; Yao, Z.; Liu, X.; Xiong, Z.; He, X.; Wu, X. Efficient 3D object annotation via vision-derived pseudo-LiDAR and Vision Language Model (VLM) validation. Transp. Res. Part C Emerg. Technol. 2025, 182, 105429. [Google Scholar] [CrossRef]
- Yang, H.; Wu, R.; Xu, W. TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation. In Proceedings of the ACM MobiCom ‘24: 30th Annual International Conference on Mobile Computing and Networking, Washington, DC, USA, 18–22 November 2024. [Google Scholar]
- Liu, W.; Zhang, S.; Zhou, L.; Luo, N.; Xu, M. A semi-supervised mixture model of visual language multitask for vehicle recognition. Appl. Soft Comput. 2024, 159, 111619. [Google Scholar] [CrossRef]
- Huang, X. Enhancing traffic prediction with textual data using large language models. arXiv 2024, arXiv:2405.06719. [Google Scholar] [CrossRef]
- Ma, T.; Zhao, Y.; Li, M.; Chen, Y.; Lei, F.; Zhao, Y.; Alsabaan, M. TPLLM: A traffic prediction framework based on pretrained Large Language Models. Appl. Soft Comput. 2025, 184, 113840. [Google Scholar] [CrossRef]
- Xu, Y.; Liu, M. GPT4TFP: Spatio-temporal fusion large language model for traffic flow prediction. Neurocomputing 2025, 625, 129562. [Google Scholar] [CrossRef]
- Ma, C.; Zhao, M. Urban rail transit passenger flow prediction using large language model under multi-source spatiotemporal data fusion. Phys. A Stat. Mech. Its Appl. 2025, 675, 130823. [Google Scholar] [CrossRef]
- Lee, X.; Gu, Y. Step-LLM: Spatial-Temporal-Enriched Prompting with Large Language Models for Traffic Prediction. In International Conference of Electrical, Electronic and Networked Energy Systems; Springer: Berlin/Heidelberg, Germany, 2024. [Google Scholar]
- Tabassuma, S.; Ahmeda, S.; Nowera, N. GAT-LLM: A Graph Attention-Based Framework with Large Language Models for Spatio-Temporal Traffic Forecasting. In Proceedings of the ECAI 2025 Workshop ReLiT on Reasoning and Learning for Intelligent Transport, Bologna, Italy, 25 October 2025. [Google Scholar]
- Rong, Y.; Mao, Y.; He, X.; Chen, M. Large-Scale Traffic Flow Forecast with Lightweight LLM in Edge Intelligence. IEEE Internet Things Mag. 2024, 8, 12–18. [Google Scholar] [CrossRef]
- Cheng, H.; Gong, Z.; Wang, C. LLM-TFP: Integrating large language models with spatio-temporal features for urban traffic flow prediction. Appl. Soft Comput. 2025, 177, 113174. [Google Scholar] [CrossRef]
- Liu, C.; Yang, S.; Xu, Q.; Li, Z.; Long, C.; Li, Z.; Zhao, R. Spatial-Temporal Large Language Model for Traffic Prediction. In Proceedings of the 2024 25th IEEE International Conference on Mobile Data Management (MDM), Brussels, Belgium, 24–27 July 2024. [Google Scholar]
- Shao, Z.; Xi, H.; Lu, H.; Wang, Z.; Bell, M.G.; Gao, J. A spatial–Temporal Large Language Model with Denoising Diffusion Implicit for predictions in centralized multimodal transport systems. Transp. Res. Part C Emerg. Technol. 2025, 179, 105249. [Google Scholar] [CrossRef]
- Moghadas, S.M.; Cornelis, B.; Alahi, A.; Munteanu, A. Strada-LLM: Graph LLM for traffic prediction. arXiv 2024, arXiv:2410.20856. [Google Scholar] [CrossRef]
- Shang, Y.; Shang, W.-L.; Cui, D.; Liu, P.; Chen, H.; Zhang, D.; Zhang, R.; Xu, C.; Liu, Y.; Wang, C.; et al. Spatio-temporal data fusion framework based on large language model for enhanced prediction of electric vehicle charging demand in smart grid management. Inf. Fusion 2025, 126, 103692. [Google Scholar] [CrossRef]
- Hou, Y.; Shao, Y.; Han, Z.; Ye, Z. Construction and Application of Traffic Accident Knowledge Graph Based on LLM. In Proceedings of the 2024 International Conference on Smart Transportation Interdisciplinary Studies, Nanjing, China, 14–15 December 2024. [Google Scholar]
- Qi, Y.; Mai, G.; Zhu, R.; Zhang, M. EVKG: An interlinked and interoperable electric vehicle knowledge graph for smart transportation system. Trans. GIS 2023, 27, 949–974. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, M.; Tang, J.; Ma, J.; Duan, X.; Sun, J.; Hu, X.; Xu, S. Analysis of Traffic Accident Based on Knowledge Graph. J. Adv. Transp. 2022, 2022, 3915467. [Google Scholar] [CrossRef]
- Gan, L.; Ye, B.; Huang, Z.; Xu, Y.; Chen, Q.; Shu, Y. Knowledge graph construction based on ship collision accident reports to improve maritime traffic safety. Ocean Coast. Manag. 2023, 240, 106660. [Google Scholar] [CrossRef]
- Nie, T.; He, J.; Mei, Y.; Qin, G.; Li, G.; Sun, J.; Ma, W. Joint estimation and prediction of city-wide delivery demand: A large language model empowered graph-based learning approach. Transp. Res. Part E Logist. Transp. Rev. 2025, 197, 104075. [Google Scholar] [CrossRef]
- Hussien, M.M.; Melo, A.N.; Ballardini, A.L.; Maldonado, C.S.; Izquierdo, R.; Sotelo, M.Á. RAG-based explainable prediction of road users behaviors for automated driving using knowledge graphs and large language models. Expert Syst. Appl. 2024, 265, 125914. [Google Scholar] [CrossRef]
- Li, Y.; Zhan, Y.; Liang, M.; Zhang, Y.; Liang, J. UPTM-LLM: Large language models-powered urban pedestrian travel modes recognition for intelligent transportation system. Appl. Soft Comput. 2025, 186, 113999. [Google Scholar] [CrossRef]
- Liu, Z.; Li, L.; Wang, Y.; Lin, H.; Cheng, H.; Liu, Z.; He, L.; Wang, J. Controllable traffic simulation through LLM-guided hierarchical chain-of-thought reasoning. arXiv 2024, arXiv:2409.15135. [Google Scholar]
- Mo, B.; Xu, H.; Zhuang, D.; Ma, R.; Guo, X.; Zhao, J. Large language models for travel behavior prediction. arXiv 2023, arXiv:2312.00819. [Google Scholar] [CrossRef]
- Beneduce, C.; Lepri, B.; Luca, M. Large Language Models are Zero-Shot Next Location Predictors. IEEE Access 2025, 13, 77456–77467. [Google Scholar] [CrossRef]
- Qu, H.; Li, H.; You, L.; Zhu, R.; Yan, J.; Santi, P.; Ratti, C.; Yuen, C. ChatEV: Predicting electric vehicle charging demand as natural language processing. Transp. Res. Part D Transp. Environ. 2024, 136, 104470. [Google Scholar] [CrossRef]
- Guo, X.; Zhang, Q.; Jiang, J.; Peng, M.; Zhu, M.; Yang, H.F. Towards explainable traffic flow prediction with large language models. Commun. Transp. Res. 2024, 4, 100150. [Google Scholar] [CrossRef]
- Wu, F.; Zheng, C.; Du, M.; Zheng, S.; Ma, J.; Lu, Y. LLM4PT: A large language model-based system for flexible and explainable public transit demand prediction. Comput. Ind. Eng. 2025, 209, 111449. [Google Scholar] [CrossRef]
- Hu, Y.; Wang, F.; Ye, D.; Wu, M.; Kang, J.; Yu, R. LLM-Based Misbehavior Detection Architecture for Enhanced Traffic Safety in Connected Autonomous Vehicles. IEEE Trans. Veh. Technol. 2025, 74, 12829–12841. [Google Scholar] [CrossRef]
- Lan, Z.; Liu, L.; Fan, B.; Lv, Y.; Ren, Y.; Cui, Z. Traj-LLM: A New Exploration for Empowering Trajectory Prediction With Pre-Trained Large Language Models. IEEE Trans. Intell. Veh. 2024, 10, 794–807. [Google Scholar] [CrossRef]
- Wang, J.; Shalaby, A. TransitTalk: Large language model-based digital assistants for enhancing transit customer experience and staff performance. J. Intell. Transp. Syst. 2025, 9, 1–18. [Google Scholar] [CrossRef]
- Peng, C.; Peng, J.; Wang, Z.; Wang, Z.; Chen, J.; Xuan, J.; Shi, T. Adaptive fault diagnosis of railway vehicle on-board controller with large language models. Appl. Soft Comput. 2025, 185, 113919. [Google Scholar] [CrossRef]
- Ansarinejad, M.; Gaweesh, S.M.; Ahmed, M.M. Assessing the efficacy of pre-trained large language models in analyzing autonomous vehicle field test disengagements. Accid. Anal. Prev. 2025, 220, 108178. [Google Scholar] [CrossRef]
- Zhong, W.; Huang, J.; Wu, M.; Luo, W.; Yu, R. Large language model based system with causal inference and Chain-of-Thoughts reasoning for traffic scene risk assessment. Knowl. Based Syst. 2025, 319, 113630. [Google Scholar] [CrossRef]
- Huang, Z.; Sheng, Z.; Qu, Y.; You, J.; Chen, S. VLM-RL: A unified vision language models and reinforcement learning framework for safe autonomous driving. Transp. Res. Part C Emerg. Technol. 2025, 180, 105321. [Google Scholar] [CrossRef]
- Yu, J.; Wang, Y.; Ma, W. Large language model-enhanced reinforcement learning for generic bus holding control strategies. Transp. Res. Part E Logist. Transp. Rev. 2025, 200, 104142. [Google Scholar] [CrossRef]
- Yao, J.; Li, J.; Xu, X.; Tan, C.; Yap, K.H.; Su, R. Incorporating vision-based artificial intelligence and large language model for smart traffic light control. Appl. Soft Comput. 2025, 179, 113333. [Google Scholar] [CrossRef]
- Yin, Q.; Xiong, Z.; Liu, Y. LLM-Enhanced Reinforcement Learning for Traffic Control at On-Ramp Merging Areas. In International Conference on Artificial Intelligence and Autonomous Transportation; Springer: Berlin/Heidelberg, Germany, 2024. [Google Scholar]
- Yildirim, M.; Dagda, B.; Asodia, V.; Fallah, S. HighwayLLM: Decision-making and navigation in highway driving with RL-informed language model. Robot. Auton. Syst. 2025, 193, 105114. [Google Scholar] [CrossRef]
- Movahedi, M.; Choi, J. The Crossroads of LLM and Traffic Control: A Study on Large Language Models in Adaptive Traffic Signal Control. IEEE Trans. Intell. Transp. Syst. 2024, 26, 1701–1716. [Google Scholar] [CrossRef]
- Da, L.; Gao, M.; Mei, H.; Wei, H. Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning. Proc. AAAI Conf. Artif. Intell. 2024, 38, 82–90. [Google Scholar] [CrossRef]
- Zhang, H.; Dong, J.; Lei, N.; Qin, Y.; Li, B.; Chen, C.; Chen, B. Optimal vehicle dynamics and powertrain control of carbon-free autonomous vehicles: Large language model assisted heterogeneous-agent learning. Energy 2025, 338, 138786. [Google Scholar] [CrossRef]
- Singh, A.R.; Ashraf, M.W.A.; Rathore, R.S.; Li, B.; Sujatha, M. Real-time traffic flow optimization using large language models and reinforcement learning for smart urban mobility. Appl. Soft Comput. 2025, 185, 113917. [Google Scholar] [CrossRef]
- Chen, X.; Meng, W. LLM-Enhanced MARL for Smarter Traffic Control. In Proceedings of the 2025 IEEE 19th International Conference on Control & Automation (ICCA), Tallinn, Estonia, 30 June–3 July 2025. [Google Scholar]
- Liu, Y.; Wu, F.; Liu, Z.; Wang, K.; Wang, F.; Qu, X. Can language models be used for real-world urban-delivery route optimization? Innovation 2023, 4, 100520. [Google Scholar] [CrossRef]
- Manas, K.; Zwicklbauer, S.; Paschke, A. TR2MTL: LLM based framework for Metric Temporal Logic Formalization of Traffic Rules. In Proceedings of the 2024 IEEE Intelligent Vehicle Symposium (IV), Jeju Island, Republic of Korea, 2–5 June 2024. [Google Scholar]
- Tang, Y.; Dai, X.; Lv, Y. Large Language Model-Assisted Arterial Traffic Signal Control. IEEE J. Radio Freq. Identif. 2024, 8, 322–326. [Google Scholar] [CrossRef]
- Masri, S.; Ashqar, H.I.; Elhenawy, M. Leveraging large language models (LLMs) for traffic management at urban intersections: The case of mixed traffic scenarios. arXiv 2024, arXiv:2408.00948. [Google Scholar] [CrossRef]
- Percassi, F.; Vallati, M. Automated Planning for Urban Traffic Control with LLM-Generated Configurations. In Proceedings of the International Florida Artificial Intelligence Research Society Conference, FLAIRS, Daytona Beach, FL, USA, 20–23 May 2025. [Google Scholar]
- Tang, Y.; Dai, X.; Zhao, C.; Cheng, Q.; Lv, Y. Large Language Model-Driven Urban Traffic Signal Control. In Proceedings of the 2024 Australian & New Zealand Control Conference (ANZCC), Gold Coast, Australia, 1–2 February 2024. [Google Scholar]
- Adekanye, O.A.M. LLM-Powered Synthetic Environments for Self-Driving Scenarios. Proc. AAAI Conf. Artif. Intell. 2024, 38, 23721–23723. [Google Scholar] [CrossRef]
- Wang, X.; Xu, L.; Zhou, L.; Liu, Y.; Xiong, N.; Li, K.-C. Large language model-driven probabilistic trajectory prediction in the Internet of Things using spatio-temporal encoding and normalizing flows. Digit. Commun. Netw. 2025. [Google Scholar] [CrossRef]
- Zhou, Z.; Huang, H.; Li, B.; Zhao, S.; Mu, Y.; Wang, J. SafeDrive: Knowledge- and data-driven risk-sensitive decision-making for autonomous vehicles with Large Language Models. Accid. Anal. Prev. 2025, 224, 108299. [Google Scholar] [CrossRef] [PubMed]
- Aiersilan, A.; Liu, M. LLM-enhanced traffic editor for accelerated testing of autonomous vehicles under various pedestrian behaviors. In Proceedings of the International Conference on Smart Transportation and City Engineering (STCE 2024), Chongqing, China, 6–8 December 2024. [Google Scholar]
- Yao, Y.; Bhatnagar, S.; Mazzola, M.; Belagiannis, V.; Gilitschenski, I.; Palmieri, L.; Razniewski, S.; Hallgarten, M. AGENTS-LLM: Augmentative Generation of Challenging Traffic Scenarios with an Agentic LLM Framework. arXiv 2025, arXiv:2507.13729. [Google Scholar] [CrossRef]
- Jaradat, S.; Nayak, R.; Paz, A.; Ashqar, H.I.; Elhenawy, M. Multitask Learning for Crash Analysis: A Fine-Tuned LLM Framework Using Twitter Data. Smart Cities 2024, 7, 2422–2465. [Google Scholar] [CrossRef]
- Costa, D.G.; Silva, I.; Medeiros, M.; Bittencourt, J.C.N.; Andrade, M. A method to promote safe cycling powered by large language models and AI agents. MethodsX 2024, 13, 102880. [Google Scholar] [CrossRef] [PubMed]
- Zheng, O.; Abdel-Aty, M.; Wang, D.; Wang, C.; Ding, S. TrafficSafetyGPT: Tuning a pre-trained large language model to a domain-specific expert in transportation safety. arXiv 2023, arXiv:2307.15311. [Google Scholar]
- Fulman, N.; Memduhoğlu, A.; Zipf, A. Utilizing large language models to simulate parking search. Transp. Res. Part A Policy Pract. 2025, 199, 104542. [Google Scholar] [CrossRef]
- Jin, Y.; Ma, J. Large language model as parking planning agent in the context of mixed period of autonomous vehicles and Human-Driven vehicles. Sustain. Cities Soc. 2024, 117, 105940. [Google Scholar] [CrossRef]
- Zhang, M.; Fang, Z.; Wang, T.; Lu, S.; Wang, X.; Shi, T. CCMA: A framework for cascading cooperative multi-agent in autonomous driving merging using Large Language Models. Expert Syst. Appl. 2025, 282, 127717. [Google Scholar] [CrossRef]
- Zhang, P.; Lin, K.; Li, D.; Fu, Z.; Cai, Y.; Li, B.; Yu, H.; Li, M. DAPlanner: Dual-agent framework with multi-modal large language model for autonomous driving motion planning. Appl. Soft Comput. 2025, 183, 113625. [Google Scholar] [CrossRef]
- Liu, X.; Zou, B.; Aw, A. An NLP-Focused Pilot Training Agent for Safe and Efficient Aviation Communication. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, 17–19 June 2024; Volume 6. [Google Scholar]
- Afridi, A.H.; Yasar, A.U.H.; Sheltami, T.R. Serendipity and LLM-based Recommender System for Smart Transportation. Transp. Res. Procedia 2025, 84, 251–258. [Google Scholar] [CrossRef]
- Yun, H.; Lee, E.H. Party politics in transport policy with a large language model. Transp. Policy 2025, 171, 487–496. [Google Scholar] [CrossRef]
- Lamichhane, B.R.; Aueawatthanaphisut, A.; Srijuntongsiri, G.; Horanont, T. Context-aware decision making in autonomous vehicles: Integrating social behavior modeling with large language models. Array 2025, 27, 100420. [Google Scholar] [CrossRef]














| Ref. | Traffic Task | LLM’s Core Role | Text Processing & Output | Key Contribution |
|---|---|---|---|---|
| [44] | Crash severity analysis | Embedding generator & latent pattern discoverer | Process: Crash narratives, deep semantic representations. Output: Risk clusters & high-frequency themes. | Automatically identifies latent crash factors from text without predefined variables. |
| [45] | Grievance identification | Representation learner & classifier | Process: Social media text, Transformer embeddings. Output: Multi-label grievance vectors & categories. | Converts public text into structured semantics for governance, no handcrafted rules needed. |
| [36] | Travel mode prediction | Personalized feature extractor | Process: Travel records (as NL), MLM embeddings. Output: Semantic vectors of personal preferences & context. | Models travel via semantic abstraction, capturing individual preferences. |
| Integration Layer | Core Entropy Mismatch | Failure Mode | Practical Consequence |
|---|---|---|---|
| Representation | Semantic entropy vs. physical entropy | Lossy physical information compression | Reduced precision and safety margin |
| Reasoning & Prediction | Correlation entropy vs. causal uncertainty | Poor intervention generalization | Fragile predictions under policy change |
| Planning & Control | Generative entropy vs. executable certainty | Constraint violations, unverifiable plans | Safety and real-time risks |
| Autonomous Agents | Individual entropy minimization vs. social entropy | Coordination failure, instability | System-level inefficiency |
| System Layer | Entropy/Uncertainty Source | Core Research Question (Entropy View) | LLM Role | Required Methodological Advances |
|---|---|---|---|---|
| Representation Integration | High-dimensional multimodal noise; semantic ambiguity | How can semantic entropy be minimized without discarding physically relevant information? | Semantic compression and mutual information preservation | Information bottleneck methods; entropy-regularized representation learning; uncertainty-aware multimodal fusion |
| Reasoning & Prediction | Stochastic demand, human behavior, network interactions | How can probabilistic inference distinguish correlation entropy from causal uncertainty? | Probabilistic reasoning and hypothesis generation | Causal entropy modeling; Bayesian LLM hybrids; intervention-aware uncertainty estimation |
| Planning & Control | Execution uncertainty; physical constraints; safety risks | How can generative uncertainty be constrained to meet low-entropy execution requirements? | High-level plan proposal under uncertainty | Entropy-constrained planning; formal verification interfaces; probabilistic-to-deterministic projection mechanisms |
| Autonomous Agents | Strategic uncertainty; multi-agent interaction entropy | How does individual uncertainty aggregation affect system-level entropy and stability? | Strategy modeling and negotiation | Game-theoretic entropy analysis; equilibrium uncertainty modeling; social welfare–entropy tradeoff mechanisms |
| Human–Machine Interaction | Cognitive uncertainty; trust and interpretability gaps | How can uncertainty be communicated and calibrated between humans and LLM agents? | Decision explanation and risk communication | Uncertainty calibration; information-theoretic interpretability; entropy-aware human-in-the-loop design |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tu, W.; Li, J.; Xiao, F.; Wang, X.; Lu, Y. Integrating Large Language Models into Traffic Systems: Integration Levels, Capability Boundaries, and an Information-Theoretic Perspective. Entropy 2026, 28, 211. https://doi.org/10.3390/e28020211
Tu W, Li J, Xiao F, Wang X, Lu Y. Integrating Large Language Models into Traffic Systems: Integration Levels, Capability Boundaries, and an Information-Theoretic Perspective. Entropy. 2026; 28(2):211. https://doi.org/10.3390/e28020211
Chicago/Turabian StyleTu, Wenwen, Junfan Li, Feng Xiao, Xiaosa Wang, and Yong Lu. 2026. "Integrating Large Language Models into Traffic Systems: Integration Levels, Capability Boundaries, and an Information-Theoretic Perspective" Entropy 28, no. 2: 211. https://doi.org/10.3390/e28020211
APA StyleTu, W., Li, J., Xiao, F., Wang, X., & Lu, Y. (2026). Integrating Large Language Models into Traffic Systems: Integration Levels, Capability Boundaries, and an Information-Theoretic Perspective. Entropy, 28(2), 211. https://doi.org/10.3390/e28020211
