A Proposal-Aware Proactive Encoding Framework for Trajectory Prediction in Autonomous Driving
Abstract
1. Introduction
2. Related Work
2.1. Trajectory Prediction
2.2. Attention Mechanism in Trajectory Prediction
3. Method
3.1. Spatio-Temporal Focal Attention
3.2. Potential Proposals Generator
3.2.1. Scene Encoder
3.2.2. Potential Proposals Decoder
3.3. Proposal-Aware Trajectory Refiner
3.3.1. Proposal-Aware Scene Encoder
3.3.2. Proposal-Aware Trajectory Decoder
3.4. Training Objective
3.4.1. Regression Loss
3.4.2. Classification Loss
4. Experiments
4.1. Experimental Setup
4.1.1. Dataset
4.1.2. Metrics
4.1.3. Implement Details
4.2. Main Result
4.2.1. Argoverse 1
4.2.2. INTERACTION
4.3. Ablation Study
4.3.1. Component Study
4.3.2. Proposal-Aware Encoder
4.3.3. Spatio-Temporal Focal Attention
4.3.4. Spatio-Temporal Focal Attention in Encoder
4.3.5. Spatio-Temporal Focal Attention in Decoder
4.3.6. Analysis of Proposal-Relevant Attention
4.3.7. Analysis of Robustness to Proposal Errors
4.3.8. The Number of Generated Proposals
4.3.9. Visualization
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| Indices and sets | |
| Symbol | Definition |
| Index for query and key tokens | |
| k | Index for attention module type or trajectory mode |
| Best-matching trajectory mode selected by proposal error | |
| r | Prediction branch indicator, where |
| t | Index of time steps |
| c | Index of historical trajectory coordinates |
| S | Number of decoder layers |
| T | Historical observation horizon |
| F | Future prediction horizon |
| K | Number of predicted trajectory modes |
| d | Feature embedding dimension |
| Temporal, agent, and mode attention dimensions |
| Parameters | |
| Symbol | Definition |
| X | Input feature for generating query representation |
| Y | Input feature for generating key and value representations |
| Q | Query representation in attention calculation |
| K | Key representation in attention calculation |
| V | Value representation in attention calculation |
| Learnable projection matrix for query representation | |
| Learnable projection matrix for key representation | |
| Learnable projection matrix for value representation | |
| Spatio-temporal relation controlled attention modulation parameter | |
| Unbounded modulation value before clipping | |
| Lower bound of the parameter | |
| Upper bound of the parameter | |
| Spatio-temporal relationship between key token and query token | |
| Encoded relative spatio-temporal relation vector | |
| Relative position between query and key tokens | |
| Relative orientation between query and key tokens | |
| Relative heading angle between query and key tokens | |
| Temporal displacement between query and key tokens | |
| Lane segment position attribute | |
| Lane segment heading orientation attribute | |
| Lane segment physical length attribute | |
| Lane segment feature representation | |
| Final lane representation in the scene encoder | |
| Displacement vector of historical trajectory at time step t | |
| Concatenated historical displacement representation | |
| Historical trajectory embedding | |
| Proposal embedding in the potential proposal decoder | |
| History-aware proposal representation | |
| Map-aware proposal representation | |
| Fused proposal representation | |
| U | Output of the mode attention module in the proposal generator |
| Point of potential proposal trajectory at time step t | |
| Relative displacement vector of proposal trajectory at time step t | |
| Concatenated proposal displacement representation | |
| Potential proposal embedding | |
| Structure-aware lane embedding | |
| Proposal-aware map representation | |
| History trajectory embedding in the proposal-aware encoder | |
| Proposal-aware history trajectory representation | |
| Query proposal embedding in the proposal-aware decoder | |
| Agent-history interaction representation in the refiner | |
| Map interaction representation in the refiner | |
| Fused representation in the proposal-aware refinement module | |
| Z | Refined proposal representation before prediction |
| Predicted trajectory from branch r | |
| Ground-truth trajectory | |
| Predicted confidence distribution from branch r | |
| s | One-hot mode label for the ground truth |
| Total training objective for branch r | |
| Regression loss for branch r | |
| Classification loss for branch r | |
| 2D Smooth L1 loss for trajectory regression |
References
- Gao, J.; Sun, C.; Zhao, H.; Shen, Y.; Anguelov, D.; Li, C.; Schmid, C. VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 11522–11530. [Google Scholar] [CrossRef]
- Liang, M.; Yang, B.; Hu, R.; Chen, Y.; Liao, R.; Feng, S.; Urtasun, R. Learning Lane Graph Representations for Motion Forecasting. In Proceedings of the Computer Vision–ECCV 2020; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 541–556. [Google Scholar] [CrossRef]
- Zeng, W.; Liang, M.; Liao, R.; Urtasun, R. LaneRCNN: Distributed Representations for Graph-Centric Motion Forecasting. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2021; pp. 532–539. [Google Scholar] [CrossRef]
- Salzmann, T.; Ivanovic, B.; Chakravarty, P.; Pavone, M. Trajectron++: Dynamically-Feasible Trajectory Forecasting with Heterogeneous Data. In Proceedings of the Computer Vision–ECCV 2020; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 683–700. [Google Scholar] [CrossRef]
- Zhou, Z.; Ye, L.; Wang, J.; Wu, K.; Lu, K. HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2022; pp. 8813–8823. [Google Scholar] [CrossRef]
- Ngiam, J.; Caine, B.; Vasudevan, V.; Zhang, Z.; Chiang, H.T.L.; Ling, J.; Roelofs, R.; Bewley, A.; Liu, C.; Venugopal, A.; et al. Scene Transformer: A Unified Architecture for Predicting Future Trajectories of Multiple Agents. In Proceedings of the International Conference on Learning Representations (ICLR 2022), Virtual, 25–29 April 2022; Available online: https://openreview.net/forum?id=Wm3EA5OlHsG (accessed on 25 May 2026).
- Nayakanti, N.; Al-Rfou, R.; Zhou, A.; Goel, K.; Refaat, K.S.; Sapp, B. Wayformer: Motion Forecasting via Simple and Efficient Attention Networks. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2023; pp. 2980–2987. [Google Scholar] [CrossRef]
- Shi, S.; Jiang, L.; Dai, D.; Schiele, B. Motion Transformer with Global Intention Localization and Local Movement Refinement. In Proceedings of the Advances in Neural Information Processing Systems 35, New Orleans, LA, USA, 28 November–9 December 2022; Neural Information Processing Systems Foundation; pp. 6531–6543. [Google Scholar] [CrossRef]
- Cheng, J.; Mei, X.; Liu, M. Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2023; pp. 8645–8655. [Google Scholar] [CrossRef]
- Liu, H.; Liu, H.; Fan, B.; Xu, J. DMotion: Diverse Modalities Alignment Enhanced Motion Prediction for Autonomous Driving. IEEE Trans. Comput. Soc. Syst. 2025, 12, 3349–3364. [Google Scholar] [CrossRef]
- Afshar, S.; Deo, N.; Bhagat, A.; Chakraborty, T.; Shao, Y.; Buddharaju, B.R.; Deshpande, A.; Cui, H. PBP: Path-based Trajectory Prediction for Autonomous Driving. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2024; pp. 12927–12934. [Google Scholar] [CrossRef]
- Li, S.; Liu, C.; Xu, X.; Yeo, S.Y.; Yang, X. Future-Aware Interaction Network for Motion Forecasting. In Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2025; pp. 7505–7515. [Google Scholar] [CrossRef]
- Choi, D.; Yim, J.; Baek, M.; Lee, S. Machine Learning-Based Vehicle Trajectory Prediction Using V2V Communications and On-Board Sensors. Electronics 2021, 10, 420. [Google Scholar] [CrossRef]
- Sighencea, B.I.; Stanciu, I.R.; Căleanu, C.D. D-STGCN: Dynamic Pedestrian Trajectory Prediction Using Spatio-Temporal Graph Convolutional Networks. Electronics 2023, 12, 611. [Google Scholar] [CrossRef]
- Hortelano, J.L.; Trentin, V.; Artuñedo, A.; Villagra, J. GPU-Accelerated Interaction-Aware Motion Prediction. Electronics 2023, 12, 3751. [Google Scholar] [CrossRef]
- Yuan, Y.; Weng, X.; Ou, Y.; Kitani, K. AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 9793–9803. [Google Scholar] [CrossRef]
- Zhou, Z.; Wang, J.; Li, Y.H.; Huang, Y.K. Query-Centric Trajectory Prediction. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2023; pp. 17863–17873. [Google Scholar] [CrossRef]
- Da, C.; Qian, Y.; Zeng, J.; Wei, X.; Zhang, F. ADSAP: An Adaptive Speed-Aware Trajectory Prediction Framework with Adversarial Knowledge Transfer. Electronics 2025, 14, 2448. [Google Scholar] [CrossRef]
- Su, H.; Wang, N.; Wang, X. Collision Risk Assessment of Lane-Changing Vehicles Based on Spatio-Temporal Feature Fusion Trajectory Prediction. Electronics 2025, 14, 3388. [Google Scholar] [CrossRef]
- Li, A.; Xu, Z.; Pan, Y.; Zhang, J.; Chen, N.; Yu, H.; Chen, Y.; Li, Y. A Convolutional Transformer Network for Vehicle Trajectory Prediction in Urban Traffic Scenarios. J. Transp. Eng. Part A Syst. 2026, 152, 04025132. [Google Scholar] [CrossRef]
- Bhat, M.; Francis, J.; Oh, J. Trajformer: Trajectory Prediction with Local Self-Attentive Contexts for Autonomous Driving. In Proceedings of the Machine Learning for Autonomous Driving Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, BC, Canada; Neural Information Processing Systems Foundation, Inc.: San Diego, CA, USA, 2020; Available online: https://ml4ad.github.io/files/papers2020/Trajformer:%20Trajectory%20Prediction%20with%20Local%20Self-Attentive%20Contexts%20for%20Autonomous%20Driving.pdf (accessed on 25 May 2026).
- Liu, M.; Cheng, H.; Chen, L.; Broszio, H.; Li, J.; Zhao, R.; Sester, M.; Yang, M.Y. LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); IEEE: New York, NY, USA, 2024; pp. 2039–2049. [Google Scholar] [CrossRef]
- Chang, M.F.; Lambert, J.; Sangkloy, P.; Singh, J.; Bak, S.; Hartnett, A.; Wang, D.; Carr, P.; Lucey, S.; Ramanan, D.; et al. Argoverse: 3D Tracking and Forecasting With Rich Maps. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2019; pp. 8740–8749. [Google Scholar] [CrossRef]
- Zhan, W.; Sun, L.; Wang, D.; Shi, H.; Clausse, A.; Naumann, M.; Kümmerle, J.; Königshof, H.; Stiller, C.; de La Fortelle, A.; et al. INTERACTION Dataset: An International, Adversarial and Cooperative Motion Dataset in Interactive Driving Scenarios with Semantic Maps. arXiv 2019, arXiv:1910.03088. [Google Scholar] [CrossRef]
- Gu, J.; Sun, C.; Zhao, H. DenseTNT: End-to-End Trajectory Prediction from Dense Goal Sets. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 15283–15292. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, J.; Fang, L.; Jiang, Q.; Zhou, B. Multimodal Motion Prediction with Stacked Transformers. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2021; pp. 7573–7582. [Google Scholar] [CrossRef]
- Da, F.; Zhang, Y. Path-Aware Graph Attention for HD Maps in Motion Prediction. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2022; pp. 6430–6436. [Google Scholar] [CrossRef]
- Zhang, L.; Li, P.; Chen, J.; Shen, S. Trajectory Prediction with Graph-based Dual-scale Context Fusion. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS); IEEE: New York, NY, USA, 2022; pp. 11374–11381. [Google Scholar] [CrossRef]
- Bhattacharyya, P.; Huang, C.; Czarnecki, K. SSL-Lanes: Self-Supervised Learning for Motion Forecasting in Autonomous Driving. In Proceedings of the 6th Conference on Robot Learning (CoRL), Auckland, New Zealand, 14–18 December 2022; PMLR: Cambridge, MA, USA, 2023; pp. 1793–1805. Available online: https://proceedings.mlr.press/v205/bhattacharyya23a.html (accessed on 25 May 2026).
- Park, D.; Ryu, H.; Yang, Y.; Cho, J.; Kim, J.; Yoon, K.J. Leveraging Future Relationship Reasoning for Vehicle Trajectory Prediction. In Proceedings of the International Conference on Learning Representations (ICLR 2023), Kigali, Rwanda, 1–5 May 2023; Available online: https://openreview.net/forum?id=CGBCTp2M6lA (accessed on 25 May 2026).
- Aydemir, G.; Akan, A.K.; Güney, F. ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2023; pp. 8261–8271. [Google Scholar] [CrossRef]
- Choi, S.; Kim, J.; Yun, J.; Choi, J.W. R-Pred: Two-Stage Motion Prediction Via Tube-Query Attention-Based Trajectory Refinement. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2023; pp. 8491–8501. [Google Scholar] [CrossRef]
- Zhang, L.; Li, P.; Liu, S.; Shen, S. SIMPL: A Simple and Efficient Multi-Agent Motion Prediction Baseline for Autonomous Driving. IEEE Robot. Autom. Lett. 2024, 9, 3767–3774. [Google Scholar] [CrossRef]
- Huang, Z.; Li, Y.; Li, D.; Mu, Y.; Qin, H.; Zheng, N. Post-interactive Multimodal Trajectory Prediction for Autonomous Driving. Transp. Res. Part C Emerg. Technol. 2025, 179, 105271. [Google Scholar] [CrossRef]
- Casas, S.; Gulino, C.; Suo, S.; Luo, K.; Liao, R.; Urtasun, R. Implicit Latent Variable Model for Scene-Consistent Motion Forecasting. In Proceedings of the Computer Vision–ECCV 2020; Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 624–641. [Google Scholar] [CrossRef]
- Zhao, H.; Gao, J.; Lan, T.; Sun, C.; Sapp, B.; Varadarajan, B.; Shen, Y.; Shen, Y.; Chai, Y.; Schmid, C.; et al. TNT: Target-driveN Trajectory Prediction. In Proceedings of the 2020 Conference on Robot Learning (CoRL 2020), Virtual, 16–18 November 2020; PMLR: Cambridge, MA, USA, 2021; pp. 895–904. Available online: https://proceedings.mlr.press/v155/zhao21b.html (accessed on 25 May 2026).
- Gilles, T.; Sabatini, S.; Tsishkou, D.; Stanciulescu, B.; Moutarde, F. THOMAS: Trajectory Heatmap Output with Learned Multi-Agent Sampling. In Proceedings of the International Conference on Learning Representations (ICLR 2022), Virtual, 25–29 April 2022; Available online: https://openreview.net/forum?id=QDdJhACYrlX (accessed on 25 May 2026).
- Rowe, L.; Ethier, M.; Dykhne, E.H.; Czarnecki, K. FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2023; pp. 13745–13755. [Google Scholar] [CrossRef]






| Dataset | Agents | Lanes | Preprocess | Candidate Generation | Batched Inference |
|---|---|---|---|---|---|
| Argoverse 1 | 22.71 | 176.23 | 1.36 ms | 10.24 ms | 16.41 ms |
| INTERACTION | 11.90 | 62.47 | 0.08 ms | 7.99 ms | 10.49 ms |
| Method | MR ↓ | minFDE ↓ | minADE ↓ | Stages | Input Format | Map Encoder | Output Modes |
|---|---|---|---|---|---|---|---|
| DenseTNT [25] (ICCV 2021) | 0.10 | 1.05 | 0.73 | 1 | Graph | GNN | 6 |
| mmTransformer [26] (CVPR 2021) | 0.11 | 1.15 | 0.71 | 1 | Sequence | MLP | 6 |
| PAGA [27] (ICRA 2022) | 0.09 | 1.02 | 0.69 | 1 | Graph | GNN | 6 |
| HiVT [5] (CVPR 2022) | 0.09 | 0.96 | 0.66 | 1 | Graph | GNN | 6 |
| DSP [28] (IROS 2022) | 0.09 | 0.98 | 0.69 | 1 | Graph | GNN | 6 |
| SSL-Lanes [29] (CoRL 2022) | 0.09 | 1.01 | 0.70 | 1 | Graph | GNN | 6 |
| FRM [30] (ICLR 2023) | – | 0.99 | 0.68 | 1 | Graph | GNN | 6 |
| ADAPT [31] (ICCV 2023) | 0.08 | 0.95 | 0.67 | 1 | Sequence | MLP | 6 |
| R-Pred [32] (ICCV 2023) | 0.09 | 0.95 | 0.66 | 2 | Sequence | MLP | 6 |
| PBP [11] (ICRA 2024) | 0.10 | 1.01 | – | 2 | Graph | GNN | 6 |
| SIMPL [33] (RAL 2024) | 0.08 | 0.95 | 0.66 | 1 | Sequence | MLP | 6 |
| LAformer [22] (CVPRW 2024) | – | 0.92 | 0.64 | 2 | Sequence | MLP | 6 |
| Pioformer [34] (Transp. Res. Part C 2025) | 0.09 | 0.95 | 0.66 | 3 | Graph | GNN | 6 |
| FINet [12] (ICCV 2025) | 0.09 | 0.95 | 0.59 | 2 | Sequence | Mamba | 6 |
| ProFocus (Ours) | 0.07 | 0.88 | 0.64 | 2 | Graph | GNN | 6 |
| Method | Params (M)↓ | Latency (ms) ↓ | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|---|---|
| HiVT-128 [5] | 2.56 | 45.60 | 0.09 | 0.96 | 0.66 |
| SIMPL [33] | 1.80 | 12.69 | 0.08 | 0.95 | 0.66 |
| ProFocus (Ours) | 4.53 | 28.01 | 0.07 | 0.88 | 0.64 |
| Method | MR ↓ | minFDE ↓ | minADE ↓ | Stages | Input Format | Map Encoder | Output Modes |
|---|---|---|---|---|---|---|---|
| ILVM [35] (ECCV 2020) | 0.1980 | 0.8400 | – | 1 | Graph | GNN | 6 |
| TNT [36] (PMLR 2021) | – | 0.6700 | 0.2100 | 3 | Graph | GNN | 6 |
| SceneTransformer [6] (ICLR 2022) | 0.1180 | 0.8400 | – | 1 | Sequence | Transformer | 6 |
| THOMAS [37] (ICLR 2022) | 0.1180 | 0.7600 | – | 2 | Raster | CNN | 6 |
| FJMP [38] (CVPR 2023) | 0.0810 | 0.6230 | 0.1930 | 2 | Graph | GNN | 6 |
| ProFocus (Ours) | 0.0659 | 0.5682 | 0.1801 | 2 | Graph | GNN | 6 |
| Baseline | Proposal-Aware Encoder | Spatio-Temperal Aware Focal Attention | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|---|---|
| ✓ | 0.0833 | 0.9994 | 0.7039 | ||
| ✓ | ✓ | 0.0820 | 0.9865 | 0.6966 | |
| ✓ | ✓ | 0.0829 | 0.9910 | 0.7031 | |
| ✓ | ✓ | ✓ | 0.0810 | 0.9742 | 0.6940 |
| Baseline | Proposal-Aware Map Encoder | Proposal-Aware History Encoder | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|---|---|
| ✓ | 0.0833 | 0.9994 | 0.7039 | ||
| ✓ | ✓ | 0.0815 | 0.9833 | 0.6966 | |
| ✓ | ✓ | 0.0821 | 0.9801 | 0.6979 | |
| ✓ | ✓ | ✓ | 0.0810 | 0.9742 | 0.6940 |
| Fusion Pattern | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|
| Baseline | 0.0833 | 0.9994 | 0.7039 |
| Concatenate | 0.0925 | 0.9930 | 0.7046 |
| Multi-head attention | 0.0815 | 0.9893 | 0.7032 |
| Temperature-based attention | 0.08399 | 0.9923 | 0.6994 |
| Spatio-temporal focal attention | 0.0811 | 0.9821 | 0.6945 |
| Encoder | MR ↓ | minFDE ↓ | minADE ↓ | |
|---|---|---|---|---|
| 0.9 | 1.2 | 0.0829 | 0.9867 | 0.6989 |
| 0.5 | 1.6 | 0.0812 | 0.9904 | 0.6965 |
| 0.1 | 2.0 | 0.0811 | 0.9821 | 0.6945 |
| Encoder | Decoder | MR↓ | minFDE ↓ | minADE ↓ | ||
|---|---|---|---|---|---|---|
| 0.1 | 2.0 | 0.1 | 2.0 | 0.0832 | 0.9756 | 0.6982 |
| 0.1 | 2.0 | 0.4 | 1.65 | 0.0823 | 0.9834 | 0.6976 |
| 0.1 | 2.0 | 0.7 | 1.3 | 0.0810 | 0.9742 | 0.6940 |
| Context Type | Method | SAR@5 m↑ | SAR@10 m ↑ |
|---|---|---|---|
| Agent | Baseline | 0.2053 | 0.3523 |
| Agent | ProFocus | 0.2143 | 0.3651 |
| Lane | Baseline | 0.6559 | 0.8087 |
| Lane | ProFocus | 0.7717 | 0.9057 |
| Intervention Strategy | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|
| No masking | 0.0810 | 0.9742 | 0.6940 |
| Random token masking | 0.0823 | 0.9822 | 0.6978 |
| Low-attention token masking | 0.0818 | 0.9741 | 0.6947 |
| High-attention token masking | 0.1327 | 1.2605 | 0.7998 |
| Method | Noise Std. | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|---|
| Baseline | – | 0.0833 | 0.9994 | 0.7039 |
| ProFocus | 0 | 0.0810 | 0.9742 | 0.6940 |
| ProFocus | 0.1 | 0.0820 | 0.9747 | 0.6948 |
| ProFocus | 0.3 | 0.0818 | 0.9776 | 0.6965 |
| ProFocus | 0.5 | 0.0825 | 0.9820 | 0.6989 |
| ProFocus | 1.0 | 0.0846 | 0.9954 | 0.7061 |
| ProFocus | 2.0 | 0.0875 | 1.0157 | 0.7194 |
| K | MR ↓ | minFDE ↓ | minADE ↓ |
|---|---|---|---|
| 6 | 0.0810 | 0.9742 | 0.6940 |
| 9 | 0.0828 | 0.9824 | 0.7004 |
| 12 | 0.0830 | 0.9845 | 0.7016 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, H.; Liu, X.; Liu, Z. A Proposal-Aware Proactive Encoding Framework for Trajectory Prediction in Autonomous Driving. Electronics 2026, 15, 2435. https://doi.org/10.3390/electronics15112435
Liu H, Liu X, Liu Z. A Proposal-Aware Proactive Encoding Framework for Trajectory Prediction in Autonomous Driving. Electronics. 2026; 15(11):2435. https://doi.org/10.3390/electronics15112435
Chicago/Turabian StyleLiu, Hongkun, Xuetao Liu, and Ziyi Liu. 2026. "A Proposal-Aware Proactive Encoding Framework for Trajectory Prediction in Autonomous Driving" Electronics 15, no. 11: 2435. https://doi.org/10.3390/electronics15112435
APA StyleLiu, H., Liu, X., & Liu, Z. (2026). A Proposal-Aware Proactive Encoding Framework for Trajectory Prediction in Autonomous Driving. Electronics, 15(11), 2435. https://doi.org/10.3390/electronics15112435

