Multisource Port Inspection Sensor Fusion with Causal Representation Learning for Cross-Border Anomaly Monitoring
Abstract
1. Introduction
- 1.
- We construct a comprehensive multisource framework that integrates document semantics, structured fields, and entity relationships for cross-border anomaly monitoring.
- 2.
- We propose a cross-modal alignment mechanism that dynamically fuses diverse signals, establishing explicit interactions among texts, logistical attributes, and enterprise chains.
- 3.
- We design an engineering-constraint-guided causal learning module that eliminates spurious environmental correlations and isolates true anomaly-driving representations.
- 4.
- We implement a counterfactual response strategy that intervenes on key variables, providing regulators with transparent and traceable evidence for each anomaly decision.
2. Related Work
2.1. Cross-Border Data Anomaly Monitoring and Risk Identification
2.2. Multisource Sensing Signal Fusion and Multimodal Deep Learning
2.3. Causal Inference and Explainable Anomaly Identification
3. Materials and Method
3.1. Data Preprocessing and Augmentation Strategy
3.2. Proposed Method
3.2.1. Overall Architecture
3.2.2. Multisource Sensing Representation and Cross-Modal Alignment Module
3.2.3. Engineering-Constraint-Guided Causal Risk Representation Debiasing Module
3.2.4. Counterfactual Anomaly Response Enhancement Module
4. Results and Discussion
4.1. Experimental Configuration
4.1.1. Hardware and Software Platform
4.1.2. Baseline Models and Evaluation Metrics
4.2. Performance Comparison with Baseline Models
4.3. Generalization and Robustness Evaluation
4.4. Ablation Study
4.5. Hyperparameter Sensitivity Analysis
4.6. Discussion
4.7. Limitations and Future Work
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Fabozzi, F.J.; Markowitz, H.M.; Gupta, F. Portfolio selection. Handb. Financ. 2008, 2, 3–13. [Google Scholar] [CrossRef]
- Sharpe, W.F. Capital asset prices: A theory of market equilibrium under conditions of risk. J. Financ. 1964, 19, 425–442. [Google Scholar] [CrossRef]
- Chen, A.; Wei, Y.; Le, H.; Zhang, Y. Learning by teaching with ChatGPT: The effect of teachable ChatGPT agent on programming education. Br. J. Educ. Technol. 2024, 57, 163–184. [Google Scholar]
- Jorion, P. Value at Risk: The New Benchmark for Managing Financial Risk; McGraw-Hill: New York, NY, USA, 1997; Volume 2. [Google Scholar]
- Rockafellar, R.T.; Uryasev, S. Optimization of conditional value-at-risk. J. Risk 2000, 2, 21–42. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning, Virtual, 13–18 July 2020; PMLR: New York, NY, USA, 2020; pp. 1597–1607. [Google Scholar]
- Grill, J.B.; Strub, F.; Altché, F.; Tallec, C.; Richemond, P.; Buchatskaya, E.; Doersch, C.; Avila Pires, B.; Guo, Z.; Gheshlaghi Azar, M.; et al. Bootstrap your own latent-a new approach to self-supervised learning. Adv. Neural Inf. Process. Syst. 2020, 33, 21271–21284. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998–6008. [Google Scholar]
- Dangsawang, B.; Nuchitprasitchai, S. A machine learning approach for detecting customs fraud through unstructured data analysis in social media. Decis. Anal. J. 2024, 10, 100408. [Google Scholar] [CrossRef]
- He, D. A multimodal deep neural network-based financial fraud detection model via collaborative awareness of semantic analysis and behavioral modeling. J. Circuits Syst. Comput. 2025, 34, 2550054. [Google Scholar]
- Kim, D. AI-Driven Cross-Modal Feature Alignment for Multimodal Fraud Detection in Financial Transaction Systems. J. Glob. Eng. Rev. 2026, 4, 21–37. [Google Scholar]
- Bello, H.O.; Ige, A.B.; Ameyaw, M.N. Deep learning in high-frequency trading: Conceptual challenges and solutions for real-time fraud detection. World J. Adv. Eng. Technol. Sci. 2024, 12, 35–46. [Google Scholar]
- Chowdhury, R.; Talukder, K.A. Assessing The Role of Statistical Modeling Techniques in Fraud Detection Across Procurement And International Trade Systems. Am. J. Interdiscip. Stud. 2022, 3, 91–125. [Google Scholar] [CrossRef]
- Ma, S.; Xie, W.; Li, D.; Li, H.; Li, Y. Reducing spurious correlation for federated domain generalization. arXiv 2024, arXiv:2407.19174. [Google Scholar]
- Will, I. Explainability vs. Performance Trade-offs: Governance Challenges in Deploying Black-Box Models for Generative Fraud Detection. 2024. Available online: https://www.researchgate.net/profile/Isreal-Will/publication/403447005_Explainability_vs_Performance_Trade-offs_Governance_Challenges_in_Deploying_Black-Box_Models_for_Generative_Fraud_Detection/links/69ce686b60c0371a60f56dc3/Explainability-vs-Performance-Trade-offs-Governance-Challenges-in-Deploying-Black-Box-Models-for-Generative-Fraud-Detection.pdf (accessed on 1 June 2024).
- Mishra, L.N.; Senapati, B. FraudFusion: A Multi-Modal Ensemble Framework for Anomaly-Based Fraud Detection in Retail POS Systems. IEEE Access 2026, 14, 10436–10456. [Google Scholar]
- Zitis, P.I.; Potirakis, S.M.; Alexandridis, A. Forecasting forex market volatility using deep learning models and complexity measures. J. Risk Financ. Manag. 2024, 17, 557. [Google Scholar] [CrossRef]
- Huang, W.R.; Perez, M.A. Accurate, data-efficient learning from noisy, choice-based labels for inherent risk scoring. arXiv 2018, arXiv:1811.10791. [Google Scholar]
- Gu, S.; Kelly, B.; Xiu, D. Empirical asset pricing via machine learning. Rev. Financ. Stud. 2020, 33, 2223–2273. [Google Scholar] [CrossRef]
- Chen, L.; Pelger, M.; Zhu, J. Deep learning in asset pricing. Manag. Sci. 2024, 70, 714–750. [Google Scholar] [CrossRef]
- Bucci, A. Realized volatility forecasting with neural networks. J. Financ. Econom. 2020, 18, 502–531. [Google Scholar] [CrossRef]
- Zhou, X.; Chen, S.; Ren, Y.; Zhang, Y.; Fu, J.; Fan, D.; Lin, J.; Wang, Q. Atrous Pyramid GAN Segmentation Network for Fish Images with High Performance. Electronics 2022, 11, 911. [Google Scholar] [CrossRef]
- Wang, C.; Chen, Y.; Zhang, S.; Zhang, Q. Stock market index prediction using deep Transformer model. Expert Syst. Appl. 2022, 208, 118128. [Google Scholar] [CrossRef]
- Yañez, C.; Kristjanpoller, W.; Minutolo, M.C. Stock market index prediction using transformer neural network models and frequency decomposition. Neural Comput. Appl. 2024, 36, 15777–15797. [Google Scholar] [CrossRef]
- Ruiru, D.; Jouandeau, N.; Owuor, D. Recurrent Neural Networks with Transformers to Trade Financial Instruments. SN Comput. Sci. 2025, 6, 934. [Google Scholar] [CrossRef]
- Sezer, O.B.; Gudelek, M.U.; Ozbayoglu, A.M. Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. Appl. Soft Comput. 2020, 90, 106181. [Google Scholar] [CrossRef]
- Olorunnimbe, K.; Viktor, H. Deep learning in the stock market—A systematic survey of practice, backtesting, and applications. Artif. Intell. Rev. 2023, 56, 2057–2109. [Google Scholar] [PubMed]
- Ma, Y.; Ventre, C.; Polukarov, M. Denoised labels for financial time series data via self-supervised learning. In Proceedings of the Third ACM International Conference on AI in Finance, New York, NY, USA, 2–4 November 2022; pp. 471–479. [Google Scholar]
- Song, C.H.; Liu, S.; Chen, L. The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting. arXiv 2026, arXiv:2602.03395. [Google Scholar]
- Tobing, V.; Hamzah, M.; Chew, J.; Hamdan, M. Towards Digital Twin Integrated Border Control. In Proceedings of the 2024 IEEE International Conference on Computing (ICOCO), Kuala Lumpur, Malaysia, 12–14 December 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 114–121. [Google Scholar]
- Oord, A.V.D.; Li, Y.; Vinyals, O. Representation learning with contrastive predictive coding. arXiv 2018, arXiv:1807.03748. [Google Scholar]
- Sun, J.; Qing, Y.; Liu, C.; Lin, J. Self-fts: A self-supervised learning method for financial time series representation in stock intraday trading. In Proceedings of the 2022 IEEE 20th International Conference on Industrial Informatics (INDIN), Perth, Australia, 25–28 July 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 501–506. [Google Scholar]
- Hwang, Y.; Zohren, S.; Lee, Y. Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management. arXiv 2024, arXiv:2407.13751. [Google Scholar]
- Abdulsahib, H.M.; Ghaderi, F. Cross-domain disentanglement: A novel approach to financial market prediction. IEEE Access 2024, 12, 16255–16265. [Google Scholar] [CrossRef]
- Hosmer, D.W., Jr.; Lemeshow, S.; Sturdivant, R.X. Applied Logistic Regression; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Huang, X.; Khetan, A.; Cvitkovic, M.; Karnin, Z. Tabtransformer: Tabular data modeling using contextual embeddings. arXiv 2020, arXiv:2012.06678. [Google Scholar]
- Kim, Y. Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; pp. 1746–1751. [Google Scholar]
- Lample, G.; Ballesteros, M.; Subramanian, S.; Kawakami, K.; Dyer, C. Neural architectures for named entity recognition. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA, 12–17 June 2016; pp. 260–270. [Google Scholar]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA, 2–7 June 2019; pp. 4171–4186. [Google Scholar]
- Zhao, Z.; Xiao, Z.; Tao, J. MSDG: Multi-scale dynamic graph neural network for industrial time series anomaly detection. Sensors 2024, 24, 7218. [Google Scholar] [CrossRef] [PubMed]
- Qu, X.; Liu, Z.; Wu, C.Q.; Hou, A.; Yin, X.; Chen, Z. Mfgan: Multimodal fusion for industrial anomaly detection using attention-based autoencoder and generative adversarial network. Sensors 2024, 24, 637. [Google Scholar] [CrossRef] [PubMed]
- Shao, Y.; Wang, S.; Zhao, W. A causality-aware perspective on domain generalization via domain intervention. Electronics 2024, 13, 1891. [Google Scholar] [CrossRef]
- Papadakis, A.; Spyrou, E. A multi-modal egocentric activity recognition approach towards video domain generalization. Sensors 2024, 24, 2491. [Google Scholar] [CrossRef] [PubMed]
- Chen, H. Optimization of Real-time Logistics Tracking and Anomaly Warning System for Cross-border E-commerce Based on Edge Computing and Long Short-Term Memory (LSTM). In Proceedings of the 2025 5th International Conference on Computer, Internet of Things and Control Engineering (CITCE), Guangzhou, China, 19–21 December 2025; IEEE: Piscataway, NJ, USA, 2025; pp. 147–153. [Google Scholar]
- Xi, Y.; Jia, X.; Zhang, H. Real-time multimodal route optimization and anomaly detection for cross-border logistics using deep reinforcement learning. Acad. Nexus J. 2024, 3, 133–142. [Google Scholar]
- Shi, J. An Intelligent Logistics Scheduling System Based on Big Data and IoT for Cross-Border Trade: A Case Study of China-Armenia Trade Corridor. In Proceedings of the 2026 3rd International Conference on Autonomous Driving and Intelligent Sensing Technology, Wuhan, China, 6–8 February 2026; pp. 226–230. [Google Scholar]
- Ceylan, B.O.; Elidolu, G.; Sezer, S.I.; Akyuz, E.; Yang, Z. Probabilistic risk assessment for inert gas system on oil tanker ships using system theoretic accident model and process (STAMP) and Bayesian belief network (BBN). Reliab. Eng. Syst. Saf. 2025, 266, 111669. [Google Scholar] [CrossRef]
- Wu, X.; Feng, S.; Li, J.; Du, Z.; Tang, R.; Wang, J.; Leng, P. Financial Disclosure Identification and Risk Assessment Model for Listed Companies Based on BERT and Graph Neural Networks. In Proceedings of the 2025 International Conference on Economic Management and Big Data Application, Shenzhen, China, 25–27 July 2025; pp. 383–388. [Google Scholar]







| Data Name | Data Volume | Sensor Type |
|---|---|---|
| Electronic documents and commodity texts | 128,460 | Scanner/OCR device |
| Structured declaration fields | 86,320 | Port terminal/Declaration terminal |
| GPS land trajectories (Total) | 42,680 | GPS sensor |
| Proprietary operational GPS data | 34,144 | GPS sensor |
| Public GPS data | 8536 | GPS sensor |
| AIS maritime trajectories (Total) | 31,900 | AIS device |
| Proprietary operational AIS data | 19,140 | AIS device |
| Public AIS data | 12,760 | AIS device |
| Port weighing data | 39,760 | Weighbridge/Dynamic weighing sensor |
| X-ray inspection images | 18,240 | X-ray inspection device |
| RFID identification data | 28,450 | RFID reader/RFID tag |
| Electronic seal status data | 24,630 | Electronic seal/Door sensor |
| Cold-chain temperature and humidity data | 35,720 | Temperature/Humidity sensor |
| Transportation vibration data | 16,850 | Acceleration/Vibration sensor |
| Port video and image data | 21,360 | HD camera/LPR camera |
| Anomaly risk labels (Total) | 86,320 | Inspection/Review terminal |
| Positive anomaly samples | 7520 | Inspection/Review terminal |
| Negative normal samples | 78,800 | Inspection/Review terminal |
| Model | Accuracy | Precision | Recall | F1-Score | AUC | PR-AUC |
|---|---|---|---|---|---|---|
| Logistic Regression | ||||||
| Random Forest | ||||||
| XGBoost | ||||||
| MLP | ||||||
| TabTransformer | ||||||
| TextCNN | ||||||
| BiLSTM | ||||||
| BERT | ||||||
| BERT+MLP | ||||||
| Multimodal Transformer | ||||||
| MSDG | ||||||
| MFGAN | ||||||
| CSRDN | ||||||
| MDGAR | ||||||
| Proposed method | ||||||
| p-value (vs. best baseline) |
| Evaluation Scenario | Accuracy | F1-Score | AUC | Top-K Hit Rate | Early Warning Gain |
|---|---|---|---|---|---|
| Random test split | |||||
| Cross-time testing | |||||
| Cross-region testing | |||||
| Cross-port testing | |||||
| Cross-entity testing | |||||
| Text noise perturbation | |||||
| Missing modality simulation | |||||
| Logistics trajectory perturbation | |||||
| Sensor record missing |
| Model Variant | Accuracy | Precision | Recall | F1-Score | AUC | PR-AUC |
|---|---|---|---|---|---|---|
| Full model | ||||||
| w/o document semantic modality | ||||||
| w/o structured declaration modality | ||||||
| w/o entity relationship modality | ||||||
| w/o cross-modal attention | ||||||
| w/o contrastive alignment loss | ||||||
| w/o causal debiasing module | ||||||
| w/o counterfactual response module | ||||||
| w/o engineering constraints | ||||||
| Only document + structured fields | ||||||
| Only structured fields | ||||||
| p-value (vs. best variant) |
| Hyperparameter | Evaluated Value | F1-Score | AUC |
|---|---|---|---|
| Batch size | 16 | ||
| Batch size | 32 | ||
| Batch size | 64 | ||
| Learning rate | |||
| Learning rate | |||
| Learning rate | |||
| Causal loss weight | |||
| Causal loss weight | |||
| Causal loss weight | |||
| Counterfactual loss weight | |||
| Counterfactual loss weight | |||
| Counterfactual loss weight |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yin, J.; Lu, Z.; Xiong, B.; Sun, K.; Liu, R.; Liu, Y.; Li, M. Multisource Port Inspection Sensor Fusion with Causal Representation Learning for Cross-Border Anomaly Monitoring. Sensors 2026, 26, 4142. https://doi.org/10.3390/s26134142
Yin J, Lu Z, Xiong B, Sun K, Liu R, Liu Y, Li M. Multisource Port Inspection Sensor Fusion with Causal Representation Learning for Cross-Border Anomaly Monitoring. Sensors. 2026; 26(13):4142. https://doi.org/10.3390/s26134142
Chicago/Turabian StyleYin, Jiaxin, Zhengjia Lu, Baodi Xiong, Kai Sun, Ruijia Liu, Yachi Liu, and Manzhou Li. 2026. "Multisource Port Inspection Sensor Fusion with Causal Representation Learning for Cross-Border Anomaly Monitoring" Sensors 26, no. 13: 4142. https://doi.org/10.3390/s26134142
APA StyleYin, J., Lu, Z., Xiong, B., Sun, K., Liu, R., Liu, Y., & Li, M. (2026). Multisource Port Inspection Sensor Fusion with Causal Representation Learning for Cross-Border Anomaly Monitoring. Sensors, 26(13), 4142. https://doi.org/10.3390/s26134142
