- Article
Leveraging Multimodal Large Language Models (MLLMs) for Enhanced Object Detection and Scene Understanding in Thermal Images for Autonomous Driving Systems
- Huthaifa I. Ashqar,
- Taqwa I. Alhadidi,
- Mohammed Elhenawy and
- Nour O. Khanfar
The integration of thermal imaging data with multimodal large language models (MLLMs) offers promising advancements for enhancing the safety and functionality of autonomous driving systems (ADS) and intelligent transportation systems (ITS). This stud...

