- Article
SADAMB: Advancing Spatially-Aware Vision-Language Modeling Through Datasets, Metrics, and Benchmarks
- Giorgos Papadopoulos,
- Petros Drakoulis,
- Athanasios Ntovas,
- Alexandros Doumanoglou and
- Dimitris Zarpalas
Understanding spatial relationships between objects in images is crucial for robotic navigation, augmented reality systems, and autonomous driving applications, among others. However, existing vision-language benchmarks often overlook explicit spatia...