Next Article in Journal
Archaeometric Investigation of Longshan-Period Pottery from Western Shangqiu, Henan: Raw Material Selection, Technological Choices, and Regional Variation
Previous Article in Journal
From Digital Transition to Low-Impact Museums: A Strategic Planning Framework for Sustainable Museum Transformation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Closed-Set vs. Open-Vocabulary Object Detectors for Urban Architectural Typology Classification: A Comparative Study on Athenian Heritage Buildings

by
Konstantinos Filippatos
1,
Konstantina Siountri
2 and
Christos-Nikolaos Anagnostopoulos
1,*
1
Intelligent Systems Lab, Cultural Technology and Communication Department, University of the Aegean, GR81100 Mytilene, Greece
2
Service of Modern Monuments and Technical Projects of Attica, Eastern Central Greece and the Cyclades, Ministry of Culture, GR10555 Athens, Greece
*
Author to whom correspondence should be addressed.
Heritage 2026, 9(5), 206; https://doi.org/10.3390/heritage9050206
Submission received: 9 April 2026 / Revised: 19 May 2026 / Accepted: 19 May 2026 / Published: 21 May 2026
(This article belongs to the Section Architectural Heritage)

Abstract

Architectural typology classification plays an important role in large-scale documentation and analysis of urban cultural heritage. Recent advances in computer vision enable automated approaches for detecting and categorizing buildings from street-level imagery, yet the suitability of different detection paradigms for architectural typology analysis remains insufficiently explored. Despite recent advances in computer vision for architectural analysis, no systematic comparative study has evaluated closed-set CNN-based detectors against open-vocabulary vision–language grounding models for urban architectural typology classification. This study presents a comparative evaluation of closed-set convolutional object detectors and open-vocabulary vision–language grounding models for the classification of Athenian architectural typologies. A dataset of 3349 street-view images containing 11,111 annotated building instances was compiled and organized into five typological categories: Neoclassical, Neoclassical-Eclectic, Interwar-Eclectic, Interwar, and Postwar. The experiments compare several YOLO-based detection configurations with Grounding DINO under zero-shot inference, parameter-efficient adaptation (e.g., Kiw Rank Adaptation—LoRA), and full fine-tuning. Results show that supervised YOLO-based models achieve robust detection and classification performance with high localization accuracy and consistent typology discrimination in dense urban scenes. In contrast, open-vocabulary grounding models demonstrate limited reliability in zero-shot settings and require substantial adaptation to approach comparable performance levels. Analysis of confusion patterns further reveals that most classification errors originate from intrinsic architectural similarities between transitional styles rather than from model instability. The findings highlight the advantages of supervised object detection frameworks for scalable urban heritage documentation and provide insights into the current limitations of vision–language models for fine-grained architectural typology classification.
Keywords: architectural typology; heritage buildings; pattern recognition; YOLO; grounding DINO architectural typology; heritage buildings; pattern recognition; YOLO; grounding DINO

Share and Cite

MDPI and ACS Style

Filippatos, K.; Siountri, K.; Anagnostopoulos, C.-N. Closed-Set vs. Open-Vocabulary Object Detectors for Urban Architectural Typology Classification: A Comparative Study on Athenian Heritage Buildings. Heritage 2026, 9, 206. https://doi.org/10.3390/heritage9050206

AMA Style

Filippatos K, Siountri K, Anagnostopoulos C-N. Closed-Set vs. Open-Vocabulary Object Detectors for Urban Architectural Typology Classification: A Comparative Study on Athenian Heritage Buildings. Heritage. 2026; 9(5):206. https://doi.org/10.3390/heritage9050206

Chicago/Turabian Style

Filippatos, Konstantinos, Konstantina Siountri, and Christos-Nikolaos Anagnostopoulos. 2026. "Closed-Set vs. Open-Vocabulary Object Detectors for Urban Architectural Typology Classification: A Comparative Study on Athenian Heritage Buildings" Heritage 9, no. 5: 206. https://doi.org/10.3390/heritage9050206

APA Style

Filippatos, K., Siountri, K., & Anagnostopoulos, C.-N. (2026). Closed-Set vs. Open-Vocabulary Object Detectors for Urban Architectural Typology Classification: A Comparative Study on Athenian Heritage Buildings. Heritage, 9(5), 206. https://doi.org/10.3390/heritage9050206

Article Metrics

Back to TopTop