Reproducibility and Validation of a Computational Framework for Architectural Semantics: A Methodological Study with Japanese Architectural Concepts

Gledis Gjata; Satoshi Yamada

doi:10.3390/buildings15224107

and

¹

Graduate School of Science and Engineering, Ritsumeikan University, Kusatsu 525-8577, Japan

²

Department of Architecture and Urban Design, College of Science and Engineering, Ritsumeikan University, Kusatsu 525-8577, Japan

^*

Author to whom correspondence should be addressed.

Buildings2025, 15(22), 4107;https://doi.org/10.3390/buildings15224107

This article belongs to the Special Issue Analysis, Conservation, and Refurbishment Methods of Heritage Architecture Based on Modern Technology

Version Notes

Order Reprints

Review Reports

Abstract

Architectural discourse is a specialised language whose key terms shift with context, which complicates empirical claims about meaning. This study addresses this problem by testing whether a rigorously audited, reproducible NLP framework can recover a core theoretical distinction in architectural language, specifically the conceptual versus physical split, using Japanese terms as a focused case. The objective is to evaluate contextual embeddings against static baselines under controlled conditions and to release an end-to-end pipeline that others can rerun exactly. We assemble a ~1.98-million-word corpus spanning architecture, history, philosophy, and theology; train Word2Vec (CBOW, Skip-gram) and a fine-tuned BERT on the same sentences; derive embeddings; and cluster terms with k-means and Agglomerative methods. Internal validity is assessed using the Adjusted Rand Index against a phenomenological gold standard split; external validity is correlated with WordSim-353; robustness is examined through a negative-control relabelling and a definitional audit comparing FULL and CLEAN corpora; seeds, versions, and artefacts are pinned for exact reruns in the archived environment; and identity across different hardware is not claimed. The study finds that BERT cleanly recovers the split with ARI 0.852 (FULL) and 0.718 (CLEAN). BERT and CBOW show no seed variation. Both Word2Vec models hover near chance, but Skip-gram shows instability across seeds. We provide a transparent, reusable methodology, with released assets, that enables falsifiable and scalable claims about architectural semantics.

Keywords:

architectural semantics; architectural informatics; Japanese architecture; semantic clustering; polysemy; word embeddings; BERT; adjusted rand index; NLP reproducibility; computational humanities

Reproducibility and Validation of a Computational Framework for Architectural Semantics: A Methodological Study with Japanese Architectural Concepts

Abstract

Article Metrics

Citations

Article Access Statistics