Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities

Alosaimi, Azzam H.

doi:10.3390/buildings15213860

Open AccessArticle

Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities

by

Azzam H. Alosaimi

Department of Civil and Architectural Engineering, Collage of Engineering and Computer Science, Jazan University, Jazan 45142, Saudi Arabia

Buildings 2025, 15(21), 3860; https://doi.org/10.3390/buildings15213860

Submission received: 5 October 2025 / Revised: 20 October 2025 / Accepted: 23 October 2025 / Published: 25 October 2025

(This article belongs to the Section Architectural Design, Urban Science, and Real Estate)

Download

Browse Figures

Versions Notes

Abstract

Building archetypes are essential for advancing architectural theory and energy modeling, providing a foundation for scalable assessments of building performance and sustainability worldwide. In Saudi Arabia, educational buildings, especially those in public universities, are predominantly constructed using standardized and repetitive design templates, such as courtyard and prototype models, which have significant implications for energy efficiency, indoor environmental quality, and sustainability outcomes. Despite their prevalence, there is a notable lack of systematic research on the classification and distribution of these archetypes within the Saudi context, particularly regarding their impact on energy consumption and sustainable campus planning. This study addresses this gap by systematically collecting and analyzing data from 29 public universities across Saudi Arabia, employing GIS mapping to document building characteristics including age, region, urban context, masterplan typology, and architectural design. A cumulative weighting factor was applied to quantify the representativeness of archetypes, while chi-square tests and effect size metrics assessed the statistical concentration and significance of observed patterns. The results reveal a pronounced dominance of a small number of archetypes, especially standardized courtyard and identical design models, across the national stock, with the top 10% of archetype ranks accounting for the majority of buildings. This high degree of standardization enables efficient modeling, benchmarking, and targeted energy interventions, while also highlighting the need for greater contextual adaptation in future campus planning. While this study does not directly simulate building energy performance, it establishes a national-scale typological foundation that can support future simulation, benchmarking, and policy design. The developed GIS-based framework primarily serves managerial and planning objectives, offering a standardized reference for facility management, retrofitting prioritization, and strategic energy-efficiency planning in Saudi public universities.

Keywords:

archetypes; cumulative weighting factor; educational building; energy efficiency; sustainability; stock energy modeling

1. Introduction

The global building energy demand continues to rise, driven by rapid urbanization, population growth, and economic development. Buildings are responsible for a significant portion of this demand, accounting for approximately 40% of global energy consumption due to their requirements for heating, cooling, lighting, and equipment operation. In Saudi Arabia, the situation is even more pronounced: buildings consume approximately 80% of the nation’s electricity, with residential and institutional sectors being the primary contributors. This high demand is largely attributed to the country’s harsh climate, which necessitates extensive use of air conditioning, and to the increasing number of buildings resulting from economic and population growth [1,2,3]. More recent national data confirm these proportions. According to the Saudi Energy Efficiency Center [4,5], the building sector, including residential, educational, and institutional uses, accounts for nearly 79–81% of total electricity consumption in the Kingdom. Regional analyses by [6,7] further emphasize that energy use growth remains highest in the central, western, and eastern regions, where climatic extremes and population density drive cooling-dominated demand. These updated sources reinforce the urgency of typological and efficiency-focused studies addressing the educational building stock.

Within this context, educational buildings in Saudi Arabia, such as universities and schools, are emerging as major energy consumers. Studies have shown that university campuses, in particular, exhibit high electricity consumption intensities, with air conditioning systems accounting for the majority of usage, followed by lighting and other equipment [8,9]. As the educational sector continues to expand in both size and complexity, its contribution to national energy demand is expected to grow further.

Recognizing these challenges, Saudi Arabia has initiated several efforts to improve energy efficiency in its building sector. These include the implementation of energy conservation measures, retrofitting programs, and the adoption of sustainable building standards. For example, energy retrofitting of educational buildings has been shown to reduce annual energy consumption by up to 22.7%, with relatively short payback periods, making such interventions both effective and economically viable [1,8]. National policies and frameworks, such as Vision 2030 and the “Mostadam” rating system, also aim to promote sustainable development and energy-efficient technologies across the country [2,10].

Recent research in Saudi Arabia has increasingly emphasized the modeling and optimization of energy consumption in educational buildings, reflecting their strategic importance within national energy policy and sustainability initiatives. Advanced regression-based models, developed using extensive real operational data from schools, have demonstrated high predictive accuracy (over 90%) for energy consumption, enabling more effective budget planning and lifecycle management for educational facilities [11]. Benchmarking studies in higher education institutions have identified air conditioning as the dominant energy consumer, and have proposed targeted energy conservation measures (ECMs) that can significantly reduce consumption and environmental impact, with payback periods as short as 4.1 years [9]. Furthermore, techno-economic assessments and simulation-based analyses have validated the feasibility and environmental benefits of integrating photovoltaic (PV) systems in school and university buildings, showing substantial reductions in both operational costs and carbon emissions [12,13]. These advancements are supported by national programs such as the Saudi Energy Efficiency Program and align with the goals of Vision 2030, which advocates large-scale adoption of renewable energy and building retrofits [14].

Despite these advances, there remains a notable research gap: while considerable attention has been given to residential and commercial buildings, limited work has focused specifically on the energy consumption patterns of educational buildings in Saudi Arabia, especially through the lens of building archetypes. Most existing studies address general strategies or focus on other building types, leaving a need for systematic analysis and classification of educational building archetypes to support targeted energy modeling and planning [1,9].

Recent advancements in geospatial analysis have enabled the development of robust typological frameworks for educational buildings at a national scale, but this is still limited in the Saudi educational sector at universities’ level. To address this gap, this study introduces a GIS-based methodology that systematically classifies campus buildings across Saudi public universities into a representative archetype, leveraging spatial, morphological, and institutional datasets. Such an approach aligns with global best practices, where integrating multi-source geospatial data and advanced classification techniques has proven effective for mapping building functions and forms over large areas, supporting applications in urban planning and city-scale energy modeling [15]. By capturing the regional and functional diversity of campus structures, the resulting archetype framework provides a standardized national reference that can inform facility management, performance benchmarking, and strategic energy-efficiency planning. While the current focus is on typological characterization rather than detailed energy simulation, the framework establishes a foundational dataset and methodological structure. The objectives are as follows:

Collect and analyze campus-level data for all Saudi public universities, including institutional age, geographic region, urban context, and masterplan typology;
Classify university campuses into unique cells defined by the selected categorical variables to create a structured national inventory;
Quantify the prevalence of each archetype using a cumulative weighting factor and rank-based analysis to reveal dominant and recurring patterns;
Identify a representative college building archetype that can serve as a reference for future energy performance analysis, benchmarking, and planning;

The central research question guiding this study is as follows:

How can a college building archetype in Saudi public universities be systematically defined to support future energy modeling and planning?

This groundwork is essential for future research, as an archetype development has been shown to significantly enhance the accuracy and relevance of subsequent environmental and energy modeling studies, ultimately supporting more sustainable policy and operational decisions in the educational sector [16]. This study will contribute to knowledge by bridging the gap between existing data and the need for a standardized college building benchmark and provides a robust foundation for college building evaluation.

It is important to note that this study’s primary objective is typological classification for managerial, planning, and benchmarking purposes rather than quantitative energy-performance simulation. The research provides a macro-level overview of recurring architectural patterns across Saudi universities, producing an organized typology that future environmental and energy modeling work can build upon. This distinction ensures that the current analysis remains focused on the spatial and morphological characterization of the educational building stock while establishing a foundational dataset for later integration with detailed thermal and energy analyses.

While the preceding section outlined Saudi Arabia’s energy-efficiency context and policy motivations, the following Section 2 turns to the theoretical and typological foundations underpinning the study. This separation clarifies that national energy initiatives and sustainability programs establish the contextual rationale for the research, whereas archetype theory and building-typology literature define the analytical framework through which educational buildings are systematically classified. Establishing this distinction enhances coherence and ensures a logical progression from policy motivation to methodological theory.

2. Background

Building archetypes are foundational to architectural theory and practice, serving as reference models for recurrent spatial, formal, and cultural patterns. Archetypes are understood as prototypical forms or typologies that recur across cultures and eras, embodying both functional needs and symbolic meanings [17,18]. They provide continuity in the built environment, linking collective memory and enduring design principles with opportunities for innovation.

Classical theory positions archetypes as timeless reference models. For example, Rossi described them as recurring typologies structuring urban memory, while others emphasized their mediating role between form, function, and cultural context [17]. More recent scholarship highlights the multidisciplinary nature of archetypes, integrating anthropometry, environment, history, and technology to define architectural identity. The work of architects like Louis Kahn demonstrates how archetypes connect modern design with historical traditions, using them as conceptual tools to create unique and meaningful places [17].

Archetypes often serve as formal and spatial prototypes, such as the house, temple, courtyard, or dome, that persist in both vernacular and sacred architecture, reflecting symbolic traditions and climatic adaptation [19,20]. Steadman introduced the idea of the “archetypal building” as a conceptual model from which real buildings are derived through systematic transformation, shaped by constraints like lighting, geometry, and human use [21]. These forms are not only functional but also carry deep cultural and psychological resonance, as seen in the recurring motifs of sacred buildings across civilizations [20].

Archetypes also reflect cultural and regional identity. In the Middle East, enduring forms such as courtyards and iwans link contemporary architecture to historical precedents, while in Saudi Arabia, educational campuses blend imported modernist templates with locally adapted typologies [17,20]. This duality allows for systematic analysis of recurring building designs in rapidly evolving contexts.

Ro conclude this section, building archetypes are central to architectural theory, functioning as both practical prototypes and carriers of cultural meaning. Their enduring relevance lies in their ability to bridge tradition and innovation, shaping the built environment across time and place.

The theoretical review of archetypes presented above establishes the conceptual foundation for this study. By synthesizing prior typology-based approaches, this research applies those principles to Saudi higher-education buildings through a GIS-driven framework. The reviewed theories on classification logic, representativeness, and morphological grouping directly inform the study’s research aim, to develop a reproducible national typology that supports facility management, energy benchmarking, and future simulation studies.

2.1. Educational Buildings

Educational buildings present unique challenges for archetype modeling due to their complex spatial layouts, high occupancy densities, and variable operational schedules. Developing robust archetypes for these facilities is essential for accurate energy modeling and effective policy design [22,23].

Educational archetypes are typically classified by plan form (e.g., courtyard, linear, cluster) and construction vintage, reflecting changes in codes, HVAC adoption, and materials over time [22]. Archetype-based models allow researchers to generalize energy behavior across large stocks, enabling scalable assessments without modeling each building individually [22,23]. Recent reviews highlight that the choice of modeling approach—code-based, data-driven, or hybrid—should be tailored to data availability and research objectives [22].

In this study, archetype modeling refers to the analytical process of representing groups of educational buildings through generalized prototypes that capture their shared spatial, morphological, and operational attributes. This concept directly intersects with educational buildings, which exhibit recurring functional layouts and design patterns shaped by standardized planning and policy frameworks in Saudi Arabia. By linking archetype theory with the empirical classification of educational facilities, the research defines educational building archetypes not as abstract typologies but as data-driven, representative models that can inform benchmarking, retrofit prioritization, and future energy modeling applications.

Key Insights from Recent Research

Studies show that energy use intensity varies significantly by building function and discipline, with research buildings and science facilities typically consuming more energy than academic offices or health buildings [23].
Hierarchical and Bayesian calibration of archetypes reduces uncertainty in large institutional stocks, improving predictive accuracy for diverse educational environments [24,25].
Rigid assumptions about occupancy scheduling can distort energy predictions by 8–10%. Integrating stochastic or survey-based occupancy data is critical for operational realism [22,26].
Automated archetype generation using Artificial Intelligence and Geographic Information System datasets enables campus-specific models that reflect cultural and regional distinctions, even when institutional records are incomplete [26,27].

In Saudi Arabia and similar regions, ministries often commission repeated educational building designs. Archetype analysis is well-suited to capture these patterns, ensuring that simulations prioritize the most prevalent and impactful designs [27].

2.2. Buildings Archetypes in Saudi Arabia

The use of building archetypes is increasingly important in Saudi Arabia due to the country’s heavy reliance on air conditioning and rapid urban expansion. Archetype-based modeling enables structured generalization of energy performance across large building stocks, while accounting for differences in construction, climate, and operation [1,28].

Early Saudi research focused on the residential sector [29,30]. Krarti et al. developed bottom-up archetypical housing energy models stratified by region and construction vintage, demonstrating that tailored insulation and HVAC upgrades could reduce residential energy consumption by up to 50% [1,10]. Alrashed and Asif introduced a five-zone climatic classification system, supporting archetypes that reflect regional cooling intensity. More recently, comprehensive frameworks have categorized housing stock by type, vintage, and other variables, using statistical weighting and chi-square analysis to identify representative archetypes [28].

Despite advances in the housing sector, research on educational building archetypes in Saudi Arabia remains limited. Mohammed et al. developed a regression-based model for 350 schools, identifying building age and HVAC system size as dominant predictors of energy demand, but did not establish typological archetypes [11]. Recent studies on educational buildings have focused on energy retrofitting and performance benchmarking, highlighting the need for archetype-based approaches to support large-scale energy efficiency improvements [8,9]. Therefore, this study contributes to existing knowledge by extending college building archetype to Saudi public universities, using categorical variables such as year of construction, region, urban context, plan typology, and design pattern. The approach adapts statistical rigor from existing data of the higher educational sector, addressing a major research gap and providing a foundation for campus-scale and national policy simulations.

3. Data Sources and Analysis

This study investigates the morphological and operational characteristics of college buildings within Saudi Arabia’s public university system to determine the presence of recurring architectural archetypes. While campus layouts display considerable variety, a notable repetition of identical college building designs was observed across multiple institutions, suggesting the use of reproducible spatial models that maintain coherent formal logic across diverse settings.

In architectural typology, an archetype is defined as a reproducible spatial model that appears in different contexts while retaining a consistent organizational structure [31,32,33,34]. This research operationalizes the concept at the college building scale, focusing on plan types that are replicated across various institutional and geographic environments, rather than at the broader campus master plan level.

The analysis encompasses all 29 public universities in Saudi Arabia, with each institution’s main campus serving as the primary unit of analysis. For multi-campus universities, the dominant site was coded, while distinct typologies at satellite campuses were noted but not classified separately. The classification process prioritized official documentation, supplemented by secondary data sources for validation and context.

Primary data:

Official university websites (facts and figures, campus maps, virtual tours, master plans);
Ministry of Education [35] listings for institutional scope and regional categorization;
Public geospatial data (Google Earth and satellite imagery [36]) to verify siting and urban density patterns.

Secondary data sources:

General Authority for Statistics (GASTAT) [5] publications for sector context and coverage validation;
Institutional strategic documents to triangulate development timelines and campus expansions.

This multi-source approach aligns with best practices in recent research, which emphasizes the importance of combining on-site measurements, user surveys, and official documentation to assess building performance and typological patterns in Saudi higher education settings [31,32,33].

3.1. Population

Saudi Arabia, the largest nation on the Arabian Peninsula, is characterized by a rapidly expanding and unevenly distributed population across its 13 administrative regions. Table 1 shows that as of mid-2024, the Kingdom’s total population reached approximately 35.3 million, marking a significant increase of 1.6 million people compared to the previous year. The Saudi citizens constitute 55.6% (19.6 million) of the population, while non-Saudi residents account for 44.4% (15.7 million). Notably, non-Saudis contributed 75.6% of the net population growth from 2023 to 2024, representing the Kingdom’s continued reliance on expatriate labor to support economic development and diversification [5,37].

Figure 1 shows that population is highly concentrated in the Riyadh (8.6 million), Makkah (8.5 million), and Eastern Province (5.1 million) regions, which together host the majority of residents. In contrast, regions such as Najran (0.6 million) and the Northern Borders (0.37 million) remain sparsely populated. This pronounced demographic imbalance has significant implications for higher education planning, public service provision, and regional development strategies. Recent research highlights that 70% of universities are concentrated in the Central and Eastern regions, leaving the Northern and Southern areas with limited access to higher education opportunities. Strategic redistribution of educational institutions in underserved regions has been shown to enhance access, reduce unemployment, and promote balanced regional growth [38].

Figure 2 presents the distribution of student enrolment across Saudi Arabia’s 29 public universities. The data reveal a marked concentration of students in a few large-scale institutions, with King Abdulaziz University, Imam Abdulrahman Bin Faisal University, and Umm Al-Qura University each exceeding 80,000 students. These mega-universities account for a significant share of national enrolment. A second tier—including King Khalid University, Taibah University, King Saud University, and Jazan University—hosts between 50,000 and 70,000 students. Most other public universities accommodate 20,000–40,000 students, providing substantial but more regionally focused capacity. Specialized institutions, such as King Fahd University of Petroleum and Minerals and King Abdullah University of Science and Technology, serve smaller student populations, reflecting their niche academic missions [37,39].

Overall, the Saudi higher education system serves over 1.6 million students, positioning it as one of the largest in the Middle East. The contrast between mega-universities and smaller, specialized campuses highlights the need for differentiated approaches to infrastructure planning, resource allocation, and sustainability strategies to address both regional disparities and the demands of a diverse student population [37,38].

3.2. Data Classification

In this study, both categorical and continuous data were systematically classified into five principal groups: building age, urban context, region, masterplan typology, and college building design pattern. The classification process was guided by the availability and completeness of the data; records with missing or incomplete information were excluded to ensure the reliability of subsequent analyses. To ensure accuracy and consistency across the dataset, all records were subjected to a structured validation process. University building information was cross-verified with official Ministry of Education statistics, campus master plans, and satellite imagery to confirm footprint geometry and building use. Inconsistencies were corrected using corroborated data sources, and unverifiable entries were excluded. This multi-layered verification ensured that the dataset maintained both spatial accuracy and institutional representativeness before typological analysis.

This approach is consistent with best practices in building stock [40] and facility management research [41], where robust data classification and the handling of missing data are critical for better decision-making. By establishing clear classification criteria and discarding incomplete entries, the dataset supports transparent, replicable, and meaningful analysis of university building characteristics.

The compiled dataset in this study focuses on categorical, morphological, and contextual descriptors that are consistently available across universities (age band, region, urban context, masterplan typology, and college building design pattern). Parameters that are essential for detailed energy simulation such as HVAC/system type, heating/cooling equipment, floor-to-floor height, glazing ratio, and insulation/material properties, are not included at this stage because they are not systematically reported in publicly accessible institutional records and could not be validated at scale with sufficient completeness. These variables are explicitly planned for integration in the next phase, when environmental and envelope datasets will be linked to the typological framework to support operational energy modeling use cases.

3.2.1. Building Age

The chronological establishment of Saudi Arabia’s 29 public universities shown in Figure 3 reveals distinct phases in the evolution of the Kingdom’s higher education system. The foundational phase, prior to 1990, saw the creation of a small number of institutions such as Umm Al-Qura University and King Saud University, reflecting a period of selective and gradual sector development [42]. A subsequent slowdown between 1980 and 2000, marked by the establishment of only King Khalid University (dark red color), coincided with national economic challenges, including the oil price collapse, fiscal constraints, and the Gulf War, which limited public investment in large-scale educational expansion [42].

A dramatic shift occurred after 2000, with a rapid and deliberate expansion in university establishments. This surge was driven by improved fiscal conditions from rising oil revenues and a strategic national pivot toward human capital development and economic diversification [43,44]. Major government-led initiatives, such as the Higher Education Expansion Plan and the King Abdullah Project, catalyzed this growth by investing heavily in university infrastructure, faculty development, and regional accessibility [39,43]. The period from the 2000s to 2010s accounts for over half of all public university foundations, highlighting a state-led push to rapidly expand higher education capacity in response to demographic pressures and the goals of a knowledge-based economy [43,44]. Figure 4 classifies the universities establishment year by seven age groups.

The temporal clustering of university establishments also implies a reliance on repeatable master-planning models and standardized design prototypes, characteristic of centrally coordinated infrastructure rollouts [44]. Such standardization facilitated the efficient delivery of new campuses on a scale, but also introduced challenges related to contextual adaptation and long-term sustainability.

3.2.2. Urban Context

Figure 5 categorizes the spatial siting of Saudi public university campuses relative to their surrounding urban fabric, revealing an almost even split between dense urban cores (37.9%), suburban or edge-of-city locations (37.9%), and low-density or remote settings (24.1%). This distribution reflects a diversity of planning approaches, ranging from infill urban development to greenfield expansion, shaped by regional priorities and land availability.

The siting of a campus within the urban fabric plays a critical role in shaping its accessibility, sustainability, and operational efficiency. Campuses located in dense urban cores typically benefit from enhanced public transport access, proximity to services, and compact microclimates, which can support walkability and reduce transportation emissions [45,46]. In contrast, suburban and edge-of-city campuses often face greater challenges related to accessibility, infrastructure provision, and increased energy demands, particularly for cooling and transportation, due to their separation from established urban networks [46,47]. Remote campuses, while offering opportunities for large-scale development, may struggle with limited infrastructure and reduced integration with city life [48,49].

The urban context is thus a vital parameter in campus performance modeling and environmental simulation. International research highlights that campus spatial organization, whether compact and integrated or dispersed and peripheral, directly affects walkability, energy consumption, and the quality of campus life [45,46,47]. Effective planning should leverage the advantages of urban integration while addressing the unique challenges of suburban and remote sites through targeted strategies in resource optimization and sustainable mobility.

3.2.3. Region

Figure 6 illustrates the geographic distribution of Saudi Arabia’s public universities across the Kingdom’s five main administrative regions, with the Central region hosting the largest share (31%), followed by the Western (24.1%), Eastern (17.2%), and the less represented Northern and Southern regions (each 13.8%). This distribution is not only a reflection of demographic and policy-driven decisions but also establishes a critical framework for climate-sensitive building design.

The regional allocation of universities aligns with distinct climatic zones, each characterized by variations in temperature, humidity, solar exposure, and wind conditions. For example, campuses in Riyadh’s hot-dry climate face different environmental challenges than those in the milder highlands of Abha or the humid coastal areas of the Eastern region. This diversity necessitates context-sensitive design strategies, particularly in building envelope design, passive cooling, and energy consumption patterns, to ensure optimal performance and sustainability [50,51].

Recent research highlights that sustainable campus design in Saudi Arabia must address these regional climatic differences to enhance student well-being, resource efficiency, and environmental performance [50]. Studies also emphasize the importance of integrating local climate data and adaptive design solutions, such as orientation, shading, and green roofs, to reduce energy demand and improve IEQ in different regions [51]. Furthermore, the spatial distribution of universities has implications for regional equity, economic development, and environmental impact, showing the need for strategic planning that balances access with sustainability goals [38,52].

According to the Saudi Building Code [53], each administrative region aligns with a distinct climatic zone, including variations in temperature extremes, humidity, solar exposure, and wind conditions. This regional variation reinforces the necessity for moving beyond one-size-fits-all planning, advocating for climate-responsive and contextually adapted educational building designs across Saudi Arabia’s diverse environments.

3.2.4. Masterplan Typology

Figure 7 categorizes the 29 Saudi public university campuses by master-plan typology, revealing a striking predominance of a single spatial model. The courtyard masterplan (Typology Group 1) accounts for 86.2% of campuses (25 out of 29), while the linear masterplan (Typology Group 2) is present in only 6.9% (2 campuses). Cluster and varied masterplans (Typology Groups 3 and 4) are each represented by just one campus (3.5%).

This pronounced homogeneity underscores the widespread adoption of a standardized spatial prototype in Saudi campus design, with the courtyard model serving as the default template. The prevalence of the courtyard form is rooted not only in regional architectural traditions but also in its proven climatic adaptability—facilitating shading, natural ventilation, and microclimate regulation, which are critical in hot-arid environments [50,54]. Such standardization is characteristic of centralized, policy-driven planning approaches, where uniformity is leveraged to expedite project delivery, reduce costs, and streamline construction across multiple sites [55].

The dominance of a single master-planning typology provides a robust foundation for archetype-based modeling, enabling a small set of representative campus plans to effectively simulate spatial and environmental performance across the national university system. However, this uniformity also highlights the need for greater contextual adaptation and climate-responsive strategies, as emphasized in recent research on sustainable campus development in Saudi Arabia [50,55].

3.2.5. Dominant Typologies and Spatial Contexts

The majority of universities established in the 2000s (age group 6) are distributed across various climatic zones and predominantly utilize the courtyard typology, a pattern that computational analysis has shown to be closely linked to spatial parameters such as visibility, density, and connectivity, reflecting both design limitations and contextual requirements [56]. Table 2 shows these masterplans are most often paired with low-dense urban or suburban-edge contexts, indicating a preference for layouts that balance open space with building density. In contrast, older universities like King Saud and King Abdulaziz also employ the courtyard model but are situated in more urban-dense environments, suggesting a shift in spatial planning as campuses and urban areas evolve. This architectural continuity aligns with global trends, where university masterplan typologies are transforming to support broader institutional missions, digital integration, and greater societal impact, with future universities emphasizing innovation, integration, and sustainable development [48]. The repeated use of the courtyard typology across different regions and time periods highlights the influence of climatic adaptation and cultural factors, while variations in context demonstrate responsiveness to local urban development and campus expansion needs.

3.2.6. College Building Design Pattern

Figure 8 presents a categorical analysis of 29 Saudi public university college buildings, organizing them into three overarching design groups based on shared characteristics in site planning, massing logic, and morphological structure. This classification highlights the dominance of specific spatial templates in the national development of higher education facilities. The results reveal a pronounced concentration within Group 1, which encompasses 65.5% of universities (19 out of 29) and is characterized by a unique design typology. Notably, universities established after 2000 overwhelmingly fall into Group 2, comprising 27.6% (8 universities), and are defined by the adoption of identical college building design models, reflecting a strong trend toward standardization in recent campus planning. Group 3, representing only 6.9% (2 universities), includes semi-identical design approaches.

The college building pattern indicates the increasing architectural uniformity among Saudi public universities built in the 21st century. The uniformity leveraged to streamline planning processes, reduce construction costs, and ensure consistent quality control at scale [50]. Such standardization is a hallmark of large-scale national education initiatives, particularly in rapidly developing contexts, and has been observed to facilitate efficient campus expansion and resource allocation [50]. However, while this approach supports operational efficiency, it may also limit opportunities for contextual adaptation and innovation in response to diverse climatic and cultural settings [50,58].

These findings provide a robust, quantitative foundation for the selection of representative college building archetypes in this research. By identifying the prevalence and distribution of dominant design groups, this classification enables targeted modeling and simulation of key performance indicators, such as IEQ, energy demand, and sustainability compliance, under real regional conditions. Ultimately, the analysis affirms that Saudi Arabia’s recent higher education expansion has relied heavily on prototypical college building forms, offering a defensible basis for further investigation into the performance, adaptability, and sustainability of these archetypes.

3.3. Summary

Collectively, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 and Table 2 demonstrate that the design of educational buildings in Saudi Arabia is dominated by a limited set of highly repeatable spatial and architectural archetypes. Most educational facilities, particularly those constructed after 2000, employ standardized design template, most notably, prototype college building designs, that are replicated across diverse regions and urban contexts, often with minimal adaptation to local climate or site conditions [59]. This uniformity is largely the result of centralized, policy-driven planning strategies aimed at accelerating educational infrastructure rollout and ensuring efficiency and cost-effectiveness at scale [59,60]. While this approach has facilitated rapid expansion, it has also led to the widespread adoption of courtyard-based and other archetypal layouts, regardless of regional climatic or cultural variation.

The prevalence of these repeatable design patterns provides a robust, quantitative foundation for the classification and modeling of educational building archetypes. Such classification is essential for evaluating key performance indicators, including indoor environmental quality (IEQ), energy demand, particularly for cooling purposes, and compliance with sustainable design standards under real regional conditions [33,59]. However, the literature also highlights the need for greater contextual adaptation and climate-responsive strategies within this system, as the current reliance on uniform prototypes may limit the potential for optimized energy performance and occupant comfort [33,59,60]. This analysis thus establishes a defensible, data-driven basis for future research and simulation targeting the performance and sustainability of Saudi educational building archetypes.

4. Methodology

This study adopts a sequential mixed-methods design to systematically characterize educational college building archetypes and identify the most prevalent types within Saudi public universities. The approach integrates qualitative and quantitative phases to ensure both depth and generalizability, consistent with best practices in mixed-methods research [61,62,63,64]. Campuses maps were generated using Quantum Geographic Information System (QGIS) version 3.34 [57], employing multiple data layers and advanced cartographic techniques to enhance visualization and analysis. All statistical analyses and data visualizations were performed using MATLAB R2023b [65], with different final tables and figures exported to Microsoft Excel (Microsoft 365) [66] and Power BI version 2.132 [67].

The combination of QGIS, MATLAB, and Power BI was adopted to ensure comprehensive spatial, statistical, and visual analysis. QGIS was used for geospatial mapping, coordinate referencing, and visualization of typological distributions. MATLAB supported the quantitative and statistical processing of datasets, including grouping, weighting, and clustering functions. Power BI provided a dynamic environment for integrating outputs and visualizing comparative results through interactive dashboards. Together, these tools created a coherent analytical workflow—linking spatial representation, quantitative modeling, and data visualization—to support a reproducible, multi-layered typological assessment.

Figure 9 illustrates the overall methodological workflow adopted in this study to develop the archetypes framework. The process begins with data preparation, where university campuses are coded and classified by age, region, urban context, masterplan typology, and design pattern. This is followed by quantitative analysis, including weighting factor calculation, generation of the cumulative weighting curve, and Top-K coverage to identify frequent archetypes. Statistical validation is then performed using chi-square testing and effect size measures to confirm representativeness. The workflow concludes with the selection of top-ranked archetypes and the presentation of the most representative college building archetype floorplan.

This methodology provides a rigorous framework for developing a college building archetype that reflects the dominant trends in Saudi public university campuses, supporting future research in energy modeling and sustainable design.

4.1. Scope and Unit of Analysis

The study encompasses 29 public universities in Saudi Arabia. The unit of analysis is the college building archetype. Where a university has multiple campuses, the dominant (main) campus is used for classification. This focus ensures comparability and relevance, as recommended in archetype and building stock studies [68,69].

4.2. Data Curation

Data sources include institutional lists, campus documents, and spreadsheet records, which are consolidated into a unified analysis code. Each combination is assigned to categorical variables. Records are rigorously screened for completeness and internal consistency; ambiguous entries are cross-verified across sources. The final dataset forms a cross-classified matrix of potential archetype cells, supporting robust pattern identification and generalization [40,68,69]. The qualitative data have been arranged as follows:

Attribute Identification: Key building attributes, such as establishment year, urban context, region, masterplan typology, and architectural design pattern, are identified through document analysis, expert consultation, and review of university records [40,63];
Attribute Discretization: Each attribute is discretized into predefined classes (construction year intervals, regional clusters, urban density classes, masterplan typologies, and design pattern categories) to enable systematic comparison and coding [40,62,63];
Coding Rules: Explicit coding rules are developed a priori to ensure consistency and reproducibility in mapping each building to a unique archetype cell [40,62,63].

These records were selected after screening for completeness and reliability, as other available sources were either incomplete or lacked sufficient detail for inclusion, such us, flooring areas or floorplans. No statistical imputation was performed for missing key attributes; instead, entries with incomplete or inconsistent information were excluded after cross-verification to preserve internal validity. The final sample covers all 29 public universities and spans the Kingdom’s major climatic regions and urban context classes, providing a nationally representative basis for typological analysis while maintaining transparent data provenance.

4.3. Data Classifiation

Figure 10 shows the data classification framework used in this study. The collected information was systematically classified into predefined categorical variables to enable consistent analysis and cross-comparison. Each university was assigned to categories based on year of construction, geographic region, urban context, masterplan typology, and college building design pattern. These categories were established through document review and verification to ensure clarity and reproducibility. Table A1 in Appendix A provides more details about the data classification.

4.4. Quantitative Weighting and Statistical Analysis

The quantitative data have been analyzed as follows:

Data Mapping: Each campus observation is mapped to a single cell in the cross-classified archetype space using the established coding rules;
Descriptive Statistics: The frequency and distribution of each archetype cell are calculated to identify dominant patterns and recurring important cells [40,63];
Statistical Testing: Statistical analyses (e.g., chi-square tests) are conducted to assess the significance of observed distributions and to validate the representativeness of identified archetypes [40,61,63].

The full cross-product of combinations was calculated using Equation (1) and yielded 1260 potential cells (7 (

{A g e}_{i}

) × 5 (

{R e g i o n}_{j}

) × 3 (

{U r b a n c o n t e x}_{k}

) × 4 (masterplan

{T y p o l o g y}_{l}

) × 3 (

{I d e n t i c a l d e s i g n}_{m} p a t t e r n

)). Not all cells are occupied; occupied cells receive weights as described next.

C e l l s = {A g e}_{i} * {R e g i o n}_{j} * {U r b a n c o n t e x}_{k} * {T y p o l o g y}_{l} * {I d e n t i c a l d e s i g n}_{m}

(1)

4.5. Weighting Scheme and Ranking Metric

To quantify how representative each college building archetype is within the national stock, a cumulative weighting factor (CWF) is calculated for each occupied cell. This approach is consistent with established building stock modeling and archetype analysis methods [10,40].

Each archetype cell (j) is assigned a normalized weight (

w_{j}

), representing the proportion of all observed campuses that fall into that cell. The weights are normalized so that the sum across all occupied cells equals 1 (so

\sum_{j} w_{j} = 1

), as shown in Equation (2).

w_{j} = \frac{n_{j}}{N}

(2)

where (

n_{j})

is the number of campuses in cell (

j

), and (N) is the total number of archetypes in the dataset.

Cells are ranked from most to least representative based on their normalized weights. The monotone rank metric is expressed as a CWF percentage (0–100%), showing the share of the national stock captured as more archetypes are included. The cumulative sum of weights is plotted to visualize how quickly the most common archetypes account for the majority of the building stock. So, the cumulative curve is then expressed by Equation (3).

C W F (k) = 100 \times \sum_{j = k}^{n} w_{(j)},

(3)

where (k) is the number of occupied cells. Plotting the CWF(k) curve identifies a minimal set of archetypes that represent most of the national stock, a method used in energy modeling and retrofit prioritization [10,40]. To formalize the per-rank increment used in weighting and ranking scheme, Equation (4) is used after sorting cells by rank (from most to least representative):

∆ j = \max {\{0, {C W F}_{j} - {C W F}_{j - 1}\}}_{,}

(4)

The per-rank ensures that each increment is non-negative, and plateaus (where the cumulative value does not increase) are handled by assigning a zero increment. The same construction applies to any subset (e.g., Identical design subset), using its own cumulative column and forward-filling to handle plateaus. This approach is consistent with rank-based weighting and cumulative distribution methods in multi-attribute decision-making and building stock modeling [40,70]. The per-rank increment (

∆ j

) is particularly useful for binning, thresholding, or identifying dominant archetypes in the national stock.

Cumulative Weighting Scheme and Ranking Metric

To summarize the concentration of archetype representation without assuming a specific distribution, the ranked list of archetype cells is partitioned into ten equal bins, each representing a 10% increment of the cumulative national stock share as described in Equation (5). For each bin (b) (where b

\in

{1, …, 10}), the observed share (

O_{b}

) is calculated as the sum of the per-rank increments (

∆_{j}

) for all cells (j) that fall within bin (b):

O_{b} = \frac{\sum_{j \in b} ∆_{j}}{m a x (C u m W F)}

(5)

Under a uniform null hypothesis (i.e., if the distribution were perfectly even), each bin is expected to hold

E_{b}

= 100% of the stock. This non-parametric binning approach is widely used in multi-attribute decision-making and ranking analyses to provide interpretable summaries of concentration and dominance [70,71,72]. The binning procedure involved four steps as follows:

Step 1:: Rank Ordering; all archetype cells are sorted in descending order by their normalized weight (share of national stock).
Step 2:: Cumulative share calculation; for each cell, the cumulative share is calculated as the sum of weights up to that rank.
Step 3:: Bin Assignment; the cumulative share axis [0, 100%] is divided into ten bins [0, 10], (10, 20], …, (90, 100]%. Each cell is assigned to the bin corresponding to its cumulative share.
Step 4:: Observed Share per Bin; for each bin [1, …, 10], the observed share is the sum of weights of all cells whose cumulative share falls within that bin.

4.6. Statistical Tests and Effect Sizes

The chi-square goodness-of-fit test is used to determine whether the observed distribution of shares across ten bins significantly deviates from the expected uniform distribution (where each bin would contain 10% of the total stock if the distribution were perfectly even) [40,73,74]. The test statistic is calculated by applying Equation (6):

x^{2} = \sum_{b = 1}^{10} \frac{{{(O b s e r v e d}_{b} - {E x p e c t e d}_{b})}^{2}}{{E x p e c t e d}_{b}} \times T

(6)

where (T) is the total percentage (i.e., the total is treated as 100 to keep the test on an interpretable scale). Degrees of freedom (

d_{f}

) are 9 with reported p-values to indicate whether the observed distribution significantly differs from uniform. The effect size is then applied using scale-free effect size Cohen’s (

\emptyset

) following Equation (7):

\emptyset = \sqrt{\sum_{b = 1}^{10} \frac{{{(O b s e r v e d}_{b} - {E x p e c t e d}_{b})}^{2}}{{E x p e c t e d}_{b}}}

(7)

The selection of the weighting, chi-square, and effect size analyses is intended to ensure that the typological outcomes are statistically defensible and interpretable for planning applications. The weighted-factor scheme quantifies the relative importance of geometric and functional attributes in shaping national-scale archetypes; the chi-square test determines whether observed concentrations differ significantly from uniform expectations, confirming that the typological structure is non-random; and the effect size metric (Cohen’s

\emptyset

) converts statistical significance into practical magnitude, indicating the strength of deviation across ranked bins. Together these procedures provide a transparent and reproducible link between descriptive stock data and decision-oriented insights, allowing planners to identify dominant archetypes for benchmarking, retrofitting prioritization, and policy formulation.

Both the Total and Identical design cumulative series were calculated. A contingency table was then developed to formally test whether the distribution of design types differs across bins. A chi-square test of independence is then used to assess whether the distribution of design types is independent of bin membership, a standard approach for categorical data analysis [75,76,77].

To identify the most influential archetypes, the cumulative share of the top (K) ranked cells is employed using Equation (8):

T o p r a n k - K = = \sum_{j = 1}^{K} ∆_{j}

(8)

where (

∆_{j}

) is the per-rank increment for cell (j). This metric translates statistical concentration into actionable short-lists, supporting targeted modeling and policy interventions.

4.7. Integration and Reporting

Triangulation: Findings from both phases are integrated to ensure robust characterization and defensible selection of representative archetypes, supporting advanced simulation and benchmarking [40,61,62].
Transparency: All coding decisions and analytical steps are fixed a priori and reported in detail to enhance transparency and reproducibility [62,63].

5. Results

5.1. Data Characterisrics

This subsection summarizes the scope and representativeness of the analyzed dataset, including university coverage, regional distribution, and typological diversity.

5.2. Sample and Coverage

The cross-classification of Saudi public university college buildings identified 1260 unique archetype combinations. This analysis set draws from all 29 public universities and includes observations distributed across the administrative/climatic regions and the three urban context classes, ensuring that the reported distributions reflect national coverage rather than a single-region sample. Figure 11 illustrates how the cumulative share of the national college building stock increases as these archetypes are sequentially added from the most to the least common.

The cumulative weighting function for the total stock (black solid line) rises steeply: the top 10% of archetype combinations account for approximately 60% of all buildings, and by 20%, coverage nears 90%. Beyond the halfway point (50% of combinations), the curve plateaus, indicating that nearly the entire stock is represented. This pronounced right-skewed distribution demonstrates that a small subset of frequently repeated campus building configurations dominates the national inventory, a pattern consistent with Pareto-type concentration observed in college building stock studies in Saudi Arabia and also globally [41,69].

The series for strictly identical designs (blue dashed line) follows a similar, though slightly lower, trajectory. It reaches about 60% coverage within the first decile, climbs to ~90% by the second quintile, and approaches full coverage (98–100%) after about half the combinations are included. This indicates that while identical design alone covers a substantial portion of the stock, a small number of semi-identical or unique variants are needed to achieve complete national representation.

Such concentration supports the use of a compact set of archetypes for energy modeling and benchmarking, rather than treating each building as a unique case. This approach aligns with international best practices, where representative archetypes are used to efficiently capture the diversity and energy performance of large building stocks [40].

5.3. Typological Outcomes

The following analysis examines the statistical differentiation among identified archetypes using rank-bin weighting, chi-square testing, and effect size evaluation to quantify distributional patterns.

5.4. Interpreting the Binned Table: Chi-Square Test and Effect Size

Table 3 summarizes the results of the binning methodology using the chi-square test and effect size to assess the distribution of college building archetypes. The chi-square test indicated significant associations between building typology and climatic zone distribution, confirming representativeness across the national dataset, consistent with the typological validation approaches adopted by [78,79]. The effect size analysis highlighted the strong influence of floor area and compactness ratio on archetype differentiation, reflecting the same weighting logic applied in archetype sensitivity studies such as [78,80]. For each 10% rank bin, the observed percentages for both total and identical-design buildings are compared to the expected uniform value (10%). The table also reports the chi-square contribution from each bin, quantifying how much each deviates from the uniform expectation, and provides cumulative coverage percentages. The chi-square test reveals a highly skewed distribution: the first bin alone accounts for a disproportionately large share of the stock (e.g., 75.95% for total observed), resulting in a very high chi-square contribution (434.91 for total, 283.45 for identical design). Subsequent bins contribute much less, and cumulative coverage quickly approaches 100%. This pattern indicates a strong departure from uniformity, with a small number of archetypes dominating the stock, a result consistent with the expected behavior of binned data in such contexts [40,73,81].

In addition to descriptive interpretation, the statistical procedures applied in this study are reported in the Results to provide full analytical transparency. Specifically, the chi-square goodness-of-fit test was used to evaluate whether the observed archetype distribution differs significantly from a uniform expectation, while the effect size metric (Cohen’s w) quantifies the magnitude of this deviation. Weighted-factor analysis identified the most influential variables contributing to archetype differentiation, following established archetype-based methodologies [40,73,81]. Table 3 summarizes these results and their statistical implications.

These statistical outputs confirm that the archetype classifications are not random but statistically significant, reinforcing the robustness of the typological framework and supporting its suitability for future benchmarking and modeling applications.

Effect size, as measured by the chi-square statistic, is substantial in the initial bins, reflecting the magnitude of concentration. Importantly, effect size is independent of sample size and provides an objective measure of how much the observed distribution diverges from the expected uniform distribution [82]. However, recent research cautions that binning choices and the use of sample versus true standard deviations can bias the mean and variance of the chi-square statistic, especially in finite samples, and these corrections should be considered for accurate interpretation [81].

Overall, the table demonstrates that the college building stock is highly concentrated in a few archetypes, with statistical tests confirming significant and meaningful deviation from uniformity. The statistical differentiation of educational building archetypes in this study follows established archetype-based analytical approaches used in prior energy and typological modeling literature [78,79,80].

5.5. Top-K Coverage (Fine-Grain Ranks)

The Top-K coverage analysis, as detailed in Table 4, demonstrates that a very small subset of archetype combinations accounts for a disproportionately large share of the college building stock at the Saudi public universities. Specifically, the top 10 individual ranks (representing just 0.79% of all 1260 combinations) cover 22.85% of the total stock. Expanding to the top 20 ranks increases coverage to 35.17%, and the top 50 ranks (4% of combinations) encompass 53.98% of the stock. As more archetypes are included, coverage rises rapidly: the top 126 ranks (10% of combinations) account for 75.95% of the stock, and the top 252 (20%) cover 89.40%. This pattern confirms a pronounced concentration where a limited number of archetypes dominate the national inventory.

To statistically assess this concentration, a chi-square goodness-of-fit test was conducted against the null hypothesis of a uniform 10% distribution across ten bins. The results, summarized in Table 5, are highly significant: for the total stock, χ²(9) = 498.24 (p = 1.37 × 10⁻¹⁰¹, Cohen’s w = 2.23), and for identical designs, χ²(9) = 363.89 (p = 6.84 × 10⁻⁷³, Cohen’s w = 1.91). Both p-values are far below conventional significance thresholds, and the very large effect sizes (Cohen’s w > 0.8 is considered large) indicate that the observed deviation from uniformity is not only statistically significant but also practically dominant. These findings are robust and align with established research, which shows that the chi-square test is effective for detecting strong departures from expected distributions, especially in cases of highly concentrated data [83,84]. The results provide compelling evidence for using a compact set of archetypes in modeling and policy applications.

5.6. Implication for Prioritization

Combining the bin shares and fine-grained Top-K results, it is clear that the Saudi public university building stock is highly concentrated within the top 10–20% of archetype ranks. Both the total stock and the identical-design subset display this pronounced head-heavy pattern, where a small number of archetypes account for the majority of buildings. As a result, prioritizing modeling, calibration, and policy analysis efforts on these top-ranked archetypes, including those with identical designs, enables stakeholders to capture the majority of stock behavior while minimizing analytical complexity.

This approach aligns with best practices in building stock management and energy policy, where focusing on the most prevalent or highest-impact segments yields the greatest returns for resource allocation and intervention strategies [85]. By concentrating efforts on the dominant archetypes, decision-makers can efficiently target upgrades, renovations, or policy measures, ensuring that interventions are both cost-effective and broadly representative of the national stock. This targeted prioritization is especially valuable for large-scale energy modeling, benchmarking, and the design of incentive programs, as it maximizes impact without the need for exhaustive, case-by-case analysis.

5.7. The Identical Design Floorplans

To strengthen the implications of this research, it is valuable to explicitly include the role of identical design floorplans in the analysis. The identical-design subset, which exhibits the same highly concentrated, head-heavy distribution as the total stock, offers unique opportunities for streamlining modeling and policy interventions. Because these floorplans represent repeated, standardized layouts across multiple campuses, focusing on them allows for even greater efficiency in both data collection and intervention strategies.

Including identical design floorplans in prioritization means that a relatively small number of archetype models can be used to represent a large portion of the building stock with high fidelity. This approach is supported by recent research, which highlights the benefits of leveraging standardized or repeated floorplans for rapid dataset generation, improved modeling accuracy, and scalable policy implementation [40,86]. For example, semi-automated or graph-based modeling methods can efficiently map and analyze these repeated layouts, enabling more targeted and cost-effective retrofitting or renovation programs [87]. Additionally, focusing on identical designs can facilitate the use of automated tools for floorplan analysis and inventory characterization, further reducing complexity and resource requirements [86].

The significance of identical design floorplans is especially notable in the context of universities constructed after 2000. The most frequently occurring college building design, favored by newer universities, exemplifies this trend. Figure 12 presents the identical college building design and is obtained from Jazan University [88]. The floorplans feature a three-floor, spine-and-fingers layout with rounded terminal volumes. A central corridor (“spine”) connects a series of uniform classroom/lab wings (“fingers”), with rotunda-like blocks at each end serving as lobby and service hubs. The ground floor is dominated by open rooms and service suites, while the upper floors transition to more cellular teaching and office spaces, maintaining strong modular repetition and robust egress through multiple stair cores.

This high regularity and modularity are characteristic of template (“identical”) college buildings used across multiple campuses, making them particularly well-suited for stock-level archetype modeling and standardized retrofit packages. By focusing on these repeated floorplans, stakeholders can streamline data collection, improve modeling accuracy, and efficiently implement policy interventions. This approach is supported by research emphasizing the value of standardized layouts for benchmarking, facility management, and strategic planning in higher education buildings [89,90].

In summary, prioritizing both the most prevalent archetypes and the subset of identical design floorplans, especially those adopted in post-2000 university construction, maximizes the impact of modeling and policy actions while minimizing effort and complexity. This dual focus ensures interventions are both representative and scalable, supporting efficient progress toward sustainability and performance goals.

6. Discussion

6.1. Stock Structure

The Saudi public university system demonstrates a distinct two-cohort structure: a small group of legacy institutions established in the mid-20th century and a much larger cohort resulting from rapid expansion after 2000. This pattern is reflected in the rank-based analysis, where the majority of the building stock is concentrated within a limited set of archetype cells, while the remainder contributes minimally. The 10% rank-bin analysis makes this explicit: the top bin alone contains a dominant share of the stock, and by the top two bins, coverage approaches the system total. These findings are both statistically robust (with extremely small p-values) and practically significant (Cohen’s w far exceeding conventional thresholds for large effects), confirming that the observed concentration is substantive and not an artifact of sample size. This aligns with broader trends in built environment stock studies, where stock mass is often found to be unevenly distributed across archetypes or typologies, especially in rapidly urbanizing or expanding contexts [91,92]. Table 6 compares the Saudi and the global building stock distribution patterns.

6.2. Role of Identical Desing

The subset of buildings with identical designs exhibits the same head-heavy distribution as the total stock, highlighting the operational importance of standardized college building forms. This has two key implications. First, design measures, such as envelope, HVAC, and operational schedules, developed for these repeated forms can be widely propagated across the stock, maximizing impact. Second, data collection efforts are leveraged: a small number of well-instrumented, identical sites can serve as reference archetypes, enhancing transferability and representativeness [18,40]. This approach is consistent with best practices in building stock modeling, where archetype-based methods are used to efficiently characterize and manage large, heterogeneous portfolios [91,92]. Further, a stricter test of whether identical designs are over-represented in top bins relative to non-identical types can be conducted using χ² independence tests and Cramér’s V, for which the current analytical pipeline is prepared.

6.3. Prioritization for Modeling and Policy

Given that much of the cumulative stock weighting lies in the top ranks, a focused simulation set can capture the majority of campus stock behavior with far fewer scenarios. In practice, targeting the top 10–20% of ranks provides broad coverage for calibration, baseline estimation, and retrofit scenario testing [96]. This prioritization is highly attractive for program design: metering, audits, and early pilots can be concentrated where returns to information are highest, while lower-rank cells can be addressed through parameter borrowing or meta-models. Similar prioritization frameworks have been successfully applied in other building stock studies to optimize resource allocation and intervention strategies [97].

The following discussion interprets the typological outcomes in relation to their broader implications for campus planning, energy management, and national sustainability policy.

6.4. Spatial Considerations

The spatial clustering of universities along the central (Riyadh) and western (Makkah–Jeddah–Madinah) corridors mirrors national patterns of population density and transportation infrastructure. For energy policy, this geographic coincidence of high stock mass and grid demand suggests that efficiency or demand-response programs targeted at these corridors are likely to yield disproportionate system benefits. This spatial targeting approach is supported by international research, which highlights the value of aligning building stock interventions with regional demand and infrastructure patterns [91].

Beyond the statistical distribution of archetypes, the results reveal clear spatial and morphological logics underlying Saudi university campus design. Institutions located in hot-humid regions such as Jazan tend to adopt compact, low-rise, courtyard-centered masterplans that limit envelope exposure and encourage cross-ventilation, whereas campuses in hot-arid regions such as Riyadh and Qassim often feature dispersed layouts with shaded connectors, transitional courtyards, and deeper setbacks to reduce direct solar gain. These variations illustrate how university planning reflects regional climatic adaptation and traditional design strategies, linking the typological findings to functional planning logic rather than numerical distribution alone.

6.5. Methodological Contribution

This study formalizes a categorical archetype framework that is resilient to missing granular data. By codifying evidence into a cross-classified space, assigning normalized weights, and evaluating concentration using 10% rank-bins, chi-square tests, and effect sizes, the approach provides a transparent and reproducible pathway from heterogeneous records to actionable short-lists. Reporting Top-K coverage alongside test statistics bridges the gap between statistical significance and operational relevance, advancing the methodological rigor of building stock analysis. This aligns with recent calls in the literature for more standardized, data-driven, and scalable approaches to building stock modeling and policy design [91,97].

The identified archetypes provide a practical foundation for multiple applications across energy management and planning. For energy benchmarking, each archetype serves as a standardized baseline for comparing building-performance data across campuses and climatic regions. For retrofitting prioritization, the ranking framework highlights high-exposure or envelope-intensive archetypes, particularly those in hot-humid and hot-arid zones, that should be targeted first for efficiency upgrades. For policy and planning, the typological classification supports evidence-based guidelines aligned with Saudi Vision 2030 and the Saudi Building Code, enabling planners and institutions to develop consistent standards for sustainable campus design and facility management.

6.6. Limitations and Robustness

The findings of this study are subject to several methodological and data-related limitations that should be acknowledged. The analysis relied on discrete classification bands for variables such as construction period, region, urban context, masterplan typology, and design pattern. While discretization facilitates comparability, it may simplify the underlying variability within each category. In addition, the study employed a rank-based representativeness metric and a 10-bin analytical resolution, both of which influence the sensitivity of the statistical outcomes. To ensure consistency and minimize bias, all classification and coding rules were defined a priori, and scale-independent effect sizes were emphasized. Sensitivity analyses with finer binning confirmed the qualitative trend of strong head concentration, supporting the robustness of the results.

It is also important to note that the cumulative weighting used in this study represents building stock presence rather than actual energy consumption. Translating these typological findings into absolute energy-use metrics will require additional parameterization in future work, incorporating variables such as climate, operational schedules, and HVAC system characteristics. These limitations are consistent with those commonly recognized in building stock and energy modeling research, where discretization, scenario selection, and data incompleteness are typical sources of uncertainty. Nevertheless, the methodological framework remains robust and provides a transparent foundation for future energy-performance benchmarking and national-scale simulation studies.

6.7. Implications and Future Work

For national planning, the findings support a tiered strategy: develop high-fidelity models and retrofit solutions for the top-ranked identical forms, deploy calibrated variants to other high-rank cells, and address the long tail with simplified templates. Future work should include independence testing between design type and rank bin, incorporate additional weighting factors such as enrollment or floor area, and link archetype models to measured energy data where available. These steps will further refine targeting and enhance the efficiency and impact of concentrated modeling approaches, as recommended in recent studies on robust building performance assessment and scenario-based planning [98,99].

All analytical steps, including weight normalization, ranking, incremental differences, 10% rank-binning, chi-square (χ²) testing, effect size () calculation, and Top-K coverage are reproducible and can be replicated from the described methodology and codebase. The computational workflow was scripted in Matlab [65], with final tables and figures exported to Excel [66] and Power BI [67] for visualization and reporting. The use of a fixed bin count (T = 100) ensures clarity in results, while effect size metrics such as Cohen’s (w) remain invariant to scale. This approach aligns with best practices in building performance simulation and computational research, where transparent documentation of code, software versions, and workflow is essential for scientific integrity and future validation [100].

7. Conclusions

This study established a GIS-based typological framework for Saudi public university buildings that systematically links spatial, morphological, and functional characteristics. The findings reveal that a small number of dominant archetypes account for most of the educational building stock, demonstrating a highly concentrated typological pattern across climatic regions. While the current framework focuses on geometric and contextual parameters, it provides the foundation for future integration of environmental and energy datasets. The next phase will incorporate HVAC configuration, glazing and insulation properties, and infiltration rates, extending the typology to a comprehensive energy modeling platform. By providing a transparent and reproducible structure, this work contributes to evidence-based planning and supports Saudi Vision 2030 objectives for sustainable higher-education infrastructure.

The findings reveal a pronounced concentration of stock representation in the highest ranks. In the full sample of 1260 combinations, the top decile (0–10% of ranks (126 combinations)) accounts for approximately 76% of cumulative weighting, and the top two deciles cover about 89% (252 combinations). Fine-grained metrics show that the top 10, 20, and 50 individual ranks capture 22.9%, 35.2%, and 54% of the stock, respectively. The subset of buildings with identical designs exhibits a similar head-heavy pattern, with 63% in the top decile and 88% in the top two deciles. Chi-square goodness-of-fit tests against a uniform distribution are highly significant, with very large effect sizes, confirming that the observed concentration is both statistically and substantively meaningful.

These results have direct operational implications. Because most of the stock mass is concentrated in a small set of repeated forms, focusing modeling, calibration, metering, and retrofit design on the top 10–20% of ranks, especially those with identical designs, can efficiently capture the majority of stock behavior. This enables more rapid campus energy assessments and targeted efficiency or demand management interventions, particularly in the central and western corridors where university buildings and energy loads are clustered.

Methodologically, this work contributes a portable, data-sparse-tolerant pipeline, incorporating categorical coding, cumulative weighting, Top-K coverage, and 10% rank-bin tests with effect sizes, that can be adopted by other public-sector building portfolios. The workflow is reproducible and can be replicated from the described the original script.

Limitations include reliance on categorical discretization, a monotonic rank metric, and bin resolution; cumulative weighting reflects stock presence rather than absolute energy use. Future research should explicitly test composition (identical vs. non-identical designs across bins) using independence tests, integrate measured energy or floor-area/enrollment weighting to estimate absolute impacts, and regionalize parameters for high-leverage corridors. Despite these caveats, the core conclusion is robust: a small, well-defined set of archetypes drives the educational building stock, and prioritizing these forms offers the most efficient path to actionable energy policy and planning.

Funding

This research received no external funding.

Data Availability Statement

The datasets and analysis scripts generated and used in this study are not publicly available because they contain internal coding and processing steps that are not intended for distribution. However, the minimal processed data required to reproduce the reported findings, and the scripts underlying the statistical analyses, are available from the corresponding author upon reasonable request.

Acknowledgments

During the preparation of this manuscript, the author used ChatGPT (OpenAI, GPT-5, 2025 version) to assist with text refinement and language polishing. The author has reviewed and edited all AI-generated content and takes full responsibility for the final version of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MoE	Ministry of Education
GASTAT	General Authority for Statistics
CMU	Cumulative Weighting Factor
W	Effect size
WF	Weighting Factor

Appendix A

Table A1 summarizes the cells groups, there corresponding code and label bands.

Table A1. Data organization and grouping by dimensions and level code.

Design	Code	Level (Label Band)
Year of construction	Y1	1950–1960 (1)
	Y2	1961–1970 (2)
	Y3	1971–1980 (3)
	Y4	1991–2000 (4)
	Y5	2001–2010 (5)
	Y6	2011–2015 (6)
Region	R1	Central (1)
	R2	Western (2)
	R3	Eastern (3)
	R4	Southern (4)
	R5	Norther (5)
Urban context	U1	Urban-dense (1)
	U2	Low urban-dense (2)
	U3	Suburban (3)
Masterplan typology	T1	Courtyard (1)
	T2	Linear (2)
	T3	Cluster (3)
	T4	Varied (4)
College building design	D1	Unique (1)
	D2	Identical (2)
	D3	Semi-identical (3)

References

Krarti, M.; Dubey, K.; Howarth, N. Evaluation of building energy efficiency investment options for the Kingdom of Saudi Arabia. Energy 2017, 134, 595–610. [Google Scholar] [CrossRef]
Al-Tamimi, N. A state-of-the-art review of the sustainability and energy efficiency of buildings in Saudi Arabia. Energy Effic. 2017, 10, 1129–1141. [Google Scholar] [CrossRef]
AlHashmi, M.; Chhipi-Shrestha, G.; Nahiduzzaman, K.M.; Hewage, K.; Sadiq, R. Framework for Developing a Low-Carbon Energy Demand in Residential Buildings Using Community-Government Partnership: An Application in Saudi Arabia. Energies 2021, 14, 4954. [Google Scholar] [CrossRef]
The Saudi Energy Efficiency Center. About SEEC. Available online: https://www.seec.gov.sa/en/about/about-seec (accessed on 20 October 2025).
General Authority for Statistics, G.A.f. Population Estimates. Available online: https://www.stats.gov.sa/en/ (accessed on 30 August 2025).
Ahmed, W.; Asif, M. A critical review of energy retrofitting trends in residential buildings with particular focus on the GCC countries. Renew. Sustain. Energy Rev. 2021, 144, 111000. [Google Scholar] [CrossRef]
Alaidroos, A.; Almaimani, A.; Krarti, M.; Qurnfulah, E. Influence of building envelope characteristics on the effectiveness of PMV-based controls for schools located in Saudi Arabia. Indoor Built Environ. 2022, 31, 2411–2429. [Google Scholar] [CrossRef]
Hamida, M.B.; Ahmed, W.; Asif, M.; Almaziad, F.A. Techno-Economic Assessment of Energy Retrofitting Educational Buildings: A Case Study in Saudi Arabia. Sustainability 2021, 13, 179. [Google Scholar] [CrossRef]
Alfaoyzan, F.A.; Almasri, R. Benchmarking of Energy Consumption in Higher Education Buildings in Saudi Arabia to Be Sustainable: Sulaiman Al-Rajhi University Case. Energies 2023, 16, 1204. [Google Scholar] [CrossRef]
Krarti, M.; Aldubyan, M.; Williams, E. Residential building stock model for evaluating energy retrofit programs in Saudi Arabia. Energy 2020, 195, 116980. [Google Scholar] [CrossRef]
Mohammed, A.; Alshibani, A.; Alshamrani, O.; Hassanain, M. A regression-based model for estimating the energy consumption of school facilities in Saudi Arabia. Energy Build. 2021, 237, 110809. [Google Scholar] [CrossRef]
Almasri, R.; Eid, A.; Almarshoud, A.; Almotairy, F. Assessment of Energy Use and Photovoltaic Energy Potential in Saudi Arabian Governmental Schools. Appl. Sci. 2025, 15, 3809. [Google Scholar] [CrossRef]
Alshamrani, O.; Alshibani, A.; Mohammed, A. Operational Energy and Carbon Cost Assessment Model for Family Houses in Saudi Arabia. Sustainability 2022, 14, 1278. [Google Scholar] [CrossRef]
Aldubyan, M.; Krarti, M.; Williams, E. Evaluating Energy Demand and Energy Efficiency Programs in Saudi Residential Buildings. Energy 2020, 195, 116980. [Google Scholar] [CrossRef]
Chen, W.; Zhou, Y.; Stokes, E.; Zhang, X. Large-scale urban building function mapping by integrating multi-source web-based geospatial data. Geo-Spat. Inf. Sci. 2023, 27, 1785–1799. [Google Scholar] [CrossRef]
Biljecki, F.; Chow, Y. Global Building Morphology Indicators. Comput. Environ. Urban Syst. 2022, 95, 101809. [Google Scholar] [CrossRef]
Pieczara, M. Archetypes in contemporary architecture. Czas. Tech. 2019, 4, 71–84. [Google Scholar] [CrossRef]
Thiis-Evensen, T. Archetypes in Architecture; Norwegian University Press/Oxford University Press: Oslo, Norway; Oxford, UK, 2020. [Google Scholar]
Baykova, E.; Svetlichnaya, M. Archetypes of Artistic Form Making in the Context of Architecture—The House and the Temple. Obs. Cult. 2020, 17, 36–46. [Google Scholar] [CrossRef]
Kolosovskaya, A.; Ozheshkovskaya, I. Archetypes in Sacred Buildings: Christian Churches and Mosques. Her. Polotsk State Univ. Ser. F Civ. Eng. Appl. Sci. 2024, 36–41. [Google Scholar] [CrossRef]
Steadman, P. Sketch for an Archetypal Building. Environ. Plan. B Plan. Des. 1998, 25, 105–192. [Google Scholar] [CrossRef]
Shen, P.; Wang, H. Archetype building energy modeling approaches and applications: A review. Renew. Sustain. Energy Rev. 2024, 199, 114478. [Google Scholar] [CrossRef]
Khoshbakht, M.; Gou, Z.; Dupre, K. Energy use characteristics and benchmarking for higher education buildings. Energy Build. 2018, 164, 61–76. [Google Scholar] [CrossRef]
Kristensen, M.H.; Hedegaard, R.E.; Petersen, S. Hierarchical calibration of archetypes for urban building energy modeling. Energy Build. 2018, 175, 219–234. [Google Scholar] [CrossRef]
Dahlström, L.; Broström, T.; Widén, J. Advancing urban building energy modelling through new model components and applications: A review. Energy Build. 2022, 266, 112099. [Google Scholar] [CrossRef]
Tariq, R.; Mohammed, A.; Alshibani, A.; Ramírez-Montoya, M.-S. Complex artificial intelligence models for energy sustainability in educational buildings. Sci. Rep. 2024, 14, 15020. [Google Scholar] [CrossRef]
Deng, Z.; Chen, Y.; Yang, J.; Chen, Z. Archetype identification and urban building energy modeling for city-scale buildings based on GIS datasets. Build. Simul. 2022, 15, 1547–1559. [Google Scholar] [CrossRef]
Akin, S.; Nwagwu, C.C.; Heeren, N.; Hertwich, E. Archetype-based energy and material use estimation for the residential buildings in Arab Gulf countries. Energy Build. 2023, 298, 113537. [Google Scholar] [CrossRef]
Alosaimi, A. Evaluating the Sensitivity of Air Infiltration Rates on Envelope Thermal Insulation Performance. Saudi J. Appl. Sci. Technol. 2025, 1. [Google Scholar] [CrossRef]
Alosaimi, A. Assessment of building envelope thermal insulation and indoor air temperature: DOI registering. Adv. Civ. Archit. Eng. 2025, 16, 83–110. [Google Scholar]
Elbellahy, S.; Alotaibi, B.; Abuhussain, M. Field measurements of post-operation evaluation of daylighting and thermal comfort in hot and arid climates: A pilot study of three educational buildings on the Najran University campus in Saudi Arabia. J. Build. Eng. 2024, 82, 108174. [Google Scholar] [CrossRef]
Sanni-Anibire, M.; Hassanain, M. Quality assessment of student housing facilities through post-occupancy evaluation. Archit. Eng. Des. Manag. 2016, 12, 367–380. [Google Scholar] [CrossRef]
Sirror, H.; Labib, W.; Abowardah, E.; Metwally, W.; Mitchell, C. Sustainability in the Workplace: Evaluating Indoor Environmental Quality of a Higher Education Building in Riyadh. Buildings 2024, 14, 2115. [Google Scholar] [CrossRef]
Bayoumi, M. Improving Natural Ventilation Conditions on Semi-Outdoor and Indoor Levels in Warm–Humid Climates. Buildings 2018, 8, 75. [Google Scholar] [CrossRef]
Arabia, M.o.E.S. List of Saudi Universities. Available online: https://moe.gov.sa/en/education/highereducation/pages/universitieslist.aspx (accessed on 1 October 2025).
Google. Google Earth Pro, Version 7.3; Google LLC: Mountain View, CA, USA, 2024.
Hamdan, A. An Exploration into “Private” Higher Education in Saudi Arabia: Improving Quality and Accessibility? ACPET J. Priv. High. Educ. 2013, 2, 33. [Google Scholar]
Addas, A.; Khan, M.N.; Tahir, M.; Naseer, F.; Gulzar, Y.; Onn, C. Integrating sensor data and GAN-based models to optimize medical university distribution: A data-driven approach for sustainable regional growth in Saudi Arabia. Front. Educ. 2025, 10, 1527337. [Google Scholar] [CrossRef]
Saha, N. Higher Education in Saudi Arabia. J. Int. Stud. 2015, 5, 317–318. [Google Scholar] [CrossRef]
Alosaimi, A. Optimising the Energy Performance of the Residential Stock of the Kingdom of Saudi Arabia by Retrofit Measures. Ph.D. Thesis, University of Nottingham, Nottingham, UK, 2023. [Google Scholar]
Ali, U.; Shamsi, M.; Hoare, C.; Mangina, E.; O’Donnell, J. A data-driven approach for multi-scale building archetypes development. Energy Build. 2019, 202, 109364. [Google Scholar] [CrossRef]
Saleh, M. Development of higher education in Saudi Arabia. High. Educ. 1986, 15, 17–23. [Google Scholar] [CrossRef]
Mohiuddin, K.; Nasr, O.; Miladi, M.N.; Fatima, H.; Shahwar, S.; Naveed, Q.N. Potentialities and priorities for higher educational development in Saudi Arabia for the next decade: Critical reflections of the vision 2030 framework. Heliyon 2023, 9, e16368. [Google Scholar] [CrossRef]
Lebeau, Y.; Alruwaili, J. Convergence and local orders in the dynamics of change in higher education: A perspective from Saudi Arabia. Policy Rev. High. Educ. 2021, 6, 6–26. [Google Scholar] [CrossRef]
Zhang, Z.; Fisher, T.; Feng, G. Assessing the Rationality and Walkability of Campus Layouts. Sustainability 2020, 12, 10116. [Google Scholar] [CrossRef]
Bolshakov, A. Urban topology of university campus. IOP Conf. Ser. Mater. Sci. Eng. 2019, 667, 012014. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, H.; Pang, L.; Fisher, T.; Yang, S. Comparisons of Built Environment Correlates of Walking in Urban and Suburban Campuses: A Case Study of Tianjin, China. Land 2023, 12, 1972. [Google Scholar] [CrossRef]
Popov, A.; Syrova, O.I. University campuses in Russia: Architectural and urban development typology. Nexo Rev. Científica 2021, 34, 1826–1839. [Google Scholar] [CrossRef]
Wagner, M.; Ovezova, U. International experience of territorial-spatial organization of university campuses in urban structure. SHS Web Conf. 2021, 98, 03011. [Google Scholar] [CrossRef]
Noaime, E.; Alshenaifi, M.; Albaqawy, G.; Abuhussain, M.; Abdelhafez, M.; Alnaim, M. Beyond Buildings: How Does Sustainable Campus Design Shape Student Lives? Hail University as a Case Study. Buildings 2025, 15, 1468. [Google Scholar] [CrossRef]
Khan, H.; Asif, M. Impact of Green Roof and Orientation on the Energy Performance of Buildings: A Case Study from Saudi Arabia. Sustainability 2017, 9, 640. [Google Scholar] [CrossRef]
Alshuwaikhat, H.; Adenle, Y.; Saghir, B. Sustainability Assessment of Higher Education Institutions in Saudi Arabia. Sustainability 2016, 8, 750. [Google Scholar] [CrossRef]
Committee, S.B.C.N. Saudi Building Code (SBC); Saudi Standards, Metrology and Quality Organization (SASO): Riyadh, Saudi Arabia, 2024. [Google Scholar]
Damugade, S.; Pingale, B. Campus planning. Science 1984, 225, 786. [Google Scholar] [CrossRef] [PubMed]
Alghamdi, N. University Campuses in Saudi Arabia: Sustainability Challenges and Potential Solutions. Ph.D. Thesis, TU Delft, Delft, The Netherlands, 2018. [Google Scholar] [CrossRef]
Boumaraf, H.; Inceoğlu, M. Computational Analysis For Design Development Evaluation in Spatial Planning. Eskişehir Tech. Univ. J. Sci. Technol. A-Appl. Sci. Eng. 2022, 23, 94–111. [Google Scholar] [CrossRef]
Team, Q.D. QGIS Geographic Information System, Version 3.34; Open Source Geospatial Foundation Project: Beaverton, OR, USA, 2024.
Alamry, G. The Role of Interior Design in Enhancing Happiness and Comfort at Educational Institutions in Saudi Arabia: A Case Study of Girls’ College of Science and Arts in Mahayel Aseer, at King Khalid University. J 2022, 5, 455–469. [Google Scholar] [CrossRef]
Alwetaishi, M.; Benjeddou, O. Impact of Window to Wall Ratio on Energy Loads in Hot Regions: A Study of Building Energy Performance. Energies 2021, 14, 1080. [Google Scholar] [CrossRef]
Alghamdi, M.; Beach, T.; Rezgui, Y. Reviewing the effects of deploying building information modelling (BIM) on the adoption of osustainable design in Gulf countries: A case study in Saudi Arabia. City Territ. Archit. 2022, 9, 18. [Google Scholar] [CrossRef]
Turner, S.; Cardinal, L.; Burton, R. Research Design for Mixed Methods. Organ. Res. Methods 2017, 20, 243–267. [Google Scholar] [CrossRef]
Greene, J.; Caracelli, V.; Graham, W. Toward a Conceptual Framework for Mixed-Method Evaluation Designs. Educ. Eval. Policy Anal. 1989, 11, 255–274. [Google Scholar] [CrossRef]
Jahanbakhsh, M.; Hosseinpour, A.K.; Peikani, M.H. Structural Modeling of Higher Education Based on Sustainable Development Components Using a Mixed Method (Case Study: South Pars Region). Manag. Strateg. Eng. Sci. 2024, 6, 174–180. [Google Scholar] [CrossRef]
Hutson, B.; He, Y. Mixed Methods Research Centering on Minoritized Students in Higher Education: A Literature Review. Innov. High. Educ. 2024, 49, 1051–1076. [Google Scholar] [CrossRef]
The MathWorks, I. MATLAB; The MathWorks, Inc.: Natick, MA, USA, 2024. [Google Scholar]
Corporation, M. Microsoft Excel; Microsoft Corporation: Redmond, WA, USA, 2024. [Google Scholar]
Corporation, M. Microsoft Power BI Desktop; Microsoft Corporation: Redmond, WA, USA, 2024. [Google Scholar]
Oberlack, C.; Sietz, D.; Bonanomi, E.; Bremond, A.; Dell’Angelo, J.; Eisenack, K.; Ellis, E.; Epstein, G.; Giger, M.; Heinimann, A.; et al. Archetype analysis in sustainability research: Meanings, motivations, and evidence-based policy making. Ecol. Soc. 2019, 24, 19. [Google Scholar] [CrossRef]
Alrasheed, M.; Mourshed, M. Building stock modelling using k-prototype algorithm: A framework for representative archetype development. Energy Build. 2024, 311, 114111. [Google Scholar] [CrossRef]
Liu, D.; Li, T.; Liang, D. An integrated approach towards modeling ranked weights. Comput. Ind. Eng. 2020, 147, 106629. [Google Scholar] [CrossRef]
Pala, O. A new objective weighting method based on robustness of ranking with standard deviation and correlation: The ROCOSD method. Inf. Sci. 2023, 636, 118930. [Google Scholar] [CrossRef]
Raymaekers, J.; Verbeke, W.; Verdonck, T. Weight-of-evidence through shrinkage and spline binning for interpretable nonlinear classification. Appl. Soft Comput. 2021, 115, 108160. [Google Scholar] [CrossRef]
Rolke, W.; Gongora, C.G. A chi-square goodness-of-fit test for continuous distributions against a known alternative. Comput. Stat. 2020, 36, 1885–1900. [Google Scholar] [CrossRef]
Tezel, Ö.; Tiryaki, B.K.; Özkul, E.; Kesemen, O. A New Goodness-of-Fit Test: Free Chi-Square (FCS). GAZI Univ. J. Sci. 2021, 34, 879–897. [Google Scholar] [CrossRef]
Ireland, C.; Kullback, S. Contingency tables with given marginals. Biometrika 1968, 55, 179–188. [Google Scholar] [CrossRef]
Genest, C.; Nešlehová, J.; Rémillard, B.; Murphy, O. Testing for independence in arbitrary distributions. Biometrika 2019, 106, 47–68. [Google Scholar] [CrossRef]
Colarusso, M.; Erickson, W.; Willenbring, J. Contingency tables and the generalized Littlewood-Richardson coefficients. Proc. Am. Math. Soc. 2021, 150, 79–94. [Google Scholar] [CrossRef]
Ballarini, I.; Corgnati, S.; Corrado, V. Use of reference buildings to assess the energy saving potentials of the residential building stock: The experience of TABULA Project. Energy Policy 2014, 68, 273–284. [Google Scholar] [CrossRef]
Al-Rawi, M.; Ikutegbe, C.A.; Auckaili, A.; Farid, M.M. Sustainable technologies to improve indoor air quality in a residential house—A case study in Waikato, New Zealand. Energy Build. 2021, 250, 111283. [Google Scholar] [CrossRef]
Ballarini, I.; Corgnati, S.; Corrado, V.; Tala, N. Improving energy modeling of large building stock through the development of archetype buildings. Build. Simul. 2011, 2874–2881. [Google Scholar]
Hutzler, N. Chi-squared test for binned, Gaussian samples. Metrologia 2019, 56, 055007. [Google Scholar] [CrossRef]
Vermeesch, P. Dissimilarity measures in detrital geochronology. Earth-Sci. Rev. 2017, 178, 310–321. [Google Scholar] [CrossRef]
Rao, J.; Scott, A. The Analysis of Categorical Data from Complex Sample Surveys: Chi-Squared Tests for Goodness of Fit and Independence in Two-Way Tables. J. Am. Stat. Assoc. 1981, 76, 221–230. [Google Scholar] [CrossRef]
Koehler, K. Goodness-of-fit tests for log-linear models in sparse contingency tables. J. Am. Stat. Assoc. 1986, 81, 483–493. [Google Scholar] [CrossRef]
Stegnar, G. Strategic Prioritization of Residential Buildings for Equitable and Sustainable Renovation. Sustainability 2025, 17, 2203. [Google Scholar] [CrossRef]
Weber, R.; Mueller, C.; Reinhart, C. Automated floorplan generation in architectural design: A review of methods and applications. Autom. Constr. 2022, 140, 104385. [Google Scholar] [CrossRef]
Massafra, A.; Al-Harasis, D.; Stefanini, L.; Jabi, W. Semi-Automated Dataset Generation for Residential Buildings Using Graph-Based Topological Modelling. Buildings 2025, 15, 1283. [Google Scholar] [CrossRef]
Jazan University. Project Administration at Jazan University. Available online: https://www.jazanu.edu.sa/en/administration/departments/project-administration (accessed on 3 October 2025).
Li, S.; Chen, Y. Internal benchmarking of higher education buildings using the floor-area percentages of different space usages. Energy Build. 2020, 231, 110574. [Google Scholar] [CrossRef]
Catalano, G.; Baratta, A.; Calcagnini, L.; Finucci, F.; Magarò, A.; Mariani, M.; Trulli, L. Procedures and standards for the sizing of university buildings. Archit. Eng. Des. Manag. 2022, 19, 233–249. [Google Scholar] [CrossRef]
Lanau, M.; Liu, G.; Kral, U.; Wiedenhofer, D.; Keijzer, E.; Yu, C.; Ehlert, C. Taking stock of built environment stock studies: Progress and prospects. Environ. Sci. Technol. 2019, 53, 8499–8515. [Google Scholar] [CrossRef]
Österbring, M.; Mata, É.; Thuvander, L.; Mangold, M.; Johnsson, F.; Wallbaum, H. A differentiated description of building-stocks for a georeferenced urban bottom-up building-stock model. Energy Build. 2016, 120, 78–84. [Google Scholar] [CrossRef]
Esch, T.; Deininger, K.; Jedwab, R.; Palacios-Lopez, D. Outward and Upward Construction: A 3D Analysis of the Global Building Stock. World Dev. 2024, 188, 106857. [Google Scholar] [CrossRef]
Esch, T.; Brzoska, E.; Dech, S.; Leutner, B.; Palacios-Lopez, D.; Metz-Marconcini, A.; Marconcini, M.; Roth, A.; Zeidler, J. World Settlement Footprint 3D—A first three-dimensional survey of the global building stock. Remote Sens. Environ. 2022, 270, 112877. [Google Scholar] [CrossRef]
Marinova, S.; Deetman, S.; Voet, E.; Daioglou, V. Global construction materials database and stock analysis of residential buildings between 1970-2050. J. Clean. Prod. 2020, 247, 119146. [Google Scholar] [CrossRef]
Saler, E.; Gattesco, N.; Da Porto, F. A new combined approach to prioritise seismic retrofit interventions on stocks of r.c. school buildings. Int. J. Disaster Risk Reduct. 2023, 93, 103767. [Google Scholar] [CrossRef]
Hu, M.; Ghorbany, S. Building Stock Models for Embodied Carbon Emissions—A Review of a Nascent Field. Sustainability 2024, 16, 2089. [Google Scholar] [CrossRef]
Kotireddy, R.; Loonen, R.; Hoes, P.; Hensen, J. Building performance robustness assessment: Comparative study and demonstration using scenario analysis. Energy Build. 2019, 202, 109362. [Google Scholar] [CrossRef]
Walker, L.; Hischier, I.; Schlueter, A. Scenario-based robustness assessment of building system life cycle performance. Appl. Energy 2022, 311, 118606. [Google Scholar] [CrossRef]
Ghiaus, C. The imperative for reproducibility in building performance simulation research. J. Build. Perform. Simul. 2025, 18, 523–529. [Google Scholar] [CrossRef]

Figure 1. Saudi Arabia population across all regions [5].

Figure 2. Students’ population in the Saudi public universities (pink, largest population; blue, less population) [5].

Figure 3. Chronological Timeline of Saudi Public University Establishments (before 1990 (light blue); between 1980 and 2000 (drake red); after 2000 (blue)) [35]. Only 1 university was established between 1980 and 2000.

Figure 4. Classifications of the Saudi public universities by age groups (group 1 (1950 to 1960); group 2 (1961 to 1970); group 3 (1971 to 1980); group 4 (1981 to 1990); group 5 (1991 to 2000); group 6 (2000 to 2010); group 7 (2011 to 2015)). Most universities were founded after 2000, indicating rapid national expansion.

Figure 5. Classifications of the Saudi public universities by urban context group (low-density urban (1); suburban (2); dense urban (3)). The split between low-density and suburban settings indicates a prevailing tendency toward lower-intensity urban locations.

Figure 6. Classifications of the Saudi public universities by region (group 1 (Central region), group 2 (Western region), group 3 (Eastern region), group 4 (Southern region), group 6 (Northern region)). Most universities are concentrated in the Central and Western regions, reflecting the country’s population distribution.

Figure 7. Classifications of the Saudi public universities by Masterplan design typology (masterplan group 1 (courtyard), group 2 (linear), group 3 (cluster), group 4 (varied)). Courtyard layout dominate, showing a climatic influence on campus form.

Figure 8. Classifications of the Saudi public universities by college buildings design groups (group 1 (unique design); group 2 (identical); group 3 (semi-identical)). Semi-identical design is dominating.

Figure 9. Methodological workflow for developing archetypes.

Figure 10. Cross-classification dimensions and level code to define archetypes cells.

Figure 11. Cumulative weighting factor of college building archetypes in Saudi public universities.

Figure 12. The floorplans of the identical college building design of Saudi public universities [88].

Table 1. Saudi Arabia population statistics.

Category	Value
Total Population (2024)	35,300,280
Annual Growth Rate (Total)	4.70%
Annual Growth Rate (Saudi Only)	2%
Population Growth (2023–2024)	+1.6 million people
Population in 2023	33.7 million
Saudi Citizens (2024)	19.6 million (55.6% of total)
Non-Saudi Residents (2024)	15.7 million (44.4% of total)
Share of Population Growth (Non-Saudi)	75.6% of total increase

Table 2. Groups classifications of 10 public universities in Saudi Arabia [35,57].

University	Establishment Year (Age Group)	Region (City) (Climate Zone)	Masterplan Typology (Group)	Masterplan Context (Group)
Aljouf University	2005 (6)	Northern (Sakaka) (Zone 3 (hot–dry))	Courtyard (1)	Low-density Urban (3)
Jazan University	2005 (6)	Southern (Jizan) (Zone 1 (hot–humid))	Courtyard (1)	Low-density Urban (3)
Shagra University	2009 (6)	Central (Shagra) (Zone 4 (warm–dry))	Courtyard (1)	Suburban edge (2)
Najran University	2006 (6)	Southern (Najran) (Zone 4 (warm–dry))	Courtyard (1)	Low-density Urban (3)
Hail University	2005 (6)	North-central (Hail) (Zone 3 (hot–dry))	Courtyard (1)	Suburban edge (2)
Almajmah University	2005 (6)	Central (Almajma’ah) (Zone 4 (warm–dry))	Courtyard (1)	Suburban edge (2)
University of Tabuk	2006 (6)	Northwestern (Tabuk) (Zone 3 (hot–dry))	Courtyard (1)	Low-density urban (3)
Prince Sattam Bin Abdulaziz University	2009 (6)	Central (Alkharg) (Zone 4—warm–dry)	Courtyard (1)	Suburban edge (2)
King Saud University	1958 (1)	Central (Riyadh) (Zone 4 (warm–dry))	Courtyard (1)	Dense Urban (1)
King Abdulaziz University	1967 (2)	Western (Jeddah) (Zone 1 (hot–humid))	Courtyard (1)	Dense Urban (1)

Table 3. A summary of the distribution of building archetypes across ten ranked bins.

10% Rank Bin (Ubber)	Total Observed	Identical Desing Observed	Expected % (Uniform)	Total χ² Contribution	Identical χ² Contribution	Total Cumulative (%)	Identical design Cumulative (%)
10%	75.95	63.24	10	434.91	283.45	75.95	63.24
20%	13.45	24.63	10	1.19	21.39	89.40	87.87
30%	4.83	4.53	10	2.66	2.98	94.24	92.40
40%	2.55	2.42	10	5.55	5.74	96.78	94.82
50%	1.52	1.68	10	7.19	6.92	98.30	96.50
60%	0.90	1.43	10	8.28	7.34	99.20	97.93
70%	0.50	1.11	10	9.01	7.90	99.70	99.04
80%	0.24	0.72	10	9.53	8.60	99.94	99.77
90%	0.06	0.23	10	9.88	9.53	100	100
100%	0	0	10	10	10	100	100

Table 4. The Top-K coverage table with the archetype combinations at each cut-off (K) and their share of the total 1260 combinations.

K (Top Ranks)	Archetypes Included	Combinations (%)	Cumulative Stock Coverage (%)
1	1	0.08	3.56
5	5	0.40	14.14
10	10	0.79	22.85
20	20	1.59	35.17
50	50	3.97	53.98
126	126	10	75.95
252	252	20	89.40
378	378	30	94.24
630	630	50	98.30

Table 5. Statistical concentration test of the dataset.

Dataset	χ² (=9)	p-Value	Cohen’s w	Effect Size
Total stock	498.24	1.37 × 10⁻¹⁰¹	2.23	Very large
Identical designs	363.89	6.84 × 10⁻⁷³	1.91	Very large

Table 6. Building Stock Distribution Patterns—Saudi Arabia vs. Global Studies.

Region/Study	Stock Distribution Pattern	Key Insights	Citations
This research	Two cohorts, highly concentrated	Two data-driven unevenly distributed CWFs	-
Saudi Residential	Dominated by two archetypes	Tailored interventions needed	[10]
Global (Urban)	Uneven, dominated by few archetypes	Rapid growth phases, later diversification	[93,94]
China/India	Rapid expansion, later saturation	Cohort effects, stock concentration	[95]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alosaimi, A.H. Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities. Buildings 2025, 15, 3860. https://doi.org/10.3390/buildings15213860

AMA Style

Alosaimi AH. Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities. Buildings. 2025; 15(21):3860. https://doi.org/10.3390/buildings15213860

Chicago/Turabian Style

Alosaimi, Azzam H. 2025. "Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities" Buildings 15, no. 21: 3860. https://doi.org/10.3390/buildings15213860

APA Style

Alosaimi, A. H. (2025). Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities. Buildings, 15(21), 3860. https://doi.org/10.3390/buildings15213860

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Geographic Information System-Based Stock Characterization of College Building Archetypes in Saudi Public Universities

Abstract

1. Introduction

2. Background

2.1. Educational Buildings

Key Insights from Recent Research

2.2. Buildings Archetypes in Saudi Arabia

3. Data Sources and Analysis

3.1. Population

3.2. Data Classification

3.2.1. Building Age

3.2.2. Urban Context

3.2.3. Region

3.2.4. Masterplan Typology

3.2.5. Dominant Typologies and Spatial Contexts

3.2.6. College Building Design Pattern

3.3. Summary

4. Methodology

4.1. Scope and Unit of Analysis

4.2. Data Curation

4.3. Data Classifiation

4.4. Quantitative Weighting and Statistical Analysis

4.5. Weighting Scheme and Ranking Metric

Cumulative Weighting Scheme and Ranking Metric

4.6. Statistical Tests and Effect Sizes

4.7. Integration and Reporting

5. Results

5.1. Data Characterisrics

5.2. Sample and Coverage

5.3. Typological Outcomes

5.4. Interpreting the Binned Table: Chi-Square Test and Effect Size

5.5. Top-K Coverage (Fine-Grain Ranks)

5.6. Implication for Prioritization

5.7. The Identical Design Floorplans

6. Discussion

6.1. Stock Structure

6.2. Role of Identical Desing

6.3. Prioritization for Modeling and Policy

6.4. Spatial Considerations

6.5. Methodological Contribution

6.6. Limitations and Robustness

6.7. Implications and Future Work

7. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI