1. Introduction
The built environment is becoming increasingly vulnerable to climate-induced disturbances, including prolonged cold spells, polar vortex events, and energy supply disruptions. According to the Intergovernmental Panel on Climate Change [
1], such events are expected to intensify throughout the 21st century, posing new challenges to building safety and habitability, especially in heating-dominated regions. While global energy policy has long emphasized decarbonization and energy efficiency [
2], recent studies highlight that efficiency alone does not ensure thermal resilience under disruptive conditions [
3,
4].
Buildings consume a significant share of energy worldwide and are expected to undergo performance shifts due to increasingly variable outdoor conditions. As Hosseini et al. [
5] demonstrated through high-resolution simulations, localized microclimatic factors—such as the urban density, vegetation, and thermal mass—can significantly influence indoor temperature decay during winter energy outages. Empirical studies have further confirmed this vulnerability. For instance, Baglivo et al. [
6] and D’Agostino et al. [
7] reported that buildings compliant with updated national energy regulations may still fail to maintain habitable conditions during extended heating interruptions.
In response, the concept of resilience has emerged as a complementary performance dimension, addressing not only how buildings operate under normal conditions but also how they perform under stress. Originally developed in ecology and materials science [
8], the notion of resilience has since been adapted across diverse fields, including engineering, sociology, and climate adaptation. In the context of the built environment, it broadly refers to the ability of a system to absorb, withstand, and recover from a disturbance while maintaining essential functions [
9,
10].
Two main theoretical frameworks currently shape the resilience discourse: engineering resilience, which emphasizes the speed of recovery and system stability [
11], and ecological resilience, which stresses the capacity to adapt, absorb shocks, and reorganize under persistent stress without losing function [
10]. Translated into building science, engineering resilience relates to robustness and the recovery of mechanical systems, whereas ecological resilience involves passive survivability, envelope behavior, and occupant adaptability [
12,
13].
Despite increasing interest, there is no consensus on how to define or operationalize thermal resilience in buildings. Peri et al. [
12] found that, while many studies invoke resilience in a general sense, few provide metrics that directly capture dynamic thermal performance during disruptions. Some authors frame resilience in terms of the envelope response and others in terms of temperature restoration or user comfort, while some include human behavior or urban infrastructure as integral components [
14,
15]. This conceptual fragmentation presents a barrier to standardization and limits comparability across studies.
Against this backdrop, thermal resilience has emerged as a more focused and measurable dimension: the ability of a building to maintain safe and habitable indoor temperatures during heating system failures or extreme weather events [
16]. It represents not only a physical response but also a public health concern, as thermal loss can lead to hypothermia, cardiovascular stress, and mortality—especially among vulnerable populations [
17,
18].
However, most current evaluations remain biased toward overheating scenarios and summer peaks. Winter resilience has received significantly less attention in both research and policy, despite potentially greater risks. Extreme cold conditions introduce unique technical challenges: intensified conductive heat loss, rapid temperature decay in low-mass buildings, and a lack of adaptive occupant responses during night-time hours [
3].
The widespread use of typical meteorological year (TMY) datasets in simulation studies exacerbates the problem by smoothing out rare but impactful weather events [
19]. Even when extreme cold years (ECY) are modeled, they often omit critical factors such as ground coupling, envelope mass, and thermal preconditioning, all of which are essential in understanding real-world resilience dynamics [
5].
Still, there is no consensus on which metrics best reflect operational resilience. For example, DoD accounts for the severity and duration of deviations from comfort but is sensitive to the observation window length and may understate performance in high-mass systems tested over long periods. Only one study [
3] has introduced and applied the WUMTP index, albeit exclusively using simulation data. To date, no research has validated WUMTP using measured indoor temperatures or adapted the metric to overheating scenarios [
3].
Additionally, the hours of safety threshold (HST), proposed by Wijesuriya et al. [
20], quantifies the duration for which a building remains within acceptable temperature bounds. While originally developed for hot climates, it has since been adapted for the assessment of winter scenarios as well.
However, most of these indicators are still primarily applied in simulation environments. As Siu et al. [
16] note, empirical validation remains rare, and most existing tools rely on standard meteorological datasets that do not reflect actual winter extremes. Moreover, no international standard currently requires performance assessment under failure conditions, creating a policy blind spot [
7,
12].
In the European context, while regulations such as the Energy Performance of Buildings [
21] promote sustainability and energy savings, they do not explicitly mandate resilience-based performance assessments. Some countries, including Italy and Finland, have introduced more stringent envelope standards [
6], yet dynamic resilience modeling is still not required. Rehman et al. [
4] show that even well-insulated houses in Nordic regions can reach unsafe indoor temperatures within hours of heating failure.
The World Health Organization [
18] recommends maintaining indoor temperatures above 18 °C in winter to reduce the risk of illness and mortality. However, these thresholds are rarely embedded in building codes or certification systems.
Taken together, the literature reveals a clear and persistent gap between the theoretical development of resilience metrics and their integration into real-world design, policy, and evaluation frameworks. While simulation methods offer a valuable starting point, they often lack standardization and robustness when applied to real failure scenarios. At the same time, regulatory frameworks continue to prioritize annual energy performance and carbon reduction targets, frequently overlooking transient risks and critical climatic extremes.
Most notably, the thermal buffering effect of ground coupling, particularly in buildings without insulated floor slabs, remains underexplored. While ground heat exchange has been modeled in some simulations, its empirical impact on resilience under real winter conditions has not been verified. Similarly, metrics such as DoD and WUMTP, although helpful, were developed for short simulation windows and may lose sensitivity in extended blackout scenarios due to penalty saturation or normalization over limited time periods. These limitations call for full-scale experimental studies that assess long-term performance and enable the more robust classification of passive survivability across structural types.
  1.1. Research Justification
Thermal resilience has become a critical concern in building science, particularly in the context of climate-induced risks such as prolonged winter cold spells, polar vortexes, and disruptions to energy infrastructure. While many buildings are optimized for steady-state energy efficiency, numerous studies have shown that this does not necessarily translate into adequate performance during extreme or disruptive conditions [
3,
7,
12].
This gap is particularly evident in heating-dominated climates, where loss of thermal services may pose severe health and safety risks. Although several simulation-based methods exist to evaluate thermal resilience, e.g., RT, DoD, and HST, they are often unvalidated against real data and tend to neglect factors such as thermal inertia, thermal history, and substructure behavior.
In particular, the literature lacks experimental investigations into how ground coupling—especially in buildings lacking floor insulation—can affect passive thermal performance during outages. Initial results suggest that preheated soil may act as a thermal reservoir, slowing the cooling process and contributing significantly to resilience. However, its interaction with envelope mass and the building typology remains empirically unverified.
Additionally, existing thermal resilience metrics such as WUMTP and DoD are typically applied in short-duration simulation studies and may fail to capture performance under extended outages. Their sensitivity to the observation window length and the potential for penalty saturation limit their comparability across different building types and durations. These limitations emphasize the need for time-based, physically interpretable indicators that can be validated against empirical data.
This disconnect underscores the urgent need for standardized approaches to resilience assessment, supported by empirically validated indicators and climate-specific thermal safety thresholds, and the stronger incorporation of resilience metrics into energy policy and retrofit practices.
Bridging this gap is essential to ensure that buildings are not only optimized for efficiency under normal conditions but are also capable of providing shelter, continuity, and protection during energy outages and climate-induced stress events. Ultimately, buildings should not be evaluated solely by how they perform on average but by how they protect when it matters most.
  1.2. Objectives and Scope of the Study
This study’s setup allowed for an isolated empirical evaluation of how thermal mass and ground contact influence indoor temperature stability during heating interruptions.
Both buildings were exposed to real weather conditions representing two seasonal scenarios: a typical winter period (TDY) and an extreme cold spell (ECY). Controlled heating outages were implemented, and indoor and outdoor temperatures were continuously monitored at a high temporal resolution. The collected data provided a basis for the evaluation of thermal decay, recovery profiles, and resilience under stress.
The analysis applies a combination of established and time-based performance metrics such as RT, DoD, and HST. Additionally, the authors propose a set of new physically grounded indicators, such as T(tx) and Tx, designed to address the limitations of existing metrics in long-duration blackout conditions. The study also examines the applicability of WUMTP to experimental datasets and benchmarks its output against the measured results.
The focus is limited to the passive thermal response of the building envelope. As such, mechanical system performance, occupant behavior, energy supply reliability, and economic or structural aspects of resilience fall outside the scope of this work. The goal is to generate empirical evidence that can inform the development of standardized resilience indicators and support their future integration into building codes, performance assessments, and retrofit strategies.
  1.3. Novelty of the Work
This study offers a novel contribution to the field of thermal resilience by providing full-scale, time-resolved experimental data on the indoor thermal behavior of residential buildings during multi-day heating outages in a winter context. While most prior studies rely on simulation-based assessments using standardized weather data, the present research captures real-time thermal responses under both typical and extreme winter conditions.
A key innovation lies in the controlled side-by-side testing of two buildings with contrasting construction systems: a lightweight timber-frame structure with an insulated foundation (B2) and a mediumweight masonry building without floor insulation (B1). This configuration enables, for the first time, the empirical isolation of ground coupling effects in cold-climate blackout scenarios. It also provides a direct comparison of passive thermal performance in buildings differing in mass and substructure behavior.
In addition to applying established indicators such as RT, DoD and HST, this study demonstrates the practical value of two new categories of resilience metrics: Tx and T(tx). These indicators are designed to complement existing metrics and better capture performance in long-duration outages.
Furthermore, this study offers the first known empirical application and validation of the full WUMTP model proposed by Homaei and Hamdy [
3]. Based on high-resolution indoor temperature data, this validation confirms the model’s overall applicability, while also revealing its limitations in long-duration scenarios due to penalty saturation. This analysis contributes to ongoing discussions about the comparability, scalability, and interpretability of penalty-based vs. time-based performance metrics.
In summary, this research introduces new methodological tools for the evaluation of thermal resilience, validates key simulation frameworks using measured data, and highlights the underexplored role of ground coupling in passive thermal survivability. These contributions offer valuable guidance for performance assessment, retrofit prioritization, and resilience-informed policy development in cold climates.
  4. Discussion of Results
The comparative analysis with Rehman et al. [
4] shows that B1 demonstrated exceptional thermal durability in both the TDY and ECY scenarios, with an extended RT, slow CS, and stable RS. Both B1 and B2 had lower CS values than any simulated cases, yet the DoD values were similar or even higher in B2.
This suggests potential methodological limitations in the DoD metric. While the DoD accounts for the cooling depth and duration, it may not scale appropriately with the duration of observation. Rehman et al. [
4] used a fixed 31 h window, whereas the present experiments extended to 80–100 h. The DoD may increase disproportionately in longer tests, even if the actual thermal resilience is superior, particularly for buildings with high inertia. This limitation was previously noted by Homaei and Hamdy [
3], who emphasized the importance of time-normalized resilience indicators.
Several factors may explain the divergence between the simulated findings of Rehman et al. [
4] and the experimental results reported here. These include differences in boundary conditions, observation windows, and modeling assumptions. In particular, the use of a fixed 31 h analysis period in simulations may underestimate the cumulative resilience in high-inertia buildings. Moreover, real-world effects such as preconditioning, ground heat exchange, and thermal bridging are difficult to replicate accurately in simulation environments. These discrepancies highlight the value of empirical data in capturing full-system thermal dynamics and assessing the practical limitations of model-based resilience predictions.
In contrast, the HST approach proposed by Wijesuriya et al. [
20] offers a more intuitive classification system, focusing on critical temperature thresholds (e.g., 15 °C). However, its binary nature (safe or unsafe) limits the resolution, especially when comparing nuanced performance aspects across buildings or climates. For operational resilience, such as flexibility under intermittent energy supplies, more detailed indicators like the time to threshold (T
18, T
15, T
12) are more informative and comparable.
The findings suggest that simplified, physically grounded metrics—such as the proposed Tx and T(tx) indicators—may offer clearer insights into resilience classification. They also enable comparisons across buildings of different masses, typologies, and observation periods without relying on simulation calibration.
An additional dimension requiring attention is the influence of ground heat exchange, particularly in the mediumweight building B1, which lacked floor insulation. From a physical perspective, the absence of insulation at the floor–ground interface facilitates one-dimensional conductive heat transfer between the building and the subsoil. During the heating phase, thermal energy is partially stored in the upper ground layers through downward conduction. When heating is interrupted, the stored energy is gradually released back into the interior, reducing the temperature gradient and slowing the cooling process. This bidirectional exchange is especially relevant in massive constructions with prolonged preconditioning and may explain the relatively stable thermal performance observed in B1 despite the colder ambient conditions.
This mechanism could explain the relatively small difference in the cooling rate between the December and February experiments despite the colder outdoor conditions. The trade-off between energy efficiency and passive resilience, especially in buildings with strong thermal mass, warrants further study.
While most of the resilience literature focuses on the overheating risk, Laouadi et al. [
27] emphasize that the indoor environment, through envelope configuration, mass, and design, can amplify or buffer thermal stress more significantly than the outdoor climate alone. Although their review centers on summer conditions in Canada, the underlying principle applies equally to cold-climate scenarios. The present results confirm that, even in the absence of active systems, a properly configured mediumweight structure can delay thermal degradation and maintain habitable temperatures beyond standard emergency response timeframes. Additionally, Laouadi et al. [
27] highlight the lack of empirical data on indoor thermal conditions during blackout events, further validating the contribution of the present full-scale experimental results.
Rui et al. [
28] also emphasize the limitations of modeling-based approaches that rely on typical meteorological years (TMY), arguing that such datasets underrepresent rare but critical events. Their analysis suggests that resilience assessment should integrate real climate extremes, envelope dynamics, and human behavior. The present findings support this view and provide empirical evidence that high-mass buildings can offer extended thermal protection during blackouts, even under severe winter conditions. Performance of this kind would likely be underestimated in simulation-based studies.
The recent study by Liyanage et al. [
29] provides additional confirmation of the severity of blackout conditions in cold climates. In their simulation-based assessment of Canadian homes, code-compliant buildings were shown to reach sub −5 °C indoor temperatures within four days of heating loss. Even with passive measures such as movable insulation, severe conditions could not be entirely avoided. In contrast, the mediumweight building (B1) considered here maintained indoor temperatures above 13 °C for more than 60 h without any dynamic mitigation strategies. This suggests that envelope mass and ground coupling can significantly enhance passive thermal survivability, sometimes more effectively than traditional retrofits. It also highlights the importance of developing resilience indicators that account for not only static efficiency measures but also dynamic thermal behavior during disruptions.
In sum, the comparison with simulated benchmarks and the existing literature underscores the value of empirical data and real-world thermal behavior tracking. The present study complements and extends previous resilience frameworks by incorporating long-duration cold scenarios, high-inertia structures, and passive soil heat buffering, offering new insights for resilience modeling, retrofit design, and cold-climate policy.
Although this study focused on the indoor air temperature as the main resilience indicator, the indoor relative humidity was also monitored. During the experiments, the RH ranged from approximately 30% (before heating shutdown) to 50% (after several days without heating). Within this range, humidity has little influence on thermal perception or comfort in cold environments. Therefore, the air temperature was considered a sufficient proxy in assessing passive thermal resilience.
The findings also have implications for regulatory and policy frameworks. Current building standards prioritize energy efficiency under normal operating conditions but often overlook thermal autonomy during service interruptions. Incorporating resilience-oriented criteria, such as passive survivability thresholds and dynamic temperature decay behavior, into building codes and performance assessment tools could enhance preparedness and safety in heating-dominated climates. Risk-informed regulations that account for thermal resilience would support more robust design strategies, especially in regions facing increasing climate variability and energy supply uncertainty.
  5. Comparative Analysis and Development of Thermal Resilience Indicators
  5.1. Application of the WUMTP Metric to Experimental Data
To benchmark our results against existing thermal resilience assessment frameworks, we applied the weighted unmet thermal performance (WUMTP) index to our full-scale experimental dataset. WUMTP, proposed by Homaei and Hamdy [
3], quantifies building resilience during and after disruptive events by aggregating penalty-weighted thermal deviations based on three dimensions: the event phase, hazard severity, and exposure time.
While this metric has been used extensively in simulation-based studies, it has not yet been widely validated using real-world measurements. Here, we apply the full WUMTP model to empirical data from two residential buildings exposed to multi-day winter blackouts.
In our calculation, we applied the full 12-segment WUMTP formulation, incorporating phase (WP), hazard level (WH), and exposure duration (WE) penalties as proposed by Homaei and Hamdy [
3]. The penalty structure distinguishes between three performance zones (acceptable, habitable, uninhabitable) and two exposure time categories (short/easy, long/difficult), resulting in a differentiated response across the temperature trajectory. The analysis covered the first 96 h of the outage, consistent with the original “four-day test framework”, and the results were normalized to the building floor area (123 m
2) to yield values in [°C·h/m
2].
Table 3 presents the calculated WUMTP values for the four test cases.
 These results confirm the superior thermal resilience of the mediumweight masonry building (B1), which maintained indoor temperatures above the critical thresholds and accumulated significantly fewer penalty-weighted degree-hours. In contrast, the lightweight structure (B2) experienced rapid and prolonged deviations from comfort conditions, resulting in more than 3.5 times higher WUMTP scores.
Importantly, these values fall within the range reported in simulation-based studies by Homaei and Hamdy [
3], where the WUMTP scores for passive designs ranged from approximately 13 to 113 °C·h/m
2. This close alignment suggests that the penalty scheme is sensitive and transferable to real-world performance and that WUMTP can be reliably applied using monitored temperature data.
Nonetheless, our results also highlight the limitations of WUMTP when used in long-duration experiments. Despite the continued indoor temperature decline in building B2 after 72 h, the WUMTP curve plateaued, due to the saturation of penalty coefficients. This dynamic may underrepresent the extended severity of exposure in lightweight structures. In contrast, our T(tx) and Tx metrics captured these differences more directly, by resolving the absolute temperature values and crossing times at key thresholds.
While the WUMTP index provides a structured and weighted approach to assessing thermal resilience, our findings suggest that it may lose sensitivity under prolonged blackout conditions—particularly in lightweight buildings, where temperatures continue to decline after penalties saturate. To address these limitations and offer a complementary perspective, we propose a time-based, physically grounded framework that captures the temporal dynamics of indoor temperature decay in a more direct and scalable manner. The following two subsections introduce the proposed indicators: the time to threshold (Tx) and temperature at X hours (T(tx)).
  5.2. Time to Threshold Indicators (Tx)
This metric captures the number of hours from the start of the heating outage until the indoor air temperature drops below a specific critical threshold. The recommended values include
T18—Adaptive comfort threshold;
T15—Passive habitability threshold;
T
12—Severe discomfort threshold, where prolonged exposure may impair thermoregulation and increase cardiovascular strain [
17];
T
9—Health risk threshold, where temperatures below 9 °C are associated with hypothermia and increased risks of acute health events in elderly or vulnerable occupants [
17,
18].
The output is a time-based vector, [T18, T15, T12, T9], which allows for the simple and meaningful comparison of thermal durability across building types, climates, and operational contexts.
  5.3. Temperature at X Hours Indicators (T(tx))
In this approach, the indoor air temperature is measured at fixed time intervals after heating has been disabled; there are typically 12 h, 24 h, 36 h, 48 h, and 60 h.
This generates a temperature vector, [T(t12), T(t24), T(t36), T(t48), T(t60)], which can be plotted as a thermal decay curve or used to construct point-based resilience scores. These values are immediately interpretable by practitioners and policymakers and can be linked to critical exposure thresholds or emergency response timeframes.
  5.4. Advantages of the Proposed Method
The proposed time-based resilience framework offers several advantages over existing approaches. First, it is independent of the observation duration, which enables valid comparisons across studies with differing test lengths and monitoring periods. Second, it maintains a high degree of physical transparency—being directly based on temperature and time data, it avoids statistical abstraction and remains closely aligned with occupant-relevant outcomes.
Another key strength of the method is its scalability and replicability. It can be applied in a wide range of contexts, including full-scale field experiments, dynamic building simulations, and post-occupancy evaluations. Its structure also supports flexible classification schemes. For example, buildings can be grouped into standardized resilience classes based on T
18 thresholds, such as
		
        offering a practical and communicable basis for policy or design guidance. In addition, the framework lends itself well to visual representation, allowing for the development of tools such as resilience maps or performance envelopes. These, in turn, can complement probabilistic modeling and long-term risk assessments under future climate scenarios, helping to bridge the gap between research and actionable insights in thermal safety and climate adaptation.
  6. Example Application of the Methodology
To demonstrate the applicability of the proposed metrics, 
Table 4 presents the recorded indoor temperatures for buildings B1 (mediumweight) and B2 (lightweight) during controlled heating outages in December and February.
The results show a clear difference in thermal performance between the two buildings. The mediumweight masonry structure with no floor insulation (B1) maintained indoor temperatures above the 15 °C habitability threshold for the entire 60 h testing period in both December and February. In contrast, the lightweight timber-frame building (B2) fell below this threshold within the first 24 h and dropped to unsafe levels below 12 °C between 48 and 60 h after heating was disabled. Notably, the contrast in resilience between the two buildings appears to be driven more by ground coupling and differences in the construction type and thermal mass than by variations in outdoor conditions between the two testing periods. To further illustrate the usefulness of the proposed T(t
x) metrics, 
Figure 5 and 
Figure 6 present the hourly indoor temperature trajectories of buildings B1 and B2 during the heating outages in February and December, respectively. Horizontal lines in each graph mark the proposed resilience thresholds: 18 °C (adaptive comfort), 15 °C (habitability), 12 °C (severe discomfort), and 9 °C (health risk).
As shown in 
Figure 5, under December conditions (T
o,avg = +0.7 °C), the lightweight building (B2) experienced a steady and rapid decline in indoor temperature, reaching the 15 °C threshold in approximately 25 h and the 12 °C threshold in about 55 h. In contrast, the mediumweight structure (B1), which had an uninsulated floor, maintained temperatures above 15 °C for approximately 60 h and recorded only a slightly lower temperature over the remaining 57 h of the blackout.
Figure 6 illustrates the hourly indoor temperature trajectories during the February outage (T
o,avg = −4.5 °C). In this case, B2 dropped below 15 °C within approximately 30 h and below 12 °C after 55 h. Starting from the fifth day of the blackout, the night-time temperatures in B2 fell to around 7 °C. B1, on the other hand, remained well above the habitability threshold for the first 60 h, and its temperature never dropped below 13 °C during the entire 7-day heating outage.
 Interestingly, the performance gap between the two buildings appears to be even more pronounced in December than in February. This may be attributed not only to the higher thermal mass of B1 but also to the absence of floor insulation, which enabled direct thermal coupling with the ground. As the buildings had not been heated from the beginning of the heating season in October until one week before the December experiment, the soil beneath the uninsulated floor remained relatively cold. Consequently, when the heating was switched off, B1 cooled more rapidly than in the February experiment. In the latter case, the indoor temperatures had been maintained at 21 °C in the intervening period, allowing the ground below the floor to gradually warm up and buffer heat loss more effectively.
These visual results clearly demonstrate the practical value of the T(tx) framework in identifying resilience classes and guiding passive design decisions. They also underscore the potential of ground-coupled mediumweight construction to passively sustain safe indoor conditions during extended periods without heating.
  7. Conclusions
This study evaluated the thermal resilience of two full-scale residential buildings with contrasting construction typologies—mediumweight masonry (B1) and a lightweight timber frame (B2)—under both typical and extreme winter conditions. Indoor temperature decay and recovery were monitored during controlled heating shutdowns. Resilience performance was assessed using standard metrics (RT, DoD, HST) and newly proposed time- and temperature-based indicators (Tx and T(tx)).
Key findings include the following:
- The mediumweight building (B1) maintained indoor temperatures above the 15 °C habitability threshold for over 60 h, despite the absence of floor insulation or additional passive strategies. This highlights the combined effect of thermal mass and ground coupling. 
- The lightweight building (B2), although compliant with modern energy standards, exhibited faster cooling and crossed critical thresholds (below 12 °C) within 55 to 60 h, emphasizing the influence of the construction type on resilience outcomes. 
- In comparison with simulation-based assessments, the results indicate that envelope mass and substructure behavior may play a more decisive role in passive survivability than steady-state efficiency ratings. 
While the DoD and HST provide valuable insights, these metrics may lack sensitivity when applied to high-inertia structures or long-duration outages. To enhance the applicability, a complementary framework based on the time to threshold and temperature after X hours is proposed.
The results support the use of simplified, physically interpretable metrics in evaluating thermal resilience and emphasize the importance of validating simulation-based approaches with empirical data. The proposed indicators may also contribute to performance classification, retrofit prioritization, and risk-based design practices in heating-dominated regions.
Further research is recommended to expand this framework to additional building typologies, climatic conditions, and occupant behaviors. Interactions between passive and low-energy active systems should also be considered. The observed ground thermal buffering effect in B1 reveals a promising but underexplored aspect of resilience that warrants further investigation through both experimental and modeling approaches. This study also includes the first known empirical application of the full WUMTP model. The calculated values were found to be consistent with those observed in prior simulation-based studies, confirming the metric’s applicability to measured indoor temperature data. However, limitations emerged in extended blackout scenarios, where penalty saturation tended to obscure ongoing performance degradation. These findings reinforce the value of complementary time-based indicators in capturing long-term resilience dynamics.
  8. Limitations of the Study
While the findings of this study provide valuable empirical insights into building thermal resilience during winter outages, several limitations must be acknowledged.
First, the sample size and typology were limited to two full-scale experimental buildings, each representing a distinct construction system: a lightweight timber frame and mediumweight masonry. While these cases capture key envelope differences, they do not reflect the full diversity of residential building typologies, especially hybrid or high-performance structures featuring advanced materials or dynamic façade elements.
Second, the climatic context of the experiments was confined to a single cold-climate location under both typical and extreme winter conditions. Although the results are relevant for similar temperate climates, their generalizability to other, warmer climate zones requires further validation.
Third, the buildings were unoccupied throughout the testing period. In real-world scenarios, factors such as occupant behavior, intermittent heating, window opening, and zonal thermal management could significantly influence resilience outcomes. The lack of human interaction may thus limit the realism of the thermal response data.
Fourth, the study focused exclusively on passive resilience, assessing the thermal behavior of buildings without the influence of active backup systems (e.g., battery-supported heat sources or localized electric heaters). As such, the results emphasize passive survivability but do not account for hybrid or adaptive strategies that might be used in emergency conditions.
Finally, while the buildings were maintained at 21 °C prior to each experiment to ensure thermal stabilization, the initial preconditioning period may not have been sufficient—particularly in the mediumweight structure with an uninsulated floor slab. In the December test, the buildings had been unheated before the five-day stabilization period began. Our assumption that five days of heating would establish equilibrium thermal conditions proved insufficient in B1, where the ground-coupled thermal mass likely required a longer period to reach a thermal steady state. As a result, the ground temperatures and their contributions to passive heat exchange may have been underestimated or unequally distributed across the test scenarios. Future studies should more precisely control and measure the thermal history of building–ground systems, especially when evaluating resilience in structures with significant subfloor interactions.