1. Introduction
Climate change necessitates a fundamental transformation in energy systems and technology, influencing multiple sectors and driving the need for innovative solutions [
1]. The building industry is responsible for approximately 27% of global energy usage and contributes around 17% of total carbon dioxide (CO
2) emissions [
2].
The high energy consumption in the building sector is mainly due to the significant dependence on heating and cooling systems, as well as the widespread use of various electrical devices [
3].
Energy efficiency plays a fundamental role in the design and operation of environmentally friendly and sustainable buildings [
4]. In the context of buildings, energy efficiency can be approached from various perspectives, with two key areas drawing significant focus from architects and engineers: the creation and development of energy-efficient structures, particularly concerning new construction projects and the retrofitting and enhancement of energy performance in existing buildings [
5].
Building energy efficiency strategies have evolved considerably over time, progressing from passive design approaches in early construction (i.e., thermal mass, natural ventilation, building orientation) through 20th-century active systems (i.e., heating, ventilation and air conditioning, heat pumps, advanced glazing) to contemporary integrated solutions combining smart building technologies, renewable energy systems, and advanced materials such as phase change materials and dynamic insulation [
6,
7,
8]. This evolution reflects growing recognition that energy efficiency must address not only technical performance but also economic viability, environmental sustainability, and occupant comfort [
9].
Improving energy efficiency in both new and existing buildings offers a rapid approach to mitigating the environmental, economic, social, and other challenges within the construction sector [
10]. Sustainable construction policies encompass a range of approaches designed to promote eco-friendly practices and optimize resource use throughout the building process [
11].
Over the past few years, energy efficiency has gained significant attention as a key strategy for mitigating carbon emissions, optimizing energy utilization, enhancing building performance, and contributing to the creation of sustainable built environments [
12].
Building energy management strategies focus on optimizing energy usage through various techniques like predictive modeling, optimization algorithms, and machine learning. Common approaches include particle swarm optimization, genetic algorithms, and demand-side management to improve efficiency while ensuring comfort. Additionally, fault detection and diagnosis techniques help identify system issues in real time. These strategies, supported by advanced software, enhance building performance by reducing energy consumption and promoting sustainability [
13].
The significance of technological transfer in attaining energy efficiency in buildings and the development of Zero Energy Buildings (ZEB) resides in its capacity to connect research innovations with practical applications, guaranteeing the effective implementation of advanced solutions in the construction industry. The Triple Helix concept, which fosters collaboration among academics, industry, and government, has produced significant outcomes, promoting sustainable practices and expediting the use of energy-efficient technologies in the built environment [
14].
Bibliometric analysis has emerged as an increasingly important approach for mapping research landscapes in building energy efficiency, offering systematic frameworks for identifying influential publications, tracking thematic shifts, revealing collaboration patterns, and mapping emerging frontiers. However, existing bibliometric investigations demonstrate significant methodological and scope limitations that constrain comprehensive understanding of the field’s evolution. The majority of studies focus on narrow research subdomains, including zero-energy buildings [
15], specific technologies such as machine learning applications [
16], or specialized topics like biomimicry [
17]. Temporal scope represents a critical limitation, with most investigations covering periods of 10–15 years, insufficient for capturing the complete transformation from foundational 1970s–1980s innovations through contemporary integrated approaches. For instance, studies by Wang et al. [
18] and Varshabi et al. [
17] examined only 2006–2018 and 2010–2021 respectively, thus not encompassing foundational research periods essential for understanding technological evolution. Previous bibliometric investigations of building energy efficiency face significant methodological limitations. Most studies suffer from one or more constraints: narrow subdomain focus (e.g., examining only zero-energy buildings or specific technologies), restricted temporal coverage (typically 10–15 years), single-database dependence, or manual keyword categorization that cannot capture semantic variants of identical concepts. Critically, no prior study has employed computational terminology normalization techniques such as TF-IDF vectorization and cosine similarity to address the field’s fundamental challenge: evolving nomenclature where identical concepts appear as multiple terminological variants (e.g., “net zero energy building,” “zero energy building,” “nearly zero energy building”). This computational approach is essential for comprehensive, long-term bibliometric analysis because building energy efficiency research has developed inconsistent terminology over four decades, making manual categorization both imprecise and non-replicable across large datasets.
To outline the methodological and conceptual distinctions of existing research,
Table 1 summarizes previous bibliometric investigations in building energy efficiency and related subdomains, focusing on their temporal scope, analytical depth, and database selection.
This research addresses these limitations by providing a comprehensive bibliometric analysis of building energy efficiency research spanning four decades (1981–2025). This extended temporal scope is essential for capturing the field’s complete maturation trajectory: from foundational research establishing core thermal physics principles in the 1980s–1990s, through the emergence of integrated design approaches following the Kyoto Protocol (1997) [
22], to the dramatic acceleration coinciding with the Paris Agreement (2015) [
23] and subsequent net-zero commitments by major economies. Understanding this four-decade evolution is critical for identifying how research priorities have responded to, and anticipated, major policy shifts, and for revealing the knowledge accumulation patterns that enabled contemporary integrated building performance frameworks. While this study does not directly implement building energy efficiency measures, it provides evidence-based guidance for research strategy and policy development, by identifying productive research clusters, emerging technologies, and underexplored areas requiring targeted investment.
The bibliometric analysis introduces two critical methodological innovations that enhance analytical rigor and reproducibility. First, this study employs Pruned Exact Linear Time (PELT) changepoint detection to objectively identify temporal transitions through statistical validation, replacing the arbitrary decade-based periodization commonly employed in bibliometric investigations. Second, automated keyword harmonization through TF-IDF vectorization (n-grams 2–3) and cosine similarity algorithms (threshold = 0.75) addresses the semantic inconsistency challenge inherent in evolving technical nomenclature, a computational approach not previously applied in building energy efficiency bibliometrics. This study aims to systematically map the temporal evolution of research priorities from foundational thermal physics through contemporary smart building integration, examine geographic distribution and international collaboration networks across countries, identify institutional leadership patterns and research productivity dynamics. It also traces the thematic evolution from component-focused investigations to holistic building performance optimization and reveals citation impact patterns that highlight influential breakthroughs and emerging research frontiers. The analysis integrates publication trend assessment, keyword co-occurrence mapping, collaboration network visualization, and citation impact evaluation to provide actionable intelligence for researchers, policymakers, and industry stakeholders navigating the transition toward net-zero building portfolios.
2. Materials and Methods
The identification of key documents in the field was facilitated by the construction of a search algorithm, which incorporated the Boolean operators OR and AND to expand the search area and select relevant documents. For minimizing the number of false positive results included in this analysis, techniques such as exact phrase matching, which involves enclosing the word or phrase in quotation marks (e.g., “keyword”), and the wildcard operator *, which allows matching all variations of a word regardless of its ending (e.g., efficient* will identify terms such as efficiency and efficient), were used. These strategies have been integrated to ensure an accurate and rigorous selection of relevant literature, minimizing interference and improving the quality of the results obtained. The initial search query employing wildcard operators yielded 77,120 results from the Web of Science Core Collection; however, preliminary analysis revealed substantial numbers of false positive results due to broad pattern matching. To enhance precision, the query was refined through exact phrase matching by enclosing key terms in quotation marks (e.g., “energy efficien*” rather than energy efficien*), reducing the dataset to 18,449 documents while eliminating irrelevant publications that contained the search terms in non-building energy contexts. Subsequently, systematic filters were applied to restrict the dataset to articles and review papers written in English, ensuring compatibility with the computational text processing framework and resulting in a final analytical dataset of 14,253 documents (
Figure 1). This multi-stage refinement process prioritized relevance and analytical precision over comprehensive coverage, focusing the investigation on peer-reviewed research directly addressing building energy efficiency. This systematic approach improved algorithm performance and ensured rigorous data selection. Data for this investigation were retrieved exclusively from the Web of Science Core Collection. The decision to rely on a single database was driven by the need for consistent data architecture and quality standards throughout the analysis. Crucially, the database provides uniformly structured metadata fields, which proved essential for the computational methods employed in this study.
The analytical pipeline integrating VOSviewer (version 1.6.20) for network visualization, Bibliometrix 4.3.0 package in R. for statistical analysis, and custom Python 3.12.3 algorithms for computational keyword harmonization depends critically on standardized data formats. Consistent field structures across records enabled accurate processing of co-authorship networks, precise temporal trend analysis, and reliable thematic evolution mapping without the data integration challenges and systematic inconsistencies that arise when merging multiple bibliographic sources. Beyond these immediate technical considerations, the single-database approach proves particularly critical for the changepoint detection methodology employed in this study. The PELT algorithm applied to identify temporal transitions in publication activity requires uninterrupted, consistently measured time series data to accurately detect statistical breakpoints. Introducing data from multiple sources would create artifacts in the temporal signal due to varying indexing delays, different inclusion criteria across databases, and inconsistent publication date recording conventions. Similarly, the thematic evolution analysis depends on uniform keyword assignment practices maintained by Web of Science indexers, whereas merging databases with disparate controlled vocabulary systems would introduce noise into the co-occurrence networks and potentially obscure genuine thematic transitions. These methodological requirements demonstrate that for advanced computational bibliometric analyses, database consistency constitutes a prerequisite for analytical validity rather than merely a convenience factor.
The choice of software programs used is of paramount importance for the execution of a bibliometric analysis. The present study primarily employed VOSviewer (version 1.6.20), a widely utilized software program for bibliometric analysis, which facilitates the generation of bibliometric maps with ease. The Bibliometrix library (version 4.3.0) was employed for detailed analysis of bibliometric data, in conjunction with the Biblioshiny web interface, which significantly facilitates the use of the library. Microsoft Excel was utilized for the cleaning of bibliometric data and the creation of graphs. The Bibliometrix library, via the Biblioshiny interface, was executed using RStudio (version 4.4.2), an Integrated Development Environment for the R language. This combination of software tools enabled a rigorous bibliometric analysis and clear visualization of the results [
24,
25].
The annual number of publications was extracted using the Bibliometrix library, and the resulting data were visualized to depict trends in scientific output over time. Graphical representations of publication trends were generated using Microsoft Excel, facilitating a clear understanding of temporal patterns. The Mean Total Citations per Year (MeanTCperYear) metric was employed as a normalized measure of citation impact. This metric was calculated for each publication by dividing the total number of citations received by the number of years since publication. These values were computed automatically using the Biblioshiny web interface, ensuring consistency and accuracy. Country-level and institution-level scientific production metrics were derived from data exported via VOSviewer (version 1.6.20), allowing for an in-depth analysis of geographical and institutional contributions. Temporal trends in scientific output by country and institution were analyzed using the Biblioshiny interface, providing valuable insights into the evolution of research activity across regions and organizations.
In the visualization of international research collaboration networks, each nation’s academic productivity is indicated by the diameter of its corresponding circle with more prolific countries represented by more expansive circular elements. The degree of academic partnership between nations manifests in the thickness of the connective elements joining their respective circles, where more substantial lines signify more extensive collaborative engagement. The network’s structural organization employs a color-coding methodology to delineate distinct collaborative communities. These chromatic groupings highlight the natural clustering of nations that maintain particularly robust research relationships, as countries within shared color designations typically demonstrate heightened levels of academic interchange and cooperative scholarly endeavors.
Temporal research mapping provides visual representations that capture the developmental patterns of scientific themes across successive periods. These graphs enable researchers to observe the transformation of research priorities and to monitor the emergence or decline of specific topics within disciplines. The Bibliometrix platform generates these graphs through comprehensive co-word analysis and clustering methodologies, thereby revealing the progressive development of scientific knowledge across temporal intervals.
To identify key transition points in publication trends objectively, the PELT algorithm was applied using the Python ruptures library (version 1.1.0) with a Radial Basis Function (RBF) model. This algorithm efficiently detects significant shifts in time series data by identifying changes in statistical patterns. The analysis examined annual publication volumes from 1981 to 2024 to detect major transitions in research activity, with data handling performed using pandas (2.0.0) and numpy (1.21.0). To ensure reliability, the algorithm was tested with five different penalty parameter values (1, 2, 3, 4, 5), which control detection sensitivity, lower penalties identify more potential breakpoints while higher penalties require stronger evidence for change detection. Only breakpoints consistently identified across multiple tests were retained, specifically, breakpoints appearing in at least three out of five analyses were classified as robust and used for temporal periodization. The detected periods were then statistically validated using Mann–Whitney U tests implemented through scipy (1.9.0) to confirm significant differences between consecutive time periods. This data-driven approach provides objective evidence for defining distinct research phases, replacing arbitrary decade-based divisions with statistically supported temporal boundaries.
Trending research topics are analyzed to identify and visualize how scientific themes change over time. This methodological framework enables researchers to observe the increasing prominence of emerging research domains and the diminishing focus on established areas. Through the Bibliometrix platform’s capability to analyze keyword frequency distributions across temporal periods, researchers can conduct robust assessments of evolving research landscapes.
The map of the journal collaboration network shows each node as a source, with the size of the node proportional to the number of documents published by that source. The color of each node shows the average year of publication of documents from that specific source, offering valuable insights into the trends over time. The lines that connect two nodes show how documents from one source are mentioned in the other, and the thickness of the lines shows how important the connection is. If a line is thick, it shows that there are more mentions of one document in another.
The utilization of a singular language in bibliometric analysis constitutes the inaugural step in the standardization process. Subsequent to this, the generation of thesaurus documents is imperative, as these facilitate the unification of terms that may be synonymous or manifest in diverse forms. To illustrate this point, the standardization of country names necessitates thesaurus generation for harmonizing disparate variants of the same name or to incorporate regions such as Wales, which is a constituent of the United Kingdom (UK). The generation of a thesaurus for keywords is a more complex undertaking, given the vast number of terms requiring analysis. The implementation of a normalization script written in Python 3.12.3 was developed, with a focus on a specialized thesaurus generator designed to enhance the analysis of keyword trends and the study of thematic evolution (
Figure 2). The generator employs character-level TF-IDF vectorization with n-grams (2–3-character sequences) and cosine similarity metrics (threshold = 0.75), implemented using scikit-learn (1.1.0) with supporting data processing through numpy (1.21.0), in combination with comprehensive text preprocessing, to identify and cluster semantically and lexically related terms. A cosine similarity threshold of 0.75 was selected based on empirical testing: lower thresholds (e.g., 0.6–0.7) produced clusters with excessive noise and unrelated terms, while higher thresholds (>0.8) fragmented semantically coherent groups. A value of 0.75 provided the best balance between precision and recall. While semantic embedding approaches using pre-trained language models (e.g., BERT) were evaluated, they proved unsuitable due to the lack of domain-specific training on building energy efficiency terminology. The character-level n-gram approach was selected for its superior ability to capture lexical variation, subtle morphological patterns, and spelling inconsistencies frequently encountered in highly specialized technical nomenclature across diverse datasets and contexts, ensuring improved consistency and overall model robustness.
4. Discussion
Over time, research in building energy efficiency has evolved from focusing on individual components and systems to adopting more integrated and holistic approaches.
The level of scientific interest displayed in the field of building energy efficiency has undergone significant evolution over time. Distinct periods of growth and development can be identified in this area of research. In recent decades, the field has undergone considerable expansion, reflecting the mounting global emphasis on sustainable construction and energy conservation in buildings.
The PELT changepoint analysis identified 2011 as a statistically significant transition point in publication activity. This breakpoint reflects the convergence of major policy initiatives that created unprecedented research demand: the European Union’s Energy Performance of Buildings Directive recast (2010/31/EU), introducing mandatory Nearly Zero-Energy Building targets [
26], China’s 12th Five-Year Plan (2011–2015) established targets for energy intensity reduction [
27] and the United States adopted ASHRAE Standard 90.1-2010, which demonstrated 18.2% energy savings over previous standards [
28].
The foundational period (1981–2011) was characterized by pioneering research primarily led by the USA, suggesting its foundational role in establishing this field of study. Within this extended formative phase, observable sub-phases emerged: the initial years (1981–2000) showed relatively modest publication output with limited international participation, while the subsequent decade (2001–2010) witnessed considerable transformation with the emergence of novel research centers, particularly within the Asian region. This period marked the advent of Hong Kong institutions into the field, with the City University of Hong Kong and the Hong Kong Polytechnic University attaining distinction as substantial contributors, signifying mounting global recognition of building energy efficiency as a pivotal research domain. The contemporary period (2012–2025) represents the most dramatic growth phase, characterized by China’s emergence as the dominant force in this research domain, with the 2011 breakpoint marking a fundamental shift from exploratory investigations to mature, application-focused research. China’s remarkable ascent is evidenced by its publication output of 2978 papers, substantially surpassing the USA (1478 papers) and the UK (1421 papers). This surge in Chinese research output coincides with the country’s increased focus on sustainable development and environmental protection measures.
The apparent paradox of China’s high publication volume (2978 papers) coupled with lower average citation impact compared to the UK (33.24 citations/paper) and USA (33.15 citations/paper) can be primarily attributed to citation lag effects rather than research quality differences. As demonstrated by the PELT analysis, China’s research productivity surge occurred predominantly after 2011, with the country transitioning from minimal output in the foundational period to dominant contributor status in the contemporary period. Consequently, the majority of Chinese publications are temporally recent and have had substantially shorter citation windows compared to UK and USA publications, which maintain more balanced temporal distribution across both foundational and contemporary periods. Citations accumulate gradually over years following publication, with peak citation rates typically occurring 3–5 years post-publication. Thus, China’s concentration of output in the 2012–2025 period inherently constrains average citation metrics when calculated across the entire corpus. Additional contributing factors may include the UK and USA’s established roles as central hubs in international collaboration networks (
Figure 7), which tend to enhance citation visibility, and the longer research tradition in these countries enabling publication in higher-impact specialized journals.
At the institutional level, the pattern of growth mirrors the country-level trends, with Chinese institutions, particularly those in Hong Kong, assuming leadership positions. The Hong Kong Polytechnic University’s output of 458 articles and the City University of Hong Kong’s contribution of 266 articles underscore the region’s emergence as a major research hub in building energy efficiency.
The co-authorship network map reveals distinct geographical clustering and collaboration patterns that illuminate the global research ecosystem in building energy efficiency. The visualization demonstrates three primary clusters that reflect both regional strengths and international partnerships. It is evident that while regional research hubs maintain their importance, successful research in this field increasingly relies on international collaboration. This global approach enables the sharing of diverse perspectives, methodologies, and solutions, which is particularly valuable given the varying climatic, technological, and regulatory contexts across different regions. The observed collaboration patterns indicate that future research endeavors should concentrate on the consolidation of cross-regional partnerships, whilst concurrently preserving the unique expertise cultivated within each distinct cluster. This balanced strategy would serve to address both the local building energy challenges and the global sustainability objectives. The European cluster’s cohesion reflects structural policy frameworks that mandate cross-border collaboration. EU Framework Programmes (FP7, Horizon) [
29,
30] require collaborative projects to include multiple organizations from different EU Member States or Associated Countries, creating policy-driven incentives for multi-country research networks. The Concerted Action for the Energy Performance of Buildings Directive (CA EPBD), launched in 2005 and continuing through multiple phases, established systematic platforms for knowledge exchange among all EU Member States and Norway specifically for implementing building energy efficiency directives. This combination of mandatory collaboration requirements and sector-specific coordination mechanisms transformed building energy efficiency research from isolated national efforts into integrated European networks.
Different research trends have been identified during the period under study, denoting the significant evolution demonstrated by this field. In the period before 2000 when interest in this field was still relatively low research during this phase focused primarily on individual components and systems, as exemplified by Athienitis et al.’s (1997) work on PCM in passive solar applications [
31]. This period also saw the emergence of innovative approaches to building envelope design, such as dynamic insulation (Taylor and Imbabi, 1998) [
32]. Notably, the economic dimensions of energy-efficient construction began to receive scholarly attention, with Bartlett and Howard (2000) providing critical analysis that challenged prevailing cost assumptions about green buildings [
33]. The early 2000s witnessed a paradigm shift toward environmentally conscious building design. Bell and Lowe’s (2000) seminal study demonstrated the feasibility of achieving 50% emissions reduction through housing modernization using existing technologies [
34]. This period also saw the emergence of comprehensive evaluation methodologies, exemplified by Swan and Ugursal’s (2009) development of sophisticated building energy performance modeling frameworks [
3]. The integration of environmental considerations into building design marked a significant departure from purely efficiency-focused approaches. Recent documents in this field highlight the maturation of the research topic, characterized by the advancement of integrated building energy solutions. Marszal et al.’s (2011) seminal work on ZEBs established a foundational framework for comprehensive energy-neutral design [
35]. More recently, Wang et al.’s (2022) critical analysis of PCM applications [
36] and Hafez et al.’s (2023) systematic review of energy efficiency strategies [
37] underscore the field’s current focus on holistic sustainability approaches.
To better situate the evolution of the field within its broader global context, a timeline of major policy and technology milestones is provided below.
Figure 12 presents a chronological timeline of key developments that have shaped building energy efficiency research and help contextualize the 2011 breakpoint.
The 2011 breakpoint coincides with a rare confluence of policy and technology shifts that reshaped building-energy research trajectories: the launch of ISO 50001 established a global energy-management standard applicable to building portfolios and facility operations, creating a shared framework for measurement and continual improvement [
38].
In Europe, the recast EPBD (2010/31/EU) had just mandated nearly zero-energy buildings for the coming decade, catalyzing methodological work and national transpositions from 2010 onward [
26].
In the United States, the DOE 2011 determination on ASHRAE 90.1-2010 ratcheted commercial-building code stringency and signaled measurable savings versus the 2007 edition, prompting code adoption and compliance research. This regulatory advancement established that newly constructed commercial buildings adhering to the updated standard would demonstrate quantifiable reductions in energy consumption when benchmarked against structures built under previous code requirements [
39].
At the same time, the IEA’s 2011 update of its 25 Energy Efficiency Policy Recommendations renewed high-level commitments on building efficiency and appliances, reinforcing a global policy push. This policy refresh documented substantial progress in cross-sectoral implementation, revealing that numerous countries had transitioned from planning stages to active deployment of stringency measures initially proposed in 2009. The evaluation highlighted a coordinated international effort toward enhanced building codes, with nations such as Canada, Korea, Luxembourg, the Netherlands, and the United Kingdom introducing strengthened requirements for new construction. Additionally, the systematic expansion of minimum energy performance standards for appliances and equipment demonstrated a maturing regulatory landscape. The IEA’s renewed emphasis on compliance monitoring, enforcement mechanisms, and standardized measurement protocols signaled a strategic shift from aspirational targets to accountable implementation, thereby catalyzing a synchronized global movement toward verifiable energy efficiency achievements in the built environment and consumer product sectors [
40].
On the technology side, the PV market inflected sharply in 2010–2011, with documented price declines and an 82% annual growth rate of cumulative installed capacity across IEA-PVPS countries and ≥1 GW annual additions in six nations, conditions that accelerated work on nZEB/BIPV and integrated building-energy systems [
41,
42].
Crucially for the post-2011 publication surge and China’s prominence, the 12th Five-Year Plan (2011–2015) set a binding −16% energy-intensity target [
43] and was followed by the 2013 Green Building Action Plan (targeting ~1 billion m
2 of new green buildings) [
44], which together expanded funding, demonstration, and regulatory activity.
Consolidation continued immediately after the breakpoint with the EU Energy Efficiency Directive (2012/27/EU), which introduced binding measures (audits, renovation obligations, public-sector leadership) [
45], and then with the Paris Agreement (2015), which anchored national commitments and finance, both amplifying the post-2011 research wave rather than creating it [
23].
It is imperative to prioritize energy efficiency in the construction of sustainable structures by employing a variety of strategies that are designed to reduce environmental footprints. These strategies include the integration of building-mounted photovoltaic systems and solar panels, the utilization of materials that conserve resources, and the exploration of cutting-edge energy technologies. These strategies also involve enhancing illumination solutions, conducting energy simulations, assessing thermal transfer properties, optimizing building envelopes, incorporating wind turbines into skyscrapers, instituting energy conservation strategies, and employing multi-objective optimization techniques [
2].
The field has evolved from the provision of focused technical solutions to the delivery of comprehensive approaches that incorporate multiple sustainability dimensions. This progression is indicative of two interrelated phenomena: firstly, the advancement of technology, and secondly, the growing awareness of the role of building energy efficiency in addressing global environmental challenges. The future direction of research will continue to integrate smart technologies and to place greater emphasis on resilient, adaptive building systems.
While this bibliometric investigation provides comprehensive insights into building energy efficiency research evolution, several methodological limitations should be acknowledged. The exclusive use of Web of Science Core Collection represents a deliberate choice that prioritizes analytical precision over comprehensive coverage. This single-database approach ensures consistent metadata structures, standardized citation linkages, and uniform field coding all necessary for reliable network analysis, temporal trends, and computational keyword normalization.
The restriction to publications written in English represents a deliberate methodological necessity for the quantitative computational analysis employed in this study. The computational terminology normalization framework utilizing TF-IDF vectorization and cosine similarity algorithms operates on character-level and word-level patterns that are fundamentally language-specific and cannot validly compare terms across different linguistic systems. Attempting multilingual analysis would require either separate processing pipelines for each language and thus fragmenting the dataset and preventing integrated thematic evolution analysis, or automated translation systems that introduce semantic drift and compromise keyword clustering precision. Notably, the dataset demonstrates substantial representation from non-English speaking regions, with China alone contributing 2978 papers and dominating the contemporary period, indicating that researchers from diverse linguistic backgrounds actively publish in English to achieve international scholarly visibility.
The PELT-based temporal periodization, while statistically robust, identifies breakpoints in publication volume rather than thematic transitions. As a result, these statistically detected periods may not perfectly align with major policy initiatives or technological breakthroughs that shaped the field conceptually. Additionally, the keyword co-occurrence analysis depends on author-assigned terms and database indexing conventions, which may not fully capture underlying research themes or interdisciplinary connections that emerge implicitly across studies. Future investigations could benefit from hybrid approaches that integrate multiple databases and languages while maintaining the analytical rigor required for reliable bibliometric network construction and computational text analysis.
5. Conclusions
Energy efficiency research in buildings has evolved through several distinct phases of growth and diversification. The rise in worldwide collaboration, coupled with an intensified focus on climate change mitigation and energy optimization, has advanced the field toward more sophisticated, integrated solutions. Consequently, the emphasis is now transitioning to innovative materials, optimization of energy systems, and the creation of ZEBs.
This bibliometric analysis establishes a computationally enhanced framework with direct implications for academic research strategy and methodological transferability. The collaboration network analysis identifies four distinct geographic clusters, providing researchers with evidence-based guidance for optimizing international partnerships and maximizing citation impact through strategic co-authorship positioning. Methodologically, the integration of PELT changepoint detection with TF-IDF-based terminology normalization addresses two persistent challenges in bibliometric research: objective temporal periodization and automated keyword harmonization across evolving technical nomenclature. This approach is directly adaptable to other rapidly evolving interdisciplinary domains. The framework demonstrates that computational text processing techniques can enhance traditional bibliometric analysis without requiring specialized language models, making it accessible for emerging research fields.
Future research in energy efficiency in buildings is anticipated to focus on artificial intelligence integration, energy storage systems, and sustainable building materials. The advent of PCM and innovative technologies such as graphene oxide, in conjunction with Building Information Modeling, is expected to significantly influence future advancements. Cross-regional cooperation will be increasingly vital as global sustainability objectives become more interrelated, necessitating solutions that reconcile local requirements with international standards. Furthermore, with a heightened emphasis on carbon neutrality, forthcoming research will probably investigate more extensive frameworks for assessing environmental effect, incorporating policy recommendations, and addressing urbanization difficulties. From a bibliometric perspective, future investigations could extend this computational framework through comparative analysis across multiple databases (e.g., Scopus, regional repositories) to validate temporal patterns, integration of natural language processing for full-text analysis to capture implicit thematic connections beyond keywords, and tracking real-world research impact through policy citations and patent references to assess practical implementation influence.