The empirical strategy adopted for bibliometric analyses was based on the integration of quantitative and qualitative methods. The quantitative dimension was operationalized through Bibliometrix, which enabled automated processing of bibliographic metadata extracted from databases, generating indicators of scientific productivity, citation patterns, temporal evolution of publications, and identification of principal authors, journals, and countries involved in scientific production on the theme. The qualitative analysis complemented this approach through systematic reading and thematic categorization of selected studies, enabling the identification of conceptual trends, methodological gaps, and research opportunities at the intersection between sustainability indicators and Business Intelligence in educational contexts.
3.1. Quantitative Analysis
This section presents a detailed quantitative analysis of the 36 selected studies, exploring essential bibliometric indicators including temporal evolution of publications, geographical distribution of authors, predominant themes, and profile of scientific vehicles. The analyzed corpus comprises documents published between 2021 and 2025, evidencing the contemporary character of research on Business Intelligence, Big Data Analytics, and sustainable practices in educational management across diverse contexts. Quantitative analysis constitutes a fundamental element for understanding the structural characteristics of scientific production in the area, enabling the identification of collaboration patterns, growth trends, and thematic concentrations that orient field development.
Specific bibliometric metrics map the volume and temporal distribution of publications, assess scientific impact, methodological diversity, and principal theoretical currents that ground research in educational BI. This quantitative approach provides objective support for subsequent qualitative analysis, establishing an empirically grounded panorama of the state of the art at the intersection between analytical technologies and educational sustainability. The investigation reveals geographical and thematic gaps that can orient future investigations in the area.
Figure 3 presents the principal information extracted from the 36 articles by Bibliometrix software.
The primary bibliometric indicators reveal structural characteristics of the analyzed scientific production. The five-year period (2021–2025) presents an expressive annual growth rate of 60.69%, indicating rapid and recent emergence of publications in this specific area. This accelerated growth reflects both increasing interest in the application of Business Intelligence and data analysis in educational contexts and the gradual consolidation of this emerging research domain. The average document age of merely 0.972 years confirms the extreme contemporaneity of the corpus, positioning it at the frontier of current scientific knowledge. This indicator reinforces the emerging nature of the investigative field and justifies the necessity of studies that systematize knowledge produced thus far.
The average index of 4.917 citations per document represents promising initial academic impact, particularly significant when contextualized by the youth of the publications. For such recent works, this average suggests that research in BI and educational data analysis is rapidly gaining attention and recognition in respective academic communities. Although the full impact of these documents can only be comprehensively evaluated over several years, initial signals evidence relevance and growth potential of this knowledge area. The involvement of 163 unique authors in 36 documents demonstrates a high proportion of active researchers in the field, with an average of 4.61 coauthors per document. This configuration evidences the highly collaborative nature of investigations, frequently conducted by multidisciplinary teams or research groups. The presence of only 4 single-authored documents (11.1% of the corpus) reinforces that thematic complexity demands integrated approaches and diversified expertise. Additionally, the percentage of 25% international coauthorships demonstrates a significant level of global collaboration and knowledge exchange, an important indicator of the relevance and international reach of the area.
The distribution of 36 articles across 26 different sources suggests that the theme is being published in a diverse range of journals, indicating both an emerging interdisciplinary field and strategic publication across various vehicles relevant to specific subthemes. The presence of 191 author keywords reveals considerable thematic richness, suggesting diversity of topics, methodological approaches, and possible exploration of terminology still in consolidation, characteristics typical of scientific domains in development that have not yet established fully consensual vocabulary.
Figure 4 presents the graph of publications by year.
The temporal distribution of publications, illustrated in
Figure 4, reveals a pattern of progressive growth with expressive acceleration in the most recent period. In 2021, three publications are registered with focus on Big Data and sustainability in higher education, principally from Malaysian and Saudi Arabian authors. The year 2022 presents slight growth with four publications, while 2023 registers a temporary retraction to two publications, possibly reflecting methodological adjustments or conceptual consolidation of the field. Subsequent years evidence significant recovery and expansion, with seven publications in 2024 and an expressive leap to twenty documents in 2025, considering that this research was conducted in May 2025 and therefore represents only the first quadrimester of the year.
This pattern of accelerated growth corresponds to the typical curve of emerging themes in scientific literature that achieve maturity and recognition by the academic community, evidencing the consolidation of Business Intelligence relevance and data analysis in the context of educational sustainability. The analysis of predominant themes by year reveals significant epistemological evolution. Studies from 2021 privilege approaches centered on Big Data and institutional sustainability. In 2022, interest emerges in participatory governance and educational management in pandemic context. The year 2023, although with more restricted production, broadens scope to evaluation of social impacts and curricular alignment. In 2024, thematic diversification is observed including digital leadership, artificial intelligence, and Business Intelligence applied to educational management. Publications from 2025 consolidate the trend of integration between smart cities, sustainability, and educational data analysis, demonstrating conceptual maturation and expansion of practical applications.
Figure 5 presents scientific production by country.
The geographical distribution of scientific production, visualized in
Figure 5, reveals significant diversity and relevant thematic concentration patterns. China leads with twelve publications, followed by Saudi Arabia (ten), Slovenia (nine), Ireland and United Kingdom (eight each), Egypt (six), and Greece (five). With four publications each, Australia, Romania, and Spain stand out, evidencing geographical amplitude in scientific production on Business Intelligence and educational sustainability. This mapping demonstrates effective internationalization of the theme, with expressive participation of both developed and emerging nations.
The strong presence of China reflects strategic investments in educational technologies and public policies for digital transformation in higher education. European representativeness, with emphasis on Slovenia, Ireland, United Kingdom, Greece, Romania, and Spain, suggests alignment with community directives on sustainability and educational digitalization. Saudi Arabian participation evidences Gulf countries’ protagonism in university institutional modernization initiatives, while Egyptian presence broadens perspective to the North African context. Notably, sub-representation of Latin America, Asia-Pacific (except China and Australia), and Sub-Saharan Africa is observed, indicating opportunity for amplification of regional research on BI and sustainability in educational contexts with specific socioeconomic, cultural, and infrastructural characteristics. This gap suggests potential for future investigations that contemplate diverse institutional realities and contribute to democratization of knowledge in data-driven educational management.
Figure 6 highlights the most globally cited documents.
The scientific impact analysis, visualized in
Figure 6 highlights the most globally cited documents and reveals significant patterns of academic recognition. The study by [
25], published in the
Journal of Business Research, occupies a prominent position with 34 total citations and an average of 8.50 citations per year, establishing itself as a fundamental reference in the field. Complementing this high-impact nucleus are the works of [
18] with 18 citations, Aloshan (2024) with 17 citations and an annual rate of 8.50 [
20] with 14 citations, and [
30] with 10 citations but an impressive average of 10 citations per year, indicating rapid adoption by the scientific community.
Particularly relevant is the analysis of normalized citations, which adjust impact by publication time and area standards. In this metric, ref. [
30] leads with 4.76, followed by [
17] with 2.90, and a set of recent 2025 publications ([
14,
37,
44]) with 2.86 each, demonstrating that contemporary works are achieving accelerated academic recognition [
49] with 2.00 in normalized citations confirms sustained impact over time. This temporal distribution of citations indicates accelerated field maturation, where recent works accumulate significant impact in relatively short periods, a characteristic typical of emerging areas with high demand for applied knowledge.
The geographical diversity of most-cited authors, including Italy, Saudi Arabia, Iran, Romania, and South Africa, evidences effective internationalization of the thematic and global convergence of interests in Business Intelligence and educational sustainability. It is also noted that approximately one-third of documents have not yet accumulated citations, reflecting the corpus contemporaneity (average age of 0.972 years) and the natural latency period for scientific recognition.
Figure 7 presents a word cloud related to the studied articles.
The lexical analysis of the corpus, represented by the word cloud in
Figure 7, reveals a clear hierarchical structure of central concepts defining the investigative field. The term “education” emerges as the nuclear concept with the highest frequency (14 occurrences), establishing itself as the principal axis around which other theoretical and practical elements gravitate. This confirms that the educational context constitutes the primordial locus of research, serving as the privileged application domain for other identified conceptual dimensions.
The second hierarchical level comprises the term “sustainability” (9 occurrences), followed by “higher education” (7 occurrences), indicating convergence between the sustainable development paradigm and formal educational contexts. Although the term “higher education” presents expressive frequency in analyzed international literature, reflecting the predominant focus of global research on university institutions, the present study turns specifically to the context of technical schools, recognizing them as equally strategic spaces for implementing sustainable practices mediated by analytical technologies. The prominence of “artificial intelligence” (5 occurrences) in this conceptual stratum evidences growing recognition of AI capabilities as a transformative tool for sustainable educational management, applicable to different modalities and teaching levels, including professional and technological education.
A third conceptual stratum is observed formed by “performance” (4 occurrences), complemented by “student,” “sustainable development,” “sustainable development goals,” and “teaching” (3 occurrences each), configuring the triad characterizing practical applications of the area: evaluation of institutional performance, focus on the student as the center of the formative process, explicit alignment with Sustainable Development Goals, and the pedagogical dimension of teaching practice. The presence of “AI in education” (2 occurrences) reinforces the specificity of artificial intelligence application in educational context as an emerging area of academic interest.
This lexical distribution confirms that research in educational data analysis is not limited to purely technical or technological aspects but constitutes a field fundamentally oriented toward higher education transformations through integration between sustainability, artificial intelligence, and performance-based management. The conceptual architecture revealed by lexical analysis demonstrates a scientific domain that positions higher education as protagonist in achieving sustainable development objectives, mediated by advanced analytical technologies and oriented by institutional and student performance metrics. The consistent articulation between educational, technological, and sustainability terms characterizes a mature interdisciplinary field where theory and practice converge toward systemic transformation of educational institutions toward more intelligent, efficient, and socially responsible models.
Figure 8 presents the co-occurrence network of abstracts with one interaction.
Based on refined analysis of the co-occurrence network (
Figure 8), a multidimensional structure is observed composed of five thematic clusters that reveal the epistemological complexity of research in educational sustainability mediated by analytical technologies. The network architecture presents a moderately dense nucleus with peripheral groupings, where the terms “education,” “sustainability,” and “higher education” emerge as central nodes.
To ensure transparency and strengthen the connection between bibliometric visualization and the reviewed corpus,
Table 4 details the representative studies and keywords associated with each thematic cluster identified in
Figure 8.
The green cluster represents the most comprehensive conceptual nucleus, congregating terms such as “education” (central node), “sustainability,” “performance,” “data analytics,” “industry,” “development goal,” “quality,” and “artificial intelligence.” The strong articulation with sustainability and development objectives indicates systemic orientation toward effective educational models aligned with socioenvironmental responsibilities, potentially preparing students for action in sustainable industrial contexts. The red cluster focuses specifically on higher education as an institutional domain and public policy space, articulating “higher education” (central), “sustainability,” “saudi arabia,” “university sector,” “education policy,” and “sustainable education.” The prominence of “saudi arabia” suggests significant regional focus or concentration of studies originating from this region, while connections with educational policies and university sector indicate concern with governance, strategy, and structural aspects of higher education institutions, again with strong emphasis on sustainability in this specific context.
The blue cluster concentrates on practical pedagogical dimension, bringing together “teaching” and “educational development,” focusing on concrete pedagogical practices and broad efforts for improving learning processes and outcomes, reflecting the applied instructional side of educational research. The orange cluster, in turn, establishes direct linkage between academic institutions and global sustainability agenda through the terms “sustainable development goals” and “universities.” The purple cluster, relatively isolated, articulates “user acceptance” and “success,” pertinent to adoption and effectiveness of new technologies, systems, or educational practices.
The centrality of the term “sustainability” as a conceptual bridge between the green cluster (general education) and red (higher education) underscores its transversal relevance, evidencing that environmental, social, and economic sustainability considerations profoundly permeate discussions about educational practices, public policies, and institutions. The significant presence of “performance” and “data analytics” in the green cluster signals a contemporary data-driven approach for assessing and improving educational effectiveness, emphasizing measurable results and use of analytical tools. The inclusion of “artificial intelligence” evidences the prospective dimension of research, exploring integration of emerging technologies in education for personalization, automation, or efficiency gains. This multidimensional configuration demonstrates a highly interdisciplinary research landscape, articulating education, sustainability, technology, and public policies in integrated fashion.
Table 5 presents the principal themes published by countries and years.
Quantitative analysis of bibliometric data, synthesized in
Table 5, enables identification of three distinct evolutionary phases in scientific production on BI, sustainability, and education. The initial phase (2021–2022) is marked by exploratory studies concentrated in three European countries and one African nation (Hungary, Romania, South Africa), addressing fundamental competencies in data usage for sustainability and university social responsibility. This foundational period established preliminary conceptual frameworks and identified initial research opportunities at the intersection between analytical technologies and educational sustainability.
The intermediate phase (2022–2023) evidences significant thematic expansion with the incorporation of Middle Eastern and Asian contexts, particularly Saudi Arabia and Malaysia, introducing concerns with participatory governance, blockchain technologies for educational transparency, digital transformation processes, and educational management challenges during the COVID-19 pandemic. This period also witnessed the emergence of smart campus concepts and the evaluation of mobile learning quality in post-pandemic contexts, suggesting a progressive sophistication of investigative approaches and a recognition of infrastructure and technological platforms’ relevance for educational sustainability.
The maturation phase (2024–2025) demonstrates substantial thematic diversification and geographical expansion, with predominant Asian, Middle Eastern, and European participation. Research themes evolve toward advanced integration of artificial intelligence, machine learning, predictive analytics, and Internet of Things in educational management contexts. The emergence of concepts such as smart cities, green campuses, circular economy, and explicit focus on Sustainable Development Goals evidences consolidation of a comprehensive and systemically articulated research agenda. The significant concentration of publications in 2025 (20 documents in the first quadrimester) suggests accelerated field growth and increasing international recognition of Business Intelligence relevance for educational sustainability, while simultaneously revealing critical absence of Latin American contributions, particularly Brazilian contexts, characterizing a significant research gap that demands urgent attention from the regional scientific community.