Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (79)

Search Parameters:
Keywords = bertopic

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
40 pages, 4303 KB  
Systematic Review
The Road to Autonomy: A Systematic Review Through AI in Autonomous Vehicles
by Adrian Domenteanu, Paul Diaconu, Margareta-Stela Florescu and Camelia Delcea
Electronics 2025, 14(21), 4174; https://doi.org/10.3390/electronics14214174 - 25 Oct 2025
Viewed by 470
Abstract
In the last decade, the incorporation of Artificial Intelligence (AI) with autonomous vehicles (AVs) has transformed transportation, mobility, and smart mobility systems. The present study provides a systematic review of global trends, applications, and challenges at the intersection of AI, including Machine Learning [...] Read more.
In the last decade, the incorporation of Artificial Intelligence (AI) with autonomous vehicles (AVs) has transformed transportation, mobility, and smart mobility systems. The present study provides a systematic review of global trends, applications, and challenges at the intersection of AI, including Machine Learning (ML), Deep Learning (DL), and autonomous vehicle technologies. Using data extracted from Clarivate Analytics’ Web of Science Core Collection and a set of specific keywords related to both AI and autonomous (electric) vehicles, this paper identifies the themes presented in the scientific literature using thematic maps and thematic map evolution analysis. Furthermore, the research topics are identified using both thematic maps, as well as Latent Dirichlet Allocation (LDA) and BERTopic, offering a more faceted insight into the research field as LDA enables the probabilistic discovery of high-level research themes, while BERTopic, based on transformer-based language models, captures deeper semantic patterns and emerging topics over time. This approach offers richer insights into the systematic review analysis, while comparison in the results obtained through the various methods considered leads to a better overview of the themes associated with the field of AI in autonomous vehicles. As a result, a strong correspondence can be observed between core topics, such as object detection, driving models, control, safety, cybersecurity and system vulnerabilities. The findings offer a roadmap for researchers and industry practitioners, by outlining critical gaps and discussing the opportunities for future exploration. Full article
Show Figures

Figure 1

27 pages, 5184 KB  
Article
Making Smart Cities Human-Centric: A Framework for Dynamic Resident Demand Identification and Forecasting
by Wen Zhang, Bin Guo, Wei Zhao, Yutong He and Xinyu Wang
Sustainability 2025, 17(21), 9423; https://doi.org/10.3390/su17219423 - 23 Oct 2025
Viewed by 328
Abstract
Smart cities offer new opportunities for urban governance and sustainable development. However, at the current stage, the construction and development of smart cities generally exhibit a technology-driven tendency, neglecting real resident demand, which contradicts the “human-centric” principle. Traditional top-down methods of demand collection [...] Read more.
Smart cities offer new opportunities for urban governance and sustainable development. However, at the current stage, the construction and development of smart cities generally exhibit a technology-driven tendency, neglecting real resident demand, which contradicts the “human-centric” principle. Traditional top-down methods of demand collection struggle to capture the dynamics and heterogeneity of public demand. At the same time, government service platforms, as one dimension of smart city construction, have accumulated massive amounts of user-generated data, providing new solutions for this challenge. This paper aims to construct a big data-driven analytical framework for dynamically identifying and accurately forecasting core resident demand. The study uses Xi’an City, Shaanxi Province, China, as a case study, utilising user messages from People.cn spanning 2011 to 2023. These messages cover various domains, including urban construction, healthcare, education, and transportation, as the data source. The People.cn message board is China’s most significant nationwide online political platform. Its institutionalised feedback mechanism ensures data content focuses on highly representative specific grievances, rather than the broad emotional expressions on social media. The study employs user messages from People.cn from 2011 to 2023 as its data source, encompassing urban construction, healthcare, education, and transportation. First, a large language model (LLM) was used to preprocess and clean the raw data. Subsequently, the BERTopic model was applied to identify ten core demand themes and construct their monthly time series, thereby overcoming the limitations of traditional methods in short-text semantic recognition. Finally, by integrating variational mode decomposition (VMD) with support vector machines (SVMs), a hybrid demand forecasting model was established to mitigate the risk of overfitting in deep learning when forecasting small-sample time series. The empirical results show that the proposed LLM-BERTopic-VMD-SVM framework exhibits excellent performance, with the goodness-of-fit (R2) on various demand themes ranging from 0.93 to 0.96. This study proposes an effective analytical framework for identifying and forecasting resident demand. It provides a decision-support tool for city managers to achieve proactive and fine-grained governance, thereby offering a viable empirical pathway to promote the transformation of smart cities from technology-centric to human-centric. Full article
Show Figures

Figure 1

24 pages, 797 KB  
Article
Towards a Sustainable Workforce in Big Data Analytics: Skill Requirements Analysis from Online Job Postings Using Neural Topic Modeling
by Fatih Gurcan, Ahmet Soylu and Akif Quddus Khan
Sustainability 2025, 17(20), 9293; https://doi.org/10.3390/su17209293 - 20 Oct 2025
Viewed by 422
Abstract
Big data analytics has become a cornerstone of modern industries, driving advancements in business intelligence, competitive intelligence, and data-driven decision-making. This study applies Neural Topic Modeling (NTM) using the BERTopic framework and N-gram-based textual content analysis to examine job postings related to big [...] Read more.
Big data analytics has become a cornerstone of modern industries, driving advancements in business intelligence, competitive intelligence, and data-driven decision-making. This study applies Neural Topic Modeling (NTM) using the BERTopic framework and N-gram-based textual content analysis to examine job postings related to big data analytics in real-world contexts. A structured analytical process was conducted to derive meaningful insights into workforce trends and skill demands in the big data analytics domain. First, expertise roles and tasks were identified by analyzing job titles and responsibilities. Next, key competencies were categorized into analytical, technical, developer, and soft skills and mapped to corresponding roles. Workforce characteristics such as job types, education levels, and experience requirements were examined to understand hiring patterns. In addition, essential tasks, tools, and frameworks in big data analytics were identified, providing insights into critical technical proficiencies. The findings show that big data analytics requires expertise in data engineering, machine learning, cloud computing, and AI-driven automation. They also emphasize the importance of continuous learning and skill development to sustain a future-ready workforce. By connecting academia and industry, this study provides valuable implications for educators, policymakers, and corporate leaders seeking to strengthen workforce sustainability in the era of big data analytics. Full article
Show Figures

Figure 1

23 pages, 1798 KB  
Article
Thematic Evolution and Transmission Mechanisms of China’s Rural Tourism Policy: A Multi-Level Governance Framework for Sustainable Development
by Haoqian Hu, Yifen Yin, Yingchong Xie, Jingwen Cai, Chunning Wang and Wenshuo Zhang
Sustainability 2025, 17(20), 9187; https://doi.org/10.3390/su17209187 - 16 Oct 2025
Viewed by 423
Abstract
Rural tourism is a key engine for sustainable development, elevated to a strategic level under China’s “Rural Revitalization Strategy”, creating a complex multi-level governance (MLG) policy system whose internal mechanisms are not fully understood. This study aims to analyze the thematic structure, spatio-temporal [...] Read more.
Rural tourism is a key engine for sustainable development, elevated to a strategic level under China’s “Rural Revitalization Strategy”, creating a complex multi-level governance (MLG) policy system whose internal mechanisms are not fully understood. This study aims to analyze the thematic structure, spatio-temporal evolution, and transmission mechanisms of China’s rural tourism policy across central, provincial, and city/county levels. We applied BERTopic topic modeling and spatio-temporal analysis to a corpus of 1174 policy documents from 2005 to 2024. The results reveal a “centrally guided Type I governance” model with a clear functional division: the central level acts as a “top-level strategic designer”, the provincial level as a “key regional hub” for adaptation, and the city/county level as the “frontline of policy implementation”. We identified a vertical transmission chain characterized by a 1–2-year lag, alongside spatial differentiation driven by regional resource endowments at the provincial level and functional needs at the city/county level. This study concludes that China’s rural tourism governance framework is an efficient synergistic system that combines strong central guidance with dynamic local adaptation, providing empirical support for MLG theory in a unitary state and offering insights for optimizing policy coordination. Full article
Show Figures

Figure 1

37 pages, 5285 KB  
Article
Assessing Student Engagement: A Machine Learning Approach to Qualitative Analysis of Institutional Effectiveness
by Abbirah Ahmed, Martin J. Hayes and Arash Joorabchi
Future Internet 2025, 17(10), 453; https://doi.org/10.3390/fi17100453 - 1 Oct 2025
Viewed by 413
Abstract
In higher education, institutional quality is traditionally assessed through metrics such as academic programs, research output, educational resources, and community services. However, it is important that their activities align with student expectations, particularly in relation to interactive learning environments, learning management system interaction, [...] Read more.
In higher education, institutional quality is traditionally assessed through metrics such as academic programs, research output, educational resources, and community services. However, it is important that their activities align with student expectations, particularly in relation to interactive learning environments, learning management system interaction, curricular and co-curricular activities, accessibility, support services and other learning resources that ensure academic success and, jointly, career readiness. The growing popularity of student engagement metrics as one of the key measures to evaluate institutional efficacy is now a feature across higher education. By monitoring student engagement, institutions assess the impact of existing resources and make necessary improvements or interventions to ensure student success. This study presents a comprehensive analysis of student feedback from the StudentSurvey.ie dataset (2016–2022), which consists of approximately 275,000 student responses, focusing on student self-perception of engagement in the learning process. By using classical topic modelling techniques such as Latent Dirichlet Allocation (LDA) and Bi-term Topic Modelling (BTM), along with the advanced transformer-based BERTopic model, we identify key themes in student responses that can impact institutional strength performance metrics. BTM proved more effective than LDA for short text analysis, whereas BERTopic offered greater semantic coherence and uncovered hidden themes using deep learning embeddings. Moreover, a custom Named Entity Recognition (NER) model successfully extracted entities such as university personnel, digital tools, and educational resources, with improved performance as the training data size increased. To enable students to offer actionable feedback, suggesting areas of improvement, an n-gram and bigram network analysis was used to focus on common modifiers such as “more” and “better” and trends across student groups. This study introduces a fully automated, scalable pipeline that integrates topic modelling, NER, and n-gram analysis to interpret student feedback, offering reportable insights and supporting structured enhancements to the student learning experience. Full article
(This article belongs to the Special Issue Machine Learning and Natural Language Processing)
Show Figures

Figure 1

28 pages, 1463 KB  
Article
Strategic Management Knowledge Map via BERTopic (1980–2025): Evolution, Integration, and Application
by Kuei-Kuei Lai, Chih-Wen Hsiao and Yu-Jin Hsu
Appl. Syst. Innov. 2025, 8(5), 142; https://doi.org/10.3390/asi8050142 - 29 Sep 2025
Cited by 1 | Viewed by 775
Abstract
Problem: Amid digital disruption and the cross-fertilization of RBV, DCV, and KBV, strategic management knowledge has grown fragmented with blurred boundaries. Conventional mapping (citation/co-word, LDA) lacks semantic and temporal resolution, obscuring overlaps, divergences, and turning points and hindering links to actionable indicators (e.g., [...] Read more.
Problem: Amid digital disruption and the cross-fertilization of RBV, DCV, and KBV, strategic management knowledge has grown fragmented with blurred boundaries. Conventional mapping (citation/co-word, LDA) lacks semantic and temporal resolution, obscuring overlaps, divergences, and turning points and hindering links to actionable indicators (e.g., the Balanced Scorecard). Hence, an integrated, semantically faithful, time-stamped map is needed to bridge research and operational metrics. Gap: Prior maps rely on citation/co-word signals, miss textual meaning, and treat RBV/DCV/KBV in isolation—lacking a theory-aligned, time-stamped, manager-oriented synthesis. Objectives: This study aims to (1) reveal how RBV, DCV, and KBV evolve and interrelate over time; (2) produce an integrated, semantically grounded map; and (3) translate selected themes into actionable managerial indicators. Method: We analyzed 25,907 WoS articles (1980–2025) with BERTopic (Sentence-BERT + UMAP + HDBSCAN + c-TF-IDF). We used an RBV/DCV/KBV lexicon to guide retrieval/interpretation (not to constrain modeling). We discovered 230 topics, retained 33 via coherence (C_V), and benchmarked them against LDA. Key findings: A concise set of 33 high-quality themes with a higher C_V than LDA on this corpus was established. A Fish-Scale view (overlapping subfields across economics, management, sociology) that clarifies RBV–DCV–KBV intersections was achieved. Era-sliced prevalence shows how themes emerge and recombine over 1980–2025. Selected themes mapped to Balanced Scorecard (BSC) indicators linking capabilities → processes → customer outcomes → financial results. Contribution: A clear, time-aware synthesis of RBV–DCV–KBV and a scalable, reproducible pipeline for structuring fragmented theory landscapes are presented in this study—bridging scholarly integration with managerial application via BSC mapping. Full article
(This article belongs to the Topic Social Sciences and Intelligence Management, 2nd Volume)
Show Figures

Figure 1

14 pages, 910 KB  
Article
Collective Dynamics in the Awakening of Sleeping Beauty Patents: A BERTopic Approach
by Hee Jin Mun and Sanghoon Lee
Appl. Sci. 2025, 15(19), 10395; https://doi.org/10.3390/app151910395 - 25 Sep 2025
Viewed by 369
Abstract
Prior research has emphasized individual patent characteristics in identifying the awakening of sleeping beauty patents (SBPs) that remain unnoticed for long periods before suddenly attracting substantial attention. However, less attention has been paid to how collective dynamics shape these awakenings. This study examines [...] Read more.
Prior research has emphasized individual patent characteristics in identifying the awakening of sleeping beauty patents (SBPs) that remain unnoticed for long periods before suddenly attracting substantial attention. However, less attention has been paid to how collective dynamics shape these awakenings. This study examines whether field-level topic patterns—observable manifestations of collective perceptions and choices—are associated with SBP awakenings. We derived two indicators from U.S. patent abstracts by using BERTopic: Jensen–Shannon Divergence (JSD), which reflects temporal shifts in topic distributions, and topic entropy, which captures the breadth of technological exploration across topics. The logistic regression results showed that JSD is negatively associated with SBP awakenings, whereas entropy is positively associated with them. These findings suggest that SBPs are more likely to reemerge when technological exploration spans a broader range of topics while the topic structure remains relatively stable. In this way, the study contributes by demonstrating how outputs of collective dynamics are linked to the delayed recognition of SBPs. Full article
Show Figures

Figure 1

46 pages, 10328 KB  
Article
European Fund Absorption and Contribution to Business Environment Development: Research Output Analysis Through Bibliometric and Topic Modeling Analysis
by Mihnea Panait, Bianca Raluca Cibu, Dana Maria Teodorescu and Camelia Delcea
Businesses 2025, 5(4), 45; https://doi.org/10.3390/businesses5040045 - 24 Sep 2025
Cited by 1 | Viewed by 473
Abstract
In recent years, the field of European funds for business development has generated significant interest in the academic literature, stimulated by European Union (EU) regulations and the implementation of business financing programs. This context has led to an increase in research on the [...] Read more.
In recent years, the field of European funds for business development has generated significant interest in the academic literature, stimulated by European Union (EU) regulations and the implementation of business financing programs. This context has led to an increase in research on the impact and use of European funds, particularly in terms of support for economic development and infrastructure. This paper presents a bibliometric analysis, using topic modeling, to examine academic publications on the use and absorption of European funds and how they influence the business environment. Using a dataset of 74 publications indexed in the Clarivate Analytics Web of Science Core Collection, covering the period 2005–2024, the present study aims to identify the main authors, institutions, journals, and collaboration networks involved. It also analyzes research trends, dominant themes, and the countries with the largest contributions in this field, using Latent Dirichlet Allocation (LDA) and BERTopic analysis as a complement to the classical bibliometric approach. The thematic analysis reveals a thematic cohesion around entrepreneurship, EU structural funds, regional development, and innovation. In addition, there has been a significant annual increase in publications in this field, and through the use of thematic maps, word clouds, and collaboration networks, this study provides an overview of the evolution of research on the absorption of European funds and its impact on the business environment. These findings contribute both to deepening academic knowledge and to formulating more effective European policies for optimizing fund absorption and supporting the sustainable development of the business environment. Full article
Show Figures

Figure 1

30 pages, 3234 KB  
Article
Analyzing the Asymmetric Effects of COVID-19 on Hotel Selection Attributes and Customer Satisfaction Through AIPA
by Jun Li, Byunghyun Lee and Jaekyeong Kim
Sustainability 2025, 17(19), 8546; https://doi.org/10.3390/su17198546 - 23 Sep 2025
Viewed by 515
Abstract
The COVID-19 pandemic reshaped travel patterns and customer expectations, generating profound challenges for the hotel industry. This study analyzes 50,000 TripAdvisor reviews of New York hotels to examine how customer satisfaction with hotel selection attributes shifted before and during the pandemic. BERTopic was [...] Read more.
The COVID-19 pandemic reshaped travel patterns and customer expectations, generating profound challenges for the hotel industry. This study analyzes 50,000 TripAdvisor reviews of New York hotels to examine how customer satisfaction with hotel selection attributes shifted before and during the pandemic. BERTopic was applied to extract eight key attributes, while VADER, PRCA, and Asymmetric Impact–Performance Analysis (AIPA) were used to capture asymmetric effects and prioritize improvements. Comparative analyses by hotel classification, travel type, and customer residence reveal significant shifts in food and beverage, location, and staff, particularly among lower-tier hotels, business travelers, and international guests. The novelty of this study lies in integrating BERTopic and AIPA to overcome survey-based limitations and provide a robust, data-driven view of COVID-19’s impact on hotel satisfaction. Theoretically, it advances asymmetric satisfaction research by linking text-derived attributes with AIPA. Practically, it offers actionable guidance for hotel managers to strengthen hygiene, expand contactless services, and reallocate resources effectively in preparation for future crises. In addition, this study contributes to sustainability by showing how data-driven analysis can enhance service resilience and support the long-term socio-economic viability of the hotel industry under global crises. Full article
(This article belongs to the Special Issue Digital Transformation for Resilient and Sustainable Businesses)
Show Figures

Figure 1

40 pages, 3284 KB  
Article
SemaTopic: A Framework for Semantic-Adaptive Probabilistic Topic Modeling
by Amani Drissi, Salma Sassi, Richard Chbeir, Anis Tissaoui and Abderrazek Jemai
Computers 2025, 14(9), 400; https://doi.org/10.3390/computers14090400 - 19 Sep 2025
Viewed by 478
Abstract
Topic modeling is a crucial technique for Natural Language Processing (NLP) which helps to automatically uncover coherent topics from large-scale text corpora. Yet, classic methods tend to suffer from poor semantic depth and topic coherence. In this regard, we present here a new [...] Read more.
Topic modeling is a crucial technique for Natural Language Processing (NLP) which helps to automatically uncover coherent topics from large-scale text corpora. Yet, classic methods tend to suffer from poor semantic depth and topic coherence. In this regard, we present here a new approach “SemaTopic” to improve the quality and interpretability of discovered topics. By exploiting semantic understanding and stronger clustering dynamics, our approach results in a more continuous, finer and more stable representation of the topics. Experimental results demonstrate that SemaTopic achieves a relative gain of +6.2% in semantic coherence compared to BERTopic on the 20 Newsgroups dataset (Cv=0.5315 vs. 0.5004), while maintaining stable performance across heterogeneous and multilingual corpora. These findings highlight “SemaTopic” as a scalable and reliable solution for practical text mining and knowledge discovery. Full article
(This article belongs to the Special Issue Advances in Semantic Multimedia and Personalized Digital Content)
Show Figures

Figure 1

31 pages, 3969 KB  
Article
From Headlines to Forecasts: Narrative Econometrics in Equity Markets
by Davit Hayrapetyan and Ruben Gevorgyan
J. Risk Financial Manag. 2025, 18(9), 524; https://doi.org/10.3390/jrfm18090524 - 18 Sep 2025
Viewed by 1618
Abstract
This study investigates whether firm-specific narratives extracted from the news add predictive content to monthly stock return models. Using bidirectional encoder representations from transformer-based topic modeling (BERTopic), we processed Microsoft (MSFT) news and constructed monthly narrative activations (binary presence and decay weighting). These [...] Read more.
This study investigates whether firm-specific narratives extracted from the news add predictive content to monthly stock return models. Using bidirectional encoder representations from transformer-based topic modeling (BERTopic), we processed Microsoft (MSFT) news and constructed monthly narrative activations (binary presence and decay weighting). These narrative activations are used in autoregressive moving-average models with exogenous regressors (ARIMA-X) to analyze MSFT monthly log returns alongside the U.S. Economic Policy Uncertainty (EPU) index from February 2021 to March 2025. Decay models using a similarity-distilled BERT embedding yielded three significant narratives: Media and Public Perception (MPP) (β = 0.0128, p = 0.002), Currency and Macro Environment (CME) (β = −0.0143, p < 0.001), and Tech and Semiconductor Ecosystem (TSE) (β = −0.0606, p = 0.014). Binary activation identifies reputational shocks: the Media and Public Perception (MPP) indicator predicts lower returns at one- and two-month lags (β = −0.0758, p = 0.043; β = −0.1048, p = 0.007). A likelihood-ratio test comparing ARIMA-X models with narrative regressors to a baseline ARIMA (no narratives) rejects the null hypothesis that narratives add no improvement in fit (p < 0.01). Firm-level narratives enhance monthly forecasts beyond conventional predictors; decay activation and similarity-distilled embeddings perform best. Demonstrated on Microsoft as a proof of concept, the ticker-agnostic design scales to multiple firms and sectors, contingent on sufficient firm-tagged news coverage for external validity. Full article
(This article belongs to the Section Financial Markets)
Show Figures

Graphical abstract

19 pages, 3302 KB  
Article
Topic Mining and Evolutionary Analysis of Urban Renewal Policy Texts in China
by Guozong Zhang, Xijing Liu and Qianmai Luo
Buildings 2025, 15(18), 3324; https://doi.org/10.3390/buildings15183324 - 14 Sep 2025
Cited by 1 | Viewed by 920
Abstract
In the context of China’s rapid urbanization and the era of stock planning, urban renewal policies play a significant role in enhancing urban quality and promoting sustainable development. To reveal the thematic structure and evolution of China’s urban renewal policy system, this study [...] Read more.
In the context of China’s rapid urbanization and the era of stock planning, urban renewal policies play a significant role in enhancing urban quality and promoting sustainable development. To reveal the thematic structure and evolution of China’s urban renewal policy system, this study applies the BERTopic model to conduct semantic mining and evolutionary analysis on 1144 policy documents issued by central and local governments. Research findings: The study identifies 34 distinct themes in urban renewal policies, grouped into five main directions: Spatial Improvement and Facility Upgrades, Project Collaboration and Approval, Land Acquisition and Compensation, Fiscal Incentives and Funding Support, and Institutional Guarantees and Governance. Each of these directions exhibits distinct evolutionary trends over time. While urban renewal policies in the Central, Western, Eastern, and Northeastern regions share common characteristics in key aspects such as land acquisition and compensation, funding assurance, and residential quality enhancement, they also reflect regional differences due to varying stages of development, economic conditions, and geographic factors. This demonstrates both the shared and distinct policy focus areas across different regions of China. By identifying underlying themes and their trajectories, this study provides critical insights into the structural characteristics of urban renewal policies and offers valuable references for government authorities to improve and optimize policy systems. At the same time, it provides the Chinese experience for urban renewal in other countries. Full article
(This article belongs to the Special Issue Large-Scale AI Models Across the Construction Lifecycle)
Show Figures

Figure 1

26 pages, 3297 KB  
Article
Exploring the Urban Heat Island Effect: A Bibliometric and Topic Modeling Analysis
by Murat Kilinc, Can Aydin, Gizem Erdogan Aydin and Damla Balci
Sustainability 2025, 17(17), 8072; https://doi.org/10.3390/su17178072 - 8 Sep 2025
Viewed by 1431
Abstract
The urban heat island (UHI) effect, intensified by urbanisation and climate change, leads to increased urban temperatures and poses a serious environmental challenge. Understanding its causes, impacts, and mitigation strategies is essential for sustainable urban planning. The aim of this study is to [...] Read more.
The urban heat island (UHI) effect, intensified by urbanisation and climate change, leads to increased urban temperatures and poses a serious environmental challenge. Understanding its causes, impacts, and mitigation strategies is essential for sustainable urban planning. The aim of this study is to systematically analyse how the Urban Heat Island (UHI) effect has been addressed in the scientific literature, to identify key research themes and their temporal evolution, and to critically highlight knowledge gaps in order to provide guidance for future research and urban planning policies. Using BERTopic, an advanced natural language processing (NLP) tool, the study extracts dominant themes from a large corpus of academic literature and tracks their evolution over time. A total of 9061 research articles from the Web of Science database were collected, pre-processed, and analysed. BERTopic clustered semantically related topics and revealed their temporal dynamics, offering insights into emerging and declining research areas. The results show that pavement materials and urban vegetation are among the most studied themes, highlighting the importance of surface materials and green infrastructure in mitigating UHI. In line with this aim, the study identifies a rising interest in urban cooling strategies, particularly reflective surfaces and ventilation corridors. Consistent with its aim, the study provides a comprehensive overview of UHI literature, critically identifies existing gaps, and proposes clear directions for future research. It provides supports for urban planners, policymakers, and researchers in developing data-driven strategies to mitigate UHI impacts and strengthen enhance urban climate resilience. Full article
(This article belongs to the Section Air, Climate Change and Sustainability)
Show Figures

Figure 1

60 pages, 12559 KB  
Article
A Decade of Studies in Smart Cities and Urban Planning Through Big Data Analytics
by Florin Dobre, Andra Sandu, George-Cristian Tătaru and Liviu-Adrian Cotfas
Systems 2025, 13(9), 780; https://doi.org/10.3390/systems13090780 - 5 Sep 2025
Cited by 1 | Viewed by 1341
Abstract
Smart cities and urban planning have succeeded in gathering the attention of researchers worldwide, especially in the last decade, as a result of a series of technological, social and economic developments that have shaped the need for evolution from the traditional way in [...] Read more.
Smart cities and urban planning have succeeded in gathering the attention of researchers worldwide, especially in the last decade, as a result of a series of technological, social and economic developments that have shaped the need for evolution from the traditional way in which the cities were viewed. Technology has been incorporated in many sectors associated with smart cities, such as communications, transportation, energy, and water, resulting in increasing people’s quality of life and satisfying the needs of a society in continuous change. Furthermore, with the rise in machine learning (ML) and artificial intelligence (AI), as well as Geographic Information Systems (GIS), the applications of big data analytics in the context of smart cities and urban planning have diversified, covering a wide range of applications starting with traffic management, environmental monitoring, public safety, and adjusting power distribution based on consumption patterns. In this context, the present paper brings to the fore the papers written in the 2015–2024 period and indexed in Clarivate Analytics’ Web of Science Core Collection and analyzes them from a bibliometric point of view. As a result, an annual growth rate of 10.72% has been observed, showing an increased interest from the scientific community in this area. Through the use of specific bibliometric analyses, key themes, trends, prominent authors and institutions, preferred journals, and collaboration networks among authors, data are extracted and discussed in depth. Thematic maps and topic discovery through Latent Dirichlet Allocation (LDA) and doubled by a BERTopic analysis, n-gram analysis, factorial analysis, and a review of the most cited papers complete the picture on the research carried on in the last decade in this area. The importance of big data analytics in the area of urban planning and smart cities is underlined, resulting in an increase in their ability to enhance urban living by providing personalized and efficient solutions to everyday life situations. Full article
Show Figures

Figure 1

25 pages, 3134 KB  
Article
Threat Intelligence Extraction Framework (TIEF) for TTP Extraction
by Anooja Joy, Madhav Chandane, Yash Nagare and Faruk Kazi
J. Cybersecur. Priv. 2025, 5(3), 63; https://doi.org/10.3390/jcp5030063 - 3 Sep 2025
Viewed by 1482
Abstract
The increasing complexity and scale of cyber threats demand advanced, automated methodologies for extracting actionable cyber threat intelligence (CTI). The automated extraction of Tactics, Techniques, and Procedures (TTPs) from unstructured threat reports remains a challenging task, constrained by the scarcity of labeled data, [...] Read more.
The increasing complexity and scale of cyber threats demand advanced, automated methodologies for extracting actionable cyber threat intelligence (CTI). The automated extraction of Tactics, Techniques, and Procedures (TTPs) from unstructured threat reports remains a challenging task, constrained by the scarcity of labeled data, severe class imbalance, semantic variability, and the complexity of multi-class, multi-label learning for fine-grained classification. To address these challenges, this work proposes the Threat Intelligence Extraction Framework (TIEF) designed to autonomously extract Indicators of Compromise (IOCs) from heterogeneous textual threat reports and represent them by the STIX 2.1 standard for standardized sharing. TIEF employs the DistilBERT Base-Uncased model as its backbone, achieving an F1 score of 0.933 for multi-label TTP classification, while operating with 40% fewer parameters than traditional BERT-base models and preserving 97% of their predictive performance. Distinguishing itself from existing methodologies such as TTPDrill, TTPHunter, and TCENet, TIEF incorporates a multi-label classification scheme capable of covering 560 MITRE ATT&CK classes comprising techniques and sub-techniques, thus facilitating a more granular and semantically precise characterization of adversarial behaviors. BERTopic modeling integration enabled the clustering of semantically similar textual segments and captured the variations in threat report narratives. By operationalizing sub-technique-level discrimination, TIEF contributes to context-aware automated threat detection. Full article
(This article belongs to the Collection Machine Learning and Data Analytics for Cyber Security)
Show Figures

Figure 1

Back to TopTop