Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (110)

Search Parameters:
Keywords = silhouette coefficient

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
14 pages, 2239 KiB  
Article
Automatic Delineation of Resistivity Contrasts in Magnetotelluric Models Using Machine Learning
by Ever Herrera Ríos, Mateo Marulanda, Hernán Arboleda, Greg Soule, Erika Lucuara, David Jaramillo, Agustín Cardona, Esteban A. Taborda, Farid B. Cortés and Camilo A. Franco
Processes 2025, 13(7), 2263; https://doi.org/10.3390/pr13072263 - 16 Jul 2025
Viewed by 300
Abstract
The precise identification of hydrocarbon-rich zones is crucial for optimizing exploration and production processes in the oil industry. Magnetotelluric (MT) surveys play a fundamental role in mapping subsurface geological structures. This study presents a novel methodology for automatically delineating resistivity contrasts in MT [...] Read more.
The precise identification of hydrocarbon-rich zones is crucial for optimizing exploration and production processes in the oil industry. Magnetotelluric (MT) surveys play a fundamental role in mapping subsurface geological structures. This study presents a novel methodology for automatically delineating resistivity contrasts in MT models by employing advanced machine learning and computer vision techniques. This approach commences with data augmentation to enhance the diversity and volume of resistivity data. Subsequently, a bilateral filter was applied to reduce noise while preserving edge details within the resistivity images. To further improve image contrast and highlight significant resistivity variations, contrast-limited adaptive histogram equalization (CLAHE) was employed. Finally, k-means clustering was utilized to segment the resistivity data into distinct groups based on resistivity values, enabling the identification of color features in different centroids. This facilitated the detection of regions with significant resistivity contrasts in the reservoir. From the clustered images, color masks were generated to visually differentiate the groups and calculate the area and proportion of each group within the pictures. Key features extracted from resistivity profiles were used to train unsupervised learning models capable of generalizing across different geological settings. The proposed methodology improves the accuracy of detecting zones with oil potential and offers scalable applicability to different datasets with minimal retraining, applicable to different subsurface environments. Ultimately, this study seeks to improve the efficiency of petroleum exploration by providing a high-precision automated framework with segmentation and contrast delineation for resistivity analysis, integrating advanced image processing and machine learning techniques. During initial analyses using only k-means, the resulting optimal value of the silhouette coefficient K was 2. After using bilateral filtering together with contrast-limited adaptive histogram equalization (CLAHE) and validation by an expert, the results were more representative, and six clusters were identified. Ultimately, this study seeks to improve the efficiency of petroleum exploration by providing a high-precision automated framework with segmentation and contrast delineation for resistivity analysis, integrating advanced image processing and machine learning techniques. Full article
(This article belongs to the Section Energy Systems)
Show Figures

Figure 1

11 pages, 404 KiB  
Proceeding Paper
Enhanced Supplier Clustering Using an Improved Arithmetic Optimizer Algorithm
by Asmaa Akiki, Kaoutar Douaioui, Achraf Touil, Mustapha Ahlaqqach and Mhammed El Bakkali
Eng. Proc. 2025, 97(1), 44; https://doi.org/10.3390/engproc2025097044 - 30 Jun 2025
Viewed by 244
Abstract
This paper presents a novel approach to supplier clustering by utilizing the Arithmetic Optimizer Algorithm (AOA), addressing the complex challenge of supplier segmentation in modern supply chain management. The AOA framework is applied to solve the multi-criteria clustering problem inherent to supplier classification. [...] Read more.
This paper presents a novel approach to supplier clustering by utilizing the Arithmetic Optimizer Algorithm (AOA), addressing the complex challenge of supplier segmentation in modern supply chain management. The AOA framework is applied to solve the multi-criteria clustering problem inherent to supplier classification. Using a real-world dataset of 500 suppliers with 12 performance criteria, including cost, quality, delivery reliability, and sustainability metrics, our method demonstrates effective clustering performance compared to conventional techniques. The AOA achieves a silhouette coefficient of 56.5% and a Davies–Bouldin index of 56.6%, outperforming several other state-of-the-art metaheuristic algorithms, including the Grey Wolf Optimizer (GWO), Whale Optimization Algorithm (WOA), Salp Swarm Algorithm (SSA), and Harris Hawks Optimization (HHO). The algorithm’s robustness is validated through extensive sensitivity analysis and statistical tests. The results indicate that the proposed approach successfully identifies distinct supplier segments with approximately 85% accuracy, enabling more effective supplier relationship management strategies. Full article
Show Figures

Figure 1

26 pages, 12167 KiB  
Article
Anomaly Detection Method for Hydropower Units Based on KSQDC-ADEAD Under Complex Operating Conditions
by Tongqiang Yi, Xiaowu Zhao, Yongjie Shi, Xiangnan Jing, Wenyang Lei, Jiang Guo, Yang Meng and Zhengyu Zhang
Sensors 2025, 25(13), 4093; https://doi.org/10.3390/s25134093 - 30 Jun 2025
Viewed by 288
Abstract
The safe and stable operation of hydropower units, as core equipment in clean energy systems, is crucial for power system security. However, anomaly detection under complex operating conditions remains a technical challenge in this field. This paper proposes a hydropower unit anomaly detection [...] Read more.
The safe and stable operation of hydropower units, as core equipment in clean energy systems, is crucial for power system security. However, anomaly detection under complex operating conditions remains a technical challenge in this field. This paper proposes a hydropower unit anomaly detection method based on K-means seeded quadratic discriminant clustering and an adaptive density-aware ensemble anomaly detection algorithm (KSQDC-ADEAD). The method first employs the KSQDC algorithm to identify different operating conditions of hydropower units. By combining K-means clustering’s initial partitioning capability with quadratic discriminant analysis’s nonlinear decision boundary construction ability, it achieves the high-precision identification of complex nonlinear condition boundaries. Then, an ADEAD algorithm is designed, which incorporates local density information and improves anomaly detection accuracy and stability through multi-model ensemble and density-adaptive strategies. Validation experiments using 14-month actual operational data from a 550 MW unit at a hydropower station in Southwest China show that the KSQDC algorithm achieves a silhouette coefficient of 0.64 in condition recognition, significantly outperforming traditional methods, and the KSQDC-ADEAD algorithm achieves comprehensive scores of 0.30, 0.34, and 0.23 for anomaly detection at three key monitoring points, effectively improving the accuracy and reliability of anomaly detection. This method provides a systematic technical solution for hydropower unit condition monitoring and predictive maintenance. Full article
Show Figures

Figure 1

20 pages, 1652 KiB  
Article
Analysis of Spatiotemporal Characteristics of Intercity Travelers Within Urban Agglomeration Based on Trip Chain and K-Prototypes Algorithm
by Shuai Yu, Yuqing Liu and Song Hu
Appl. Syst. Innov. 2025, 8(4), 88; https://doi.org/10.3390/asi8040088 - 26 Jun 2025
Viewed by 534
Abstract
In the rapid process of urbanization, urban agglomerations have become a key driving factor for regional development and spatial reorganization. The formation and development of urban agglomerations rely on communication between cities. However, the spatiotemporal characteristics of intercity travelers are not fully grasped [...] Read more.
In the rapid process of urbanization, urban agglomerations have become a key driving factor for regional development and spatial reorganization. The formation and development of urban agglomerations rely on communication between cities. However, the spatiotemporal characteristics of intercity travelers are not fully grasped throughout the entire trip chain. This study proposes a spatiotemporal analysis method for intercity travel in urban agglomerations by constructing origin-to-destination (OD) trip chains using smartphone data, with the Beijing–Tianjin–Hebei urban agglomeration as a case study. The study employed Cramer’s V and Spearman correlation coefficients for multivariate feature selection, identifying 12 key variables from an initial set of 20. Then, optimal cluster configuration was determined via silhouette analysis. Finally, the K-prototypes algorithm was applied to cluster 161,797 intercity trip chains across six transportation corridors in 2019 and 2021, facilitating a comparative spatiotemporal analysis of travel patterns. Results show the following: (1) Intercity travelers are predominantly males aged 19–35, with significantly higher weekday volumes; (2) Modal split exhibits significant spatial heterogeneity—the metro predominates in Beijing while road transport prevails elsewhere; (3) Departure hubs’ waiting times increased significantly in 2021 relative to 2019 baselines; (4) Increased metro mileage correlates positively with extended intra-city travel distances. The results substantially contribute to transportation planning, particularly in optimizing multimodal hub operations and infrastructure investment allocation. Full article
Show Figures

Figure 1

17 pages, 901 KiB  
Proceeding Paper
Enhanced Water Access Segmentation Using an Improved Salp Swarm Algorithm for Regional Development Planning
by Youness Boudrik, Achraf Touil, Rachid Hasnaoui, Mustapha Ahlaqqach and Mhammed El Bakkali
Eng. Proc. 2025, 97(1), 38; https://doi.org/10.3390/engproc2025097038 - 20 Jun 2025
Viewed by 158
Abstract
This paper presents a novel approach to water access segmentation by introducing an improved version of the Salp Swarm Algorithm (ISSA), addressing the complex challenge of household water access classification in developing regions. The proposed enhancement incorporates dynamic exploration–exploitation balancing and feature-aware mechanisms [...] Read more.
This paper presents a novel approach to water access segmentation by introducing an improved version of the Salp Swarm Algorithm (ISSA), addressing the complex challenge of household water access classification in developing regions. The proposed enhancement incorporates dynamic exploration–exploitation balancing and feature-aware mechanisms into the original SSA framework, significantly improving cluster quality and interpretability. Using a real-world dataset of 500 households from the El Hajeb region in Morocco and 12 socio-economic criteria, our method demonstrates superior clustering performance compared to conventional techniques. The ISSA achieves a 25% improvement in the silhouette coefficient (0.732 vs. 0.480) and a 22% reduction in the Davies–Bouldin index (0.421 vs. 0.645) compared to the standard SSA and other state-of-the-art metaheuristic algorithms. Five distinct water access segments are identified, enabling targeted infrastructure development strategies across different community types. The approach provides regional planners with essential insights into the spatial distribution of water access patterns and their relationship with socio-economic factors. Full article
Show Figures

Figure 1

24 pages, 6698 KiB  
Article
From Spectrum to Image: A Novel Deep Clustering Network for Lactose-Free Milk Adulteration Detection
by Chong Zhang, Shankui Ding and Ying He
Information 2025, 16(6), 498; https://doi.org/10.3390/info16060498 - 16 Jun 2025
Viewed by 439
Abstract
Traditional clustering methods are often ineffective in extracting relevant features from high-dimensional, nonlinear near-infrared (NIR) spectra, resulting in poor accuracy of detecting lactose-free milk adulteration. In this paper, we introduce a clustering model based on Gram angular field and convolutional depth manifold (GAF-ConvDuc). [...] Read more.
Traditional clustering methods are often ineffective in extracting relevant features from high-dimensional, nonlinear near-infrared (NIR) spectra, resulting in poor accuracy of detecting lactose-free milk adulteration. In this paper, we introduce a clustering model based on Gram angular field and convolutional depth manifold (GAF-ConvDuc). The Gram angular field accentuates variations in spectral absorption peaks, while convolution depth manifold clustering captures local features between adjacent wavelengths, reducing the influence of noise and enhancing clustering accuracy. Experiments were performed on samples from 2250 milk spectra using the GAF-ConvDuc model. Compared to K-means, the silhouette coefficient (SC) increased from 0.109 to 0.571, standardized mutual information index (NMI) increased from 0.696 to 0.921, the Adjusted Randindex (ARI) increased from 0.543 to 0.836, and accuracy (ACC) increased from 67.2% to 88.9%. Experimental results indicate that our method is superior to K-means, Variational Autoencoder (VAE) clustering, and other approaches. Without requiring pre-labeled data, the model achieves higher inter-cluster separation and more distinct clustering boundaries. These findings offer a robust solution for detecting lactose-free milk adulteration, crucial for food safety oversight. Full article
Show Figures

Graphical abstract

20 pages, 2848 KiB  
Article
A Dual-Branch Network for Intra-Class Diversity Extraction in Panchromatic and Multispectral Classification
by Zihan Huang, Pengyu Tian, Hao Zhu, Pute Guo and Xiaotong Li
Remote Sens. 2025, 17(12), 1998; https://doi.org/10.3390/rs17121998 - 10 Jun 2025
Viewed by 358
Abstract
With the rapid development of remote sensing technology, satellites can now capture multispectral (MS) and panchromatic (PAN) images simultaneously. MS images offer rich spectral details, while PAN images provide high spatial resolutions. Effectively leveraging their complementary strengths and addressing modality gaps are key [...] Read more.
With the rapid development of remote sensing technology, satellites can now capture multispectral (MS) and panchromatic (PAN) images simultaneously. MS images offer rich spectral details, while PAN images provide high spatial resolutions. Effectively leveraging their complementary strengths and addressing modality gaps are key challenges in improving the classification performance. From the perspective of deep learning, this paper proposes a novel dual-source remote sensing classification framework named the Diversity Extraction and Fusion Classifier (DEFC-Net). A central innovation of our method lies in introducing a modality-specific intra-class diversity modeling mechanism for the first time in dual-source classification. Specifically, the intra-class diversity identification and splitting (IDIS) module independently analyzes the intra-class variance within each modality to identify semantically broad classes, and it applies an optimized K-means method to split such classes into fine-grained sub-classes. In particular, due to the inherent representation differences between the MS and PAN modalities, the same class may be split differently in each modality, allowing modality-aware class refinement that better captures fine-grained discriminative features in dual perspectives. To handle the class imbalance introduced by both natural long-tailed distributions and class splitting, we design a long-tailed ensemble learning module (LELM) based on a multi-expert structure to reduce bias toward head classes. Furthermore, a dual-modal knowledge distillation (DKD) module is developed to align cross-modal feature spaces and reconcile the label inconsistency arising from modality-specific class splitting, thereby facilitating effective information fusion across modalities. Extensive experiments on datasets show that our method significantly improves the classification performance. The code was accessed on 11 April 2025. Full article
Show Figures

Figure 1

25 pages, 3666 KiB  
Article
Validation of Core and Whole-Genome Multi-Locus Sequence Typing Schemes for Shiga-Toxin-Producing E. coli (STEC) Outbreak Detection in a National Surveillance Network, PulseNet 2.0, USA
by Molly M. Leeper, Morgan N. Schroeder, Taylor Griswold, Mohit Thakur, Krittika Krishnan, Lee S. Katz, Kelley B. Hise, Grant M. Williams, Steven G. Stroika, Sung B. Im, Rebecca L. Lindsey, Peyton A. Smith, Jasmine Huffman, Alyssa Kelley, Sara Cleland, Alan J. Collins, Shruti Gautam, Eishita Tyagi, Subin Park, João A. Carriço, Miguel P. Machado, Hannes Pouseele, Dolf Michielsen and Heather A. Carletonadd Show full author list remove Hide full author list
Microorganisms 2025, 13(6), 1310; https://doi.org/10.3390/microorganisms13061310 - 4 Jun 2025
Viewed by 1009
Abstract
Shiga-toxin-producing E. coli (STEC) is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Whole-genome sequencing (WGS) is a powerful tool used in public health and microbiology for the detection, surveillance, and outbreak investigation of STEC. In this study, we [...] Read more.
Shiga-toxin-producing E. coli (STEC) is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Whole-genome sequencing (WGS) is a powerful tool used in public health and microbiology for the detection, surveillance, and outbreak investigation of STEC. In this study, we applied three WGS-based subtyping methods, high quality single-nucleotide polymorphism (hqSNP) analysis, whole genome multi-locus sequence typing using chromosome-associated loci [wgMLST (chrom)], and core genome multi-locus sequence typing (cgMLST), to isolate sequences from 11 STEC outbreaks. For each outbreak, we evaluated the concordance between subtyping methods using pairwise genomic differences (number of SNPs or alleles), linear regression models, and tanglegrams. Pairwise genomic differences were highly concordant between methods for all but one outbreak, which was associated with international travel. The slopes of the regressions for hqSNP vs. allele differences were 0.432 (cgMLST) and 0.966 wgMLST (chrom); the slope was 1.914 for cgMLST vs. wgMLST (chrom) differences. Tanglegrams comprised of outbreak and sporadic sequences showed moderate clustering concordance between methods, where Baker’s Gamma Indices (BGIs) ranged between 0.35 and 0.99 and Cophenetic Correlation Coefficients (CCCs) were ≥0.88 across all outbreaks. The K-means analysis using the Silhouette method showed the clear separation of outbreak groups with average silhouette widths ≥0.87 across all methods. This study validates the use of cgMLST for the national surveillance of STEC illness clusters using the PulseNet 2.0 system and demonstrates that hqSNP or wgMLST can be used for further resolution. Full article
(This article belongs to the Special Issue The Molecular Epidemiology of Infectious Diseases)
Show Figures

Figure 1

25 pages, 2934 KiB  
Article
Appraisal of Industrial Pollutants in Sewage and Biogas Production Using Multivariate Analysis and Unsupervised Machine Learning Clustering
by Wiktor Halecki, Anna Młyńska and Krzysztof Chmielowski
Appl. Sci. 2025, 15(11), 6222; https://doi.org/10.3390/app15116222 - 31 May 2025
Viewed by 469
Abstract
Sewage composition analysis is important for understanding environmental impact and ensuring effective treatment processes. In this study, we employed multivariate analysis techniques to delve into the intricate composition of sewage. Specifically, we utilized Principal Component Analysis (PCA) and Detrended Correspondence Analysis (DCA) to [...] Read more.
Sewage composition analysis is important for understanding environmental impact and ensuring effective treatment processes. In this study, we employed multivariate analysis techniques to delve into the intricate composition of sewage. Specifically, we utilized Principal Component Analysis (PCA) and Detrended Correspondence Analysis (DCA) to uncover patterns and relationships among different types of sewage pollutants. Statistical analysis revealed that treatment stages did not consistently reduce all pollutant concentrations. Mechanical treatment failed to lower chlorides and sulfates, but was effective for ether extract and phenols. Moreover, total mechanical–biological treatment provided a significant, 91% reduction of the ether extract and phenols, while only reducing chlorides by 13% and sulfates by 22%. The multivariate analysis revealed significant differences between raw sewage and mechanically treated sewage. Totally treated sewage stood out as the key factor influencing the pollutants studied, particularly chlorides and sulfates. This finding emphasizes the critical role of comprehensive treatment processes in effective sewage management. Among the analysed substances, chlorides showed the strongest clustering potential, with an average Silhouette coefficient of 0.738, the highest observed. Phenols, on the other hand, exhibited lower Within-Cluster Sum of Squares (WCSS), suggesting their potential as an alternative parameter for evaluation. Full article
(This article belongs to the Special Issue AI in Wastewater Treatment)
Show Figures

Figure 1

25 pages, 11111 KiB  
Article
Integrating Backscattered Electron Imaging and Multi-Feature-Weighted Clustering for Quantification of Hydrated C3S Microstructure
by Xin Wang and Yongjun Luo
Buildings 2025, 15(10), 1699; https://doi.org/10.3390/buildings15101699 - 17 May 2025
Viewed by 393
Abstract
The microstructure of cement paste is governed by the hydration of its major component, tricalcium silicate (C3S). Quantitative analysis of C3S microstructural images is critical for elucidating the microstructure-property correlation in cementitious systems. Existing image segmentation methods rely on [...] Read more.
The microstructure of cement paste is governed by the hydration of its major component, tricalcium silicate (C3S). Quantitative analysis of C3S microstructural images is critical for elucidating the microstructure-property correlation in cementitious systems. Existing image segmentation methods rely on image contrast, leading to a struggle with multi-phase segmentation in regions with close grayscale intensities. Therefore, this study proposes a weighted K-means clustering method that integrates intensity gradients, texture variations, and spatial coordinates for the quantitative analysis of hydrated C3S microstructure. The results indicate the following: (1) The deep convolutional neural network with guided filtering demonstrates superior performance (mean squared error: 53.52; peak signal-to-noise ratio: 26.35 dB; structural similarity index: 0.8187), enabling high-fidelity preservation of cementitious phases. In contrast, wavelet denoising is effective for pore network analysis but results in partial loss of solid phase information. (2) Unhydrated C3S reflects optimal boundary clarity at intermediate image relative resolutions (0.25–0.56), while calcium hydroxide peaks at 0.19. (3) Silhouette coefficients (0.70–0.84) validate the robustness of weighted K-means clustering, and the Clark–Evans index (0.426) indicates CH aggregation around hydration centers, contrasting with the random CH distribution observed in Portland cement systems. Full article
(This article belongs to the Section Building Materials, and Repair & Renovation)
Show Figures

Figure 1

21 pages, 3370 KiB  
Article
An Improved Density-Based Spatial Clustering of Applications with Noise Algorithm with an Adaptive Parameter Based on the Sparrow Search Algorithm
by Zicheng Huang, Zuopeng Liang, Shibo Zhou and Shuntao Zhang
Algorithms 2025, 18(5), 273; https://doi.org/10.3390/a18050273 - 6 May 2025
Viewed by 764
Abstract
The density-based spatial clustering of applications with noise (DBSCAN) is able to cluster arbitrarily structured datasets. However, the clustering result of this algorithm is exceptionally sensitive to the neighborhood radius (Eps) and noise points, and it is hard to obtain the best result [...] Read more.
The density-based spatial clustering of applications with noise (DBSCAN) is able to cluster arbitrarily structured datasets. However, the clustering result of this algorithm is exceptionally sensitive to the neighborhood radius (Eps) and noise points, and it is hard to obtain the best result quickly and accurately with it. To address this issue, a parameter-adaptive DBSCAN clustering algorithm based on the Sparrow Search Algorithm (SSA), referred to as SSA-DBSCAN, is proposed. This method leverages the local fast search ability of SSA, using the optimal number of clusters and the silhouette coefficient of the dataset as the objective functions to iteratively optimize and select the two input parameters of DBSCAN. This avoids the adverse impact of manually inputting parameters, enabling adaptive clustering with DBSCAN. Experiments on typical synthetic datasets, UCI (University of California, Irvine) real-world datasets, and image segmentation tasks have validated the effectiveness of the SSA-DBSCAN algorithm. Comparative analysis with DBSCAN and other related optimization algorithms demonstrates the clustering performance of SSA-DBSCAN. Full article
(This article belongs to the Section Algorithms for Multidisciplinary Applications)
Show Figures

Figure 1

23 pages, 8244 KiB  
Article
Analysis of Spatial Aggregation and Activity of the Urban Population of Almaty Based on Cluster Analysis
by Gulnara Bektemyssova, Artem Bykov, Aiman Moldagulova, Sayan Omarov, Galymzhan Shaikemelev, Saltanat Nuralykyzy and Dauren Umutkulov
Sustainability 2025, 17(7), 3243; https://doi.org/10.3390/su17073243 - 5 Apr 2025
Cited by 1 | Viewed by 812
Abstract
This study analyzes the spatial aggregation and activity of the urban population in Almaty using anonymized population density data provided by a telecommunications operator and geographic data from OpenStreetMap. The study focuses on identifying stable zones of high population activity, which facilitates the [...] Read more.
This study analyzes the spatial aggregation and activity of the urban population in Almaty using anonymized population density data provided by a telecommunications operator and geographic data from OpenStreetMap. The study focuses on identifying stable zones of high population activity, which facilitates the optimization of transport routes, urban infrastructure planning, and the efficient allocation of city resources. The novelty of this work lies in the integration of aggregated spatiotemporal data with advanced clustering methods, including DBSCAN, KMeans++, and agglomerative clustering. The research methodology involves dividing the city into 500 × 500 m quadrants, calculating normalized population density metrics, and identifying high-activity clusters. Based on a comparative analysis of clustering algorithms, DBSCAN exhibited the highest clustering quality according to the silhouette coefficient and the Davies–Bouldin index, allowing for the identification of key zones of urban activity. The identified clusters were utilized to assess transport load, analyze disparities in the distribution of public transport stops, and develop recommendations to improve public transport accessibility in the most congested areas. The study’s findings are applicable not only to optimizing the transport network but also to addressing a broader range of urban planning challenges, including the strategic placement of infrastructure facilities and the management of population flows. The proposed methodology is scalable and can be adapted to other cities requiring effective tools for analyzing the spatiotemporal activity of urban populations. Full article
Show Figures

Figure 1

15 pages, 4361 KiB  
Article
From 2D to 3D Urban Analysis: An Adaptive Urban Zoning Framework That Takes Building Height into Account
by Tao Shen, Fulu Kong, Shuai Yuan, Xueying Wang, Di Sun and Zongshuo Ren
Buildings 2025, 15(7), 1182; https://doi.org/10.3390/buildings15071182 - 3 Apr 2025
Viewed by 588
Abstract
The vertical heterogeneous structures formed during the evolution of urban agglomerations, driven by globalization, pose challenges to traditional two-dimensional spatial analysis methods. This study addresses the vertical heterogeneity and spatial multiscale problem in three-dimensional urban space and proposes an adaptive framework that takes [...] Read more.
The vertical heterogeneous structures formed during the evolution of urban agglomerations, driven by globalization, pose challenges to traditional two-dimensional spatial analysis methods. This study addresses the vertical heterogeneity and spatial multiscale problem in three-dimensional urban space and proposes an adaptive framework that takes into account building height for multiscale clustering in urban areas. Firstly, we established a macro-, meso- and micro-level analysis system for the characteristics of urban spatial structures. Subsequently, we developed a parameter-adaptive model through a dynamic coupling mechanism of height thresholds and average elevations. Finally, we proposed a density-based clustering method that integrates the multiscale urban analysis with parameter adaptation to distinguish urban spatial features at different scales, thereby achieving multiscale urban regional delineation. The experimental results demonstrate that the proposed clustering framework outperforms traditional density-based and hierarchical clustering algorithms in terms of both the Silhouette Coefficient and the Davies–Bouldin Index, effectively resolving the problem of vertical density variation in urban clustering. Full article
(This article belongs to the Special Issue New Challenges in Digital City Planning)
Show Figures

Figure 1

16 pages, 1263 KiB  
Article
Identifying Heart Attack Risk in Vulnerable Population: A Machine Learning Approach
by Subhagata Chattopadhyay and Amit K Chattopadhyay
Information 2025, 16(4), 265; https://doi.org/10.3390/info16040265 - 26 Mar 2025
Viewed by 771
Abstract
The COVID-19 pandemic has significantly increased the incidence of post-infection cardiovascular events, particularly myocardial infarction, in individuals over 40. While the underlying mechanisms remain elusive, this study employs a hybrid machine learning approach to analyze epidemiological data in assessing 13 key heart attack [...] Read more.
The COVID-19 pandemic has significantly increased the incidence of post-infection cardiovascular events, particularly myocardial infarction, in individuals over 40. While the underlying mechanisms remain elusive, this study employs a hybrid machine learning approach to analyze epidemiological data in assessing 13 key heart attack risk factors and their susceptibility. Based on a unique dataset that combines demographic, biochemical, ECG, and thallium stress tests, this study aims to design, develop, and deploy a clinical decision support system. Assimilating outcomes from five clustering techniques applied to the ‘Kaggle heart attack risk’ dataset, the study categorizes distinct subpopulations against varying risk profiles and then divides the population into ‘at-risk’ (AR) and ‘not-at-risk’ (NAR) groups using clustering algorithms. The GMM algorithm outperforms its competitors (with clustering accuracy and Silhouette coefficient scores of 84.24% and 0.2623, respectively). Subsequent analyses, employing Pearson correlation and linear regression as descriptors, reveal a strong association between the likelihood of experiencing a heart attack and the 13 risk factors studied, and these are statistically significant (p < 0.05). Our findings provide valuable insights into the development of targeted risk stratification and preventive strategies for high-risk individuals based on heart attack risk scores. The aggravated risk for postmenopausal patients indicates compromised individual risk factors due to estrogen depletion that may be further compromised by extraneous stress impacts, like anxiety and fear, aspects that have traditionally eluded data modeling predictions. The model can be repurposed to analyze the impact of COVID-19 on vulnerable populations. Full article
Show Figures

Graphical abstract

26 pages, 4679 KiB  
Article
Importance Classification Method for Signalized Intersections Based on the SOM-K-GMM Clustering Algorithm
by Ziyi Yang, Yang Chen, Dong Guo, Fangtong Jiao, Bin Zhou and Feng Sun
Sustainability 2025, 17(7), 2827; https://doi.org/10.3390/su17072827 - 22 Mar 2025
Viewed by 397
Abstract
Urbanization has intensified traffic loads, posing significant challenges to the efficiency and stability of urban road networks. Overloaded nodes risk congestion, thus making accurate intersection importance classification essential for resource optimization. This study proposes a hybrid clustering method that combines Self-Organizing Maps (SOMs), [...] Read more.
Urbanization has intensified traffic loads, posing significant challenges to the efficiency and stability of urban road networks. Overloaded nodes risk congestion, thus making accurate intersection importance classification essential for resource optimization. This study proposes a hybrid clustering method that combines Self-Organizing Maps (SOMs), K-Means, and the Gaussian Mixture Model (GMM), which is supported by the Traffic Flow–Network Topology–Social Economy (TNS) evaluation framework. This framework integrates three dimensions—traffic flow, road network topology, and socio-economic features—capturing six key indicators: intersection saturation, traffic flow balance, mileage coverage, capacity, betweenness efficiency, and node activity. The SOMs method determines the optimal k value and centroids for K-Means, while GMM validates the cluster membership probabilities. The proposed model achieved a silhouette coefficient of 0.737, a Davies–Bouldin index of 1.003, and a Calinski–Harabasz index of 57.688, with the silhouette coefficient improving by 78.1% over SOMs alone, 65.2% over K-Means, and 11.5% over SOM-K-Means, thus demonstrating high robustness. The intersection importance ranking was conducted using the Mahalanobis distance method, and it was validated on 40 intersections within the road network of Zibo City. By comparing the importance rankings across static, off-peak, morning peak, and evening peak periods, a dynamic ranking approach is proposed. This method provides a robust basis for optimizing resource allocation and traffic management at urban intersections. Full article
(This article belongs to the Section Sustainable Transportation)
Show Figures

Figure 1

Back to TopTop