Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering

Liu, Ting; Liang, Li-Zhen; Cao, Zheng-Kun; Xu, Xing-Qin; Zou, Shang-Xuan; Hu, Guang-Nian

doi:10.3390/app16031239

Open AccessArticle

Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering

by

Ting Liu

^1,2,

Li-Zhen Liang

^2,3,*

,

Zheng-Kun Cao

³,

Xing-Qin Xu

^1,2,

Shang-Xuan Zou

^1,2 and

Guang-Nian Hu

^1,3

¹

School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan 232001, China

²

Institute of Energy, Hefei Comprehensive National Science Center (Anhui Energy Laboratory), Hefei 230031, China

³

Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(3), 1239; https://doi.org/10.3390/app16031239

Submission received: 26 November 2025 / Revised: 24 December 2025 / Accepted: 16 January 2026 / Published: 26 January 2026

Download

Browse Figures

Review Reports Versions Notes

Abstract

In the diagnostics of plasmas heated by neutral beam injection (NBI), which serves as a fundamental heating technique, critical core parameters such as ion temperatures and rotational velocities can be measured through NBI’s associated diagnostic methods. However, conventional spectral analysis methods applied in NBI-based Beam Emission Spectroscopy diagnostics face a significant limitation: a relatively high false detection rate during characteristic peak detection and boundary determination. This issue stems from three primary factors: persistent noise interference, overlapping spectral peaks, and dynamic broadening effects. To address this critical issue, we propose a spectral feature extraction method based on morphological segmentation and optimized clustering, with three key innovations that work synergistically: (1) an adaptive chunking algorithm driven by gradient, Laplacian, and curvature features to dynamically partition spectral regions, laying a foundation for localized analysis; (2) a hierarchical residual iteration mechanism combining dynamic thresholding and Gaussian template subtraction to enhance weak peak signals; (3) optimized DBSCAN clustering integrated with morphological closure to refine peak boundaries accurately. Among them, the adaptive chunking technique is distinct from general adaptive methods: its chunking granularity can be dynamically adjusted according to peak structures and can accurately adapt to low signal-to-noise ratio (SNR) scenarios. Experimental results based on measured data from the EAST device demonstrate that the adaptive chunking strategy maintains a missed detection rate of 0–20% across the full signal-to-noise ratio (SNR) range, with false positive rates limited to 16.67–50.00%. Notably, it achieves effective peak detection even under extremely low SNR conditions.

Keywords:

neutral beam injection; morphological segmentation; optimized clustering; adaptive chunking

1. Introduction

Neutral beam injection (NBI) serves as a core heating and current drive method for enhancing plasma performance. Precise interpretation of the characteristic emission spectra induced by neutral beam injection (NBI) serves as a direct means to obtain key plasma parameters such as ion temperatures and rotational velocities. Among these techniques, Beam Emission Spectroscopy (BES)—based on the Doppler effect of ion emission spectral lines—has emerged as a key tool for obtaining plasma microdynamics information due to its high temporal and spatial resolution [1,2,3,4].

However, limited by the neutral beam attenuation, background radiation noise, and complex electromagnetic interference, neutral beam injection (NBI) spectral signals often show low signal-to-noise ratio characteristics, and traditional peak detection methods such as the sliding window threshold method and the first-order derivative method have a false detection rate of more than 30% at signal-to-noise ratios of less than 3 dB, which results in a significant reduction in diagnostic reliability [5,6]; and the spectral data are high in dimensionality and non-uniformly distributed—these spectral data exhibit high-dimensional characteristics with a shape of (1, 2, 1024). It is important to note that “multi-dimensional spectral data” is a composite data system integrating “physical dimensions + feature dimensions”, not merely referring to the structural dimensions of the data. Specifically, in the data shape (1, 2, 1024), the first dimension corresponds to the single acquisition sample dimension, the second dimension is the core physical dimension consisting of wavelength and intensity, and the third dimension is the sequence dimension composed of 1024 discrete sampling points. Due to this high-dimensional nature, the traditional fixed block strategy is difficult to adapt to local features due to a single block size. The traditional fixed chunking strategy has a single chunking granularity—referring to a uniform and preset partition size that remains unchanged throughout the entire spectral analysis process, with no adjustment based on local signal characteristics such as peak density, width, or noise distribution—which makes it difficult to adapt to localized features [7]. For instance, if a fixed chunk size is set to match the average width of spectral peaks, it may either over-segment narrow peaks or merge adjacent overlapping peaks in dense peak regions, while in sparse noise regions, the overly large chunk size tends to include irrelevant background signals, leading to increased false detections or missed detections of weak peaks. Recent advances in machine learning-assisted diagnostics, such as convolutional neural networks (CNNs) for automated peak identification and recurrent neural networks (RNNs) for temporal spectral analysis [8,9], have demonstrated improved robustness against noise. However, these data-driven approaches often require large labeled datasets and lack interpretability in physical constraints, limiting their applicability in low-data plasma scenarios. Meanwhile, alternative adaptive segmentation techniques like wavelet-based multi-scale decomposition and dynamic time warping (DTW) address non-uniform peak distributions but struggle with real-time processing due to computational complexity.

These limitations motivate the development of our hybrid method, which integrates physics-informed feature engineering with adaptive clustering to balance accuracy and efficiency. Therefore, the development of feature extraction methods adapted to low signal-to-noise ratio (SNR) and complex peak shape distribution is of great significance for optimizing the operation of plasma devices and studying physical mechanisms. Specifically, the target features to be extracted include two core categories: first, the intrinsic physical features of NBI-induced spectral characteristic peaks, such as peak wavelength, intensity, full width at half maximum (FWHM), and skewness; second, the multi-scale auxiliary features constructed for enhancing detection robustness, including normalized wavelength/intensity, gradient, curvature, morphological gradient, frequency-domain harmonics, and local SNR. These features are extracted targeting the characteristic emission spectra generated by neutral beam injection (NBI) in magnetic confinement plasmas, aiming to accurately identify valid spectral peaks from interference such as noise, peak overlapping, and dynamic broadening, thereby providing reliable input for deriving key plasma parameters.

To address the aforementioned challenges, we propose a spectral feature extraction method based on morphological segmentation and optimized clustering. The method innovatively combines an adaptive chunking strategy, multi-scale morphological feature extraction, and the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm [10,11], aiming to enhance the accuracy and robustness of spectral feature peak extraction. Specifically, the spectral data are first preprocessed through wavelet denoising [12] and Savitzky–Golay smoothing [13,14] to suppress noise interference while preserving spectral features. Adaptive chunking is then applied to divide the spectral curve into multiple simplified subregions based on the local gradient, Laplacian, and curvature characteristics of the data, thereby reducing complexity and facilitating subsequent analyses. During the feature extraction stage, multi-scale morphological analysis is employed to capture local structural features of the spectral data across varying scales, which strengthens the recognition capability for feature peaks with diverse shapes and widths. Subsequently, the DBSCAN clustering algorithm is utilized to analyze the extracted features: by defining appropriate density thresholds and distance parameters, points with similar characteristics are clustered to precisely identify feature peak locations and other key parameters. Furthermore, postprocessing techniques such as morphological closure operations [15] are implemented to refine clustering results and improve detection reliability.

2. Problem Formulation

Let the spectral signal be a discrete set of points

S = {(X_{i}, Y_{i})}_{i = 1}^{N}

, where

X_{i} \in R^{+}

denotes the wavelength and

Y_{i} \in R^{+}

denotes the intensity, whose physical meaning is the distribution of spectral energy in the wavelength dimension. The goal of its spectral detection is to identify a subset

P \subseteq S

that satisfies the following conditions.

2.1. Density Properties

The point density within the peak region

P_{k}

is significantly higher than the background noise, which is satisfied:

ρ (P_{k}) \geq ρ_{t h} = \frac{1}{|S|} \sum_{i = 1}^{N} ρ (S_{i}) + 3 σ_{p}

(1)

where

ρ (S_{i})

is the local density of point

S_{i}

and

σ_{p}

is the global density standard deviation.

2.2. Morphological Characteristics

The width

W_{k}

of each peak

P_{k}

needs to satisfy the physical constraints:

W_{m i n} \leq W_{k} \leq W_{m a x}

(2)

where

W_{m i n}

and

W_{m a x}

are determined by the instrument resolution and the target analytical absorption linewidth, respectively.

2.3. Physical Characteristics

The peak position

C_{k}

is required to satisfy the local extreme value condition:

y (C_{k}) = {m a x}_{x \in [C_{k} - δ, C_{k} + δ]} y (x)

(3)

where

δ

is the merging threshold, bounded by the instrumental resolution, and the peak shape symmetry is bounded by the skewness coefficient

S_{k}

:

|S_{k}| \leq 0.5, S_{k} = \frac{E [{(y - μ)}^{3}]}{σ^{3}}

, where

μ

and

σ

are the mean and standard deviation of the peak region intensities.

The merging threshold

δ

is derived from the instrumental resolution (0.08 nm) scaled by a safety factor of 3, ensuring robust separation of adjacent peaks while accommodating Doppler broadening and Stark broadening in EAST plasmas. The skewness bound

|γ| < 1.5

is empirically set based on the observed asymmetry of

D α

lines under NBI conditions, where beam–plasma interactions typically include skewed line profiles.

3. Morphological Segmentation and Optimized Clustering Detection Method

In this study, a morphological segmentation and optimized clustering detection method is proposed. This method achieves robust spectral feature extraction via a hierarchical iterative detection mechanism by integrating adaptive chunking-based clustering with multi-scale morphological features. The core workflow is illustrated in Figure 1.

3.1. Data Preprocessing

A four-stage linkage preprocessing process is used in data preprocessing to solve the three core problems existing in the original spectral data—high-frequency random noise, insufficient sampling rate, and baseline drift—through the synergistic effect of wavelet denoising, cubic spline interpolation, Savitzky–Golay smoothing, and dynamic baseline.

3.2. Adaptive Chunking via Gradient–Laplacian–Curvature Fusion

The adaptive chunking process dynamically partitions spectral regions by integrating gradient, Laplacian, and curvature features through a weighted fusion strategy; the flowchart of adaptive chunking is illustrated in Figure 2.

3.2.1. Three-Feature Fusion

(1): Gradient strength

The first-order derivative approximately characterizes the rate of change in the signal:

G (x_{i}) = |\frac{y_{i + 1} - y_{i - 1}}{2 ∆ x}|

(4)

where

∆ x

is the wavelength step and

y_{i}

is the spectral intensity. It serves to detect rapidly changing regions of the spectral signal for initial screening of candidate chunking boundaries. This is combined with dynamic thresholding to filter the slowly changing background noise.

(2): Laplacian

The Laplacian achieves edge detection by calculating the second-order derivatives of the signal, mathematically modeled as

L (x_{i}) = \frac{y_{i + 1} - 2 y_{i} + y_{i - 1}}{{(∆ x)}^{2}}

(5)

Pseudo edges generated by baseline fluctuations are excluded.

(3): Curvature change

The normalized second-order derivative describes the peak-concavity property:

C (x_{i}) = \frac{|L (x_{i})|}{{(1 + G {(x_{i})}^{2})}^{\frac{3}{2}}}

(6)

It is used to distinguish the local morphology of overlapping peaks and avoid single feature misclassification. The curvature feature has stronger sensitivity to non-featured and weak peaks.

For the discrete set

{x_{i}, y_{i})}_{i = 1}^{N}

of the spectral signal, first, the gradient

G (x_{i})

, Laplacian

L (x_{i})

, and curvature

C (x_{i})

are normalized to the interval [0, 1] through Equations (4)–(6), and then a weighted fusion decision index is constructed:

F_{f u s i o n} (x_{i}) = w_{G} \cdot G_{n o r m} (x_{i}) + w_{L} \cdot |L_{n o r m} (x_{i})| + w_{C} \cdot C_{n o r m} (x_{i})

w_{G}

,

w_{L}

,

w_{C}

are fusion weights, satisfying

w_{G}

+

w_{L}

+

w_{C}

= 1, and are determined through cross-validation optimization.

3.2.2. Dynamic Threshold Calculation

For each feature channel, the dynamic thresholds are adjusted in real time according to the changes in signal features, and the adaptive thresholds are generated according to the preset percentile based on the statistical distribution characteristics of the region of interest, which are calculated in a satisfactory way:

T_{k} = Q_{p} (F_{k}) (k = 1, 2, 3; p = 0.95)

(7)

where

F_{k}

denotes the kth feature vector and

Q_{p}

is the quantile function.

The percentile threshold P = 0.95 is optimized for the 0.08 nm resolution spectrometer, ensuring 95% of true features are retained. Chunk sizes are dynamically adjusted between 90 and 150 points to match typical peak widths in measured data from the EAST device, where 1 resolution unit = 30 data points.

3.3. Synergy Between Multi-Scale Feature Space Construction and Density Clustering

The formation of spectral signals is affected by multiple factors and has complex characteristics in the time, frequency, and spatial domains. The construction of multi-scale feature space is aimed at integrating the multi-dimensional features in the time domain, frequency domain, and morphology, solving the problems of noise interference, overlapping peak separation, and local density difference and providing a highly discriminative feature base for the subsequent density clustering. Key features and their contributions are summarized in Table 1.

4. Experiment

4.1. Data Sources

All experimental data were acquired from the diagnostic systems of the EAST device during deuterium plasma discharges. The NBI system parameters were as follows: extraction voltage = 46 kV, beam current = 28.8 A.

The spectral data (stored as NBI__23983.sif) were acquired using a high-resolution spectrometer (model: Andor Shamrock 500i) with a wavelength range of 650–660 nm and a spectral resolution of 0.08 nm. The five known characteristic peaks were determined by the following: (1) a Hα line from background hydrogen; (2) four deuterium impurity lines verified by EAST’s auxiliary optical diagnostic system. The specific parameter settings are shown in Table 2.

4.2. Multi-Scale Feature Fusion for Optimal Clustering Detection

In order to verify the effectiveness of multi-scale feature fusion in spectral data clustering analysis, especially for the enhancement of spectral peak detection performance, this study selects the complete waveform segments from the preprocessed spectral data file “NBI__23983.sif”, see Figure 3, and employs two processing methods, multi-scale feature fusion and single feature extraction, and combines them with the DBSCAN clustering algorithm to compare and analyze the performance difference between the two in the spectral data clustering task. Two processing methods, multi-scale feature fusion and single feature extraction, are combined with the DBSCAN clustering algorithm to compare and analyze the performance difference between the two in the spectral data clustering task.

Single-scale feature extraction used only normalized spectral intensity as the input feature, ignoring structural and domain-specific information. Figure 4 demonstrates the multi-scale feature information, where a 7-dimensional feature matrix was constructed by normalizing and integrating multi-dimensional information, including normalized wavelength, normalized intensity, gradient, curvature, local maxima, frequency domain harmonic amplitude, and local SNR. DBSCAN parameters were optimized via the k-distance method (k = 5, consistent with the minimum number of points required for a valid peak), with ε = 0.3 ± 0.02 and min_samples = 5 for multi-scale features and ε = 0.5 ± 0.03 and min_samples = 3 for single-scale features. Clustering quality was assessed using the silhouette coefficient (SC = 0.72 ± 0.03) and cluster purity (CP = 0.91 ± 0.02) for multi-scale feature fusion, outperforming single-scale feature extraction (SC = 0.45 ± 0.05, CP = 0.63 ± 0.04), where SC > 0.5 indicates good intra-cluster homogeneity and CP = 1 means all clusters contain only valid peak points.

Using the DBSCAN clustering algorithm, whose parameters are dynamically calculated based on the local characteristics of the data, ensures that the clustering results are adapted to the distribution of the data, and the quality of the clustering is assessed using contour coefficients. To verify the necessity of feature selection, the indispensability of each feature is clarified by sequentially removing individual features and comparing the changes in clustering performance.

A visualization of the clustering results is shown in Figure 5, with scatter colors characterizing the cluster labels to which they belong. From left to right and top to bottom, each panel displays the result obtained after sequentially removing one feature. Compared with non-full-feature clustering, the full multi-scale feature-fusion strategy clearly splits the main peak into two independent clusters: No speckle points appear between groups, the boundary is sharp, and the transition zone is extremely narrow, which shows that data points within each group are highly consistent while data points from different groups are distinctly different. As features are progressively discarded, the overlapping area between groups expands, boundary ambiguity increases, and the clustering process degenerates into a simple intensity-threshold segmentation, completely losing its ability to identify manifold structures. This enhanced topological discriminability effectively decouples the coupled physical process features embedded in the spectrum. Experiments confirm that the multi-scale feature-fusion strategy offers significant advantages in excavating deep correlation features within spectral data.

4.3. Dynamic Threshold Quantile Parameter p and Three-Signature Fusion Mechanism

Three sets of typical NBI spectral data from the EAST device (NBI__23984.sif, NBI__23983.sif, NBI__23982.sif) were selected. To comprehensively investigate the impact of the quantile P on the performance of peak boundary detection in spectral data analysis, a series of controlled experiments were conducted. Three key quantile values, namely p = 0.9, p = 0.95, and p = 0.99, were selected as the test parameters. For each quantile, peak boundary detection was performed on three sets of spectral data. The evaluation of the detection results was carried out from three critical aspects: peak boundary accuracy, false boundary rate, and weak peak edge retention rate.

In Figure 6, the test results reveal distinct performance characteristics for different quantile values. In terms of peak boundary accuracy, the case of p = 0.95 exhibited the optimal performance, with all the tested spectral data achieving a 100% accuracy rate. When p = 0.9, the accuracy rate ranged from 94.4% to 95.0%. In contrast, for p = 0.99, the accuracy rate decreased, falling within the range of 88.9–90.0%.

Regarding the false boundary rate, the p = 0.95 scenario showed almost no false boundaries, with a false boundary rate of 0. For p = 0.9, the spectral data 23983 had a relatively high false boundary rate of 15%. When p = 0.99, the spectral data 23983 also exhibited a false boundary rate of 5.9%.

In the aspect of a weak peak edge retention rate, all the spectral data maintained a 100% retention rate across the three different p values. This indicates that the algorithm has a high stability in detecting weak peak edges, and the detection result is not significantly affected by the variation in the quantile p.

As shown in Figure 7, the wavelength interval of interest is from 655.0 nm to 659.0 nm. (a) Plot of gradient variation: The gradient peaks near the wavelength of 656.35 nm, indicating that the rate of change in the signal is the largest here, which is the starting position of the peak. In addition, there is also a significant gradient change near 658.0 nm. (b) Plot of Laplacian: The plot shows a significant negative spike around 656.4 nm followed by a positive spike near 656.5 nm, indicating that the concavity of the signal changes here, which is the edge location of the peak. (c) Plot curvature: The curvature peaks around 656.4 nm, indicating that the signal here has a great degree of curvature and is the location of the inflection point of the peak. (d) Figure chunking boundary: This is located in the region where the signal characteristics change significantly. For example, at the wavelength of 656.4 nm, the signal shows an obvious peak, and the chunking boundary is positioned near the start and end of this peak, specifically at 656.28 nm and 656.48 nm, approximately. This demonstrates that the adaptive chunking strategy can reasonably divide the region based on the characteristic changes in the signal.

4.4. Comparative Analysis of Adaptive Chunking and Fixed Chunking Strategies

The core region of the NBI spectrum contains approximately 2500 data points, within which 3–5 characteristic peaks are distributed, with each peak having an average width of 120 data points. To systematically evaluate the performance of the fixed chunking strategy and avoid subjectivity in parameter selection, this study selected three representative chunk sizes for testing: 60 data points (0.5 times the average peak width), 120 data points (matching the average peak width), and 150 data points (1.3 times the average peak width, balancing integrity and computational efficiency). This design not only meets the core requirement of completely containing individual peaks but also verifies the robustness of the results through multi-size comparison.

The adaptive chunking strategy divides spectral data into dynamic chunks based on three local features: gradient, Laplacian second-order derivative, and curvature. The chunk size is adaptively adjusted within the range of 90–130 data points: smaller chunks are employed in peak-dense regions to capture adjacent peaks, while larger chunks are used in sparse regions to suppress noise.

In Figure 8, there are significant differences among different chunk sizes. A chunk size of 30 data points is an overly small chunk. Since it is much smaller than the spectral peak width, a single peak is over-segmented into small chunks, and the algorithm cannot recognize the complete peak shape, with the missed detection rate being as high as about 95%. Meanwhile, overly small chunks usually only contain noise and do not trigger the detection threshold, so the false positive rate is as low as about 0%. A chunk size of 60 data points belongs to a moderate-sized chunk. When the chunk size is close to the peak width, the algorithm is sensitive to local noise and tends to misjudge fluctuations as peaks or over-segment adjacent peaks into pseudo-peaks, causing the false positive rate to surge to about 90%. Moreover, this size can cover the complete peak shape, and most real peaks are effectively captured, with the missed detection rate significantly reduced to about 20%. Chunk sizes of 90–180 data points are overly large chunks. When the chunk size exceeds the peak width, it may merge multiple peaks or cause false peaks due to background noise, and the false positive rate remains at a high level of 85–95%. Additionally, overly large chunks tend to mask weak peaks and overlapping peaks, resulting in some real peaks being merged and submerged, and the missed detection rate fluctuates and rises to 40–75%, and increases as the chunk size grows.

Figure 9 and Figure 10 show the differential performance of the fixed chunking strategy and adaptive chunking strategy in feature clustering and candidate peak recognition effects, respectively. In DBSCAN clustering, labels are used to mark the cluster that data points in the same high-density connected region belong to with non-negative integers, and to mark noise points with −1; these serve as the basic identifier for distinguishing clusters from noise and conducting subsequent analysis of clustering results.

The fixed chunking in Figure 9 is limited by its preset division dimension, which can easily cause feature dimension collapse in complex data space: uniform chunking ensures the consistency of the peak area, but leads to the breakage of DBSCAN density clusters in non-uniform distribution scenarios, resulting in damage to the topological integrity structure; the noise of feature sparsity area can be easily misclassified as a peak, resulting in a high false alarm rate; although it is simple to implement, there are the overall problems of poor feature continuity and high sensitivity to noise, and the processing performance is obviously weaker than the dynamic strategy. Although it is simple to implement, the overall feature continuity is poor, the noise sensitivity is high, and the processing performance is obviously weaker than that of the dynamic strategy.

As can be seen in Figure 10, the adaptive chunking and fixed chunking strategies show significant differences in processing non-uniform spreading spectral data. Adaptive chunking adjusts the chunk size adaptively, uses smaller chunks to accurately capture neighboring peaks in the feature-dense area, expands the chunks to suppress noise in the sparse area, and the candidate peaks are concentrated in the real peaks with a low false alarm rate, which has both high specificity and sensitivity. Its flexible demarcation effectively maintains the continuity of features and is suitable for processing complex spectral data.

Integrating the detection data across different signal-to-noise ratios (SNRs) in Table 3’s comparison of adaptive chunking and fixed chunking detection results reveals that the fixed chunking strategy exhibits significant performance sensitivity to SNR variations: In low SNR scenarios, its missed detection rate reaches 100%, with false positive rates consistently maintaining between 75 and 100%, indicating its near-total inability to effectively identify true characteristic peaks under noise interference. In contrast, the adaptive chunking strategy demonstrates robust stability across the full SNR range (5–14 dB), with missed detection rates consistently controlled between 0 and 20% and false positive rates limited to 16.67–50.00%. Notably, even under extremely low SNR conditions (5–7 dB), it still achieves 0% missed detection. These results confirm that dynamically adjusting chunk size based on local spectral features effectively enhances the ability to identify characteristic peaks, thereby simultaneously reducing both false positives and missed detections compared to the fixed chunking approach with uniform partition granularity.

To more intuitively compare the robustness performance between dynamic chunking and fixed chunking methods under low signal-to-noise ratio (SNR) conditions, we selected three spectral samples with varying intensities and SNRs ranging from 5 to 9 dB for testing based on the data shown in Table 3. The test samples included data files NBI_110195.sif, NBI_110067.sif, and NBI_110069.sif, with particular focus on evaluating their peak identification accuracy and localization precision in high-noise environments.

As shown in the spectral detection results in Figure 11, the proposed method demonstrates superior performance compared to the fixed chunking approach, effectively suppressing background noise interference and accurately distinguishing true peaks from noise signals. It achieves precise identification even for weak peaks with signal intensities approaching noise levels. For three spectral datasets with different signal-to-noise ratios (SNRs), the detected peak centers are precisely localized with consistently low miss rates, fully demonstrating its excellent robustness against signal intensity fluctuations.

5. Conclusions

Experimental results based on the measured spectral data from EAST device have comprehensively verified the superiority and robustness of the proposed morphological segmentation and optimized clustering detection method. Through systematic comparative analysis, the multi-scale feature fusion strategy—integrating standardized basic features, differential features, morphological gradients, frequency-domain harmonics, and local signal-to-noise ratio (SNR) characterization—significantly outperforms single-scale feature extraction, achieving a silhouette coefficient of 0.72 ± 0.03 and a cluster purity of 0.91 ± 0.02. Feature selection experiments confirm that full-feature fusion ensures that data points within each group are highly consistent and those between different groups are distinctly different, effectively decoupling the coupled physical process features in the spectrum. In contrast, sequential feature removal degrades clustering into simple intensity-threshold segmentation, resulting in blurred boundaries and overlapping clusters. The optimal quantile parameter for the dynamic threshold is determined as p = 0.95, which achieves 100% peak boundary accuracy and a 0% false boundary rate across all tested datasets while maintaining a 100% weak peak edge retention rate. Compared with p = 0.9 and p = 0.99, this parameter achieves a better balance between boundary precision and weak feature preservation. Comparative evaluations of adaptive and fixed chunking strategies show that the adaptive method—dynamically adjusting chunk sizes (90–130 data points) based on gradient, Laplacian, and curvature features—outperforms fixed chunking (60, 120, 150 data points) across the entire SNR range (5–14 dB). Fixed chunking is highly sensitive to chunk size and noise: overly small chunks (30 data points) lead to a missed detection rate of approximately 95%, moderate chunks (60 data points) cause the false positive rate to soar to around 90%, and large chunks (90–180 data points) result in a missed detection rate of 40–75% and a false positive rate of 85–95%. In contrast, the adaptive chunking strategy controls the missed detection rate within 0–20% and the false positive rate within 16.67–50.00%, achieving 0% missed detection even under low SNR conditions (5–7 dB). Specifically, in low SNR environments, compared with fixed chunking, adaptive chunking reduces the mean false alarm rate by 41.5% and the mean missed alarm rate by 66.7%; in extreme noise scenarios, its correct detection rate remains higher than that of fixed chunking, with a 60% reduction in the false alarm rate. Collectively, these results demonstrate that the proposed method effectively addresses key challenges in Beam Emission Spectroscopy diagnostics, including low SNR, peak overlapping, and dynamic broadening, providing a reliable and efficient solution for high-precision plasma parameter extraction.

Author Contributions

Conceptualization, T.L. and L.-Z.L.; methodology, T.L. and Z.-K.C.; validation, T.L.; resources, L.-Z.L., S.-X.Z. and G.-N.H.; data curation, X.-Q.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Comprehensive Research Facility for Fusion Technology Program of China under Contract No. 2018-000052-73-01-001228. The Natural Science Foundation of Anhui Province (Contract No. 2208085MA19). This work was financially supported by the Institute of Energy, Hefei Comprehensive National Science Center (Grant No. 21KZS202).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wang, Y.; Wu, B.; Liang, L.-Z.; Hu, C.-D.; Wei, J.-L.; Zhao, X.-X. Beam Species Evolution Under Long Pulse Operation for EAST-NBI by Doppler Shift Spectroscopy Diagnostic System. J. Fusion Energy 2015, 34, 615–619. [Google Scholar] [CrossRef]
Jeong, S.H.; Kim, T.S.; In, S.R.; Lee, K.W.; Jin, J.T.; Chang, D.S.; Oh, B.H.; Bae, Y.S.; Kim, J.S.; Park, H.T.; et al. First neutral beam injection experiments on KSTAR tokamak. Rev. Sci. Instrum. 2012, 83, 02B102. [Google Scholar] [CrossRef] [PubMed]
Hu, C. Preliminary Results of Ion Beam Extraction Tests on EAST Neutral Beam Injector. Plasma Sci. Technol. 2012, 14, 871–873. [Google Scholar] [CrossRef]
Asakawa, R.; Goto, M.; Sadeghi, N.; Sasaki, K. Characteristics of saturation spectroscopy at the Balmer-α line of atomic hydrogen in a linear magnetized plasma source. J. Instrum. 2012, 7, C01018. [Google Scholar] [CrossRef]
Scholkmann, F.; Boss, J.; Wolf, M. An Efficient Algorithm for Automatic Peak Detection in Noisy Periodic and Quasi-Periodic Signals. Algorithms 2012, 5, 588–603. [Google Scholar] [CrossRef]
Putignano, M.; Intermite, A.; Welsch, C.P. A non-linear algorithm for current signal filtering and peak detection in SiPM. J. Instrum. 2012, 7, P08014. [Google Scholar] [CrossRef]
Abdel-Hakim, A.E.; Ibrahim, A.-M.M.; Bouazza, K.E.; Deabes, W.; Hedar, A.-R. Ellipsoidal K-Means: An Automatic Clustering Approach for Non-Uniform Data Distributions. Algorithms 2024, 17, 551. [Google Scholar] [CrossRef]
Li, T.; Beirami, A.; Sanjabi, M.; Smith, V. On tilted losses in machine learning: Theory and applications. J. Mach. Learn. Res. 2023, 24, 1. [Google Scholar]
Tran, T.N.; Drab, K.; Daszykowski, M. Revised DBSCAN algorithm to cluster data with dense adjacent clusters. Chemom. Intell. Lab. Syst. 2013, 120, 92–96. [Google Scholar] [CrossRef]
Baracchini, E.; Benussi, L.; Bianco, S.; Capoccia, C.; Caponero, M.; Cavoto, G.; Cortez, A.; Costa, I.; Di Marco, E.; D’Imperio, G.; et al. A density-based clustering algorithm for the CYGNO data analysis. J. Instrum. 2020, 15, T12003. [Google Scholar] [CrossRef]
Li, L.; Peng, Y.; Yang, M.; Xue, P. A new de-noising method based on 3-band wavelet and nonparametric adaptive estimation. J. Electron. 2007, 24, 358–362. [Google Scholar] [CrossRef]
Zhao, S.P.; Li, X.W.; Xing, J.H.; Zheng, G. An Wavelet Image Automatic Threshold Selection Denoising Method. Adv. Mater. Res. 2012, 1674, 780–783. [Google Scholar] [CrossRef]
Kałka, A.J.; Turek, A.M. Searching for Alternatives to the Savitzky-Golay Filter in the Spectral Processing Domain. Appl. Spectrosc. 2023, 77, 37028231154278. [Google Scholar]
Escribano, C.; Giraldo, A.; Sastre, M.A. Digitally Continuous Multivalued Functions, Morphological Operations and Thinning Algorithms. J. Math. Imaging Vis. 2012, 42, 76–91. [Google Scholar] [CrossRef]
Chen, B.; Jia, M. Novel Edge Detection Approach Based on Soft Morphological Operations. Appl. Mech. Mater. 2012, 220, 2828–2832. [Google Scholar] [CrossRef]

Figure 1. Flowchart of morphological segmentation and optimized clustering detection method.

Figure 2. Adaptive chunking flowchart.

Figure 3. Real spectral data slicing.

Figure 4. Multi-dimensional feature matrix.

Figure 5. Comparative evaluation of clustering performance with multi-scale feature selection.

Figure 6. Impact of quantile p on peak boundary detection metrics.

Figure 7. Adaptive chunking strategy with triple sign fusion detection. (a) Gradient; (b) fusion of the signal’s second-order derivative with Laplacian edge features; (c) curvature; (d) chunking results for spectral data (different colors represent different block regions).

Figure 8. Sensitivity analysis of fixed chunk size on NBI spectral peak detection.

Figure 9. Clustering results and candidate peak finding results in fixed chunking strategy.

Figure 10. Clustering results and candidate peak finding results in adaptive chunking strategy.

Figure 11. Comparison of peak detection results between adaptive chunking (left) and fixed chunking (right) under low signal-to-noise ratio.

Table 1. Key features and their contributions in multi-scale feature space.

Feature Type	Mathematical Definition	Parameters	Purpose	Performance Impact
Standardized basic features	$X_{n o r m} = \frac{X - m e d i a n (X)}{I Q R (X)}$	X: Original signal IQR: interquartile range	Eliminates instrument calibration errors and baseline drift, normalizing the dynamic range to [−1, 1].	Enhances cross-band comparability and provides a uniform baseline reference for density clustering.
Differential characteristic	$\nabla X = \frac{X_{i + 1} - X_{i}}{∆ λ}$	$∆ λ$ : Wavelength step size	Captures local trends in spectral signals: 1st-order derivatives identify rising/falling edges; 2nd-order derivatives identify concavities and inflection points.	Improves detection of abrupt changes and turning points in signals.
Morphological gradients	$\nabla X = (X \oplus B) - (X ⊖ B)$	$\oplus$ $: Expansion operation, ⊖$ : erosion operation,	Extracts local structural changes using dynamic window sizes adjusted to typical peak widths.	Enhances edge contrast in weak signal regions and adapts to varying peak widths.
Frequency domain harmonic	$X (f) = \int_{- \infty}^{\infty} x (t) e^{- j 2 π f t} d t$	x(t): Time-domain signal X(f): Frequency-domain representation f: Frequency	Utilizes harmonic correlation differences to improve robustness against periodic disturbances.	Helps separate overlapping peaks and reduces noise interference in the frequency domain.
Local signal-to-noise ratio characterization	${S N R}_{l o c a l} = \frac{μ_{s i g i n a l}}{σ_{n o i s e}}$	$μ_{s i g i n a l}$ : Local signal-mean $σ_{n o i s e}$ : Local noise standard deviation	Constructs spatial attention weights to enhance detection sensitivity in low SNR regions.	Automatically down-weights noisy dense regions, reducing false positives in sparse noise areas.

Table 2. Parameter settings anchored to 0.08 nm resolution.

Parameter	Value	Physical Basis
$Resolution (∆ λ$ )	0.08 nm	Spectrometer specification
Safety factor	2–4 (typical: 3)	Ensures coverage of peak separation considering Doppler broadening (Δλ_D ≈ 0.18 nm)
Min peak width	0.20–0.30 nm	Derived from 2.5 to 3.75 × Δλ to accommodate instrumental and physical broadening
Validation discharges	15–25	Covers typical SNR regimes (5–14 dB) in EAST NBI operations

Table 3. Comparison of adaptive chunking and fixed chunking detection results.

Data File	SNR Range (dB)	Strategy Type	Number of Known Characteristic Peaks	Number of Detected Peaks	Number of False Positives	Number of Omissions	Number of Correct Detections	False Positive Rate	Missed Detection Rate
NBI__110195.sif	5	Adaptive chunking	4	5	1	0	4	20.00%	0.00%
NBI__110195.sif	5	Fixed chunking	4	10	10	4	0	100.00%	100.00%
NBI__23082.sif	6	Adaptive chunking	5	8	3	1	4	37.50%	20.00%
NBI__23082.sif	6	Fixed chunking	5	20	15	5	0	75.00%	100.00%
NBI__110067.sif	7	Adaptive chunking	4	5	1	0	4	20.00%	0.00%
NBI__110067.sif	7	Fixed chunking	4	16	16	4	0	100.00%	100.00%
NBI__110072.sif	7	Adaptive chunking	3	6	3	0	3	50.00%	0.00%
NBI__110072.sif	7	Fixed chunking	3	15	14	2	1	93.33%	66.67%
NBI__110069.sif	9	Adaptive chunking	4	6	2	0	4	33.33%	0.00%
NBI__110069.sif	9	Fixed chunking	4	15	14	3	1	93.33%	75.00%
NBI__23981.sif	12	Adaptive chunking	5	9	4	1	4	44.44%	20.00%
NBI__23981.sif	12	Fixed chunking	5	21	16	4	1	76.19%	80.00%
NBI__23982.sif	12	Adaptive chunking	5	7	2	1	4	28.57%	20.00%
NBI__23982.sif	12	Fixed chunking	5	20	15	4	1	75.00%	80.00%
NBI__23983.sif	13	Adaptive chunking	5	7	2	0	5	28.57%	00.00%
NBI__23983.sif	13	Fixed chunking	5	23	18	4	1	78.26%	80.00%
NBI__23083.sif	13	Adaptive chunking	5	8	3	0	5	37.50%	0.00%
NBI__23083.sif	13	Fixed chunking	5	20	15	4	1	75.00%	80.00%
NBI__23984.sif	14	Adaptive chunking	5	6	1	0	5	16.67%	0.00%
NBI__23984.sif	14	Fixed chunking	5	21	16	3	2	76.19%	60.00%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, T.; Liang, L.-Z.; Cao, Z.-K.; Xu, X.-Q.; Zou, S.-X.; Hu, G.-N. Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering. Appl. Sci. 2026, 16, 1239. https://doi.org/10.3390/app16031239

AMA Style

Liu T, Liang L-Z, Cao Z-K, Xu X-Q, Zou S-X, Hu G-N. Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering. Applied Sciences. 2026; 16(3):1239. https://doi.org/10.3390/app16031239

Chicago/Turabian Style

Liu, Ting, Li-Zhen Liang, Zheng-Kun Cao, Xing-Qin Xu, Shang-Xuan Zou, and Guang-Nian Hu. 2026. "Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering" Applied Sciences 16, no. 3: 1239. https://doi.org/10.3390/app16031239

APA Style

Liu, T., Liang, L.-Z., Cao, Z.-K., Xu, X.-Q., Zou, S.-X., & Hu, G.-N. (2026). Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering. Applied Sciences, 16(3), 1239. https://doi.org/10.3390/app16031239

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Multi-Scale Feature Fusion for Spectral Peak Extraction with Morphological Segmentation and Optimized Clustering

Abstract

1. Introduction

2. Problem Formulation

2.1. Density Properties

2.2. Morphological Characteristics

2.3. Physical Characteristics

3. Morphological Segmentation and Optimized Clustering Detection Method

3.1. Data Preprocessing

3.2. Adaptive Chunking via Gradient–Laplacian–Curvature Fusion

3.2.1. Three-Feature Fusion

3.2.2. Dynamic Threshold Calculation

3.3. Synergy Between Multi-Scale Feature Space Construction and Density Clustering

4. Experiment

4.1. Data Sources

4.2. Multi-Scale Feature Fusion for Optimal Clustering Detection

4.3. Dynamic Threshold Quantile Parameter p and Three-Signature Fusion Mechanism

4.4. Comparative Analysis of Adaptive Chunking and Fixed Chunking Strategies

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI