Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel

Kim, Yumin; Bang, Jonghyun; Kim, Taehyoung

doi:10.3390/math14030400

Open AccessArticle

Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel

by

Yumin Kim

¹,

Jonghyun Bang

^2,* and

Taehyoung Kim

^1,*

¹

School of Electrical Engineering, Kookmin University, Seoul 02707, Republic of Korea

²

Department of Electronic Engineering, Myongji University, Yongin-si 17058, Republic of Korea

^*

Authors to whom correspondence should be addressed.

Mathematics 2026, 14(3), 400; https://doi.org/10.3390/math14030400

Submission received: 10 December 2025 / Revised: 11 January 2026 / Accepted: 21 January 2026 / Published: 23 January 2026

(This article belongs to the Special Issue Computational Methods in Wireless Communications with Applications)

Download

Browse Figures

Versions Notes

Abstract

This paper proposes a novel blind channel estimation method based on K-means clustering algorithm with efficient time–frequency resource grouping. Existing K-means-based blind channel estimation techniques assume that received symbols within the coherence time and coherence bandwidth experience the same channel response, which is not valid under fading channel with severe time variation or frequency selectivity. To overcome this limitation, this paper proposes an efficient time–frequency resource grouping pattern selection algorithm. The proposed method introduces the concept of an effective number of data symbols, which eliminates patterns that are computationally expensive yet performance-irrelevant, thereby reducing the search space compared to exhaustive search. Two strategies are applied: Time-main, which prioritizes grouping in the time domain, and Freq-main, which prioritizes grouping in the frequency domain. Simulation results demonstrate that the proposed method consistently outperforms conventional and fixed-pattern approaches across various channel conditions.

Keywords:

blind channel estimation; fading channel; K-means clustering; resource grouping

MSC:

94A13

1. Introduction

The 5G New Radio (NR) standard defines three major use cases—enhanced mobile broadband (eMBB), massive machine-type communications (mMTC), and ultra-reliable low-latency communications (URLLC)—and has become a core technology for next-generation wireless communication systems [1]. To support these use cases, numerous studies have been conducted [2,3,4,5,6,7,8,9]. These studies employ clustering techniques to classify data effectively and apply the K-means algorithm to improve beamforming efficiency, resource allocation, multiple-input multiple-output (MIMO) system performance, energy efficiency, and power control. As a representative unsupervised learning method, the K-means algorithm is computationally simple and exhibits strong clustering capability. Owing to these advantages, it has been widely applied to various physical-layer technologies [10,11].

In particular, clustering based on signal similarity enhances resource utilization and improves channel estimation accuracy even in multipath environments [12,13]. Channel estimation plays a crucial role in compensating for signal distortion by accurately identifying time-varying channel conditions and multipath fading. In 5G NR, demodulation reference signal (DM-RS)-based channel estimation has been adopted as a standard technique [14,15,16,17]. While DM-RS enables effective channel estimation, it occupies time–frequency resources together with data symbols, thereby consuming resources that could otherwise be used for data transmission. Hence, minimizing such resource consumption is essential. To address this limitation, a K-means-based channel estimation method has been proposed [18]. This approach performs pilotless channel estimation using only received signals without DM-RS transmission, thereby eliminating pilot overhead and improving spectral efficiency. However, this method assumes that all symbols within the resource block to which K-means is applied experience identical channel responses—namely, that the block lies entirely within the coherence time and coherence bandwidth of the channel.

In practice, when the coherence time and coherence bandwidth are exceeded, channel characteristics vary rapidly. In particular, high-mobility environments exhibit strong time selectivity, where the channel changes quickly over time, whereas environments with large delay spreads experience significant frequency selectivity, where the channel response differs across subcarriers [14,19]. Under such highly dynamic channel conditions, the assumption of channel uniformity within a single resource block no longer holds. Consequently, K-means-based estimation fails to fully capture actual channel variations, leading to increased estimation errors and potential degradation of overall system performance.

To overcome these limitations, time–frequency grouping is applied. By partitioning the resource grid into smaller subgroups and performing K-means-based channel estimation independently within each group, more accurate estimation can be achieved within local coherence regions. However, excessively fine grouping reduces the number of data samples available in each group, weakening the noise-averaging effect of K-means and consequently degrading clustering accuracy. Conversely, identifying the optimal group size through an exhaustive search over all possible time–frequency partition combinations results in prohibitively high computational complexity, making such approaches impractical for real-time communication systems.

To address this issue, this paper introduces a signal-to-noise ratio (SNR)-based effective data count constraint. The effective data count specifies the minimum number of data symbols required in each group to ensure reliable K-means estimation, thereby excluding grouping patterns that are computationally expensive yet performance-irrelevant. As a result, the proposed constraint significantly reduces the search space and improves computational efficiency while maintaining the required channel estimation accuracy.

Previous studies have investigated adaptive channel estimation strategies based on channel statistics or SNR-related criteria, such as switching estimation methods according to Doppler characteristics or sparsity levels [14,20]. However, these approaches primarily focus on estimator selection and do not explicitly consider adaptive time–frequency grouping. In contrast, the proposed method determines the time–frequency grouping structure within a unified K-means-based framework by incorporating an SNR-dependent effective data count constraint, without requiring prior channel statistics or pilot signals.

Furthermore, the proposed algorithm employs two search strategies: the Time-main strategy, which is effective in rapidly time-varying channels with high time selectivity, and the Freq-main strategy, which performs better in channels with large delay spreads and high frequency selectivity. Both strategies conduct efficient searches under the effective data count constraint, significantly reducing computational load compared to exhaustive search while maintaining high estimation performance across diverse channel environments. Finally, the performance of the proposed method is evaluated through simulations and compared with the conventional K-means-based approach [18] and a fixed-pattern method.

2. System Model

This study is based on the 5G NR data transmission structure, in which the transmitted signal is allocated to a resource grid (RG). The RG consists of one or more slots in the time domain and one or more resource blocks (RBs) in the frequency domain. A single slot contains 14 orthogonal frequency division multiplexing (OFDM) symbols, while a single RB contains 12 subcarriers. The resource element (RE), defined by one OFDM symbol in the time domain and one subcarrier in the frequency domain, represents the smallest unit of time–frequency resources [21].

In this study, the RG is configured with one slot and ten RBs, resulting in a total of 1680 REs. The transmitted data undergo cyclic redundancy check (CRC) attachment, channel coding, and digital modulation before being mapped onto the RG. Quadrature phase shift keying (QPSK) is employed for digital modulation. The modulated signal is OFDM-modulated, transmitted through a wireless fading channel, and received at the user equipment (UE) with additive noise [21,22]. The received signal model is given by

Y_{k, l} = H_{k, l} X_{k, l} + N_{k, l},

(1)

where

k

denotes the subcarrier index in the frequency domain and

l

denotes the OFDM symbol index in the time domain.

Y_{k, l}

is the received signal at the

k

-th subcarrier and

l

-th OFDM symbol,

X_{k, l}

is the transmitted signal, and

H_{k, l}

is the wireless fading channel,

N_{k, l}

is the noise. The UE performs K-means-based blind channel estimation based on the QPSK constellation defined as

X_{q} = e^{\frac{i (2 q - 4) π}{4}}, (q = 1, 2, 3, 4),

(2)

where

q

denotes the constellation index. Since QPSK consists of four constellation points, the number of clusters is fixed to K = 4 when applying the K-means algorithm. To improve clustering stability, K-means++ initialization is employed, where the initial centroids are selected according to a distance-based probability distribution [23]. Accordingly, the received signals

Y_{k, l}

are partitioned into four clusters using the K-means algorithm, and the centroid of each cluster, denoted by

C_{p}

is obtained, where

p \in {1, 2, 3, 4}

represents the cluster index. The channel estimate is derived from the cluster centroids as

{\hat{H}}_{p, q} = \frac{C_{p}}{X_{q}} .

(3)

By combining the four cluster centroids

C_{p}

and the four QPSK symbols

X_{q}

, a total of 16 candidate channel estimates

{\hat{H}}_{p, q}

are obtained. For each candidate channel estimate, the transmitted symbol is reconstructed as

{\hat{X}}_{k, l, p, q} = \frac{Y_{k, l}}{{\hat{H}}_{p, q}} .

(4)

QPSK demodulation is then applied, and the bit error rate (BER) is computed. The estimated channel

{\hat{H}}_{p, q}

that minimizes the BER is selected as the final estimated channel, denoted by

\tilde{H}

. Since all resource elements classified within the same cluster are assumed to share an identical channel response, the final estimated channel

\tilde{H}

is uniformly assigned to all corresponding

(k, l)

, yielding

{\tilde{H}}_{k, l}

. The mean squared error (MSE) between the estimated channel and the actual channel is calculated as

M S E = \frac{1}{K L} \sum_{l = 1}^{L} \sum_{k = 1}^{K} {|{\tilde{H}}_{k, l} - H_{k, l}|}^{2},

(5)

where

K

and

L

denote the total numbers of subcarriers and OFDM symbols in the considered resource grid, respectively. The MSE is used as a performance metric to evaluate the accuracy of channel estimation.

3. Proposed Method

The conventional K-means-based channel estimation method [18] assumes that all symbols within a resource block experience identical channel characteristics. However, in practical wireless environments, this assumption is often violated due to time- and frequency-selective fading caused by user mobility and multipath propagation. In the time domain, channel variations are primarily governed by the Doppler frequency shift. The maximum Doppler frequency

f_{d}

is given by

f_{d} = \frac{v f_{c}}{c},

(6)

where

v

denotes the user velocity,

f_{c}

is the carrier frequency, and

c

is the speed of light. The corresponding coherence time

T_{c}

, which represents the duration over which the channel can be regarded as approximately time-invariant, is commonly approximated in the literature as

T_{c} \approx \frac{1}{2 f_{d}} .

(7)

This relationship indicates that as user mobility increases, the coherence time decreases. Consequently, assuming a constant channel response over a long time interval becomes increasingly inaccurate in high-mobility scenarios, and time-domain grouping should be constrained within the channel coherence time to preserve channel homogeneity.

Similarly, in the frequency domain, channel selectivity is characterized by the delay spread

τ

. The coherence bandwidth

B_{c}

, which quantifies the frequency range over which the channel response remains highly correlated, can be approximated as

B_{c} \approx \frac{1}{τ} .

(8)

As the delay spread increases, the coherence bandwidth decreases, implying that grouping across a wide frequency span becomes increasingly unfavorable due to significant channel variations within a group. These fundamental relationships indicate that the feasible resolution of time–frequency grouping is inherently constrained by the physical characteristics of the wireless channel. In particular, stronger time selectivity requires finer grouping in the time domain, while stronger frequency selectivity requires finer grouping in the frequency domain to maintain channel consistency within each group.

Although applying various time–frequency grouping patterns can improve channel estimation accuracy, an exhaustive search over all possible grouping combinations leads to an exponentially growing search space, resulting in prohibitive computational complexity that limits real-time applicability. Moreover, excessively fine partitioning reduces the number of data symbols available per group, which degrades the clustering reliability of the K-means algorithm and ultimately deteriorates channel estimation performance. This behavior is consistent with well-established observations that insufficient data samples impair clustering stability in K-means-based methods [24,25,26].

To address these limitations, this paper introduces an SNR-based effective data count criterion that excludes grouping patterns insufficient to ensure reliable channel estimation. This criterion significantly reduces the search space and computational burden while preserving estimation accuracy. Furthermore, two pattern search strategies—Time-main and Freq-main—are proposed to prioritize time-domain and frequency-domain grouping, respectively, within the proposed pattern selection framework.

3.1. Resource Grouping Pattern Definition

The resource grouping patterns, which form the basis of the proposed search algorithm, are defined separately in the time and frequency domains. Channel estimation is then performed for each group using K-means clustering based on these patterns.

The frequency-domain pattern

P_{f} (F)

is defined according to the group size

F

, which is set as a divisor of the total number of subcarriers (120). Each pattern groups

F

subcarriers to form equally sized groups. For example, when

F = 40

, the pattern is represented as [40, 40, 40], resulting in three equal groups.

The time-domain pattern

P_{t} (T)

is defined based on 14 OFDM symbols within one slot, where the group size

T

is set as an integer between 1 and 14. Each pattern groups

T

consecutive OFDM symbols starting from the first symbol, and the remaining symbols are included in the final group. For example, when

T = 4

, the pattern [4, 4, 4, 2] has the smallest group size of 2. The square brackets [] denote the number of elements in each group, representing the number of subcarriers in the frequency domain and the number of OFDM symbols in the time domain.

To reduce computational complexity, a limited set of candidate grouping patterns is defined based on the effective data count constraint introduced in Section 3.2. In the frequency domain,

F

is restricted to divisors of 120 so that all frequency groups have equal size. This design avoids data-count imbalance across groups and ensures stable K-means clustering. As a result, asymmetric grouping is not adopted in the frequency domain.

In contrast, in the time domain, 15 representative grouping patterns are predefined to limit the search space while allowing diverse symbol grouping lengths. For

T = 3

, the conventional pattern

P_{t} (3) = [3, 3, 3, 3, 2]

includes a small minimum group. To address this issue, an asymmetric pattern

P_{t} (3 - 1) = [3, 3, 3, 5]

is additionally included to expand feasible grouping options. The final set of 15 time-domain pattern candidates is summarized in Table 1, and Figure 1 illustrates examples of these resource grouping patterns for different values of

F

and

T

.

3.2. SNR-Based Determination of Effective Data Count

This section presents the procedure for determining an appropriate data count under different SNR conditions. Since the number of data symbols within a group directly affects the performance of K-means-based channel estimation, it must be adjusted according to the channel environment. The manner in which the data count increases differs depending on whether the main axis is set to the time domain or the frequency domain.

In the Freq-main strategy, the time domain is fixed to 14 OFDM symbols, and the data count is expanded along the subcarrier axis. Consequently, the number of resource elements increases from 14 up to 1680 in increments of 14, resulting in a total of 120 configurations.

In the Time-main strategy, the frequency domain is fixed to 10 resource blocks, corresponding to 120 subcarriers, and the data count is expanded along the OFDM symbol axis. In this case, the number of resource elements increases from 120 up to 1680 in increments of 120, resulting in a total of 14 configurations.

For each configuration, K-means-based channel estimation is performed, and the MSE between the estimated and actual channels is computed. As the data count increases, the MSE decreases rapidly at first and then gradually approaches a saturated region with diminishing improvement. Accordingly, the effective data count is defined as the point at which this performance saturation begins. To identify this point in a consistent manner, the standard deviation of the MSE is computed over consecutive configurations, and the first data count at which it falls below a predefined threshold is selected. The resulting effective data count is then used as a reference criterion for the subsequent grouping pattern selection algorithm.

Figure 2 illustrates examples of how the data count expands in both the frequency and time domains. In the frequency domain, additional subcarriers are grouped incrementally, whereas in the time domain, the number of OFDM symbols is progressively increased. This visualization highlights the difference in how effective data counts are determined under the Freq-main and Time-main strategies.

3.3. Time–Freq Pattern Selection Algorithm

Based on the candidate resource grouping patterns and the effective data count constraint, this section presents the proposed time–frequency pattern selection algorithm. The algorithm is implemented using two strategies: Time-main and Freq-main. In the Time-main strategy, the time-domain pattern

P_{t}

is designated as the main axis pattern

P_{m a i n}

, and the frequency-domain pattern

P_{F}

is assigned as the sub axis pattern

P_{s u b}

. Conversely, in the Freq-main strategy,

P_{f}

is selected as the main axis and

P_{t}

as the sub axis. The overall selection procedure consists of two steps, as summarized in Algorithm 1.

Algorithm 1 Time-Frequency Grouping Selection
Input:
	$P_{m a i n}$ : Candidate set for the main-axis pattern.
	$P_{s u b}$ : Candidate set for the sub-axis pattern.
	$N_{e f f}$ : Effective data count required for the current SNR.
	$\{Y_{k, l}\}$ : Received signals on the resource grid.
	$\{H_{k, l}\}$ : True channel responses.
1:	Step 1: Generation of Candidate Combinations
2:	Initialize candidate set $C_{c a n d}$ $\leftarrow$ $\emptyset$
3:	for each $p_{m a i n} \in$ $P_{m a i n}$ do
4:		$d_{m i n} \leftarrow \min (p_{m a i n})$
5:		$s_{r e q} \leftarrow ⌈\frac{N_{e f f}}{d_{m i n}}⌉$
6:			$S \leftarrow {P \in P_{s u b} \| m i n (P) \geq s_{r e q}}$
7:			if $S \neq \emptyset$ then
8:				$p_{s u b} \leftarrow a r g m i n_{P \in S} (\min (P))$ } ▷ select the smallest feasible sub-axis
9:				$C_{c a n d} \leftarrow C_{c a n d} \cup \{(p_{m a i n}, p_{s u b})\}$
10:			end if
11:		end for
12:	Step 2: Performance Evaluation and Final Selection
13:	Initialize ${M S E}_{b e s t}$ $\leftarrow$ $\infty$ , ${(P}_{m a i n}^{}$ , $P_{s u b}^{}) \leftarrow$ $n u l l$
14:	for each $(P_{m a i n}$ , $P_{s u b}) \in C_{c a n d}$ do
15:		$G \leftarrow A p p l y G r o u p i n g (\{Y_{k, l}\}, P_{m a i n}, P_{s u b})$
16:		for each group $G_{n} \in G$ do
17:			${\tilde{H}}_{n} \leftarrow {K m e a n s E s t i m a t e (G}_{n})$
18:			for all $(k, l) \in G_{n}$
19:				${\tilde{H}}_{k, l} \leftarrow {\tilde{H}}_{n}$
20:			end for
21:		end for
21:		$M S E_{c u r r e n t} \leftarrow$ evaluate using Equation (5) with $\{{\tilde{H}}_{k, l}\}$ and ${{H}_{k, l}}$
22:		$i f M S E_{c u r r e n t} < M S E_{b e s t}$ then
23:			$M S E_{b e s t} \leftarrow M S E_{c u r r e n t}$
25:			${(P}_{m a i n}^{}$ , $P_{s u b}^{}) \leftarrow (P_{m a i n}, P_{s u b})$
26:		end if
27:	end for
Output: Final selected pair ${(P}_{m a i n}^{}$ , $P_{s u b}^{}$ )

Algorithm 1 illustrates the detailed procedure of the proposed time–frequency pattern selection algorithm. The algorithm consists of two main steps. In the first step, for each main-axis pattern

P_{m a i n}

, the minimum group size

d_{m i n}

is computed, and sub-axis pattern candidates that satisfy the

N_{e f f}

condition are identified. If multiple sub-axis candidates exist, only the finest-grained feasible sub-axis pattern is selected to reduce unnecessary exploration. Therefore, each main-axis pattern generates at most one candidate combination.

In the second step, performance evaluation is conducted on the candidate set

C_{c a n d}

. For each combination

{(P}_{m a i n}

,

P_{s u b})

, the resource grid is divided into groups, and K-means-based channel estimation is performed for each group to obtain the representative channel

{\tilde{H}}_{n}

. The estimated channel is uniformly assigned to all resource elements within the group, and performance is evaluated using the MSE criterion defined in the system model. Among all candidates, the combination that achieves the lowest MSE is selected as the final grouping pattern.

Unlike exhaustive search methods that evaluate all possible main–sub axis pattern combinations, the proposed approach excludes unnecessary candidates in advance through the

N_{e f f}

constraint. As a result, each main-axis pattern yields at most one candidate, and the total number of evaluated combinations is limited to no more than the number of main-axis patterns. This significantly reduces the computational burden while maintaining stable channel estimation performance.

3.4. Computational Complexity Analysis

This section analyzes the computational complexity of the proposed time–frequency pattern selection algorithm and compares it with an exhaustive search-based approach.

In K-means-based channel estimation, the computational complexity of a single K-means run applied to a group

G_{n}

can be expressed as

O (|G_{n}| \cdot K \cdot I_{g} \cdot D),

(9)

where

|G_{n}|

denotes the number of data samples in the group,

K

is the number of clusters,

I_{g}

is the number of iterations required for convergence, and

D

is the data dimension.

When evaluating a single time–frequency pattern combination, the K-means operation is applied repeatedly to all groups formed by that pattern. Consequently, the total computational cost for one pattern combination is determined by the cumulative cost of K-means executions over all groups.

In an exhaustive search-based scheme, all combinations of the time-domain pattern set

P_{t}

and the frequency-domain pattern set

P_{f}

must be evaluated. As a result, the total computational complexity scales proportionally to

|P_{t}| \cdot |P_{f}| .

Since K-means-based channel estimation is repeatedly performed for multiple groups under each pattern combination, the overall computational burden becomes very large, making real-time implementation challenging.

In contrast, the proposed method employs the effective data count constraint

N_{e f f}

, which allows only one sub-axis pattern to be evaluated for each main-axis pattern

P_{m a i n}

. Accordingly, the number of pattern combinations considered is limited to

|P_{m a i n}|

, which is significantly smaller than

|P_{t}| \cdot |P_{f}|

required by exhaustive search-based approaches.

Although the number of data samples

|G_{n}|

in each K-means execution may vary depending on the grouping configuration, the total number of samples processed over the entire resource grid remains unchanged. Therefore, the proposed grouping strategy does not alter the asymptotic order of the computational complexity per K-means execution. Instead, it substantially reduces the total number of K-means executions required during the pattern selection stage, leading to a significant reduction in overall computational complexity compared to exhaustive search-based methods.

4. Simulation Results

In this study, after determining the effective data count as a function of SNR, the performance of the proposed method was evaluated under various channel conditions. Four schemes were compared in terms of MSE and BER: the conventional K-means-based method, the fixed-pattern method, and the proposed methods with Time-main and Freq-main strategies. In addition, the average runtime was measured to assess the computational overhead of the proposed method.

All simulations were conducted based on the 5G NR tapped-delay line (TDL-A) channel model, and the key simulation parameters are summarized in Table 2. For reproducibility and fair comparison, the random seed was reinitialized at each Monte Carlo iteration and set to the iteration index, ensuring that all schemes were evaluated under identical data, channel, and noise realizations. All reported results were obtained by averaging over 100,000 Monte Carlo iterations.

The analysis results of the effective data count are presented in Table 3. The simulation results indicate that, at lower SNR values, noise effects become more pronounced, requiring a larger number of data samples to ensure stable clustering. Conversely, as the SNR increases, reliable channel estimation can be achieved with fewer data samples, leading to a gradual reduction in the effective data count.

In the frequency-domain case (Freq-main), the data count increases in increments of 14 resource elements, and the MSE variation across consecutive data configurations remains relatively smooth. Accordingly, to conservatively identify the stabilization point, the consecutive data segment size was set to 3, and the standard deviation threshold was set to 0.0013.

In the time-domain case (Time-main), the data count increases in increments of 120 resource elements, which results in larger MSE fluctuations in the initial region. Therefore, the consecutive data segment size was set to 2, and a more relaxed standard deviation threshold of 0.005 was applied. With these domain-specific criteria, the stabilization point of the MSE was consistently identified across various SNR conditions, and the effective data count was determined in a reproducible manner.

The proposed algorithm determines the grouping pattern based on the SNR-based effective data count and evaluates its performance through MSE and BER analyses. Figure 3 shows the MSE performance versus SNR under various channel conditions with different delay spreads and UE velocities. Figure 4 analyzes the impact of increasing UE velocity on MSE performance under a fixed delay spread, whereas Figure 5 focuses on the effect of increasing delay spread under a fixed UE velocity. Figure 6 presents the BER performance versus SNR under the same channel conditions used for the MSE evaluation, enabling consistent comparisons across different metrics. In addition, Table 4 compares the average elapsed time of the proposed K-means-based channel estimation with that of conventional methods.

The Conventional Pattern, introduced in [8], represents the baseline K-means-based channel estimation method, which assumes identical channel conditions for all resources without grouping. The Fixed Pattern applies a predefined grouping scheme with

F

set to 60 in the frequency domain and

T

set to 7 in the time domain. The Proposed Pattern represents the time–frequency pattern selection algorithm presented in this paper, which applies two strategies: Time-main, where the time domain is the primary axis, and Freq-main, where the frequency domain is the primary axis.

4.1. MSE Performance Versus SNR

Figure 3 presents the MSE performance versus SNR under three channel conditions, where the delay spread and UE velocity are varied across the scenarios.

Figure 3a shows the MSE performance in an environment with a delay spread of 30 ns and a UE velocity of 30 km/h. In this relatively stable environment, with limited variation in both the time and frequency domains, the performance curves of Time-main and Freq-main are nearly identical. Both strategies exhibit a consistent reduction in MSE as SNR increases, achieving approximately 86% improvement over the Conventional Pattern and 52% improvement over the Fixed Pattern at an SNR of 10 dB.

Figure 3b illustrates the MSE performance in an environment with a delay spread of 300 ns and a UE velocity of 30 km/h. In this environment, where frequency selectivity dominates, the Freq-main strategy demonstrates clear superiority. For example, at an SNR of 10 dB, Freq-main achieves an MSE that is approximately 32% lower than that of Time-main, along with an 86% improvement over the Conventional Pattern and 77% improvement over the Fixed Pattern. In addition, the Time-main curve shows a sharp improvement between 5 and 6 dB, which can be attributed to the relaxation of the effective data count constraint from 240 to 120, enabling additional candidate patterns to be included in the search.

Figure 3c presents the MSE performance in an environment with a delay spread of 30 ns and a UE velocity of 120 km/h. In this case, where time variation dominates, Freq-main still shows an overall advantage; however, the performance gap between the two strategies narrows significantly, with Time-main converging to a similar level of accuracy. Overall, these results confirm that the proposed pattern selection strategy effectively exploits the dominant channel variation characteristics, achieving robust performance over a wide range of SNR conditions.

4.2. MSE Performance Versus UE Velocities

Figure 4 presents the MSE performance with respect to different UE velocities when the delay spread is fixed at 100 ns, evaluated at SNR levels of 0 dB, 6 dB, and 10 dB.

Figure 4a illustrates the performance variation in a low-SNR environment at an SNR of 0 dB. As the UE velocity increases from 30 km/h to 150 km/h, the MSE performance of all methods deteriorates. In this case, the performance curves of the proposed methods, Time-main and Freq-main, are nearly identical. This indicates that, in low-SNR environments, noise effects outweigh the impact of channel variations, resulting in negligible differences between the two pattern selection strategies.

Figure 4b shows the performance variation with respect to UE velocity at an SNR of 6 dB. Across the entire range, the Time-main method consistently outperforms the Freq-main method, with the performance gap becoming more pronounced in high-mobility environments. At 30 km/h, both methods show similar MSE values; however, at 150 km/h, Time-main achieves approximately 31% lower MSE than Freq-main. This indicates that the advantage of Time-main becomes more evident as the UE velocity increases. This behavior can be attributed to the relaxation of the effective data count constraint from 240 to 120 at an SNR of 6 dB, which enables finer exploration of time-domain patterns and allows the algorithm to better capture time variations under high-mobility conditions.

Figure 4c presents the MSE performance at an SNR of 10 dB. At the lower velocity of 30 km/h, Freq-main achieves superior performance with approximately 12% lower MSE than Time-main. However, as velocity increases, the MSE of both methods degrades. Despite this degradation, the performance gap between Time-main and Freq-main does not widen significantly, and even at the highest velocity of 150 km/h, the difference remains below 20%.

4.3. MSE Performance Versus Delay Spread

Figure 5 illustrates the impact of increasing delay spread on MSE performance when the UE velocity is fixed at 90 km/h, evaluated at SNR levels of 0 dB, 6 dB, and 10 dB.

Figure 5a shows the MSE performance variation with respect to delay spread in an environment with an SNR of 0 dB and a UE velocity of 90 km/h. For all methods, the MSE increases as the delay spread becomes larger. At low delay spread values, the proposed pattern selection methods outperform both the Conventional and Fixed Patterns. However, as the delay spread increases, the frequency selectivity of the channel becomes more significant, and the performance of the proposed methods gradually approaches that of the Fixed Pattern. This indicates that, in environments with high noise and strong channel variations, the effectiveness of pattern selection becomes limited.

Figure 5b presents the MSE performance for varying delay spreads at an SNR of 6 dB and a UE velocity of 90 km/h. As the delay spread increases, the performance degradation of the Conventional and Fixed Patterns becomes more pronounced. In contrast, the proposed Time-main and Freq-main methods consistently achieve lower MSE values across all delay spread conditions. In particular, for delay spreads between 10 ns and 30 ns, both methods perform similarly. However, at delay spreads of 100 ns and 300 ns, the Time-main method outperforms Freq-main. This behavior can be attributed to the relaxation of the effective data count constraint from 240 to 120 in this range, which increases the number of candidate patterns available for Time-main. As the delay spread further increases to 1000 ns, the performance of both methods becomes similar.

Figure 5c illustrates the MSE performance variation with respect to delay spread in an environment with an SNR of 10 dB and a UE velocity of 90 km/h. While all methods exhibit performance degradation as the delay spread increases, the differences between the patterns become more pronounced under high-SNR conditions. At a delay spread of 10 ns, the Time-main and Freq-main methods achieve nearly identical performance. However, as the delay spread increases, the performance gap between the two methods widens. This result suggests that, under larger delay spreads, frequency selectivity has a stronger impact on channel distortion, thereby amplifying the relative performance advantage of the Freq-main method.

4.4. BER Performance Versus SNR

Figure 6 illustrates the BER performance as a function of SNR under different channel conditions. Figure 6a and Figure 6b, and 6c correspond to the scenarios of (a) DS = 30 ns and V = 30 km/h, (b) DS = 300 ns and V = 30 km/h, and (c) DS = 30 ns and V = 120 km/h, respectively. When evaluated under the same channel conditions as in Section 4.1, the BER results exhibit performance trends consistent with the corresponding MSE results, indicating that improvements in channel estimation accuracy lead to consistent gains in detection performance.

In the relatively stable channel environment shown in Figure 6a, where both time- and frequency-domain variations are limited, the performance difference between the two proposed strategies remains marginal. At SNR = 10 dB, the proposed strategies achieve approximately a 74–76% relative BER reduction compared with the Conventional Pattern and about a 37–41% relative reduction compared with the Fixed Pattern, indicating that the proposed grouping approach provides tangible gains even under mild channel variations.

As the delay spread increases, as shown in Figure 6b, frequency selectivity becomes the dominant impairment, and the advantage of the Freq-main strategy becomes more noticeable. At SNR = 10 dB, Freq-main achieves approximately a 74.5% relative BER reduction compared with the Conventional Pattern and about a 64.4% relative reduction compared with the Fixed Pattern. Furthermore, compared with Time-main, Freq-main achieves approximately a 22% relative BER reduction, suggesting that frequency-domain-oriented grouping may be more effective under highly frequency-selective channels.

In contrast, in the high-mobility scenario depicted in Figure 6c, the BER performances of Time-main and Freq-main are very similar over most of the SNR range. A slight performance difference is observed around SNR = 6 dB, where Time-main achieves approximately 6% lower BER than Freq-main; however, the overall performance gap between the two strategies remains marginal in this scenario.

4.5. Runtime Analysis

To assess the practical feasibility of the proposed method, the average execution time of the K-means-based channel estimation stage was compared for each scheme using the selected resource patterns. The experiments were conducted under three representative channel conditions, including a low-mobility environment with short delay spread, an environment with large delay spread, and a high-mobility environment. The average execution time was analyzed over the entire SNR range from 0 to 10 dB. The comparative results of the average execution time for each method are presented in Table 4.

As shown in Table 4, the Conventional method exhibits the lowest execution time across all channel conditions, since it does not involve resource grouping. The Fixed Pattern method applies a predefined grouping structure, which increases the execution time compared with the Conventional method; however, the runtime variation across the considered channel conditions remains limited. In contrast, the proposed Time-main and Freq-main methods incur additional computational cost due to group-wise processing and repeated K-means operations under the selected grouping patterns, resulting in higher execution times than the Fixed Pattern method. Although the measured execution time can vary depending on the selected grouping configuration, the overall runtime remains stable across the evaluated channel conditions without abrupt increases.

By jointly considering the runtime results with the MSE and BER performance, it is observed that the proposed method provides consistent performance gains across diverse channel environments with a bounded computational overhead. In particular, the Freq-main strategy tends to be more effective in frequency-selective conditions (large delay spread), whereas the Time-main strategy becomes more competitive and achieves comparable performance in high-mobility conditions dominated by time selectivity. These trends suggest that, in practical operational scenarios, an appropriate strategy can be selected according to channel characteristics (e.g., SNR, delay spread, and UE velocity). Moreover, under the considered experimental settings, the average execution time remains within 30 ms for all conditions, supporting the practical applicability of the proposed method in terms of computational cost relative to the achieved performance gains.

5. Conclusions

This paper proposes an efficient grouping pattern selection algorithm that combines SNR-based determination of the effective data count with prioritized time–frequency grouping. While conventional K-means-based channel estimation methods improve resource efficiency by operating without DM-RS, their assumption of channel uniformity over a fixed resource region can lead to degraded estimation accuracy in high-mobility scenarios or environments with large delay spread. To address this limitation, the proposed method applies two complementary strategies, namely Time-main and Freq-main, to better reflect channel characteristics, and excludes performance-irrelevant grouping patterns through the effective data count constraint. Simulation results demonstrate that the proposed method consistently outperforms conventional approaches in terms of both MSE and BER across diverse channel environments. In particular, at SNR = 10 dB, the Time-main strategy achieves approximately an 84% reduction in MSE compared to the Conventional method in a high-mobility scenario with DS = 100 ns and V = 150 km/h. In addition, under a frequency-selective scenario characterized by DS = 1000 ns and V = 90 km/h, the Freq-main strategy provides approximately a 53% reduction in MSE relative to the Conventional method. These results confirm that the proposed method effectively adapts to different channel characteristics. Overall, the proposed time–frequency grouping pattern selection approach significantly reduces the number of evaluated pattern combinations compared to exhaustive search methods while maintaining accurate channel estimation performance. These findings indicate that the proposed method is practically applicable to 5G NR and future wireless communication systems. Future work will focus on extending the proposed framework by considering diverse modulation schemes and expanding its applicability to MIMO environments.

Author Contributions

Conceptualization, Methodology, Validation, Investigation, Data Analysis, Writing—Original Draft, and Visualization, Y.K.; Software, Writing—Review & Editing, and Supervision, J.B.; Supervision, Writing—Review & Editing, and Funding Acquisition, T.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by an Institute of Information Communications Technology Planning Evaluation (IITP) grant, funded by the Korean Government (MSIT) (No. RS-2024-00397789, Development and Standardization of 6G Radio Transmission Standard Technology).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kim, Y.; Park, J.; Lee, S.; Cho, J.; Kim, H. New radio and its evolution toward 5G-Advanced. IEEE Wirel. Commun. 2019, 26, 2–7. [Google Scholar] [CrossRef]
Park, J.C.; Kang, K.-M.; Choi, J. K-means clustering-aided power control for UAV-enabled OFDM networks. IEEE Access 2024, 12, 15549–15560. [Google Scholar] [CrossRef]
Park, S.; Sung, W. Advanced codebook generation using high-resolution K-means clustering to PMI data. IEEE Wirel. Commun. Lett. 2025, 14, 549–553. [Google Scholar] [CrossRef]
Alouzi, M.; Yanikomeroglu, H.; Kurt, G.K. Adaptive phase shifters for hybrid beamforming in mmWave systems. IEEE Trans. Wirel. Commun. 2025, 24, 1104–1116. [Google Scholar] [CrossRef]
Du, P.; Xiao, T.; Chakraborty, C.; Cao, H.; Alfarraj, O.; Yu, K. Energy-efficient drones and BS management in distributed edge intelligence empowered IoV networks. IEEE Internet Things J. 2025, 12, 4667–4680. [Google Scholar] [CrossRef]
Tran, T.-N.; Nguyen, T.-L.; Hoang, V.T.; Voznak, M. Sensor clustering using a K-means algorithm in combination with optimized unmanned aerial vehicle trajectory in wireless sensor networks. Sensors 2023, 23, 2345. [Google Scholar] [CrossRef] [PubMed]
Khammari, H.; Ahmed, I.; Bhatti, G.; Alajmi, M. Spatio-radio resource management and hybrid beamforming for limited feedback massive MIMO systems. Electronics 2019, 8, 1061. [Google Scholar] [CrossRef]
Trifan, R.-F.; Enescu, A.-A.; Paleologu, C. Hybrid MU-MIMO precoding based on K-means user clustering. Algorithms 2019, 12, 146. [Google Scholar] [CrossRef]
Ito, M.; Fukue, S.; Ando, K.; Kanno, I.; Yamazaki, K.; Ishibashi, K. Clustering and beamforming for user-centric cell-free massive MIMO with backhaul capacity limitation. IEEE Access 2024, 12, 382–395. [Google Scholar] [CrossRef]
Koesnadi, F.X.K.; Chung, S.-H. Improved cell allocation strategies using K-means clustering in congested 6TiSCH environments. Sensors 2024, 24, 5608. [Google Scholar] [CrossRef] [PubMed]
Zeng, B.; Li, S.; Gao, X. Threshold-driven K-means sector clustering algorithm for wireless sensor networks. EURASIP J. Wirel. Commun. Netw. 2024, 2024, 68. [Google Scholar] [CrossRef]
Abou Yassin, M.R.; Abou Chahine, S.; Issa, H. AI-driven pilot overhead reduction in 5G mmWave massive MIMO systems. Appl. Syst. Innov. 2025, 8, 24. [Google Scholar] [CrossRef]
Terré, M.; Féty, L.; Horsin, T. Blind channel response estimation by constrained clustering. In Proceedings of the 2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia, 22–24 September 2022; pp. 1–6. [Google Scholar] [CrossRef]
Tarafder, P.; Chun, C.; Ullah, A.; Kim, Y.; Choi, W. Channel estimation in 5G-and-beyond wireless communication: A comprehensive survey. Electronics 2025, 14, 750. [Google Scholar] [CrossRef]
Peng, Z.; Pan, C.; Zhou, G.; Ren, H.; Jin, S.; Popovski, P. Two-stage channel estimation for RIS-aided multiuser mmWave systems with reduced error propagation and pilot overhead. IEEE Trans. Signal Process. 2023, 71, 3607–3622. [Google Scholar] [CrossRef]
Lee, S.; Sim, D. Deep learning-based channel estimation method for MIMO systems in spatially correlated channels. IEEE Access 2024, 12, 79082–79090. [Google Scholar] [CrossRef]
Sabapathy, S.; Prabhu, J.S.; Maruthu, S.; Jayakody, D.N.K. Profuse channel estimation and signal detection techniques for orthogonal time frequency space in 6G epoch: A survey. IEEE Access 2023, 11, 129963–129993. [Google Scholar] [CrossRef]
Jung, K.; Wang, H. Pilotless channel estimation scheme using clustering-based unsupervised learning. In Proceedings of the 15th International Symposium on Wireless Communication Systems (ISWCS), Lisbon, Portugal, 28–31 August 2018; pp. 1–5. [Google Scholar] [CrossRef]
Bernadó, L.; Zemen, T.; Tufvesson, F.; Molisch, A.F.; Mecklenbräuker, C.F. Time- and frequency-varying K-factor of non-stationary vehicular channels for safety-relevant scenarios. IEEE Trans. Intell. Transp. Syst. 2015, 16, 1007–1017. [Google Scholar] [CrossRef]
Zhu, X.; Wang, J.; Dai, L.; Wang, Z. Sparsity-aware adaptive channel estimation based on SNR detection. IEEE Trans. Broadcast. 2015, 61, 119–126. [Google Scholar] [CrossRef]
3GPP. 5G.; NR.; Physical Channels and Modulation. Technical Specification (TS) 38.211, 3rd Generation Partnership Project (3GPP), Version 18.7.0. 2025. Available online: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3213 (accessed on 20 October 2025).
3GPP. 5G.; NR.; Multiplexing and Channel Coding. Technical Specification (TS) 38.212, 3rd Generation Partnership Project (3GPP), Version 18.7.0. 2025. Available online: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3214 (accessed on 20 October 2025).
Arthur, D.; Vassilvitskii, S. k-means++: The advantages of careful seeding. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA), New Orleans, LA, USA, 7–9 January 2007; pp. 1027–1035. [Google Scholar]
Jain, A.K.; Murty, M.N.; Flynn, P.J. Data clustering: A review. ACM Comput. Surv. 1999, 31, 264–323. [Google Scholar] [CrossRef]
Zhu, A.; Hua, Z.; Shi, Y.; Tang, Y.; Miao, L. An improved K-means algorithm based on evidence distance. Entropy 2021, 23, 1550. [Google Scholar] [CrossRef] [PubMed]
Liao, J.; Qing, X.; Zhong, X.; Yang, Y. K-means algorithm for optimizing initial clustering center based on density peak. In Proceedings of the 2024 International Conference on Information Technology, Communication Ecosystem and Management (ITCEM), Bangkok, Thailand, 20–22 December 2024; pp. 91–95. [Google Scholar] [CrossRef]

Figure 1. Resource grouping patterns: (a)

T = 7, F = 60

, (b)

T = 4, F = 60

.

Figure 1. Resource grouping patterns: (a)

T = 7, F = 60

, (b)

T = 4, F = 60

.

Figure 2. Illustration of Effective Data Count.

Figure 3. MSE performance versus SNR under different channel conditions: (a) DS = 30 ns and V = 30 km/h, (b) DS = 300 ns and V = 30 km/h, and (c) DS = 30 ns and V = 120 km/h.

Figure 4. MSE performance versus UE velocity with DS = 100 ns: (a) SNR = 0 dB, (b) SNR = 6 dB, and (c) SNR = 10 dB.

Figure 5. MSE performance versus delay spread with V = 90 km/h: (a) SNR = 0 dB, (b) SNR = 6 dB, and (c) SNR = 10 dB.

Figure 6. BER performance versus SNR under different channel conditions: (a) DS = 30 ns and V = 30 km/h, (b) DS = 300 ns and V = 30 km/h, and (c) DS = 30 ns and V = 120 km/h.

Table 1. Candidate time-domain patterns.

$T$	$P_{t} (T)$	$T$	$P_{t} (T)$	$T$	$P_{t} (T)$
1	[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]	5	[5, 5, 4]	10	[10, 4]
2	[2, 2, 2, 2, 2, 2, 2]	6	[6, 6, 2]	11	[11, 3]
3	[3, 3, 3, 3, 2]	7	[7, 7]	12	[12, 2]
3-1	[3, 3, 3, 5]	8	[8, 6]	13	[13, 1]
4	[4, 4, 4, 2]	9	[9, 5]	14	[14]

Table 2. Simulation parameters.

Category	Parameters	Value
General Settings	Number of OFDM Symbols	14
	Number of RBs	10
	Number of REs	1680
	Sampling Rate	3.84 MHz
	Carrier Frequency	6 GHz
	SNR	0:1:10 [dB]
	Noise	AWGN
	Iteration	100,000
Effective Data Count Analysis	Delay Spread	30 ns
	Velocity	10 km/h
	Data Count Configuration	Freq. = 14:14:1680, Time = 120:120:1680
	Continuous Data Segment Size	Freq. = 3, Time = 2
	Threshold	Freq. = 0.0013, Time = 0.005
K-means Settings	Number of clusters	4
	Initialization method	K-means++
	Max iterations	100
	Replicates	3

Table 3. Effective data counts according to SNR.

SNR	0	1		2		3		4		5
Frequency	308	266		238		196		182		154
Time	360			240
SNR	6		7		8		9		10
Frequency	126		112		98		84		70
Time	120

Table 4. Average elapsed time (s).

Method	DS = 30 ns, V = 30 km/h	DS = 300 ns, V = 30 km/h	DS = 30 ns, V = 120 km/h
Conventional	0.0069	0.0083	0.0077
Fixed Pattern	0.0139	0.0147	0.0142
Proposed (Time-main)	0.0188	0.0227	0.0240
Proposed (Freq-main)	0.0229	0.0278	0.0228

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, Y.; Bang, J.; Kim, T. Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel. Mathematics 2026, 14, 400. https://doi.org/10.3390/math14030400

AMA Style

Kim Y, Bang J, Kim T. Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel. Mathematics. 2026; 14(3):400. https://doi.org/10.3390/math14030400

Chicago/Turabian Style

Kim, Yumin, Jonghyun Bang, and Taehyoung Kim. 2026. "Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel" Mathematics 14, no. 3: 400. https://doi.org/10.3390/math14030400

APA Style

Kim, Y., Bang, J., & Kim, T. (2026). Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel. Mathematics, 14(3), 400. https://doi.org/10.3390/math14030400

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Blind Channel Estimation Based on K-Means Clustering with Resource Grouping in Fading Channel

Abstract

1. Introduction

2. System Model

3. Proposed Method

3.1. Resource Grouping Pattern Definition

3.2. SNR-Based Determination of Effective Data Count

3.3. Time–Freq Pattern Selection Algorithm

3.4. Computational Complexity Analysis

4. Simulation Results

4.1. MSE Performance Versus SNR

4.2. MSE Performance Versus UE Velocities

4.3. MSE Performance Versus Delay Spread

4.4. BER Performance Versus SNR

4.5. Runtime Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI