An Improved Blind Recognition Algorithm of Frame Parameters Based on Self-Correlation

: The blind recognition of the frame parameter plays a crucial role in frame synchronization in the background of a non-cooperation communication system. This paper proposes an algorithm based on self-correlation on the foundation of existed cumulative ﬁltering algorithm. To overcome high BER, the peak-to-average ratio (PAR) is calculated to improve the algorithm. The simulation results proved that the performance of the algorithm has been improved.


Introduction
In digital communication systems, frame synchronization is a prerequisite for subsequent signal processing such as error correction and information extraction [1][2][3][4].Frame synchronization words can mark the place for mapping receiving binary stream and eliminating redundant information.However, the manner of frame synchronization is different under different communication conditions.This paper focuses on the frame synchronization methods in the background of the non-cooperative communication.The frame parameters of the received binary signals must be blindly recognized in the context of non-cooperative communication.The cumulative filtering algorithm is the most commonly used blind identification method, the core of which is to use the randomness and synchronization features of asynchronous codes to identify the fame parameters by traversing frame lengths [5].However, there are some disadvantages of this method, including weak resistance against bit errors, low correctness rate, too many memory matrix lines and also more consumption of the calculation resources.Based on the existed cumulative filtering algorithm, this paper proposes an improved frame parameter blind identification method based on self-correlation, and its performance is greatly improved.
The rest of the paper is arranged as follows: in Section 2, the cumulative filtering algorithm is briefly described.In Section 3, algorithm based on self-correlation is demonstrated detailly.In Section 4, the improved algorithm based on self-correlation is explored.Section 5 demonstrates the simulation results and Section 6 concludes the whole paper.

The Cumulative Filtering Algorithm
The signals are usually transmitted in the form of frames in the channel, through the location of which, it can achieve the synchronization between the receiver and the transmitter.The typical frame format can be seen in Figure 1. S represents the frame synchronization word, the length of which is M. D represents the data payload, the length of which is N.The length of a whole frame L = M + N. The transmitter packages the frame synchronization words and data payloads into the channel.At the receiver, the frame synchronization word S is repeated in cycles, while the data payload is random.
The start of each frame can be found by detecting the location of the frame synchronization word S, and then subsequent operations can be performed.random.The start of each frame can be found by detecting the location of the frame synchronization word S, and then subsequent operations can be performed.
In the background of non-cooperating communication, three key parameters: S, M, N are unknown parameters, which have to be recognized blindly [6].Blind recognition of frame parameters can be achieved by using the periodicity of the frame synchronization sequence.Y is the received bit stream while X is a receiving matrix in the size of u × l, which are as shown in the Equation (1).
When the number of matrix columns l is equal to the frame length, all elements of certain columns are either 0 or 1, while elements of other columns are randomly occurring as 0 or 1.The former ones are defined as synchronization columns, while others are defined as nonsynchronization ones.The sum of each column is calculated, based on the results of which it can be recognized as a synchronization column when the result of a column is closed to 0 or u.Therefore, reasonable thresholds have to be elaborately designed [7,8].
δ up and δ down are the upper and lower threshold respectively.Equations ( 2) and ( 3) can be acquired according to reference [9] and 3δ-criterion [10].

Frame Parameters Recognition Algorithm Based on Self-Correlation
The frame synchronization code sequence has a local self-correlation function with sharp unimodal characteristics.Therefore, the frame synchronization parameters can be recognized by analyzing the self-correlation property.First, the mathematical model is analyzed and derived.
The signal transmitted s(t) can be calculated through the following equation: In Equation ( 5), g T represents signal generation filter while T represents symbol interval.The received signal r(t) can be calculated as: In Equation ( 6), a represents channel attenuation factor, θ represents the phase shift during transmission, τ represents the channel group delay, w(t) represents complex Gaussian white noise.In the background of non-cooperating communication, three key parameters: S, M, N are unknown parameters, which have to be recognized blindly [6].
Blind recognition of frame parameters can be achieved by using the periodicity of the frame synchronization sequence.Y is the received bit stream while X is a receiving matrix in the size of u × l, which are as shown in the Equation (1).
When the number of matrix columns l is equal to the frame length, all elements of certain columns are either 0 or 1, while elements of other columns are randomly occurring as 0 or 1.The former ones are defined as synchronization columns, while others are defined as non-synchronization ones.The sum of each column is calculated, based on the results of which it can be recognized as a synchronization column when the result of a column is closed to 0 or u.Therefore, reasonable thresholds have to be elaborately designed [7,8].δ up and δ down are the upper and lower threshold respectively.Equations ( 2) and ( 3) can be acquired according to reference [9] and 3δ-criterion [10].

Frame Parameters Recognition Algorithm Based on Self-Correlation
The frame synchronization code sequence has a local self-correlation function with sharp unimodal characteristics.Therefore, the frame synchronization parameters can be recognized by analyzing the self-correlation property.First, the mathematical model is analyzed and derived.
The signal transmitted s(t) can be calculated through the following equation: In Equation ( 5), g T represents signal generation filter while T represents symbol interval.The received signal r(t) can be calculated as: In Equation ( 6), a represents channel attenuation factor, θ represents the phase shift during transmission, τ represents the channel group delay, w(t) represents complex Gaussian white noise.
At the receiving end, the signal is determined by the ADC sampling to become a discrete time signal.The sampling interval is T s , and then T = T s N s (N s is an integer).The sampling signal at the receiving end can be expressed as: Under cooperative communication conditions, maximum likelihood estimation is a common algorithm for the implementation of frame synchronization.The method uses a sliding time window to sample the signal and calculate a likelihood function between the sampled data and a known sequence of frame synchronization codes.The frame synchronization parameter is identified by searching for the maximum likelihood function value.
When the sliding window obtains the optimal delay estimate, its corresponding likelihood function takes the maximum value, which can be expressed as: Likelihood function can be expressed as: In Equation ( 9), T 0 is the observation interval of the sliding time window.Equation ( 9) can be simplified as Equation (10) according to the reference [11].
Re{(a exp(jθ)) * y i After obtaining the logarithm of Equation (10), it can generate: In Equations ( 10) and ( 11), y i τ 0 represents discrete data sampled by the ADC.Using the known frame synchronization word sequence to perform cross-correlation calculation with the sampled data in the sliding window, Equation ( 11) can be equivalently simplified to: In Equation (12), M represents the length of synchronization code sequence, p i represents the ith data in the sync code sequence while τ m indicates the starting moment of the sliding window.
In the context of non-cooperative communication, since the bits of synchronization word sequence are unknown, the cross-correlation calculation in Equation ( 12) cannot be directly used to recognize the frame synchronization word sequence.Therefore, the cross-correlation calculation in Equation ( 12) should be changed to the self-correlation calculation, which is shown in Equation (13).
In Equation ( 13), y i τ m and y i τ n are the sampling data of different times in the sliding windows while the interval of the two sliding time windows are integer multiples of M.
In order to realize the synchronization word recognition in the non-cooperative communication background, the sampling data storage matrix Y is defined, and the multiplication results of the sampling data at corresponding positions of different moments in the sliding window are sequentially put into the storage matrix, as shown in Equation ( 14).
In Equation ( 14), n is the number of sampled signals in the sliding window When n equals to the frame length or an integer multiple of the frame length, assuming that the received bit stream error rate is 0, the columns corresponding to the frame synchronization word should be all 0 or all 1.The data in other columns should be distributed randomly.When n is not the frame length or an integer multiple of the frame length, elements of each column in the storage matrix should be randomly distributed.Meanwhile, there is almost no case of all "0"s or all "1"s.
Considering the case of a data payload sequence, the probability that each element in the storage vector is 0 and 1 is: According to the knowledge of mathematical statistics [12], y i obeys the binomial distribution, and its mathematical expectation and variance are: In the synchronization word sequence and the data payload sequence, the probability distributions of 0 or 1 are different.Therefore, in the case where the starting time and the length of sliding time window are constant, the presence of synchronization word sequence can be detected by calculating the distributions of 0 and 1 corresponding to each position in the sliding time window.After multiple iterations of the search, the final conclusion is reached.The sampling matrix Y * is defined as: In Equation (17), N is the number of matrix rows.By summing the data in each column of Y * : In Equation (18), y * i obeys the binomial distribution with the parameter (N, p), where p = 0.5.Supposing that the random variable X n obeys the binomial distribution with the parameter (n, p) (0 < p < 1, n ≥ 1), when n is large enough, the binomial distribution has a normal distribution as its law of limit distribution according to the DeMoivre-Laplace central limit theorem.Thus, when the column y * i corresponds to the data payload sequence, taking a sufficient number of sliding window sample data, the probability distribution of y * i can be equivalent to that generated by obeying N(N p, N p(1 − p)).When the column y * i corresponds synchronization word sequence, the probability distribution of y * i does not conform to the normal distribution characteristics, and y * i Information 2019, 10, 64 5 of 9 should be closed to N or 0 instead.The up and down decision threshold are δ up and δ down , Equation ( 19) can be achieved according to the 3δ-criterion: If y * i ∈ (u − 3δ, u + 3δ), the column represented by y * i is determined to correspond to the data payload sequence, otherwise it is determined to correspond to the synchronization word sequence.

Improved Frame Parameters Recognizing Algorithm Based on Self-Correlation
The algorithm described in the previous section uses the 3δ criterion to determine the threshold during the decision phase.Actually, if the number of rows of the sampling matrix Y * is too low or the bit error rate (BER) is high, this decision method might lead to low accuracy.In this section, the peak-to-average ratio (PAR) and 3δ-criterion are combined to search for the length of the frame.Afterwards, the frame length value obtained by the search is substituted into the self-correlation algorithm described in the previous section to obtain the synchronization word sequence.The definition of the PAR is as shown in Equation (20).Suppose array X contains n elements, then: where R is PAR, X i (0 ≤ i ≤ n) is any element in array X. X max is the maximum value in the array.While searching for the frame length value, it needs to first apply the upper limit value δ up = u + 3σ for coarse filtering and record the estimated frame length values and corresponding Y * sum arrays.Afterwards, for the filtered Y * sum arrays, it needs to calculate the peak-to-average ratio R of each array and determine the frame length corresponding to the array Y * sum with the largest R value as a reasonable frame length estimate.It can be shown from the simulation results that the correctness of frame length recognition and error-resistance performance can be greatly improved by applying this kind of judgment method.
After obtaining the credible frame length estimation value, it is substituted into the algorithm in the previous section.When determining the 0 and 1, the up and down threshold are in the form of u ± ασ, where α is a constant slightly less than 3.Because the peak value is not as high as the ideal case when the bit error rate is high, if α takes 3, it will cause the problem of missing the "1".However, the impact on the "0" decision is not obvious.The recommended value of α is 2.7.
As is shown in Figure 2, the α of up and down thresholds are all 3. Figure 2a shows the result of Y * sum when the frame length is correctly detected at bit error rate (BER) = 0.08. Figure 2b is the partial enlargement of Figure 2a at the frame header.There are two "1"s below the up threshold, which leads to the missing judgement.In Figure 3, α is 2.7, no missing judgement occurs when the receiving bit stream is unchanged, indicating that the fault tolerance is enhanced.It needs to be noticed that for Figures 2-4, all the coordinates on y-axis have been normalized while x-axis represents bit numbers.3a at the frame header.There are no "1"s below the up threshold, indicating that all "1"s are determined correctly.

Simulation
In the algorithm simulation experiments, the frame length is 1024 bits while the frame synchronization word sequence is 0xf628.The length of sliding window and frame initial values are 0 and 500 respectively.When the BER is 0, the result of the frame parameters recognition algorithm based on the self-correlation is shown in Figure 4.When the frame estimation is right, apparent selfcorrelation peaks appear at the head of the frame (Figure 4a).When the frame estimation has a fault, the sum of each column of the storage matrix Y sum * is distributed randomly and there is no apparent peak (Figure 4b).
The two straight lines paralleling to the X-axis are the upper and lower threshold.From Figure 4a, the frame synchronization word can be recognized, which is 0xf628.3a at the frame header.There are no "1"s below the up threshold, indicating that all "1"s are determined correctly.

Simulation
In the algorithm simulation experiments, the frame length is 1024 bits while the frame synchronization word sequence is 0xf628.The length of sliding window and frame initial values are 0 and 500 respectively.When the BER is 0, the result of the frame parameters recognition algorithm based on the self-correlation is shown in Figure 4.When the frame estimation is right, apparent selfcorrelation peaks appear at the head of the frame (Figure 4a).When the frame estimation has a fault, the sum of each column of the storage matrix Y sum * is distributed randomly and there is no apparent peak (Figure 4b).
The two straight lines paralleling to the X-axis are the upper and lower threshold.From Figure 4a, the frame synchronization word can be recognized, which is 0xf628.3a at the frame header.There are no "1"s below the up threshold, indicating that all "1"s are determined correctly.0 and 500 respectively.When the BER is 0, the result of the frame parameters recognition algorithm based on the self-correlation is shown in Figure 4.When the frame estimation is right, apparent selfcorrelation peaks appear at the head of the frame (Figure 4a).When the frame estimation has a fault, the sum of each column of the storage matrix Y sum * is distributed randomly and there is no apparent peak (Figure 4b).
The two straight lines paralleling to the X-axis are the upper and lower threshold.From Figure 4a, the frame synchronization word can be recognized, which is 0xf628.Assume that the bit stream is transmitted by QPSK system with BER [13,14].The recognition accuracy rate is defined as the ratio of the times of recognizing correctly to the total times.100 simulations are performed at each BER level.After 1000 algorithm simulations, the recognition accuracy rates of algorithms in Sections 3 and 4 are demonstrated in Figure 5.As the result shows, when the BER is over 0.03, the recognition accuracy rate of the unimproved algorithm is lower than 0.8, while that of the improved algorithm is 1.When the BER increases to 0.1,

Simulation
In the algorithm simulation experiments, the frame length is 1024 bits while the frame synchronization word sequence is 0xf628.The length of sliding window and frame initial values are 0 and 500 respectively.When the BER is 0, the result of the frame parameters recognition algorithm based on the self-correlation is shown in Figure 4.When the frame estimation is right, apparent self-correlation peaks appear at the head of the frame (Figure 4a).When the frame estimation has a fault, the sum of each column of the storage matrix Y * sum is distributed randomly and there is no apparent peak (Figure 4b).
The two straight lines paralleling to the X-axis are the upper and lower threshold.From Figure 4a, the frame synchronization word can be recognized, which is 0xf628.
Assume that the bit stream is transmitted by QPSK system with BER [13].The recognition accuracy rate is defined as the ratio of the times of recognizing correctly to the total times.100 simulations are performed at each BER level.After 1000 algorithm simulations, the recognition accuracy rates of algorithms in Sections 3 and 4 are demonstrated in Figure 5. Assume that the bit stream is transmitted by QPSK system with BER [13,14].The recognition accuracy rate is defined as the ratio of the times of recognizing correctly to the total times.100 simulations are performed at each BER level.After 1000 algorithm simulations, the recognition accuracy rates of algorithms in Sections 3 and 4 are demonstrated in Figure 5.As the result shows, when the BER is over 0.03, the recognition accuracy rate of the unimproved algorithm is lower than 0.8, while that of the improved algorithm is 1.When the BER increases to 0.1, the unimproved algorithm can hardly recognize the frame parameters.However, the improved algorithm in Section 4 still works at the recognition accuracy rate of 0.43.

Conclusions
Based on the cumulative filtering algorithm, this paper proposes a frame parameters recognition algorithm based on self-correlation.In order to optimize the performance in the case of high BER, an improved frame parameters recognition algorithm based on self-correlation is proposed.The peakto-average ratio (PAR) is introduced to estimate the reasonable frame length.Simulation results show that the performance of the algorithm is significantly improved, and this algorithm is of vital significance in practical engineering due to its reliability at high BER.As the result shows, when the BER is over 0.03, the recognition accuracy rate of the unimproved algorithm is lower than 0.8, while that of the improved algorithm is 1.When the BER increases to 0.1, the unimproved algorithm can hardly recognize the frame parameters.However, the improved algorithm in Section 4 still works at the recognition accuracy rate of 0.43.

Conclusions
Based on the cumulative filtering algorithm, this paper proposes a frame parameters recognition algorithm based on self-correlation.In order to optimize the performance in the case of high BER, an improved frame parameters recognition algorithm based on self-correlation is proposed.The peak-to-average ratio (PAR) is introduced to estimate the reasonable frame length.Simulation results show that the performance of the algorithm is significantly improved, and this algorithm is of vital significance in practical engineering due to its reliability at high BER.

Figure 2 .Figure 3 .
Figure 2. (a) This is the figure of Y sum * with BER = 0.08, α = 3.(b) This is the partial enlargement of Figure2aat the frame header.There are two "1"s below the up threshold, causing the missing judgement.

Figure 2 .Figure 2 .Figure 3 .
Figure 2. (a) This is the figure of Y * sum with BER = 0.08, α = 3.(b) This is the partial enlargement of Figure2aat the frame header.There are two "1"s below the up threshold, causing the missing judgement.

Figure 3 .
Figure 3. (a) This is the figure of Y * sum with BER = 0.08, α = 2.7.(b) This is the partial enlargement of Figure3aat the frame header.There are no "1"s below the up threshold, indicating that all "1"s are determined correctly.

Figure 4 .
Figure 4.This is the result of Y sum * when BER is 0. (a) The peak appears when the frame estimation is right.(b) No peak appears when the frame estimation is fault.

Figure 5 .
Figure 5. Comparation of the improved and unimproved algorithm.Obviously, the improved algorithm performs better under high BER circumstances.

Figure 4 .
Figure 4.This is the result of Y * sum when BER is 0. (a) The peak appears when the frame estimation is right.(b) No peak appears when the frame estimation is fault.

Figure 4 .
Figure 4.This is the result of Y sum * when BER is 0. (a) The peak appears when the frame estimation is right.(b) No peak appears when the frame estimation is fault.

Figure 5 .
Figure 5. Comparation of the improved and unimproved algorithm.Obviously, the improved algorithm performs better under high BER circumstances.

Figure 5 .
Figure 5. Comparation of the improved and unimproved algorithm.Obviously, the improved algorithm performs better under high BER circumstances.