Next Article in Journal
Intuitionistic Fuzzy TOPSIS as a Method for Assessing Socioeconomic Phenomena on the Basis of Survey Data
Previous Article in Journal
Selecting an Effective Entropy Estimator for Short Sequences of Bits and Bytes with Maximum Entropy
Previous Article in Special Issue
Transmitter Diversity Gain Technique Aided Irregular Channel Coding for Mobile Video Transmission
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Performance of Video Resolution, Motion and Dynamism in Transmission Using Near-Capacity Transceiver for Wireless Communication

1
Department of Computer Systems Engineering, University of Engineering and Technology Peshawar, Peshawar 25000, Pakistan
2
Department of Quantitative Methods and Economic Informatics, Faculty of Operation and Economics of Transport and Communications, University of Zilina, 010 26 Zilina, Slovakia
3
Department of Telecommunications, Faculty of Electrical Engineering and Computer Science, VSB—Technical University of Ostrava, 17. Listopadu 15, 708 33 Ostrava-Poruba, Czech Republic
*
Author to whom correspondence should be addressed.
All authors contributed equally to this work.
Entropy 2021, 23(5), 562; https://doi.org/10.3390/e23050562
Submission received: 21 April 2021 / Accepted: 27 April 2021 / Published: 1 May 2021
(This article belongs to the Special Issue Information Theory in Digital Signal Processing)

Abstract

:
This article investigates the performance of various sophisticated channel coding and transmission schemes for achieving reliable transmission of a highly compressed video stream. Novel error protection schemes including Non-Convergent Coding (NCC) scheme, Non-Convergent Coding assisted with Differential Space Time Spreading (DSTS) and Sphere Packing (SP) modulation (NCDSTS-SP) scheme and Convergent Coding assisted with DSTS and SP modulation (CDSTS-SP) are analyzed using Bit Error Ratio (BER) and Peak Signal to Noise Ratio (PSNR) performance metrics. Furthermore, error reduction is achieved using sophisticated transceiver comprising SP modulation technique assisted by Differential Space Time Spreading. The performance of the iterative Soft Bit Source Decoding (SBSD) in combination with channel codes is analyzed using various error protection setups by allocating consistent overall bit-rate budget. Additionally, the iterative behavior of SBSD assisted RSC decoder is analyzed with the aid of Extrinsic Information Transfer (EXIT) Chart in order to analyze the achievable turbo cliff of the iterative decoding process. The subjective and objective video quality performance of the proposed error protection schemes is analyzed while employing H.264 advanced video coding and H.265 high efficient video coding standards, while utilizing diverse video sequences having different resolution, motion and dynamism. It was observed that in the presence of noisy channel the low resolution videos outperforms its high resolution counterparts. Furthermore, it was observed that the performance of video sequence with low motion contents and dynamism outperforms relative to video sequence with high motion contents and dynamism. More specifically, it is observed that while utilizing H.265 video coding standard, the Non-Convergent Coding assisted with DSTS and SP modulation scheme with enhanced transmission mechanism results in E b / N 0 gain of 20 dB with reference to the Non-Convergent Coding and transmission mechanism at the objective PSNR value of 42 dB. It is important to mention that both the schemes have employed identical code rate. Furthermore, the Convergent Coding assisted with DSTS and SP modulation mechanism achieved superior performance with reference to the equivalent rate Non-Convergent Coding assisted with DSTS and SP modulation counterpart mechanism, with a performance gain of 16 dB at the objective PSNR grade of 42 dB. Moreover, it is observed that the maximum achievable PSNR gain through H.265 video coding standard is 45 dB, with a PSNR gain of 3 dB with reference to the identical code rate H.264 coding scheme.

1. Introduction

Video on-demand transmission is the major factor of the estimated increase in network traffic over the cellular network. By the end of 2023, the emerging increase in mobile applications downloading will reach 300 million and the 5G speeds will reach about 575 Mbps, which is 13 times higher than the average mobile connections. There will be a significant need in increase of bandwidth with the highest connectivity requirements of the future networks. A crucial factor leading the increase in mobile speed is the increasing proportion of 4G mobile connection and the rising interest in 5G connections [1,2,3]. The applications of Internet data traffic on 3G and 4G over wireless communication networks increases exponentially, which increases the needs of highly efficient networks providing superior connectivity, reliable and low latency communication [4].
For a raw video sequence with one pixel representation of a gray scale and color video, 8 and 24 bits are used correspondingly. A total of 944 Mbps is required to display one frame of a simple colored TV video with a resolution of 1024 × 768 pixels with a frame rate of 25 fps [5]. Bandwidth requirement is reduced by encoding the video content through source video coding standards such as H.264/AVC or H.265/HEVC for transmission over the communication network. However, the High Definition (HD) videos require higher bandwidth for transmission over communication channels, even with the employment of the video compression mechanism.
For the very first time, Shannon proposed the basics of channel capacity bounds over a noisy channel providing minimum probability of errors by employing complex encoding systems and identified the error-resilient features of the communication systems [6,7]. Shannon’s coding theorem presented signals and messages as a point in space and developed a method that any communication system can be represented geometrically. After that researcher presented different techniques for achieving that limit. The first forward error correction block code for single error correction was the Hamming code [8]. Elias was the first who discovered error-correcting convolutional codes [9,10], in which the encoding dependencies are shifted from finite-length segments to encoding dependencies that exist over the entire block. The authors in [11] used convolutional codes for the burst of error correction in information bit-stream. In [12], sequential algorithms were proposed for decoding the convolutional codes. A heuristic discussion was presented in greater detail about probabilistic decoding [13]. Furthermore, the authors presented in depth analysis about A-priori, A-posteriori knowledge, mutual information, channel degradation, channel quantization and channel capacity i.e., different terminologies linked with designing an encoding and decoding schemes. The Viterbi Algorithm (VA) proposed in [14] is a significant achievement ever in the history of convolutional correction codes. VA primarily works on finding the closest sequence of the transmitted information bits (Maximum A-posteriori Probability (MAP) sequence estimation), which results in minimum Bit Error Ratio (BER) [15,16]. VA can also be used as a maximum likelihood sequence detector for AWGN channel, which finds applications in Code Division Multiple Access (CDMA), Time Division Multiple Access (TDMA), and various other fields [16,17]. An efficient bidirectional search algorithm for computing the free distance of a convolutional codes is described in [18]. VA works on codeword error minimization for convolutional codes. Therefore, authors proposed the symbol based MAP algorithm in [19], minimizing error rate in symbols or bits through optimal decoding. However, compared to VA, the symbol based MAP decoding is only attractive for short block codes with short constraint lengths due to offering higher complexity algorithm and large storage for larger constraint lengths. Turbo codes are first presented in [20,21], which comprises the concatenation of two Recursive Systematic Convoluational (RSC) codes interfaced through an interleaver. Turbo codes are also used in third-generation (3G) mobile radio systems [22]. In turbo codes, an iterative algorithm is used at the decoder side to extract the transmitted information bits. The authors proposed Soft Output Viterbi Algorithm (SOVA) in [23]. They modified the VA in such a way that it gives the most probable transmitted sequence in a Markov finite-state chain along with the reliability or posteriori information. Koch et.al in [12] proposed the Max Log MAP algorithm for turbo decoder with lower complexity than SOVA.
Authors in [24,25,26,27,28] evaluated the performance of the Joint Source Channel Codes (JSCC) and it was concluded that it could be jointly optimized as a one pair for achieving a lower BER. Authors in [24] proposed an extended curve-fitting algorithm which finds optimal design criteria for JSCC. In [29], the authors used and extended the application of Fully Parallel Turbo Decoder (FPTD), for the implementation of a unary error correction on hardware considering the video transmission of information bits on JSCC. An analog low complexity JSCC system is designed [30] for the transmission of still images. Deep JSCC is proposed [31], which does not explicitly relay on source and channel encoder and instead trained an auto-encoder composed of two convolutional neural network and can be used in a non-trainable layer in the middle of a communication channel. In [32], the authors proposed a lower complexity Robust Distributed Video Coding (RDVC) framework to optimize the quality of video communication for wireless multimedia sensors networks. A new coding scheme is presented based on Wyner-Ziv coding for error resilience and Rate Distortion (RD) performance. In [33], the authors proposed systematic code Low Density Generator Matrix (LDGM), which under maximum likelihood decoding, can achieve the capacity of memoryless binary input/output channel. The extent of performance achievement in iterative decoding is determined from the number of profitable iterations by EXIT chart analysis in [34]. In [35], the authors showed the EXIT chart as a versatile tool for designing of different serial concatenation codes. The authors [36] proposed multi-edge type EXIT chart bit mapping for low density parity check Bit-interleaved Coded Modulation (BiCM).
A delightful source coding candidate for wireless video communication is the compression efficient standard H.264 [37,38,39,40]. In [41], the authors proposed a mobile model called Proposed Generation (Pro-G) for supporting large number of user applications, making usage of wider bandwidth, adoptive modulation and coding for cellular systems. Furthermore, transcoding technique (H.265 pro, incorporating two Highest Efficiency Video Coding (HEVC) structure) is used for adoptive video streaming, which results in providing multiple data rates of a video streams. Authors in [42] studied the compressed video transmission using JSCC based on the stream contents characteristics of HEVC. Clustering algorithm Fuzzy C-Means (FCM) from the field of artificial intelligence is used for reducing the noise effects and for classifications of wireless channel. A new Sample Adaptive Offset (SAO) in loop filter make feasible the higher compression efficiency of H.265/HEVC for both objective and subjective measures. Estimating the best SAO parameter per coding tree unit for low powered or real time encoders results in a number of issues such a high computational complexity or architectural inefficiency. The authors proposed in [43] various SAO policies reducing complexity and removing the inefficiency caused by its implementation. A broadcaster group from Japan plan to stream a television services based on the latest new standard HEVC having twice the compression capability compared to H.264. A studied has been carried out for estimating the required bit rates. Different multiformat videos were evaluated in [44] and suggested that a 10–15 Mbits/s is required for 1080/60/I and 1080/60/P, 30–40 Mbit/s is required for 2160/60/P, while 80–100 Mbit/s is needed for 4320/60/P format. The study deduce a conclusion that such videos can also be transmitted via the existing satellite channel bandwidth. The authors in [45] proposed a method for identifying the moving objects that passes through the specific region without fully decoding the bitstreams. Foreground prediction block is extracted from the video bit-stream according to the motion vector of H.265 and clustered into region of interests. The state of moving object can be find by match the moving objects and the region of interest in the current frame. HEVC chips are incorporated into the application processor system-on-a-chip within the mobile devices due to its wide usage in video transmission. However, the coding bandwidth accessed for the motion estimation (ME) operation within HEVC has changed over time due to the adaption of the intelligent power management with in the process system-on-a-chip [46]. The needs of on-demand coding bandwidth for HEVC must be considered in designing a low power system. The authors proposed an intelligent ME controller algorithm model, which is coding bandwidth efficient and referred to a Dynamic Voltage Frequency Scale aware (DVFS-aware). DVFS-aware can be integrated in ME for coding bandwidth realization, coding bit-rate, and coding-quality-optimized HEVC ME design. The authors in [47] proposed an algorithm for improving the bitrate (<; 1.5 %), while a slight decrease in PSNR (<; 0.15 decibel) has been reported.
Taking in mind the above background, we propose an arrangement for the H.264 and HEVC compressed video bitstream transmission through Soft Bit Source Decoding (SBSD) scheme. The propose system uses H.264 and H.265 [48] as a source encoder for simulation purpose. Artificial redundancy is generated in the transmitter side using two different combination of Over Complete Mapping (OSM) and RSC encoder while keeping the overall bitrate constant. The redundancy is iteratively utilized in the decoder side for the enhancement of BER performance. Its hard to estimate an impulse response for each individual Multiple Input Multiple Output (MIMO) link in Rayleigh fading channel due to experiencing different fading. In order to further improve the performance of the system, a transmitter diversity gain technique is incorporated, such that the coded video bitstream is passed to Sphere Packing (SP) modulation and Differential Space Time Spreading (DSTS) scheme which overcome channel estimation dependency. Due to embedding DSTS MIMO scheme, the proposed transceivers can be deployed in diverse environments for providing better performance.
The following is the main contributions of the proposed work while considered in designing of the three systems i.e., NCC, NCDSTS-SP and CDSTS-SP:
  • H.264 and H.265 source compression standard has been incorporated in order to visualize the quality of the highly compressed video stream at the receiver.
  • SP modulation are included in order to observe the performance of the proposed systems on fading channel.
  • SP modulation is inspired by space time modulation and provides the diversity and coding gain for the proposed system in order to efficiently estimate and recover the actual transmitted information.
  • DSTS scheme is included to remove the Channel State Information (CSI) estimation dependency and the receiver does not required to know the channel fade.
  • Different inner and outer code rates are used in the concerned schemes to visualize the effect of code rates on the convergence property of iterative schemes.
  • EXIT chart are used for finding the number of profitable iterations in the decoding process.
  • Diverse video sequences have been tested to measure the role of the static objects having fine details, dynamism and motion of the object in the background, on the performance metrics.
  • To the best of our knowledge, this is distinctive research study to transmit the highest compression efficient coded video using the H.265/HEVC standard while employing the advocated wireless transmission setup.
The rest of the proposed manuscript has been structured as follows. In Section 2, preliminaries and system design criteria has been presented that is used in designing of our proposed systems. Section 3, a detailed analysis of the proposed system models has been briefly explained. Section 4 presents the performance analysis of our propose simulation. Finally, a conclusion of the resultant work is provided in Section 5.

2. Preliminaries & System Design Criteria

MIMO schemes are mainly used for multiplexing and diversity gain. In multiplexing, different uncorrelated symbols are transmitted from all the antennas while for providing diversity gain, the transmitted symbols should be correlated. Space Time Block Codes (STBC) performs encoding both in space domain as well as in time domain due to using two transmitting (Tx) antennas and requiring two period for transmitting the information symbols. The first Tx antenna transmit the original symbol and all the remaining Tx antennas are transmitting the linear combination or the conjugate version of the symbol that are transmitting from the first antennas. In case of two Tx antennas, symbols d 1 and d 2 are transmitted at the first time period t 1 while at time period t 2 , d 2 and d 1 are transmitted, for which the decoded matrix can be represented as [49]:
[ d 1 d 2 d 2 d 1 ]
From the decoding matrix, the column contains the symbols transmitted at the corresponding time period while the rows comprises of the symbols transmitted from the specific Tx antenna. STBC with the following encoding matrix was proposed for four Tx antennas [50].
1 2 · [ d 1 d 2 z 1 z 2 d 2 d 1 z 2 z 1 d 3 d 4 d 1 d 2 d 4 d 3 d 2 d 1 ]
where d 1 , d 2 , d 3 , and d 4 are the data symbols, z 1 = R e a l ( d 3 ) j i m a g i n a r y ( 2 d 1 d 2 d 4 ) and z 2 = d 1 + d 4 + d 2 2 d 4 + d 1 d 2 x 3 d 1 d 2 d 3 . Differential Space Time Block Codes (DSTBC) does not required CSI estimation at both the Tx or receiving (Rx) antenna. In fast fading channel, the training sequences becomes outdated. Therefore, the overhead will increases if we are transmitting the training sequence at each instance. Differential scheme is suitable in case of fading channels. For PSK modulation having M s p signal points and spectral efficiency m = log 2 M s p , the symbols set S y m t for a transmission period of t using differential PSK from the constellation are:
S y m t = exp ( j 2 π ψ t M s p )
where ψ t ϵ { 0 , 1 , 2 , 3 M s p 1 } . Therefore, the differential encoder, encodes the symbol in such a way that the data information is the difference between the phases of the current and previous symbols such that d t = S y m t · d t 1 . Similarly, DSTBC following the same method for two Tx antennas in transmitting the two symbols having phases d t + 1 , d t + 2 correspondingly, and the previous transmitted symbols d t 1 , d t . Therefore, in each block, the sum of the phases of new vector [ d t + 1 d t + 2 ] T and the previously transmitted vectors, [ d t 1 d t ] T and [ d t d t 1 ] T , is sent as follow [49]:
[ d t + 1 d t + 2 ] = [ d t 1 d t d t d t 1 ] · [ S y m t + 1 S y m t + 2 ]
The design criteria that guarantee maximum diversity and coding gain can be find [49], e.g., the signal received ( r s i g t j ) at antenna j after demodulation is expressed as in the following equation [51].
r s i g n t j = i = 1 N α i , j ψ t i E + η t j
Here, the codeword is expressed with ψ t j and the path gain is represented with η t j . The relationship in Equation (4) between input and output can be also referred to fading channel model. The codewords that are transmitting from a total of N Tx antennas over the total duration of T time period transmission is given below.
ψ 1 = ψ 1 , 1 1 ψ 1 , 2 1 ψ 1 , N 1 ψ 2 , 1 1 ψ 2 , 2 1 ψ 2 , N 1 ψ T , 1 1 ψ T , 2 1 ψ T , N 1
At the receiving side, the signal for a T time period of transmission with a total of M receiving antennas is expressed as in the following equation.
r s i g t = r s i g 1 , 1 r s i g 1 , 2 r s i g 1 , M r s i g 2 , 1 r s i g 2 , 2 r s i g 2 , M r s i g T , 1 r s i g T , 2 r s i g T , M
If the antennas are far away then the path gain is independent from each other, otherwise a spatial correlation exist that can be expressed with the following N × M channel matrix:
H = α 1 , 1 α 1 , 2 α 1 , M α 2 , 1 α 2 , 2 α 2 , M α N , 1 α N , 2 α N , M
r = ψ · H + N
where noise N of a T × N matrix is expressed as given below.
N = η 1 , 1 η 1 , 2 η 1 , M η 2 , 1 η 2 , 2 η 2 , M η T , 1 η T , 2 η T , M
An error occurred if codeword ψ 1 is selected from the codebook at the transmitter antenna and the receiver estimates ψ 2 due to some noise present in the channel.
ψ 2 = ψ 1 , 1 2 ψ 1 , 2 2 ψ 1 , N 2 ψ 2 , 1 2 ψ 2 , 2 2 ψ 2 , N 2 ψ T , 1 2 ψ T , 2 2 ψ T , N 2
The probability of error using the union bound for a set of comprising L number of codewords is given below:
P error ψ 1 is sent i = 2 L P ψ 1 ψ i
For an error matrix D ( ψ 1 , ψ 2 ) = ψ 2 ψ 1 , the pairwise error probability to be define [51] in term of non negatives reals number eigenvalues λ n 0 of matrix A ψ 1 , ψ 2 = D ψ 1 , ψ 2 H · D ψ 1 , ψ 2   = ψ 2 ψ 1 H · ψ 2 ψ 1 where H is conjugate and transpose a the corresponding matrix.
P ψ error n = 1 r λ n M E s 4 N 0 r M
or
P ψ 1 ψ 2 ( E s 4 N 0 ) r M n = 1 r λ n M
or
P ψ 1 ψ 2 4 r M n = 1 r λ n M γ r M
The rank of the matrix A ( ψ 1 , ψ 2 ) represent the diversity gain of the space time codes which can be also expressed with the rank of the difference matrix D ( ψ 1 , ψ 2 ) multiplied with the number of M receiving antennas, i.e., the power of SNR ( r × M ) in the denominator of Equation (14). A rank cafeteria to be defined for guarantee a full diversity scheme as, if the matrix A ( ψ i , ψ i ) is a full rank matrix for all the possible codewords. Contrary to diversity gain, coding gain is the distance between two codes which is related to the determinate of matrix A ( ψ 1 , ψ 2 ) . Coding gain can be improve by maximizing the minimum determinate of matrix A ( ψ 1 , ψ 2 ) [49,51].
The Alamouti code is a full rank matrix and is satisfies the determinant criteria. Considering a time correlated Rayleigh fading channel, SP modulation obtaining a full diversity gain using the joint combination of orthogonal designs with sphere packing [52]. Let, the orthogonal function G 1 ( d ) = d 1 I 1 , where I 1 denotes the identity matrix, then the recursive orthogonal function is expressed as given below [52,53]:
G 2 k d 1 , , d k + 1 = G 2 k 1 d 1 , , d k d k + 1 I 2 k 1 d k + 1 I 2 k 1 G 2 k 1 H d 1 , , d k
where d 1 , d 2 , d k + 1 are the complex variables and their conjugate is expressed as d 1 , d 2 , d k + 1 , H is the transpose and conjugate of G 2 k 1 .
Furthermore, the SP signals are constructed using the above equation as expressed in the following equation.
S p = 2 k / ( k + 1 ) G 2 k d 1 , d 2 , , d k + 1
The symbol rate is denote as k + 1 / 2 k where the 2 k / ( k + 1 ) is used for energy constraint as a normalization factor. Using Equation (16), the set of SP signals for 2 transmitting antennas are given below:
S p = 2 k / ( k + 1 ) G 2 d 1 , d 2 =
2 k / ( k + 1 ) G 1 d 1 d 2 d 2 G 1 H d 1 = 2 k / ( k + 1 ) d 1 d 2 d 2 d 1
Using the same Equation (16), the set of SP signals can be constructed for four and eight transmitting antennas is given below:
S p = 2 k / ( k + 1 ) G 4 d 1 , d 2 , d 3 =
2 k / ( k + 1 ) G 2 d 1 , d 2 d 3 I 2 d 3 I 2 G 2 H d 1 , d 2
= 2 k / ( k + 1 ) d 1 d 2 d 3 0 d 2 d 1 0 d 3 d 3 0 d 1 d 2 0 d 3 d 2 d 1
S p = 2 k / ( k + 1 ) G 8 d 1 , d 2 , d 3 , d 4 =
2 k / ( k + 1 ) G 4 d 1 , d 2 , d 3 d 4 I 4 d 4 I 4 G 4 H d 1 , d 2 , d 3
= 2 k / ( k + 1 ) d 1 d 2 d 3 0 d 4 0 0 0 d 2 d 1 0 d 3 0 d 4 0 0 d 3 0 d 1 d 2 0 0 d 4 0 0 d 3 d 2 d 1 0 0 0 d 4 d 4 0 0 0 d 1 d 2 d 3 0 0 d 4 0 0 d 2 d 1 0 d 3 0 0 d 4 0 d 3 0 d 1 d 2 0 0 0 d 4 0 d 3 d 2 d 1 .
The differential MIMO (DSTS) scheme comprises two encoders i.e., differential encoder and space time spreading (STS) encoder as shown in Figure 1. The symbols are first differential encoded and then spreaded by STS scheme. For the very first time, two dummy symbols d diff 0 1 , d diff 0 2 are transmitted which have no information bits. After that, the data information stream d t k ; k = 1 , 2 are separated into d diff t 1 , d diff t 2 symbols. The encoded symbols at the 2 transmitter antennas are then expressed as with the following equation [54].
d diff t 1 = d t 1 · d diff t 1 1 + x t 2 · d diff t 1 2 | d diff t 1 1 | 2 + | d diff t 1 2 | 2
d diff t 2 = d t 1 · d diff t 1 2 d t 2 · d diff t 1 2 | d diff t 1 1 | 2 + | d diff t 1 2 | 2
After passing from the STS encoder, the two consecutive symbols at both the transmitting antennas is expressed with the following mapping as shown in Figure 2 [54].
d s p r e a d e d t 1 = 1 2 ( S c ¯ 1 · d diff t 1 + S c ¯ 2 · d diff t 2 )
d s p r e a d e d t 2 = 1 2 ( S c ¯ 1 · d diff t 2 S c ¯ 1 · d diff t 2 )
where S c 1 ¯ and S c 2 ¯ are the orthogonal codes at both the transmitting antennas. The proposed work uses DSTS MIMO scheme for attaining diversity and coding gain. In case of Rayleigh fading fading channel, DSTS MIMO scheme is a suitable candidate for removing the CSI estimation at each instance of transmission.

3. Proposed System Model

Typically, the primitive of the transmitter block comprises discrete input, encoder, digital modulator, and the output. Channel encoder is operating for the removal of error effects by adding some redundancy through a controlled methodology in the information bits. Channel flexibility takes place by manipulating RSC codes as a constituent channel codes to efficiently operate in AWGN and as well in Rayleigh fading channels. The preliminaries and system design criteria has been briefly explained in [26]. The proposed system model comprises three video phone arrangements as shown in Figure 3 and the specifications of the transceivers are given in Table 1, Table 2, Table 3, Table 4 and Table 5. All the transceivers follow the same mechanism and are only different by using different modulation schemes and arrangements of inner and outer code rates. In the proposed scheme, video is initially compressed at the transmitter side by using the international compression standard H.264 and H.265 video codec with the parameters settings of the source encoder as given in Table 1. The different building blocks of the proposed system models are as follows.

3.1. H.264 and H.265 Source Video Coding Standard

Advance Video Coding (AVC)/H.264 is a compression standard resulted in the extension of the video service delivery to various networks and applications. The coding architecture of H.264 consists of two basics layers i.e., Video Coding Layer (VCL) and Network Adaptation Layer (NAL). VCL provided efficient representation of video contents while NAL provides network friendly representation. On the other hand, HEVC is the latest state of the art video compression standard designed for achieving the goals of parallel-processing, resilience to data-loss and coding efficiency [55]. It comprises of a high-level syntax for supporting network interface, implementation aspects and a compressed data of video layer. HEVC has been extended with enhanced bit depth capability, scalability of embedded-bits and 3D multi-view representations [56]. The need of H.265 was realized due to the increasing demand of HD videos, higher resolutions videos and for the transmission of multi-view or 3D displays, which encouraged the joint collaborative team to bring new compression standard superior to prior codecs and to incorporate higher usage of parallel processing [57]. The primary goal in designing of HEVC was to reduce the number of bits for the same quality of video, compared to the predecessor standards. Some features retained in the H.265 standard architecture from H.264 standard are:
  • Parameter set structure, containing sharable information deployed in the decoding process. The new Video Parameter Set (VPS) is based on the concepts of sequence and picture parameter sets used in H.264.
  • Network Abstraction Layer (NAL) unit is retained which is helpful in identification of the purpose of payload data.
  • In case of data losses, synchronization is done in H.265 with the help of slices. The same concept of slices is followed in H.264.
  • Information about the timing of a video picture, interpretation of the color space and other display related information is provided by Supplemental Enhancement Information (SEI) and Video usability Information (VUI) metadata, adopted in H.265 and H.264.
The hybrid approach of standard H.261 concerning inter/intra-prediction and 2D transform is also imported in H.265. A prediction residual is computed based on the inter-picture motion compensation or intra-picture spatial predictions. This residual undergoes through block transformation, quantization and entropy coding respectively. A detailed description can be found in [55,56,58].
HEVC fulfilled the target 50% compression efficiency improvement over the predecessor H.264/AVC, providing adequate image quality with the trade-off computational complexity. HEVC consumes half bandwidth to that of the AVC thats why users experiences a reasonable video quality over very small bandwidth. The standard is specifically designed to be compatible with all the existing applications of the previous codecs.
Furthermore, HEVC divides each frame into slices where slice is a sequence of an independent and subsequent dependent slice segments. Each slice has a specific header that enables its independent decoding of all the other slices in a given picture. Slice segments comprises of Coding Tree Units (CTU) which contains one luma and two chroma Code Tree Block (CTB) along with their syntax while CTB is a region of N × N (N = 16, 32, 64) samples of the image, with the fact that larger sizes provide better compression. The higher size of the CTB is largely required when higher resolution videos are to be encoding which is beneficial for reducing the decoding time. Similarly, CTB can be further divided into coding blocks (CB), one luma CB and two Chroma CB along with syntax are called coding unit (CU). The size of CB should be in the range of 8 × 8 ≤ CB Size ≤ CTB size. The CTB blocks are then further divided into Transform Blocks (TB) for Discrete Cosine Transform (DCT) into a block of 4 × 4 and similarly the luma TBs and two chroma TBs combine along with their syntax to form Transform Unit (TU). For example, HEVC divides the frame into two slices while the first slice comprises three slice segments refer to Figure 4. The first slice are divided into 4 CTU independent segments, 32 depended segments and again 24 dependent segments, the second slice is composed of 39 independent CTUs [59].

3.2. Source Channel Encoding

The compressed video bit-stream comprises three partitions types A, B, and C of all the slices per frame. The bit-stream x i is then mapped to bit-string x i using a special bitmapping scheme, referred to as Over-complete Mapping (OCM), where we have i = 1 , 2 , 3 n . Thereafter the mapped bit string x i is interleaved to conquer the burst of error into x i ¯ using the bit interleaver ( Π ). The interleaved sequence x i ¯ is then passed to the RSC encoder which encodes the sequence by different code rates as shown in Table 3. Our proposed NCC and NCDSTS-SP schemes employ Rate-1 OCM, presented in Table 2, as inner code while Rate-1/3 RSC code is employed as outer channel code. Contrary to this and in order to improve the convergence behavior of our employed iterative coding, CDSTS-SP scheme employ Rate-3/4 OCM, presented in Table 2, as inner code while Rate-4/9 RSC code is employed as outer channel code. The channel coded output from this block is forwarded to the next block for further processing.

3.3. Modulation and Transmission

The encoded bit-stream received from the source channel encoding block is then forwarded to modulation and transmission block. Using NCC scheme the received bit-stream is mapped to the stream of QPSK symbols. Afterwards, the Single Input Single Output (SISO) transmitter antenna is used to transmit the QPSK modulated Symbols over Rayleigh Fading channels.
While using NCDSTS-SP and CDSTS-SP schemes the received bit-stream is mapped to the SP symbols S i , where i = 0 , 1 , 2 , 3 L 1 by SP mapper, L is the modulated symbols that represents the number of SP signaling elements [52,60]. Afterwards, the DSTS transmitter antenna is used to transmit the SP modulated Symbols over Rayleigh Fading channels.
DSTS transceiver is liable to diversity gain in wireless network, reduced BER and accomplished a substantial video quality improvement. Due to embedding DSTS MIMO scheme, the proposed transceivers can be deployed in diverse environment for providing better video quality performance.
At the decoder side the received signal is first presented to the DSTS decoder and SP demapper which gives the Log-Likelihood Ratios (LLRs) value for the bit-stream. The generated LLR values are forwarded to the next block for further processing.

3.4. Iterative Soft Decoding

Subsequently, the extrinsic LLRs information received from the modulation and transmission block is then exchanged between the RSC decoders and SBSD in iterative fashion. The extrinsic information for Markov zero order model using SBSD can be found in [61]. The residual redundancy remains in M-ary symbols after the source encoding by H.264 and H.265 in term of non-uniform probability distribution P [ S n ( j ) ] , where S n ( j ) = [ S n ( 1 ) , S n ( 2 ) , S n ( 3 ) S n ( M ) ] refer to Table 2. The results in Table 2 is accomplished by assigning 3 bits/symbols to the H.264/AVC encoded bit-stream, i.e., [ b 1 ( 1 ) , b 1 ( 2 ) , b 1 ( 3 ) ; b 2 ( 1 ) , b 2 ( 2 ) , b 2 ( 3 ) b j ( 3 ) ] . The channel output symbol is generated by the following expression, provided that the bits are independent of each other.
P [ y ^ τ | y τ ] = k = 1 K P [ y ^ τ ( k ) | y τ ( k ) ] ,
where y ^ τ is the received K-bit sequence on transmission of y τ . For a specific bit [ y τ ( λ ) ] , the output extrinsic channel information P [ y ^ τ [ e x t ] | y τ [ e x t ] ] can be achieved as follow.
P [ y ^ τ [ e x t ] | y τ [ e x t ] ] = k = 1 , k λ K P [ y ^ τ ( k ) | y τ ( k ) ] .
Eventually, the channel output information is combined with the A-priori knowledge of that corresponding symbol to achieve the resultant LLR extrinsic value, as expressed below.
L L R [ y τ ( λ ) ] =
l o g y τ [ e x t ] P [ y τ [ e x t ] | y τ ( λ ) = + 1 ] . P [ y ^ τ [ e x t ] | y τ [ e x t ] ] y τ [ e x t ] P [ y τ [ e x t ] | y τ ( λ ) = 1 ] . P [ y ^ τ [ e x t ] | y τ [ e x t ] ] .
Natural redundancy residue in the encoded bit stream x i and is used in conventional SBSD as source of performance to obtain the extrinsic information. The achievable performance in video bit stream is limited when coding standard H.264/H.265 is used as encoder, which remove most of the residual redundancy. Therefore, the proposed scheme uses over complete mapping to generate artificial redundancy in coded video bit stream. To extract extrinsic information, the ingredient inner and outer decoder of iterative channel decoding provides sufficient information to each other in each iteration using the concept of interlacing.
EXIT chart operates on the special property of iterative decoder, which is convergence behavior as proposed by Stephen ten Brink [62], by exploiting the exchange of input and output mutual information. It is useful for serial, parallel and hybrid concatenated systems as well. Our proposed OCM is designed for employing this specific property of EXIT charts. If there is an open tunnel between the EXIT curves of the RSC and SBSD decoder, then the iterative decoding achieve extremely small decoded BER. Consequently, the above condition becomes satisfied when the EXIT curve intersects at a point ( I A , I B ) = ( 1 , 1 ) and it require a hamming distance between any two codewords ( d H , m i n ) = 2 , which is briefly explained in [26,27,28]. This condition stimulates our OCM scheme that it is possible to find a code table that satisfies the above condition by complete search of all codewords having mapping rate reciprocal to that of code rate and having ( d H , m i n ) = 2 , as presented in Table 4. Based on this table, the EXIT curve gets closer to the perfect point of convergence i.e., reaching to ( I A , I E ) = ( 1 , 1 ) of both the OCM and codec block, regardless of the shape of the Exit curve of the standalone encoder.
Moreover, the EXIT curve for NCC scheme of SBSD using OCM rate-1 inner code and RSC rate-1/3 outer code is shown in Figure 5a. It can be observed that OCM rate-1 is not capable to reach to the point of perfect convergence—i.e., to the rightmost corner at ( I A ; I E ) = (1, 1) and hence is not capable to exploit a beneficial advantage of iterative decoding. In contrast, the EXIT curve of SBSD using OCM rate-3/4 inner code reaches the point of perfect convergence, which in combination with Rate-4/9 RSC code can get advantage of iterative decoding by iteratively exchanging extrinsic information to achieve very low BER refer to Figure 5b. The EXIT Trajectory using OCM Rate-3/4 along with Rate-4/9 RSC code of CDSTS-SP at E b / N 0 of −3 to 3 dB is shown in Figure 5c. It can be observed that due to convergence of inner and outer code to the point of perfect convergence—i.e., at ( I A ; I E ) = ( 1 , 1 ) , iterative decoding is capable to iteratively reach at the point of perfect convergence through an open-tunnel available at 0 , 1.5 dB E b / N 0 and above.

4. Simulations Results and Analysis

This section provides a detailed performance analysis of the proposed system. The JM version 15 video and HM master codec developed by the joint video team (JVT) is used as a reference H.264 and H.265 source encoder. The proposed systems is simulated using IT++ signal processing and communications library, coded in C++. To perform a fair analysis between the three schemes, several aspects of the proposed systems have been considered. Various sample of the diverse video sequences is considered and provided as an input to the considered system models. Convergence behavior in iterative source channel decoding is specifically analyzed in the proposed work with different code rates of the inner and outer channel codes. In our simulation, three different video sequences—i.e., AKIYO, FOREMAN and MOBILE as shown in Figure 6, each with three different resolution (CIF, QCIF and 4CIF) in YUV(4:2:0) format is used as testing sequences.
Brief information about the video sequences used in the simulation experiments is as follows:
  • Akiyo: This is a video of newscaster with extremely negligible movement and less details in the background.
  • Foreman: This is a video of a foreman with rapidly moving face, extensive zoom out, average details and regular structures.
  • Mobile: This is a video of a moving toy train with high details and dynamism.
The idea of using these different diverse video sequences with different video resolutions, level of motion and dynamism is to investigate the impact of video contents on its objective video quality while considering similar channel. As the extent of compression and robustness of the coded bit-stream is directly linked with its resolution, dynamism and motion content of the video stream, therefore the objective video quality performance of different video sequences is expected to be variable, while considering similar communication setup.
The number of bits per frame for luminance and chrominance in different sampling format and video sequences are calculated from [5] as depicted in Table 5. The duration of each video comprising of 45 frames at the frame rate of 15 fps is considered. The macro-blocks in H.264/AVC are processed in the raster scan order of a group of MBs called a slice, which represents a region of a given picture that can be processed independently of each other. In our experimental setup each slice consists of 11 Mbs while in HEVC, each frame is divided into slices which comprises CTUs. The considered slice types are intra (I), inter (P) slices. All MBs and CTUs in I slice are encoded using intra mode where as in P all MBs are coded using Intra coded mode with reference to previous frame. The encoder H.264.AVC is set at 15 fps. The resultant video frame sequence followed the pattern I 1 P 2 P 3 P 4 P 14 in which each 15th frame is I and the predecessor 14 are p frames. All the simulation parameters are listed in Table 1 and Table 3.
In order to reduce the computational complexity, iterations between RSC and SBSD decoders are set to 3 and 5 for the OCM rate-1 and 3 4 respectively. The averaged results are obtained by repeating every 45 frames 160 times. The proposed system performance is exploited while considering the same overall code rate for the input H.264 encoded bit-stream.
The measurement of video quality is a major task in a scenario where there is a transmission of a compressed video over noisy channels. Subjective [63] and objective [63] techniques are widely used for measuring the video quality. Human participants are required in subjective technique to assess the streaming quality and therefore it is a time-consuming solution. On the other hand, PSNR is an objective metric for measuring the video quality when the video contents, codec and underlying communication setup remains unchanged [64].
PSNR is the ratio between the signal of the power of the original signal to that of the corrupted signal. PSNR is traditionally used as a quality metric for evaluating algorithms in multimedia streaming systems. It is generally expressed in term of Mean Square Error (MSE). The MSE of an image X having dimension of size M × N and its corrupted image Y can be expressed by the following equation.
M S E = 1 M N i = 1 M j = 1 N [ X ( i , j ) Y ( i , j ) ]
P S N R ( d B ) = 10 × log 10 M A X X M S E
where M A X X is the maximum value of a pixel of image X and can be calculated as 2 B 1 for a B bits per sample.
The BER and PSNR performance of the simulation work were plotted against varying E b / N 0 values, while utilizing diverse video sequences having different resolution, motion and dynamism while employing rayleigh fading channel as a communication medium.
In our simulation scenarios OCM is employed as inner code, while RSC is used as outer channel code. Furthermore, in the considered NCC scheme QPSK modulation is employed, while in NCDSTS-SP and CDSTS-SP sphere packing modulation is used in combination with DSTS.
Furthermore, in order to analyze the impact of varying motion contents and dynamism of video scene on the objective video quality performance of the transmission scheme, its PSNR versus E b / N 0 dB performance trends are plotted in Figure 7a–c for NCC, NCDSTS-SP and CDSTS-SP schemes, respectively, while using AKIYO, FOREMAN and MOBILE video sequences in its QCIF video resolution. In Figure 7a–c similar trend in performance variations due to employment of different video sequences, with varying motion contents and dynamism is observed for all three employed schemes. Frome Figure 7a, the PSNR Vs E B / N 0 [dB] is plotted for the same resolution resolution of different sequences i.e., AKIYO, FOREMAN and MOBILE considering NCC scheme. More specifically, it is observed from Figure 7a–c that the presence of high channel noise, the AKIYO video sequence with low motion contents and dynamism outperforms the FOREMAN and MOBILE video sequence with high motion contents and dynamism. This is due to the fact that in the presence of low motion contents and dynamism of video, the H.264/AVC can compress the corresponding video sequence more robustly within the allocated bit-rate budget. Furthermore, the performance of the H.264/AVC error concealment mechanism is more effective in video sequences with low motion contents and dynamism and as a result its objective quality performance will outperform.
In order to analyze the impact of varying video resolution on the objective video quality performance of the transmission scheme its PSNR versus E b / N 0 dB performance trends were plotted in Figure 8a–c for NCC, NCDSTS-SP and CDSTS-SP schemes, respectively, while using AKIYO video sequence. Figure 8a is plotted for transmitting AKIYO video sequence on NCC scheme from only one Tx antenna varying the E b / N 0 values. As it is clear that the for a higher E b / N 0 , the performance of the NCC scheme in terms of PSNR is reasonable for all the three video resolutions. Moreover, due to using fast fading channel, lower the resolution of the video sequence higher its performance while NCC does not employing any differential detection at the Rx antenna. Most of the time, the SNR comes under deep fade for which the decoder cannot reliably decoder the received signal and hence the performance is degraded for the concerned scheme. In contrast, the performances of NCDSTS-SP and CDSTS-SP schemes for a very low E b / N 0 values are much better due to using differential detection and spreading codes as shown in Figure 8b,c. Moreover, CDSTS-SP scheme outperform in terms of PSNR values compare to NDSTS-SP due to the reason that CDSTS-SP achieved perfect convergence on EXIT chart using the specific inner and outer code rates specified at both the encoders. Therefore, the rule of code rate is visible on quality performance upon using the iterative decoding. The performance difference in terms of resolution used is still valid for which both the schemes outperform for lower resolution. In Figure 8a–c similar trend in performance variations due employment of different resolution videos is observed for all three employed schemes. As from Figure 8c, the PSNR reached to its peak value for a lower E b / N 0 value and remains constant for a remaining range of E b / N 0 . It is due to the fact that no matter how much the E b / N 0 is increased, the performance remains constant after the E b / N 0 range at which perfect convergence is achieved. Therefore, it is observed from Figure 8a–c that the presence of high channel noise the low resolution video outperforms its high resolution counterparts.
The BER versus E b / N 0 dB performance of the NCC, NCDSTS-SP and CDSTS-SP schemes is presented in Figure 9a. BER performance is plotted against E b / N 0 values for all the three schemes. The BER values approaches to 10 7 for a very low E b / N 0 for CDSTS-SP and the same fact is visualize for PSNR from Figure 9b. As it is already discussed that CDSTS-SP is employing differential methods for recovering the SNR values at the receiver and is suitable scheme for fast fading channel while NCDST-SP is also employing differential methods but cannot obtained the perfect convergence at the considered inner and outer code rate. The worst BER performance is noticed for NCC scheme which is neither employing differential scheme nor obtained convergence on the specific inner and outer code rate. Its observed from Figure 9a that the BER performance of CDSTS-SP is best as compared to NCDSTS-SP scheme which is performing better then NCC scheme.
The effects of BER on PSNR performance of the CDSTS-SP scheme can be easily observed from Figure 9b. For a very low range of E b / N 0 [dB] values, the PSNR reached to its peak values and remains constant for the rest of the E b / N 0 range which shows that the differential scheme obtained a sufficient information about a channel on iterative decoding from observing the phases of the current and the the previous symbols that can estimate the value on a lower range of E b / N 0 values. Similar performance is observed for NDSTS-SP which on slightly higher values of E b / N 0 obtained convergence and tends to be constant for the rest of the E b / N 0 range while the NCC scheme is worst compared to the others and showing an increasing trend when E b / N 0 values increases. Finally, its observed from the PSNR versus E b / N 0 dB performance curves of the NCC, NCDSTS-SP and CDSTS-SP schemes, presented in Figure 9b that the sophisticated system design of CDSTS-SP outperform its counterpart NCC, NCDSTS-SP schemes in terms of PSNR. More specifically, it is observed from Figure 9b that NCDSTS-SP results in PSNR gain of 6 dB and CDSTS-SP results in PSNR gain of 28 dB for E b / N 0 value of 10 dB, with reference to bench marker system design of NCC.
The PSNR Vs Eb/N0 performance of the three advocated schemes—i.e., NCC, NCDSTS-SP and CDSTS-SP with reference to the employed H.264 and H.265 standards is presented in Figure 9b. It can be observed from the Figure 9b, that while utilizing H.265 video coding standard, the NCDSTS-SP with enhanced transmission mechanism results in E b / N 0 gain of 20 dB with reference to the NCC coding and transmission mechanism at the objective PSNR value of 42 dB. It is important to mention that both NCC and NCDSTS-SP schemes have employed identical code rate. Furthermore, the CDSTS-SP with convergent coding mechanism achieved superior performance with reference to the competing NCDSTS-SP counterpart mechanism, with a performance gain of 16 dB at the objective PSNR grade of 42 dB. Moreover, it is observed that while employing any of the three advocated schemes—i.e., NCC, NCDSTS-SP and CDSTS-SP the maximum achievable PSNR gain through H.265 video coding standard is 45 dB, with a PSNR gain of 3 dB with reference to the identical code rate H.264 coding scheme.

5. Conclusions

This paper presents performance analysis of sophisticated channel coding and transmission schemes for reliable transmission of H.264/AVC and H.265/HEVC compressed video. Three different coding and transmission schemes were presented, namely Non-Convergent Coding (NCC), Non-Convergent Coding assisted with Differential Space Time Spreading (DSTS) and Sphere Packing (SP) modulation (NCDSTS-SP) and Convergent Coding assisted with Differential Space Time Spreading (DSTS) and Sphere Packing (SP) modulation (CDSTS-SP). Different diverse combination of source mapping techniques and channel coding techniques were employed and their convergence behavior is analyzed with the aid of Extrinsic Information Transfer (EXIT) Charts. Furthermore, to improve the diversity gain of the transceiver advanced Sphere Packing (SP) modulation technique assisted by Differential Space Time Spreading (DSTS) is employed. The BER and PSNR performance of the proposed schemes were plotted against varying E b / N 0 values, while utilizing diverse video sequences having different resolution, motion and dynamism while employing Rayleigh fading channel as a communication medium. It was observed that in the presence of high channel noise the low resolution videos outperforms its high resolution counterparts. Furthermore, it was observed that in the presence of high channel noise, the AKIYO video sequence with low motion contents and dynamism outperforms the FOREMAN and MOBILE video sequence with high motion contents and dynamism. More specifically, it is observed that while utilizing H.265 video coding standard, the NCDSTS-SP scheme with enhanced transmission mechanism results in Eb/N0 gain of 20 dB with reference to the NCC coding and transmission mechanism at the objective PSNR value of 42 dB. It is important to mention that both NCC and NCDSTS-SP schemes have employed identical code rate. Furthermore, the CDSTS-SP with convergent coding mechanism achieved superior performance with reference to the equivalent rate NCDSTS-SP counterpart mechanism, with a performance gain of 16 dB at the objective PSNR grade of 42 dB. Moreover, it is observed that the maximum achievable PSNR gain through H.265 video coding standard is 45 dB, with a PSNR gain of 3 dB with reference to the identical code rate H.264 coding scheme. Furthermore, it is observed from that the BER and PSNR versus E b / N 0 dB performance of the NCC, NCDSTS-SP and CDSTS-SP schemes that the BER performance of CDSTS-SP is best as compared to NCDSTS-SP scheme, which is performing better then NCC scheme. More specifically, it is observed that the NCDSTS-SP results in PSNR gain of 20 dB with reference to the NCC coding and transmission mechanism at the objective PSNR value of 42 dB. Furthermore, the CDSTS-SP with convergent coding mechanism achieved superior performance with reference to the equivalent rate NCDSTS-SP counterpart mechanism, with a performance gain of 16 dB at the objective PSNR grade of 42 dB. Moreover, it is observed that the maximum achievable PSNR gain through H.265 video coding standard is 45 dB, with a PSNR gain of 3 dB with reference to the identical code rate H.264 coding scheme.

Author Contributions

Conceptualization, N.M. and K.U.; methodology, N.M. and L.H.; software, N.M., K.U., J.N.; validation, N.M., J.F., J.N. and L.H.; formal analysis, K.U.; investigation, N.M.; resources, L.H., J.N.; data curation, J.F.; writing—original draft preparation, K.U.; writing—review and editing, N.M., J.F., J.N. and L.H.; visualization, L.H.; supervision, N.M. and L.H.; project administration, N.M. and L.H.; funding acquisition, N.M., J.N. and J.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been funded with support of the project SP2021/45, assigned to VSB-Technical University of Ostrava, the Ministry of Education, Youth and Sports in the Czech Republic.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The financial support of the National Center of Big data and Cloud Computer (NCBC), University of Engineering and Technology, Peshawar, under the auspices of Higher Education Commission, Pakistan is gratefully acknowledged.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Index, C. Cisco visual networking index: Global mobile data traffic forecast update, 2015–2020. In Cisco Technical Report; White Paper; Cisco System: San Jose, CA, USA, 2016. [Google Scholar]
  2. Global Mobile Data Traffic Forecast. Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2018–2023; White Paper; Cisco System: San Jose, CA, USA, 2020. [Google Scholar]
  3. Björnson, E.; Hoydis, J.; Sanguinetti, L. Massive MIMO networks: Spectral, energy, and hardware efficiency. Found. Trends Signal Process. 2017, 11, 154–655. [Google Scholar] [CrossRef]
  4. Pirinen, P. A brief overview of 5G research activities. In Proceedings of the 1st International Conference on 5G for Ubiquitous Connectivity, Akaslompolo, Finland, 26–28 November 2014; pp. 17–22. [Google Scholar]
  5. Richardson, I.E. H. 264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
  6. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  7. Shannon, C.E. Communication in the Presence of Noise. Proc. IRE 1949, 37, 10–21. [Google Scholar] [CrossRef]
  8. Hamming, R.W. Error Detecting and Error Correcting Codes. Bell Syst. Tech. J. 1950, 29, 147–160. [Google Scholar] [CrossRef]
  9. Elias, P. Coding for noisy channels. In IRE Convention Record; Technical Report; Butterworth Scientific Publications: London, UK, 1955; pp. 37–47. [Google Scholar]
  10. Nguyen, H.V.; Xu, C.; Ng, S.X.; Hanzo, L. Near-capacity wireless system design principles. IEEE Commun. Surv. Tutor. 2015, 17, 1806–1833. [Google Scholar] [CrossRef] [Green Version]
  11. Hagelbarger, D.W. Recurrent Codes: Easily Mechanized, Burst-Correcting, Binary Codes. Bell Syst. Tech. J. 1959, 38, 969–984. [Google Scholar] [CrossRef]
  12. Wozencraft, J.M.; Reiffen, B. Sequential Decoding; The MIT Press: Cambridge, MA, USA, 1961; Volume 10. [Google Scholar]
  13. Fano, R. A heuristic discussion of probabilistic decoding. IEEE Trans. Inf. Theory 1963, 9, 64–74. [Google Scholar] [CrossRef]
  14. Viterbi, A. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Trans. Inf. Theory 1967, 13, 260–269. [Google Scholar] [CrossRef] [Green Version]
  15. Forney, G.D. The viterbi algorithm. Proc. IEEE 1973, 61, 268–278. [Google Scholar] [CrossRef]
  16. Forney, G.D., Jr. The viterbi algorithm: A personal history. arXiv 2005, arXiv:cs/0504020. [Google Scholar]
  17. Forney, G. Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference. IEEE Trans. Inf. Theory 1972, 18, 363–378. [Google Scholar] [CrossRef] [Green Version]
  18. Bahl, L.; Cullum, C.; Frazer, W.; Jelinek, F. An efficient algorithm for computing free distance (Corresp.). IEEE Trans. Inf. Theory 1972, 18, 437–439. [Google Scholar] [CrossRef]
  19. Bahl, L.; Cocke, J.; Jelinek, F.; Raviv, J. Optimal decoding of linear codes for minimizing symbol error rate (Corresp.). IEEE Trans. Inf. Theory 1974, 20, 284–287. [Google Scholar] [CrossRef] [Green Version]
  20. Berrou, C.; Glavieux, A.; Thitimajshima, P. Near Shannon limit error-correcting coding and decoding: Turbo-codes. 1. In Proceedings of the ICC 93—IEEE International Conference on Communications, Geneva, Switzerland, 23–26 May 1993. [Google Scholar] [CrossRef]
  21. Berrou, C.; Glavieux, A. Near optimum error correcting coding and decoding: Turbo-codes. In The Best of the Best: Fifty Years of Communications and Networking Research; Wiley-IEEE Press: Hoboken, NJ, USA, 2007. [Google Scholar]
  22. Steele, R.; Hanzo, L. Mobile Radio Communications: Second and Third Generation Cellular and WATM Systems, 2nd ed.; IEEE Press-John Wiley: Chichester, UK, 1999. [Google Scholar]
  23. Koch, W.; Baier, A. Optimum and sub-optimum detection of coded data disturbed by time-varying intersymbol interference (applicable to digital mobile radio receivers). In Proceedings of the GLOBECOM 90: IEEE Global Telecommunications Conference and Exhibition, San Diego, CA, USA, 2–5 December 1990. [Google Scholar] [CrossRef]
  24. Chen, C.; Wang, L.; Lau, F.C. Joint optimization of protograph LDPC code pair for joint source and channel coding. IEEE Trans. Commun. 2018, 66, 3255–3267. [Google Scholar] [CrossRef]
  25. Burth Kurka, D.; Gündüz, D. Joint Source-Channel Coding of Images with (not very) Deep Learning. In International Zurich Seminar on Information and Communication (IZS 2020) Proceedings; ETH Zurich: Zürich, Switzerland, 2020; pp. 90–94. [Google Scholar]
  26. Minallah, N.; Ullah, K.; Frnda, J.; Cengiz, K.; Awais Javed, M. Transmitter Diversity Gain Technique Aided Irregular Channel Coding for Mobile Video Transmission. Entropy 2021, 23, 235. [Google Scholar] [CrossRef]
  27. Minallah, N.; Butt, M.F.U.; Khan, I.U.; Ahmed, I.; Khattak, K.S.; Qiao, G.; Liu, S. Analysis of Near-Capacity Iterative Decoding Schemes for Wireless Communication Using EXIT Charts. IEEE Access 2020, 8, 124424–124436. [Google Scholar] [CrossRef]
  28. Minallah, N.; Ahmed, I.; Ijaz, M.; Khan, A.S.; Hasan, L.; Rehman, A. On the Performance of Self-Concatenated Coding for Wireless Mobile Video Transmission Using DSTS-SP-Assisted Smart Antenna System. Wirel. Commun. Mob. Comput. 2021, 2021, 8836808. [Google Scholar] [CrossRef]
  29. Brejza, M.F.; Maunder, R.G.; Al-Hashimi, B.M.; Hanzo, L. A High-Throughput FPGA Architecture for Joint Source and Channel Decoding. IEEE Access 2017, 5, 2921–2944. [Google Scholar] [CrossRef]
  30. Balsa, J.; Domínguez-Bolaño, T.; Fresnedo, Ó.; García-Naya, J.A.; Castedo, L. Transmission of Still Images Using Low-Complexity Analog Joint Source-Channel Coding. Sensors 2019, 19, 2932. [Google Scholar] [CrossRef] [Green Version]
  31. Bourtsoulatze, E.; Burth Kurka, D.; Gündüz, D. Deep Joint Source-Channel Coding for Wireless Image Transmission. IEEE Trans. Cogn. Commun. Netw. 2019, 5, 567–579. [Google Scholar] [CrossRef] [Green Version]
  32. Yang, H.; Qing, L.; He, X.; Ou, X.; Liu, X. Robust distributed video coding for wireless multimedia sensor networks. Multimed. Tools Appl. 2018, 77, 4453–4475. [Google Scholar] [CrossRef]
  33. Cai, S.; Lin, W.; Yao, X.; Wei, B.; Ma, X. Systematic Convolutional Low Density Generator Matrix Code. arXiv 2020, arXiv:2001.02854. [Google Scholar]
  34. Hagenauer, J. The exit chart—Introduction to extrinsic information transfer in iterative processing. In Proceedings of the 2004 12th European Signal Processing Conference, Vienna, Austria, 6–10 September 2004; pp. 1541–1548. [Google Scholar]
  35. Brink, S. Designing iterative decoding schemes with the extrinsic information chart. AEU Int. J. Electron. Commun 2000, 54, 389–398. [Google Scholar]
  36. Du, J.; Yang, L.; Yuan, J.; Zhou, L.; He, X. Bit Mapping Design for LDPC Coded BICM Schemes with Multi-Edge Type EXIT Chart. IEEE Commun. Lett. 2017, 21, 722–725. [Google Scholar] [CrossRef]
  37. Stockhammer, T.; Hannuksela, M.; Wiegand, T. H. 264/AVC in wireless environments. IEEE Trans. Circuits Syst. Video Technol. 2003, 13, 657–673. [Google Scholar] [CrossRef]
  38. Nam, C.; Chu, C.; Kim, T.; Han, S. A novel motion recovery using temporal and spatial correlation for a fast temporal error concealment over H. 264 video sequences. Multimed. Tools Appl. 2020, 79, 1221–1240. [Google Scholar] [CrossRef]
  39. Nie, H.; Jiang, X.; Tang, W.; Zhang, S.; Dou, W. Data security over wireless transmission for enterprise multimedia security with fountain codes. Multimed. Tools Appl. 2020, 79, 10781–10803. [Google Scholar] [CrossRef]
  40. Yuan, J. Video data wireless transmission method based on cross-layer bitrate adaptation and error control. Multimed. Tools Appl. 2019, 79, 9255–9266. [Google Scholar] [CrossRef]
  41. Ibrahim, S.K.; Khamiss, N.N. A new wireless generation technology for video streaming. J. Comput. Netw. Commun. 2019, 2019, 3671826. [Google Scholar] [CrossRef]
  42. Ma, Z.; Sun, S. Research on HEVC screen content coding and video transmission technology based on machine learning. Ad Hoc Netw. 2020, 107, 102257. [Google Scholar] [CrossRef]
  43. Choi, Y.; Joo, J. Exploration of practical HEVC/H. 265 sample adaptive offset encoding policies. IEEE Signal Process. Lett. 2014, 22, 465–468. [Google Scholar] [CrossRef]
  44. Ichigaya, A.; Nishida, Y. Required bit rates analysis for a new broadcasting service using HEVC/H. 265. IEEE Trans. Broadcast. 2016, 62, 417–425. [Google Scholar] [CrossRef]
  45. Chen, Y.; Chen, K.; Yuan, S.; Kuo, S. Moving Object Counting Using a Tripwire in H.265/HEVC Bitstreams for Video Surveillance. IEEE Access 2016, 4, 2529–2541. [Google Scholar] [CrossRef]
  46. Hsieh, J.; Cai, J.; Wang, Y.; Guo, Z. ML-Assisted DVFS-Aware HEVC Motion Estimation Design Scheme for Mobile APSoC. IEEE Syst. J. 2019, 13, 4464–4473. [Google Scholar] [CrossRef]
  47. Singhadia, A.; Mamillapalli, M.; Chakrabarti, I. Hardware-efficient 2D-DCT/IDCT architecture for portable HEVC-compliant devices. IEEE Trans. Consum. Electron. 2020, 66, 203–212. [Google Scholar] [CrossRef]
  48. Coding, H.E.V.; Series, H. Audiovisual and Multimedia Systems. In Infrastructure of Audiovisual Services Coding Moving Video; ITU Recommendation: Geneva, Switzerland, 2013; Volume 265. [Google Scholar]
  49. Jafarkhani, H. Space-Time Coding: Theory and Practice; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
  50. Kułakowski, P. The Multiple-Input Multiple-Output Systems in Slow and Fast Varying Radio Channels. Ph.D. Thesis, AGH University of Science and Technology, Kraków, Poland, 2006. [Google Scholar]
  51. Tarokh, V.; Seshadri, N.; Calderbank, A.R. Space-time codes for high data rate wireless communication: Performance criterion and code construction. IEEE Trans. Inf. Theory 1998, 44, 744–765. [Google Scholar] [CrossRef] [Green Version]
  52. Su, W.; Safar, Z.; Liu, K.R. Space-time signal design for time-correlated Rayleigh fading channels. IEEE Int. Conf. Commun. 2003, 5, 3175–3179. [Google Scholar]
  53. Su, W.; Xia, X.G. On space-time block codes from complex orthogonal designs. Wirel. Pers. Commun. 2003, 25, 1–26. [Google Scholar] [CrossRef]
  54. El-Hajjar, M.H. Near-Capacity MIMOs Using Iterative Detection. Ph.D. Thesis, University of Southampton, Southampton, UK, 2008. [Google Scholar]
  55. Ohm, J.R.; Sullivan, G.J.; Schwarz, H.; Tan, T.K.; Wiegand, T. Comparison of the coding efficiency of video coding standards—Including high efficiency video coding (HEVC). IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1669–1684. [Google Scholar] [CrossRef]
  56. Sullivan, G.J.; Boyce, J.M.; Chen, Y.; Ohm, J.R.; Segall, C.A.; Vetro, A. Standardized extensions of high efficiency video coding (HEVC). IEEE J. Sel. Top. Signal Process. 2013, 7, 1001–1016. [Google Scholar] [CrossRef]
  57. Sullivan, G.J.; Ohm, J.R.; Han, W.J.; Wiegand, T. Overview of the High Efficiency Video Coding (HEVC) Standard. IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1649–1668. [Google Scholar] [CrossRef]
  58. Pourazad, M.T.; Doutre, C.; Azimi, M.; Nasiopoulos, P. HEVC: The new gold standard for video compression: How does HEVC compare with H. 264/AVC? IEEE Consum. Electron. Mag. 2012, 1, 36–46. [Google Scholar] [CrossRef]
  59. Telecommunication Standardization Sector. ITU-T Recommendation H. 265: High Efficiency Video Coding; Telecommunication Standardization Sector: Geneva, Switzerland, 2013. [Google Scholar]
  60. El-Hajjar, M.; Alamri, O.; Ng, S.X.; Hanzo, L. Turbo Detection of Precoded Sphere Packing Modulation Using Four Transmit Antennas for Differential Space-Time Spreading. IEEE Trans. Wirel. Commun. 2008, 7, 943–952. [Google Scholar] [CrossRef] [Green Version]
  61. Adrat, M.; Vary, P. Iterative source-channel decoding: Improved system design using EXIT charts. EURASIP J. Adv. Signal Process. 2005, 2005, 178541. [Google Scholar] [CrossRef] [Green Version]
  62. Ten Brink, S. Convergence behavior of iteratively decoded parallel concatenated codes. IEEE Trans. Commun. 2001, 49, 1727–1737. [Google Scholar] [CrossRef]
  63. RECOMMENDATION ITU-R BT. Methodology for the Subjective Assessment of the Quality of Television Pictures; International Telecommunication Union: Geneva, Switzerland, 2002. [Google Scholar]
  64. Huynh-Thu, Q.; Ghanbari, M. Scope of validity of PSNR in image/video quality assessment. Electron. Lett. 2008, 44, 800–801. [Google Scholar] [CrossRef]
Figure 1. Differential Space Time Spreading Scheme.
Figure 1. Differential Space Time Spreading Scheme.
Entropy 23 00562 g001
Figure 2. Space Time Spreading Technique.
Figure 2. Space Time Spreading Technique.
Entropy 23 00562 g002
Figure 3. Proposed Systems (a) NCC; (b) NCDSTS-SP; (c) CDSTS-SP.
Figure 3. Proposed Systems (a) NCC; (b) NCDSTS-SP; (c) CDSTS-SP.
Entropy 23 00562 g003
Figure 4. HEVC division of frame into slices.
Figure 4. HEVC division of frame into slices.
Entropy 23 00562 g004
Figure 5. EXIT chart comparison for the proposed schemes.
Figure 5. EXIT chart comparison for the proposed schemes.
Entropy 23 00562 g005
Figure 6. Video Sequences (a) MOBILE; (b) FOREMAN; (c) AKIYO.
Figure 6. Video Sequences (a) MOBILE; (b) FOREMAN; (c) AKIYO.
Entropy 23 00562 g006
Figure 7. PSNR and BER comparison for the proposed schemes.
Figure 7. PSNR and BER comparison for the proposed schemes.
Entropy 23 00562 g007aEntropy 23 00562 g007b
Figure 8. PSNR comparison for the proposed schemes.
Figure 8. PSNR comparison for the proposed schemes.
Entropy 23 00562 g008aEntropy 23 00562 g008b
Figure 9. BER and PSNR comparison for the proposed schemes.
Figure 9. BER and PSNR comparison for the proposed schemes.
Entropy 23 00562 g009
Table 1. Parameters settings of the source encoder.
Table 1. Parameters settings of the source encoder.
Video CodingParameters
Compression standardH.264 & H.265
Date-Rate for both H.264 and H.26564 kbps
Frame-Rate15 fps
Number of Slices Per frame in H.2649
Macro-Blocks Per Slice in H.26411
Macro-Blocks per frame in Intra-frame in H.2643
Profile in H.264Extended
IntraPeriod of I-pictures15
PartitionMode in H.2643 Partitions per Slice
Entropy coding method in H.264UVLC
SliceMode in H.264Fixed # of MBs in slice
SOA1
Slice Mode1
Number of CTUs in slice 1 and slice 299
MaxCUWidth64
MaxCUHeight64
Quantization Parameter32
Table 2. Probabilities of Symbols with OCM Rate-1 and Rate- 3 4 .
Table 2. Probabilities of Symbols with OCM Rate-1 and Rate- 3 4 .
SymbolsOCM Rate-1OCM Rate- 3 4
0000000000
0010011001
0100101010
0110110011
1001001100
1011010101
1101100110
1111111111
Table 3. Parameters settings of the transceivers.
Table 3. Parameters settings of the transceivers.
SystemParameters
NCCNCDSTS-SPCDSTS-SP
Ourter CodeRate-1 OCMRate-1 OCMRate-3/4 OCM
Inner CodeRate-1/3 RSCRate-1/3 RSCRate-4/9 RSC
ModulationQPSKSPSP
MIMO SchemeNillDSTSDSTS
Tx122
Rx111
ChannelRayleighRayleighRayleigh
Doppler Frequency0.010.010.01
Table 4. Code rate and modulation technique in the prposed models.
Table 4. Code rate and modulation technique in the prposed models.
SchemeBit RateModulation
Outer CodeInner Code
NCCRate-1 OCMRate-1/3 RSC codeQPSK
NCDSTS-SPRate-1 OCMRate-1/3 RSC codeSphere Packing
CDSTS-SPRate-3/4 OCMRate-4/9 RSC codeSphere Packing
Table 5. Analysis of different sampling formats and video sequences.
Table 5. Analysis of different sampling formats and video sequences.
Sampling FormatVideo Sequence
Resolution
Luminance (Y)
Resolution
Luminance
Bits per Frame
Chrominance
(Cb & Cr)
Resolution
Chrominance
Bits per Frame
YUV(4:4:4)QCIF176 × 144608,256176 × 144608,256
CIF352 × 2882,433,024352 × 2882,433,024
4CIF704 × 5769,732,096704 × 5769,732,096
YUV(4:2:2)QCIF176 × 144405,50488 × 144202,752
CIF352 × 2881,422,016176 × 288811,008
4CIF704 × 5766,488,064352 × 5763,244,032
YUV(4:2:0)QCIF176 × 144304,12888 × 7276,032
CIF352 × 2881,216,512176 × 144304,128
4CIF704 × 5764,866,044352 × 2881,216,512
Video Sequence (VS)FramesFrame rateReason for a selection
AIYO4515 fpsLow motion and dynamism
FOREMANMedium motion and dynamism
MOBILEHigh motion and dynamism
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Minallah, N.; Ullah, K.; Frnda, J.; Hasan, L.; Nedoma, J. On the Performance of Video Resolution, Motion and Dynamism in Transmission Using Near-Capacity Transceiver for Wireless Communication. Entropy 2021, 23, 562. https://doi.org/10.3390/e23050562

AMA Style

Minallah N, Ullah K, Frnda J, Hasan L, Nedoma J. On the Performance of Video Resolution, Motion and Dynamism in Transmission Using Near-Capacity Transceiver for Wireless Communication. Entropy. 2021; 23(5):562. https://doi.org/10.3390/e23050562

Chicago/Turabian Style

Minallah, Nasru, Khadem Ullah, Jaroslav Frnda, Laiq Hasan, and Jan Nedoma. 2021. "On the Performance of Video Resolution, Motion and Dynamism in Transmission Using Near-Capacity Transceiver for Wireless Communication" Entropy 23, no. 5: 562. https://doi.org/10.3390/e23050562

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop