Next Article in Journal
Rotational Activity around an Obstacle in 2D Cardiac Tissue in Presence of Cellular Heterogeneity
Next Article in Special Issue
Mathematically Based Assessment of the Accuracy of Protection of Cardiac Data Realized with the Help of Cryptography and Steganography
Previous Article in Journal
Neural Network Modelling for Prediction of Zeta Potential
Previous Article in Special Issue
Improving DFT-Based Image Watermarking Using Particle Swarm Optimization Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source

1
Faculty of Electronic Engineering, University of Nis, Aleksandra Medvedeva 14, 18000 Nis, Serbia
2
Department of Mobile Network Nis, Telekom Srbija, Vozdova 11, 18000 Nis, Serbia
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(23), 3091; https://doi.org/10.3390/math9233091
Submission received: 11 November 2021 / Revised: 26 November 2021 / Accepted: 27 November 2021 / Published: 30 November 2021
(This article belongs to the Special Issue Coding and Combinatorics)

Abstract

:
Motivated by the fact that uniform quantization is not suitable for signals having non-uniform probability density functions (pdfs), as the Laplacian pdf is, in this paper we have divided the support region of the quantizer into two disjunctive regions and utilized the simplest uniform quantization with equal bit-rates within both regions. In particular, we assumed a narrow central granular region (CGR) covering the peak of the Laplacian pdf and a wider peripheral granular region (PGR) where the pdf is predominantly tailed. We performed optimization of the widths of CGR and PGR via distortion optimization per border–clipping threshold scaling ratio which resulted in an iterative formula enabling the parametrization of our piecewise uniform quantizer (PWUQ). For medium and high bit-rates, we demonstrated the convenience of our PWUQ over the uniform quantizer, paying special attention to the case where 99.99% of the signal amplitudes belong to the support region or clipping region. We believe that the resulting formulas for PWUQ design and performance assessment are greatly beneficial in neural networks where weights and activations are typically modelled by the Laplacian distribution, and where uniform quantization is commonly used to decrease memory footprint.

1. Introduction

One of the growing interests in neural networks (NNs) is directed towards the efficient representation of weights and activations by means of quantization [1,2,3,4,5,6,7,8,9,10,11,12,13,14]. Quantization, as a bit-width compression method, is a desirable mechanism that can dictate the entire NN performance [10,12]. In other words, the overall network complexity reduction, provided by the quantization process, can lead to commensurately reduced overall accuracy if the pathway toward this reduction is not chosen prudently. Quantization is significantly beneficial for NN implementation on resource-limited devices since it is capable of fitting the whole NN model into the on-chip memory of edge devices such that the high overhead that occurs by off-chip memory access can be mitigated [9]. Namely, standard implementation of NNs supposes 32-bits full-precision (FP32) representation of NN parameters, requiring complex and expensive hardware. By quantizing FP32 weights and activations with low-bits, that is, by thoughtfully choosing a quantizer model for NN parameters, one can significantly reduce the required bit-width for the digital representation of NN parameters, greatly reducing the overall complexity of the NN while degrading the network accuracy to some extent [2,3,5,6,8,9]. For that reason, a few of new quantizer models and quantization methodologies have been proposed, for instance in [4,5,11,13], with a main objective—to enable quantized NNs to have the slightly degraded or almost the same accuracy level as their full-precision counterparts.
In general, to optimize a quantizer model, one has to know the statistical distribution of the input signal, allowing for the quantizer to be adapted as best as possible to the statistical characteristics of the signal itself. The symmetric Laplacian probability density functions (pdf) with the pronounced peak and heavy tails has been successfully used for modelling signals in many practical applications [8,10,13,15,16,17]. Furthermore, it is arguably the most suitable pdf form for speech and audio signals because it fits many distinctive attributes of these signals [15,16,17,18,19]. In addition, transformed signals and other quantities that are derived from original signals often follow the Laplacian pdf [16]. As it is commonly encountered in many applications, in this paper we favor signal modelling by the Laplacian pdf.
It is well known that a nonuniform quantizer model, well accommodated to the signal’s amplitude dynamic and a nonuniform pdf, has lower quantization error compared to the uniform quantizer (UQ) model with an equal number of quantization levels or equal bit-rates [2,11,13,18,20,21,22,23,24,25,26,27]. However, due to the fact that UQ is the simplest quantizer model, it has been intensively studied, for instance in [23,24,28,29,30,31,32]. Moreover, the high complexity of nonuniform quantizers can outweigh the potential performance advantages over uniform quantizers [21]. Substantial progress in this direction might go towards the usage of one well-designed PWUQ, composed of the optimized pair of UQs with equal bit-rates, capable of accommodating the statistical characteristics of the assumed Laplacian pdf in the predefined amplitude regions. For that reason, in this paper we address the parameterization of a PWUQ with the goal to provide beneficial performance improvements over the existing UQ solutions.
By following [24,25], we accept quantizer’s support region definition as a region separating the signal amplitudes into a granular region and an overload region, or alternatively, into an inner and an outer region. For the symmetric quantizer, such as the one we propose in this paper, these regions are separated by the support region thresholds or clipping thresholds, as denoted by ±xclip (see Figure 1). These thresholds have particular real values that define a quantizer’s support region [−xclip, xclip], where distortion due to quantization and clipping is bounded. The problem of determining the value of xclip of the binary scalar quantizer for the assumed Laplacian pdf has been recently addressed in [33]. Since the Laplacian pdf is a long-tailed pdf, a fair percentage of the samples are concentrated around the mean value, whereas a small percentage of the samples are in the granular region, near to the support region threshold, or out of the granular region. Observe that the shrinkage of the support region for a fixed number of quantization levels results in a reduction of the granular distortion, while at the same time causing an unwanted increase of the overload distortion [22]. On the other hand, for the given number of quantization levels, with the increase of the support region width, the overload region, and hence the overload distortion, is reduced at the expense of the granular distortion increase. In that regard, the main trade-off in quantizer design is making the support region width large enough to accommodate the signal’s amplitude dynamic while keeping this support region width small enough to minimize quantizer distortion.
By inspecting the specific features of the Laplacian pdf, we propose a novel PWUQ whose granular region is wide enough so that the overload distortion can be nullified, whereas the granular region is divided properly into two non-overlapping regions (the central granular region (CGR) and the peripheral granular region (PGR)) and to utilize the simplest uniform quantization within each of the regions. In particular, we assume a narrow region around the mean for the CGR covering the peak of the Laplacian pdf, while for the PGR we specify a wider region, that is the rest of the granular region where the pdf is tailed. These two regions are separated by the granular region border thresholds, denoted by ±xb, that are symmetrically placed around the mean. Since our goal is to minimize the overall distortion, especially the granular distortion, the parameterization of our PWUQ so that most of the samples (99.99%) from the assumed Laplacian pdf belong to the granular region, can indeed be considered as a ubiquitous optimization task. Nevertheless, the authors of this paper have found an iterative manner to solve this task in a convenient way, explained in detail below.
In brief, the novelty of this paper in the field of quantization is reflected in the following:
-
by a studious inspection of the shape of the Laplacian pdf, a novel idea with the partition of the amplitude range of the quantizer into two regions, CGR and PGR, is proposed;
-
for given xclip values, the widths of these two regions are optimized using our iterative algorithm so that the distortion of PWUQ is minimal;
-
the simplest model of a uniform quantizer is exploited, for equal bit-rates it is applied in each of the two regions, which makes the design of our model much simpler compared to many non-uniform quantizer models that are available in the literature (for instance, see [21,22,27]);
-
a significant gain is achieved in SQNR in relation to the uniform quantizer, which justifies the meaningfulness of our idea.
The paper is organized as follows: Section 2 describes the iterative algorithm for the parameterization of our novel PWUQ that is optimized for the given support region threshold and the assumed Laplacian pdf. Section 3 provides the discussion on the gain in performance that is achieved with the proposed quantizer when compared to UQ. Section 4 summarizes and concludes our research results.

2. Iterative Parameterization of PWUQ

At the beginning of this section, we briefly recall the basic theory of quantization. An Nlevel quantizer QN is defined by mapping QN: ℝ → Y [25], where ℝ is a set of the real numbers, Y = {yN/2, …, y−1, y1, …, yN/2} ⸦ ℝ is the code book of size N containing representation levels yi, where N = 2r and r is a bit-rate. With the N-level quantizer QN, ℝ is partitioned into N bounded in-width granular cells ℜi and two unbounded overload cells. The i-th granular cell is given by ℜi = {x|x ∈ [−xclip, xclip], QN(x) = yi}, where it holds ℜi∩ℜj = ∅, for ij. In other words, yi specifies the i-th codeword and is the only representative for all real values x from ℜi.
Let us define our novel symmetrical PWUQ that consisted of two UQs of the same number of quantization levels N/2 = K. One quantizer is utilized for quantization of amplitudes belonging to the CGR [−xb, xb] and the second one is used for PGR [−xclip, xb) ∪ (xb, xclip] (see Figure 1). Let us also assume that the amplitudes belonging to the two overload cells, that is, to (−∞, −xclip) ∪ (xclip, +∞), are clipped. Further, define CGR and PGR as:
CGR = i = K / 2 1 i i = 1 K / 2 i = [ x K / 2 , x K / 2 ] , PGR = i = K K / 2 1 i i = K / 2 + 1 K i = [ x K , x K / 2 ) ( x K / 2 , x K ] ,  
which are separated by the border thresholds as denoted by ±xb, for which it holds ±xb = ±xK/2. Due to the symmetry of the Laplacian pdf of zero mean and variance σ2 = 1, for which we design our PWUQ:
p ( x ) = exp { 2 | x | / σ } 2 σ | σ 2 = 1 = exp { 2 | x | } 2 ,  
decision thresholds and representation levels of our quantizer are symmetrically placed about the mean value. Without loss of generality, we restrict our attention to the K positive counterparts, whereas the negative counterparts trivially follow from the symmetry
x i = x i , i = 1 , 2 , , K .
In particular, non-negative decision thresholds of our PWUQ consisted of two UQs of the same number of quantization levels N/2 = K, for a given K, xclip and xb calculates from:
x i = { i 2 x b K , i = 0 , 1 , K 2 x b + 2 ( x clip x b ) K ( i K 2 ) , i = K 2 + 1 , K 2 + 2 , , K .
In other words, both regions, CGR covering [−xb, xb] and PGR covering [−xclip, −xb) ∪ (xb, xclip] are symmetrically partitioned into K uniform cells (see Figure 2 where transfer characteristic of PWUQ is shown for N = 16). Due to symmetry, this equally means that each of the regions [0, xb] and (xb, xclip] is partitioned into K/2 uniform cells. Since these two UQs compose our PWUQ the notation of the decision thresholds in the CGR ends with index K/2, whereas the index of the decision thresholds in the PGR increases up to K.
In what follows, we describe other parameters of our PWUQ, provide derivation of the distortion, and perform its optimization per ψ = xb/xclip, for a given xclip value by using the iterative algorithm. In other words, for a given xclip value, we perform optimization of the widths of CGR and PGR via distortion optimization per border–clipping threshold scaling ratio ψ and we end up with an iterative formula enabling the concrete design of our PWUQ.
Let us define the quantization step sizes ΔCGR and ΔPGR as a uniform width of cells ℜi from ℜCGR and ℜPGR, respectively
Δ CGR = 2 x b K = 2 ψ x clip K ,  
Δ PGR = 2 ( x clip x b ) K = 2 ( 1 ψ ) x clip K .
We introduce parameter ψ, that is border–clipping threshold scaling ratio, specified as ψ = xb/xclip, for which, in accordance with our PWUQ model definition (see Figure 1), it holds ψ < 1, or more precisely ψ < 0.5. Observing the area under the Laplacian pdf, we opt to increase quantization step size in ℜPGR since it is counterbalanced by a corresponding diminution of the step size in ℜCGR. Our symmetrical PWUQ maps a real value x ∈ ℝ to one of the representation levels yi, where it holds:
y i = y i , i = 1 , 2 , , K ,  
and yi is determined as the midpoint of the corresponding quantization cells ℜi ∈ ℜCGR for i = 1, …, K/2 and ℜi ∈ ℜPGR for i = K/2+1, …, K
y i = { ( i 1 2 ) Δ CGR , i = 1 , , K 2 x b + ( i K + 1 2 ) Δ PGR , i = K 2 + 1 , K 2 + 2 , , K .
Determining the decision thresholds and representation levels of the quantizer specifies its entire performance. If they are chosen more suitably, the overall distortion is smaller which translates to a reduction in the number of bits that are required from the quantizer for achieving certain distortion. The main idea behind our PWUQ design is to improve the overall quantization performance by the prudent application of the simplest uniform quantization. To assess the performance of our PWUQ for a given bit-rate, r (r = log2N), and a given support region threshold we have to specify its distortion. Specifically, in accordance with our PWUQ model, we have to determine the sum of the granular distortions originating from quantization in ℜCGR and ℜPGR, that is to determine DgCGR and DgPGR
D g PWUQ = D g CGR + D g PGR ,
D g PWUQ = 1 12 ( ( Δ CGR ) 2 P CGR + ( Δ P G R ) 2 P PGR ) .
PCGR and PPGR denote the probability of belonging the input sample x to ℜCGR and ℜPGR, respectively
P CGR = 2 0 x b p ( x ) d x ,
P PGR = 2 x b x clip p ( x ) d x .
Let us recall that the mean is a measure of central tendency and that it specifies where the values of x tend to cluster, whereas the standard deviation σ, indicates how the data are spread out from the selected mean to form the measure of dispersion [25]. Also, let us recall that on an unbounded amplitude domain, the cumulative distribution function (CDF), denoted as FCDF, is given by
F CDF ( b ) = b p ( x ) d x ,  
where CDF satisfies FCDF(∞) = 1. For a symmetric pdf, such as the pdf that is given in (2), it holds:
Φ ( b , b ) = F CDF ( b ) F CDF ( b ) = 2 0 b p ( x ) d x ,
where Φ(−b, b) is the probability that the value of the input sample x having pdf p(x) belongs to the given interval [−b, b]. By invoking (2), (13), and (14) for PCGR and PPGR, we have:
P CGR = Φ ( x b , x b ) = 2 0 x b p ( x ) d x = 1 exp { 2 x b } ,
P PGR = Φ ( x clip , x clip ) Φ ( x b , x b ) = 2 x b x clip p ( x ) d x = exp { 2 x b } exp { 2 x clip } .
Substituting (5), (6), (15) and (16) in (10) yields
D g PWUQ = C [ ψ 2 ( 1 exp { 2 ψ x clip } ) + ( 1 ψ ) 2 ( exp { 2 ψ x clip } exp { 2 x clip } ) ] ,  
where C = 4 xclip2/(3N2). By further setting the first derivate of DgPWUQ with respect to ψ equal to zero
D g PWUQ ψ = 0 ,
we obtain:
x b = ψ x clip = 1 2 ln [ 1 + x clip 2 2 ψ x clip ψ + ( 1 ψ ) exp { 2 x clip } ] .
From (19) and discussions about xb or ψ determining, the application of an iterative numerical method is required:
ψ ( i ) = 1 2 x clip ln ( 1 + x clip 2 ( 1 2 ψ ( i 1 ) ) ψ ( i 1 ) + ( 1 ψ ( i 1 ) ) exp { 2 x clip } ) .
Taking the second derivative of (17) with respect to ψ yields:
2 D g PWUQ ψ 2 = 1 exp { 2 x clip } + x c l i p exp { 2 ψ x clip } [ 2 2 + x clip ( 1 2 ψ ) ] .  
As it holds xbxclip/2, 2 2 + x clip ( 1 2 ψ ) > 2 2 , we can conclude that DgPWUQ is a convex function of ψ, and subsequently of the border threshold xb, where xb ∈ (0, xclip/2]. In other words, as it holds ∂2DgPWUQ/∂ψ2 > 0, DgPWUQ is also a convex function of the border threshold xb so that for the given xclip, one unique optimal value of xb and one unique optimal value of ψ exists that minimizes DgPWUQ.
Pseudo-code (see Algorithm 1) that is shown here summarizes our iterative algorithm for the concreate designing and parameterization of PWUQ for a given bit-rate and a clipping threshold. PWUQ parameterization implies specifying the clipping thresholds, xclip, and iteratively determining: border thresholds xb*, and border–clipping threshold scaling ratio ψ* = xb*/xclip, ΔCGR, ΔPGR—uniform step sizes in ℜCGR and ℜPGR; {y−i, yi}, i = 1, 2, …, K—symmetrical representation levels; {x−i, xi}, i = 1, 2, …, K—symmetrical decision thresholds.
We initialize our PWUQ model with the UQ model, that is with xb(0) = xclip/2, to follow performance improvement that is achieved by the iterative algorithm. In other words, we assume that it holds ψ(0) = 0.5 as for this ψ value and the same number of levels N, PWUQ and UQ model are matched. We define that the iterative algorithm stopping criterion is satisfied when the absolute error
ε ( i ) = | ψ * ψ ( i 1 ) |
is less than εmin = 10−4.
Recalling (11)–(13) we can calculate λ, as the probability that an input sample x, with unrestricted pdf p(x), belongs to the granular region:
λ = P CGR + P PGR F CDF ( ) = P CGR + P PGR .
We should also highlight here that xclip, xb, and N have a direct effect on the distortion. If the clipping threshold xclip has a very small value, the quantization accuracy may be decreased because too many samples will be clipped [24]. Note that the clipping effect nullifies the overload distortion where it can indeed degrade the granular distortion if the clipping threshold is not appropriately specified. Accordingly, setting a suitable clipping threshold value is crucial for achieving the best possible performance of the given quantization task. We can anticipate that for the given clipping threshold, the border threshold xb has a large impact on the granular distortion because the CGR and PGR distortion, which compose the total granular distortion, behave opposite in relation to the border threshold xb. In particular, for the fixed and an equal number of quantization levels that are assumed in ℜCGR and ℜPGR, with the decrease of the border threshold value, the CGR distortion is reduced at the expense of the increase in PGR distortion. Namely, the shrinkage of ℜCGR can cause a significant distortion reduction in CGR, while at the same time, can result in an unwanted but expected increase of the distortion in the pdf tailed PGR. Therefore, we can conclude that tuning the values of the border thresholds ± xb and the clipping thresholds ± xclip, is one of the key challenges when heavy-tailed Laplacian pdf are taken into consideration. In what follows, we will show that, for the Laplacian pdf given in (2), our PWUQ that is composed of only two UQs provides significantly better performance than one UQ. That is, it provides a higher signal–quantization-noise-ratio (SQNR)
SQNR PWUQ [ dB ] = 10 log 10 ( D g PWUQ ) ,
compared to UQ for the same bit-rate
SQNR UQ [ dB ] = 10 log 10 ( D g UQ ) ,
D g UQ = x UQ 2 3 N 2 .
It is interesting to notice that if we assume equal values of the support region thresholds, or clipping thresholds of PWUQ and UQ, xclip = xUQ, we can end up with the simple closed-form formula providing a detailed insight in performance gain. That is, SQNR gain achievable with our PWUQ over UQ can be calculated from:
δ = 10 log 10 ( D g PWUQ D g UQ ) = 10 log 10 ( 4 [ ψ 2 ( P CGR + P PGR ) + ( 1 2 ψ ) P PGR ] ) .
Obviously, for ψ = 0.5 and the same number of levels N, the PWUQ and UQ models are matched so that SQNRPWUQ is equal to SQNRUQ.
Algorithm 1. PWUQ Laplacian (N, xclip, εmin)—iterative parameterization of PWUQ for the Laplacian pdf and the given xclip—determining ψ* and Φ (−xb*, xb*)
Input: Total number of quantization levels N, predefined clipping threshold xclip, εmin << 1
1st Output:ψ*, Φ(−xb*, xb*)
2nd Output: ΔCGR, ΔPGR, {yi, yi}, {xi, xi}, i = 1, 2, …, K
   1:  Initialize i ← 0,
   2:  ψ* ← 0.5,
   3:  ε(0) ← 0.5
   4:  while ε (i) > εmin do
   5:   ii + 1
   6:   ψ(i − 1) ← ψ*
   7:   compute ψ(i) using (20)
   8:   ψ* ← ψ(i)
   9:   compute ε(i) using (22)
   10: end while
   11: xb* ← ψ* × xclip
   12: compute Φ(−xb*, xb*) by using (15)
   13: return ψ*, Φ(−xb*, xb*)
   14: calculate ΔCGR, ΔPGR, {yi, yi}, {xi, xi}, i = 1, 2, …, K
Let us highlight that we have performed our analysis for the Laplacian source; similar analyses can be derived for some other source. That is, for some other pdf that we can first specify in (2) and then substitute in (11) and (12), which will further affect the derivation of the formulas starting from (15). However, not every pdf will allow the derivation of the expressions in closed form. Moreover, to provide a similar iterative algorithm, as in this paper, the distortion should be a convex function of ψ, so this should be taken into consideration as well. In brief, as the Laplacian source is one of the widely used sources, it can be concluded that the analysis that is presented in this paper is indeed significant.
Let us also highlight that our piecewise uniform quantizer, with the border threshold between two segments as determined iteratively, can be considered a piecewise linear quantizer if we consider its realization from the standpoint of a companding technique. Taking this fact into account, our novel model can be related to adaptive rejection sampling, as used in [34,35,36,37,38,39] for the generation of samples from a given pdf where piecewise linearization is performed with linear segments as tangents on log pdf and with iteratively calculated nodes specifying the linear segments. This approach with piecewise linear approximation is especially useful for pdfs such as Gaussian. As with the Gaussian pdf, one cannot solve the integrals provided in our analysis in a closed form. Due to the widespread utilization of both the Laplacian and Gaussian pdf, our future work will be focused on designing a quantizer that is based on the piecewise linear approximation of the Gaussian pdf by using a similar technique to the one proposed in [34,35,36,37,38,39].

3. Numerical Results

The most important step in designing our PWUQ is determining the value of its key parameter ψ. For a given xclip, we can calculate ψ* using the above algorithm. Then we can calculate xb* = ψxclip and other parameters of PWUQ, as well as its performance (SQNRPWUQ). Since the Laplacian pdf is long-tailed and accordingly persistently unbounded, we can assume that most of the pdf’s samples are in the granular region for λGR = PCGR + PPGR = 0.9999. We can analyse the case where the values of the clipping and border thresholds xclipGR and xbGR, as well as of ψGR, are determined from λGR = 0.9999, where the GR notation indicates the granular region. By invoking (15) and (16) for λGR = 0.9999, we have:
x clip GR = 1 2 ln ( 1 λ GR ) = 2 2 ln ( 10 ) .
For medium and high bit-rates, where r amounts from 5 bit/sample to 8 bit/sample, we have calculated ψ*, xb* = ψxclip, Φ(−xb*, xb*), λ*, SQNRUQ, SQNRPWUQ, and δ for different xclip values (see Table 1). In particular, along with xclipGR, as determined from (28), we assume xclip values as determined in [24,25], by Hui and Jayant, respectively. To distinguish between these three different cases, we use notation [J], [H], and [GR] (or just GR) in the line or in the superscript. As we can see from Figure 2, we can simply calculate from (5) and (6) (ΔCGRPGR = ψ*/(1 − ψ*)), it holds ΔPGR ≈ 2ΔCGR, so that from K×(ΔPGR + ΔCGR) = 2K × ΔUQ it implies ΔUQ ≈ 1.5 ΔCGR and ΔUQ ≈ 0.75 ΔPGR. Accordingly, we show that in comparison to UQ, more precise quantization is enabled in ℜCGR to which most of samples from the assumed Laplace pdf belong, while in ℜPGR, where pdf is predominantly tailed, UQ provides slightly better performances since ΔUQ < ΔPGR. In other words, keeping in mind the shape of the Laplacian pdf, we show that it makes sense to favor the more meticulous quantization of the dominant number of samples that are concentrated around the mean belonging to ℜCGR. Since we assume ψ* < 0.5, from ΔCGRPGR = ψ*/(1 − ψ*) we can write ΔCGR < ΔPGR, proving that in the narrower ℜCGR to which most of samples from the assumed Laplacian pdf belong, smaller quantization errors indeed occur. In brief, by taking into account the shape of the assumed Laplacian pdf and the manner in which we perform parameterization of our PWUQ, the overall gain in SQNR that is achieved with PWUQ over UQ can be completely justified.
Let us observe that for xclip, as calculated from (28) so that λGR = 0.9999 and for ε < 10−4, ψGR, amounts to 0.2674 and is not dependent on the bit-rate. Namely, we have calculated the following fixed values: xclip = xclipGR = 6.5127, ψGR = 0.2674, xbGR = 1.74150, and Φ(−xbGR, xbGR) = 0.9148, whereas SQNRPWUQ strictly depends on r. It is interesting to notice that although SQNRPWUQ depends on r, the gain in SQNR that is achieved by PWUQ over UQ in the so-called GR case, is also constant and it amounts to 3.523 dB (see Figure 3 for GR case). To justify this constant gain in SQNR we can observe (27) from which it trivially follows our conclusion for given fixed ψGR, xclipGR and xbGR values. Eventually, we can notice that for r = 8 bit/sample, SQNRPWUQ[GR], determined for xclip = xclipGR = 6.5127, is higher than SQNRPWUQ[H] [24] and SQNRPWUQ[J] [25], which can be justified by the suitable clipping effect that was performed in the GR case in accordance with the assumption that it holds λGR = 0.9999 and also by the fact that in [24,25], the optimization of the support region threshold, here considered as a clipping threshold, have been performed without nullifying the overload distortion, i.e., by avoiding a clipping effect.
It is worthy to notice from Table 1 that the Laplacian pdf is expected to be in the defined granular regions with λ*[J] = 0.9982, λ*[H] = 0.9990 for r = 5 bit/sample, whereas for r = 8 bit/sample λ*[J] and λ*[H] are equal to λGR = 0.9999. By calculating λ* and Φ(−xb*, xb*) for our three different cases, we confirm that with a convenient choice of clipping values, only a few quantized samples are in the granular region near to the support or clipping threshold or out of the granular region. From Table 1, we can also conclude that the probability of belonging samples x to ℜCGR grows with the bit-rate since values of xclip[J] and xclip[H] increase with the bit-rate. Also, we can notice that Φ(−xbGR, xbGR) shows dominance compared to Φ(−xb*[J], xb*[J]) and Φ(−xb*[H], xb*[H]) for r equal to 5 bit/sample and 6 bit/sample, whereas Φ(−xbGR, xbGR) takes close values to Φ(−xb*[J], xb*[J]) and Φ(−xb*[H], xb*[H]) for r = 7 bit/sample and r = 8 bit/sample. Since the values of Φ(−xbGR, xbGR) and xclipGR are constant and for r = 7 bit/sample it holds xclipGR < xclip[H], as a result we have SQNRPWUQ[GR] > SQNRPWUQ[H]. Similarly, for r = 8 bit/sample, from xclipGR < xclip[J], it implies that SQNRPWUQ[GR] > SQNRPWUQ[J]. For the highest observed bit-rate, r = 8 bit/sample, along with the values of the key design parameters and performances of UQ and PWUQ given in the last row of Table 1 for the GR case, we present additional descriptive information, given in Figure 4. From Figure 4, we can notice that 91.48% of the samples belonged to ℜCGR, whereas 99.99% of samples belonged to ℜCGR ∪ ℜPGR, meaning that we have determined the value of ψ = ψGR so that a huge percentage of the samples do belong to ℜCGR. In the case with r = 8 bit/sample we have determined the smallest values of ψ* in [J] and [H] cases. That is, we have determined the largest absolute differences from the initial value ψ(0) = 0.5. In these two cases, the number of iterations for determining ψ* ranges up to 25, where the values of ψ* matches with the results of the numerical distortion optimization per ψ, meaning that our algorithm converges fast.
To illustrate the importance of determining not only the optimized value of ψ, but also of the choice of xclip values, Figure 5 shows the dependences of SQNRPWUQ on ψ for r = 6 bit/sample for the three cases that were considered, where xclip is equal to xclipGR, xclip[H], xclip[J]. Let us highlight that the values of the parameter ψ, which are the results of our iterative algorithm for the considered three cases, marked with asterisks in Figure 5, are indeed optimal and gives the corresponding maximum of SQNR of PWUQ for each of these three cases. To select one of the values for xclip for a given bit-rate, r, we can choose the one giving the highest SQNR. Eventually, we can conclude that although the range of values for ψ is relatively narrow, its selection is very significant since the unfavorable choice of this parameter can significantly degrade the performance of our PWUQ. This observation additionally justifies the importance of our proposal, described in the paper.

4. Summary and Conclusions

To improve upon the uniform quantizer model in terms of SQNR and to utilize the benefits that are provided by the simplest UQ, in this paper we have proposed one PWUQ model. This model deliberately applies equal bit-rates uniform quantization in regions called CGR and PGR whose widths are optimized in an iterative manner so that for the assumed clipping thresholds and Laplacian pdf, the distortion is minimal. In other words, we have opted to perform parameterization of our PWUQ to achieve an increase of the quantization step size in PGR which is counterbalanced by a corresponding diminution of the step size in CGR to which most of samples from the assumed Laplacian pdf belong. We have proved that in comparison to UQ, smaller quantization errors have indeed occurred in the CGR, resulting in a significant SQNR gain achieved by PWUQ over UQ. Moreover, for the three different cases of specifying the clipping threshold values, we have shown and justified that this SQNR gain originates from the optimization of the border–clipping threshold scaling factor, performed in accordance with the proposed iterative algorithm. We have shown that although the range of values for border–clipping threshold scaling factor is relatively narrow, its choice is very significant since the unfavorable choice of this parameter, or unfavorable parameterization, can significantly degrade the performance of our PWUQ. Eventually, we found that with a convenient choice of clipping values, 99.99% of samples from the assumed Laplacian pdf do belong to the granular region. Accordingly, we can anticipate that our PWUQ model can be deployed as an outstanding replacement for the widely-used UQ, not only in traditional, but also in contemporary quantized neural network solutions where weights are typically modelled by the Laplacian distribution, as we assumed in this paper.

Author Contributions

Conceptualization and methodology, J.N. and Z.P.; software and validation, D.A. and J.N.; formal analysis, D.A.; writing—original draft preparation, J.N. and D.A.; writing—review and editing, M.D.; visualization, D.A. and J.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Science Fund of the Republic of Serbia, 6527104, AI-Com-in-AI.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank anonymous reviewers for their constructive comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hubara, I.; Courbariaux, M.; Soudry, D.; Ran, E.Y.; Bengio, Y. Binarized Neural Networks. In Proceedings of the 30th Conference on Neural Information Processing Systems (NeurIPS 2016), Barcelona, Spain, 1–9 December 2016. [Google Scholar]
  2. Lin, D.; Talathi, S.; Soudry, D.; Annapureddy, S. Fixed Point Quantization of Deep Convolutional Networks. In Proceedings of the 33rd International Conference on Machine Learning Conference on Neural Information Processing Systems, New York, NY, USA, 19–24 June 2016; pp. 2849–2858. [Google Scholar]
  3. Hubara, I.; Courbariaux, M.; Soudry, D.; El-Yaniv, R.; Bengio, Y. Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations. J. Mach. Learn. Res. 2017, 18, 6869–6898. [Google Scholar]
  4. Huang, K.; Ni, B.; Yang, D. Efficient Quantization for Neural Networks with Binary Weights and Low Bit Width Activations. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 2–9 February 2021; pp. 3854–3861. [Google Scholar]
  5. Yang, Z.; Wang, Y.; Han, K.; Xu, C.; Xu, C.; Tao, D.; Xu, C. Searching for Low-Bit Weights in Quantized Neural Networks. In Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, BC, Canada, 6–12 December 2020. [Google Scholar]
  6. Véstias, M.P.; Duarte, R.P.; De Sousa, J.T.; Neto, H.C. Moving Deep Learning to the Edge. Algorithms 2020, 13, 125. [Google Scholar] [CrossRef]
  7. Uhlich, S.; Mauch, L.; Cardinaux, F.; Yoshiyama, K. Mixed precision DNNs: All you Need is a Good Parametrization. In Proceedings of the 8th International Conference on Learning Representations, Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
  8. Peric, Z.H.; Denic, B.D.; Savic, M.S.; Vucic, N.J.; Simic, N.B. Binary Quantization Analysis of Neural Networks Weights on MNIST Dataset. Elektron. Elektrotech. 2021, 27, 55–61. [Google Scholar] [CrossRef]
  9. Liu, D.; Kong, H.; Luo, X.; Liu, W.; Subramaniam, R. Bringing AI to Edge: From Deep Learning’s Perspective. arXiv 2020, arXiv:2011.14808. [Google Scholar] [CrossRef]
  10. Gholami, A.; Kim, S.; Dong, Z.; Yao, Z.; Mahoney, M.W.; Keutzer, K. A Survey of Quantization Methods for Efficient Neural Network Inference. arXiv 2021, arXiv:2103.13630. [Google Scholar]
  11. Sanghyun, S.; Juntae, K. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation Based Non-Uniform Quantizer. Appl. Sci. 2019, 9, 2559. [Google Scholar] [CrossRef] [Green Version]
  12. Guo, Y. A Survey on Methods and Theories of Quantized Neural Networks. arXiv 2018, arXiv:1808.04752. [Google Scholar]
  13. Peric, Z.; Denic, B.; Dincic, M.; Nikolic, J. Robust 2-bit Quantization of Weights in Neural Network Modeled by Laplacian Distribution. Adv. Electr. Comput. Eng. 2021, 21, 3–10. [Google Scholar] [CrossRef]
  14. Baskin, C.; Zheltonozhkii, E.; Rozen, T.; Liss, N.; Chai, Y.; Schwartz, E.; Giryes, R.; Bronstein, A.M.; Mendelson, A. NICE: Noise Injection and Clamping Estimation for Neural Network Quantization. Mathematics 2021, 9, 2144. [Google Scholar] [CrossRef]
  15. Kotz, S.; Kozubowski, T.; Podgórski, K. The Laplace Distribution and Generalization: A Revisit with Applications to Communications, Economics, Engineering, and Finance; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2001. [Google Scholar] [CrossRef]
  16. Gazor, S.; Zhang, W. Speech probability distribution. IEEE Signal Process. Lett. 2003, 10, 204–207. [Google Scholar] [CrossRef]
  17. Naik, S.M.; Jagannath, R.P.K.; Kuppili, V. Bat algorithm-based weighted Laplacian probabilistic neural network. Neural Comput. Appl. 2020, 32, 1157–1171. [Google Scholar] [CrossRef]
  18. Lee, J.; Na, S. A Rigorous Revisit to the Partial Distortion Theorem in the Case of a Laplacian Source. IEEE Commun. Lett. 2017, 21, 2554–2557. [Google Scholar] [CrossRef]
  19. Delić, V.; Perić, Z.; Sečujski, M.; JakovljeviĆ, N.; Nikolić, J.; Mišković, D.; Simić, N.; Suzić, S.; Delić, T. Speech Technology Progress Based on New Machine Learning Paradigm. Comput. Intell. Neurosci. 2019, 2019, 4273290. [Google Scholar] [CrossRef] [Green Version]
  20. Shlezinger, N.; Eldar, Y. Deep Task-Based Quantization. Entropy 2021, 23, 104. [Google Scholar] [CrossRef] [PubMed]
  21. Perić, Z.; Petković, M.; Nikolić, J. Optimization of Multiple Region Quantizer for Laplacian Source. Digit. Signal Process. 2014, 27, 150–158. [Google Scholar] [CrossRef]
  22. Perić, Z.; Aleksić, D. Quasilogarithmic Quantizer for Laplacian Source: Support Region Ubiquitous Optimization Task. Rev. Roum. Sci. Tech. 2019, 64, 403–408. [Google Scholar]
  23. Jovanović, A.; Perić, Z.; Nikolić, J. Iterative Algorithm for Designing Asymptotically Optimal Uniform Scalar Quantization of the One-Sided Rayleigh Density. IET Commun. 2021, 15, 723–729. [Google Scholar] [CrossRef]
  24. Hui, D.; Neuhoff, D. Asymptotic analysis of optimal fixed-rate uniform scalar quantization. IEEE Trans. Inf. Theory 2001, 47, 957–977. [Google Scholar] [CrossRef] [Green Version]
  25. Jayant, S.; Noll, P. Digital Coding of Waveforms; Prentice Hall: Hoboken, NJ, USA, 1984. [Google Scholar]
  26. Perić, Z.; Savić, M.; Simić, N.; Denić, B.; Despotović, V. Design of a 2-Bit Neural Network Quantizer for Laplacian Source. Entropy 2021, 23, 933. [Google Scholar] [CrossRef]
  27. Perić, Z.; Nikolić, J.; Aleksić, D.; Perić, A. Symmetric Quantile Quantizer Parameterization for the Laplacian Source: Qualification for Contemporary Quantization Solutions. Math. Probl. Eng. 2021, 2021, 6647135. [Google Scholar] [CrossRef]
  28. Na, S.; Neuhoff, D.L. Monotonicity of Step Sizes of MSE-Optimal Symmetric Uniform Scalar Quantizers. IEEE Trans. Inf. Theory 2018, 65, 1782–1792. [Google Scholar] [CrossRef]
  29. Na, S.; Neuhoff, D. On the support of MSE-optimal, fixed-rate, scalar quantizers. IEEE Trans. Inf. Theory 2001, 47, 2972–2982. [Google Scholar] [CrossRef]
  30. Na, S.; Neuhoff, D.L. On the Convexity of the MSE Distortion of Symmetric Uniform Scalar Quantization. IEEE Trans. Inf. Theory 2017, 64, 2626–2638. [Google Scholar] [CrossRef]
  31. Choi, Y.H.; Yoo, S.J. Quantized-Feedback-Based Adaptive Event-Triggered Control of a Class of Uncertain Nonlinear Systems. Mathematics 2020, 8, 1603. [Google Scholar] [CrossRef]
  32. Guo, J.; Wang, Z.; Zou, L.; Zhao, Z. Ultimately Bounded Filtering for Time-Delayed Nonlinear Stochastic Systems with Uniform Quantizations under Random Access Protocol. Sensors 2020, 20, 4134. [Google Scholar] [CrossRef] [PubMed]
  33. Peric, Z.; Denic, B.; Savic, M.; Despotovic, V. Design and Analysis of Binary Scalar Quantizer of Laplacian Source with Applications. Information 2020, 11, 501. [Google Scholar] [CrossRef]
  34. Gilks, W.R.; Wild, P. Adaptive Rejection Sampling for Gibbs Sampling. J. R. Stat. Soc. Ser. C Appl. Stat. 1992, 41, 337. [Google Scholar] [CrossRef]
  35. Gilks, W.R.; Best, N.G.; Tan, K.K.C. Adaptive Rejection Metropolis Sampling within Gibbs Sampling. J. R. Stat. Soc. Ser. C Appl. Stat. 1995, 44, 455. [Google Scholar] [CrossRef] [Green Version]
  36. Martino, L.; Read, J.; Luengo, D. Independent Doubly Adaptive Rejection Metropolis Sampling within Gibbs Sampling. IEEE Trans. Signal Process. 2015, 63, 3123–3138. [Google Scholar] [CrossRef] [Green Version]
  37. Martino, L. Parsimonious adaptive rejection sampling. Electron. Lett. 2017, 53, 1115–1117. [Google Scholar] [CrossRef] [Green Version]
  38. Hörmann, W. A rejection technique for sampling from T -concave distributions. ACM Trans. Math. Softw. 1995, 21, 182–193. [Google Scholar] [CrossRef] [Green Version]
  39. Görür, D.; Teh, Y.W. Concave-Convex Adaptive Rejection Sampling. J. Comput. Graph. Stat. 2011, 20, 670–691. [Google Scholar] [CrossRef]
Figure 1. Support region partition into CGR and PGR: ΔCGR and ΔPGR are cell widths in the ℜCGR and ℜPGR, xi, i = 1, …, K are nonnegative decision thresholds, yi, i = 1, …, K are nonnegative representation levels.
Figure 1. Support region partition into CGR and PGR: ΔCGR and ΔPGR are cell widths in the ℜCGR and ℜPGR, xi, i = 1, …, K are nonnegative decision thresholds, yi, i = 1, …, K are nonnegative representation levels.
Mathematics 09 03091 g001
Figure 2. Transfer characteristics of PWUQ for N = 2K = 16.
Figure 2. Transfer characteristics of PWUQ for N = 2K = 16.
Mathematics 09 03091 g002
Figure 3. Constant SQNR gain of 3.523 dB that is achieved by PWUQ over UQ in GR case for r ranging from 4 bit/sample to 8 bit/sample.
Figure 3. Constant SQNR gain of 3.523 dB that is achieved by PWUQ over UQ in GR case for r ranging from 4 bit/sample to 8 bit/sample.
Mathematics 09 03091 g003
Figure 4. Samples from Laplacian pdf belonging to ℜCGR and ℜPGR for GR case (r = 8 bit/sample).
Figure 4. Samples from Laplacian pdf belonging to ℜCGR and ℜPGR for GR case (r = 8 bit/sample).
Mathematics 09 03091 g004
Figure 5. Dependences of SQNRPWUQ on ψ for the three cases considered (r = 6 bit/sample).
Figure 5. Dependences of SQNRPWUQ on ψ for the three cases considered (r = 6 bit/sample).
Mathematics 09 03091 g005
Table 1. Competitive presentation of key design parameters and performances of UQ and PWUQ.
Table 1. Competitive presentation of key design parameters and performances of UQ and PWUQ.
r [bit/Sample] xclipψ*xb*Φ(−xb*, xb*)λ*SQNRUQSQNRPWUQδ
5[J]4.48000.31001.388800.85970.998221.848724.10892.2602
[H]4.90130.29961.468430.87470.999021.068023.60102.5330
GR6.51270.26741.741500.91480.999918.599022.12203.5230
6[J]5.30240.29031.539290.88660.999426.405429.19382.7884
[H]5.88150.27891.640350.90170.999825.505028.65223.1472
GR6.51270.26741.741500.91480.999924.619628.14263.5230
7[J]6.15040.27401.68520.90770.999831.137334.44663.3093
[H]6.86180.26161.795050.92100.999930.186633.91063.7240
GR6.51270.26741.741500.91480.999930.640234.16323.5230
8[J]7.02720.25891.819340.92370.999936.000439.81783.8174
[H]7.84210.24671.834650.95520.999935.047439.30964.2622
GR6.51270.26741.741500.91480.999936.660840.18383.5230
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nikolić, J.; Aleksić, D.; Perić, Z.; Dinčić, M. Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source. Mathematics 2021, 9, 3091. https://doi.org/10.3390/math9233091

AMA Style

Nikolić J, Aleksić D, Perić Z, Dinčić M. Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source. Mathematics. 2021; 9(23):3091. https://doi.org/10.3390/math9233091

Chicago/Turabian Style

Nikolić, Jelena, Danijela Aleksić, Zoran Perić, and Milan Dinčić. 2021. "Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source" Mathematics 9, no. 23: 3091. https://doi.org/10.3390/math9233091

APA Style

Nikolić, J., Aleksić, D., Perić, Z., & Dinčić, M. (2021). Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source. Mathematics, 9(23), 3091. https://doi.org/10.3390/math9233091

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop