Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique

Chen, Tung-Shou; Wu, Jie; Chen, Kai Sheng; Yuan, Junying; Hong, Wien

doi:10.3390/app11020619

Open AccessArticle

Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique

by

Tung-Shou Chen

¹,

Jie Wu

²,

Kai Sheng Chen

²

,

Junying Yuan

² and

Wien Hong

^1,*

¹

Department of Computer Science and Information Engineering, National Taichung University of Science and Technology, Taichung 404, Taiwan

²

School of Electrical and Computer Engineering, Nanfang College of Sun Yat-Sen University, Guangzhou 510970, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2021, 11(2), 619; https://doi.org/10.3390/app11020619

Submission received: 22 November 2020 / Revised: 5 January 2021 / Accepted: 7 January 2021 / Published: 10 January 2021

(This article belongs to the Special Issue Advances in Signal, Image and Video Processing)

Download

Browse Figures

Versions Notes

Abstract

:

Absolute moment block truncated coding (AMBTC) is a lossy image compression technique aiming at low computational cost, and has been widely studied. Previous studies have investigated the performance improvement of AMBTC; however, they often over describe the details of image blocks during encoding, causing an increase in bitrate. In this paper, we propose an efficient method to improve the compression performance by classifying image blocks into flat, smooth, and complex blocks according to their complexity. Flat blocks are encoded by their block means, while smooth blocks are encoded by a pair of adjusted quantized values and an index pointing to one of the k representative bitmaps. Complex blocks are encoded by three quantized values and a ternary map obtained by a clustering algorithm. Ternary indicators are used to specify the encoding cases. In our method, the details of most blocks can be retained without significantly increasing the bitrate. Experimental results show that, compared with prior works, the proposed method achieves higher image quality at a better compression ratio for all of the test images.

Keywords:

image compression; AMBTC; ternary representation; k-means

1. Introduction

With the rapid development of imaging technology, digital images are perhaps the most widely used media of the Internet. Because digital images themselves contain significant amounts of spatial redundancy, an efficient lossy image compression technique is required for lower storage requirement and faster transmission. The Joint Photographic Experts Group (JPEG) [1,2], vector quantization (VQ) [3,4], and block truncation coding (BTC) [5,6] are well-known lossy compression methods and have been extensively investigated in the literature. Among these techniques, BTC requires significantly less computation cost than others while offering acceptable image quality. BTC has been widely investigated in the disciplines of remote sensing and portable devices, in which computational costs are limited. BTC was firstly proposed by Delp and Mitchell [7]. This method partitions image into blocks, and each block is represented by two quantized values and a bitmap. Inspired by [7], Lema and Mitchel [8] propose a variant method called absolute moment block truncation coding (AMBTC), which offers a simpler computation than that of BTC.

The applications of AMBTC are studied in video compression [9], image authentication [10,11,12], and image steganography [13,14]. Moreover, some recoverable authentication methods adopt AMBTC codes as the recovery information to recover the tampered regions. Because the recovery codes have to be embedded into the host image, a more efficient coding of AMBTC is always desirable because the burden of the embedment can be reduced and the quality of the recovered regions can be enhanced. To improve the compression efficiency of the AMBTC method, several approaches, including bitmap omission [15], block classification [16,17], and quantized value adjustment [18], are adopted to lower the bitrate while maintaining the image quality. For example, Hu [15] recognizes that if the difference between two quantized values is smaller than a predefined threshold, the bitmap plays an insignificant role in reconstructed image quality. Therefore, Hu employs the bitmap omission approach by neglecting the recording of a bitmap if a block is considered to be flat, and only uses block means to represent the flat block. Chen et al. [17] adopt quadtree partitioning and propose a variable-rate AMBTC compression method for color images. The basic idea of [17] is to partition the image into blocks with various sizes according to their complexities. The AMBTC and bitmap omission technique are then employed to encode the image blocks. In some applications, such as data hiding or image authentication, bitmaps have to be altered to carry some required information, causing a degradation in image quality. Hong [18] optimizes the quantized values so that the impact of bitmap alteration can be reduced. Mathews and Nair [19] propose an adaptive AMBTC method based on edge quantization by considering human visual characteristics. This method separates image blocks into edge and non-edge blocks, and quantized values are calculated based on the edge information. Because the edge characteristics are considered, their method provides better image quality than other AMBTC variants.

Xiang et al. [16] in 2019 proposed a dynamic multi-grouping scheme for AMBTC focusing on improving the reconstructed image quality and reducing the bitrate. Their method partitions an image into non-overlapping blocks. According to the block complexity, varied grouping techniques are designed. An indicator is employed to distinguish the grouping types. In addition, instead of recording the quantized values, the differences between them are recorded so as to reduce the bitrate. Xiang et al.’s method provides better compression performance than those of prior works.

In Xiang et al.’s method, the number of pixel groups of an image block directly affects the reconstructed image quality and bitrate. Their method divides pixels of complex blocks into three or four groups during encoding, which may improve the image quality insignificantly but requires more bits for encoding. In this paper, we propose a ternary representation technique, which uses two thresholds to classify image blocks into three types, namely flat, smooth, and complex. We use the bitmap omission technique [15] to code flat blocks. The adjusted quantized values and an index pointing to one of the representative bitmaps are used to encode the smooth blocks. The complex blocks are encoded using three quantized values and a ternary bitmap. Compared with the AMBTC and Xiang et al.’s work, the proposed method achieves a higher reconstructed image quality with a smaller bitrate.

The reminder of this paper is organized as follows: Section 2 introduces AMBTC and Xiang et al.’s methods. Section 3 introduces the algorithms of this paper in detail. Section 4 presents the experimental results of the proposed method, and concluding remarks are provided in the final section.

2. Related Works

In this section, we briefly introduce AMBTC and Xiang et al.’s methods, which are compared with the proposed method for evaluating the encoding performance.

2.1. The AMBTC Method

The AMBTC method [8] compresses image blocks into two quantized values and a bitmap. The detailed approaches are as follows. Let

I

be the original image of size

w \times h

and partition

I

into non-overlapping blocks

{I_{i}}_{i = 0}^{N - 1}

of size

n \times n

, where

N = (w / n) \times (h / n)

is the total number of blocks. Let

I_{i, j}

be the

j - th

pixel of

i - th

block. Therefore,

I_{i} = {I_{i, j}}_{j = 0}^{n \times n - 1}

. For block

I_{i}

, the averaged value

m_{i}

can be calculated by:

m_{i} = \frac{1}{n \times n} \sum_{j = 0}^{n \times n - 1} I_{i, j} .

(1)

The

j - th

bit of bitmap

B_{i}

, indicated by

B_{i, j}

, is used to indicate the relationship between

I_{i, j}

and

m_{i}

.

B_{i, j}

can be obtained by:

B_{i, j} = {\begin{matrix} 0, & I_{i, j} < m_{i}; \\ 1, & I_{i, j} \geq m_{i} . \end{matrix}

(2)

The lower quantized value

a_{i}

and higher quantized value

b_{i}

are obtained by averaging the pixels in

I_{i}

with values smaller than and larger than or equal to

m_{i}

, respectively. This can be implemented by sequentially visiting pixels in

I_{i}

. The lower quantized value

a_{i}

is obtained by calculating the averaged value of visited pixels with values smaller than

m_{i}

. Similarly, the higher quantized value

b_{i}

is the averaged values of the other pixels. Therefore, the compressed code

Φ_{i}

of

I_{i}

is

{a_{i}, b_{i}, B_{i}}

. Each block is processed using the same manner, and the AMBTC compressed codes

{Φ_{i}}_{i = 0}^{N - 1} = {a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

of image

I

are then obtained.

To decode

{Φ_{i}}_{i = 0}^{N - 1} = {a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

, blocks

{I_{i}^{'}}_{i = 0}^{N - 1}

of size

n \times n

are prepared, where

I_{i}^{'} = {I_{i, j}^{'}}_{j = 0}^{n \times n - 1}

. The

j - th

pixel of

I_{i}^{'}

can be decoded by:

I_{i, j}^{'} = {\begin{array}{l} a_{i}, & B_{i, j} = 0; \\ b_{i}, & B_{i, j} = 1 . \end{array}

(3)

After all of the image blocks are reconstructed, the image

I^{'}

can then be obtained.

2.2. Xiang et al.’s Method

AMBTC uses the same approach to compress all image blocks. However, the same approach may not suitable for flat and complex blocks. As a result, Xiang et al. proposed an improved scheme to efficiently encode blocks according to their complexity, and achieve a better image quality than that of AMBTC with a satisfactory bitrate.

Let

{Φ_{i}}_{i = 0}^{N - 1} = {a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

be the AMBTC compressed code of the original image

I = {I_{i}}_{i = 0}^{N - 1}

. To determine the complexity of block

I_{i}

, a threshold

τ_{0}

is set. If

b_{i} - a_{i} \leq τ_{0}

, the variations of pixel values in block

I_{i}

are relatively small. Therefore, all the pixels in this block are categorized as one group. In this case, the block mean

m_{i}

is calculated, and this block is encoded by

{(m_{i})}_{2}

, which is the 8-bit binary representation of

m_{i}

.

If

b_{i} - a_{i} > τ_{0}

, the variations of pixels in block

I_{i}

are large and these pixels need to be regrouped to achieve a better reconstructed image quality. Let

G_{i}^{0}

and

G_{i}^{1}

be the group of pixels with

B_{i, j} = 0

and

B_{i, j} = 1

, respectively. Apply the AMBTC method to

G_{i}^{0}

and

G_{i}^{1}

to obtain codes

Φ_{i}^{0} = {a_{i}^{0}, b_{i}^{0}, B_{i}^{0}}

and

Φ_{i}^{1} = {a_{i}^{1}, b_{i}^{1}, B_{i}^{1}}

. According to a given threshold

d_{\min}

, this method uses the following rules to determine whether

G_{i}^{0}

and

G_{i}^{1}

should be regrouped:

Rule 1: If

b_{i}^{0} - a_{i}^{0} > τ_{0}

and the total number of pixels in

G_{i}^{0}

is greater than

d_{\min}

.

Rule 2: If

b_{i}^{1} - a_{i}^{1} > τ_{0}

and the total number of pixels in

G_{i}^{1}

is greater than

d_{\min}

.

If neither rule is met, block

I_{i}

does not need to be further divided. Otherwise, block

I_{i}

will be sub-divided into three or four groups using the following rules:

(1): If only rules 1 or 2 are met, group $G_{i}^{0}$ or $G_{i}^{1}$ needs to be subdivided. The number of pixels needing to be subdivided is denoted by $P_{i}$ , and $P_{i} - bit$ bitmap $B_{i}^{0}$ or $B_{i}^{1}$ has to be used to record the bitmap of $G_{i}^{0}$ or $G_{i}^{1}$ . In this case, block $I_{i}$ is eventually divided into three groups.
(2): If both rules 1 and 2 are met, both $G_{i}^{0}$ and $G_{i}^{1}$ need to be subdivided, and block $I_{i}$ is eventually divided into four groups. Bitmap $B_{i}^{0}$ and $B_{i}^{1}$ have to be recorded to maintain the grouping information.

Xiang et al.’s method uses a 2-bit indicator

I_{N D}

to record grouping information of

I_{i}

. When block

I_{i}

is divided into one to four groups, the indicator

I_{N D}

is set to be

00_{2}

,

01_{2}

,

10_{2}

, and

11_{2}

, respectively. Moreover, if

I_{i}

needs to be divided into three groups, an extra indicator is required to show which group is subdivided. Specifically, if

G_{i}^{0}

is sub-divided, then

J_{i} = 0

. On the contrary, if

G_{i}^{1}

is sub-divided, then

J_{i} = 1

.

To record the quantized values, Xiang et al.’s method records the smallest quantized value of a block using 8 bits, and utilizes a difference encoding scheme (DES) to encode the difference

d_{i}

between two quantized values. In DES, if

d_{i} < γ

, where

γ

is a predefined threshold,

d_{i}

is recorded using

\log_{2} (γ)

bits. Otherwise,

d_{i}

is recorded using

⌈ \log_{2} (σ) ⌉

bits, where

σ

is the maximum difference between quantized values in all blocks. An extra indicator

Y_{i}

is used to distinguish these two methods. That is, if

d_{i} < γ

,

Y_{i} = 0

is set. Otherwise,

Y_{i} = 1

. The number of bits

R_{i}

used to record the difference can be expressed as:

R_{i} = {\begin{matrix} \log_{2} (γ) + 1, & d_{i} < γ; \\ ⌈ \log_{2} (σ) ⌉ + 1, & d_{i} \geq γ . \end{matrix}

(4)

We use the symbol

{(x - y)}_{2}

to represent the R-bit encoded result of the difference between

x

and

y

using DES. For example, if

x = 40

,

y = 28

, and

γ = 64

, then

d_{i} = 12 < γ

. Therefore,

R = 7

and the encoding result is

{(40 - 28)}_{2} = 0_{2} | | 001100_{2}

, where

| |

is the concatenation operator.

The compressed code and the number of bits required to record blocks

I_{i}

of different grouping cases are summarized in Table 1. Each block is compressed using the same procedures and the final compressed code stream

C S_{f}

of image

I

is obtained.

To decode

C S_{f}

, the 2-bit indicator

I_{N D}

is read. According to the read bits, four possible compressed codes shown in Table 1 with different lengths can be extracted. The image blocks can be reconstructed from the compressed codes, and the decompressed image can be obtained. The detailed decoding procedures can be referred to [16].

3. Proposed Method

The traditional AMBTC compression method uses the same number of bits to compress each block. However, coding in this way requires more bits than necessary for flat blocks and neglects too much image detail for complex blocks. Xiang et al.’s method improves AMBTC, resulting in better compression effects for both flat and complex blocks. However, in the processing of complex blocks, Xiang et al.’s method reconstructs the gray values of the image block by four quantized values. Although the quality of the reconstructed block is improved, it requires quantized values to be recorded and bitmaps with more bits. In addition, Xiang et al. adopt the traditional AMBTC method to compress the smooth blocks, which may increase the cost of recording bitmaps and quantized values.

In this paper, we propose a more effective solution by classifying image blocks into flat, smooth, and complex blocks based on thresholds

τ_{0}

and

τ_{1}

(

τ_{0} \leq τ_{1}

). Let

Φ_{i} = {a_{i}, b_{i}, B_{i}}

be the AMBTC codes of

I_{i}

. If

b_{i} - a_{i} \leq τ_{0}

,

I_{i}

is classified as a flat block. Because pixel variations in a flat block are small, all pixels in a flat block can be simply reconstructed by their mean to a satisfactory visual quality. If

τ_{0} < b_{i} - a_{i} < τ_{1}

,

I_{i}

is classified as a smooth block. For the smooth block, we use a clustering algorithm to obtain representative bitmaps, and the original bitmaps are replaced by the indices pointing to the obtained bitmap. The two quantized values are also adjusted to reduce the error caused by the bitmap replacement. If

b_{i} - a_{i} \geq τ_{1}

,

I_{i}

is classified as a complex block. We use three quantized values and a ternary map to represent the complex block to maintain better texture details. The encoding algorithms of these three types of blocks will be presented in the following sections.

3.1. Encoding of Flat Blocks

The pixel values of a flat block

I_{i}

(i.e.,

b_{i} - a_{i} \leq τ_{0}

) are relatively close, and thus the bitmap plays an insignificant role in reconstructing the image block. Therefore, we omit the recording of the quantization value in addition to the bitmap, and use an 8-bit mean value

{(m_{i})}_{2}

to represent the flat block, where:

m_{i} = round (\frac{b_{i} + a_{i}}{2})

(5)

and

round (x)

is the function rounding

x

to the nearest integer.

3.2. Encoding of Smooth Blocks

If

τ_{0} < b_{i} - a_{i} < τ_{1}

, the fluctuation of pixel values of block

I_{i}

is more than that of a flat block. Therefore, we refer to

I_{i}

as a smooth block. To reduce the bitrate, a codebook consisting of the

k

most representative bitmaps (codewords) is found, and the bitmap of the smooth block will be replaced by an index pointing to one of the codewords in the codebook. We use the k-means algorithm [20] to obtain the

k

most representative bitmaps. Let

{a_{s}, b_{s}, B_{s}}_{s = 0}^{N_{s} - 1}

be the set of AMBTC codes satisfying

τ_{0} < b_{i} - a_{i} < τ_{1}

for

0 \leq i \leq N - 1

, where

N_{s}

is the number of smooth blocks. Firstly, an initial codebook

{C_{α}^{0}}_{α = 0}^{k - 1}

is constructed by randomly selecting

k

bitmaps from

{B_{s}}_{s = 0}^{N_{s} - 1}

, where

k

is much less than

N_{s}

. Secondly, the bitmaps

{B_{s}}_{s = 0}^{N_{s} - 1}

are classified into

k

clusters according to the similarities between

{B_{s}}_{s = 0}^{N_{s} - 1}

and

{C_{α}^{0}}_{α = 0}^{k - 1}

. That is, if

B_{s}

has more bits identical to

C_{α}^{0}

than other codewords, then

B_{s}

is classified into group

α

, where

0 \leq α \leq k - 1

. Thirdly,

{B_{s}}_{s = 0}^{N_{s} - 1}

of the same group are averaged and rounded to obtain the updated codebook

{C_{α}^{1}}_{α = 0}^{k - 1}

. Repeat the classification process

t

times and the final representative bitmaps

{C_{α}^{t}}_{α = 0}^{k - 1}

are obtained. Normally, setting

t = 6

can already obtain a satisfactory result. We denote the final representative bitmaps as

{C_{α}^{}}_{α = 0}^{k - 1}

. Once the classification process is completed, the classification results

{α_{s}^{*}}_{s = 0}^{N_{s} - 1}

of bitmaps

{B_{s}}_{s = 0}^{N_{s} - 1}

are also obtained. Note that the codeword with index

α_{s}^{*}

has the nearest distance to

B_{s}

, that is:

α_{s}^{*} = \underset{α}{\arg \min} {(\sum_{j = 0}^{n \times n - 1} {(B_{s, j} - C_{α, j})}^{2})}^{1 / 2}

(6)

where

B_{s, j}

and

C_{α, j}

represent the

j - th

element of

B_{s}

and

C_{α}

, respectively. Instead of recording

{B_{s}}_{s = 0}^{N_{s} - 1}

, the proposed method uses the binary representation of

{α_{s}^{*}}_{s = 0}^{N_{s} - 1}

as the required bitmap information. Therefore, the bits required to record the bitmap are reduced from

n \times n

bits to

\log_{2} (k)

bits. To successfully decode the bitmap, we must have cluster centers

{C_{α}}_{α = 0}^{k - 1}

and cluster indices

{α_{s}^{*}}_{s = 0}^{N_{s} - 1}

. Therefore,

{C_{α}}_{α = 0}^{k - 1}

must be included as part of the compressed codes.

When decoding a smooth block, because we use cluster center

C_{α_{s}^{*}}

to replace the original bitmap

B_{s}

, the quality of the reconstructed image block will be reduced. To minimize the reduced quality, a quantized value adjustment (QA) technique [18] is employed. QA is a technique originally used in a data hiding technique to reduce the distortions of the reconstructed AMBTC block when the original bitmap is replaced by secret data. Because bits in the bitmap are altered, distortions of the reconstructed block are inevitable. QA subtly adjusts the quantized values by counting the bit difference between the original bitmap and secret data. In the proposed method, the original bitmap is replaced by a cluster center, which resembles the situations in which the bitmap is replaced by secret data. Therefore, the QA technique can be applied in the proposed method. To find the minimum distortion, the QA technique adjusts

a_{s}

and

b_{s}

to

{\hat{a}}_{s}

and

{\hat{b}}_{s}

by calculating:

{\hat{a}}_{s} = \frac{a_{s} ρ_{00} + b_{s} ρ_{10}}{ρ_{00} + ρ_{10}}

(7)

and:

{\hat{b}}_{s} = \frac{a_{s} ρ_{01} + b_{s} ρ_{11}}{ρ_{01} + ρ_{11}}

(8)

respectively, where

ρ_{p q}

is the number of bits with

B_{s, j} = p

and

C_{α_{s}^{*}, j} = q

,

(p, q) \in {0, 1}

. For example,

ρ_{01}

indicates the number of bits with

B_{s, j} = 0

and

C_{α_{s}^{*}, j} = 1

. After adjustment of quantized values, the distortion due to the bitmap replacement will be smaller than that without adjustment.

3.3. Encoding of Complex Blocks

Blocks with

b_{i} - a_{i} \geq τ_{1}

are classified as complex blocks. Let

{I_{c}}_{c = 0}^{N_{c} - 1}

be the set of

N_{c}

complex blocks in

I

. For a given complex block

I_{c} = {I_{c, j}}_{j = 0}^{n \times n - 1}

, the proposed method uses the k-means clustering algorithm to obtain three most representative quantized values

{q_{c}^{0}, q_{c}^{1}, q_{c}^{2}}

and a ternary map

T_{c} = {T_{c, j}}_{c = 0}^{n \times n - 1}

, where

T_{c, j}

is a ternary digit ranging from 0 to 2 used to indicate which quantized value should be used to reconstruct the

j - th

pixel of

I_{c}

. Because the value of

T_{c, j}

is equally distributed over 0 to 2, we can simply encode the ternary digits

0_{3}

,

1_{3}

, and

2_{3}

by

0_{2}

,

10_{2}

, and

11_{2}

, respectively. We assume the encoded result of

T_{c}

is

{T^{'}}_{c}

of L-bit. Once the decoder has

{{T^{'}}_{c}}_{c = 0}^{N_{c} - 1}

and

{q_{c}^{0}, q_{c}^{1}, q_{c}^{2}}_{c = 0}^{N_{c} - 1}

, blocks

{I_{c}}_{c = 0}^{N_{c} - 1}

can be reconstructed.

When encoding a

4 \times 4

ternary map, the average number of bits required in the proposed method is:

\frac{1 \times 16}{3} + \frac{2 \times 2 \times 16}{3} = 26.67 bits .

Theoretically, recording 16 ternary digits requires

⌈ 16 \times \log_{2} 3 ⌉ = 26

bits, which is almost the same as in the proposed method. Therefore, the encoding of the ternary map used in the proposed method is effective.

3.4. Encoding Procedures

This section describes the procedures of the proposed method. To distinguish the encoding methods of three types of image blocks, an indicator is prepended to the code stream of each encoded block. The indicators

0_{2}

,

10_{2}

, and

11_{2}

are used to indicate a flat, smooth, and complex block is encoded, respectively. The detailed encoding procedures are shown as follows:

Input:

Original image

I

, block size

n \times n

, thresholds

τ_{0}

and

τ_{1}

, parameter

γ

, and cluster size

k

.

Output:

Code stream

C S_{f}

.

Step 1:

Partition the original image

I

into blocks

{I_{i}}_{i = 0}^{N - 1}

of size

n \times n

. Encode

{I_{i}}_{i = 0}^{N - 1}

using the AMBTC encoder and obtain codes

Φ_{i} = {a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

, as described in Section 2.1.

Step 2:

Scan codes

{a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

. Let

{B_{s}}_{s = 0}^{N_{s} - 1}

be the bitmap of smooth blocks. Clustering

{B_{s}}_{s = 0}^{N_{s} - 1}

into

k

groups using the k-means clustering algorithm, we obtain

k

cluster centers

{C_{α}}_{α = 0}^{k - 1}

and

N_{s}

cluster indices

{α_{s}^{*}}_{s = 0}^{N_{s} - 1}

. Concatenate the binary representation of

{C_{α}}_{α = 0}^{k - 1}

and obtain the concatenated code stream

C S_{A}

. The

N_{s}

pairs of adjusted quantized values

{{\hat{a}}_{s}, {\hat{b}}_{s}}_{s = 0}^{N_{s} - 1}

of smooth blocks are also obtained, as described in Section 3.2. Similarly, quantized values

{q_{c}^{0}, q_{c}^{1}, q_{c}^{2}}_{c = 0}^{N_{c} - 1}

and ternary maps

{T_{c}}_{c = 0}^{N_{c} - 1}

of complex blocks are also obtained, as described in Section 3.3.

Step 3:

Scan codes

{a_{i}, b_{i}, B_{i}}_{i = 0}^{N - 1}

again and perform the encoding according to the cases listed below:

Case 1:: If $b_{i} - a_{i} \leq τ_{0}$ , a flat block is visited and the code stream of block $I_{i}$ is $C S_{i} = 0_{2} | | (m_{i})_{2}$ .
Case 2:: If $τ_{0} < b_{i} - a_{i} < τ_{1}$ , a smooth block is visited. Extract ${\hat{a}}_{s}$ , ${\hat{b}}_{s}$ , and $α_{s}^{*}$ from ${{\hat{a}}_{s}, {\hat{b}}_{s}, α_{s}^{*}}_{s = 0}^{N_{s} - 1}$ obtained in Step 2, and block $I_{i}$ is encoded by $C S_{i} = 10_{2} | | ({\hat{a}}_{s})_{2} | | {({\hat{b}}_{s} - {\hat{a}}_{s})}_{2} | | {(α_{s}^{*})}_{2}$ . Note that ${({\hat{b}}_{s} - {\hat{a}}_{s})}_{2}$ is encoded using the DES, as described in Section 2.2.
Case 3:: If $b_{i} - a_{i} \geq τ_{1}$ , block $I_{i}$ is a complex one. Extract $q_{c}^{0}$ , $q_{c}^{1}$ , $q_{c}^{2}$ , and $T_{c}$ from ${q_{c}^{0}, q_{c}^{1}, q_{c}^{2}, T_{c}}_{c = 0}^{N_{c} - 1}$ obtained in Step 2, and block $I_{i}$ is encoded by $C S_{i} = 11_{2} | | (q_{c}^{0})_{2} | | {(q_{c}^{1} - q_{c}^{0})}_{2} | | {(q_{c}^{2} - q_{c}^{1})}_{2} | | {T^{'}}_{c}$ . Note that ${(q_{c}^{1} - q_{c}^{0})}_{2}$ and ${(q_{c}^{2} - q_{c}^{1})}_{2}$ are encoded using the DES (see Section 2.2).

Step 4:

Repeat Step 3 until the code stream

{C S_{i}}_{i = 0}^{N - 1}

of blocks

{I_{i}}_{i = 0}^{N - 1}

are obtained. Concatenate

{C S_{i}}_{i = 0}^{N - 1}

, we have the concatenated code stream

C S_{B}

.

Step 5:

Concatenate

C S_{A}

and

C S_{B}

; we obtain the final code stream

C S_{f}

of image

I

, i.e.,

C S_{f} = C S_{A} | | C S_{B}

.

The encoding of a given image block and the number of required bits for each block types are shown in Figure 1.

We take a simple example to illustrate the encoding of smooth and complex blocks. Let

I_{0}

be a

4 \times 4

block to be encoded, as shown in Figure 2a. Suppose

τ_{0} = 4

,

τ_{1} = 16

,

γ = 64

,

σ = 128

, and

k = 128

are used in this example. The AMBTC compressed code of

I_{0}

is

{a_{0}, b_{0}, B_{0}} = {28, 40, 1110 1110 1100 1100_{2}}

, and

B_{0}

is depicted in Figure 2b. Because

τ_{0} < b_{0} - a_{0} < τ_{1}

,

I_{0}

is a smooth block. Assume

α_{0}^{*} = 43

and

C_{43} = 1010 0110 0101 0100_{2}

(see Figure 2c). By comparing

B_{0}

and

C_{43}

, we have

ρ_{00} = 5

,

ρ_{01} = 1

,

ρ_{10} = 4

, and

ρ_{11} = 6

. Using Equations (7) and (8), we have

{\hat{a}}_{0} = 33

and

{\hat{b}}_{0} = 38

. Because

{\hat{b}}_{0} - {\hat{a}}_{0} = 5 < γ

, we have

Y = 0

. Because

{({\hat{a}}_{0})}_{2} = 00100001_{2}

,

{({\hat{b}}_{0} - {\hat{a}}_{0})}_{2} = 0 | | 000101_{2}

, and

{(α_{0}^{*})}_{2} = 0101011_{2}

, the code stream of

I_{0}

should be

C S_{0} = 10 | | 00100001 | | 0 | | 000101 | | 0101011_{2}

.

Figure 2d shows another block

I_{1}

to be encoded. For this block, quantized values

a_{1} = 25

and

b_{1} = 103

of the AMBTC code are calculated. Because

b_{1} - a_{1} \geq τ_{1}

,

I_{1}

is regarded as a complex block. Suppose after applying the k-means clustering algorithm to

I_{1}

, we obtain three quantized values

{q_{1}^{0}, q_{1}^{1}, q_{1}^{2}} = {19, 85, 133}

and the ternary cluster indices of pixels

T_{1} = {1111 2121 0210 0000}

, as shown in Figure 2e. The difference between the first two quantized values is

q_{1}^{1} - q_{1}^{0} = 66 > γ

. Therefore, indicator

Y_{1, 0} = 1

should be placed in front of the

\log_{2} (σ) = 7 - bit

binary representation of 66 (i.e.,

1 | | 1000010_{2}

). Similarly, because

q_{1}^{2} - q_{1}^{1} = 48 < γ

, indicator

Y_{1, 1} = 0

should be placed in front of the

\log_{2} (γ) = 6 - bit

binary representation of 48 (i.e.,

{0 | | 110000}_{2}

). Finally, the ternary cluster indices

T_{1}

are encoded by

10101010 11101110 011100 0000_{2}

, which is illustrated in Figure 2f. Therefore, according to Step 3 of Case 3 in Section 3.4, the code stream of block

I_{1}

should be

C S_{1} = 11 | | 00010011 | | 1 | | 1000010 | | 0 | | 110000 | | 10101010 11101110 {011100 0000}_{2}

.

3.5. Decoding Procedures

In decoding, data bits are sequentially read and decoded, and image blocks are reconstructed by decoding the read data bits. The detailed steps of decoding are listed as follows:

Input:

Code stream

C S_{f}

, block size

n \times n

, parameter

γ

,

σ

, and cluster size

k

.

Output:

Decompressed image

I^{'} = {I_{i}^{'}}_{i = 0}^{N - 1}

.

Step 1:

Extract

C S_{A}

from

C S_{f}

and reconstruct

k

cluster centers

{C_{α}}_{α = 0}^{k - 1}

.

Step 2:

Extract one bit

b

from

C S_{f}

. According to the extracted bit, one of the following decoding cases is then performed:

Case 1:: If $b = 0_{2}$ , the block to be reconstructed is a flat block. All the pixel values of block $I_{i}^{'}$ are the decimal value of the next 8 bits extracted from $C S_{f}$ .
Case 2:: If $b = 1_{2}$ and the next extracted bit is $0_{2}$ , the block to be reconstructed is a smooth block. Extract the next 8 bits and convert them to a decimal value to obtain the quantized value ${\hat{a}}_{s}$ . Read the next bit from $C S_{f}$ . If the read bit is $0_{2}$ , ${\hat{b}}_{s}$ is reconstructed by the decimal value of the next $\log_{2} (γ)$ bits plus ${\hat{a}}_{s}$ . Otherwise, ${\hat{b}}_{s}$ is reconstructed by the decimal value of next $⌈ \log_{2} (σ) ⌉$ bits plus ${\hat{a}}_{s}$ . The clustering index $α_{s}^{*}$ is the decimal value of next $k$ bits, and the bitmap $C_{α_{s}^{*}}$ can be obtained from ${C_{α}}_{α = 0}^{k - 1}$ . Using the AMBTC decoder to decode ${{\hat{a}}_{s}, {\hat{b}}_{s}, C_{α_{s}^{*}}}$ , the image block can be reconstructed.
Case 3:: If $b = 1_{2}$ and the next extracted bits is $1_{2}$ , the block to be reconstructed is a complex block. Extract the next 8 bits and convert them to a decimal value to obtain the quantized value $q_{c}^{0}$ . Read the next bit from $C S_{f}$ . If the read bit is $0_{2}$ , $q_{c}^{1}$ is reconstructed by the $q_{c}^{0}$ plus the decimal value of the next $\log_{2} (γ)$ bits; otherwise, $q_{c}^{1}$ is reconstructed by the decimal value of the next $⌈ \log_{2} (σ) ⌉$ bits plus $q_{c}^{0}$ . Using a similar manner, $q_{c}^{2}$ is reconstructed. To reconstruct the ternary map ${T_{c, j}}_{j = 0}^{n \times n - 1}$ , we start from $j = 0$ to $j = n \times n - 1$ and repeat the following process: Read a bit $b_{0}$ from $C S_{f}$ . If $b_{0} = 0_{2}$ , we have $T_{c, j} = 0$ . Otherwise, read the next bit $b_{1}$ from $C S_{f}$ . If $b_{0} b_{1} = 10_{2}$ , $T_{c, j} = 1$ . If $b_{0} b_{1} = 11_{2}$ , $T_{i, j} = 2$ . Once we have ${q_{c}^{0}, q_{c}^{1}, q_{c}^{2}}$ and ${T_{c, j}}_{j = 0}^{n \times n - 1}$ , the j-th pixel of the image block is reconstructed by $q_{c}^{0}$ , $q_{c}^{1}$ , or $q_{c}^{2}$ if $T_{c, j} = 0$ , 1, or 2, respectively.

Step 3:

Repeat Step 2 until all image blocks are reconstructed, and the final decompressed image

I^{'}

is obtained.

We continue the example given in Section 3.4 to illustrate the decoding process. The detailed process and the decoded result are depicted in Figure 3. To decode the code stream

C S_{0} = 10 | | 00100001 | | 0 000101 | | 0101011_{2}

, because the first bit is

1_{2}

and the second bit is

0_{2}

, the to-be-reconstructed block is a smooth block. Extract the next 8 bits from

C S_{0}

and convert them into decimal representation; we obtain

{\hat{a}}_{0} = 33

. The next extracted bit is 0. Therefore, the difference

d_{0} = 5

is the decimal value of the next

\log_{2} (γ)

bits, and we have

{\hat{b}}_{0} = {\hat{a}}_{0} + 5 = 38

. Finally, extract

\log_{2} (k)

bits and convert them to a decimal value; we have

α_{s}^{*} = 43

and the bitmap

C_{43}

is obtained. The image block can then be constructed by decoding

{{\hat{a}}_{0}, {\hat{b}}_{0}, C_{43}}

using the AMBTC decompression technique.

To decode

C S_{1} = 11 | | 00010011 | | 1 1000010 | | 0 110000 | | 10101010 11101110 {011100 0000}_{2}

, because the first two extracted bits are

11_{2}

, the block to be decompressed is a complex block. Extract 8 bits and

q_{1}^{0} = 19

is the decimal value of these 8 bits. The next bit is

1_{2}

; therefore,

d_{1}^{1} = 66

is the decimal value of the next

⌈ \log_{2} (σ) ⌉ = 7

bits and

q_{1}^{1} = 19 + 66 = 85

can be obtained. Similarly, the next extracted bit is

0_{2}

; therefore,

d_{1}^{2} = 48

is obtained by converting the next

\log_{2} (γ) = 6

bits to their decimal value, and

q_{1}^{2} = 85 + 48 = 133

can be obtained. Finally, we have to reconstruct

{T_{1, j}}_{j = 0}^{15}

from the remaining bits. Because the next extracted bit is

1_{2}

, we extract one more bit, which is

0_{2}

. Therefore,

T_{1, 0} = 1

is obtained. The remaining 15 ternary digits

{T_{1, j}}_{j = 1}^{15}

can be decoded in the similar manner. Once we have

{q_{1}^{0}, q_{1}^{1}, q_{1}^{2}}

and

{T_{1, j}}_{j = 0}^{15}

, block

I_{1}^{'}

can be reconstructed. Figure 3b illustrates the decoding process of

C S_{1}

.

4. Experimental Results

In this section, we conduct several experiments to show the effectiveness and applicability of the proposed scheme. We take eight grayscale images of size

512 \times 512

, namely, Lena, Jet, Baboon, Tiffany, Boat, Stream, Peppers and House, as the test images, as shown in Figure 4. These images can be obtained from the USC-SIPI image database [21]. We use the peak signal-to-noise ratio (PSNR) and bitrate to measure the performance. The PSNR is calculated by:

PSNR = 10 \log_{10} \frac{255^{2}}{\frac{1}{n \times n \times N} \sum_{i = 0}^{n \times n \times N - 1} {(x_{i} - {x^{'}}_{i})}^{2}},

(9)

where

x_{i}

and

{x^{'}}_{i}

represent the pixel values of the original and decompressed images, respectively. The bitrate metric is measured by the number of bits required to record each pixel (i.e., bit per pixel, bpp).

In all of the experiments, we set

τ_{0} = 4

because the flat blocks under this setting show no apparent block boundary artifacts.

4.1. The Performance of the Proposed Method

Because the number of cluster centers k and threshold greatly affect the coding efficiency in the application of the quantized value adjustment (QA) technique [18], we evaluate how the QA technique and these parameters influence the bitrate and image quality in this section.

4.1.1. Coding Efficiency Comparisons

In the coding of smooth blocks, the original bitmaps of smooth blocks are used to obtain the cluster centers, and the quantized values are adjusted using the QA technique to lower the distortions. Table 2 and Table 3 show how the QA technique improves the image quality when the block size is set to

4 \times 4

and

8 \times 8

, respectively. In this experiment,

τ_{1} = 16

and

γ = 32

are set. As seen from the tables, the QA technique effectively enhances the quality of reconstructed images for every

k

. For example, in Table 2 and

k = 128

, the averaged quality of the reconstructed images with and without the QA technique is 34.63 and 34.52 dB, respectively. The averaged quality has improved by

34.63 - 34.52 = 0.11

dB. Similarly, in Table 3 when

k = 128

, the PSNR improvement is

31.86 - 31.76 = 0.10

dB. Therefore, the QA technique indeed reduces the distortion caused by replacing the original bitmap with a cluster center.

Table 2 and Table 3 also reveal that the increase in cluster size

k

also enhances the image quality. For example, in Table 3 when

k = 128

, the averaged PSNR of eight test images is 31.86 dB. When

k = 256

and 512, the PSNR increases

31.91 - 31.86 = 0.05

dB and

32.00 - 31.86 = 0.14

dB, respectively. The reason is that a larger cluster size provides a greater chance to reduce the difference between the cluster centers and original bitmaps.

To evaluate how threshold

τ_{1}

affects the performance of the QA, we plot the gain of PSNR when using the QA for various

τ_{1}

with

k = 128

and 512. The results are shown in Figure 5. Note that, in this experiment, a block size of

4 \times 4

is set.

Figure 5a,b shows that the gain in PSNR increases as

τ_{1}

increases, and this is mainly because the number of smooth blocks also increases as

τ_{1}

increases. Because more blocks are classified as smooth for larger

τ_{1}

, more blocks will be processed using the QA technique. As a result, the gain in PSNR is higher when

τ_{1}

is larger. It also can be observed that for each test image, the gain in PSNR is larger when

k = 128

than that when

k = 512

. The reason is that a smaller

k

implies larger differences between the original bitmaps and cluster centers. Because the QA technique is capable of reducing the distortion caused by the differences, a larger PSNR improvement can be achieved for smaller

k

.

It is interesting to note that the gain in PSNR of the Stream and Baboon images increases more than that of other test images when varying

τ_{1} = 10

to

τ_{1} = 50

for both

k = 128

and

k = 512

. Because these two images are more complex than the others, their bitmaps of smooth blocks are expected to be more different from the selected cluster centers used to replace the bitmaps. As previously mentioned, the QA technique is effective in reducing the distortion caused by the differences, and the bitmaps of Stream and Baboon images are more different from the cluster centers than the other images. Therefore, the improvement in PSNR after applying the QA technique is more significant than for the others.

4.1.2. Performance Comparison of Various $τ_{1}$

The parameter

τ_{1}

controls the number of smooth and complex blocks. The number of complex blocks decreases as

τ_{1}

increases. To see the distribution of flat, smooth, and complex blocks of a test image, we take the Lena image as an example to illustrate their distribution by varying

τ_{1}

. Figure 6a–d shows the distributions of blocks when

τ_{1} = 8

, 16, 32, and 64 are set. The block sizes in these figures are

8 \times 8

and

τ_{0} = 4

. In this figure, the blue squares, red dots, and black cross marks represent flat, smooth, and complex blocks, respectively.

Because the same

τ_{0}

is applied, it can be seen that the number of blue squares (flat blocks) is the same in Figure 6a–d. However, as

τ_{1}

increases, the red dots increase and black cross marks decrease. The reason is that an increase in

τ_{1}

leads more blocks to be categorized as smooth. It can also be inferred that a better image quality can be achieved at a smaller

τ_{1}

but the bitrate will be higher because more blocks are deemed to be complex. Note that in the proposed method, more bits are required to represent a complex block than a smooth block.

Table 4 and Table 5 show the PSNR and bitrate for all of the test images under various

τ_{1}

with block size

4 \times 4

and

8 \times 8

, respectively. In this experiment,

τ_{0} = 4

,

γ = 64

, and

k = 256

are set. We also list the PSNR and bitrate of the standard AMBTC method as a comparison. Note that the bitrates of the AMBTC with block size

4 \times 4

and

8 \times 8

are 2.0 and 1.25 bpp, respectively.

As seen in Table 4 and Table 5, the PSNR of block size

8 \times 8

is lower than that of block size

4 \times 4

. For example, when

τ_{1} = 16

, the PSNR of the Lena image of block sizes

4 \times 4

and

8 \times 8

are 34.73 and 31.91 dB, respectively. However, the former requires

1.68 - 1.02 = 0.66

more bits per pixel than the latter. In addition, the experiments also reveal the fact that a large

τ_{1}

effectively reduces the bitrate at the expense of image quality. On the contrary, a small

τ_{1}

provides better image quality, but requires more bitrate. This result is expected because a small

τ_{1}

increases the number of complex blocks and, therefore, the bitrate, however, the image quality also increases.

Figure 7 shows the bitrate–PSNR curves of each of the eight test images by varying the threshold

τ_{1}

from 8 to 64. The figure shows that for all of the test images, the PSNR increases as the bitrate increases. Moreover, the figure also reveals that smooth images, such as Tiffany or Jet, have a better compression efficiency than those of complex images, such as Stream or Baboon. The reason is that a smooth block not only requires less bits to record its compressed code but also provides better reconstructed quality. Because the smooth images naturally possess more smooth blocks than complex blocks, their bitrate–PSNR curves are higher than those of complex ones.

It also can be seen from Figure 7 that the PSNR and bitrate vary as the threshold

τ_{1}

changes. A larger

τ_{1}

gives a lower bitrate with lower PSNR. In contrast, a smaller

τ_{1}

offers a higher image quality, but the bitrate is also higher. Therefore, the selection of threshold

τ_{1}

depends on real applications. For example, if an application requires higher image quantity, a smaller

τ_{1}

is required.

It is worth noting that, for most of the test images, the proposed method provides better performance than AMBTC, particularly for smooth images. For example, Lena, Jet, Tiffany, Boat, and Peppers are considered to be smooth. For these smooth images, regardless of the value of

τ_{1}

, the PSNR is always higher and the bitrate is always lower than those of the AMBTC method. In contrast, for the complex images such as Baboon or Stream, few blocks are classified as flat, which require only 8 bits to record them. Therefore, the reduction in bitrate is limited. Nevertheless, the proposed method either provides a better image quality or lower bitrate than those of AMBTC.

4.2. Comparisons with Xiang et al.’s Work

Xiang et al.’s method [16] also improves the AMBTC method by dynamically splitting images into multiple groups and achieves a good performance. In this section, we compare the proposed method with that of Xiang et al. in terms of PSNR and bitrate. To make a fair comparison, threshold

τ_{0} = 4

and

γ = 64

are set in both methods. The proposed method uses

τ_{1}

to control the number of smooth and complex blocks, whereas Xiang et al.’s method uses

d_{\min}

to control the number of pixel groups. We select

τ_{1} = 8

, 16, and 24 in the proposed method and compare the results with those of Xiang et al. by setting

d_{\min} = 6

, 7, and 8 for block size

4 \times 4

. The results are shown in Table 6. Table 7 shows the same experimental results, except block size is

8 \times 8

and

d_{\min} = 28

, 32, and 36. The settings of

d_{\min}

in Xiang et al.’s method ensure that best performance can be achieved.

Table 6 shows that in Xiang et al.’s method, as

d_{\min}

increases, the image quality and bitrate decrease. The reason is that a large

d_{\min}

prevents more blocks from being split, leading to a decrease in bitrate and PSNR. Note that for most of the test images, the proposed method performs better than that of Xiang et al. We take the Lena image as an example: when

τ_{1} = 16

and

d_{\min} = 7

are set, the PSNR of the proposed and Xiang et al.’s methods are 36.05 dB with 1.68 bpp and 35.12 dB with 2.20 bpp, respectively. The PSNR of the proposed method is

36.05 - 35.12 = 0.93

dB higher and the bitrate is

2.20 - 1.68 = 0.52

bpp lower than that of Xiang et al.’s method. Comparisons with other images and another set of parameters also reveal similar results, with the exception of the Baboon image. When

τ_{1} = 8

and

d_{\min} = 6

are set, the PSNR of Xiang et al.’s method is

31.40 - 31.28 = 0.12

dB higher than the proposed method. The reason is that under these settings, more blocks are divided into four groups and thus a better image quality is achieved. However, their method requires

3.57 - 3.12 = 0.45

bpp more than the proposed method.

In the performance comparisons with block size

8 \times 8

, the proposed method shows better results for all test images. For example, as shown in Table 7 when

τ_{1} = 14

and

d_{\min} = 28

, the PSNR of the Baboon image of the proposed method is 28.84 dB at 1.88 bpp. The PSNR is

28.84 - 28.14 = 0.70

dB higher and the bitrate is

2.23 - 1.88 = 0.35

bpp lower than those of Xiang et al.’s method.

Figure 8a–f shows the visual quality comparisons of the AMBTC, Xiang et al.’s, and the proposed methods. As seen from Figure 8a when the block size is

4 \times 4

and the AMBTC method is applied, apparent distortions can be seen in the image edges, and noticeable boundary artifacts are observed (see Figure 8a). Note that the PSNR of the AMBTC is 33.27 dB with 2.0 bpp. Xiang et al. improve the AMBTC method by adding more details to complex blocks. As a result, the PSNR (36.31 dB) is significantly higher and blocks at the edges look more natural than those of AMBTC (Figure 8c,

d_{\min} = 6

). However, their method requires

2.35 - 2 = 0.35

bpp more to achieve this effect. In contrast, the visual quality of the proposed method (Figure 8e,

τ_{1} = 8

) is comparable with that of Xiang et al.’s method, but the bitrate is

2.35 - 2.05 = 0.30

bpp lower with a slightly higher PSNR.

When the block size is

8 \times 8

, the distortion of AMBTC is more apparent (Figure 8b) than that of

4 \times 4

, but the bitrate reduces from 2.0 to 1.25 bpp. The visual quality of Xiang et al.’s method (Figure 8d,

d_{\min} = 28

) is significantly better than that of AMBTC, and has no noticeable block boundary artifacts. However, their method requires

1.64 - 1.25 = 0.39

bpp more to improve the image quality. In addition, some edges in Lena’s face, eyes, and shoulder exhibit apparent distortions because the pixel splitting operation may not be triggered due to the setting of

d_{\min}

. In contrast, the edges of the proposed method exhibit no apparent distortion (see Figure 8f,

τ_{1} = 14

). Moreover, the bitrate required in the proposed method is even lower than that of AMBTC by

1.25 - 1.09 = 0.16

bpp.

5. Conclusions

In this paper, we propose a hybrid encoding scheme for AMBTC compressed images using a ternary representation technique. Considering that the number of quantized values greatly affects the quality of the reconstructed image, the proposed method classifies image blocks into flat, smooth, and complex. These three types of blocks are encoded by using one, two, or three quantized values. Flat blocks require no bitmap, whereas smooth and complex blocks require binary and ternary maps, respectively, to record the quantized values to be used to reconstruct the corresponding pixels. A sophisticated design indicator is prepended before the code stream of a block to signify the block type. The proposed method achieves a better image quality than that of prior works with a smaller bitrate. The effectiveness of the proposed method is observed from the experimental results. Note that although the k-means algorithm used in the proposed method may require slightly higher computational cost than that of the discrete cosine transform (DCT) based methods, it is only applied to smooth blocks in the encoding stage to obtain the representative bitmaps rather than the whole image. Furthermore, the k-means algorithm does not need to be applied again during decoding. Therefore, the overall computational cost of the proposed method is smaller than that of DCT-based compression methods.

Author Contributions

W.H., J.W., and K.S.C. contributed to the conceptualization, methodology, and writing of this paper. J.Y. and T.-S.C conceived the simulation setup, formal analysis and conducted the in-vestigation. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Liu, J.; Tian, Y.-G.; Han, T.; Yang, C.-F.; Liu, W.-B. LSB steganographic payload location for JPEG-decompressed images. Digit. Signal Process. 2015, 38, 66–76. [Google Scholar] [CrossRef]
Liu, J.; Tian, Y.; Han, T.; Wang, J.; Luo, X. Stego key searching for LSB steganography on JPEG decompressed image. Sci. China Inf. Sci. 2016, 59, 1–15. [Google Scholar] [CrossRef]
Qin, C.; Chang, C.-C.; Chiu, Y.-P. A Novel Joint Data-Hiding and Compression Scheme Based on SMVQ and Image Inpainting. IEEE Trans. Image Process. 2014, 23, 969–978. [Google Scholar] [CrossRef]
Qin, C.; Hu, Y.-C. Reversible data hiding in VQ index table with lossless coding and adaptive switching mechanism. Signal Process. 2016, 129, 48–55. [Google Scholar] [CrossRef]
Tsou, C.-C.; Hu, Y.-C.; Chang, C.-C. Efficient optimal pixel grouping schemes for AMBTC. Imaging Sci. J. 2008, 56, 217–231. [Google Scholar] [CrossRef]
Hu, Y.C.; Su, B.H.; Tsai, P.Y. Color image coding scheme using absolute moment block and prediction technique. Imaging Sci. J. 2008, 56, 254–270. [Google Scholar] [CrossRef]
Delp, E.J.; Mitchell, O.R. Image coding using block truncation coding. IEEE Trans. Commun. 1979, 27, 1335–1342. [Google Scholar] [CrossRef]
Lema, M.; Mitchell, O. Absolute Moment Block Truncation Coding and Its Application to Color Images. IEEE Trans. Commun. 1984, 32, 1148–1157. [Google Scholar] [CrossRef]
Kumaravadivelan, A.; Nagaraja, P.; Sudhanesh, R. Video compression technique through block truncation coding. Int. J. Res. Anal. Rev. 2019, 6, 236–242. [Google Scholar]
Hemida, O.; He, H. A self-recovery watermarking scheme based on block truncation coding and quantum chaos map. Multimed. Tools Appl. 2020, 79, 18695–18725. [Google Scholar] [CrossRef]
Qin, C.; Ji, P.; Zhang, X.; Dong, J.; Wang, J. Fragile image watermarking with pixel-wise recovery based on overlapping embedding strategy. Signal Process. 2017, 138, 280–293. [Google Scholar] [CrossRef]
Qin, C.; Ji, P.; Chang, C.-C.; Dong, J.; Sun, X. Non-uniform Watermark Sharing Based on Optimal Iterative BTC for Image Tampering Recovery. IEEE MultiMed. 2018, 25, 36–48. [Google Scholar] [CrossRef]
Ma, Y.Y.; Luo, X.Y.; Li, X.L.; Bao, Z.; Zhang, Y. Selection of rich model steganalysis features based on decision rough set α-positive region reduction. IEEE Trans. Circuits Syst. Video Technol. 2019, 29, 336–350. [Google Scholar] [CrossRef]
Zhang, Y.; Qin, C.; Zhang, W.M.; Liu, F.L.; Luo, X.Y. On the fault-tolerant performance for a class of robust image ste-ganography. Signal Process. 2018, 146, 99–111. [Google Scholar] [CrossRef]
Hu, Y.-C. Low-complexity and low-bit-rate image compression scheme based on absolute moment block truncation coding. Opt. Eng. 2003, 42, 1964–1975. [Google Scholar] [CrossRef]
Xiang, Z.; Hu, Y.-C.; Yao, H.; Qin, C. Adaptive and dynamic multi-grouping scheme for absolute moment block truncation coding. Multimed. Tools Appl. 2018, 78, 7895–7909. [Google Scholar] [CrossRef]
Chen, W.-L.; Hu, Y.-C.; Liu, K.-Y.; Lo, C.-C.; Wen, C.-H. Variable-Rate Quadtree-segmented Block Truncation Coding for Color Image Compression. Int. J. Signal Process. Image Process. Pattern Recognit. 2014, 7, 65–76. [Google Scholar] [CrossRef]
Hong, W. Efficient Data Hiding Based on Block Truncation Coding Using Pixel Pair Matching Technique. Symmetry 2018, 10, 36. [Google Scholar] [CrossRef] [Green Version]
Mathews, J.; Nair, M.S. Adaptive block truncation coding technique using edge-based quantization approach. Comput. Electr. Eng. 2015, 43, 169–179. [Google Scholar] [CrossRef]
Hartigan, J.A.; Wong, M.A. A K-means clustering algorithm. Appl. Stat. 1979, 28, 100–108. [Google Scholar] [CrossRef]
The USC-SIPI Image Database. Available online: http://sipi.usc.edu/database/ (accessed on 1 November 2020).

Figure 1. Illustration of image block encoding.

Figure 2. An example of image encoding.

Figure 3. Illustration of the decoding procedures of image blocks.

Figure 4. Eight test images.

Figure 5. The gain of PSNR when the quantized value adjustment (QA) technique is applied.

Figure 6. Distribution of flat, smooth and complex blocks.

Figure 7. Bitrate–PSNR curves of each of the eight test images.

Figure 8. Visual quality comparisons of the proposed method and that of Xiang et al.

Table 1. Number of bits required to record a compressed block.

Number of Groups	$Compressed Code of I_{i}$	Number of Bits
1	${00_{2}, {(m_{i})}_{2}}$	$2 + 8$
2	${01_{2}, (a_{i})_{2}, {(b_{i} - a_{i})}_{2}, B_{i}}$	$2 + 8 + R_{i} + n \times n$
3	${10_{2}, (a_{i}^{0})_{2}, {(b_{i}^{0} - a_{i}^{0})}_{2}, {(b_{i} - b_{i}^{0})}_{2}, B_{i}, J_{i}, B_{i}^{0}}$ or	$2 + 8 + 2 R_{i} + n \times n + 1 + P_{i}$
3	${10_{2}, (a_{i})_{2}, {(a_{i}^{1} - a_{i})}_{2}, {(b_{i}^{1} - a_{i}^{1})}_{2}, B_{i}, J_{i}, B_{i}^{1}}$	$2 + 8 + 2 R_{i} + n \times n + 1 + P_{i}$
4	${11_{2}, {(a_{i}^{0})}_{2}, {(b_{i}^{0} - a_{i}^{0})}_{2}, {(a_{i}^{1} - b_{i}^{0})}_{2}, {(b_{i}^{1} - a_{i}^{1})}_{2}, B_{i}, B_{i}^{0}, B_{i}^{1}}$	$2 + 8 + 3 R_{i} + 2 \times n \times n$

Table 2. Peak signal-to-noise ratio (PSNR) and bitrate of compressed images with

4 \times 4

block size.

Table 2. Peak signal-to-noise ratio (PSNR) and bitrate of compressed images with

4 \times 4

block size.

Images	$k = 128$			$k = 256$			$k = 512$
Images	Bitrate	w/o QA	w/QA	Bitrate	w/o QA	w/QA	Bitrate	w/o QA	w/QA
Lena	1.60	35.83	35.96	1.63	35.95	36.05	1.68	36.07	36.15
Jet	1.52	35.85	35.91	1.55	35.92	35.97	1.58	35.99	36.03
Baboon	2.73	30.80	30.85	2.76	30.85	30.89	2.79	30.89	30.93
Tiffany	1.47	37.18	37.38	1.50	37.39	37.55	1.55	37.57	37.69
Boat	2.00	33.97	34.13	2.05	34.15	34.27	2.10	34.27	34.36
Stream	2.68	32.43	32.49	2.70	32.49	32.53	2.73	32.55	32.57
Peppers	1.68	35.34	35.57	1.73	35.52	35.71	1.78	35.71	35.85
House	1.95	34.73	34.78	1.97	34.79	34.82	2.01	34.85	34.88
Average	1.95	34.52	34.63	1.99	34.63	34.72	2.03	34.74	34.81

Table 3. PSNR and bitrate of compressed images with

8 \times 8

block size.

Table 3. PSNR and bitrate of compressed images with

8 \times 8

block size.

Images	$k = 128$			$k = 256$			$k = 512$
Images	Bitrate	w/o QA	w/QA	Bitrate	w/o QA	w/QA	Bitrate	w/o QA	w/QA
Lena	0.97	32.96	33.07	1.01	33.03	33.12	1.08	33.13	33.21
Jet	1.00	32.77	32.82	1.04	32.83	32.86	1.11	32.90	32.92
Baboon	1.79	28.67	28.72	1.83	28.72	28.75	1.89	28.80	28.82
Tiffany	0.89	34.63	34.81	0.93	34.79	34.93	1.00	34.99	35.09
Boat	1.30	31.17	31.32	1.34	31.24	31.37	1.41	31.36	31.46
Stream	1.89	29.79	29.82	1.92	29.84	29.86	1.99	29.93	29.93
Peppers	1.01	32.59	32.74	1.05	32.66	32.80	1.12	32.79	32.90
House	1.36	31.52	31.57	1.39	31.57	31.61	1.46	31.65	31.67
Average	1.28	31.76	31.86	1.31	31.83	31.91	1.38	31.94	32.00

Table 4. PSNR and bitrate of the proposed method for various

τ_{1}

(block size

4 \times 4

).

Table 4. PSNR and bitrate of the proposed method for various

τ_{1}

(block size

4 \times 4

).

Images	AMBTC bpp = 2.0	PSNR (dB)			Bitrate (bpp)
Images	AMBTC bpp = 2.0	$τ_{1} = 16$	$τ_{1} = 32$	$τ_{1} = 64$	$τ_{1} = 16$	$τ_{1} = 32$	$τ_{1} = 64$
Lena	33.24	36.05	34.62	32.85	1.68	1.46	1.33
Jet	31.97	35.98	34.79	32.50	1.56	1.39	1.24
Baboon	26.98	30.89	29.55	25.74	2.75	2.34	1.75
Tiffany	35.77	37.55	35.72	34.26	1.54	1.34	1.25
Boat	31.16	34.27	32.87	30.81	2.09	1.77	1.57
Stream	28.59	32.53	30.20	27.27	2.73	2.11	1.66
Peppers	33.42	35.71	34.49	32.80	1.77	1.59	1.49
House	30.89	34.82	32.00	30.42	2.00	1.65	1.38
Average	31.50	34.73	33.03	30.83	2.02	1.71	1.46

Table 5. PSNR and bitrate of the proposed method for various

τ_{1}

(block size

8 \times 8

).

Table 5. PSNR and bitrate of the proposed method for various

τ_{1}

(block size

8 \times 8

).

Images	AMBTC bpp = 1.25	PSNR (dB)			Bitrate (bpp)
Images	AMBTC bpp = 1.25	$τ_{1} = 16$	$τ_{1} = 32$	$τ_{1} = 64$	$τ_{1} = 16$	$τ_{1} = 32$	$τ_{1} = 64$
Lena	29.93	33.12	31.80	29.24	1.02	0.76	0.51
Jet	28.84	32.86	31.72	29.40	1.04	0.82	0.62
Baboon	25.18	28.75	27.66	23.06	1.81	1.43	0.71
Tiffany	32.55	34.93	32.76	30.74	0.94	0.62	0.47
Boat	28.07	31.37	29.78	27.44	1.35	0.90	0.61
Stream	26.10	29.85	27.99	24.18	1.92	1.34	0.63
Peppers	29.66	32.80	31.49	29.43	1.06	0.78	0.58
House	27.68	31.60	30.20	26.92	1.39	1.02	0.62
Average	28.50	31.91	30.43	27.55	1.32	0.96	0.59

Table 6. Comparisons of PSNR and bitrate with block size

4 \times 4

.

Table 6. Comparisons of PSNR and bitrate with block size

4 \times 4

.

Images	Metrics	Proposed $τ_{1} = 8$	[16] $d_{\min} = 6$	Proposed $τ_{1} = 16$	[16] $d_{\min} = 7$	Proposed $τ_{1} = 24$	[16] $d_{\min} = 8$
Lena	PSNR	36.98	36.31	36.05	35.12	35.31	33.91
Lena	Bitrate	2.05	2.35	1.68	2.20	1.54	1.95
Jet	PSNR	36.53	34.96	35.97	33.74	35.36	32.69
Jet	Bitrate	1.80	1.98	1.57	1.87	1.45	1.71
Baboon	PSNR	31.28	31.40	30.89	29.59	30.32	28.11
Baboon	Bitrate	3.12	3.57	2.75	3.24	2.52	2.75
Tiffany	PSNR	38.98	38.42	37.55	37.29	36.46	36.41
Tiffany	Bitrate	1.92	2.12	1.54	1.99	1.40	1.81
Boat	PSNR	35.37	34.81	34.27	33.43	33.44	32.11
Boat	Bitrate	2.65	3.02	2.09	2.79	1.87	2.43
Stream	PSNR	32.98	32.69	32.53	31.14	31.46	29.69
Stream	Bitrate	3.07	3.45	2.73	3.18	2.39	2.75
Peppers	PSNR	37.11	36.38	35.71	35.23	35.03	34.16
Peppers	Bitrate	2.28	2.64	1.77	2.45	1.65	2.20
House	PSNR	35.32	35.06	34.82	33.52	33.99	31.86
House	Bitrate	2.28	2.52	2.00	2.35	1.80	2.05

Table 7. Comparisons of PSNR and bitrate with

8 \times 8

block size.

Table 7. Comparisons of PSNR and bitrate with

8 \times 8

block size.

Images	Metrics	Proposed $τ_{1} = 14$	[16] $d_{\min} = 28$	Proposed $τ_{1} = 22$	[16] $d_{\min} = 32$	Proposed $τ_{1} = 30$	[16] $d_{\min} = 36$
Lena	PSNR	33.32	31.88	32.66	30.85	32.00	30.27
Lena	Bitrate	1.09	1.64	0.90	1.45	0.79	1.27
Jet	PSNR	32.99	30.37	32.44	29.66	31.87	29.23
Jet	Bitrate	1.09	1.36	0.94	1.24	0.84	1.13
Baboon	PSNR	28.84	28.14	28.43	26.44	27.84	25.50
Baboon	Bitrate	1.88	2.23	1.65	1.88	1.47	1.50
Tiffany	PSNR	35.36	34.44	33.97	33.48	32.95	32.89
Tiffany	Bitrate	1.04	1.51	0.77	1.33	0.64	1.16
Boat	PSNR	31.62	30.12	30.68	29.11	29.95	28.54
Boat	Bitrate	1.46	1.99	1.11	1.75	0.93	1.50
Stream	PSNR	29.94	28.62	29.39	27.30	28.31	26.54
Stream	Bitrate	1.99	2.20	1.71	1.90	1.41	1.58
Peppers	PSNR	33.01	31.35	32.27	30.52	31.68	30.03
Peppers	Bitrate	1.14	1.86	0.91	1.62	0.81	1.40
House	PSNR	31.71	30.20	31.22	28.79	30.45	28.09
House	Bitrate	1.45	1.70	1.24	1.47	1.06	1.26

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, T.-S.; Wu, J.; Chen, K.S.; Yuan, J.; Hong, W. Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique. Appl. Sci. 2021, 11, 619. https://doi.org/10.3390/app11020619

AMA Style

Chen T-S, Wu J, Chen KS, Yuan J, Hong W. Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique. Applied Sciences. 2021; 11(2):619. https://doi.org/10.3390/app11020619

Chicago/Turabian Style

Chen, Tung-Shou, Jie Wu, Kai Sheng Chen, Junying Yuan, and Wien Hong. 2021. "Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique" Applied Sciences 11, no. 2: 619. https://doi.org/10.3390/app11020619

APA Style

Chen, T.-S., Wu, J., Chen, K. S., Yuan, J., & Hong, W. (2021). Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique. Applied Sciences, 11(2), 619. https://doi.org/10.3390/app11020619

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique

Abstract

1. Introduction

2. Related Works

2.1. The AMBTC Method

2.2. Xiang et al.’s Method

3. Proposed Method

3.1. Encoding of Flat Blocks

3.2. Encoding of Smooth Blocks

3.3. Encoding of Complex Blocks

3.4. Encoding Procedures

3.5. Decoding Procedures

4. Experimental Results

4.1. The Performance of the Proposed Method

4.1.1. Coding Efficiency Comparisons

4.1.2. Performance Comparison of Various $τ_{1}$

4.2. Comparisons with Xiang et al.’s Work

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique

Abstract

1. Introduction

2. Related Works

2.1. The AMBTC Method

2.2. Xiang et al.’s Method

3. Proposed Method

3.1. Encoding of Flat Blocks

3.2. Encoding of Smooth Blocks

3.3. Encoding of Complex Blocks

3.4. Encoding Procedures

3.5. Decoding Procedures

4. Experimental Results

4.1. The Performance of the Proposed Method

4.1.1. Coding Efficiency Comparisons

4.1.2. Performance Comparison of Various τ 1

4.2. Comparisons with Xiang et al.’s Work

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1.2. Performance Comparison of Various $τ_{1}$