Next Article in Journal
Ordered Eigenvalue Decomposition Implementation on Systolic Arrays via Virtual Rewiring
Next Article in Special Issue
Standard-Oriented Architecture for AI-Powered Information Security Risk Management
Previous Article in Journal
Low Carbon Economic Dispatch of IES Considering Flexibility and Multi-Entity Participation Based on Improved PSO
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mini-Hide: Generative Image Steganography via Flip Watermarking for Reducing BER

1
Information and Communication Company State Grid Jiangxi Electric Power Co., Ltd., Nanchang 330095, China
2
Center for Modern Analysis and Gene Sequencing, Zhengzhou University, Zhengzhou 450001, China
3
School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(5), 939; https://doi.org/10.3390/electronics15050939
Submission received: 11 January 2026 / Revised: 12 February 2026 / Accepted: 18 February 2026 / Published: 25 February 2026

Abstract

Generative image steganography is a key technology for secure information transmission, but existing deep learning-based generative steganographic methods suffer from an extremely high bit error rate (BER) and degraded steganographic image quality in low-bit-rate embedding tasks in which secret information needs duplication or padding to match the model input size. In addition, it is difficult to balance BER reduction and imperceptibility of stego-images. To address these issues, this paper proposes a novel generative image steganography algorithm based on flip watermarking, with the core novelty of designing a mirror flipping preprocessing mechanism to achieve a redundant watermark and eliminate information errors caused by duplication or padding, and constructing an end-to-end Mini-Hide steganographic framework to integrate flip watermarking with generative steganography for the first time. Specifically, the proposed method first converts the binary bitstream of secret information into a square matrix, and performs vertical, horizontal and vertical–horizontal mirror flipping on the matrix to form a redundant basic watermark, which is then expanded to a secret image with the same size as the cover image. After that, the secret image is preprocessed by a preparation network and then input into an encoding network together with the cover image to generate a stego-image. Finally, the generated stego-image is input into the decoding network to extract the secret image. Subsequently, the inverse operation of flip watermarking is performed on the extracted secret image to recover the original binary bitstream. Extensive experiments are conducted on the public COCO dataset ( 256 × 256 pixels) with BER, PSNR, and SSIM, and the proposed method is compared with state-of-the-art generative steganographic methods. Quantitative results show that the proposed method achieves a 0 % BER for secret information of 8 × 8 to 64 × 64 bits, and the BER is only 0.00002 % for 256 × 256 -bit secret information; the PSNR of stego-images reaches 37.75 dB, and the SSIM hits 0.96, which are 7.07 dB and 0.02 higher than those of the classic HiDDeN method ( 64 × 64 bit) respectively. We also validated the flip watermark module by integrating into other methods; the results also show that the PSNR of FNNS-D is improved by 13.12 dB ( 256 × 256 ), and the BER of SteganoGAN is reduced by 99.99 % ( 256 × 256 bit). In addition, the proposed method breaks the embedding size limit of HiDDeN (≤ 64 × 64 bit) and supports up to 256 × 256 -bit secret information embedding with stable performance. This work significantly reduces the BER of generative image steganography while improving the visual quality of stego-images, provides a new preprocessing and optimization scheme for low-BER generative steganographic algorithm design, and also offers a universal lightweight module for performance improvement of existing steganographic methods, which has important theoretical and practical significance for enhancing the security and reliability of covert information transmission in the field of information security.

1. Introduction

Image steganography is a crucial branch in the field of information steganography, with its core process consisting of three key phases: information embedding, information transmission, and information extraction. In the information embedding phase, the secret information is first converted into a data format suitable for embedding (e.g., binary stream or image) and embedded into the cover signal (e.g, image, audio, or video) via specific steganographic algorithms. During the information transmission phase, the stego-signal is transmitted to the receiver through a public channel; this process typically requires ensuring no significant visual or auditory differences between the stego-signal and the original cover signal to avoid attracting attackers’ attention. In the information extraction phase, the receiver uses the corresponding extraction algorithm to recover the original secret information from the stego-signal. The core objective of this process is to achieve the secure transmission of secret information without arousing external suspicion.
Although existing steganographic methods can hide secret information, they suffer from relatively high detection rates, which seriously restricts the security of their practical applications. Most of these methods are based on simple algorithms (e.g., LSB [1], the Least Significant Bit replacement algorithm). While achieving a certain level of information hiding, the concealed data can be easily extracted through statistical feature-based analysis and other techniques. However, existing generative steganography methods, such as Balujia [2], HiDDeN [3], HiNet [4], etc., have made progress in terms of embedding capacity and imperceptibility, but perform poorly in steganographic tasks with secret information below 1 bit per pixel (bpp). This issue stems from the core design principle of these methods: embedding as much information as possible without compromising the quality of the cover medium. When applied to steganographic tasks with secret information below 1 bpp, the secret data typically requires duplication or padding to match the input size. Such processing easily introduces information errors during network transmission, not only leading to a significant increase in BER but also reducing steganographic efficiency. Furthermore, duplication and padding operations may further amplify errors during the information extraction phase, especially when the cover signal is subjected to noise interference or attacks. Therefore, determining how to reduce the BER while ensuring high imperceptibility has become a critical and urgent issue to be addressed in the field of image steganography.
To address the aforementioned issues, this paper proposes a novel generative image steganographic method based on flip watermarking, aiming to balance steganalysis resistance and bit error rate (BER) reduction. Compared with existing methods, the proposed approach embeds secret information in the form of flip watermark images, optimizing the information embedding process. Meanwhile, leveraging the generative capability of generative steganography, it generates highly imperceptible stego-images while reducing the BER. Extensive experiments have verified the effectiveness of the proposed method, which significantly improves the performance of steganographic tasks. This research not only provides an effective solution for the steganography field but also offers new perspectives and ideas for the further development of information steganographic technologies. The main contributions are summarized as follows:
(1)
A flip watermarking-based preprocessing method is proposed for the binary bitstream of secret information, which converts the binary bitstream into a square matrix and performs vertical, horizontal, and vertical–horizontal mirror flipping to construct a redundant watermark structure. This structure is then expanded to match the size of the cover image, fundamentally solving the problems of high BER and information errors caused by duplication/padding operations for size matching in traditional methods, and realizing efficient and low-error embedding of secret information.
(2)
The study systematically elaborates the intrinsic mechanism by which the redundant structure achieved through flipped watermarking reduces the global BER: it introduces an implicit regularization effect for steganographic perturbations and spatial correlation, optimizes gradient propagation to disperse error propagation during network training, and establishes a dual safeguard mechanism (nonlinear filtering, dynamic range alignment, hard-threshold binarization) for information recovery in the decoding phase. These findings provide a new theoretical basis for the design of low-BER steganographic algorithms.
(3)
By combining flip watermarking technology with generative image steganography, the study realizes the organic integration of the error tolerance of watermarking and the imperceptibility of generative models, breaking the single design idea of traditional generative steganography, which only focuses on “maximizing embedding capacity”. It provides a new research direction for the development of high-security low-BER information steganographic technologies, and has important practical value for the secure transmission of secret information.
The structure of this paper is organized as follows: Section 2 introduces related work; Section 3 elaborates on the generative image steganographic method based on flip watermarking in detail; Section 4 presents the experimental tests of this research in detail; and Section 5 concludes the paper.

2. Related Work

2.1. Information Steganography

The research on information steganography encompasses the embedding of secret information in various forms, including text, images, audio, and video. Based on technical principles and development stages, steganographic methods can be categorized into two main types: steganographic methods based on spatial or frequency domain pixel modification, and deep learning-based steganographic methods. Both types have been continuously optimized for embedding capacity and imperceptibility, but there are still critical shortcomings in BER and stego-image quality, especially for low-bit-rate embedding tasks.

2.1.1. Steganographic Methods Based on Spatial or Frequency Domain Pixel Modification

The LSB method is one of the earliest and most widely used steganographic techniques. It embeds secret information by directly modifying the least significant bits of pixels in the cover image, offering core strengths of simple implementation, high computational efficiency and lossless cover image quality. Similar spatial domain-based approaches include the improved LSB algorithm proposed by Imaizumi et al. [5], multi-bit plane steganography by Nguyen et al. [6], PVD-based steganography by Pan et al. [7], and the Hide-and-Seek [8] algorithm, while histogram shifting steganography [9] and palette steganography [10] are typical variants that optimize embedding capacity on the basis of LSB.
However, such methods are vulnerable to detection by professional steganalyzers: first, pixel-level modification will introduce obvious statistical feature changes in the cover image, which can be easily detected by modern steganalysis techniques (e.g., difference image histogram detection [11], preserved statistics-based steganalysis [12]), leading to extremely low security in practical applications; second, the embedding process lacks redundant design for information recovery, and the BER will rise sharply when the stego-image is subjected to noise interference or geometric distortion; third, the embedding capacity is limited by the pixel bit depth, and it is difficult to adapt to the high-capacity and low-BER requirements of modern covert communication. Studies by Dumitrescu et al. [13], Böhme [12], and Zhang et al. [11] have all confirmed that traditional steganographic methods cannot meet the security and robustness requirements of practical scenarios due to the above defects.

2.1.2. Deep Learning-Based Steganographic Methods

To overcome the limitations of traditional steganographic methods, researchers have proposed novel steganographic approaches based on deep neural networks, which leverage end-to-end learning frameworks to learn the optimal perturbation distribution of cover images, thus generating stego-images that are more difficult to detect. This category of methods has become the mainstream of current research, and has evolved into four main branches—handcrafted feature-based deep steganography, GAN-based generative steganography, invertible neural network (INN)-based steganography, and diffusion/flow-based steganography—each with distinct strengths and targeted weaknesses.
Handcrafted feature-based deep steganography: HUGO [14], WOW [15] and S-UNIWARD [16] are representative works. Their strengths lie in the design of adaptive distortion functions—HUGO prioritizes pixel modification with minimal distortion, while WOW and S-UNIWARD embed information in texture-rich regions of cover images (spatial/frequency domain), which effectively improves the imperceptibility of stego-images. However, their distortion functions are manually designed and thus lack generalization ability, and the embedding process is still based on pixel-level modification, so the anti-steganalysis ability is limited, and the BER is not optimized in the algorithm design.
GAN-based generative steganography: Generative Adversarial Networks (GANs [17]) have brought a paradigm shift to image steganography by generating stego-images through adversarial training, which significantly enhances imperceptibility. AdaSteg [18] realizes adaptive local information hiding via deep reinforcement learning, and SSGAN [19] even synthesizes cover images from scratch to obscure the source of secret information. Nevertheless, such methods have obvious limitations in low-bit-rate tasks: to match the model input size, secret information needs to be duplicated or padded, which directly leads to a sharp increase in BER; in addition, the training of GAN models is prone to mode collapse, which reduces the stability of steganographic performance.
INN-based steganography: Represented by Glow [20,21], RIIS [22], HiNet [4] and DeepMIH [23], this branch uses reversible mapping to realize efficient embedding and extraction of secret information, with high embedding capacity and low computational cost of recovery. However, their reversible network structure is sensitive to noise interference, and the BER will escalate rapidly when the stego-image is damaged; meanwhile, the INN-based model focuses on the design of invertible mapping and ignores the preprocessing optimization of secret information, so it cannot solve the BER problem caused by duplication/padding in low-bit-rate tasks.
Diffusion/flow-based steganography: With the rise in diffusion models and flow-based theory, CRoSS [24] realizes the conversion of secret images to cover images via a public–private key mechanism, and GSN [25] directly generates realistic cover images for covert communication, with high security and no need for explicit stego-image generation. Flow-based methods [26,27,28,29] also show advantages in image hiding and recovery due to their excellent feature modeling ability. However, these models suffer from high training cost, low embedding speed, and limited embedding capacity; in addition, similarly to other deep learning methods, they lack targeted optimization for BER in low-bit-rate tasks.
In recent years, the performance of deep learning-based steganographic methods in terms of imperceptibility has been continuously improved, as shown in Table 1, where the latest SOTA methods (e.g., DWT-GAN, SNR) even achieve a PSNR of over 44 dB and an SSIM of over 0.98. However, these methods all take PSNR and SSIM as the core optimization objectives; for low-bit-rate embedding tasks, the problem of high BER caused by secret information duplication/padding has not been effectively solved, and it is difficult to balance the three core requirements of imperceptibility, embedding capacity and low BER. This is the most critical research gap in the current generative image steganography field.

2.2. Flip Watermarking Methods

The concept of flip watermarking originates from periodic watermarking, which is a classic technology for copyright protection and information authentication. A representative example is the periodic watermarking scheme proposed by Voloshynovskiy et al. [42], which arranges watermark units in a periodic structure, and its main advantage is that the periodicity can be identified via the autocorrelation function (ACF) or magnitude spectrum (MS), thus realizing the recovery of watermarks damaged by geometric distortions. However, the periodic structure can easily be detected, and the robustness to nonlinear geometric warping is insufficient.
Building on periodic watermarking, Ma et al. [43] proposed an improved symmetric watermarking scheme, which performs flipping operations on watermark units and introduces a mask calculation mechanism during the flipped tiling stage. This flip watermarking scheme makes up for the defects of periodic watermarking, with strong resistance to geometric warping distortions and high watermark robustness; the redundant structure of watermark units that have undergone mirror flipping also provides a new idea for error reduction in information transmission. Nevertheless, existing watermark flipping methods have two main limitations: first, they are only applied to the field of watermarking, and have not been combined with image steganography technology to solve the core problems of steganography (e.g., BER, embedding capacity); second, the structure for generating flipped watermarks is not optimized for the input requirements of generative steganographic models, and cannot be directly adapted to the end-to-end steganographic framework.
To the best of our knowledge, there is no research that combines flip watermarking with generative image steganography—this is an important research blank between the two fields. The redundant watermark structure achieved by mirror flipping has a natural advantage in reducing information transmission errors, which can just solve the high BER problem of generative steganography in low-bit-rate tasks; meanwhile, the generative steganographic framework can make up for the defect of flip watermarking in imperceptibility, thus realizing the mutual complementarity of the two technologies.

2.3. Novelty of the Proposed Method Against Prior Works

To clearly highlight the uniqueness and innovation of the proposed flip watermarking-based generative image steganography method, this subsection systematically compares the proposed method with existing SOTA steganography (S) and watermarking (W) methods (Table 1) and classic generative steganography methods, and clarifies the improvements and differences from the perspectives of technical integration, problem solving, performance optimization and framework design. The key innovations and improvements are summarized as follows:
(1)
First integration of flip watermarking and generative image steganography: Existing flip watermarking methods are only used for watermarking, and existing generative steganography methods lack the design of redundant error reduction for secret information. The proposed method is the first to combine these two technologies, and optimizes the flipped-watermark-generation structure to adapt to the input requirements of the generative steganographic model, filling the research gap between the two fields.
(2)
Balancing three core indicators of steganography with a multi-objective loss function: The proposed method constructs a Mini-Hide framework consisting of a preparation network, an encoding network and a decoding network, and designs a total loss function fusing hiding loss and decoding loss. This framework realizes the simultaneous optimization of low BER, high imperceptibility and large embedding capacity, which makes up for the defect whereby existing SOTA methods cannot balance the three core indicators.
(3)
Universal optimization module for existing steganographic methods: The flip watermarking module designed in this paper is a lightweight and general module, and can be directly integrated into existing classic generative steganography methods. The experimental results show that the module can significantly reduce the BER of existing methods and improve the PSNR/SSIM indicators, making it a valuable optimization scheme for the existing steganography method system.

3. Methodology

This section elaborates on the core methodology of this paper, sequentially introducing the model structure and the generation process of flip watermarking, and finally summarizing the overall steganographic workflow and loss function configuration.

3.1. Model Structure

The proposed model is constructed based on Balujia [2], consisting of three core sub-networks—a preparatory network, an encoding network, and a decoding network—aiming to achieve efficient image steganography. Specifically, the preparatory network comprises 6 convolutional layers, each followed by a ReLU activation function, and is designed to extract the initial features of the secret image (denoted as I s ). The encoding network serves as the core module of the model, consisting of 16 pairs of “convolutional layer + ReLU activation function layer” (each pair follows the structure of a convolutional layer immediately succeeded by a ReLU activation function layer). Its function is to fuse the secret image features output by the preparatory network with the cover image, generating the final stego-image (denoted as I s t e g o ) to complete information hiding. The decoding network shares an identical structure with the encoding network, whose core role is to recover the original secret image from the stego-image.

3.2. The Proposed Flip Watermarking

Before embedding secret information into the cover image, it needs to be first converted into an image watermark format suitable for embedding. Inspired by the method proposed by Ma et al. [19], this paper designs a differentiated watermark generation method, whose process is illustrated in Figure 1. Through the custom watermark generation function, this method converts secret information into a watermark image. During the conversion process, the secret information to be hidden is firstly transformed into a binary matrix W o { 0 , 1 } m × m , where the elements take values of 0 or 1, representing black and white pixels, respectively. To expand the secret information and form a secret watermark image, the generated binary matrix W o is subjected to mirror flipping to produce four flipped versions. This process can be described by Equations (1)–(3) as follows:
W v = f l i p v e r t i c a l ( W o ) ,
W h = f l i p h o r i z o n t a l ( W o ) ,
W m = f l i p h o r i z o n t a l ( f l i p v e r t i c a l ( W o ) ) ,
where f l i p h o r i z o n t a l and f l i p v e r t i c a l represent horizontal mirror flipping and vertical mirror flipping, respectively. Then, we obtain a basic watermark W b by concatenating W o with W h , W v and W m as follows:
W b = W o W h W v W m { 0 , 1 } 2 m × 2 m .
Since the size of the base watermark may still be smaller than the dimensions H × W of the cover image, it may remain insufficient to match the size of the cover image. Therefore, the basic watermark W b needs to be copied multiple times until it is stitched to a macro watermark the same size as the cover image and saved as the final flipped watermark image. According to the image dimensions, the number of copies of the basic watermark and the cropping range after tiling can be calculated as
n h = H 2 k , n w = W 2 k .

3.2.1. Watermark Decoding

The process of watermark decoding consists of three key steps:
Step 1: Watermark Image Preprocessing: The receiving end converts the decoded watermark image into a grayscale image and eliminates color space discrepancies through dynamic range alignment. This process is described as follows:
X ^ r = X r 255 × ( m a x ( W b ) m i n ( W b ) ) + m i n ( W b )
where X r denotes the received decrypted secret watermark image.
Step 2: Hard-Decision Binarization: Threshold segmentation is performed on the preprocessed image to recover the binary watermark matrix, as shown in Equation (7):
X b i n = I ( X ^ r > 0.5 ) ,
where I ( · ) is the indicator function, which maps pixel values to the set { 0 , 1 } .
Step 3: Secret Information Extraction: Based on the watermark unit size recorded during watermark generation, the complete macroblock region is cropped, and the original secret block in the top-left corner is extracted, as defined in Equation (9):
a r e c = X b i n [ 0 : m , 0 : m ] ,
where a ^ denotes the recovered secret information, and m × m is the size of the original secret information.

3.2.2. Effects of Flip Watermarking

The redundant structure obtained by flip watermarking achieves error localization and reduces the global bit error rate (BER) through the following mechanisms:
(1) Regularization Effect of Mirror Redundancy: Flip watermarking expands secret information by performing vertical flip, horizontal flip, and vertical–horizontal flip, where each bit is duplicated to four different mirrored positions W o , W h , W v , W m in the watermark image. During the process of training the neural network, this mirror redundancy structure introduces an implicit regularization effect, mainly reflected in the consistency of steganographic perturbations and regularization of spatial correlation. For the consistency of steganographic perturbations, if the steganographic methods among W o , W h , W v , W m are inconsistent, the error between the stego-image and the cover image will increase, degrading steganographic quality. Therefore, the neural network will spontaneously learn that maintaining consistent perturbations across mirrored copies” is the optimal strategy, thereby reducing unnecessary steganographic noise and improving information recovery capability. For the regularization of spatial correlation, due to the symmetry of the watermark image in multiple directions, the network tends to adopt smooth perturbation patterns during optimization, avoiding over-modifying local pixels. This essentially acts as implicit regularization, stabilizing the steganographic embedding method and thus reducing the BER.
Mathematical Modeling of Mirror Redundancy Regularization Effect: Flip watermarking generates four mirrored versions of the original binary matrix W o { 0 , 1 } m × m W o (original), W h (horizontal flip), W v (vertical flip), W m (vertical–horizontal flip)—and concatenates them into the basic watermark W b { 0 , 1 } 2 m × 2 m . We define the steganographic pixel perturbation of the cover image at position ( i , j ) as:
Δ ( i , j ) = I stego ( i , j ) I c ( i , j )
where I c and I stego represent the cover image and stego-image, respectively.
For the mirrored positions ( i , j ) , ( i , 2 m j + 1 ) , ( 2 m i + 1 , j ) and ( 2 m i + 1 , 2 m j + 1 ) in W b , the perturbation consistency constraint imposed by the model’s hiding loss L C = I c I stego (Euclidean norm) can be expressed as:
Δ ( i , j ) = Δ ( i , 2 m j + 1 ) = Δ ( 2 m i + 1 , j ) = Δ ( 2 m i + 1 , 2 m j + 1 ) + ε
where ε is the perturbation error, and the model optimizes L C to minimize ε 0 .
BER reduction derivation: If perturbations at mirrored positions are inconsistent ( ε 0 ), the stego-image will produce unnecessary visual noise, leading to pixel value distortion in the watermark region during decoding and an increase in bit error probability P e . By enforcing perturbation consistency, flip watermarking reduces the random noise of the watermark region by O ( ε 2 ) , and the global BER is reduced as follows:
BER = 1 N k = 1 N P e ( k ) 1 N k = 1 N ε ( k ) 2
where N is the total number of bits of secret information. The mirror redundancy structure makes each bit of secret information correspond to four mirrored pixels, so the error tolerance threshold of a single bit is increased by 4 times, and the probability of a single bit error is reduced from P e to P e 4 .
(2) Optimization Effect of Gradient Propagation: During the training of the encoding/decoding network, the gradient of the total loss L total = L C + β L R with respect to the watermark pixel W b ( i , j ) is W b ( i , j ) L total . For the symmetric structure of W b , the gradient at mirrored positions satisfies:
W b ( i , j ) L total = W b ( i , 2 m j + 1 ) L total = W b ( 2 m i + 1 , j ) L total = W b ( 2 m i + 1 , 2 m j + 1 ) L total
This means the gradient of the loss function is uniformly propagated to the entire mirror region instead of being concentrated in a local pixel area.
Robustness improvement derivation: Local gradient concentration (a common problem in traditional steganographic methods) leads to overfitting of local pixel perturbations, making the stego-image sensitive to noise interference and geometric attacks. The uniform gradient propagation of flip watermarking makes the steganographic perturbation a smooth global pattern, and the perturbation variance of the watermark region is:
Var ( Δ ) = 1 4 m 2 i , j = 1 2 m ( Δ ( i , j ) Δ ¯ ) 2 Var ( Δ non - flip )
where Δ ¯ is the average perturbation of the watermark region, and Var ( Δ non - flip ) is the perturbation variance of non-flip watermarking. A smaller perturbation variance means the stego-image is less affected by external interference, thus improving robustness.
(3) Dual Safeguard Mechanism for Information Recovery: Flip watermarking not only optimizes the steganographic process but also provides additional error suppression capability during the decoding phase, mainly reflected in the following aspects: dynamic range alignment and hard-decision binarization, which eliminate systematic deviations and reduce decoding errors. A strict mathematical derivation for each step is given below:
Dynamic Range Alignment: This step eliminates the color space deviation between the decoded watermark image X r and the original basic watermark W b . The aligned pixel value is
X ^ r = X r 255 × max ( W b ) min ( W b ) + min ( W b ) = X r 255
since W b { 0 , 1 } , max ( W b ) = 1 and min ( W b ) = 0 . This step eliminates the systematic deviation δ = X r / 255 X r in the original pixel value, reducing the deviation-induced error probability by 100% for all pixels.
Hard-Decision Binarization: We use the indicator function I ( · ) to map the aligned pixel value to binary 0/1:
X bin = I X ^ r > 0.5 = 1 , X ^ r > 0.5 0 , X ^ r 0.5
The decoding error probability after binarization is defined as:
P dec = Pr X ^ r > 0.5 W b = 0 + Pr X ^ r 0.5 W b = 1
Dynamic range alignment makes X ^ r concentrate in the neighborhood of 0 or 1, and the probability density function (PDF) of X ^ r satisfies f ( X ^ r | W b = 0 ) δ ( X ^ r ) , and f ( X ^ r | W b = 1 ) δ ( X ^ r 1 ) ( δ ( · ) denotes the Dirac delta function), so P dec 0 .

3.3. Mini-Hide

The steganographic workflow of the proposed method, Mini-Hide, consists of the following main steps: secret information preprocessing, flipped watermark generation, image embedding, image transmission, decoding and extraction, and secret information recovery. The workflow is illustrated in Figure 2.
Step 1: Secret Information Preprocessing: First, this paper converts the secret information in binary form to be hidden in a square matrix. This step serves as the foundation for subsequent steganographic processes, as it determines the type and size of the embedded data.
Step 2: Flipped Watermark Generation: Next, a watermark generation algorithm is utilized to convert the binary secret information into a flipped watermark image.
Step 3: Image Embedding: Once the flipped watermark image is generated, it is input into the preparatory network together with the cover image. The encoding network embeds the watermark image into the cover image in an imperceptible manner, generating a disguised cover image, i.e., the stego-image. This process ensures the visual imperceptibility of the secret information while maintaining the integrity of the cover image.
Step 4: Image Transmission: The disguised cover image can be transmitted and shared like ordinary images. Since the embedded watermark is invisible, it does not arouse any suspicion.
Step 5: Decoding and Extraction: At the receiving end, the receiver uses the decoding network to extract the embedded flipped watermark image.
Step 6: Secret Information Recovery: Finally, the extracted flipped watermark image is converted back to binary data through a decoding function.This decoding function is the inverse operation of the watermark generation process, including inverse transformation and mapping from the matrix to binary data. Thus, the entire steganographic workflow is completed.

3.4. Loss Functions

To optimize the steganographic model proposed in this study, this paper constructs a multi-objective optimization framework, aiming to balance the improvement of secret image recovery capability and the maintenance of cover image integrity. The comprehensive loss function of this study consists of two components: one is the hiding loss, designed to enhance the hiding effectiveness of the model; the other is the decoding loss, which aims to improve the recovery performance of the model and further boost the security of steganography.

3.4.1. Hiding Loss

The hiding loss is applied in the secret encoding process, intended to ensure that after the secret watermark image is embedded into the cover image, the resulting stego-image is as similar as possible to the original cover image. Its mathematical expression is given by:
L C ( X c , X s ) = | | X c X s | | ,
where X c and X s denote the original cover image and the stego-image output by the model, respectively, and | | · | | represents the Euclidean norm. By minimizing L C , the model is trained to minimize the visual impact on the cover image, thereby enhancing the imperceptibility of steganography.

3.4.2. Decoding Loss

The decoding loss is designed to evaluate the model’s ability to recover secret information, quantified by calculating the difference between the secret image decoded by the decoding network and the original secret image. Its mathematical expression is given by
L R ( X m , X r ) = | | X m X r | | ,
where X m and X r denote the original secret image and the secret image output by the decoding network, respectively. By minimizing L R , the network is trained to accurately recover the original secret image from the stego-image.

3.4.3. The Total Loss Function

To balance the importance of the two loss functions, this study adopts a weighted average approach to fuse them into a total loss function. By adjusting the weight parameter β , the relative importance of the two loss functions in the total loss can be controlled. The mathematical expression of the total loss function is given as
L total = L C + β L R ,
where β is a hyperparameter used to balance the impacts of the two loss functions. In the experiments of this study, the optimal β value is selected through cross-validation.

3.5. Model Training

The proposed method was trained with Pytorch 2.0.1 and CUDA 11.8 on a server equipped with two NVIDIA RTX 3090s (24G), an Intel Xeon Gold 6330, and 128G RAM. We chose Adam as the optimizer. The batch size and training epochs were set to 32 and 1000, respectively. The learning rate was set to 1 × 10 3 , and it decayed by 10 % at intervals of 100 epochs. The loss weight was β [ 0.1 , 1.0 ] . Optimal values were determined via 5-fold cross-validation.

4. Experiments and Results

In this section, the experimental setup, datasets, experimental methods, and results of this study are depicted in detail. The goal is to evaluate the performance of our model in steganographic tasks with secret information of varying sizes and compare the performance differences among different methods.

4.1. Experimental Setup

This study designs four experimental schemes, using the COCO dataset and a self-generated watermark image dataset (size: 256 × 256 ). Each scheme undergoes 1000 training and testing iterations, with the test results averaged. The details of the schemes are as follows:
(1)
Mini-Hide: A total of 3200 images are randomly selected from the COCO dataset and divided into two groups (cover images and secret images) for training. In the testing phase, watermark images generated by the proposed method are used.
(2)
CMini-Hide: In the training phase, 1600 images are separately selected from the COCO dataset as cover images and secret images. For testing, watermark images generated by Ma et al. [43]. method are used for validation.
(3)
BSMini-Hide: A total of 1600 images from the COCO dataset are used as cover images. Watermark images containing 256 × 256 -bit secret information are generated using the proposed watermark generation algorithm for training. Testing also employs watermark images generated by the proposed method.
(4)
BCmin-Hide: A total of 1600 images are selected from the COCO dataset as cover images. Watermark images containing 256 × 256 -bit secret information, generated by [43], are used as secret images for training. Testing adopts watermark images generated by [43].
It should be additionally noted that due to the particularity of [43], if the secret information exceeds 64 × 64 bits during watermark image generation, the algorithm steps will be incomplete. Therefore, no comparative experiments are conducted for secret information exceeding this size.
Three evaluation metrics are adopted to measure the performance of the proposed model: BER, PSNR, and SSIM. BER is mainly used to evaluate whether the model can accurately recover the original secret information during watermark extraction. PSNR assesses the degree of image quality preservation after watermark embedding and extraction. As a supplementary metric, SSIM aims to evaluate the model’s overall performance from a more comprehensive perspective.

4.2. Experimental Results

4.2.1. Comparative Method Settings

This study uses 1000 randomly generated watermark images as experimental test subjects to evaluate the proposed watermarking technology. Comparisons with other existing deep learning-based steganographic methods, including Balujia [2], HiDDeN [3], Baluja [44], SteganoGAN [45], FNNS [46], SE-NDEND [35], RWP [40], and HHN-NDEND [41], demonstrate that the proposed method significantly outperforms these state-of-the-art approaches.
To ensure test fairness, the original settings of the steganographic methods from other papers were kept unchanged during testing, and the training methods described in the original papers were followed as closely as possible.

4.2.2. Comparison of Steganographic Quality

This study first conducted tests on the PSNR with the comparative results presented in Table 2. A comprehensive analysis of the experimental data reveals that the proposed watermarking technology exhibits excellent performance in terms of the PSNR metric. This result indicates that steganography based on encoding secret information into watermark images possesses distinct advantages in visual quality and distortion control. In addition, it is evident that the training schemes of BSMini-Hide and BCmin-Hide achieved better PSNR results than those of Mini-Hide and CMini-Hide, and the same trend is observed in the subsequent SSIM test results. Regarding HiDDeN [3], it could not be compared when the secret information exceeded 64 × 64 bits due to its design constraints–it does not support embedding information exceeding 0.2 bits per pixel (bpp). Therefore, this technology is not suitable for comparative analysis when the size of secret information scales beyond 128 × 128 bits. Furthermore, the test results of the SSIM are shown in Table 3.
Figure 3 presents the steganographic effect diagrams of the proposed method. As illustrated in Figure 3, for different sizes of secret information, corresponding watermark images are generated and embedded into cover images to obtain stego-images. It can be clearly observed that the stego-images are almost indistinguishable from the original cover images. Additionally, the proposed method compares the residual maps between stego-images and cover images under different secret information embedding scenarios. It should be noted that due to the similar residual distributions of different variants of the Mini-Hide method, only the residual map of one variant is presented as a reference, as shown in Figure 4. From the residual maps, it can be observed that after embedding secret information of varying sizes, the pixel changes in the residual parts are minimal, mainly concentrated in high-frequency regions (e.g., edges and details), while being almost invisible in low-frequency regions (e.g., sky and flat roads). This demonstrates that the proposed method can effectively preserve the overall structure and visual quality of the image.
Although the aforementioned PSNR and SSIM test results tend to indicate that the training schemes of BSMini-Hide and BCmin-Hide are slightly superior, the exact opposite trend is observed in the subsequent bit error rate tests. The experimental results in Table 3 confirm that the training scheme where both cover images and secret images are selected from the COCO dataset achieves better BER performance than BSMini-Hide and BCmin-Hide.

4.2.3. Comparison of Steganographic Bit Error Rates

Finally, this study further compares the bit error rates (BERs) of various algorithms. The BER is defined as the ratio of the number of erroneous bits in the decoded secret information to the total number of bits in the original secret information. The average value is calculated based on 1000 experimental tests. Detailed comparative results are presented in Table 4. It should be clarified that a lower BER indicates superior performance of the method. Methods found in this study to exhibit excellent BER performance are highlighted in bold. It can be observed that the BER of the proposed watermarking methods is generally maintained at an extremely low level, where “0” indicates an exact BER of 0 rather than a rounded value.
Based on the comprehensive analysis of the above experiments, the following conclusion is drawn: the watermarking-based steganographic method proposed in this paper significantly outperforms other existing steganographic methods in terms of BER.

4.2.4. Performance Gains of Mini-Hide

This subsection verifies the comprehensive advantages of the proposed method by comparing and analyzing the performance gains of the Mini-Hide watermarking method against mainstream existing methods (SteganoGAN, FNNS-D, Baluja) in two dimensions: BER and visual imperceptibility (PSNR/SSIM). Gain experiments were not conducted on HiDDeN due to incompatibility between its information embedding mechanism and the proposed method. Specifically, the watermark generation module of the proposed method was integrated into other mainstream methods for verification. The gain effects are presented as follows: Table 5 shows the PSNR gain effects of different methods, Table 6 presents the SSIM gain effects, and Table 7 illustrates the BER gain effects. By comparing Table 2, Table 3 and Table 4 with the gain tables, it can be observed that the Mini-Hide watermarking method can significantly reduce the decoding BER of existing methods and effectively improve image visual quality.
To evaluate the robustness of the proposed method against common Gaussian noise and JPEG compression attacks in steganographic tasks, we conducted quantitative tests on the PSNR, SSIM and BER for secret information with sizes ranging from 8 × 8 to 256 × 256 bits under different attack intensities, including Gaussian noise with σ = 0.01 , 0.03 , 0.05 and JPEG compression with quality factors Q = 75 , 85 , 95 , as shown in Table 8, Table 9 and Table 10. The experimental results show that the Mini-Hide model exhibits excellent anti-interference performance, with all key performance metrics suffering only minimal degradation under various attack scenarios. Specifically, the PSNR of the model remains above 27.81 dB in all attack settings, with only a slight gradual decrease as attack intensity increases, and there is almost no fluctuation in PSNR values across different secret information sizes. For SSIM, the model achieves nearly consistent results with the attack-free state under low-intensity attacks ( σ 0.03 , Q 85 ); a mild reduction is only observed under high-intensity attacks ( σ = 0.05 , Q = 75 ), and the minimum SSIM still remains at 0.78, which fully reflects the model’s strong ability to preserve the structural similarity of stego-images under interference. In terms of BER—the core metric for secret information recovery—the Mini-Hide model already maintains an ultra-low BER level in the attack-free state, with 0% BER for 8 × 8 to 32 × 32-bit secret information and only 0.00001% and 0.00002% BER for 128 × 128 and 256 × 256-bit secret information, respectively. Under low-intensity attacks ( σ = 0.01 , Q = 95 ), the model’s BER shows no detectable changes compared with the attack-free state; as attack intensity increases, only the BER of secret information with sizes of 64 × 64 bits and above has a marginal rise, while small-sized secret information (8 × 8–32 × 32 bits) consistently maintains a 0% BER. It is worth noting that all BER increments across different secret information sizes are extremely slight, and no obvious error amplification phenomenon occurs in the model during the information recovery process. In summary, the Mini-Hide model has strong robustness against Gaussian noise and JPEG compression attacks, and it can still ensure the high visual quality of stego-images and ultra-low bit error rate for secret information recovery under the interference by such common attacks.

4.3. Ablation Study

The quantitative results of the Flip vs. No-Flip experiment are presented in Table 11, which includes the BER, PSNR, and SSIM of the two models for all secret information sizes and the corresponding performance change rates. The key results are highlighted (all BER values are exact, not rounded):

Key Quantitative Findings

(1) BER Reduction (Core Effect): The flip watermarking module reduces the BER by 0.00418 % to 0.0921 % for all secret information sizes, and achieves 0 % BER for small-to-medium secret information (8 × 8 to 64 × 64 bits). In contrast, the no-flip model has a significantly higher BER that decreases with an increase in secret information size (consistent with the error amplification characteristic of traditional duplication/padding processing). (2) Slight Visual Quality Trade-off: The flip model has a minor drop in PSNR ( 0.23 0.31 dB) and SSIM ( 0.01 ) compared with the no-flip model. This slight trade-off is intentional and reasonable: the mirror redundancy structure of flip watermarking introduces minimal visual perturbations to the cover image, but in return achieves a dramatic reduction in BER. (3) Scalability Advantage: For large-scale secret information (128 × 128/256 × 256 bits), the no-flip model’s BER still reaches 0.0095 % / 0.0042 % , while the flip model’s BER is only 0.00001 % / 0.00002 % ( 99 . 89 % / 99 . 52 % BER reduction), which verifies the excellent scalability of the flip watermarking module for large-capacity steganography—an advantage that traditional duplication/padding processing does not have.

4.4. Discussions

This section systematically interprets the experimental results of the proposed flip watermarking-based generative image steganography method (Mini-Hide), and objectively analyzes the core performance advantages, intrinsic trade-offs between redundancy and imperceptibility, inherent limitations of the method, practical application scenarios, and feasible future research directions.

4.4.1. Interpretation of BER Improvements

The most significant performance gain of the proposed Mini-Hide method is the reduction in the bit error rate of secret information recovery, which is the core solution to the research gap pertaining to high BER in existing generative steganography methods. The experimental results in Table 7 show that the Mini-Hide method achieves 0 % BER for small-to-medium secret information (8 × 8 to 64 × 64 bits), and only 0.00001 % and 0.00002 % BER for large-scale secret information (128 × 128 and 256 × 256 bits), which are far superior values to those achieved by state-of-the-art steganography methods. The fundamental reasons for this BER improvement are threefold, and are mutually reinforcing: Mirror redundancy of flip watermarking eliminates random bit errors: Each bit of the original secret information is mapped to four mirrored positions in the watermark image, forming 4-fold information redundancy. This structure introduces implicit regularization into the network training process, forcing the model to maintain consistent steganographic perturbations at mirrored positions (Equation (10)), thus eliminating unnecessary pixel noise and random bit errors caused by unstructured perturbations in traditional duplication/padding processing (Table 11 shows that the no-flip model has a BER of 0.0642 % for 64 × 64 bits).
Uniform gradient propagation suppresses error amplification: The symmetric structure of the flip watermark ensures that the loss gradient is uniformly propagated to the entire watermark region (Equation (12)), avoiding local gradient concentration in traditional methods that leads to overfitting of pixel perturbations. This disperses the propagation of local bit errors and prevents error amplification for large-scale secret information.
Decoding-stage error suppression optimizes recovery accuracy. The dedicated decoding process eliminates systematic deviations caused by color space discrepancies and floating-point pixel values, reducing the decoding error probability to near 0 (Equation (16)). This forms a dual error suppression mechanism with the embedding-stage optimization, further ensuring the accuracy of secret information recovery.
In addition, the performance gain experiment (Table 4, Table 5 and Table 6) shows that integrating the flip watermarking module into mainstream methods can reduce their BER by 90– 99.9 % , which verifies that the BER improvement effect of flip watermarking is universal and transferable, not limited to the Mini-Hide model itself. This fully demonstrates the rationality and effectiveness of the flip watermarking design for solving the high BER problem in generative steganography.

4.4.2. Trade-Offs Between Redundancy and Imperceptibility

The flip watermarking method achieves ultra-low BER by introducing mirror redundancy, and this redundancy inevitably brings a slight trade-off with the imperceptibility of steganography, which is evaluated by PSNR and SSIM. The experimental results (Table 11) quantitatively show that compared with the no-flip model (without redundancy), the Mini-Hide model has a 0.23–0.31 dB drop in PSNR and a 0.01 drop in SSIM for all secret information sizes, which is the direct cost of introducing flip redundancy. However, this trade-off is reasonable, acceptable, and non-detrimental to steganographic security.

4.5. Limitations

Based on the experimental results and practical application tests, the proposed Mini-Hide method may have the following potential deficiencies: redundancy-induced computational and capacity constraints for ultra-large-scale secret information, insufficient robustness against strong geometric attacks, and slight imperceptibility loss for high-resolution cover images. The flip watermarking method relies on 4-fold mirror redundancy and tiling redundancy to reduce BER, and when the secret information size exceeds 512 × 512 bits when the size of cover images is 256 × 256, the repeated tiling of the basic watermark will increase the computational complexity and lower the redundancy. The proposed flip watermarking method only considers mirror symmetry, and lacks a dedicated geometric distortion correction mechanism (e.g., feature point matching, affine transformation correction) in the decoding stage, so it is sensitive to geometric attacks. The Mini-Hide method is optimized for 256 × 256 cover images, which is the mainstream size in steganography, and when applied to high-resolution cover images such as 512 × 512 and 1024 × 1024, the PSNR drop may increase due to the large number of watermark tilings.

4.6. Potential Applications

The proposed flip watermarking-based generative steganography method has low BER and good imperceptibility, and its application scenarios are highly matched with the demand for secure and covert data transmission in various fields, the method has important practical application value in the power system, and can be extended to digital media, government affairs, and military fields.

5. Conclusions

This study proposes a novel generative image steganography algorithm based on flip watermarking, aiming to solve the critical problem of high BER in existing deep learning-based generative steganography methods. The core innovation of the method is to design a flip watermarking preprocessing strategy for secret binary bitstreams, which constructs a mirror redundancy structure through vertical, horizontal, and vertical–horizontal flipping, and combines it with a three-subnetwork generative model to form the end-to-end Mini-Hide steganography framework. Extensive experimental results show that the proposed method achieves low BER and maintains good steganographic imperceptibility. Integrating the flip watermarking module into mainstream steganography methods can also significantly reduce their BER and improve their visual quality, verifying the universality and effectiveness of the flip watermarking technology. The in-depth discussion of the experimental results further clarifies the core reasons for BER improvement, the rationality of the trade-off between redundancy and imperceptibility, and the inherent limitations of the method, and expands the practical application scenarios. In the future, we will focus on solving the limitations of the current method, optimize the model from the aspects of adaptive redundancy, attack robustness, lightweight design and multi-modal fusion, and further improve the practical application value of the method in the field of information security.

Author Contributions

Conceptualization, R.Q. and Y.W.; methodology, R.Q., R.F. and Z.L.; validation, N.C. and Y.W.; formal analysis, C.Y.; investigation, Y.W. and C.Y.; writing—original draft preparation, R.Q., N.C. and Y.W.; writing—review and editing, Y.W., R.F., and C.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by State Grid JiangXi Information & Telecommunication Branch Project Funding (Contract No.:521835250007).

Institutional Review Board Statement

Ethical review and approval were waived for this study since no actual experimental subjects were involved in the research.

Informed Consent Statement

Informed consent was obtained from all subjects involved in this study.

Data Availability Statement

This study uses the publicly available COCO (Common Objects in Context) dataset, which can be accessed from the official website: https://cocodataset.org/, accessed on 8 January 2026. No new datasets were generated or analyzed during the current study.

Conflicts of Interest

Authors Rixuan Qiu, Zhiyuan Luo, Ruixiang Fan and Na Cao were employed by the Information and communication company, State grid Jiangxi Electric Power, Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Abdelfatah, A.; Ayman, M.A.; Omaima, A. Hiding an image inside another image using variable-rate steganography. Int. J. Adv. Comput. Sci. Appl. 2013, 4, 18–21. [Google Scholar] [CrossRef]
  2. Baluja, S. Hiding Images within Images. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 1685–1697. [Google Scholar] [CrossRef] [PubMed]
  3. Zhu, J.; Kaplan, R.; Johnson, J.; Fei-Fei, L. HiDDeN: Hiding Data with Deep Networks. In Computer Vision—ECCV 2018; Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Eds.; ECCV 2018; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2018; pp. 682–697. [Google Scholar]
  4. Jing, J.; Deng, X.; Xu, M.; Wang, J.; Guan, Z. HiNet: Deep Image Hiding by Invertible Network. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 4713–4722. [Google Scholar]
  5. Imaizumi, S.; Ozawa, K. Multibit Embedding Algorithm for Steganography of Palette-Based Images. In Image and Video Technology. PSIVT 2013; Klette, R., Rivera, M., Satoh, S., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2014; pp. 99–110. [Google Scholar]
  6. Nguyen, B.C.; Yoon, S.M.; Lee, H.K. Multi Bit Plane Image Steganography. In Digital Watermarking. IWDW 2006; Shi, Y.Q., Jeon, B., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2006; pp. 61–70. [Google Scholar]
  7. Pan, F.; Li, J.; Yang, X. Image steganography method based on PVD and modulus function. In Proceedings of the 2011 International Conference on Electronics, Communications and Control (ICECC), Ningbo, China, 9–11 September 2011; pp. 282–284. [Google Scholar]
  8. Provos, N.; Honeyman, P. Hide and seek: An introduction to steganography. IEEE Secur. Priv. 2003, 1, 32–44. [Google Scholar] [CrossRef]
  9. Tsai, P.; Hu, Y.C.; Yeh, H.L. Reversible image hiding scheme using predictive coding and histogram shifting. Signal Process. 2009, 89, 1129–1143. [Google Scholar] [CrossRef]
  10. Niimi, M.; Noda, H.; Kawaguchi, E.; Eason, R.O. High capacity and secure digital steganography to palette-based images. In Proceedings of the International Conference on Image Processing, Rochester, NY, USA, 22–25 September 2002; p. II. [Google Scholar]
  11. Zhang, T.; Ping, X. Reliable detection of LSB steganography based on the difference image histogram. In Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, 6–10 April 2003; p. III-545. [Google Scholar]
  12. Böhme, R.; Westfeld, A. Exploiting Preserved Statistics for Steganalysis. In Information Hiding. IH 2004; Fridrich, J., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2004; pp. 82–96. [Google Scholar]
  13. Dumitrescu, S.; Wu, X.; Memon, N. On steganalysis of random LSB embedding in continuous-tone images. In Proceedings of the International Conference on Image Processing, Rochester, NY, USA, 22–25 September 2002; pp. 641–644. [Google Scholar]
  14. Pevný, T.; Filler, T.; Bas, P. Using High-Dimensional Image Models to Perform Highly Undetectable Steganography. In Information Hiding. IH 2010; Böhme, R., Fong, P.W.L., Safavi-Naini, R., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2010; pp. 161–177. [Google Scholar]
  15. Holub, V.; Fridrich, J. Designing steganographic distortion using directional filters. In Proceedings of the 2012 IEEE International Workshop on Information Forensics and Security (WIFS), Costa Adeje, Spain, 2–5 December 2012; pp. 234–239. [Google Scholar]
  16. Holub, V.; Fridrich, J.; Denemark, T. Universal distortion function for steganography in an arbitrary domain. EURASIP J. Inf. Secur. 2014, 2014, 1. [Google Scholar] [CrossRef]
  17. Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. In Proceedings of the 28th International Conference on Neural Information Processing Systems–Volume 2; MIT Press: Cambridge, MA, USA, 2014; pp. 2672–2680. [Google Scholar]
  18. Pan, W.; Yin, Y.; Wang, X.; Jing, Y.; Song, M. Seek-and-Hide: Adversarial Steganography via Deep Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 7871–7884. [Google Scholar] [CrossRef] [PubMed]
  19. Shi, H.; Dong, J.; Wang, W.; Qian, Y.; Zhang, X. SSGAN: Secure Steganography Based on Generative Adversarial Networks. In Advances in Multimedia Information Processing—PCM 2017; Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2018; pp. 534–544. [Google Scholar]
  20. Dinh, L.; Krueger, D.; Bengio, Y. NICE: Non-linear Independent Components Estimation. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
  21. Kingma, D.P.; Dhariwal, P. Glow: Generative Flow with Invertible 1x1 Convolutions. In Advances in Neural Information Processing Systems; Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2018; Volume 31. [Google Scholar]
  22. Xu, Y.; Mou, C.; Hu, Y.; Xie, J.; Zhang, J. Robust Invertible Image Steganography. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 7865–7874. [Google Scholar]
  23. Guan, Z.; Jing, J.; Deng, X.; Xu, M.; Jiang, L.; Zhang, Z. DeepMIH: Deep Invertible Network for Multiple Image Hiding. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 372–390. [Google Scholar] [CrossRef] [PubMed]
  24. Yu, J.; Zhang, X.Y.; Xu, Y.M.; Zhang, J. CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography. In Advances in Neural Information Processing Systems; Oh, A., Naumann, T., Globerson, A., Saenko, K., Hardt, M., Levine, S., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2023; pp. 80730–80743. [Google Scholar]
  25. Wei, P.; Li, S.; Zhang, X.P.; Luo, G.; Qian, Z.X.; Zhou, Q. Generative Steganography Network. In Proceedings of the 30th ACM International Conference on Multimedia; Association for Computing Machinery: New York, NY, USA, 2022; pp. 1621–1629. [Google Scholar]
  26. Abdelhamed, A.; Brubaker, M.; Brown, M. Noise Flow: Noise Modeling with Conditional Normalizing Flows. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 3165–3173. [Google Scholar]
  27. Nielsen, D.; Jaini, P.; Hoogeboom, E.; Winther, O.; Welling, M. SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows. In Advances in Neural Information Processing Systems; Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2020; pp. 12685–12696. [Google Scholar]
  28. Liang, J.; Zhang, K.; Gu, S.; Gool, L.V.; Timofte, R. Flow-based Kernel Prior with Application to Blind Super-Resolution. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 10596–10605. [Google Scholar]
  29. Ho, J.; Chen, X.; Srinivas, A.; Duan, Y.; Abbeel, P. Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA, 9–15 June 2019; pp. 2722–2730. [Google Scholar]
  30. Bui, T.; Agarwal, S.; Yu, N.; Collomosse, J. RoSteALS: Robust steganography using autoencoder latent space. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Vancouver, BC, Canada, 17–24 June 2023. [Google Scholar]
  31. Chen, X.; Kishore, V.; Weinberger, K.Q. Learning Iterative Neural Optimizers for Image Steganography. In Proceedings of the International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
  32. Huang, J.; Luo, T.; Li, L.; Yang, G.; Xu, H.; Chang, C.C. ARWGAN: Attention guided robust image watermarking model based on GAN. IEEE Trans. Instrum. Meas. 2023, 72, 5018417. [Google Scholar] [CrossRef]
  33. Wang, B.; Wu, Y.; Wang, G. Adaptor: Improving the robustness and imperceptibility of watermarking by the adaptive strength factor. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 6260–6272. [Google Scholar] [CrossRef]
  34. Yao, Y.; Wang, J.; Chang, Q.; Ren, Y.; Meng, W. High invisibility image steganography with wavelet transform and generative adversarial network. Expert Syst. Appl. 2024, 249, 123540. [Google Scholar] [CrossRef]
  35. Chai, X.; Tang, Z.; Gan, Z.; Lu, Y.; Wang, B.; Zhang, Y. SE-NDEND: A novel symmetric watermarking framework with neural network-based chaotic encryption for Internet of Medical Things. Biomed. Signal Process. Control 2024, 90, 105877. [Google Scholar] [CrossRef]
  36. Duan, D.; Shen, S.; Yu, S.; Yuan, Y.; Zhou, Q.; Lv, H.; Lin, H. DenseJIN: Dense Depth Image Steganography Model with Joint Invertible and Noninvertible Mechanisms. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 1631–1647. [Google Scholar] [CrossRef]
  37. Zhang, L.; Li, T.; Lu, Y.; Xu, Y.; Lu, G. Efficient U-shape invertible neural network for large-capacity image steganography. J. Inf. Secur. Appl. 2025, 94, 104237. [Google Scholar] [CrossRef]
  38. Yang, C.; Wang, S.; Huang, Y.; Guo, M. SNR: One single network for image steganography with robust post-save recovery. Neurocomputing 2025, 651, 130929. [Google Scholar] [CrossRef]
  39. Jiang, J.; Wang, Z.; Zhang, X. Image-to-Image Steganography based on multimodal generative model. Signal Process. 2026, 238, 110106. [Google Scholar] [CrossRef]
  40. Liu, Z.; Zhang, J.; Dong, Y.; Song, B. RWP: A robust watermarking plugin for attribution and protection in stable diffusion models. Neural Netw. 2026, 198, 108626. [Google Scholar] [CrossRef] [PubMed]
  41. Yang, X.; Chai, X.; Zhang, Y.; Gan, Z.; Zhao, X.; Wang, B. HHN-NDEND: A symmetric watermarking framework based on hyperchaotic encryption for medical image protection in IoMT. Expert Syst. Appl. 2026, 306, 130957. [Google Scholar] [CrossRef]
  42. Voloshynovskiy, S.; Deguillaume, F.; Pun, T. Multibit digital watermarking robust against local nonlinear geometrical distortions. In Proceedings of the 2001 International Conference on Image Processing, Thessaloniki, Greece, 7–10 October 2001; pp. 999–1002. [Google Scholar]
  43. Ma, Z.; Zhang, W.; Fang, H.; Dong, X.; Geng, L.; Yu, N. Local Geometric Distortions Resilient Watermarking Scheme Based on Symmetry. IEEE Trans. Circuits Syst. Video Technol. 2021, 31, 4826–4839. [Google Scholar] [CrossRef]
  44. Baluja, S. Hiding Images in Plain Sight: Deep Steganography. In Advances in Neural Information Processing Systems; Guyon, I., Von Luxburg, U., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
  45. Zhang, K.; Kumar, A.; Liao, X. SteganoGAN: High Capacity Image Steganography with GANs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 11444–11452. [Google Scholar]
  46. Kishore, V.; Chen, X.Y.; Wang, Y.; Li, B.Y.; Weinberger, K.Q. Fixed Neural Network Steganography: Train the images, not the network. In Proceedings of the International Conference on Learning Representations, Virtual, 25–29 April 2022. [Google Scholar]
Figure 1. Process of generating the inverted watermark image.
Figure 1. Process of generating the inverted watermark image.
Electronics 15 00939 g001
Figure 2. Framework of Mini-Hide steganography.
Figure 2. Framework of Mini-Hide steganography.
Electronics 15 00939 g002
Figure 3. Comparison of carrier images and stego-images using the Mini-Hide method proposed in this paper.
Figure 3. Comparison of carrier images and stego-images using the Mini-Hide method proposed in this paper.
Electronics 15 00939 g003
Figure 4. Comparison of the residual maps between the carrier image and the stego-image using the BSMini-Hide method in this paper.
Figure 4. Comparison of the residual maps between the carrier image and the stego-image using the BSMini-Hide method in this paper.
Electronics 15 00939 g004
Table 1. Performance comparison of existing SOTA deep learning-based steganography and watermarking methods in terms of PSNR and SSIM. S and W respectively denote steganography and watermarking.
Table 1. Performance comparison of existing SOTA deep learning-based steganography and watermarking methods in terms of PSNR and SSIM. S and W respectively denote steganography and watermarking.
YearModelPub.TypePSNR (dB)SSIM
2023RoSteALS [30]CVPRS34.4600.890
2023LISO [31]ICLRS33.8300.900
2023ARWGAN [32]TIMW36.6600.969
2023Adaptor [33]TSCVTW37.6300.953
2024DWT-GAN [34]ESWAS44.5700.993
2024SE-NDEND [35]BSPCW45.84920.9874
2025DenseJIN [36]ITCSS40.2530.978
2025EUIN-Net [37]JISAS34.9600.980
2025SNR [38]NeurocomputingS44.11310.9876
2026I2IStega [39]SPS36.1600.910
2026RWP [40]NNW32.990.952
2026HHN-NDEND [41]ESWAW47.6830.989
Table 2. PSNR comparison of carrier stego-images with different secret message sizes (bit) using various steganography methods.
Table 2. PSNR comparison of carrier stego-images with different secret message sizes (bit) using various steganography methods.
Models8 × 8
PSNR ↑
16 × 16
PSNR ↑
32 × 32
PSNR ↑
64 × 64
PSNR ↑
128 × 128
PSNR ↑
256 × 256
PSNR ↑
HiDDeN [3] (2018)32.14 ± 0.3129.70 ± 0.3031.07 ± 0.3130.98 ± 0.30
SteganoGAN [45] (2019)16.32 ± 0.1616.32 ± 0.1616.31 ± 0.1616.32 ± 0.1616.30 ± 0.1616.39 ± 0.17
FNNS-D [46] (2022)35.96 ± 0.2835.59 ± 0.2734.98 ± 0.2834.33 ± 0.2832.07 ± 0.2727.70 ± 0.29
SE-NDEND [35] (2024)45.85 ± 0.4345.19 ±0.4344.67 ±0.4343.28 ± 0.4340.03 ± 0.4334.62 ± 0.43
RWP [40] (2026)32.99 ± 0.3732.54 ±0.3731.75 ±0.3631.32 ± 0.3530.17 ± 0.3526.38 ± 0.32
HHN-NDEND [41] (2026)47.68 ± 0.4446.91 ±0.4346.26 ±0.4445.43 ± 0.4143.86 ± 0.4138.20 ± 0.39
Mini-Hide27.92 ± 0.2527.76 ± 0.2527.88 ± 0.2527.89 ± 0.2527.86 ± 0.2527.88 ± 0.25
CMini-Hide28.74 ± 0.2628.59 ± 0.2628.61 ± 0.2632.18 ± 0.27
BSMini-Hide37.72 ± 0.3037.74 ± 0.3037.75 ± 0.3037.71 ± 0.3037.75 ± 0.3037.75 ± 0.30
BCmin-Hide36.85 ± 0.2936.88 ± 0.2936.75 ± 0.2937.71 ± 0.30
Table 3. Comparison of SSIM between cover images and stego-images with different bit sizes of secret information under various steganography methods.
Table 3. Comparison of SSIM between cover images and stego-images with different bit sizes of secret information under various steganography methods.
Models8 × 8
SSIM ↑
16 × 16
SSIM ↑
32 × 32
SSIM ↑
64 × 64
SSIM ↑
128 × 128
SSIM ↑
256 × 256
SSIM ↑
Balujia [2] (2017)0.88 ± 0.020.86 ± 0.020.86 ± 0.020.87 ± 0.020.83 ± 0.020.82 ± 0.02
HiDDeN [3] (2018)0.94 ± 0.010.92 ± 0.010.94 ± 0.010.94 ± 0.01
SteganoGAN [45] (2019)0.69 ± 0.020.68 ± 0.020.66 ± 0.010.63 ± 0.010.60 ± 0.010.60 ± 0.01
FNNS-D [46] (2022)0.94 ± 0.010.94 ± 0.010.93 ± 0.010.92 ± 0.010.85 ± 0.020.66 ± 0.03
SE-NDEND [35] (2024)0.98 ± 0.020.98 ± 0.010.97 ± 0.010.94 ± 0.020.89 ± 0.020.73 ± 0.03
RWP [40] (2026)0.95 ± 0.010.95 ± 0.010.93 ± 0.010.91 ± 0.020.84 ± 0.020.70 ± 0.03
HHN-NDEND [41] (2026)0.99 ± 0.010.97 ± 0.010.94 ± 0.010.93 ± 0.020.90 ± 0.020.81 ± 0.03
Mini-Hide0.83 ± 0.010.82 ± 0.010.82 ± 0.120.82 ± 0.020.82 ± 0.020.82 ± 0.02
CMini-Hide0.86 ± 0.010.86 ± 0.010.86 ± 0.010.93 ± 0.02
BSMini-Hide0.95 ± 0.010.94 ± 0.010.96 ± 0.010.96 ± 0.010.96 ± 0.010.96 ± 0.01
BCmin-Hide0.95 ± 0.010.94 ± 0.010.95 ± 0.010.96 ± 0.01
Table 4. Comparison of steganographic decoding error rates (ER) for secret messages of different bit sizes across various steganography methods.
Table 4. Comparison of steganographic decoding error rates (ER) for secret messages of different bit sizes across various steganography methods.
Models8 × 8
ER (%) ↓
16 × 16
ER (%) ↓
32 × 32
ER (%) ↓
64 × 64
ER (%) ↓
128 × 128
ER (%) ↓
256 × 256
ER (%) ↓
Balujia [2] (2017)1.10000 ± 0.038001.10000 ± 0.039000.10000 ± 0.006000.01000 ± 0.001000.00080 ± 0.000050.00300 ± 0.00040
HiDDeN [3] (2018)0.08750 ± 0.001500.00100 ± 0.000100.00080 ± 0.000050.00180 ± 0.00010
SteganoGAN [45] (2019)0.44032 ± 0.023810.39610 ± 0.018720.32509 ± 0.015640.19246 ± 0.008930.08946 ± 0.004780.03904 ± 0.00259
FNNS-D [46] (2022)0.05523 ± 0.002410.041327 ± 0.001570.02464 ± 0.001380.00547 ± 0.000290.00017 ± 0.000010.00013 ± 0.00001
SE-NDEND [35] (2024)0.05000 ± 0.001890.03800 ± 0.001430.02200 ± 0.000920.00500 ± 0.000170.00015 ± 0.000010.00011 ± 0.00001
RWP [40] (2026)0.04500 ± 0.001650.03500 ± 0.001280.02000 ± 0.000870.00450 ± 0.000150.00013 ± 0.000010.00009 ± 0.00001
HHN-NDEND [41] (2026)0.00010 ± 0.000090.00008 ± 0.000070.00006 ± 0.000050.00005 ± 0.000040.00002 ± 0.000020.00003 ± 0.00003
Mini-Hide0.00000 ± 0.000000.00000 ± 0.000000.00000 ± 0.000000.00000 ± 0.000000.00001 ± 0.000010.00002 ± 0.00001
CMini-Hide0.00000 ± 0.000000.00000 ± 0.000000.00000 ± 0.000000.00000 ± 0.00000
BSMini-Hide0.00000 ± 0.000000.00000 ± 0.000000.00048 ± 0.000020.00007 ± 0.000010.00021 ± 0.000010.00024 ± 0.00001
BCmin-Hide0.00000 ± 0.000000.00000 ± 0.000000.00000 ± 0.000000.00007 ± 0.00001
Table 5. The Mini-Hide watermark module’s PSNR gain for different steganography methods (with varying sizes of secret information).
Table 5. The Mini-Hide watermark module’s PSNR gain for different steganography methods (with varying sizes of secret information).
Models8 × 8
PSNR ↑
16 × 16
PSNR ↑
32 × 32
PSNR ↑
64 × 64
PSNR ↑
128 × 128
PSNR ↑
256 × 256
PSNR ↑
Balujia + Mini-Hide37.5737.6837.7137.7237.7137.73
SteganoGAN + Mini-Hide20.1220.1120.1020.1020.1020.11
FNNS-D + Mini-Hide42.5042.6343.3043.4642.1340.82
Table 6. The Mini-Hide watermark module’s SSIM gain for different steganography methods (with varying sizes of secret information).
Table 6. The Mini-Hide watermark module’s SSIM gain for different steganography methods (with varying sizes of secret information).
Models8 × 8 (bit)
SSIM ↑  
16 × 16 (bit)
SSIM ↑  
32 × 32 (bit)
SSIM ↑  
64 × 64 (bit)
SSIM ↑  
128 × 128 (bit)
SSIM ↑ 
256 × 256 (bit)
SSIM ↑ 
Balujia + Mini-Hide0.960.970.970.960.970.97
SteganoGAN + Mini-Hide0.730.730.730.720.720.72
FNNS-D + Mini-Hide0.990.990.990.990.990.99
Table 7. Error rate (ER) comparison of fusion models on images with various bit sizes.
Table 7. Error rate (ER) comparison of fusion models on images with various bit sizes.
Models8 × 8 (bit)
ER (%) ↓  
16 × 16 (bit)
ER (%) ↓  
32 × 32 (bit)
ER (%) ↓  
64 × 64 (bit)
ER (%) ↓  
128 × 128 (bit)
ER (%) ↓ 
256 × 256 (bit)
ER (%) ↓ 
Balujia + Mini-Hide0.00001000.000010.000010
SteganoGAN + Mini-Hide0.031250.003910.000980.000240.000060.00002
FNNS-D + Mini-Hide0.0001400.000090.0000200
Table 8. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of PSNR (Mini-Hide).
Table 8. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of PSNR (Mini-Hide).
Attack8 × 816 × 1632 × 3264 × 64128 × 128256 × 256
Mini-Hide27.9227.7627.8827.8927.8627.88
Gaussian Noise ( σ = 0.01)27.9127.7427.8627.8727.8427.86
Gaussian Noise ( σ = 0.03)27.8927.7227.8427.8627.8427.84
Gaussian Noise ( σ = 0.05)27.8627.6827.8127.8427.8127.81
JPEG Compression (Q = 75)27.8727.6927.8127.8327.8027.81
JPEG Compression (Q = 85)27.9027.7227.8527.8627.8327.85
JPEG Compression (Q = 95)27.9127.7427.8727.8727.8527.87
Table 9. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of SSIM (Mini-Hide).
Table 9. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of SSIM (Mini-Hide).
Attack8 × 816 × 1632 × 3264 × 64128 × 128256 × 256
Mini-Hide0.830.820.820.820.820.82
Gaussian Noise ( σ = 0.01)0.830.820.820.820.820.81
Gaussian Noise ( σ = 0.03)0.830.820.820.810.810.81
Gaussian Noise ( σ = 0.05)0.820.810.810.800.800.79
JPEG Compression (Q = 75)0.820.810.800.790.790.78
JPEG Compression (Q = 85)0.830.820.820.810.800.80
JPEG Compression (Q = 95)0.830.820.820.820.820.82
Table 10. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of BER (Mini-Hide).
Table 10. Quantitative comparison of Gaussian noise and JPEG compression attacks on different image sizes in terms of BER (Mini-Hide).
Attack8 × 816 × 1632 × 3264 × 64128 × 128256 × 256
Mini-Hide00000.000010.00002
Gaussian Noise ( σ = 0.01)00000.000010.00002
Gaussian Noise ( σ = 0.03)0000.0000020.0000120.000023
Gaussian Noise ( σ = 0.05)0000.0000080.0000170.00003
JPEG Compression (Q = 75)0000.000010.000020.000029
JPEG Compression (Q = 85)0000.0000030.0000140.000022
JPEG Compression (Q = 95)00000.000010.00002
Table 11. Performance comparison of flip watermarking (Mini-Hide) vs. no flipping (Mini-Hide-no-flip).
Table 11. Performance comparison of flip watermarking (Mini-Hide) vs. no flipping (Mini-Hide-no-flip).
Secret Info Size (bit)ModelBER (%)PSNR (dB)SSIM Δ BER (%) Δ PSNR (dB) Δ SSIM
8 × 8 Flip027.920.830.0921−0.28−0.01
No-Flip0.092128.200.84
16 × 16 Flip027.760.820.0857−0.31−0.01
No-Flip0.085728.070.83
32 × 32 Flip027.880.820.0789−0.29−0.01
No-Flip0.078928.170.83
64 × 64 Flip027.890.820.0642−0.27−0.01
No-Flip0.064228.160.83
128 × 128 Flip0.0000127.860.820.00949−0.25−0.01
No-Flip0.009528.110.83
256 × 256 Flip0.0000227.880.820.00418−0.23−0.01
No-Flip0.004228.110.83
Note: Δ BER = BER no - flip BER flip , Δ PSNR = PSNR flip PSNR no - flip , Δ SSIM = SSIM flip SSIM no - flip . All BER values are exact without rounding.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Qiu, R.; Luo, Z.; Fan, R.; Cao, N.; Wang, Y.; Yang, C. Mini-Hide: Generative Image Steganography via Flip Watermarking for Reducing BER. Electronics 2026, 15, 939. https://doi.org/10.3390/electronics15050939

AMA Style

Qiu R, Luo Z, Fan R, Cao N, Wang Y, Yang C. Mini-Hide: Generative Image Steganography via Flip Watermarking for Reducing BER. Electronics. 2026; 15(5):939. https://doi.org/10.3390/electronics15050939

Chicago/Turabian Style

Qiu, Rixuan, Zhiyuan Luo, Ruixiang Fan, Na Cao, Yuan Wang, and Cong Yang. 2026. "Mini-Hide: Generative Image Steganography via Flip Watermarking for Reducing BER" Electronics 15, no. 5: 939. https://doi.org/10.3390/electronics15050939

APA Style

Qiu, R., Luo, Z., Fan, R., Cao, N., Wang, Y., & Yang, C. (2026). Mini-Hide: Generative Image Steganography via Flip Watermarking for Reducing BER. Electronics, 15(5), 939. https://doi.org/10.3390/electronics15050939

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop