1. Introduction
Remote sensing images are a fundamental and strategic resource with broad applications in military, mapping, engineering planning, and disaster monitoring [
1,
2]. They possess distinctive features, such as spatial resolution, spectral resolution, temporal resolution, radiometric resolution, geometric accuracy, and extensive coverage, which make them highly valuable for detailed analysis and observation. Consequently, ensuring the security of remote sensing images is of paramount importance.
Digital watermarking, an essential technique in data security, establishes a strong relationship between digital data and watermarks. It can be categorized into various types, including robust watermarking [
3,
4], fragile watermarking [
5,
6], and reversible watermarking [
7,
8], among others. Robust watermarking can ensure that the embedded watermark remains unaffected by attacks, a characteristic known as robustness. Thus, it offers an effective solution for copyright protection of remote sensing images, wherein robustness is a crucial metric in assessing digital watermarking algorithms. Present research efforts to improve robustness primarily focus on minor geometric attacks. However, geometric correction, one of the basic processing methods for remote sensing images, usually causes severe geometric deformations to the data. This makes it difficult to maintain the watermark in Level 0 and Level 1 remote sensing image products after geometric correction. Thus, it presents a significant challenge in watermark synchronization under severe geometric distortions [
9,
10,
11].
As a subset of image watermarking, remote sensing image watermarking algorithms have significantly benefited from advancements in general image watermarking research [
12,
13,
14]. The template watermark is key to resisting geometric attacks [
15,
16,
17]. Algorithms based on template watermarks employ regular watermark patterns. After an attack, the template watermark can be extracted and corrected to reestablish watermark synchronization. These algorithms can be divided into two categories.
The first category is the method based on spatial domain template watermarks [
18,
19]. This method generally involves periodically tiling template watermarks within the spatial domain. During watermark extraction, the template watermark is identified through correlation functions and other methods, which act as references for rectifying geometric transformations. For instance, the literature [
20] proposes a watermarking scheme based on symmetry. In this method, watermark information is encoded through random patterns, which serve as units of the template watermark and are subsequently embedded within the spatial domain. Watermark synchronization is then achieved by employing the auto-correlation function to detect the symmetrical template watermark. This type of method couples the template watermark with the data in the spatial domain, effectively ensuring watermark synchronization when data deformation is not severe. However, due to the significant coupling between the template watermark and the data in the spatial domain, severe data deformation will also deform the template watermark, thereby affecting the extraction of the watermark.
The second category is the method based on transform domain template watermarks, typically constructed using discrete wavelet transformation (DWT) [
21], discrete cosine transform (DCT), or discrete Fourier transform (DFT). Among these, template watermarks based on DFT are the most extensively studied. Additionally, Chen [
22] found that DFT features exhibit greater robustness against screen-cam attacks involving geometric distortions compared to DCT and DWT. These DFT-based watermarking algorithms involve arranging watermark sequences into fixed patterns and replacing coefficients in the DFT domain. A study [
23] embedded watermarks in the annular area of the DFT magnitude. Another study [
24] used circular DFT template watermarks. This type of method embeds the template watermark in the transform domain, significantly reducing the spatial coupling problem in the spatial domain. However, decoupling between the template watermark and the attacked data is incomplete. For example, under cropping attacks, the proportion of the template watermark in the data will encounter irreversible reduction. Therefore, precisely identifying template watermarks in the transform domain remains challenging.
In summary, template watermarks are crucial for resisting geometric attacks in watermarking algorithms for remote sensing images. The method based on spatial domain template watermarks achieves watermark synchronization under some geometric attacks but struggles with the spatial coupling issue. The method based on transform domain template watermarks partially solves the former problem but still requires enhanced identification of template watermarks. Consequently, how to identify the template watermark efficiently remains a scientific problem.
To address the above problem, this paper proposes a watermarking algorithm for remote sensing images. In this method, we design a ring-shaped template watermark, innovatively converting the watermark into small targets in the DFT domain. This is followed by applying an infrared small target detection method, specifically the multiscale local contrast measure (LCM) [
25], to facilitate watermark synchronization. The main contributions of this paper are as follows:
Design of a ring-shaped template watermark.
Conversion of the watermark into small targets.
Enhancement of the watermark using multiscale LCM.
Peak detection based on remapping and column-wise summation.
Watermark extraction based on the circular edge local binarization method.
Robustness against geometric attacks.
The rest of this paper is organized as follows:
Section 2 presents the methodology,
Section 3 describes the experimental design,
Section 4 provides the results and analyses of the experiments, and
Section 5 offers discussions.
Section 6 concludes the study.
2. Methodology
This paper proposes a watermarking algorithm for remote sensing images based on a ring-shaped template watermark and the multiscale LCM method. Typically, template-based watermarking algorithms divide the watermark into the template watermark and the message watermark [
15]. The template watermark is used for attack correction, while the message watermark stores copyright information. In contrast, our algorithm treats the template and message watermarks as one. The main idea of the proposed algorithm is illustrated in
Figure 1.
The core components include two main parts. (1) In the embedding stage, the ring-shaped template watermark is utilized. This stage involves converting the watermark into small targets in the DFT domain and embedding the watermark into the magnitude coefficients of the DFT. These coefficients are the features of the remote sensing image used by the proposed watermarking algorithm and serve as the watermark carrier. (2) In the detection stage, multiscale LCM is employed to enhance the small target watermark and obtain a contrast map. Peak detection is then performed on the contrast map to determine the radius of the template watermark. Subsequently, circular edge local binarization is applied to binarize the values at the corresponding radius, thereby extracting the watermark. The specific details of the proposed algorithm are provided below.
2.1. Ring-Shaped Template Watermark
The watermark to be embedded consists of a binary sequence of 0s and 1s, with a length denoted as
. The watermark is represented as
. A ring-shaped template watermark is constructed using the magnitude coefficients in the DFT domain.
Figure 2 shows a schematic diagram of the template watermark, where black represents a DFT magnitude of 0, and white points represent non-zero magnitudes, indicating watermark bits of 1. The watermark information is evenly distributed within the ring. Due to the central symmetry of the magnitude spectrum, the calculation formula for the interval denoted as
between two watermark bits is as follows:
Let the rows and columns of image
be denoted as
and
, respectively. The center
of the image is taken as the center of the template watermark. The formulas are:
where the function
represents the modulo operation. The center is also the position of the image’s direct current (DC) component after DFT. Around the center, similar to a band-pass filter, a ring-shaped template watermark is constructed with a radius range of
. Here,
is the radius of the concentric circle at the center of the ring-shaped template watermark, referred to as the radius of the template watermark. This radius divides the ring into two equal radial segments. The parameter
controls the shape of the template. A recommended value for
is 2.
Figure 3 shows the magnitude spectrum of the image after embedding the template watermark, where
= 40, and the watermark is a periodic repetition of the combination of 01 bits. To better display the effect, the embedding strength of the watermark has been appropriately enhanced. As shown in
Figure 3a, the ring-shaped template watermark is distinctly visible, with 40 bright spots in the ring. In
Figure 3b, the corresponding bright spots appear as 40 pillars forming a ring centered around the DC component.
2.1.1. Radius of Template Watermark
Determining the radius of the template watermark is a critical issue. A smaller radius means more low-frequency data will be replaced, resulting in a lower peak signal-to-noise ratio (PSNR) value between the original and watermarked data. Conversely, a larger radius results in a higher PSNR and better watermark imperceptibility but weaker robustness. The choice of radius is critical when dealing with scaling attacks. After the entire data are scaled up or down, the radius of the template watermark remains the same, that is, .
For the image
,
has a maximum value, denoted as
, which is calculated as:
Let the image scaling factor be denoted as
. To resist scaling, it must also satisfy:
This algorithm recommends . When and are equal to 256, . Thus, . In this case, the algorithm can theoretically resist all scaling attacks greater than 0.5. To resist scaling attacks with a smaller scaling factor, the value of must be correspondingly reduced.
2.1.2. Embedding of Template Watermark
Watermark embedding is achieved by replacing the magnitude values. First, the region of the ring-shaped template watermark is preprocessed by setting the magnitude values to 0. The positions need to be determined before embedding the watermark information. With
as the radius and
as the interval, the positions are embedded with the corresponding watermark bit. Let the DFT coefficients of the image be denoted as
and the embedding positions as
. When
is between 0 and
degrees, the positions are calculated as follows:
where the
and
functions first convert the input angle from degrees to radians and then compute the cosine and sine, respectively. When
is between 180 and
degrees, the symmetric positions
are obtained by:
The replacement rule for the watermark is shown in the following equation:
where
represents the phase angle of input
in the interval
, and
is a fixed magnitude value that controls the watermark strength. A recommended value for
is 40. For the symmetric position, the rule is:
where
computes the complex conjugate.
2.2. Multiscale LCM
Multiscale LCM was proposed in the literature [
26]. LCM, local contrast measure, is an effective contrast measurement method inspired by biological vision mechanisms, capable of enhancing targets while suppressing the background, thus effectively extracting small objects. A 3 × 3 window of an image, where each grid in the window may represent more than one pixel, is shown in
Figure 4, with each grid numbered 0–8. For this window, first, calculate the maximum pixel value
in area 0. Then, calculate the average grayscale value
for each area in the window. The local contrast is calculated as shown in the following equation:
Contrast is used to quantitatively describe the grayscale difference between the target and the background. Based on Equation (10), the contrast between areas 1–8 and area 0 can be calculated. When area 0 contains the target, the contrast obtained for area 0 is significantly lower than the surrounding areas. A contrast map of the image can be obtained by sliding a 3 × 3 window across the entire image and calculating the local contrast. The local contrast is enhanced according to the equation below.
When the detection result in the contrast map exceeds the threshold
, the area is considered to contain the target to be detected. The definition of threshold
is as shown in the following equation:
where
is the average grayscale value of the contrast map,
is the standard deviation of the contrast map, and
is an empirical value. The final detection result can be obtained after binarizing with the threshold
.
Since the target size is usually unknown and the size of the bright spots that contain watermarks can change due to attacks, it is necessary to introduce a multiscale LCM algorithm. Let the maximum size of the target be denoted as
in the unit of pixels, with a recommended value of 10 in the proposed method. The LCM is performed sequentially from 1 to
, producing a series of contrast maps. A maximum pooling operation is finally applied to this series of contrast maps.
Figure 5 shows the results after employing the multiscale LCM to
Figure 3a, with
set to 3 and
set to 4. As seen in
Figure 5a, the watermark information is significantly enhanced, and in
Figure 5b, the watermark information is precisely extracted as small targets.
Before applying multiscale LCM, the magnitude coefficients in the DFT domain are enhanced to facilitate watermark identification. Let the magnitude coefficients be denoted as
and the enhanced results be denoted as
, with the enhancement formula as follows:
where the function
represents min-max normalization to the interval [0, 1]. In addition, we found that normalizing the original data before applying multiscale LCM also yields better results.
2.3. Peak Detection
In the process of watermark extraction, the radius of the template watermark is extracted based on the contrast map of multiscale LCM. The main idea is to remap the contrast map to polar coordinate space. The destination image size is
, and the transformation center is
. The radius of the bounding circle to transform is
. The interpolation method used is bicubic interpolation. The result of the transformation of
Figure 5a is shown in
Figure 6. From
Figure 6, it is evident that the ring-shaped template watermark is converted into a vertical stripe. It is important to note that if an attack does not deform the ring-shaped template watermark, the stripe will remain vertical. However, if an attack, such as affine transformations or projection transformations, deforms the ring-shaped template watermark, the resulting stripe will also be distorted.
The warped contrast map is summed column-wise to obtain a one-dimensional sequence. The pixel position of the maximum prominence in this sequence can be found. The vertical stripe from the previous step is crucial for this process. In the remapping result, the position of the center of the template watermark is 1. Subtracting one from the position of the maximum prominence gives the radius of the template watermark, denoted as
. As shown in
Figure 7, the maximum prominence occurs at position 65, so
equals 64, which matches the radius value used during embedding.
2.4. Watermark Detection
The circular edge local binarization method is primarily used in the watermark extraction stage. Specifically, based on the obtained radius of the template watermark, a ring with radius is extracted from the enhanced magnitude results, denoted as . The average value of this ring is then calculated. If the value of the position is greater than the average, it is considered a watermark bit 1; otherwise, it is considered a watermark bit 0.
To resist rotation attacks, an offset degree from 0 to 180 with an interval is applied when computing extracting positions and . The interval cannot be infinitely small to ensure each position corresponds to a unique pixel. The minimum interval must be larger than degrees. When is 64, this interval is approximately 0.9 degrees. In the proposed method, we use an interval of 1 degree.
4. Results and Analyses
4.1. The Results of Imperceptibility
Table 2 presents the PSNR values for the three methods. The proposed method achieves a PSNR of 45.13 dB, Method Sun achieves a PSNR of 41.30 dB, and Method Heidari achieves a PSNR of 38.24 dB. The results indicate that the proposed method significantly outperforms the other two methods in terms of imperceptibility.
4.2. The Results of Rotation Attacks
Figure 13 shows the results of rotation attacks. The NC values are used to evaluate the performance of the proposed method, Method Sun, and Method Heidari under various rotation angles. The rotation angle
ranges from 15° to 180°, in intervals of 15°.
The NC values of the proposed algorithm remain consistently high across all rotation angles, demonstrating strong robustness against rotation attacks. The NC value stays around 1.00, indicating an almost perfect correlation between the extracted and original watermarks, regardless of the rotation angle. Specifically, the proposed method maintains NC values above 0.93 for all tested angles.
In contrast, Method Sun and Method Heidari exhibit significant fluctuations in the NC values as the rotation angle varies. For Method Sun, the NC values drop significantly at certain angles, reaching as low as approximately 0.32 at 15°, 30°, 60°, and 75°, as well as their symmetrical counterparts around 90°. This indicates that Method Sun is highly susceptible to rotation attacks. Method Heidari also shows considerable variation, with NC values dropping to around 0.21 at 90°, although it performs slightly better than Method Sun at certain angles.
Overall, the proposed method outperforms the other two methods, maintaining high NC values and demonstrating superior robustness against rotation attacks.
4.3. The Results of Scaling Attacks
Figure 14 shows the results of scaling attacks. The NC values are used to evaluate the performance of the proposed method, Method Sun, and Method Heidari under various scaling factors
. The scaling factors range from 0.5 to 1 in increments of 0.1 and from 2 to 10 in increments of 1.
The NC values of the proposed algorithm remain consistently high across most scaling factors, demonstrating strong robustness against scaling attacks. The NC value stays above 0.87 for all tested scaling factors except when
, indicating a high level of correlation between the extracted watermark and the original watermark. When
, the NC value of the proposed method drops below the threshold of 0.75. However, it is consistent with the theoretical analysis in
Section 2.1.1.
In contrast, Method Sun and Method Heidari exhibit significant fluctuations in NC values as the scaling factor varies. The NC values of Method Sun drop below the threshold of 0.75 at certain scaling factors, particularly at and higher, indicating that Method Sun is less robust to more significant scaling factors. Method Heidari shows even more pronounced variation, with NC values dropping to 0 at scaling factors greater than 1. This demonstrates that Method Heidari is highly susceptible to scaling attacks, particularly at larger scaling factors. The NC values for Method Heidari fluctuate between 0 and 1.00, showing vulnerability to scaling attacks.
Overall, the proposed method demonstrates better performance than the other two methods, consistently achieving high NC values and showing greater robustness against scaling attacks.
4.4. The Results of Translation Attacks
Figure 15 shows the results of translation attacks. The NC values are used to evaluate the performance of the proposed method, Method Sun, and Method Heidari under various translation distances
. The translation distances range from 10 to 120 pixels in increments of 10 pixels.
The proposed algorithm maintains relatively high NC values across most translation distances, with values above 0.86 for all tested distances. This indicates a good level of correlation between the extracted watermark and the original watermark for the proposed method.
However, the proposed method’s performance is slightly lower than that of Method Sun. Method Sun demonstrates superior robustness against translation attacks, maintaining NC values of 1.0 for most translation distances, with a slight drop to around 0.83 at 100 pixels. This indicates that Method Sun is highly resilient to translation attacks. In contrast, Method Heidari shows considerable variation, with NC values dropping below the threshold at several distances. The NC values for Method Heidari fluctuate between 0.53 and 1.00, demonstrating higher susceptibility to translation attacks compared to both the proposed method and Method Sun.
Overall, while Method Sun generally achieves higher NC values across all tested distances, the proposed method performs well against translation attacks, consistently achieving NC values above the threshold. Method Heidari performs adequately but shows more variability in its NC values than the other methods.
4.5. The Results of Cropping Attacks
Figure 16 shows the results of cropping attacks. The NC values are used to evaluate the performance of the proposed method, Method Sun, and Method Heidari under various cropping ratios
. The cropping ratios range from 5% to 70% in increments of 5%.
The proposed algorithm maintains relatively high NC values across most cropping ratios, with values above 0.84 for all tested ratios. This indicates a good level of correlation between the extracted watermark and the original watermark, even as significant portions of the image are cropped.
Method Sun generally demonstrates strong robustness against cropping attacks, maintaining NC values above the threshold of 0.75 for most cropping ratios. However, the NC values fluctuate more compared to the proposed method, with significant drops at specific points, such as 15% and 35%. Method Heidari shows the most significant variation, with NC values gradually declining and then dropping below the threshold at cropping ratios above 45%, indicating higher susceptibility to cropping attacks compared to both the proposed method and Method Sun.
Overall, the proposed method maintains high NC values and demonstrates good robustness against cropping attacks compared with the other two methods.
6. Conclusions and Outlooks
In this paper, we proposed a watermarking algorithm for remote sensing images based on a ring-shaped DFT template watermark and multiscale LCM. The method addresses the significant challenge of accurately identifying and synchronizing template watermarks under severe geometric distortions, which is crucial in watermarking research for remote sensing images. The experimental results demonstrate the proposed method’s strong robustness against various geometric attacks, including rotation, scaling, translation, and cropping. It consistently outperforms comparative algorithms, such as Method Sun and Method Heidari, in terms of robustness while exhibiting superior imperceptibility. These findings indicate that the proposed method excels in maintaining the integrity and synchronization of the watermark under challenging conditions, making it a more reliable and practical choice for watermarking remote sensing images. Future work will focus on further enhancing the algorithm’s robustness against a wider variety of attack scenarios, such as affine transformations and projection transformations.