IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields

Chen, Lifeng; Song, Chaoyue; Liu, Jia; Sun, Wenquan; Dong, Weina; Di, Fuqiang

doi:10.3390/app14146184

Open AccessArticle

IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields

by

Lifeng Chen

^†

,

Chaoyue Song

^†,

Jia Liu

^*,

Wenquan Sun

,

Weina Dong

and

Fuqiang Di

Chinese University of Engineering of the Chinese People’s Armed Police Force, Xi’an 710086, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2024, 14(14), 6184; https://doi.org/10.3390/app14146184

Submission received: 28 May 2024 / Revised: 10 July 2024 / Accepted: 12 July 2024 / Published: 16 July 2024

(This article belongs to the Special Issue Recent Advances in Multimedia Steganography and Watermarking)

Download

Browse Figures

Versions Notes

Abstract

The neural radiance field (NeRF) has demonstrated significant advancements in computer vision. However, the training process for NeRF models necessitates extensive computational resources and ample training data. In the event of unauthorized usage or theft of the model, substantial losses can be incurred by the copyright holder. To address this concern, we present a novel algorithm that leverages the implicit neural representation (INR) watermarking technique to safeguard NeRF model copyrights. By encoding the watermark information implicitly, we integrate its parameters into the NeRF model’s network using a unique key. Through this key, the copyright owner can extract the embedded watermarks from the NeRF model for ownership verification. To the best of our knowledge, this is the pioneering implementation of INR watermarking for the protection of NeRF model copyrights. Our experimental results substantiate that our approach not only offers robustness and preserves high-quality 3D reconstructions but also ensures the flawless (100%) extraction of watermark content, thereby effectively securing the copyright of the NeRF model.

Keywords:

neural radiation field; digital watermarking; robustness; neural networks; copyright protection

1. Introduction

Digital watermarking is a copyright protection technology that embeds copyright identifiers into digital media using algorithms [1,2,3]. In the event of copyright disputes, copyright owners can extract copyright information from digital media using the inverse operation of the embedding algorithm to confirm ownership. The concept of neural radiance fields (NeRFs), introduced by Mildenhall et al. [4], utilizes multilayer perceptrons (MLPs) to implicitly represent 3D scenes and render 2D images by synthesizing new perspectives. Due to the NeRF’s robust representational capabilities, high generalization, and ease of learning, it is poised to become a mainstream technique in digital media representation. However, training NeRF models is challenging, and the theft of such models can lead to significant losses for their owners. Therefore, safeguarding NeRF model copyrights is of paramount importance.

Traditional watermarking algorithms [5,6,7] typically rely on specific mathematical functions to embed watermarks into media by altering them. However, these methods often struggle to strike a balance between imperceptibility, robustness, and watermark capacity. The integration of deep learning in watermarking has revolutionized the field, eliminating the need for manually designing complex mathematical functions and showcasing superior performance. In deep learning watermarking, copyright holders encode watermark information into carrier images using an encoder and extract the watermark information from noisy images through a decoder [8], simulating a black-box scenario. While existing deep learning-based watermarking algorithms excel in robustness, imperceptibility, and embedding capacity, they are primarily tailored for multimedia data, like images [9], audio [10], and videos [11], with limited focus on watermarking implicit data such as NeRF.

Uchida et al. [12] pioneered embedding watermarks in deep neural network models, proposing a model watermarking scheme that embeds watermarks in model parameters using a regularization technique. Model watermarking has since evolved rapidly, proving effective in safeguarding network model copyrights. Leveraging the idea of model watermarking, researchers are exploring solutions to protect NeRF copyrights. For instance, Li et al. [13] introduced the StegaNeRF approach for information hiding, while Luo et al. [14] developed the CopyRNeRF scheme, aiming to safeguard NeRF model copyrights while preserving rendering quality and bit accuracy. However, these techniques may suffer from limited watermark capacity when using bit strings or images as watermark information, necessitating comprehensive representation of copyright holders’ logos in embedded watermark data.

To overcome the low watermark capacity and security vulnerabilities in existing NeRF watermarking techniques, we propose an implicit representation-based watermarking algorithm. This method encodes watermark information as continuous functions using implicit representation neurons, embedding these data into carrier networks via a key. By representing watermark information as neural network models instead of bit strings or images, we significantly enhance the watermark capacity. Our approach involves representing watermark information as continuous functions, embedding these parameters into an untrained network using a key, training the carrier network to encapsulate carrier information while sharing the trained watermark-containing carrier model across the network, and enabling copyright owners to extract watermark information using the key if model theft is suspected.

In summary, this work contributes the following:

The first application of implicit representations in NeRF copyright protection to address existing NeRF watermarking challenges.
A key-based carrier network construction method for lossless watermark information extraction.
Validation of our method across various datasets, ensuring model quality and successful watermark extraction.
Testing the robustness of our model to demonstrate that any attempts to remove the watermark would render the NeRF model unusable.

2. Related Work

2.1. Digital Watermarking for 2D Data

Traditionally, digital watermarking algorithms focused on embedding watermarks into two-dimensional data, categorized into spatial domain-based algorithms and transform domain-based algorithms depending on the embedding domains utilized. In spatial domain algorithms, classic examples include least significant bit (LSB) and patchwork algorithms. LSB algorithms [5,6,7] encode watermark information in the least significant bit of image pixels, while patchwork algorithms. utilize statistical characteristics of the carrier image for encoding. Transform domain-based watermarking algorithms [15,16,17] involve transforming the carrier image into a different domain and embedding the watermark by modifying the transformation domain coefficients based on the watermark value. While traditional watermarking algorithms excel in watermark extraction quality, they often lack robustness and struggle to withstand various attacks. The advancement of deep learning has revolutionized digital watermarking, with significant strides made in image watermarking methods leveraging deep learning techniques.

For instance, Kandi et al. [18] implemented watermark embedding using a convolutional neural network autoencoder, while Zhu et al. [19] introduced an end-to-end watermarking framework named HiDDeN. Building upon these innovations, Zhang et al. [20] delved into attack networks within the universal deep learning watermarking paradigm, enhancing the overall robustness of watermarking algorithms. Notably, deep learning-based watermarking algorithms have witnessed remarkable progress, showcasing their potential for robust and efficient watermark embedding and extraction. However, these existing methods do not cater to safeguarding the copyrights of 3D models effectively.

2.2. Digital Watermarking for 3D Data

Within the realm of 3D digital watermarking, traditional methods involve embedding watermarks in various representations of 3D models, such as point clouds [21], voxels [22], or triangular meshes [23], through transformations like translation, rotation, scaling [24], and parameter modifications [25]. Notably, Hou et al. [26] introduced a technique employing layered artifacts of 3D-printed objects for watermarking, while Hamidi et al. [27] enhanced robustness by leveraging wavelet transform to amplify grid saliency. Recognizing the complexities of traditional 3D digital watermarking approaches, researchers have turned towards integrating deep learning techniques in this domain. Wang et al. [28] pioneered a deep learning-based 3D mesh watermarking network, introducing a more versatile framework for 3D mesh watermarking. Building upon this, Yoo et al. [29] successfully embedded messages within 3D meshes and retrieved them from 2D renderings. However, existing methodologies predominantly cater to explicit 3D models, posing challenges when applying them to neural radiance field (NeRF) models lacking specific 3D structures, thereby limiting their efficacy in ensuring copyright protection for NeRF representations.

2.3. Model Watermarking Algorithm

With the burgeoning advancements in deep learning technology, a plethora of sophisticated deep learning models have surfaced, prompting the development of diverse watermarking strategies tailored to safeguard these models. These approaches can be broadly categorized into two groups: watermark embedding based on network weights and watermark insertion based on the classification labels of trigger sets. Uchida et al. [12] pioneered a method for embedding watermarks through network weights, utilizing a parameter regularizer to infuse watermarks into model parameters. Building on Uchida et al’s work, Wang et al. [30] enhanced this technique by introducing an additional neural network that maps network weights to the watermark space, although this is susceptible to ambiguity attacks. In response to these vulnerabilities, Fan et al. [31] devised a novel passport-based ownership verification scheme for deep neural networks (DNNs), showcasing resilience against network modifications and ambiguity attacks. Rouhani et al. [32] proposed DeepSigns, an end-to-end protective framework empowering developers to embed digital watermarks into pertinent deep learning models prior to dissemination. On the other hand, employing trigger set classification labels for watermark incorporation aims to introduce a backdoor into the network architecture, delivering exclusive activation rights to the copyright verifier. Adi et al. [33] introduced a framework integrating author signatures during DNN training to establish a backdoor-based watermarking mechanism, enabling verification of author identity through predefined signature patterns in watermarked DNNs. Additionally, Shafieinejad et al. [34] proposed a backdoor-inspired approach for watermarking DNNs. While these methodologies are tailored for specific network models and not directly applicable to neural radiance field (NeRF) copyright protection, insights from model watermarking can inform the design of watermark algorithms tailored for NeRF security.

2.4. Watermark Algorithm for Neural Radiation Field

The remarkable performance of neural radiance fields (NeRFs) in 3D reconstruction has garnered significant attention from researchers. Li et al. [13] introduced StegaNeRF, a method for embedding secret information in NeRF rendering. Their approach involved an optimization framework, initially training a standard NeRF model and subsequently conducting additional training to achieve the rendering of secret images while preserving the original visual quality. Similarly, Liu et al. [35] embedded secret messages within the implicit representation function of masked data, enabling direct extraction of these messages via a shared key between the sender and receiver. In contrast, Luo et al. [14] proposed an anti-distortion rendering scheme, replacing the original color representation in NeRF with a watermarked color representation to ensure stable extraction of watermark information in 2D rendered images. Additionally, Chen et al. [36] utilized the newly synthesized view of NeRF as a pivotal element for copyright verification, employing parameterized methods to train a watermark extractor for model validation. Notably, the majority of the aforementioned schemes rely on watermark extractors for copyright protection, which may entail certain security risks. Consequently, the development of an implicit representation watermarking algorithm specifically tailored for NeRF models has been pursued to address these concerns.

3. Preliminaries

The neural radiance field (NeRF) model utilizes an MLP network trained with the input of three-dimensional coordinate positions

x = (x, y, z)

and spatial point directions

d = (θ, φ)

, where

θ

and

φ

represent horizontal and vertical azimuths, respectively. The network outputs the color of spatial points

c = (r, g, b)

and the density of corresponding positions (voxels)

σ

. In the specific implementation, the position information of x and d is encoded initially. Subsequently, x is fed into the MLP network to produce

σ

and a 256-dimensional intermediate feature. This intermediate feature, along with d, is jointly input into a fully connected layer for color prediction, culminating in the generation of a two-dimensional image through volume rendering. The network structure of the NeRF is depicted in Figure 1.

Despite the high adaptability and strong learning capabilities of fully connected MLP networks, they exhibit notable parameter and structural redundancies. To validate the substantial redundancy within the NeRF model network, an 8-layer MLP network with a hidden layer size of 128 was trained. Following this, parameters of the trained NeRF model were pruned based on very small absolute values. The experimental outcomes, illustrated in Figure 2, demonstrate that even with a 50% parameter trim, the NeRF model retains high reconstruction quality. Hence, leveraging the inherent redundancy in MLP networks is considered in the design of our proposed solution.

4. Proposed Method

4.1. Framework

Our study centers on INR watermarking, utilizing the implicit representation of a NeRF as a case study to elucidate our approach, as depicted in Figure 3.

In our methodology, we denote the initial network conveying watermark information as

F_{σ} (\cdot)

, the augmented model network with the watermark as

G_{θ} (\cdot)

, the watermark information dataset as

H_{i}, i \in [1, k]

, and the carrier model’s training dataset as

Q_{i}, i \in [1, k]

. The overall framework of our approach, delineated in Figure 4, is partitioned into three sequential stages. Initially, the content owner trains

H_{i}

using

F_{σ} (\cdot)

to derive the watermark information network

{\hat{F}}_{σ} (\cdot)

and integrates the model parameters of

{\hat{F}}_{σ} (\cdot)

into

G_{θ} (\cdot)

via the key K. Subsequently, the content owner trains

G_{θ} (\cdot)

with the carrier dataset to construct a NeRF model

{\hat{G}}_{θ} (\cdot)

encompassing watermarks. In the final phase, during a copyright dispute, the content owner leverages the key K to extract the original parameters of

{\hat{F}}_{σ} (\cdot)

from

{\hat{G}}_{θ} (\cdot)

for watermark information recovery. The embedding and extraction processes of the watermark information described above can be expressed through Equations (1) and (2):

\begin{matrix} G_{θ} (\cdot) = E m b ({\hat{F}}_{σ} (\cdot), K) \end{matrix}

(1)

\begin{matrix} {\hat{F}}_{σ} (\cdot) = E x t ({\hat{G}}_{θ} (\cdot), K) \end{matrix}

(2)

4.2. Data Representation and Transformation

In the initial phase, we depict the watermark information data using implicit neural representations. The structure of the image noise-resilient (INR) model is a multilayer perceptron (MLP) network comprising n hidden layers of size D, employing the activation function

σ (\cdot)

. The output of the INR model can be formulated as shown in Equation (3):

\begin{matrix} \begin{matrix} y = W^{(n)} (g_{n - 1} o \dots o g_{1} o g_{0}) (h_{0}) + b^{(n)}, \\ where h_{i + 1} = g_{i} (h_{i}) = σ (W^{(i)} h_{i} + b^{(i)}) \end{matrix} \end{matrix}

(3)

Here,

h_{i}, i \in {0, 1, 2, \dots, n}

represents the input of the i-th layer, and y denotes the corresponding output value of

h_{0}

. Additionally,

g_{i}

signifies the hidden layer,

W^{(i)}

represents the weight matrix of the i-th layer, and

b^{(i)}

denotes the bias parameter of the i-th layer. The INR network architecture utilizes a 10-layer MLP network structure with the ReLU activation function. In the context of a given input H, the output set M corresponding to Y is associated with a loss function as in Equation (4):

\begin{matrix} L_{w} = min \sum_{(h, y) \in M} {∥F_{σ} (h) - y∥}_{2}^{2} \end{matrix}

(4)

Here,

F_{σ} (\cdot)

denotes the neural representation of the data. For instance, in the context of the NeRF model and a given set of images from diverse perspectives

H_{i}

, the set M comprises three-dimensional coordinates, camera pose

h = (x, y, z, θ, φ)

, corresponding RGB values, and opacity

y = (r, g, b, σ)

.

4.3. Watermark Information Embedding Stage

This article is influenced by advancements in neural network watermarking, focusing on embedding watermark representations within carrier networks. The methodology leverages network unfolding techniques for watermark embedding, ensuring the preservation of watermark network parameters within a newly constructed neural network structure. By adopting implicit representations, especially through straightforward network architectures like MLP, embedding watermark parameters using a designated key is achieved without compromising performance. The new network design is centered on the watermark network, with inspiration drawn from models like the NeRF. Three key network deployment strategies are introduced, depicted in Figure 5, to optimize the embedding process efficiently while maintaining the integrity of the watermark information within the carrier network, as follows:

(1) Horizontal expansion. This strategy entails the insertion of new layers subsequent to the watermark network, preserving the original network structure, to construct a larger network, as illustrated in Figure 5a. (2) Vertical expansion. An alternative approach to expanding the network is to maintain the layer count unchanged. Given that both the watermark and extended networks embody the NeRF model, aside from the input and output layers, we augment solely the count of neurons within the hidden layers, as depicted in Figure 5b. (3) Mixed expansion. Mixed expansion encompasses both horizontal and vertical expansion, concurrently increasing the layer count and the neuron count within existing hidden layers. This constitutes a versatile operation, as demonstrated in Figure 5c, where the entirety of the watermark network’s content and parameters are encapsulated within a single network. Indeed, both horizontal and vertical expansion can be encapsulated under the umbrella of mixed expansion, and our objective is to construct the carrier network’s structure using expansion methodologies that are as simplistic as is feasible.

Employing watermark networks and carrier networks to encapsulate watermark information and NeRF models, we naturally leverage mixed dilation to construct carrier networks. Within implicitly represented 3D scenes, upon the training of the watermark network, we can assemble a carrier network through the watermark network. Due to the necessity of embedding the parameters of the trained watermark network within the carrier network, we designate a shared key K, comprising a randomized binary sequence of zeros and ones, as presented in Figure 2. Within our scheme, the key is defined as

K = {k_{0}, k_{1}, k_{2}, \dots, k_{m}}

, wherein

k_{0}

represents the count of layers of the watermark network neurons within the carrier network, with a binary value of 1 indicating that the layer includes neurons of the watermark network and 0 indicating otherwise. The sequence

{k_{1}, k_{2}, \dots, k_{m}}

denotes the location of the watermark network neurons within the carrier network, where each layer corresponds to a binary stream

k_{m}

of length

d_{k_{m}}

, specifying the neuron count in that layer. Each bit within this sequence represents a neuron, with 1 indicating it belongs to the watermark information and 0 indicating it belongs to the carrier information. For example, in Figure 5c, the corresponding key K is

{01110, 00110, 101101, 01101}

. This method significantly reduces the length of the key through

k_{0}

.

4.4. Training of Carrier Networks

We utilized Equation (4) to enable the implicit representation of the carrier NeRF model through the carrier network

G_{θ} (\cdot)

, where

θ

encompasses two distinct parameter categories: those of the watermark network and others representing carrier information. In ensuring the lossless extraction of

{\hat{F}}_{σ} (\cdot)

from

{\hat{G}}_{θ} (\cdot)

, it becomes imperative to immobilize the parameters associated with the watermark information. Subsequently, training is exclusively directed towards the parameters embodying carrier information, thereby optimizing

{\hat{F}}_{σ} (\cdot)

with precision. In pursuit of this objective, we introduced a binary mask M to facilitate selective optimization of

G_{θ} (\cdot)

in Equation (5):

\begin{matrix} M [p] = \{\begin{matrix} 1, θ [p] \in φ \\ 0, e l s e \end{matrix} \end{matrix}

(5)

Among them,

φ

represents the parameter representing carrier information in

θ

, and

θ [p]

represents the p-th parameter in

θ

. Similar to Equation (6), we use the image set

Q_{i}

to train the carrier network and define the loss of the carrier network as

L_{c}

. The formula for

L_{c}

is as follows:

\begin{matrix} L_{c} = min \sum_{(q, y) \in M} {∥G_{θ} (q) - y∥}_{2}^{2} \end{matrix}

(6)

Let

λ

be the learning rate and ⊙ be the product of elements, and update

G_{θ} (\cdot)

using the gradient descent method, as shown in Equation (7).

\begin{matrix} θ = θ - λ M ⊙ \nabla_{θ} L_{c} \end{matrix}

(7)

Through training, we obtained the NeRF model

{\hat{G}}_{θ} (\cdot)

containing watermarks.

4.5. Watermark Information Extraction

After releasing model

{\hat{G}}_{θ} (\cdot)

online, regular users can engage with and explore 3D environments. Maintaining the watermark information parameters unaltered during the training phase of the carrier network

G_{θ} (\cdot)

ensures that the party responsible for copyright protection can restore the watermark information network

{\hat{F}}_{σ} (\cdot)

without any loss postacquisition of the watermark network parameter details utilizing the key K. Leveraging key K, we not only ascertain the layers within

{\hat{F}}_{σ} (\cdot)

housing watermark information but also identify the neurons containing watermark information along with their pertinent parameter specifics. In essence, through the parameters

P_{{\hat{F}}_{σ} (\cdot)}

and structure

S_{{\hat{F}}_{σ} (\cdot)}

in key K, the watermark network can be regenerated. The overall process of our method is shown in Algorithm 1.

Algorithm 1 Training process of IW-NeRF.

1:: Data: Watermark information dataset $H_{i}, i \in [1, k]$ , Carrier model training dataset $Q_{i}, i \in [1, k]$ , Random key K, learning rate $η$
2:: Output: Watermark information network model ${\hat{F}}_{σ} (\cdot)$
Initial network containing watermark information $G_{θ} (\cdot)$
Watermarked NeRF model ${\hat{G}}_{θ} (\cdot)$ .
Optimizing the parameter $θ$ of model $G_{θ} (\cdot)$ through $Q_{i}$
Compute mask M for $θ$ as in Equation (5)
3:: for each training iteration t do
    Compute Watermark information loss as Equation (4)
    Compute Carrier network information loss as Equation (6)
    Update $θ$ as Equation (7)
4:: end for

5. Experiments

5.1. Experimental Settings

Dataset. We assessed our algorithm using the NeRF Semantic and LLFF datasets sourced from the NeRF dataset. The LLFF dataset encompasses diverse scenes such as flowers, ferns, fortresses, and rooms, while the NeRF Semantic dataset features 360-degree scenes like LEGO structures, drums, and chairs. For evaluating the efficacy of our approach within the NeRF Semantic dataset, akin to a NeRF [1], our training regimen involves feeding in 100 views per scene. To gauge the visual fidelity of our methodology, we handpicked 20 images per scene from the test dataset. Furthermore, we conducted renders of 200 views for each scene to validate the watermark extraction precision across varying camera angles. Our comprehensive experimental methodology entails presenting all outcomes as averaged results for clarity and consistency.

Implicit neural representation setting. Our approach was implemented utilizing PyTorch, and both the watermark information and carrier network were structured employing an MLP network design. We employed 12 hidden layers of size 128 to encode the watermark information and 22 hidden layers of size 256 for the carrier network. The hyperparameters used include a learning rate of

5 \times 10^{- 4}

, a batch size of 512, utilization of the Adam optimizer with default values of

β_{1} = 0.9

,

β_{2} = 0.999

, and regularization parameter

λ = 1 \times 10^{- 8}

. The experiments were conducted on an NVIDIA A100 GPU, with training of both the watermark network and carrier network executed using a stochastic gradient descent algorithm over 20,000 epochs.

Baselines. To the best of our knowledge, there exists limited research on watermarking specifically tailored for NeRF applications. Thus, we conducted a comparative analysis with established watermarking techniques adapted for NeRF to ensure a comprehensive evaluation: (1) LSB [6] + NeRF: here, we employed the LSB algorithm to embed watermark data into the dataset images before NeRF model training; (2) DeepStega [37] + NeRF: preceding NeRF model training, we leveraged the two-dimensional watermarking approach of DeepStega for image processing; (3) HiDDeN [19] + NeRF: prior to NeRF model training, the image underwent processing utilizing the HiDDeN scheme; (4) StegaNeRF [13]; and (5) CopyNeRF [14].

Distortion evaluation metric. In order to evaluate the distortion between the carrier data and the carrier data containing watermarks, as well as the distortion between the watermark data and the extracted watermark data, we visualized the implicitly represented data and used several evaluation metrics. For 2D images rendered from 3D models, we evaluate them using peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and perceived loss (LPIPS).

(1): PSNR (peak signal-to-noise ratio).

PSNR is an indicator used to evaluate image quality based on the concept of root mean square error (MSE), which represents the peak signal-to-noise ratio of image signals. The higher the PSNR value, the better the quality of the evaluated image and the smaller the error. The calculation methods for MSE and PSNR are shown in Equations (8) and (9), respectively.

\begin{matrix} MSE = \frac{1}{W \times H} \sum_{i = 1}^{W} {\sum_{j = 1}^{H} (X_{i, j} - Y_{i, j})}^{2} \end{matrix}

(8)

\begin{matrix} PSNR = 20 \times \lg (s c) - 10 \times \lg (MSE) \end{matrix}

(9)

In the formulas, X and Y represent two images with a size of W × H, respectively, and sc represents the scaling factor, usually taken as 2.

(2): SSIM (structural similarity).

SSIM is a metric that evaluates image quality by measuring the similarity between two images, with higher values indicating higher similarity. The calculation method of SSIM is shown in Equation (10).

\begin{matrix} SSIM (x, y) = \frac{(2 μ_{x} μ_{y} + c_{1}) (2 σ_{x y} + c_{2})}{({μ_{x}}^{2} + {μ_{y}}^{2} + c_{1}) ({σ_{x}}^{2} + {σ_{y}}^{2} + c_{2})} \end{matrix}

(10)

In the above equation,

μ_{x}

and

σ_{x}

are the mean and variance of image X,

μ_{y}

and

σ_{y}

are the mean and variance of image Y, and

σ_{x y}

is the covariance of X and Y.

c_{1} = {(k_{1} L)}^{2}

,

c_{2} = {(k_{2} L)}^{2}

is a constant,

k_{1} = 0.01

,

k_{2} = 0.03

, L is the dynamic range of pixel values, and if the data are uint8-type, L takes a value of 255.

(3): LPIPS (learned perceptual image patch similarity).

LPIPS is a method of measuring image similarity, which does not use mathematical formulas to implement but evaluates the perceptual differences between two images through deep learning models. The lower the value of LPIPS, the more similar the two images are; and the higher the value, the greater the difference. We use a pretrained LPIPS model Alex for evaluation.

5.2. Reconstruction and Watermark Extraction Quality

We conducted a qualitative comparison of the reconstruction quality against all baselines, with the results depicted in Figure 6. The visual analysis from Figure 6 indicates that all methods exhibit commendable reconstruction quality. While other schemes may outperform ours in reconstruction quality due to their limited capacity stemming from text or image information embedding, our approach stands out in terms of embedded capacity. Despite the reduction in reconstruction quality caused by the embedding of watermark network parameters, likely attributed to fixed parameters impacting the carrier network’s fitting effect, the overall NeRF model information remains retrievable visually. Notably, our designed key facilitates lossless recovery of watermark information from a carrier network containing watermarks, offering a significantly larger embedded capacity compared to alternative schemes. For a quantitative assessment, we further analyzed the rendering quality and watermark information extraction effects across various schemes, as detailed in Table 1 and Table 2.

By examining the outcomes outlined in Table 1 and Table 2, a notable observation is the ineffectiveness of the two-dimensional watermarking techniques applied to the NeRF model in retrieving watermark information. The alteration of information embedded via the two-dimensional watermarking method due to NeRF’s view synthesis led to the method’s failure. Conversely, the utilization of three-dimensional watermarking approaches facilitated accurate extraction of watermark information. Notably, while other algorithms incorporate bit strings into the model, our methodology visualizes the 3D reconstruction and watermark extraction effects. Our experimentation involved utilizing diverse 360-degree scenes such as ‘Lego’, ‘drums’, and ‘chairs’ from the NeRF Semantic dataset for training the watermark network and carrier network. The resulting experimental findings, as illustrated in Figure 7, showcase various images representing original watermark information, carrier information, samples from both the watermark and carrier networks, and extracted watermark network samples. Despite our method slightly affecting the NeRF model’s training performance, it demonstrates the ability to extract embedded watermark information from the carrier network without loss. Noteworthy is the incorporation of a neural network capable of implicitly representing a range of copyright materials (e.g., images, sounds, videos) in real-world scenarios.

5.3. Algorithm Capacity

Our approach involves embedding the parameters of a watermark network within a carrier model network, allowing for the assessment of algorithm capacity through scalability analysis. The expansion rate, as defined in Equation (11), serves as a metric to quantify this scalability.

\begin{matrix} e = \frac{N_{c a r r i e r}}{N_{w a t e r m a r k}} - 1 \end{matrix}

(11)

In this context,

N_{w a t e r m a r k}

and

N_{carrier}

represent the parameter counts within the original watermark network and carrier network, respectively. Consequently, with

e ⩾ 0

, the expansion rate is intricately linked to the sizes of the watermark network and carrier network. A lower expansion rate correlates with higher capacity. To assess the influence of watermark capacity on the outcomes of our proposed scheme, we executed the subsequent pair of experiments.

Watermark network layer modification. In our approach, the utilization of INR to represent watermark information enables a smaller watermark neural network to achieve superior carrier data reconstruction during the fitting process. By adjusting the size of the watermark network, we can influence the performance of the carrier network fitting. Specifically, we maintained a fixed size of 22 hidden layers for the carrier network and employed hidden layer networks with 16, 14, 12, 10, and 8 layers for training the watermark information. The hidden layer width for the carrier network was set at 256, while for the watermark network, it was set at 128. Our experiments were carried out using the LLFF dataset. Figure 8 illustrates the performance of carrier models trained with varying watermark networks, showcasing rendered images by both the carrier and watermark networks in the respective columns. Furthermore, Figure 9 displays the SSIM values between the rendered and real images of the carrier network across different watermark network levels. Table 3 depicts the impact of the expansion rate on the PSNR values between the rendered and real images of the carrier network. Notably, our experimental results highlight that with a consistent size for the carrier network, reducing the number of layers in the watermark network leads to an increase in the expansion rate and the complexity of the carrier network, consequently enhancing the quality of the carrier model.

Carrier network layer modification. To further optimize 3D reconstruction outcomes, adjustments to both the watermark network size and the carrier network dimensions were explored. The watermark network was kept at a fixed size of 10 layers, while carrier networks ranging from 18 to 26 layers were utilized in training the carrier model. The hidden layer width was set to 128 for the watermark network and 256 for the carrier network. PSNR values illustrating the comparison between images rendered by the carrier model trained on different layers of the carrier network and the original images is depicted in Figure 10. Additionally, Figure 11 showcases SSIM values representing the comparison between the rendered image and the actual image of the carrier network across varying layers. The impact of the expansion rate on the PSNR values between the rendered and real images of the carrier network is detailed in Table 4. The experimental findings reveal a positive correlation between the depth of the carrier network and the expansion rate, leading to an enhancement in the reconstruction quality of the carrier model. However, it is worth noting that an increase in the number of layers of the carrier network introduces higher computational complexity and training challenges, potentially reducing training efficiency. For future research directions, the adoption of multiresolution hash encoding [38] is under consideration to boost training speed.

5.4. The Robustness of the Model

In practical scenarios, the vulnerability of our model to malicious attacks necessitates an assessment of the robustness of our watermarking scheme against various model alterations. To evaluate the resilience of our algorithm, we employed two common modifications. Model pruning is a vital technique, especially for MLP networks known for their robust fitting capabilities but often burdened with numerous parameters due to deep network layers and high neuron counts. The abundance of parameters not only escalates computational demands in deep learning but also presents an opportunity for optimization through pruning. The primary objective of network pruning is to reduce redundant parameters while preserving the original network’s performance. Our experiments encompassed

L_{1}

unstructured pruning,

L_{n}

structured pruning, random structured pruning, and random unstructured pruning methods. During the pruning process, the parameter with the minimum absolute pruning rate p% (ranging from 10% to 90%) was set to 0 for the NeRF model embedded with watermarks. Subsequently, we compared the extracted watermark information postpruning with the prepruning data to assess its impact on our watermark framework. Ideally, a model thief aims to render the extracted watermark information inaccurate postpruning while retaining the model’s performance. Figure 12 and Figure 13 illustrate the influence of pruning rates on watermark extraction performance under various pruning methodologies. Notably, even with

L_{1}

unstructured pruning removing 90% of parameters, our embedding method demonstrated high accuracy in watermark extraction (with an SSIM value of 0.8829 between the extracted and original watermark information). Conversely,

L_{n}

structured pruning, random structured pruning, and random unstructured pruning methods exhibited a significant reduction in extracted watermark quality beyond a 10% pruning rate. However, while these pruning methods may compromise watermark quality, they also detrimentally impact the model’s performance, rendering the stolen model ineffective. Thus, our watermark algorithm exhibits robustness against pruning modifications, ensuring that any attempts to maintain the model’s quality post-theft come at the cost of significantly reduced model performance.

Model fine-tuning is a crucial aspect in the context of large-scale NeRF models. Training such models from scratch demands a substantial training dataset, and the lack of adequate data can impact model performance. Hence, in practical scenarios, fine-tuning existing models becomes a more feasible approach when data scarcity is a concern. Typically, fine-tuning is preferred when there is minimal variance in experimental performance between the original dataset and the dataset trained using a pretrained model. Consequently, for potential plagiarists, employing fine-tuning methods to train a new model based on the stolen model with limited training data becomes a viable strategy. This new model not only mirrors the performance of the original model but also diverges in experimental outcomes from the original model. In our experimental setup, each dataset underwent a partitioning of the test data into two segments: the first half facilitated the fine-tuning of the trained NeRF model, while the second half served to evaluate the performance of the newly derived model. Subsequently, we utilized the fine-tuned model to assess the resilience of our watermark scheme against modifications induced by the fine-tuning process.

Table 5 presents the PSNR and SSIM metrics comparing the original watermark information with the extracted watermark information post-model fine-tuning. The experimental findings indicate that fine-tuning has a minimal impact on watermark accuracy, regardless of whether synthetic or real datasets are utilized. This resilience to fine-tuning-induced modifications can be attributed to the presence of numerous redundant neurons within the MLP network, enhancing the robustness of our watermark algorithm against such adjustments.

5.5. Security Assessment

Contrary to discrete representations, implicit neural representations introduce uncertainty, leading to varied weight distributions for each dataset across rendering and watermark extraction processes. To assess the algorithm’s security, we conducted a comprehensive statistical analysis and visual examination of the weight distributions within the watermark network model and the carrier network post-watermark embedding. The watermark network comprised 10 layers, while the carrier network featured 22 layers. By comparing the corresponding layers’ weights in both networks, we observed distinct differences. Visualization techniques were employed to juxtapose the weights of networks with equivalent layer numbers in the watermark and carrier networks, depicted in Figure 14. The disparity in weight distributions between the watermark and carrier networks was evident, signifying the successful incorporation of watermark network parameters into the carrier network.

Within our watermarking framework, the pivotal role of the key in extracting watermark information underscores the need for a comprehensive evaluation of key sensitivity. Analogous to cryptographic protocols, the correct key serves as the sole means through which copyright stakeholders can access the embedded watermark data. In contrast to traditional cryptography, the performance of neural networks in extracting watermarks is subject to various influencing factors, including network topology, training datasets, parameter configurations, and optimization strategies, which may introduce performance variability. From an experimental standpoint, the fundamental criterion lies in the successful extraction of the watermark using the correct key, as even minor deviations in the key could result in extraction failures. Figure 15 showcases the average peak signal-to-noise ratio (PSNR) values between the original watermark and the extracted watermark across diverse datasets, highlighting the impact of varying error bits within the 2180-sized key. Additionally, Figure 16 visually demonstrates the consequences of erroneous key bits on the output quality of the extracted watermark network, showcasing a rapid decline in PSNR values as the number of incorrect key bits escalates, leading to perceptible image blurring. This observation underscores the sensitivity of the watermark extraction process to even minor alterations in the key. For individuals lacking direct access to the key, resorting to guesswork may introduce errors, with approximately 50% of incorrect key bits resulting in failed watermark extraction attempts. Future endeavors will focus on refining key generation methodologies to bolster the security of the watermarking scheme.

6. Limitations

The utilization of a network model, as opposed to a conventional bit string or image, in our watermarking scheme introduces a trade-off in the quality of the NeRF model. Our embedding and extraction processes operate akin to a white-box model, necessitating access to the NeRF model’s parameters for successful watermark extraction. This requirement may constrain the practical applicability of our algorithm, particularly in scenarios where parameter accessibility is restricted. Moving forward, our research endeavors will focus on enhancing the watermark embedding methodology to refine the overall efficiency and robustness of our watermarking scheme.

7. Conclusions

In light of the rapid advancements and diverse applications of NeRF technology, the preservation of copyright for NeRF models has emerged as a critical imperative. This study introduces a novel NeRF model watermarking scheme known as implicit neural representation watermarking for NeRF (IW-NeRF). Within this framework, the watermark information is intricately embedded into the NeRF model through implicit neural representations, ensuring a seamless and robust integration of copyright protection mechanisms within the model architecture. Subsequently, the extraction of the watermark information is facilitated through a designated key, effectively fortifying the copyright protection framework for NeRF models.

Advantages. The experimental findings demonstrate the distinct advantages of our proposed watermark scheme: (1) Security: Within our scheme, the watermark information is implicitly represented and then randomly embedded into the NeRF model using a key derived from network parameters. The extensive key space, stemming from the multitude of network parameters, ensures the security of our approach. (2) Capacity: By characterizing the watermark information implicitly through neural methods and incorporating it into a continuous function, the memory required for signal parameterization becomes independent of spatial resolution, solely influenced by the signal’s underlying complexity. This strategy enables the utilization of continuous functions for achieving high-capacity watermarks. (3) Robustness: Neural networks exhibit inherent redundancy in structure and neuron count. By embedding the watermark information within the NeRF model network, our scheme maintains resilience against minor alterations to the model; any significant attacks on the model would consequently impact its performance rather than compromise the embedded watermark. (4) Universality: Implicit neural representations possess the capability to depict diverse data forms. While our experimental focus entailed the utilization of the NeRF model as watermark information, the watermark network framework proposed in our study extends to various media types such as images and videos. Noteworthy is the pioneering introduction of implicit neural representation within the realm of digital watermarking in this research.

Author Contributions

Writing—original draft, L.C.; Formal analysis, C.S.; Writing—review & editing, J.L.; Software, W.S.; Methodology, W.D.; Resources, F.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the National Natural Science Foundation of China, with fund number 62272478.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the author 18792537291@139.com. The data are not publicly available due to privacy.

Conflicts of Interest

The authors declare that they have no conflicts of interest to report regarding the present study.

References

Li, D.; Yang, Z.; Jin, X. Zero watermarking scheme for 3D triangle mesh model based on global and local geometric features. Multimed. Tools Appl. 2023, 82, 43635–43648. [Google Scholar] [CrossRef]
Wu, H.; Liu, G.; Yao, Y.; Zhang, X. Watermarking Neural Networks with Watermarked Images. IEEE Trans. Circuits Syst. Video Technol. 2021, 31, 2591–2601. [Google Scholar] [CrossRef]
Feng, L.; Zhang, X. Watermarking Neural Network with Compensation Mechanism. In Knowledge Science, Engineering and Management; Lecture Notes in Computer Science; Li, G., Shen, H.T., Yuan, Y., Wang, X., Liu, H., Zhao, X., Eds.; Springer: Cham, Swtzerland, 2020; pp. 363–375. [Google Scholar] [CrossRef]
Mildenhall, B.; Srinivasan, P.P.; Tancik, M.; Barron, J.T.; Ramamoorthi, R.; Ng, R. NeRF: Representing scenes as neural radiance fields for view synthesis. Commun. ACM 2022, 65, 99–106. [Google Scholar] [CrossRef]
Kuang, X.; Ling, W.A.; Ke, L.S.; Lei, G.; Ping, P.J.; Yue, L.Z.; Ping, L.F. Watermark embedding and extraction based on LSB and four-step phase shift method. In Proceedings of the 2019 7th International Conference on Information Technology: IoT and Smart City, Shanghai, China, 20–23 December 2019; pp. 243–247. [Google Scholar]
Muyco, S.D.; Hernandez, A.A. Least significant bit hash algorithm for digital image watermarking authentication. In Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence, Bali, Indonesia, 17–20 April 2019; pp. 150–154. [Google Scholar]
Van Schyndel, R.G.; Tirkel, A.Z.; Osborne, C.F. A digital watermark. In Proceedings of the 1st International Conference on Image Processing, Austin, TX, USA, 13–16 November 1994; Volume 2, pp. 86–90. [Google Scholar]
Chen, L.; Liu, J.; Dong, W.; Sun, W. Image Hiding Scheme Based on Dense Residual Networks. Sci. Technol. Eng. 2024, 24, 03719-08. [Google Scholar]
Singh, H.K.; Singh, A.K. Digital image watermarking using deep learning. Multimed. Tools Appl. 2024, 83, 2979–2994. [Google Scholar] [CrossRef]
Charfeddine, M.; Mezghani, E.; Masmoudi, S.; Amar, C.B.; Alhumyani, H. Audio watermarking for security and non-security applications. IEEE Access 2022, 10, 12654–12677. [Google Scholar] [CrossRef]
Luo, X.; Li, Y.; Chang, H.; Liu, C.; Milanfar, P.; Yang, F. DVMark: A deep multiscale framework for video watermarking. arXiv 2023, arXiv:2104.12734. [Google Scholar] [CrossRef] [PubMed]
Uchida, Y.; Nagai, Y.; Sakazawa, S.; Satoh, S. Embedding Watermarks into Deep Neural Networks. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, Bucharest, Romania, 6–9 June 2017; pp. 269–277. [Google Scholar] [CrossRef]
Li, C.; Feng, B.Y.; Fan, Z.; Pan, P.; Wang, Z. StegaNeRF: Embedding Invisible Information within Neural Radiance Fields. arXiv 2022, arXiv:2212.01602. [Google Scholar]
Luo, Z.; Guo, Q.; Cheung, K.C.; See, S.; Wan, R. CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields. arXiv 2023, arXiv:2307.11526. [Google Scholar]
Chen, W.; Zhu, C.; Ren, N.; Seppänen, T.; Keskinarkaus, A. Screen-cam robust and blind watermarking for tile satellite images. IEEE Access 2020, 8, 125274–125294. [Google Scholar] [CrossRef]
Singh, O.P.; Singh, A.K. Image fusion-based watermarking in IWT-SVD domain. In Advanced Machine Intelligence and Signal Processing; Springer: Berlin/Heidelberg, Germany, 2022; pp. 163–175. [Google Scholar]
Thomas, R.; Sucharitha, M. Contourlet and Gould transforms for hybrid image watermarking in RGB color images. Intell. Autom. Soft Comput. 2022, 33, 879–889. [Google Scholar] [CrossRef]
Kandi, H.; Mishra, D.; Gorthi, S.R.S. Exploring the learning capabilities of convolutional neural networks for robust image watermarking. Comput. Secur. 2017, 65, 247–268. [Google Scholar] [CrossRef]
Zhu, J.; Kaplan, R.; Johnson, J.; Fei-Fei, L. Hidden: Hiding data with deep networks. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 657–672. [Google Scholar]
Zhang, C.; Karjauv, A.; Benz, P.; Kweon, I.S. Towards robust deep hiding under non-differentiable distortions for practical blind watermarking. In Proceedings of the 29th ACM International Conference on Multimedia, Virtual Event, 20–24 October 2021; pp. 5158–5166. [Google Scholar]
Zhang, G.; Liu, J. Research on Watermark Algorithm for 3D Color Point Cloud Models. Comput. Technol. Dev. 2023, 33, 62–68. [Google Scholar]
Li, B.; Wu, J.; Zhang, B. Improved Image Digital Watermark Algorithm for Zero Parallax Pixel Reorganization. Comput. Simul. 2023, 40, 244–248. [Google Scholar]
Cui, J.; Zhang, G. Anti simplification blind watermarking algorithm based on vertex norm 3D mesh model. Comput. Eng. Des. 2023, 44, 692–698. [Google Scholar] [CrossRef]
Xiong, X.; Wei, L.; Xie, G. A robust color image watermarking algorithm based on 3D-DCT and SVD. Comput. Eng. Sci. Gongcheng Kexue 2015, 37, 8. [Google Scholar]
Pham, G.N.; Lee, S.H.; Kwon, O.H.; Kwon, K.R. A 3D Printing Model Watermarking Algorithm Based on 3D Slicing and Feature Points. Electronics 2018, 7, 23. [Google Scholar] [CrossRef]
Hou, J.U.; Kim, D.G.; Lee, H.K. Blind 3D mesh watermarking for 3D printed model by analyzing layering artifact. IEEE Trans. Inf. Forensics Secur. 2017, 12, 2712–2725. [Google Scholar] [CrossRef]
Hamidi, M.; Chetouani, A.; El Haziti, M.; El Hassouni, M.; Cherifi, H. Blind robust 3D mesh watermarking based on mesh saliency and wavelet transform for copyright protection. Information 2019, 10, 67. [Google Scholar] [CrossRef]
Wang, F.; Zhou, H.; Fang, H.; Zhang, W.; Yu, N. Deep 3D mesh watermarking with self-adaptive robustness. Cybersecurity 2022, 5, 24. [Google Scholar] [CrossRef]
Yoo, I.; Chang, H.; Luo, X.; Stava, O.; Liu, C.; Milanfar, P.; Yang, F. Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 10021–10030. [Google Scholar] [CrossRef]
Wang, J.; Wu, H.; Zhang, X.; Yao, Y. Watermarking in Deep Neural Networks via Error Back-propagation. Electron. Imaging 2020, 32, 22-1–22-9. [Google Scholar] [CrossRef]
Fan, L.; Ng, K.W.; Chan, C.S. Rethinking Deep Neural Network Ownership Verification: Embedding Passports to Defeat Ambiguity Attacks. 2019. Available online: https://proceedings.neurips.cc/paper_files/paper/2019/file/75455e062929d32a333868084286bb68-Paper.pdf (accessed on 16 September 2019).
Rouhani, B.D.; Chen, H.; Koushanfar, F. DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models. arXiv 2018, arXiv:1804.00750. [Google Scholar]
Adi, Y.; Baum, C.; Cisse, M.; Keshet, J.; Pinkas, B. Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring. 2018. Available online: https://www.usenix.org/system/files/conference/usenixsecurity18/sec18-adi.pdf (accessed on 13 February 2018).
Shafieinejad, M.; Wang, J.; Lukas, N.; Li, X.; Kerschbaum, F. On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks. arXiv 2019, arXiv:1906.07745. [Google Scholar]
Liu, J.; Luo, P.; Ke, Y. Hiding Functions within Functions: Steganography by Implicit Neural Representations. arXiv 2023, arXiv:2312.04743. [Google Scholar]
Chen, L.; Liu, J.; Ke, Y.; Sun, W.; Dong, W.; Pan, X. MarkNerf: Watermarking for Neural Radiance Field. arXiv 2023, arXiv:2309.11747. [Google Scholar] [CrossRef]
Baluja, S. Hiding Images in Plain Sight: Deep Steganography. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
Müller, T.; Evans, A.; Schied, C.; Keller, A. Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. Graph. 2022, 41, 1–15. [Google Scholar] [CrossRef]

Figure 1. The network structure of the NeRF model.

Figure 2. The reconstruction quality of NeRF model under different pruning rates.

Figure 3. We present an implicit representation watermarking algorithm tailored for NeRFs. This approach encapsulates the watermark information implicitly within the NeRF model through a specified key, facilitating subsequent watermark extraction using the same key.

Figure 4. The overall framework of our algorithm IW-NeRF.

Figure 5. Three expansion strategies for constructing carrier networks.

Figure 6. Comparison of reconstruction quality of NeRF models with different baselines.

Figure 7. The NeRF reconstruction and watermark extraction effects of our scheme.

Figure 8. The reconstruction and watermark extraction performance of NeRF models under different watermark network layers with a fixed number of carrier network layers.

Figure 9. The SSIM value between the images rendered by the carrier NeRF model and the images in the original dataset of NeRF under different watermark network layers.

Figure 10. The 3D reconstruction quality of the NeRF model under different carrier network layers after a fixed number of watermark network layers.

Figure 11. SSIM values between the images rendered by the carrier NeRF model and the carrier dataset images under different carrier network layers.

Figure 12. The effect of different pruning methods on watermark extraction.

Figure 13. SSIM values between the original watermark image and the extracted watermark image under different pruning methods.

Figure 14. Comparison of weight visualization between watermark network and carrier network.

Figure 15. The average PSNR between the original watermark and the extracted watermark under different incorrect bit keys.

Figure 16. Visual effects of extracting watermarks under different key error bits.

Table 1. Quantitative analysis of reconstruction quality and watermark extraction effect of NeRF models under different baselines (Synthetic-chair).

Method	NeRF Rendering			Watermark Extraction
Method	PSNR ↑	SSIM ↑	LPIPS ↓	Acc (%) ↑	SSIM ↑
Standard NeRF	33.23	0.9143	0.1113	N/A	N/A
LSB + NeRF	27.45	0.8446	0.1410	N/A	N/A
DeepStega + NeRF	26.41	0.8457	0.1429	N/A	N/A
HiDDeN + NeRF	27.88	0.8964	0.1418	N/A	N/A
StegaNeRF	30.31	0.9847	0.0276	100	0.9643
CopyRNeRF	30.54	0.9689	0.0327	100	N/A
IW-NeRF (ours)	21.32	0.6364	0.2852	100	1

Table 2. Quantitative analysis of reconstruction quality and watermark extraction effect of NeRF models under different baselines (LLFF-trex).

Method	NeRF Rendering			Watermark Extraction
Method	PSNR ↑	SSIM ↑	LPIPS ↓	Acc (%) ↑	SSIM ↑
Standard NeRF	27.76	0.8546	0.1453	N/A	N/A
LSB + NeRF	27.59	0.8435	0.1399	N/A	N/A
DeepStega + NeRF	26.98	0.8356	0.1269	N/A	N/A
HiDDeN + NeRF	27.64	0.8865	0.1512	N/A	N/A
StegaNeRF	28.21	0.8453	0.1423	100	0.9698
CopyRNeRF	30.67	0.9683	0.0457	100	N/A
IW-NeRF (ours)	22.54	0.6539	0.2798	100	1

Table 3. After modifying the number of layers in the watermark network, we download the PSNR value between the rendered image and the real image of the network with different expansion rates.

Expansion Rate	1.75	2.14	2.66	3.40	4.50
PSNR	19.88	21.67	22.99	23.41	25.86

Table 4. After modifying the number of layers in the carrier network, we download the PSNR value between the rendered image and the real image of the body network with different expansion rates.

Expansion Rate	2.60	3.00	3.40	3.80	4.20
PSNR	20.94	21.58	23.21	24.34	25.66

Table 5. The watermark extraction effect on different datasets after model fine-tuning.

Dataset	Hot Dog	Lego	Fern	Flower
PSNR (dB)	46.34	45.28	44.57	46.13
SSIM	0.9843	0.9789	0.9648	0.9881

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, L.; Song, C.; Liu, J.; Sun, W.; Dong, W.; Di, F. IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields. Appl. Sci. 2024, 14, 6184. https://doi.org/10.3390/app14146184

AMA Style

Chen L, Song C, Liu J, Sun W, Dong W, Di F. IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields. Applied Sciences. 2024; 14(14):6184. https://doi.org/10.3390/app14146184

Chicago/Turabian Style

Chen, Lifeng, Chaoyue Song, Jia Liu, Wenquan Sun, Weina Dong, and Fuqiang Di. 2024. "IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields" Applied Sciences 14, no. 14: 6184. https://doi.org/10.3390/app14146184

APA Style

Chen, L., Song, C., Liu, J., Sun, W., Dong, W., & Di, F. (2024). IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields. Applied Sciences, 14(14), 6184. https://doi.org/10.3390/app14146184

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

IW-NeRF: Using Implicit Watermarks to Protect the Copyright of Neural Radiation Fields

Abstract

1. Introduction

2. Related Work

2.1. Digital Watermarking for 2D Data

2.2. Digital Watermarking for 3D Data

2.3. Model Watermarking Algorithm

2.4. Watermark Algorithm for Neural Radiation Field

3. Preliminaries

4. Proposed Method

4.1. Framework

4.2. Data Representation and Transformation

4.3. Watermark Information Embedding Stage

4.4. Training of Carrier Networks

4.5. Watermark Information Extraction

5. Experiments

5.1. Experimental Settings

5.2. Reconstruction and Watermark Extraction Quality

5.3. Algorithm Capacity

5.4. The Robustness of the Model

5.5. Security Assessment

6. Limitations

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI