Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts

Maté-González, Miguel Ángel; Yali, Roy; Rodríguez-Hernández, Jesús; González-González, Enrique; Aguirre de Mata, Julián

doi:10.3390/rs17142535

Open AccessArticle

Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts

by

Miguel Ángel Maté-González

^1,*

,

Roy Yali

¹,

Jesús Rodríguez-Hernández

¹

,

Enrique González-González

¹

and

Julián Aguirre de Mata

²

¹

Department of Cartographic and Terrain Engineering, Higher Polytechnic School of Ávila, Universidad de Salamanca, Hornos Caleros 50, 05003 Ávila, Spain

²

Departamento de Ingeniería Cartográfica, Geodésica y Fotogrametría, Escuela Técnica Superior de Ingenieros en Topografía, Geodesia y Cartografía, Universidad Politécnica de Madrid, 28031 Madrid, Spain

^*

Author to whom correspondence should be addressed.

Remote Sens. 2025, 17(14), 2535; https://doi.org/10.3390/rs17142535

Submission received: 30 May 2025 / Revised: 18 July 2025 / Accepted: 19 July 2025 / Published: 21 July 2025

(This article belongs to the Special Issue 3D Reconstruction in Cultural Heritage Conservation Through Range-Based and Image-Based Techniques)

Download

Browse Figures

Versions Notes

Abstract

This study presents a critical evaluation of image-based 3D reconstruction techniques for small archaeological artifacts, focusing on a quantitative comparison between Neural Radiance Fields (NeRF), its recent Gaussian Splatting (GS) variant, and traditional Structure-from-Motion (SfM) photogrammetry. The research targets artifacts smaller than 5 cm, characterized by complex geometries and reflective surfaces that pose challenges for conventional recording methods. To address the limitations of traditional methods without resorting to the high costs associated with laser scanning, this study explores NeRF and GS as cost-effective and efficient alternatives. A comprehensive experimental framework was established, incorporating ground-truth data obtained using a metrological articulated arm and a rigorous quantitative evaluation based on root mean square (RMS) error, Chamfer distance, and point cloud density. The results indicate that while NeRF outperforms GS in terms of geometric fidelity, both techniques still exhibit lower accuracy compared to SfM, particularly in preserving fine geometric details. Nonetheless, NeRF demonstrates strong potential for rapid, high-quality 3D documentation suitable for visualization and dissemination purposes in cultural heritage. These findings highlight both the current capabilities and limitations of neural rendering techniques for archaeological documentation and suggest promising future research directions combining AI-based models with traditional photogrammetric pipelines.

Keywords:

point-cloud reconstruction; archaeology; photogrammetry; NeRF; Gaussian Splatting

1. Introduction

Archaeological artifacts serve as tangible connections to past societies, offering valuable insights into the lives, technologies, and cultural practices of our predecessors. These objects (ranging from stone tools and ceramic vessels to ornate jewelry and inscribed tablets) act as portals to bygone eras, enabling archaeologists and historians to reconstruct patterns of daily life, belief systems, trade networks, and social organization over millennia [1,2,3,4,5,6,7]. Preserving and studying such artifacts is critical not only for advancing scholarly understanding but also for safeguarding cultural heritage for future generations.

Modern conservation efforts employ a wide array of strategies to protect these irreplaceable objects, including environmental monitoring, careful handling, preventive conservation, and systematic documentation [8,9,10]. Among these, digital documentation through high-resolution three-dimensional (3D) modeling has emerged as an essential tool for both research and preservation [1,11,12,13,14,15,16]. Accurate 3D models allow for detailed, non-invasive study of fragile artifacts and enable wider public access through virtual exhibitions and replicas. For small-sized artifacts in particular (those typically under 15 cm in size), achieving high-fidelity digital representations poses notable challenges due to their complex geometries, reflective surfaces, and fine textural details.

In the last two decades, geomatics technologies such as terrestrial laser scanning, structured-light scanning, and SfM photogrammetry have revolutionized 3D feature/object documentation practices in archaeology [17,18,19,20,21]. SfM photogrammetry, in particular, has gained prominence due to its accessibility and ability to reconstruct dense point clouds and textured meshes from overlapping photographic datasets [22]. Nevertheless, when applied to small archaeological artifacts, traditional SfM techniques often encounter limitations: they require controlled lighting, extensive photographic coverage, and struggle with shiny or translucent materials [23,24,25]. These constraints motivate the exploration of alternative or complementary approaches capable of addressing the specific difficulties posed by small, complex cultural heritage objects.

Recent advances in artificial intelligence have introduced new paradigms for 3D reconstruction, notably NeRF [26]. Unlike conventional methods that explicitly reconstruct geometry, NeRF models learn a volumetric representation of a scene from a set of input images and synthesize photorealistic views from novel perspectives. By leveraging deep neural networks, NeRF can capture intricate lighting effects, subtle textures, and material properties that traditional methods may overlook [27]. Early applications in cultural heritage contexts suggest that NeRF could become a transformative technology for the non-invasive documentation of artifacts, particularly in scenarios where optical challenges hinder feature-based reconstruction techniques [28].

Recent works have explored NeRF approaches in architectural documentation and large-scale outdoor environments [28,29,30,31]. However, few studies have examined the behavior of neural rendering methods in the context of small archaeological artifacts, especially those with metallic or reflective surfaces. Most existing benchmarks focus on synthetic datasets, ideal lighting, or non-reflective materials. This study addresses that gap by applying NeRF and Gaussian Splatting to a challenging but realistic use case: bronze fibulae from the Late Iron Age with high reflectivity and complex geometry. This constitutes a novel application of neural rendering methods to micro-scale, reflective cultural heritage objects, a scenario rarely addressed in the current literature.

This approach complements recent evaluations such as [27,32] but moves the focus toward micro-scale heritage documentation, which remains underrepresented in the literature.

Moreover, rapid developments in NeRF-related methods have significantly improved their efficiency and quality. One of the most promising recent innovations is Gaussian Splatting, introduced by Kerbl et al. (2023) [33], which represents scenes using 3D Gaussians instead of dense voxel grids or implicit functions. This approach offers real-time rendering capabilities while maintaining high visual fidelity, potentially overcoming some of the computational bottlenecks associated with early NeRF models.

Despite these advances, critical questions remain regarding the comparative performance of NeRF techniques and traditional photogrammetry for small artifact reconstruction: How do their outputs differ in terms of geometric accuracy, surface noise, and completeness? Can NeRF or Gaussian Splatting match or surpass the established standards of SfM photogrammetry for heritage applications?

This research aims to systematically compare 3D reconstructions of small archaeological artifacts generated using SfM photogrammetry, NeRF, and GS. To that end, this study examines three horse-shaped fibulae recovered from archaeological sites associated with the Vettones, one of the most famous pre-Roman peoples of western Spain [34]. Currently housed in the Museum of Ávila, these artifacts were selected as case studies due to their refined morphological details and historical significance [35,36,37,38,39,40]. Quantitative evaluation metrics (including root mean square error (RMSE), Hausdorff distance, Chamfer distance, and density analyses) were used to assess the fidelity of the reconstructions against ground-truth measurements obtained with a metrological articulated arm.

One of the main contributions of this study lies in the novel application of NeRF-based methods to the 3D documentation of small archaeological objects with reflective metallic surfaces, a particularly challenging domain for conventional reconstruction techniques. Unlike most previous works focused on large architectural or natural scenes, our research demonstrates the potential of neural rendering approaches for capturing complex microgeometries and materials such as bronze. This application opens up new pathways for non-invasive, low-cost documentation of delicate and hard-to-scan cultural heritage items, especially in museum environments where traditional scanning technologies may be unfeasible. To address this gap, the present study aims to answer the following research questions: (i) How do NeRF and GS perform in reconstructing small-scale, reflective archaeological artifacts compared to traditional SfM photogrammetry? (ii) What are the strengths and limitations of these neural approaches in terms of geometric accuracy and detail preservation? (iii) Can these methods offer a viable alternative for cultural heritage documentation in constrained environments such as museums?

The studied objects exhibit specular reflections and complex geometry; feature-based photogrammetry loses correspondence, and the projected patterns of structured scanners become saturated. NeRF has been shown to tolerate non-ideal illumination and capture subtle radiometric properties. Its integration into Nerfstudio allows models to be trained from smartphone video, aligning with the “low-cost/low-contact” objective of the project.

The structure of this article is organized as follows: Section 2 describes the materials and methods, including the artifacts, equipment, and data acquisition protocols. Section 3 presents the results of the comparative analysis. Section 4 discusses the findings in relation to prior work and identifies avenues for future research. Finally, Section 5 summarizes the conclusions and practical implications of this study for 3D documentation of small archaeological objects.

2. Materials and Methods

This section describes the experimental design, acquisition methods, and reconstruction pipelines employed to generate 3D models of small archaeological artifacts. The workflow included (i) the selection and documentation of representative objects, (ii) the acquisition of ground-truth data using a metrological articulated arm, (iii) photogrammetric image acquisition and 3D reconstruction, (iv) Neural Radiance Fields (NeRF) training and point cloud extraction, and (v) quantitative evaluation of the reconstructed models (Figure 1).

The selection of NeRF and GS methods was driven by the specific challenges posed by the artifacts under study: their small size, metallic surfaces, and intricate geometry. These conditions make conventional methods like laser scanning or even structured-light scanning less effective due to issues like reflection, occlusion, or loss of surface detail. Neural rendering techniques, by contrast, are more tolerant to such complexities, providing robust reconstructions using standard RGB imaging and offering significant advantages in environments where physical contact with artifacts is restricted or lighting cannot be fully controlled.

For the NeRF and GS reconstructions, video sequences were captured using an iPhone 13 under diffused daylight conditions, while the photogrammetric images were acquired using a Canon EOS R camera equipped with a macro lens and artificial lighting. The use of different acquisition devices (DSLR for SfM, mobile phone for NeRF/GS) was intentional, aiming to test the robustness of neural rendering methods under realistic, low-cost documentation scenarios. This setup reflects common constraints in museum or fieldwork environments, where access to professional imaging equipment may be limited.

2.1. Archaeological Artifacts

Three horse-shaped fibulae from the Late Iron Age (3rd–1st centuries BC), associated with the Vettones, were selected from the collection of the Museum of Ávila, Spain (Figure 2). These bronze artifacts, approximately 5 cm in length, are characterized by thin, complex geometries and metallic reflective surfaces, posing significant challenges for 3D digitization. Their intricate morphology and cultural significance make them ideal candidates for assessing reconstruction accuracy at small scales.

With a design inspired by the form of a horse, these bronze archaeological pieces are usually associated with the equestrian elite who assumed leadership roles in the pre-Roman societies of Celtic Iberia [35,40]. Fibula 1 (museum number: 06/56/MS/343; size: 4.1 × 3.6 cm) was found in the walls of the archaeological site of La Mesa de Miranda (Chamartín, Ávila) [39,40,41]. Fibula 2 (museum number: 1989/41/3461; size: 4.1 × 3.6 cm) was discovered during Juan Cabré’s excavations at the archaeological site of Las Cogotas (Cardeñosa, Ávila), carried out between 1927 and 1932 [35,39,42,43,44,45]. Fibula 3 (museum number: 04/112/2571; size: 4.5 × 3.7 cm) was found in the walls of Las Cogotas oppidum [39,44,45,46]. In all cases, their excellent state of preservation gives us a glimpse of the dexterity and skill of the craftsmen who created them.

2.2. Ground-Truth Acquisition: Metrological Arm Measurements

Precise 3D coordinates were acquired for each fibula using a Hexagon ROMER Absolute Arm 7325SI metrological system (Hexagon Manufacturing Intelligence, USA) (Figure 3). This instrument is used at the industrial level for metrological control of parts and components, ensuring that they meet the required dimensional specifications.

During data acquisition, the object was kept completely still to avoid any distortion in the measurements, and a total of 22–24 control points were recorded per brooch, distributed strategically over flat areas, morphological details, and high-contrast color regions to ensure comprehensive coverage (Figure 4). Measurements were performed in a controlled indoor environment to minimize external influences, and each fibula was immobilized during acquisition. These datasets served as ground truth for subsequent accuracy assessments.

2.3. Photogrammetric 3D Reconstruction

Photographic datasets were acquired using a Canon EOS 700D DSLR camera equipped with a 60 mm macro lens (Table 1). Each piece was placed on a white matte turntable inside a lightbox with diffuse illumination to minimize reflections. The camera was fixed on a tripod while the object was rotated, following a convergent photogrammetry scheme [23].

Acquisition parameters included

Manual focus and exposure settings (f/8 aperture, low ISO).
150–200 images per artifact at 70–80% overlap.
A focus distance of approximately 100–120 mm.

To obtain high quality images, an efficient system using a camera mounted on a fixed tripod was implemented. In addition, a lighting box with a neutral, matte (white) background color was used. To improve the handling of the piece, a white turntable was included inside the illumination box, which allowed for better control of the object. In this way, good contrast and a soft and homogeneous illumination around the object of study was achieved. It should be noted that all three objects are metallic, so correct illumination is vital for the data acquisition process (avoiding reflections is paramount).

After adjusting the composition, the object was slowly rotated on the turntable, enabling images to be taken from multiple angles without altering the camera position. Although this method requires additional preprocessing time to remove the white background and avoid interference in the 3D reconstruction, it is highly effective (Figure 5).

In order to scale the 3D model, the data obtained with the metrological arm were used. A total of 150 to 200 photographs were taken of each element. The number of photographs varied according to the geometry of the archaeological piece.

We performed 3D reconstructions using Agisoft Metashape (version 2.2.1. https://www.agisoft.com/), a commercial photogrammetric software, applying standard SfM and Multi-View Stereo (MVS) pipelines. Masks were applied to each image to remove the background prior to processing. The models were scaled using the ground-truth points. The resulting models achieved an average Ground Sample Distance (GSD) of 0.0079 mm, average scale error of 0.0165 mm, and mean photogrammetric error of 0.0056 mm.

2.4. NeRF-Based 3D Reconstruction

To complement the photogrammetric reconstructions, NeRF were employed using the Nerfstudio framework [32], a modular, open-source platform that facilitates the implementation and training of NeRF volumetric models from image datasets. This approach enables high-fidelity reconstructions of complex geometries and surface textures from dense visual information.

Data preparation involved extracting frames from high-resolution video footage captured around each artifact. The videos were recorded using a 50-megapixel HDR-enabled smartphone, ensuring detailed coverage from multiple viewpoints while maintaining consistent exposure and dynamic range. This method allowed for efficient acquisition of training data with sufficient angular diversity.

Camera poses were estimated using COLMAP [22], a well-established SfM tool, which provided the necessary orientation data to initialize the NeRF training process. The sparse point clouds and calibrated camera parameters obtained from COLMAP served as the geometric backbone for radiance field learning.

NeRF models were trained using the Nerfacto [32] configuration within Nerfstudio. This setup utilizes multi-resolution hash encoding and adaptive ray sampling to accelerate training and convergence. A total of 200,000 training iterations were performed using default hyperparameters, optimizing the volumetric representation of the scene for accurate photo-consistent rendering. The resulting radiance fields effectively captured the spatial and radiometric properties of the objects, including fine surface details and material variation (Figure 6).

From the trained models, depth maps were extracted and subsequently converted into dense 3D point clouds. These point clouds provided geometric representations suitable for comparative analysis and downstream processing.

2.5. Gaussian Splatting-Based 3D Reconstruction

In parallel, reconstructions using GS [33] were also performed through Nerfstudio’s GS module. GS represents the scene as a collection of 3D anisotropic Gaussian primitives, enabling faster training and real-time rendering capabilities. This method is particularly beneficial in scenarios where interactive visualization and responsiveness are required.

Gaussian Splatting (GS) represents the scene as a set

𝒢 = {𝑔_{i}}_{i = 1}^{N}

of anisotropic Gaussian primitives in

R^{𝟛}

, each defined by a center

μ_{i}

, a covariance matrix

Σ_{i}

and a color

c_{i}

. Following Kerbl [33] and Nerfstudio implementation, we trained the parameters by minimizing the photometric error:

L = \sum_{r \in R} | \hat{C} (r; 𝒢) - C_{r e f} (r) |_{2}^{2}

(1)

where

\hat{C}

is the alpha-composited integration of the Gaussians along the ray r.

2.6. Experimental Configurations

Alignment of the NeRF and GS models to the ground-truth coordinate system was initially conducted manually and later refined using the Iterative Closest Point (ICP) algorithm. This ensured precise spatial correspondence between the reconstructed models and reference datasets, facilitating quantitative accuracy assessments and integration with other datasets.

The NeRF- and GS-based 3D reconstructions, generated using the Nerfstudio framework, were systematically compared with scaled photogrammetric models obtained using Agisoft Metashape. The photogrammetric models were accurately scaled using ground-truth reference points acquired with a robotic arm-based measurement system, which provided high-precision spatial data for alignment. This comparison was carried out to evaluate the geometric accuracy, spatial consistency, and level of surface detail captured by each method. Alignment of all reconstructions within a unified coordinate system enabled the calculation of key quantitative metrics, including Ground Sample Distance (GSD), average scale error, and photogrammetric residuals.

All experiments were conducted using an RTX 3070 GPU (8 GB VRAM) in a Python 3.13.3 environment with Nerfstudio 1.0.0. For the NeRF reconstructions, the Nerfacto pipeline was used with default parameters, with key settings including 4096 rays per batch (

2^{12}

) and 32,768 rays per chunk (

2^{15}

). The average training time per object was approximately 32 min.

For GS, the splatfacto method was used with key settings including an initial radius which was randomly sampled within the range 0.04–0.08 mm, and the training was conducted using 15,000 Adam iterations with a learning rate of 1 × 10⁻³. The renderer used the antialiased rasterization mode, and refinement steps were performed every 100 iterations. The average training time per object was 21 min.

In all cases, the reconstructed volumetric representations were converted into point clouds for evaluation. After convergence, volumetric density was sampled on a voxel grid, with approximately 33 million voxels for NeRF and 20 million voxels for GS. The resulting 3D point clouds were exported in .PLY format and used for quantitative geometric comparison.

2.7. Evaluation Metrics

Quantitative evaluation between the reconstructed point clouds and ground-truth data was performed according to standard geometric metrics [47]:

2.7.1. Initial Alignment (ICP)

Prior to the calculation of the comparison metrics, an alignment was performed using the Iterative Closest Point (ICP) [48] algorithm to minimize the position discrepancies between the reconstructed cloud and the reference cloud. This algorithm measures the proportion of points that, after alignment, are sufficiently close (inliers) according to the defined threshold. According to this, a high fitness indicates a good overlap between both point clouds, which is essential before evaluating point errors. Here, the RMSE, which measures the mean square error between the matched points after alignment, provides an overall measure of deviation in alignment and is sensitive to outliers, which can help to detect mismatches.

E (R, t) = \frac{1}{N_{p}} \sum_{i = 1}^{N_{p}} {‖x_{i} - R_{p i} - t‖}^{2}

(2)

where the alignment error

E (R, t)

represents the mean squared distance between corresponding points of the two point clouds after applying a rigid transformation. In this context

N_{p}

denotes the total number of point correspondences between the source (reconstructed) point cloud and the target (reference) point cloud. The term

x_{i}

refers to the coordinates of the i-th point in the reference cloud, while

p i

corresponds to the coordinates of the i-th point in the source cloud. The expression

{‖x_{i} - R_{p i} - t‖}^{2}

calculates the squared Euclidean distance between the transformed source point and its corresponding point in the reference. This function is minimized during the ICP optimization process to determine the rigid transformation that achieves the best possible alignment between both clouds.

2.7.2. Geometric Discrepancy Metrics

To quantify the differences between the reconstructed cloud and the reference cloud, metrics based on point-to-point distances were calculated.

RMS Error: Measure the average deviation considering a higher weight for large errors, which helps to identify possible outliers in the reconstruction.

R M S = \sqrt{\frac{1}{|P_{s}|} \sum_{p \in P_{s}} d {(p, P_{r})}^{2}}

(3)

where d(p,Pr) is the distance from the point

p

in

P_{s}

to its nearest point in

P_{r}

.

Hausdorff distance: Evaluates the worst-case discrepancy between the two clouds, identifying the maximum distance between a point in the reference and its closest point in the reconstruction. This value is critical in applications where it is necessary to ensure that there are no significant error regions.

d_{H} (P_{s}, P_{r}) = m a x \{\begin{matrix} \sup d (p, P_{r}), & \sup d (q, P_{s}) \\ p \in P_{s} & q \in P_{r} \end{matrix}\}

(4)

where the first part evaluates the worst distance from the reference to the reconstruction and the second from the reconstruction to the reference.

Chamfer’s distance: Calculates the sum of the nearest point distances in both directions (from reference to reconstruction and vice versa), providing a symmetric metric that allows for comparison of the completeness of the reconstruction.

d_{C} (P_{s}, P_{r}) = \frac{1}{|P_{s}|} \sum_{p \in P_{s}} d (p, P_{r}) + \frac{1}{|P_{r}|} \sum_{q \in P_{r}} d (q, P_{s})

(5)

In this equation, Chamfer’s distance

d_{C} (P_{s}, P_{r})

quantifies the geometric difference between the source point cloud

P_{s}

and the reference cloud

P_{r}

by averaging the nearest-neighbor distances in both directions.

Mean Absolute Distance: Represents the average of all measured discrepancies between the two clouds, providing a value less sensitive to outliers than the RMS.

d_{m e d i a} = \frac{1}{N} \sum_{i = 1}^{N} d_{i}

(6)

Here, the equation measures the average distance between corresponding points in the two clouds. Unlike RMSE, it is less affected by outliers, offering a more robust estimate of the typical point-to-point deviation.

Symmetric distance: Calculated as the average of the distances between the two clouds in both directions, ensuring that the comparison does not depend on the density of points in one of the two representations.

d_{s y m} = \frac{1}{2} (\frac{1}{|P_{s}|} \sum_{p \in P_{s}} d (p, P_{r}) + \frac{1}{|P_{r}|} \sum_{q \in P_{r}} d (q, P_{s}))

(7)

This equation averages the mean distances from each cloud to the other, ensuring that the comparison is unbiased with respect to point density or sampling differences.

2.7.3. Density Characterization and Spatial Distribution

In addition to geometric accuracy, the structure of the reconstructed point cloud was evaluated in terms of its density and spatial volume:

Density of points: It was estimated as the number of points per unit volume, considering the approximate volume defined by the extent of the cloud on the three axes. This metric allows us to identify if the reconstruction contains a detailed representation or if it presents deficiencies in capturing fine details.

ρ = \frac{N}{\begin{matrix} (x_{m a x} - x_{m i n}) & (y_{m a x} - y_{m i n}) & (z_{m a x} - z_{m i n}) \end{matrix}}

(8)

where

N

is the total number of points and (

x_{m i n}

,

x_{m a x}

), (

y_{m i n}

,

y_{m a x}

) and (

z_{m i n}

,

z_{m a x}

) are the minimum and maximum values on each cloud axis.

These metrics provide a comprehensive assessment of reconstruction fidelity in terms of both local and global deviations.

3. Results

The central purpose of this study is to systematically compare the performance of NeRF 3D reconstruction methods, specifically NeRF and GS, against conventional SfM photogrammetry, using small archaeological artifacts as test subjects. To this end, we evaluate and interpret the accuracy of each method relative to a high-precision ground truth generated with a metrological articulated arm.

3.1. Reconstruction Performance Metrics

Table 2 presents the comparative results across three fibula samples using several quantitative metrics, including ICP Fitness, ICP RMSE, RMS Error, Hausdorff distance, Chamfer distance, Mean Absolute Error, and Symmetric Error.

NeRF consistently achieved lower RMS, Chamfer, and Symmetric Errors compared to GS across all samples, indicating better overall geometric fidelity. Hausdorff distances, which captured the worst-case deviations, were also smaller for NeRF in two of the three brooches. However, both methods demonstrated increased errors when reconstructing Fibula 3, likely due to its more complex geometry and reflective surface properties.

Although GS produced denser point clouds in certain cases, this did not translate into better reconstruction accuracy, suggesting that point density alone is insufficient to guarantee model fidelity.

3.2. Point Density Comparison

Table 3 shows the density of the reconstructed point clouds relative to the ground-truth models.

Both NeRF and GS produced point clouds significantly less dense than the ground-truth data, which was expected given the differences in acquisition technologies. Although GS sometimes generated a higher number of points than NeRF, this did not correlate with superior metric performance. In particular, Fibula 3 exhibited very low densities in both NeRF and GS reconstructions, suggesting that complex or reflective artifacts pose challenges to image-based methods.

3.3. Visual Comparison of Point Clouds

The spatial distribution of errors between the photogrammetric reference models, NeRF reconstructions, and GS models was assessed using the Cloud-to-Cloud Distance tool in CloudCompare. Figure 7 and Figure 8 present deviation maps for the three fibulae, illustrating geometric discrepancies between each reconstruction method and the ground-truth photogrammetric model.

Figure 7 illustrates the deviations between the NeRF models and the photogrammetric reference. The predominance of blue and green tones across the fibulae indicates that NeRF reconstructions maintain a high degree of geometric fidelity, especially in flat and less detailed areas. Minor deviations (green to light yellow) appear mainly along edges and small ornamental features, but they remain limited to some extent.

In contrast, Figure 8 presents the deviation maps for the GS-based models, where a larger proportion of yellow and red areas can be observed, particularly in regions containing fine geometrical details such as contours, cavities, and narrow features. This suggests that GS has more difficulty in accurately reproducing intricate elements and tends to produce higher deviations overall.

While both techniques can approximate the overall structure of the archaeological pieces, the NeRF reconstructions outperform GS in terms of spatial accuracy. The results confirm that NeRF is more effective in preserving geometric integrity, making it a more suitable approach for documenting small and complex archaeological artifacts that require precise 3D representation.

3.4. Quantitative Evaluation of Geometric Accuracy Using Distance Distributions

To complement the visual deviation maps, quantitative analysis was performed to assess the geometric fidelity of the NeRF and GS reconstructions with respect to the photogrammetric reference models. Specifically, the Euclidean distances between corresponding points in the reconstructed and reference clouds were computed and analyzed using histograms that reflect the frequency distribution of point-to-point distances.

This method provides a more detailed understanding of how each technique behaves across the full surface of the object, highlighting not only average performance but also the presence of outliers or areas with significant error. The analysis was carried out separately for each of the three fibulae under study, and the resulting distributions are presented in the following figure (Figure 9).

Figure 9 represents the point-to-point error density (top) and cumulative distribution curves (bottom) for the three fibulae. In models 1 and 2, the NeRF distribution is concentrated in the 0.1–0.4 mm range, while GS exhibits longer tails, indicating greater local deviations. Eighty percent of the NeRF points fall within 0.3 mm of the metrological control, compared to 55–65% in GS. In Fibula 3, both methods exhibit larger errors due to the strong specular reflectance; however, NeRF maintains a median 30% lower than GS.

The F-score at 0.5 mm (tabulated in the same figure) quantifies this observation: 0.73 and 0.83 for NeRF versus 0.55 and 0.50 for GS in the first two cases. In the third, both fall below 0.40. These results confirm NeRF’s geometric advantage, already indicated by the RMS and Chamfer metrics.

Across all three brooches, the NeRF reconstructions clearly achieve superior geometric fidelity, particularly in areas requiring detailed reproduction. These results reinforce the conclusion that NeRF is better suited than GS for high-accuracy 3D modeling of small archaeological objects, especially when precise spatial measurements are essential for documentation, conservation, or scientific analysis.

3.5. Summary of Reconstruction Performance

The results of this comparative study provide a comprehensive overview of the capabilities and limitations of NeRF and GS for the 3D reconstruction of small archaeological artifacts, using high-precision photogrammetric models such as ground truth.

Quantitatively, NeRF consistently outperforms GS across all evaluated metrics, including Root Mean Square Error (RMS), Chamfer distance, and Symmetric Error, as reported in Table 2. These lower error values indicate a higher level of geometric accuracy and surface fidelity. Although GS occasionally produces denser point clouds (Table 3), this does not translate into improved performance, suggesting that point count alone is not a reliable indicator of model quality.

Visually, the deviation maps (Figure 7 and Figure 8) further emphasize NeRF’s superiority in spatial accuracy. While both NeRF and GS can reproduce the overall shape of the objects, NeRF more accurately captures fine details and preserves the structural integrity of the fibulae. GS, on the other hand, exhibits noticeable errors in complex or narrow regions, particularly evident in Fibula 3, where it struggles with reflective surfaces and intricate geometry.

The histogram-based distance analysis (Figure 9) confirms these findings, showing that NeRF consistently maintains a higher concentration of points within minimal deviation ranges (<0.3 mm), while GS distributions are broader and skewed toward higher errors. This pattern is consistent across all three fibulae and highlights NeRF’s robustness and precision.

In conclusion, while both NeRF and GS represent promising approaches for image-based 3D reconstruction, NeRF demonstrates greater reliability and accuracy for small, complex cultural heritage objects. Its ability to generate metrically accurate models makes it more suitable for applications in digital preservation, scientific analysis, and heritage documentation, where geometric precision is critical. GS may still offer advantages in terms of rendering speed and visual smoothness, but its geometric limitations must be carefully considered when fidelity is a priority.

4. Discussion

4.1. Comparative Performance of NeRF, GS, and SfM

The findings of this study contribute significant insights into the practical capabilities, limitations, and future potential of NeRF-based 3D reconstruction methods, specifically NeRF and GS, when applied to the documentation of small-scale archaeological artifacts. By benchmarking these approaches against high-precision photogrammetric models derived from SfM, we have been able to draw robust conclusions regarding their geometric accuracy, reliability, and applicability in cultural heritage workflows.

Across all evaluated metrics (RMS Error, Chamfer distance, Mean Absolute Error, and Symmetric Error, see Table 2 and Figure 10), NeRF consistently outperformed GS. On average, NeRF reduced RMS and Chamfer errors by 35–40% relative to GS and by >40% relative to SfM. While GS occasionally produced denser point clouds, these did not translate into higher accuracy, reinforcing the notion that point density alone is not a reliable proxy for model quality. Indeed, GS exhibited the highest Hausdorff distances (up to 20% larger than NeRF), signaling a greater frequency of localized outliers and boundary artifacts. These patterns aligns with previous benchmarks [33], which report that GS favors visual smoothness and real-time rendering speed at the expense of sub-millimetre precision.

Tests demonstrate that NeRF offers superior geometric fidelity, achieving lower RMS, Chamfer, and symmetry errors across all three parts analyzed. This is because NeRF’s continuous volumetric rendering better captures microcurvatures and fine edges, while GS tends to smooth out details to optimize real-time rendering. Furthermore, NeRF enables the extraction of dense, reliable depth maps that feed a coherent cloud, ideal for comparison against metrological control. Furthermore, hierarchical ray sampling reduces artifacts in hard-to-see areas. Finally, while GS renders more efficiently, NeRF keeps the VRAM footprint within manageable limits (<8 GB in our tests).

4.2. Performance Under Complex Geometries and Reflective Surfaces

One of the most revealing outcomes emerged from the analysis of Fibula 3, an artifact characterized by a more complex morphology and reflective metallic surfaces. Both NeRF and GS showed markedly reduced performance on this object, with increased reconstruction errors and significantly lower point densities (as reflected in Table 3). This decline can be attributed to specular reflections, occlusions, and the overall difficulty of modeling highly detailed or shiny surfaces using passive image-based methods. Reflective artifacts disrupt the feature correspondence mechanisms fundamental to SfM and interfere with radiance field estimation in neural methods, resulting in noise, over smoothing, or missing geometry.

The deviation maps (Figure 7 and Figure 8) and histogram analyses (Figure 9) confirmed that the major discrepancies in NeRF and GS reconstructions are concentrated in small-scale features, such as edges, decorative engravings, and narrow voids. Although both methods effectively capture the overall shape and volumetric proportions of the fibulae, GS displayed higher spatial deviations across all samples, particularly in high-curvature areas. NeRF, by contrast, maintained a tighter error distribution, with most deviations below 0.3 mm and localized mainly in ornamented zones or occluded surfaces. These findings align with previous research in the cultural heritage domain, where NeRF has demonstrated strong performance in texture reproduction and general geometry, but some difficulty in achieving submillimetric fidelity required for high-precision applications.

4.3. Applicability in Cultural Heritage Documentation

Despite these limitations, NeRF reconstructions offer significant advantages in terms of workflow efficiency, visual realism, and user accessibility. The relatively short acquisition and training times (especially when compared to traditional photogrammetric modeling and dense SfM reconstructions) make NeRF a promising option for preliminary documentation, interactive exhibitions, digital storytelling, and remote research collaboration. Its ability to produce compelling visual outputs with reduced human intervention also makes it highly suitable for public dissemination and educational use.

However, for tasks requiring strict geometric accuracy (such as digital conservation archives, physical replication (e.g., 3D printing), morphometric analyses, or architectural integration), photogrammetry remains the gold standard. SfM still provides superior control over point cloud resolution, alignment, and error traceability, essential in projects demanding traceable metrological validity.

Beyond the quantitative performance evaluation, the present study offers an original contribution for exploring how NeRF systems perform in a particularly underexplored domain: the documentation of small-scale, high-reflectivity heritage artifacts. This niche application area introduces unique challenges that are rarely addressed in NeRF-related literature, including issues related to scale, lighting control, and reflective surface properties. By rigorously testing these technologies under such constraints, we demonstrate that neural rendering can extend its utility beyond synthetic benchmarks and architectural datasets, providing heritage scientists with alternative tools for documenting fragile or visually complex items. These challenges underscore the importance of developing material-aware or photometrically robust NeRF variants to improve accuracy in cultural heritage scenarios involving metal, glass, or polished stone.

4.4. Methodological Scope and Future Research Directions

While other reconstruction techniques such as structured-light scanning or micro-CT imaging are also used for small object digitization, their inclusion was beyond the scope of this study, which aimed to evaluate the performance of NeRF methods against a widely accepted standard (SfM photogrammetry). Nonetheless, future work could expand this comparison to include additional methods, especially those designed for metrological accuracy in micro-scale applications.

It is worth noting that this study did not seek to provide a comprehensive benchmark of all existing 3D reconstruction techniques. Instead, it focuses on evaluating NeRF approaches in contrast with the most widely adopted photogrammetric method (SfM) and a real-time neural variant (GS). Although techniques such as laser scanning or structured-light scanning are also used in heritage digitization, they were excluded from this comparison due to practical constraints and to maintain a clear focus on the performance of neural rendering models. Future research may benefit from including additional methods to further validate and contextualize the comparative performance of AI-based reconstructions.

Looking ahead, the integration of NeRF with hybrid or multimodal approaches presents a promising avenue for enhancing accuracy without sacrificing efficiency. For instance, combining NeRF with SfM-derived sparse geometry may provide stronger geometric priors during training. Adaptive-resolution NeRFs, capable of focusing on high-detail areas, could address the current limitations in fine-feature reproduction. Material-aware rendering models, incorporating reflectance properties or multispectral data, may mitigate the challenges posed by reflective surfaces. Improvements in camera pose estimation and radiance field optimization under complex lighting could further enhance the reliability of neural reconstructions in cultural heritage settings. Moreover, the incorporation of photometric consistency constraints, BRDF-aware shading models, and integrated uncertainty quantification could position future NeRF systems as more robust tools for scientific documentation.

It is important to note that the input data used for the neural methods (NeRF and GS) were derived from mobile video capture, whereas the SfM model was built from a controlled DSLR-based image set. This divergence reflects a deliberate choice to simulate realistic, low-cost documentation conditions using consumer devices. While this introduces variability in the comparison, it also highlights one of the core advantages of neural rendering: its capacity to generate plausible reconstructions from less controlled, lower-quality data sources. Future work will explore direct comparisons using matched datasets from identical cameras to further isolate the effects of algorithmic differences.

4.5. Trade-Offs Between Accuracy, Processing Time, and Usability

While SfM photogrammetry remains the most accurate method for sub-millimeter documentation, it requires a carefully controlled acquisition environment, a large number of well-planned images, and time-intensive processing (including masking, alignment, and manual scaling). In contrast, NeRF and GS reconstructions (particularly when implemented via frameworks like Nerfstudio) require significantly less human intervention and can be deployed using video frames or simple acquisition protocols.

In terms of computational cost, GS resulted in 21 ± 3 min of training time and 28 FPS of rendering at 1080p on an 8 GB RTX 3070, while NeRF-Nerfacto required 32 ± 5 min and yielded 12 FPS. While GS improves the interactive performance, our results confirm that this improvement comes at the cost of an 18–35% loss in RMS. Therefore, the choice between the two approaches must weigh display speed against the fidelity of the visual metrics required by the final application. However, this reduced effort comes at the cost of geometric accuracy, as neural methods still struggle with fine-feature preservation and reflective surfaces. In field applications such as emergency documentation, museum digitization, or educational VR content creation, NeRF provides an accessible, fast, and visually rich alternative. For applications requiring reproducible metrological accuracy (e.g., conservation or 3D printing), SfM remains preferable despite the higher processing burden.

These trade-offs underscore the importance of aligning method selection with project goals and constraints, particularly in time-sensitive or resource-limited archaeological and heritage contexts.

4.6. Final Remarks

In summary, this study confirms the potential of NeRF-based approaches to serve as efficient and visually compelling tools in the digitization of archaeological heritage. While not yet a replacement for high-precision photogrammetry in rigorous conservation or analytical contexts, NeRF’s strengths in volumetric fidelity and fast deployment make it an attractive complement to existing 3D documentation pipelines [33]. Continued research and cross-disciplinary development will be essential to unlocking its full potential for use in cultural heritage science.

5. Conclusions

This study has systematically evaluated the performance of Neural Radiance Fields (NeRF) and Gaussian Splatting (GS) in comparison to conventional Structure-from-Motion (SfM) photogrammetry for the 3D reconstruction of small archaeological artifacts. The results clearly demonstrate that NeRF consistently outperforms GS across multiple quantitative metrics, including RMS Error, Chamfer distance, and Symmetric Error, indicating a higher degree of geometric fidelity. However, both neural methods still fall short of the geometric precision achieved by SfM photogrammetry, which remains the most accurate approach for capturing sub-millimeter details essential in cultural heritage conservation. Despite the longer acquisition and processing times associated with photogrammetry, its reliability and metrological robustness continue to make it the gold standard for rigorous documentation.

NeRF, in contrast, offers significant advantages in terms of efficiency and visual quality. It effectively reconstructs the global shape and surface curvature of artifacts while producing visually compelling models suitable for applications in virtual exhibitions, educational tools, and public outreach initiatives. Nevertheless, both NeRF and GS exhibit notable limitations when applied to objects with reflective or highly complex surfaces, such as metallic fibulae, where specular reflections and occlusions negatively impact reconstruction accuracy. These findings highlight the need for continued development of neural rendering techniques, particularly in enhancing their ability to handle challenging materials and detailed features.

While not yet a replacement for photogrammetry in precision-critical contexts, NeRF and GS present promising alternatives for rapid and accessible 3D documentation workflows. Future research should explore hybrid approaches that combine the photometric realism of NeRF with the geometric precision of SfM, potentially through the integration of sparse geometric priors or adaptive-resolution neural models. Additionally, further investigation is warranted to examine how variables such as lighting, camera calibration, and dataset diversity influence reconstruction quality in neural pipelines. Developments in material-aware radiance fields and uncertainty quantification may also help bridge the current accuracy gap.

In summary, although SfM photogrammetry remains the most accurate method for 3D reconstruction in archaeology, NeRF-based techniques are emerging as valuable complementary tools that offer a balance between visual realism, processing speed, and usability, particularly in scenarios where extreme precision is not the primary requirement. Their ongoing evolution holds great potential for enriching the digital preservation and dissemination of cultural heritage.

The originality of this work also stems from its focus on a highly specific yet largely overlooked use case: the neural-based reconstruction of small, metallic archaeological artifacts. This constitutes a meaningful advancement in the field of digital heritage, as it illustrates the feasibility of applying NeRF and GS techniques in scenarios traditionally dominated by photogrammetry or laser scanning. Our findings encourage further exploration into how AI-based methods can be adapted or optimized for heritage contexts that demand both accuracy and accessibility.

Author Contributions

Conceptualization, M.Á.M.-G., R.Y. and J.R.-H.; methodology, M.Á.M.-G., R.Y. and J.R.-H.; software, R.Y.; validation, R.Y., J.R.-H. and E.G.-G.; formal analysis, J.R.-H.; investigation, R.Y. and J.R.-H.; resources, M.Á.M.-G.; data curation, J.R.-H., E.G.-G., J.A.d.M. and R.Y.; writing—original draft preparation, M.Á.M.-G. and R.Y.; writing—review and editing, J.R.-H., E.G.-G. and J.A.d.M.; visualization, R.Y.; supervision, M.Á.M.-G.; project administration, M.Á.M.-G.; funding acquisition, M.Á.M.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been funded by Ministerio de Ciencia, Innovación y Universidades Agencia Estatal de Investigación through grants from the European Union “NextGenerationEU”/PRTR (RF.CNS2023-144126), and by the Junta Castilla y Léon (SA080P24).

Data Availability Statement

Data are contained within the article.

Acknowledgments

M.Á.M.-G. acknowledges the grant RYC2021-034813-I funded by MCIN/AEI/10.13039/501100011033 and by European Union “NextGenerationEU”/PRTR. J.R.-H. acknowledges a postdoctoral contract (IIb Program) funded by the University of Salamanca. We would like to thank the Museum of Ávila for providing access to the archaeological artifacts.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Álvarez-Alonso, D.; de Andrés-Herrero, M.; Díez-Herrero, A.; Miralles-Mosquera, S.; Sastre Barrio, M.C.; Maté-González, M.Á.; Nieva Gómez, E.; Díaz Delgado, M.R.; Ruiz Mediavilla, E. More than a fingerprint on a pebble: A pigment-marked object from San Lázaro rock-shelter in the context of Neanderthal symbolic behavior. Archaeol. Anthropol. Sci. 2025, 17, 131. [Google Scholar] [CrossRef]
Álvarez-Sanchís, J.R.; Rodríguez-Hernández, J.; Ruiz Zapatero, G. El askos de Ulaca (Solosancho, Ávila) y el simbolismo del toro entre los vettones. Trab. De Prehist. 2021, 78, 356–365. [Google Scholar] [CrossRef]
Baquedano, E.; Arsuaga, J.L.; Pérez-González, A.; Laplana, C.; Márquez, B.; Huguet, R.; Higham, T. A symbolic Neanderthal accumulation of large herbivore crania. Nat. Hum. Behav. 2023, 7, 342–352. [Google Scholar] [CrossRef] [PubMed]
Estaca-Gómez, V.; Liceras-Garrido, R.; Quintero-Cabello, S.A.; Jimeno, A. Zooarchaeology of Celtiberian-Roman households: Animal use in Numantia (Soria, Spain). J. Archaeol. Sci. Rep. 2025, 62, 105021. [Google Scholar] [CrossRef]
Rodríguez-Hidalgo, A.; Morales, J.I.; Cebrià, A.; Courtenay, L.A.; Fernández-Marchena, J.L.; García-Argudo, G.; Fullola, J.M. The Châtelperronian Neanderthals of Cova Foradada (Calafell, Spain) used imperial eagle phalanges for symbolic purposes. Sci. Adv. 2019, 5, eaax1984. [Google Scholar] [CrossRef] [PubMed]
García Sanjuán, L.; Ramírez-Cruzado, S.; Díaz-Guardamino, M.; Lozano Rodríguez, J.A.; Donaire Romero, T.; Afonso Vargas, J.A.; Muñiz Guinea, F. A multi-analytical study of the Montelirio beaded attires: Marine resources, sumptuary crafts, and female power in copper age Iberia. Sci. Adv. 2025, 11, eadp1917. [Google Scholar] [CrossRef] [PubMed]
Matesanz Gascón, R. Simbolismo calendárico en la iconografía del equipo metálico de tipo Monte Bernorio de la tumba 32 de la necrópolis de Las Ruedas (Padilla de Duero, Peñafiel, Valladolid). Vaccea Anu. 2022, 15, 71–83. [Google Scholar] [CrossRef]
Childs, S.T.; Warner, M.S. (Eds.) Using and Curating Archaeological Collections; University Press of Colorado: Louisville, CO, USA, 2020. [Google Scholar]
Geweely, N.S. New frontiers review of some recent conservation techniques of organic and inorganic archaeological artefacts against microbial deterioration. Front. Microbiol. 2023, 14, 1146582. [Google Scholar] [CrossRef] [PubMed]
Smith, M.H. A Necessary Duty, a Hideous Fault: Digital Technology and the Ethics of Archaeological Conservation. Ph.D. Thesis, Texas A&M University, College Station, TX, USA, 2010. Available online: https://oaktrust.library.tamu.edu/items/a83f91d0-8fb3-4299-bcc5-319bf4dcad6a (accessed on 1 January 2025).
Aramendi, J.; Maté-González, M.Á.; Yravedra, J.; Ortega, M.C.; Arriaza, M.C.; González-Aguilera, D.; Domínguez-Rodrigo, M. Discerning carnivore agency through the three-dimensional study of tooth pits: Revisiting crocodile feeding behaviour at FLK-Zinj and FLK NN3 (Olduvai Gorge, Tanzania). Palaeogeogr. Palaeoclimatol. Palaeoecol. 2017, 488, 93–102. [Google Scholar] [CrossRef]
Clini, P.; Frapiccini, N.; Mengoni, M.; Nespeca, R.; Ruggeri, L. SFM Technique and Focus Stacking for Digital Documentation of Archaeological Artifacts. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 2016, XLI-B5, 229–236. [Google Scholar] [CrossRef][Green Version]
Grosman, L.; Muller, A.; Dag, I.; Goldgeier, H.; Harush, O.; Herzlinger, G.; Dick, N. Artifact3-D: New software for accurate, objective and efficient 3D analysis and documentation of archaeological artifacts. PLoS ONE 2022, 17, e0268401. [Google Scholar] [CrossRef] [PubMed]
Kantaros, A.; Douros, P.; Soulis, E.; Brachos, K.; Ganetsos, T.; Peppa, E.; Manta, E.; Alysandratou, E. 3D Imaging and Additive Manufacturing for Original Artifact Preservation Purposes: A Case Study from the Archaeological Museum of Alexandroupolis. Heritage 2025, 8, 80. [Google Scholar] [CrossRef]
Santos, P.; Ritz, M.; Fuhrmann, C.; Fellner, D. 3D mass digitization: A milestone for archeological documentation. Virtual Archaeol. Rev. 2017, 8, 1–11. [Google Scholar] [CrossRef]
Sapirstein, P. A high-precision photogrammetric recording system for small artifacts. J. Cult. Herit. 2018, 31, 33–45. [Google Scholar] [CrossRef]
Amico, F.; Ronzino, P.; Vassallo, V.; Miltiadous, N.; Hermon, S.; Niccolucci, F. Theorizing authenticity–practising reality: The 3D replica of the Kazaphani boat. In Authenticity and Cultural Heritage in the Age of 3D Digital Reproductions; Di Giuseppantonio Di Franco, P., Galeazzi, F., Vassallo, V., Eds.; McDonald Institute for Archaeological Research: Cambridge, UK, 2018; pp. 111–122. [Google Scholar]
Counts, D.B.; Averett, E.W.; Garstki, K.; Toumazou, M.K. Visualizing Votive Practice: Exploring Limestone and Terracotta Sculpture from Athienou-Malloura through 3D Models; The Digital Press at the University of North Dakota: Grand Forks, ND, USA, 2020. [Google Scholar]
Papadopoulos, C.; Hamilakis, Y.; Kyparissi-Apostolika, N.; Díaz-Guardamino, M. Digital sensoriality: The neolithic figurines from Koutroulou Magoula, Greece. Camb. Archaeol. J. 2019, 29, 625–652. [Google Scholar] [CrossRef]
Remondino, F. Heritage Recording and 3D Modeling with Photogrammetry and 3D Scanning. Remote Sens. 2011, 3, 1104–1138. [Google Scholar] [CrossRef]
Stanco, F.; Tanasi, D. Experiencing the Past: Computer Graphics in Archaeology. In Digital Imaging for Cultural Heritage Preservation; Stanco, F., Battiato, S., Gallo, G., Eds.; Analysis, restoration, and reconstruction of Ancient Artworks; CRC Press: Boca Raton, FL, USA, 2011; pp. 1–36. [Google Scholar]
Schönberger, J.L.; Frahm, J.M. Structure-from-Motion Revisited. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Guidi, G.; Micoli, L.; Russo, M.; Angheleddu, D. Optimization of automatic image-based 3D reconstruction of cultural heritage artifacts. Remote Sens. 2014, 6, 7324–7345. [Google Scholar]
Kaneda, A.; Nakagawa, T.; Tamura, K.; Noshita, K.; Nakao, H. A proposal of a new automated method for SfM/MVS 3D reconstruction through comparisons of 3D data by SfM/MVS and handheld laser scanners. PLoS ONE 2022, 17, e0270660. [Google Scholar] [CrossRef] [PubMed]
Lamanna, C. Evaluation of SfM and LiDAR technology for mapping inscribed artifacts. Preliminary observations starting from the research project ‘Ravenna and Its Late Antique and Medieval Epigraphic Landscape’. In Linking Pasts and Sharing Knowledge. Mapping Archaeological Heritage, Legacy Data Integration and Web Technologies for Modelling Historical Landscapes; «Archeol. Calcolatori»; Naples, 13–14 November 2023; Brancato, R., Bogdani, J., Vitale, V., Eds.; University of Bologna: Bologna, Italy, 2024; pp. 411–420. Available online: https://www.archcalc.cnr.it/journal/articles/1363 (accessed on 1 January 2025).
Mildenhall, B.; Srinivasan, P.P.; Tancik, M.; Barron, J.T.; Ramamoorthi, R.; Ng, R. NeRF: Representing scenes as neural radiance fields for view synthesis. Commun. ACM 2022, 65, 99–106. [Google Scholar] [CrossRef]
Gao, K.; Gao, Y.; He, H.; Lu, D.; Xu, L.; Li, J. NeRF: Neural Radiance Field in 3D Vision, a comprehensive Review. arXiv 2022, arXiv:2210.00379. [Google Scholar]
Croce, V.; Billi, D.; Caroti, G.; Piemonte, A.; De Luca, L.; Veron, P. Comparative Assessment of Neural Radiance Fields and photogrammetry in Digital Heritage: Impact of Varying Image Conditions on 3D Reconstruction. Remote Sens. 2024, 16, 301. [Google Scholar] [CrossRef]
Mazzacca, G.; Karami, A.; Rigon, S.; Farella, E.M.; Trybala, P.; Remondino, F. NeRF for Heritage 3D Reconstruction. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 2023, XLVIII-M-2-2023, 1051–1058. [Google Scholar] [CrossRef]
Remondino, F.; Karami, A.; Yan, Z.; Mazzacca, G.; Rigon, S.; Qin, R. A critical analysis of NeRF-based 3D reconstruction. Remote Sens. 2023, 15, 3585. [Google Scholar] [CrossRef]
Li, S.; Li, C.; Zhu, W.; Yu, B.; Zhao, Y.; Wan, C.; Lin, Y. Instant-3d: Instant neural radiance field training towards on-device ar/vr 3d reconstruction. In Proceedings of the 50th Annual International Symposium on Computer Architecture, Orlando, FL, USA, 17–21 June 2023. [Google Scholar]
Tancik, M.; Weber, E.; Ng, E.; Li, R.; Yi, B.; Wang, T.; Kanazawa, A. Nerfstudio: A modular framework for neural radiance field development. In Proceedings of the ACM SIGGRAPH, Los Angeles, CA, USA, 6–10 August 2023. [Google Scholar]
Kerbl, B.; Kopanas, G.; Leimkühler, T.; Drettakis, G. 3D Gaussian Splatting for Real-Time Radiance Field Rendering. arXiv 2023, arXiv:2308.04079. [Google Scholar] [CrossRef]
Álvarez-Sanchís, J.R. The Iron Age in Western Spain (800 BC–AD 50): An Overview. Oxf. J. Archaeol. 2000, 19, 65–89. [Google Scholar] [CrossRef]
Almagro-Gorbea, M.; Torres, M. Las fíbulas de jinete y caballito. Aproximación a las élites Ecuestres y su Expansión en la Hispania Céltica; Institución Fernando el Católico: Zaragoza, Spain, 1999. [Google Scholar]
Wells, P.S. How Ancient Europeans Saw the World: Visions, Patterns and the Shaping of the Mind in Prehistoric Times; Princeton University Press: Princeton, NJ, USA, 2012. [Google Scholar]
Arnold, B.; Hagmann, S. Fibulae and dress in Iron Age Europe. In Encyclopedia of Dress and Fashion; New York, NY, USA, 2015; Available online: http://www.bergfashionlibrary.com/view/bewdf/BEWDFv8/EDch81013.xml (accessed on 10 November 2021).
Adams, S. Personal objects and personal identity in the Iron Age: The case of the earliest brooches. In Dress and Society: Contributions from Archaeology; Martin, T.F., Weetch, R., Eds.; Oxbow Books: Oxford, UK, 2017; pp. 48–68. [Google Scholar]
Camacho, P. Las fíbulas de la Vettonia. Adorno personal e identidades en la Edad del Hierro; Universidad de Alicante: Alicante, Spain, 2020. [Google Scholar]
Pereira, J.; Chapa Brunet, T.; Montero Ruiz, I.; Rovira Llorens, S.; Charro Lobato, C.; Rodero Riaza, A.; Cabrera Díez, A. Las fíbulas de caballito y jinete “tipo Castellares”: Un símbolo compartido por los jefes de caballería de los pueblos prerromanos de la meseta. Trab. De Prehist. 2023, 80, e07. [Google Scholar] [CrossRef]
Fabián García, J.F. Guía de la Ruta de los Castros Vettones de Ávila y su Entorno; Institución Gran Duque de Alba: Ávila, Spain, 2009. [Google Scholar]
Cabré Aguiló, J. Excavaciones en Las Cogotas, Cardeñosa (Ávila). I. El Castro; Junta Superior de Excavaciones y Antigüedades: Madrid, Spain, 1930. [Google Scholar]
Almagro-Gorbea, M.; Mariné Isidro, M.; Álvarez-Sanchís, J.R. Celtas y Vettones; Diputación Provincial: Ávila, Spain, 2004. [Google Scholar]
Mariné Isidro, M. Fíbulas de caballito, Las Cogotas. In Cien Piezas del Museo de Ávila; Junta de Castilla y León: Ávila, Spain, 2011. [Google Scholar]
Camacho Rodríguez, P. Las fíbulas del Castro de Las Cogotas (Cardeñosa, Ávila). BSAA Arqueol. 2017, LXXXIII, 123–156. [Google Scholar] [CrossRef]
Álvarez-Sanchís, J.R. Vettones. Pastores y Guerreros en la Edad del Hierro; Museo Arqueológico Regional: Alcalá de Henares, Spain, 2008. [Google Scholar]
Berger, M.; Tagliasacchi, A.; Seversky, L.; Alliez, P.; Levine, J.A.; Sharf, A.; Silva, C.T. A survey of surface reconstruction from point clouds. Comput. Graph. Forum 2017, 36, 301–329. [Google Scholar] [CrossRef]
Besl, P.J.; McKay, N.D. A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 1992, 14, 239–256. [Google Scholar] [CrossRef]

Figure 1. Workflow of the methodology applied for 3D reconstruction and evaluation.

Figure 2. Brooches selected for this study and preserved in the Museum of Ávila (Spain). (a) Fibula 1; (b) Fibula 2 (c) Fibula 3.

Figure 3. ACMM Hexagon Metrology Absolute Arm 7325SI employed for the ground truth.

Figure 4. Distribution of control points in each of the brooches. (a) Fibula 1; (b) Fibula 2; (c) Fibula 3.

Figure 5. Data acquisition carried out for the application of photogrammetric techniques.

Figure 6. Multi-view camera positioning and volume sampling strategy in NeRF training. Each image frame is associated with a ray projected into the 3D scene, where radiance values are estimated.

Figure 7. Deviation map showing differences between the photogrammetric point cloud and the NeRF reconstruction (in millimeters). Blue regions correspond to minimal deviations, while red regions indicate areas of higher error. (a) Fibula 1; (b) Fibula 2; (c) Fibula 3.

Figure 8. Deviation map showing differences between the photogrammetric point cloud and the GS-based reconstruction (in millimeters). As with Figure 7, blue regions denote low error, and red regions represent greater deviations. (a) Fibula 1; (b) Fibula 2; (c) Fibula 3.

Figure 9. These illustrate that NeRF consistently produces tighter error distributions, especially in artifacts with complex geometries (e.g., Fibula 3), as discussed in Section 4.2.

Figure 10. Comparison of NeRF and GS after normalizing the five key error metrics across all three brooches. NeRF encloses a consistently larger area, confirming its 15–50% advantage in geometric fidelity, particularly on metrics that penalize outliers such as Hausdorff distance and Symmetric Error.

Table 1. Specifications of the Canon EOS 700D camera used for photogrammetric acquisition.

Specification	Value
Sensor Type	CMOS
Sensor Size	22.3 × 14.9 mm
Pixel Size	4.3 µm
Image Resolution	5184 × 3456 pixels
Effective Pixels	18.0 MP
Focal Length	60 mm (macro lens)
Focus Distance	100–120 mm

Table 2. Comparison of metrics between NeRF and GS for the three brooch samples, where an up arrow (↑) indicates greater alignment overlap, while a down arrow (↓) indicates less error, better accuracy, or lower deviation.

Metrics	Fibula 1		Fibula 2		Fibula 3
Metrics	NeRF	GS	NeRF	GS	NeRF	GS
ICP–RMSE (↓)	0.3533	0.6644	0.3542	0.6562	1.3127	1.9147
RMS Error (↓)	0.7422	0.7309	0.3861	0.7244	1.0224	2.6065
Hausdorff Distance (↓)	2.9873	2.9648	1.6023	2.1987	11.1304	14.0759
Chamfer Distance (↓)	0.8158	1.1372	0.5999	1.1420	1.7289	3.7459
Mean Absolute Error (↓)	0.5421	0.6090	0.3269	0.6259	0.7539	2.2358
Symmetric Error (↓)	0.4079	0.5686	0.2999	0.5710	0.8645	1.8729

Table 3. Point cloud densities (points per cubic millimeter) for NeRF, GS, and ground-truth models.

	Fibula 1	Fibula 2	Fibula 3
NeRF	1.542	1.3362	0.0623
GS	0.4561	0.479	0.0020
GT	19.5702	16.2367	0.7225

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Maté-González, M.Á.; Yali, R.; Rodríguez-Hernández, J.; González-González, E.; Aguirre de Mata, J. Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts. Remote Sens. 2025, 17, 2535. https://doi.org/10.3390/rs17142535

AMA Style

Maté-González MÁ, Yali R, Rodríguez-Hernández J, González-González E, Aguirre de Mata J. Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts. Remote Sensing. 2025; 17(14):2535. https://doi.org/10.3390/rs17142535

Chicago/Turabian Style

Maté-González, Miguel Ángel, Roy Yali, Jesús Rodríguez-Hernández, Enrique González-González, and Julián Aguirre de Mata. 2025. "Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts" Remote Sensing 17, no. 14: 2535. https://doi.org/10.3390/rs17142535

APA Style

Maté-González, M. Á., Yali, R., Rodríguez-Hernández, J., González-González, E., & Aguirre de Mata, J. (2025). Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts. Remote Sensing, 17(14), 2535. https://doi.org/10.3390/rs17142535

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparison of NeRF- and SfM-Based Methods for Point Cloud Reconstruction for Small-Sized Archaeological Artifacts

Abstract

1. Introduction

2. Materials and Methods

2.1. Archaeological Artifacts

2.2. Ground-Truth Acquisition: Metrological Arm Measurements

2.3. Photogrammetric 3D Reconstruction

2.4. NeRF-Based 3D Reconstruction

2.5. Gaussian Splatting-Based 3D Reconstruction

2.6. Experimental Configurations

2.7. Evaluation Metrics

2.7.1. Initial Alignment (ICP)

2.7.2. Geometric Discrepancy Metrics

2.7.3. Density Characterization and Spatial Distribution

3. Results

3.1. Reconstruction Performance Metrics

3.2. Point Density Comparison

3.3. Visual Comparison of Point Clouds

3.4. Quantitative Evaluation of Geometric Accuracy Using Distance Distributions

3.5. Summary of Reconstruction Performance

4. Discussion

4.1. Comparative Performance of NeRF, GS, and SfM

4.2. Performance Under Complex Geometries and Reflective Surfaces

4.3. Applicability in Cultural Heritage Documentation

4.4. Methodological Scope and Future Research Directions

4.5. Trade-Offs Between Accuracy, Processing Time, and Usability

4.6. Final Remarks

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI