According to the above analysis, the far-field SR-CLSM via the longitudinal nano-optical field is suitable for exploring the morphology and/or physical properties of a sample with a feature size below the diffraction limit. It also offers a large longitudinal detection range at a certain level, , during a single lateral scanning process across the tightly focused RP beam. With the far-field excitation and detection method, a large workspace is provided. Under no special treatment (such as fluorescence staining), in this Super-Resolution (SR) scheme, sample detection relies solely on the intrinsic response of the sample to the incident excitation light field.. To establish a general imaging theory, a DDA for nanostructures in the sample system is utilized so that its whole response can be regarded as the superposition of all induced dipoles. Since the analytical expressions of the excitation field and the detection Green’s function tensor for this SR system have been derived in detail, the exact analytical description of light–matter interaction is vital, as it relates the excitation and detection processes and contains the sample information.
However, when the SR scheme is used to explore a strong interaction sample system, such as a plasmonic system composed of an extended metal structure and/or several densely distributed metal nanoparticles, the dipole–dipole interaction within the sample system may show that they have a strong influence on each other and even on the final image pattern. In order to make the imaging principle better understood, the imaging simulation for lattice arrays composed of ideal dipole sources with identical or modulated response intensity will be first simulated using a weak interaction CLSM imaging theory. Specifically, the weak interaction model refers to systems such as isolated nanoparticle arrays, where the nanoparticle spacing exceeds the constraint condition for dipole spacing (d) in the DDA model, while the nanoparticle size itself is smaller than the constraint condition for d. In such weak interaction systems, the incident field can be approximated as the driving field, and the influence of the scattered field on other dipoles can be neglected. Then, a general theory will be presented and demonstrated on several strong interaction models referring to bulk nanostructures whose dimensions exceed the constraint condition for d, such as the nanodisks and nanotriangles in the paper. These are discretized into small units on the scale of a few nanometers, with each unit approximated as a dipole located at its center.
3.1. Weak Interaction Model
In the LEDM-based far-field SR-CLSM, the detected electric field at the pinhole position for any ZP dipole moment,
, at a given position,
, can be expressed as
[
25]. In a weak interaction system, the dipole moment is expressed as
, where
is the effective polarizability tensor,
is the longitudinal component of the focused RP field, and
is the Green’s function for the ZP dipole, which only has an x-polarized component.
is the vacuum permeability, and
is the angular frequency of the incident optical wave.
For isotropic material, its polarizability tensor will degrade into a scalar quantity, . Therefore, the detected signal for each scanning position is proportional to the detected field intensity, , where denotes the infinitesimal detector area. In CLSM, the total PSF is defined as the image of a dipole source that scans through the excitation field. And it can be approximately calculated by . In this SR scheme, if the laser source is supposed as a point source, then . As a result, the image signal of a weak interaction system for a certain wavelength excitation can be approximately expressed as , where is the effective polarizability of each dipole and represents the number of ideal dipole sources. On the basis of this weak interaction CLSM imaging theory, the images of both an identical dipole lattice array and a modulated dipole lattice structure will be calculated. For a certain excitation wavelength, an isolated dipole’s responding intensity is intrinsically represented by its effective polarizability.
As shown in
Figure 2a, five lattice arrays are arranged with different gaps, where each dipole is assumed to be polarized along the z-direction and to have the same polarization intensity. In the simulation, the polarizability is set as a unit: one. In practice, it can be replaced by any practical value.
Figure 2b demonstrates the simulated CLSM image for such artificially set dipole arrays. It can be seen that besides the dipoles in the fourth and fifth arrays, all the dipoles in the other arrays can be distinguished from each other. However, with the decreasing gap distance, the signal contrast defined by
drops dramatically from
to
, as shown in
Figure 2c, when the gap shrinks from
to
beyond the resolving limit. For the array that has a gap period below the resolving power of the SR—CLSM, such as
or
, its imaging pattern looks like a rod, from which the spatially separated dipoles cannot be reversely guessed. At the same time, the whole image intensity is increased by
(from 1 to 1.4 for a gap distance of
) and
(from 1 to 2 for a gap distance of
), whereas the averaged signal contrast is just at the level of
to
. From the simulated results, we can see that the diffraction-induced broadening of the dipole image spot and the overlaying among dipole image fields both give rise to the synthetic performance on the final CLSM image.
After understanding the relationship between the gap distance and the final image information, the influence of the dipole’s response intensity on the final image information will also be explored through simulation, as shown in
Figure 3. The response intensity of dipoles in each row is alternately modulated by a factor,
, which is defined as the ratio of the response intensity of the even-numbered dipoles to that of the odd-numbered dipoles. Actually, an ideal dipole source does not exist but is proposed for analyzing physical problems. For a practical nanoparticle, it can be approximated as a single dipole or a series of dipoles. The response intensity of these dipoles is generally characterized by the effective polarizability, which is related to its own material composition, geometrical structure, and surroundings.
In
Figure 3a–d, each dipole array can be regarded as composed of two sets of identical dipole arrays (a strong dipole array and a weak dipole array) that have twice the period of the original one shown in
Figure 2a. From the simulated LSCM imaging results shown in
Figure 3e,f, it can be seen that the dipoles in the strong dipole set can be distinguished from each other, while those in the weak set cannot be observed. If compared with the image result of a homogeneous dipole array with gaps of
and
, the signal from the weak dipoles in each array contributes a considerable background field to the final image, which may reduce the relative signal contrast. From the simulated results, it can be concluded that the dipole’s responding intensity can also influence the final CLSM imaging. The weak responding dipoles cannot be directly perceived through the image pattern.
From the simulated results for the weak interaction sample system, it is shown that the final image is a synthetic result of all parameters in the sample system, including the geometrical pattern and material components. In the next part, a general CLSM imaging theory will be presented and applied to a practical nanostructure with an extended geometrical size.
3.2. General Imaging Model
When the longitudinal nano-optical field excitation and detection-based SR approach is used to explore the extended nanostructure or densely distributed nanoparticles with a feature size below the diffraction limit in terms of their geometrical shape and/or physical property pattern, the dipole–dipole interaction within the sample system should be considered for the diffraction-induced broadened excitation and detection. If the strong interaction sample system is still analyzed using the weak interaction CLSM imaging theory, this may lead to a considerable discrepancy between the simulated image results and the experimental ones. Although the image of any sample can be directly obtained through the experiment, the nanoimaging system may still function like a black box without a complete theoretical analysis.
Taking the imaging of a nanodisk as an example, as shown in
Figure 4a, the complex light–matter interaction can be analytically expressed by the volume integral equation [
25,
26] (also called the Lippmann–Schwinger equation):
where
,
, and
respectively denote the position vector, the relative dielectric function inside the nanostructure, and the wavenumber in the vacuum. The integral should be taken over the entire volume of the nanostructure,
. And
represents the free-space Green’s function that characterizes the field at any position,
, generated by an induced dipole at
.
is the incident tightly focused RP beam.
denotes the actual field when the nanodisk is present in the excitation field area, and
denotes any position vector in the object space.
Instead of directly solving the interaction Equation (1), which will become a complex equation set if the considered nanostructure is divided into a series of discrete dipoles, a commercial finite-difference time-domain (FDTD) software (Ansys Lumerical 2020 R2 Finite Difference IDE) system is utilized to perform the light–matter interaction. In terms of mesh accuracy, the computational domain employs a non-uniform grid division, with local grid refinement applied to the nanostructures and near-field regions, setting the minimum grid size to Δx = Δy = Δz = 2 nm to ensure accurate resolution of near-field interactions and evanescent waves. Regarding boundary conditions, perfectly matched layer (PML) boundaries are applied in the x-, y-, and z-directions to minimize non-physical reflections. For the light-source setup, a custom source mode is used, generating a nanoscale vector light-field consistent with
Figure 1b through programming for excitation, with an excitation wavelength of 405 nm. In constructing the material model, the optical properties (dielectric constants) of all materials are based on experimental data or reliable literature values and defined using dispersion models such as the Drude–Lorentz model. For the determination of the simulation domain and runtime, the computational domain size is set to fully encompass the near-field interaction region. The simulation time is sufficiently tested to ensure that the electromagnetic fields reach a steady state. In the arrangement of near-field monitors, field monitors are placed near the tip of the optical needle and the sample surface to accurately extract the electric field enhancement distribution and near-field–sample interaction information. The longitudinal (upper) and transverse (lower) cross-sections of the actual field intensity distribution are displayed in
Figure 4b for the RP (first column), ZP (second column), and total field (third column) components. From the resulting field distribution, it can be seen that the incident excitation field is hardly coupled deep into the inner part of the Au nanodisk. However, the field above the nanodisk shows a more than two-fold enhancement compared to the original incident field.
Combining the scattered field distribution outside the nanodisk, as demonstrated in
Figure 5a, which is acquired by removing the incident excitation field,
, from the actual local field,
, as
, we can conclude that the scattered field intensity has nearly reached the level of the incident field in the near area of the Au nanodisk. However, it shows a rapid decline from the near field to the far field in the whole space. Although the excited nanodisk scatters into the whole space, it exhibits an inhomogeneous distribution in different directions, where the backward-scattered field appears to be slightly stronger than the forward-scattered field.
However, in the back-scattering far-field CLSM, only the backward-scattered far field can be partially collected by the detection system. Using the near-to-far field (NTFF) method for the full field on a monitor plane located at a distance beyond half a wavelength from the nanostructure in FDTD simulation, or directly through the volume integral expression, the scattered or radiation field at any position in the object space for the entire excited nanostructure can be written as
where
should adopt its far-field form in the far-field area and the near-field form in the near-field area. After being partially collected by the objective lens, the scattered field from the simultaneously excited parts of the sample will subsequently pass through a series of optical modulation devices arranged in the detection system.
As the image field for a free-space-induced vector dipole at focus point
has been studied in detail, where it was defined as the detection Green’s function, and in addition to the reasonable assumption of linear translation invariance in the vicinity of focus (several micrometers is reasonable for the working distance of the objective, which is usually beyond one-hand micrometers), the single-shot image field of the actual longitudinal field-induced dipoles in the nanostructure can be calculated by
The single-point response image field represents the single-frame information obtained at a specific scanning position, serving as the fundamental data unit for synthesizing the final scanning image. During the evolution from the scattered field,
, in object space to the detected field,
, in image space, by combining the longitudinal excitation and detection mechanism in this far-field SR-CLSM, the
in Equation (2) is replaced by its longitudinal component,
, and the free-space Green’s function tensor,
, in Equation (2) is replaced by the detected field component on the confocal image plane for the ZP dipole,
, which only possesses the x-polarized component, as shown in
Figure 1c (the third column).
Equation (3) indicates that the single-shot image field for a sample can be regarded as the convolution operation between the detected Green’s function and the local-field-induced dipoles inside the sample. As demonstrated in
Figure 5b, it is obvious that the induced field at each transverse cross-section inside the nanodisk has a completely different distribution, which depends on the geometrical shape, material elements, and its surroundings, besides the property of the incident excitation field. Moreover, the induced field on the upper surface of the Au nanodisk is much larger than that on other lateral cross-sections. From this, it can be easily predicted in theory that the scattered field above the nanodisk may be stronger than that below the nanodisk, as the oscillation amplitude of the dipole is directly proportional to the actual driving field at the dipole position.
The single-shot image field on the confocal image plane is calculated and demonstrated in
Figure 5c. If the full width at half maximum (FWHM) of the central image spot is used to characterize the diameter of the golden nanodisk, there is an evaluation error of
because the FWHM is about
, but the diameter of the nanodisk is about
. In confocal laser scanning microscopy (CLSM), a pinhole filter is applied to the single-shot image, and the filtered signal is registered as pixel information. A complete CLSM image is reconstructed by a series of continuously and temporally detected signals at the pinhole, while the sample is raster-scanned through the excitation field. However, it is time-consuming to obtain the CLSM reconstructed image in theory in the same way as in the experiment by point-to-point scanning excitation and detection.
In theory, through a series of derivations, the induced field inside the nanostructure can be directly linked to the incident excitation field via its physical property of local density of optical states (LDOS) [
30,
34] as
Combining Equations (3) and (4), we can see that the single-shot image field is related to the following parameters: the material components of the nanostructure and its surroundings, which are characterized by the relative dielectric function ; the incident focusing field that is polarized along the z-direction when the sample is absent (for point-source illumination, its intensity can be denoted as ); the effective detection field distribution, , whose intensity is denoted as ; and the physical property, LDOS, in the nanostructure.
By systematically traversing the light needle positions and acquiring the single-frame imaging field corresponding to each position, a complete two-dimensional CLSM image is ultimately synthesized. When the sample experiences a whole displacement vector,
, at the
scanning to its original position, the corresponding pixel signal can be derived as
It should be pointed out that denotes the position vector of any position inside the nanodisk in terms of the local Cartesian coordinate system that is set at any reference point inside the nanodisk, such as its central position . represents the position vector of the reference point, , in the Cartesian coordinate system that is set at the focus of the objective lens. If the sample is transversely scanned through the incident excitation field near the focus, the final constructed CLSM image can be regarded as the convolution between the total PSF and the product of the relative dielectric function and LDOS.
As can be seen in
Figure 6a, the local density of states (LDOS) on the upper surface and in several nearby cross-sections, such as at
, is overall stronger than that in other cross-sections. Nevertheless, almost all the LDOSs show stronger intensity near the edge area than in the inner parts. The picture at the bottom right of
Figure 6a represents the averaged LDOS distribution in the projected transverse plane along the whole thickness direction. The confocal laser scanning microscopy (CLSM) image of the Au nanodisk calculated according to Equation (5) is demonstrated in
Figure 6b. Compared with the single-shot image field displayed in
Figure 5b, the CLSM image shows better imaging properties, including a higher resolution and lower background. The FWHM of the image spot is evaluated as
, which only has an error of
with respect to the diameter of the nanodisk. However, the LDOS pattern cannot be resolved from the final CLSM image due to the limited resolving power of the microscopy system.
To further resolve the LDOS pattern, not just its projected geometrical shape, a simulation on a nanoprism with an assumed higher resolving power was performed, as shown in
Figure 7. In the back-scattering CLSM system that adopts the conventional PC and AA-based VFM, the resolving power is a little better than that
with the excitation wavelength of
, and all the previous discussion is based on this. However, as mentioned in the
Section 1, by adopting a powerful metasurface design and manufacturing method that can provide arbitrary manipulation of the light field as required, a better resolving power can be obtained in the near future, though the research on versatile metasurfaces is not within the scope of this paper. The assumed better resolving powers, such as
and
, also evaluated by the total PSF of a free-space dipole, are utilized in the imaging simulation for the same excitation wavelength of
and the same LEDM. From the simulated CLSM image, it can be seen that all the images in
Figure 7b–d appear as a triangle pattern but with different spreads from their geometrical boundaries. And the main features of the LDOS distribution can be distinguished for the case with a resolving power of
. It can be speculated that more details can be observed in its CLSM image if a higher-resolution scheme is used.
As the general CLSM imaging theory has been presented and demonstrated through a simulation for a nanodisk whose lateral size is comparable to the excitation spot, the geometrical shape can be preliminarily distinguished from its image pattern. However, the more sophisticated pattern of the LDOS distribution cannot be recognized unless the appropriate resolving power is provided, as demonstrated for a nanoprism in
Figure 7. In the following section, it will be applied to a nanostructure composed of two smaller nanoparticles with different gaps. From the simulated results, as shown in
Figure 8, the two nano-objects cannot be distinguished from each other until the edge-to-edge gap is beyond the resolving power of the system according to the Rayleigh criterion. However, the imaging pattern is also related to the actual geometrical pattern, as in
Figure 8(a3,a4), where the two image spots begin to separate from each other when the gap is
, and they are separated far beyond the Rayleigh criterion when the gap is beyond the resolving power. In actuality, this system is also applicable to the characterization of dielectric materials, such as silicon, titanium dioxide, and polymer nanoparticles. Naturally, when contrasted with metallic nanostructures, discrepancies may be present in signal intensity and contrast.