1. Introduction
Over the past few decades, significant advancements have been made to overcome the spatial resolution limitations of conventional optical microscopy in order to probe nano-scale materials. The earliest example is near field scanning optical microscopy [
1], which helped shift the interest of the microscopy community towards superresolution techniques. The most prominent of these techniques can be divided into two categories: localization microscopy and excitation engineering microscopy. Photoactivated localization microscopy (PALM) [
2] and stochastic reconstruction microscopy (STORM) [
3] are examples of localization microscopy. Excitation engineering microscopy includes techniques such as stimulated emission depletion microscopy (STED) [
4] and structured illumination microscopy (SIM) [
5,
6]. While each of these techniques have provided important insight into materials, they each also have limitations. Most notably, PALM, STORM, and STED require fluorescent samples, although STED-like approaches have been theoretically proposed for coherent Raman scattering [
7,
8,
9].
Excitation engineering superresolution techniques often rely on applying an amplitude or phase mask to the excitation beam. One example is a Toraldo-style phase mask [
10]. In this case a series of concentric phase steps are applied to the excitation beam, which results in a focus with a narrowed centroid at the expense of increasing energy in the side lobes. The simplest example of this type of mask is a single
phase step, where the diameter of the
phase step determines the width of the centroid. While this type of phase mask has been extensively explored theoretically in the paraxial limit [
11,
12,
13,
14,
15], investigating full theoretical foundation for strongly focused non-paraxial beams has been limited [
16,
17]. In addition, the substantial side lobes in the focus generated by the phase mask have limited the experimental realizations for superresolution imaging. The side lobe can be suppressed by exploiting the multiplicative nature of microscopy and has been demonstrated using two methods. The first utilizes multi-beam nonlinear processes such as four wave-mixing (FWM) and coherent anti-Stokes Raman scattering (CARS). A Toraldo-style phase mask is applied to one excitation beam and side lobes are removed by ensuring that the second beam only overlaps with the narrowed centroid of the phase masked beam [
17]. In the second approach a spatial filter is implemented in the detection path to mitigate the contribution from the side lobes in the measured signal. This approach can be viewed as a multiplication of the excitation and the detection volumes. The simplest example of spatially filtered detection is tight confocal detection, but more advanced filters can also be used. This approach has been demonstrated for confocal microscopy, resulting in a ≈20% resolution improvement [
18]. In addition, promising experimental and theoretical works have been carried out using amplitude masks combined with confocal detection [
19,
20], as well as super-oscillatory lenses [
21,
22,
23,
24,
25], again showing an improvement in resolution. Significant research has also been invested into eigenmode decomposition using amplitude and phase masks [
26,
27,
28].
In this work, we demonstrate that phase masked excitation in combination with selective detection can be utilized as a highly versatile superresolution technique that is suitable for a wide range of optical signals. In this case the side lobes are removed using spatially filtered detection, which results in artifact-free superresolution images. In comparison to previous reports using this type of technique, we have outlined the theoretical framework for calculating a non-paraxial focus from an arbitrary azimuthally symmetric phase mask for linear and two-photon (2P) excitation. We also explore the practical resolution limitations of the technique. This technique is experimentally demonstrated by acquiring two-photon luminescence (TPL) images of 80 nm spheres as well as fluorescence lifetime images of fluorescent polystyrene beads. To demonstrate the multimodal capabilities of this technique, we simultaneously acquire images from multiple optical signals with lateral resolution beyond what was previously accomplished [
18]. Since this technique does not require a specific optical process, prior information about the sample, or post-processing, it is suitable for most types of scanning-based microscopy. To demonstrate the versatility of this approach we acquired second harmonic generation (SHG), two-photon fluorescence (TPF), and TPL lifetime images of the same sample with <120 nm spatial resolution using 808 nm excitation. While all of these processes are nonlinear, this technique is suitable for linear optical processes as well.
2. Theory
To determine the ideal phase mask for superresolution microscopy, it is crucial to develop the theoretical frame work to calculate the resulting point-spread function (PSF). While the PSF is straight-forward to calculate in the paraxial limit, this is not valid for strongly the focused fields generated by high numerical aperture objectives used in superresolution applications. However, the formalism developed in Reference [
29] can be adapted for amplitude and phase mask calculations. This task has been addressed for amplitude masks [
20,
30,
31], however, it has only limited applications to Toraldo-style phase masks [
16,
17]. While genetic algorithm have been implemented to optimize Toraldo-style phase masks in the paraxial limit [
15], these calculations are insufficient for applications using strongly focused fields.
The focal fields can be calculated using the formalism for the full vectoral electric fields developed in Reference [
29]. In this case the electric fields generated by a lens with focal length,
f, can be written as,
where
,
,
are the electric fields polarized in the
x,
y, and
z directions, respectively,
and
are the refractive indices on either side of the interface, and
k is the wavevector. In Equation (
1),
are defined by the integrals
where
,
, and
are Bessel functions of the zeroth, first, and second kind.
is the radial coordinate in the focus. In Equation (
2),
and
are calculated using the + and −, respectively. While similar to the integral definitions for plane waves [
29], there is an additional phase term,
, for the angular integration limits
to
to account for the phase mask. This is conceptually illustrated in
Figure 1. Finally, the total intensity in the focus can be written as,
Using Equations (
1)–(
4), strongly focused fields resulting from a phase mask with an arbitrary number of azimuthally symmetric discrete phase steps can be calculated. While Equation (
4) is for a linear process, it is straight forward to extend it to nonlinear multiphoton processes. In this case the phase mask is applied to each excitation beam. For simplicity, we will restrict the discussion to linear and 2P excitation processes, where the focal fields for 2P excitation can be calculated by squaring Equation (
4) to give
.
Theoretical calculations of the focus with different phase masks are shown in
Figure 2. These calculations are for a 1.4 NA oil immersion objective and a phase mask with a single
phase step is applied to the excitation beam with the indicated radii. The applied phase masks are illustrated in
Figure 2a,e, where the dashed line indicates the back-aperture of the microscope objective of radius
and the gray circle is the applied
phase mask with the radius,
r, indicated.
Figure 2b–d shows the calculations for the transverse intensity distribution in the paraxial limit as well as the transverse and longitudinal vectorial intensity distributions, respectively. Comparing the paraxial and vectoral calculations illustrates that the focus elongates in the direction of the incident polarization (white arrow in
Figure 2b,f) as a result of the polarization mixing from the
component. The first column shows the case with no applied phase mask (blank), which results in a conventional focus described by an Airy function. Applying a phase mask spatial redistributes the energy in the focus and in this case results in a narrowed central lobe at the expense of increasing the intensity of the side lobes. As
r increases, the central lobe continues to narrow and the intensity of the side lobes increase further until
, where the central lobe disappears. At
central lobe reemerges and the original Airy function is recovered for
. Due to the polarization mixing, the central lobe also continues to elongate as
r increases. Since the elongation is caused by the
field components, it does not impact the focus for samples that are insensitive to
in which case the focus closely resembles the paraxial limit. While this phase mask reduces the size of the central lobe, it also increases the longitudinal extent of the focus. Therefore the phase mask should be chosen for the specific application. While more complicated excitation masks can be used, the width of the centroid cannot be significantly reduced by going beyond a single
phase step [
15].
The calculated PSF for 2P excitation processes using the same phase masks as the linear cases are shown in
Figure 2f–h. In this case the phase mask is applied to both excitation beams, resulting in a multiplicative effect. While these PSF calculations are for degenerative nonlinear processes where the two excitation beams are at the same wavelength, the same procedure can be applied to nondegenerative processes like four-wave mixing [
17]. Comparing the 2P PSF to the linear PSF the central lobe is further narrowed and there is a greater spatial separation between the central lobe and side lobes. In addition, the side lobes are suppressed in the 2P case for phase masks that result in the central lobe having higher intensity than the side lobe, as can be seen by comparing
Figure 2c,g.
Figure 2 illustrates that applying a single
phase step can narrow the central lobe of a PSF and therefore improve the lateral resolution of an optical microscope. However for superresolution imaging applications, the side lobes need to be removed or suppressed. As discussed earlier, the side lobe can be removed by spatially filtering the signal during collection. The total PSF of the optical system is then a multiplication of the excitation and collection PSF. Therefore the detection volume can be chosen to suppress the side lobes while maintaining the narrowed central lobe. The confocal detection PSFs, including diffraction effects, can be calculated by convolving a circle function representing the pinhole in object space with the scalar PSF, which is defined as,
The radius of the circle function representing the pinhole can be specified in Airy units (AU) to provide a straight forward comparison conventional confocal microscopy. In the theoretical limit for confocal detection, the circle function is replaced with a Dirac delta function, which results in a confocal detection PSF defined by Equation (
5). In the case of detection, there is no polarization mixing and therefore the collection volume is azimuthally symmetric.
Figure 3a–c and
Figure 3e–g show the lateral PSFs for the linear and 2P cases, respectively, for a 0.55
phase masked with confocal detection of ∞ AU, 1 AU and 0.5 AU. The longitudinal PSF for the linear and 2P cases are shown in
Figure 3d,h, respectively. Interestingly a pinhole of 1 AU is insufficient to fully suppress the side lobes. To more directly exhibit the resolution improvement from applying a phase-mask combined with confocal detection,
Figure 4a,b shows cross-sections of the focus for a blank phase mask (black dashed-dotted line), a 0.55
phase mask without (blue dashed line) and with confocal detection of 0.5 AU (solid red line) for the linear and 2P cases, respectively. These calculations illustrate that superresolution images can be acquired by combining a Toraldo-style phase mask with spatially filtered detection to suppress the side lobes.
While increasing the radius of the phase-mask does continue to reduce the size of the central lobe and therefore improve the resolution, it also re-distributes more of the energy into the side lobes as discussed earlier. Therefore, applications using this type of phase-mask requires balancing these two effect since the side lobes become more difficult to suppress as the strength increases and could potentially result in photo-damage to the sample. The practicality of a given phase-mask can be evaluated using the ratio of the amplitudes of the side and central lobes,
, as a metric. Plot of the full width at half maximum (FWHM) of the central lobe (black lines) on the left vertical axis and
(blue lines) on the right vertical axis as a function of
r for the linear and 2P cases are shown in
Figure 4c,d, respectively. The cases with (solid lines) and without (dashed lines) confocal detection are shown. The red dashed lines indicate a phase mask of
used experimentally. As these plots illustrate, while the central lobe can become arbitrarily narrow, the intensity of side lobes dominate for
and imposes a practical limitation on the maximum phase-mask radius.
3. Experimental Setup and Results
Figure 5 shows a diagram of the experimental setup. A phase mask is applied using a spatial light modulator (SLM) to the excitation beam from Ti:Sapphire with ≈140 fs pulses centered at 808 nm. The applied phase mask included a
phase step and a background phase to remove the inherent wavefront distortion of the SLM, which was measured using a Shack-Hartmann wavefront sensor. The SLM was over-filled by a factor 2 to ensure a uniform intensity pattern across the SLM. The pattern on the SLM is imaged onto the back aperture of a high numerical aperture (1.4 NA) microscope objective using relay lenses and then focused onto the sample that is raster-scanned through the focus to create images. The resulting signal is collected with the same microscope objective and sent through a single mode fiber after passing through a shortpass filter to remove the laser excitation. The lens focusing onto the single mode fiber was chosen such that the detection area was 0.5 AU for 808 nm excitation. The signal is spectrally separated using dichroic beamsplitters and sent to three time-resolved single photon counting modules (SPCM). The three SPCM detection regions as selected with bandpass filters are: 400 nm–410 nm (SHG), 420 nm–500 nm (TPF + TPL), and 555 nm–565 nm (TPL). The images presented are 256 × 256 pixels with dwell times of a 10 ms–30 ms depending on the signal levels. To compensate for the signal decrease by applying a phase mask, the incident excitation power was increased by a factor 3 compared to the blank images.
To experimentally demonstrate this technique, 80 nm gold spheres were drop-cast onto a glass coverslip and imaged using TPL. In this case, the photons were sent to a single SPCM.
Figure 6a shows the conventional image with a blank phase mask and without spatially filtered detection. Applying a phase mask of 0.55
to the excitation beam without spatially filtered detection results in a narrowed centroid and increased intensity in the side lobe (
Figure 6b), as expected from the calculations. The results from applying a phase mask and spatially filtering the detection are shown in
Figure 6c. In this case the narrow centroid remains while the side lobes are strongly suppressed. Particles in close proximity can be resolved in the superresolution image, as indicated by the white arrows. Interestingly, the focal pattern is slight rotated for some of the particles, indicating sensitivity to the orientation and symmetry of the particles. For the same excitation power, the signal decreases by a factor 4 due to the reduced intensity of centroid and spatial filtered detection. Therefore the excitation power was increased by a factor 3 in the superresolution case from 0.37 mW to 1.1 mW. The increased energy in the side lobes is not a significant source of photo-damage due to the large spatial area that the lobes cover.
Figure 6d shows cross-sections of the signal from the particle along the white lines in
Figure 6a–c for the blank mask (black), phase mask only (blue), and a phase mask with spatially filtered detection (red). As a comparison, confocal detection using a blank phase mask is shown in green. The full width at half maximum for the blank mask without and with confocal detection are ≈194 nm and ≈159 nm, whereas superresolution cross-section is ≈118 nm. Note that vectoral polarization mixing is seen by an elongation of the images of the particles in
Figure 6a as well as the intensity asymmetry in the side lobe in
Figure 6b, which is in agreement with the calculations shown in
Figure 2. These results experimentally demonstrate that this technique is capable of acquiring superresolution images without any post-processing or prior information about the sample.
This phase-mask technique is suitable to use for a variety of optical processes. As an example, fluorescent beads (175 nm diameter) were deposited on a coverslip and the resulting fluorescence lifetime images without and with the phase mask (
) are shown in
Figure 7a,b, respectively. In this case, an air microscope objective (0.95 NA) was used. Comparing the two images shows a distinct resolution improvement due to the phase-mask. Furthermore, this data set demonstrates the suitability of this technique for 2P superresolution fluorescence lifetime imaging microscopy (FLIM), which has broad applications in biology.
Finally, to illustrate the versatility of this technique we fabricated a sample with fluorophores in addition to the 80 nm gold particles. This sample was fabricated by spin coating a concentrated (
M) solution of coumarin derivative fluorophores onto a glass coverslip followed by 5–10 nm of poly(methyl methacrylate) (PMMA) to increase the photostability of the fluorophores. Then 80 nm gold spheres were drop-cast onto the sample. This sample was chosen because it provides three spectrally separated and unique photophysical signals. The gold particles provide SHG and TPL signals and the fluorophores provide longer lifetime TPF.
Figure 8a shows spectra acquired on (red) and off (black) a gold particle and the detection window for each SPCM is indicated. Since our technique is scanning-based, time-resolved SPCM can be used, as has also been shown for STED [
32,
33]. The SHG signal from the gold particles is due to asymmetries in the shape of the particles that break the inversion symmetry [
34]. The fluorophore used was selected such that the peak fluorescence is red-shifted from the SHG and blue-shifted from the TPL. The three signals were spectrally separated with dichroic beamsplitters and bandpass filters, although a fraction of the tail from the TPL is present in the TPF channel. The spectral widths of the bandpass filters were chosen to balance the relative signal levels. The fluorescence from the fluorophores can be seen on and off the particle as expected since the fluorophores were spin coated prior to drop casting the gold particles, whereas the SHG and TPL signals are only seen on the particle. In addition to spectral separation, the lifetime of the TPL and TPF are also quite different.
Figure 8b shows lifetime decay curves on (red) and off (black) the particles indicated by the white arrow in
Figure 8e. The lifetime is longer away from the particle as expected for fluorescence. On the particle both short and longer lifetimes can be seen. The shorter lifetimes (<0.5 ns) originate from the gold TPL as well as the fluorophores in close proximity to the gold particles.
Lifetime images of the same location for the SHG, TPF + TPL, and TPL signals are shown in
Figure 8c–h, respectively. The left column shows the imaging with a blank phase mask and without spatially filtered detection. The right column shows superresolution images using a phase mask and spatially filtered detection. The decay curve at each pixel was fit with double exponential. The weighted average lifetime and the signal intensity are shown using the hue and saturation of the color scale, respectively. As expected, the lifetimes of the SHG and TPL from the gold particles are instrument response limited and the TPF image shows a mixture of longer lifetimes from the coumarin as well as short lifetimes from the gold. The TPF and TPL signals were acquired simultaneously with an excitation power of 0.4 mW and 1.2 mW for a blank and 0.55
phase mask, respectively. The SHG images were acquired afterward using a higher laser power due to the weak signal (1.75 mW and 5.25 mW for the blank and phase masked images, respectively). A slightly smaller phase mask (0.5
) was also used due to the chromatic aberrations of the lens focusing onto the single mode fiber. Since SHG is a
process, requiring broken inversion symmetry, and the TPL signal is a
, the differences in SHG and TPL responses/images are expected. These images demonstrate the ability of this superresolution imaging technique to measure a wide range of optical processes and acquire lifetime images.