An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses

Ghosh, Tridib; Karkhanis, Mohit; Mastrangelo, Carlos H.

doi:10.3390/virtualworlds4030038

Open AccessArticle

An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses

by

Tridib Ghosh

^1,*

,

Mohit Karkhanis

² and

Carlos H. Mastrangelo

^1,2

¹

NewEyes, Inc., Salt Lake City, UT 84102, USA

²

Department of Electrical and Computer Engineering, University of Utah, Salt Lake City, UT 84112, USA

^*

Author to whom correspondence should be addressed.

Virtual Worlds 2025, 4(3), 38; https://doi.org/10.3390/virtualworlds4030038

Submission received: 19 June 2025 / Revised: 16 August 2025 / Accepted: 20 August 2025 / Published: 26 August 2025

Download

Browse Figures

Versions Notes

Abstract

Herein, we report a USB-powered VR-HMD prototype integrated with our 33 mm aperture varifocal liquid lenses and electronic drive components, all assembled in a conventional VR-HMD form-factor. In this volumetric-display-based VR system, a sequence of virtual images are rapidly flash-projected at different plane depths in front of the observer and are synchronized with the correct accommodations provided by the varifocal lenses for depth-matched focusing at chosen sweep frequency. This projection mechanism aids in resolving the VAC that is present in conventional fixed-depth VR. Additionally, this system can address refractive error corrections like myopia and hyperopia for prescription users and do not require any eye-tracking systems. We experimentally demonstrate these lenses can vibrate up to frequencies approaching 100 Hz and report the frequency response of the varifocal lenses and their focal characteristics in real time as a function of the drive frequency. When integrated with the prototype’s 120 fps VR display system, these lenses produce a net diopter change of 2.3 D at a sweep frequency of 45 Hz while operating at ~70% of its maximum actuation voltage. The components add a total weight of around 50 g to the off-the-shelf VR set, making it a cost-effective but lightweight minimal solution.

Keywords:

virtual reality (VR); augmented reality (AR); head-mounted displays (HMD); vergence–accommodation conflict (VAC); liquid tunable lens; varifocal optics; 3D immersion

1. Introduction

Since the first report on stereoscopic systems by Wheatstone back in 1838 [1], the recent technological advances in electronic hardware and computational capabilities have transformed stereoscopy-based virtual reality (VR) head-mounted displays (HMDs) into a viable candidate for futuristic near-eye wearable gadgets that are both digitally interactive and capable of supporting time-intensive applications. This emerging technology of VR is based on stereoscopic observation of a simulated virtual environment in a near-eye HMD that facilitates interaction [2] and creates a sense of presence and immersion in this virtual environment, eliciting a heightened degree of “engagement” with the virtual world while being present in a different physical world [3,4,5]. Owing to its inclusive and interactive nature providing a sensory “stimuli” of virtual immersion, VR technology has proven to accelerate learning and training, as compared to two-dimensional (2D) display modules like desktops, tablets, mobile devices, etc. Jensen et al. recently reported that such an immersive experience [6] can positively affect the outcome of acquiring improved cognitive, psychomotor, and affective skills. In fact, a recent study suggests that virtual memory palaces in the VR HMD condition provide a 9% improvement in memory recall ability compared to the desktop condition [7]. Beyond gaming and entertainment, VR has many emerging applications in the areas of education, industrial design and manufacturing, retail, cinema, healthcare, sports, and aviation [8,9,10,11,12,13,14,15] to name a few. Stereoscopic VR displays, however, suffer from certain drawbacks and challenges. In terms of display performance, AR and VR face several common challenges to satisfy demanding human vision requirements, including field of view (FoV), angular resolution, dynamic range, eyebox, and correct depth cue, etc. Other considerations include ergonomics, user-friendly experience, weight and compactness, and affordability, amongst others. A detailed discussion on these challenges and trade-offs is reported elsewhere [16,17].

Perhaps the most critical of these challenges to have been widely studied till date is the absence of the appropriate depth cue for natural human viewing in 3D environments and the subsequent need to alter the conventional optical design and configuration. Of all the eight visual cues required for depth perception [18], VR stereoscopic displays offer all but one, the “convergence and accommodation” cue in the personal space (0.1~10 m). Hence, depth perception, which forms an integral part of a truly focused 3D immersive experience, is lost. Generally speaking, vergence and accommodation cues are synchronized by the brain–eye neural circuitry for disparity-free natural viewing; changes in one drive the other, and vice versa [19]. The situation changes while viewing the virtual objects in 3D stereoscopic displays. The observer’s eyes are focused on a fixed plane while the brain interprets those images originating from a different (farther or closer) plane. This mismatch between the focus cues and depth perception causes a disruption in the neural synchronization of the two cues and is commonly referred to as Vergence–Accommodation Conflict (VAC) [20,21], resulting in perceptual distortions and difficulty in fuse-focus of a stimuli, often manifesting as blurred, shifted, or double images inducing eye fatigue and dizziness after a short use of the VR HMDs [20,22,23,24,25]. Munafo et al. reported that incidences of motion sickness in commercially available VR devices can range anywhere from 33% for male participants all the way up to 78% for their female counterparts [26], with the experience worsening as the relative speed of object motion increases. This limits certain VR applications with objects up close that require accurately focused information, like aviation applications, diagnostic and surgical trainings, HAZMAT, etc. While gaming and entertainment are opt-in activities, the wider adoption of VR as a professional training or learning tool for extended periods of time by and large remains unrealized. Another drawback of these VR HMDs is the ability to provide refractive error corrections for a sizable user base that requires prescription eyeglasses [27].

2. Background

In order to resolve the VAC problem, the projection system must be able to project virtual images at various distances and, therefore, the projection optics of the HMD must be tunable. Three-dimensional displays with variable image distances have been proposed and constructed in various approaches [27,28]. Some of the proposed solutions to the problem of providing such depth cues attempt to approximate focus cues. Varifocal displays, monovision displays, and even some implementations of multifocal displays fall in this category. As an alternative to focus tuning, a simpler approach of monovision has also been proposed [29,30]. Monovision has, however, been reported to reduce sharpness with only a marginal change in fusion or accommodation compared to conventional near-eye displays [27]. Since only one eye is offered with digital images, this approach is more suited to AR applications than immersive VR [16]. Software approaches like retinal blur or gaze-contingent disparity rendering [29,30,31,32] neither provide natural focus cues or refractive error corrections.

Volume-sweeping varifocal and multifocal displays are the most straightforward approach to project 3D images in a volumetric space by using tunable lens optics which project corresponding virtual images at different depths [33,34]. In this approach, the 3D volume in front of the observer is swept periodically and planar images of objects at different depths are projected at different times. Fundamentally, this method of rendering is the most natural, and it is able to render object slices at any distance away from the observer. Varifocal near-displays have a single image plane where the vergence and focus cues match, and this plane is moved by using focus-tunable lenses [27,35], or deformable membrane mirrors [36], or by actuating fixed-focus optical components [37]. In a varifocal display, all pixels are at the same focal plane, and therefore virtual pixels that do not lie on the plane of focus need to be synthetically blurred in proportion to their distance from the plane of focus. Varifocal displays need to track the accommodation state of the pupil [27] or assume that the pupils are accommodated to the eye convergence distance [27,36]. Multifocal near-eye displays, first proposed by Akeley et al. [38], display a small number of images at different depths; the images are perceived additively [34,38,39,40,41]. In Akeley et al. [38] and MacKenzie et al. [41], subregions of an LCD panel were mapped to different focal planes using beam splitters. Liu and Hua [40] and Love et al. [34] proposed a lens that is switchable to multiplex between the multiple focal planes. Several variations of the varifocal schemes have been reported that partially overcome the disadvantages of the time multiplexing approach. Hu and Hua [42,43] proposed using high-speed optical components to achieve a larger number of focal planes. Because a relatively small number of depth planes are used to represent objects occupying a large volume, multifocal plane displays need scene decomposition algorithms to optimally represent a 3D scene using a few 2D image planes. Content generated by these scene decomposition algorithms provide synthetic focus cues to represent objects that lie in between the focal planes. MacKenzie et al. [41] propose a per-pixel linear blending approach. Narain et al. [44] proposed an optimized blending algorithm that can demonstrate occlusion, reflection, and non-Lambertian effects, while Mercier et al. [45] and Lee et al. [46] proposed new scene decomposition techniques that are tolerant to eye movements. While scene decomposition algorithms help to depict imagery that lies between the focal planes, the spatial frequency of the fused image is inversely related to the focal plane separation [39,47]. Recently, swept-volume displays that utilize fast-moving lenses (~60 Hz) coupled with a DMD light projection systems have been implemented. Recently, Chang et al. [48] and Rathinavel et al. [49] reported volumetric displays that have a large number of focal depths, i.e., 40 and 280, respectively. Although, depth scan-based volume-sweeping method is very simple, this scheme however has several disadvantages. First, the tunable lens must be swept at very high frequencies in order for the eye not to notice gradual focus changes; hence, a fast lens is needed. Second, the duty cycle of the image for any given depth is small, thus proportionally reducing the object brightness. Third, when a virtual image is projected only when the volume is swept, one can also see the objects in front or behind it. This renders the objects with a ghostly appearance in the absence of software object occlusion checks. Temporal multiplexing can introduce perceived flicker and requires display refresh rates higher than the flicker-fusion threshold frequency.

Multifocal displays present multiple focal image planes simultaneously, allowing the viewer to refocus on any object, while varifocal displays utilize eye vergence to determine the observer object distance, refocusing or blurring object images. A key disadvantage of the varifocal scheme is that it must sense the eye vergence correctly; hence, multifocal displays fundamentally provide a more realistic display experience. The oldest and most common VAC-resolved multifocal display scheme utilizes a fixed-distance projection screen that is observed through a varifocal lens placed at a fixed distance from the screen that is smaller than its focal distance. This produces a virtual image in front of the observer at a depth that depends on the lens power. In order to produce a 3D display, the lens power must oscillate at high frequency, thus producing a periodic series of virtual images projected at different depths. Such a scheme has been implemented in both HMD and 3D display applications, utilizing both mechanical shape-changing varifocal lenses and solid-state digital polarization and tunable Fresnel varifocal lenses. In a third approach that is more suitable for augmented reality (AR), a similar effect is achieved, utilizing solid-state waveguide multi-plane displays. Light field displays [50,51,52,53,54,55,56,57] and holographic displays [58,59,60] have also been proposed as a potential fix for VAC since they can approximate the wave front perceived by the observer for different views and provide relatively accurate focus cues. Light field display approaches are interesting as they do not require tunable optical elements or tunable lenses, but they require high-resolution displays and are computation-intensive. Current implementations of light field displays are either diffraction-limited [51,56] or have poor resolution due to spatial-angular resolution trade-offs [50,57]. Current implementations of holographic near-eye displays have a very small eyebox [58] and, similar to light field displays, have high computational requirements. In addition, phase-only spatial light modulator (SLM) technologies that holographic displays are based on also need improvement for practical implementation in near-eye VR HMDs. These devices are fundamentally very complex in nature and currently are at an early development stage. The multifocal approaches have many trade-offs. Solid-state approaches offer both low weight and very high reliability and drop resistance, but universally the complexity and cost of these devices are not volume-scalable, resulting in very high production unit costs. Thus, these devices are only practical in high-end, high-cost applications. Solid-state multifocal display devices are, in general, very fast, with some implementations reaching kHz image frame rates, but they suffer from strong chromatic aberration and consequently poor image quality. On the other hand, mechanical varifocal lens approaches are less reliable than solid-state devices and can lead to higher set weights and unwanted vibrations, but they provide polarization-independent imaging and can deliver higher-quality images with larger apertures and low chromatic aberration. Mechanical varifocal schemes also are much simpler than the solid-state approaches, thus leading to much lower unit costs.

In order to synchronize the virtual object depth with the appropriate focus for normal visual cues, several research studies have reported on the use of gaze detection technology in combination with either optically tunable lenses or movable mechanical display systems to address these discomforts. While the use of tunable varifocal optics has emerged as a viable solution, a vast majority of these varifocal systems have apertures in the range of 20 mm or less. Digital crosstalk between gaze detection and tunable actuation modules usually make current varifocal VR techniques computation-intensive, sometimes adding noticeable lag to these adaptive systems. Additionally, eye-tracking sensory systems do not provide the desirable accuracy of depth-sensing, especially for high-frequency depth variations and surfaces of varying transparency and/or reflectivity [61,62]. Ebner et al. [63] has recently reported a video see-through mixed reality system with full support for focus cues using an eye-tracking module. Out of the methods discussed above, the most realistic images are produced by the varifocal technique, which, when combined with very bright light sources, high-speed projection systems, and appropriate signal processing, can produce very realistic 3D scenery images. The major problem that remains to be solved is the practical implementation of these methods in a manner that is inexpensive and computationally balanced for a given application. In this paper, we present the implementation of a low-cost VAC-free multifocal display that utilizes a single-plane screen and an oscillating varifocal mechanical lens producing a sequence of virtual images at a volume sweep rate of 30 Hz and a frame rate of 90 fps. This true-focus system does not require an eye-tracking module, hence negating additional complexity and inaccuracies, as reported previously [63]. Sections below describe the system construction, the low-cost varifocal mechanical lens, multifocal display design parameters, and the experimental performance.

3. Electro-Optical Design

3.1. Optical Layout

Figure 1 below shows a schematic of our minimalistic low-cost multifocal VR set. It consists of a high-speed, high-brightness screen placed in front of a varifocal lens set, one per eye. A pair of convex lenses of fixed optical power (~16D) are placed 20 mm away from the observer eyes, and a mobile smartphone screen projecting stereoscopic images of an object is placed 58 mm away from the lens. Because the screen is closer than the focal point of the lens (62.5 mm), the screen and lens system produces a virtual image in front of the observer at about 0.8 m away from the lens. In conventional VR HMDs, the virtual image distance is always fixed no matter where the object is placed in the virtual space. Most inexpensive fixed-lens virtual reality sets use a virtual image projected between 0.8 and 3 m away from the observer. The root of the VAC conflict in conventional VR sets is the inability to project virtual images of objects at different distances. In order to resolve the VAC problem, the projection optics of the set must be tunable. For the particular distances used, a virtual image location between 25 cm (reading distance) and infinity requires utilization of a varifocal lens that ranges between 13D and 17D, or a 4D change in the lens power. In order to display a realistic volumetric image, the varifocal lens power is periodically swept between these two power limits while the image of the objects at each virtual depth is virtually projected by the smartphone screen with consistent object occlusions.

Fundamentally, for the observer to disregard screen flickering, the frequency of the periodic display of virtual images should be at least 60 Hz or more, and preferably 90 Hz. Thus, we have chosen the sweep frequency of the varifocal lenses to be 30 Hz with 3 or more periodic virtual image display. Within that 30 Hz sweeping scene, we can send multiple frames for different object depths. In this prototype, we used three frames per scene with 90 Hz frame rate (using a display system of 120 fps refresh rate) at this sweep frequency of 30 Hz. The faster this oscillation is and the higher the number of images displayed to approximate the 3D volume scenery in front of the observer, the smaller the time available to present the images. Therefore, as the number of images presented per period increases, one conversely needs a progressively brighter and faster screen. This is a limitation of the multifocal imaging scheme that needs careful selection of display hardware.

3.2. Piezoelectric Varifocal Tunable Liquid Lenses and Drive Electronics

We have selected lightweight piezoelectric liquid VTL that has been demonstrated elsewhere by the authors [64]. Figure 2 shows a schematic diagram and photographs of the VTLs used in the VAC-free VR set. We drive the lenses using periodic harmonic sinusoidal excitation; hence, the full multifocal range is swept in the sinusoidal cycle = (0.25 m-infinity). Since the dynamics of the liquid lenses are linear to the first order, the corresponding sinusoidal VTL power dynamic range is thus somewhat increased by driving the lenses near resonance. The piezoelectric sinusoidal drive for the lenses is generated utilizing low-cost amplifier electronics.

Figure 3 below shows a simplified schematic of the lens’ driving electronics. Each piezoelectric lens is driven differentially between the outputs of two single-ended high-voltage (HV) dual operational amplifier chips (PA79DK) connected to the lens in a bridge configuration. This permits the generation of a bipolar lens driven from a single-ended high-voltage supply. The two high-voltage op-amps gains for each chip are set to −10x and −1x correspondingly. The (−) input of the −10x PA79DK op-amp is connected to the output of a cascade level shifter preamplifier, while the (+) input is set to a fixed preset voltage VR2. The level shifter preamplifier input is driven by a single-ended op-amp that both provides a high input impedance and produces linear −5x preamplifier gain. A common-gate high-voltage signal MOSFET is utilized to shift the preamplifier output near +VR2. The preamp input is connected to the output of a digital to analog converter (DAC). The DAC is driven by a microcontroller which synthesizes the sinusoidal excitation waveforms in sync with the mobile screen cycle. The varifocal lens voltage is thus 100x larger than the DAC output signal. In the current HV amplifier configuration, the high-voltage amplifier board is driven by a single 120 V DC power supply generated from a 5 V supply using a miniature high-voltage DC converter (EMCO-A series). All electronic components were purchased from www.digikey.com (accessed on 10 June 2023).

The HV board and additional digital microcontroller boards fit inside the VR system shell. The VAC-compensated VR prototype integrates two varifocal tunable eyepiece lenses and specialized associated hardware (mounts, drivers, display and communication interfaces) necessary to display 3D virtual imagery. The VAC-compensated VR is shown in Figure 4 integrated within a conventional Google cardboard-type VR HMD set (dimensions of 18.42 × 12.07 × 9.14 cm). These utilize a mobile phone CPU to produce stereoscopic images and the phone screen as the object which is virtually projected in the front of the observer. Note that, in our prototype, our tunable lenses and the drive electronics fit within the void space between the lenses and the mobile phone screen. In this experimental implementation, the electronics driving the lenses is connected to the phone via a wifi connection, and the phone executes a simple scene render app using Unity and the Google cardboard SDK. The phone app adjusts the location of the virtual images as the objects in the scene are sequentially displayed. For this experimental prototype set, we utilized three different depth zones for the virtual images displayed at 90 frames/s.

Figure 5a shows examples of virtual images of far and close objects projected by this set at two different vergence distances. Note that the blurring of objects is different on both images indicative of the VAC compensation in action. In the left image, the vergence is near and, in the right, the vergence is further. This simple image set example demonstrates that VAC compensation can be realized inexpensively with our piezo lenses. Figure 5b shows examples of images captured by a simulated eye camera system consisting of a 35 mm SLR camera and focus adjustable lens, as projected through the variable virtual image system using three different virtual image distances. The complete image is obtained by projecting at each virtual depth the objects corresponding to a virtual distance zone centered at the range, while other objects are blacked out. The system is cycled through all the depth zones and objects in a repeating cyclic manner. The left and right images were obtained by adjusting the camera lens focus to near and middle distances. Two challenges are evident in the photographs that affect the image quality. The first is the compensation of magnification differences between projection zones as the virtual image size is a function of the virtual image depth, thus necessitating projected image scaling and accurate image registration. The second is the overall darkening effect of the images as the more zones the virtual space is divided into, the lower the image brightness.

From these preliminary tests performed on the prototype set, we learned the features that can be improved for realization of commercial grade prototypes. Namely, two improvements are required to enhance the quality of the 3D virtual image projection. The first consists of the utilization of faster lenses. The second is the utilization of faster, brighter screens. Faster tunable lens operation translates into smoother projected images and improved display quality. Faster displays permit the projection of more image planes per cycle resulting in less flickering and smoother, more natural images.

3.3. Lens Vibration Frequency, Virtual Image Frame Rate and Visual Perception

Central to the volumetric display are the vibrating lens frequency and the determination of the number of image planes needed to represent the scenery with a good quality. Since depth binning is used with virtual images displayed at specific distances, the tunable lens must first reach and settle the correct power, and then, the image of the object is flashed in focus, with all other objects that fit in other bins displayed out of focus. The settling of the lens power can lead to lower frame rates, for example. Lee et al. [65] recently showed that a more efficient way to drive the 3D display is by not driving the lens to the right power but instead having the lens motion continuously excited at resonance, while the correct images with proper occlusion are displayed on the object screens. Using this concept, Rathinavel et al. [49] displayed low-flicker 3D images at 280 different depths. We believe the continuous sweep approach used is effective in producing excellent 3D images, but it is not necessary to project such a large number of depths to produce reasonable good-quality 3D images.

There are two major known effects for this reasoning, both of them being consequences of the working and limitations of the human visual system. First, there is the flicker-fusion threshold. The flicker-fusion threshold, or flicker-fusion rate, is a concept in the psychophysics of vision. It is defined as the frequency at which an intermittent light stimulus appears to be completely steady to the average human observer. Flicker-fusion threshold is related to persistence of vision. Although flicker can be detected for many waveforms representing time-variant fluctuations of intensity, it is conventionally, and most easily, studied in terms of sinusoidal modulation of intensity. Flicker fusion is important in all technologies for presenting moving images, nearly all of which depend on presenting a rapid succession of static images (e.g., the frames in a cinema film, TV show, or a digital video file). If the frame rate falls below the flicker-fusion threshold for the given viewing conditions, the flicker will be apparent to the observer, and movements of objects on the film will appear jerky. For the purposes of presenting moving images, the human flicker-fusion threshold is usually between 60 and 90 Hz.

The second effect is the inability of the human eye to resolve distances accurately. Intrinsically the human eyes accommodate for the vergence of the object or projected virtual image. This vergence is proportional to the inverse of the image distance. Therefore, as the distances become increasingly larger, the lens power adjustments become progressively smaller. Physiologically, the human eye cannot distinguish refractive errors that produce wavefront errors smaller than the average aberration of the eyes themselves. The average human eye’s RMS aberration is 0.35 µm [66]. This aberration can be translated to an equivalent defocusing [67]:

{∆ P}_{e y e} = \frac{4 π \sqrt{3} \cdot a_{r m s}}{A_{p}}

(1)

where a_rms is the average RMS aberration (in microns), and A_p is the pupil area in mm². For a typical pupil diameter of 3 mm, this translates into a defocusing error equivalence of 0.28D. The smaller is the area of the pupil and the larger the equivalent defocusing for a fixed RMS aberration. The pupil diameter is also a function of the object distance [68]. The closer the object (or virtual image) is, the smaller the pupil size. If a good volumetric VR set projects virtual images from reading distance (0.25 m = 4D) to infinity (=0D), then, from Equation (1), the diopter accuracy of the tunable lens should be between 0.28D and 0.6D as the aberrated eye behaves as a low-pass power averaging filter. For the calculated tunable power diopter range of 4D of the VAC-corrected set, the number of frames at which images should be projected is 7 to 15. A higher number of projected depth images would be perceived no different than ~10 images per cycle in the volumetric sweep for the majority of the population. The number of displayed frames is also dependent on the visual acuity of the subject as well as the level of illumination. An additional frame number-reducing effect is the depth of field:

D O F = \frac{2 u^{2} N c}{f^{2}}

(2)

where u is the distance of the object, N is the lens system f-number, c is the circle of confusion radius and f is the system focal length. In optics and photography, the depth of field is the distance between the nearest and the farthest objects that are in acceptably sharp focus in the image. In our system, the f-number is determined as the ratio of the system focal distance over the pupil aperture diameter. The depth of field is entirely dependent upon what level of sharpness is considered to be acceptable based on the selected radius of the circle of confusion, but it helps to narrow down the number of image planes [63] as the observer would perceive all objects in focus on a depth bin that is equal to the depth of field. The tolerable error for the limit of circle of confusion diameter is traditionally set at 0.05 mm. Since we are using a finite number of virtual image frames projected at different depths, for a varifocal range of 4D, about eight virtual images and above are sufficient to produce a reasonable representation of the 3D scenery. At a 30 Hz periodic lens drive cycle, an effective virtual image frame rate of 240 fps+ is thus required in the display screen. For experimental and viability study purposes, we have used a 120 fps display system (Samsung S20), but the cost addition of high-fps-display systems is justifiable for bulk commercial manufacturing. A third important consideration is the screen brightness setting. The total energy of the optical light emitted per frame is proportional to the display time for that frame. Since the position and power change sinusoidally, the speed of the virtual image is not constant. One must either reduce or increase the display time to normalize the object brightness accordingly for different virtual image depth bin volume.

4. VTL Dynamics Experiments

The faster the VTL is vibrated, the smoother is the 3D image produced. It is therefore of extreme importance to utilize a high-frequency lens. For our low-cost system, we measured both the frequency response of the lens’ central deflection and the power vs. time under sinusoidal excitation. These two measurements are discussed below.

4.1. Varifocal Lens Frequency Response

In this experiment, we utilized the HV amplifier driven by an audio signal generator to vibrate a single varifocal lens at different frequencies under low voltage amplitude. The mechanical deflection vs. time of the lens at its apex was measured and recorded using a high-frequency laser spot distance probe (Micro-Epsilon optoNCDT 1420, Micro-Epsilon, Ortenburg, Germany) sampled at 2 k Hz. The amplitude of the lens peak deflection is related to the power of the lens by the relation. To the first order, the optical power for this type of fluid lens is only dependent on the membrane dimensions, liquid index, and the lens peak deflection Δh: wherein

∆ P = \frac{4 (n - 1) \cdot r_{r}^{2} \cdot ∆ h}{r_{o}^{4}}

(3)

where r_r and r_o are the piston and membrane aperture radii, and n is the index of refraction of the liquid lens [64]. Equation (3) indicates that the optical power is independent of any other system parameters. The audio signal driving the HV amplifier was swept between 10 and 100 Hz. Figure 6 shows the normalized frequency response amplitude vs. frequency under a sinusoidal lens excitation voltage of 75 V peak to peak. The lens experienced a pole with amplitude dropping 6 dB (50% in amplitude) at about 45 Hz. While the drop is steep, the lens can be driven at higher frequencies, utilizing a higher drive voltage. A significant effect of the resonance peak is the phase shift change near resonance. This implies that, near the peak, the motion of the lens is out of phase with respect to the drive signal. The amount of phase shift thus must be measured and compensated by the microcontroller in order to ensure that the lens power corresponds to the actual object virtual image distance. The phase shift is measured as described in the following section.

4.2. Stroboscopic Varifocal Lens Power vs. Time Profile

The previous measurement provides a measure of the lens peak deflection vs. drive frequency, but it does not ensure that the lens surface is not distorted or mode switched during the periodic lens excitation. In this experiment, we utilized the HV amplifier driven by a dual-output signal generator as shown in Figure 7 below. The HV amplifier signal is a sinusoidal voltage of frequency f with a DC offset. The amplified sinusoidal voltage V_D(t) drives one piezoelectric lens. The second output of the dual signal generator is used to produce a narrow voltage strobe pulse V_S(t), typically 1% of the duty cycle of the sinusoidal signal but offset by a fixed phase ϕ, as shown on the upper right insert of Figure 7. The narrow strobe pulse generated by a DDS signal generator (Koolertron GH-CJDS66, Koolertron, Hong Kong, China) is used to drive and strobe a collimated LED light source (ThorLabs M625L3-C1, ThorLabs, Newton, NJ, USA) with the intensity modulator (ThorLabs LEDD1B, ThorLabs, Newton, NJ, USA). This light passes through the VTL and is captured by a Shack–Hartmann wavefront sensor (SHS) (Thorlabs WFS150-7AR, ThorLabs, Newton, NJ, USA). Since the phase of the pulse is fixed, the wavefront produced at the SHS is static; hence, it can easily be captured by the SHS. The power of the lens at phase ϕ can be determined from the wavefront radius of curvature (ROC). The lens is placed 55 mm away from the SHS sensor. When the ROC is measured in mm, the power of the lens is then

P (ϕ) = 1000 {(R O C - 55 - 13.6)}^{- 1}

. Figure 8 shows an example screenshot of the wavefront captured by the SHS sensor. By sweeping the phase ϕ between 0 and 360 degrees while keeping the same frequency, one can observe the actual P(t). Figure 9 (Top) shows the waveform of P(ϕ) for different drive frequencies. Note that the waveform is not exactly a sinusoid, indicating a possible partial excitation of higher-order modes or nonlinearities in the lens. Also note that the waveforms are shifted in phase due to the resonant nature of the piezoelectric VTL. Figure 9 (Bottom) shows the relationship between phase and the excitation frequency. At a drive voltage of 104 V driven at 45 Hz, the lens experiences a diopter change of 2.3 D, sufficient to span a VAC-free virtual depth range of 43.5 cm–infinity in the VR set.

5. Summary

We present the implementation of a swept-volume VAC-free VR headset that utilizes low-cost components consisting of two lightweight VTL lenses and an HV-amplifier driven by a conventional smartphone screen. The set tested can present multiple depth virtual images at a swept-volume frequency of 30 Hz with frame rates determined by the mobile device’s maximum frame rate (120 fps for a Samsung S20) without the need for any tracking sensory system. Our tunable lenses can be tuned for higher operation frequencies subject to aperture modifications. We also present dynamical measurements of the VTL characteristics utilizing time-domain and stroboscopic measurement setups. The field of view (FoV) for this large-aperture-lens VR system is 105 degrees with 1520 × 1440 AMOLED display pixels per eye. The depth range of 43.5 cm–infinity driven at around 100 V can be improved with higher drive voltages (up to 150 V and depending on the maximum application voltage of piezo material). A magnetometer-based feedback circuitry mechanism [69] can be integrated with the tunable lens system for enhancing long-term system performance and stability.

Author Contributions

Conceptualization, T.G. and C.H.M.; methodology, M.K. and C.H.M.; software, M.K.; validation, T.G., M.K. and C.H.M.; formal analysis, M.K.; investigation, T.G.; resources, T.G. and C.H.M.; data curation, C.H.M.; writing—original draft preparation, T.G.; writing—review and editing, T.G.; visualization, T.G. and C.H.M.; supervision, T.G.; project administration, T.G.; funding acquisition, T.G. All authors have read and agreed to the published version of the manuscript.

Funding

Private equity funding used for all research purposes.

Data Availability Statement

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Conflicts of Interest

Authors Tridib Ghosh is employed by the company “NewEyes Inc.”. The remaining authors declare no conflicts of interest.

References

Wheatstone, C. Contributions to the physiology of vision Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision. Philos. Trans. R. Soc. Lond. 1838, 128, 371–394. [Google Scholar] [CrossRef]
Steuer, J. Defining virtual reality: Dimensions determining presence. In Communication in the Age of Virtual Reality; Hillsdale, N.J., Ed.; Lawrence Erlbaum Associates: Mahwah, NJ, USA, 1995; pp. 33–56. [Google Scholar]
Slater, M.; Wilbur, S. A framework for immersive virtual environments (five): Speculations on the role of presence in virtual environments. Presence Teleoperators Virtual Environ. 1997, 6, 603–616. [Google Scholar] [CrossRef]
Walsh, K.R.; Pawlowski, S.D. Virtual reality: A technology in need of is research. Commun. Assoc. Inf. Syst. 2002, 8, 20. [Google Scholar] [CrossRef]
Witmer, B.G.; Singer, M.J. Measuring presence in virtual environments: A presence questionnaire. Presence 1998, 7, 225–240. [Google Scholar] [CrossRef]
Jensen, L.; Konradsen, F. A review of the use of virtual reality head-mounted displays in education and training. Educ. Inf. Technol. 2018, 23, 1515–1529. [Google Scholar] [CrossRef]
Krokos, E.; Plaisant, C.; Varshney, A. Virtual Memory Palaces: Immersion aids Recall. Virtual Real. 2019, 23, 1–15. [Google Scholar] [CrossRef]
Choi, S.; Jung, K.; Noh, S.D. Virtual reality applications in manufacturing industries: Past research, present findings, and future directions. Concurr. Eng. 2015, 23, 40–63. [Google Scholar] [CrossRef]
Li, X.; Yi, W.; Chi, H.-L.; Wang, X.; Chan, A.P. A critical review of virtual and augmented reality (VR/AR) applications in construction safety. Autom. Constr. 2018, 86, 150–162. [Google Scholar] [CrossRef]
Hu, Y.; Malthaner, R.A. The feasibility of three-dimensional displays of the thorax for preoperative planning in the surgical treatment of lung cancer. Eur. J. Cardiothorac. Surg. 2007, 31, 506–511. [Google Scholar] [CrossRef]
Radianti, J.; Majchrzak, T.A.; Fromm, J.; Wohlgenannt, I. A systematic review of immersive virtual reality applications for higher education: Design elements, lessons learned, and research agenda. Comput. Educ. 2020, 147, 103778. [Google Scholar] [CrossRef]
Mihelj, M.; Novak, D.; Beguš, S. Virtual Reality Technology and Applications; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
Eichenberg, C. (Ed.) Virtual Reality in Psychological, Medical and Pedagogical Applications; InTech: Vienna, Austria, 2012. [Google Scholar]
Mendiburu, B. 3D Movie Making: Stereoscopic Digital Cinema from Script to Screen; Focal Press: Wellington, OX, USA, 2009. [Google Scholar]
Rendon, A.A.; Lohman, E.B.; Thorpe, D.; Johnson, E.G.; Medina, E.; Bradley, B. The effect of virtual reality gaming on dynamic balance in older adults. Age Ageing 2012, 41, 549–552. [Google Scholar] [CrossRef] [PubMed]
Zhan, T.; Yin, K.; Xiong, J.; He, Z.; Wu, S.-T. Augmented Reality and Virtual Reality Displays: Perspectives and Challenges. iScience 2020, 23, 101397. [Google Scholar] [CrossRef] [PubMed]
Xiong, J.; Hsiang, E.-L.; He, Z.; Zhan, T.; Wu, S.-T. Augmented reality and virtual reality displays: Emerging technologies and future perspectives. Light Sci. Appl. 2021, 10, 216. [Google Scholar] [CrossRef] [PubMed]
Cutting, J.E.; Vishton, P.M. Perceiving layout and knowing distances: The integration, relative potency, and contextual use of different information about depth. In Perception of Space and Motion; Handbook of Perception and Cognition; Academic Press: London, UK, 1995; Volume 5, pp. 69–117. [Google Scholar]
Fincham, E.F.; Walton, J. The reciprocal actions of accommodation and convergence. J. Physiol. 1957, 137, 488–508. [Google Scholar] [CrossRef]
Hoffman, D.M.; Girshick, A.R.; Akeley, K.; Banks, M.S. Vergence-accommodation conflicts hinder visual performance and cause visual fatigue. J. Vis. 2008, 8, 33. [Google Scholar] [CrossRef]
Watt, S.J.; Akeley, K.; Ernst, M.O.; Banks, M.S. Focus cues affect perceived depth. J. Vis. 2005, 5, 834–862. [Google Scholar] [CrossRef]
Emoto, M.; Niida, T.; Okano, F. Repeated vergence adaptation causes the decline of visual functions in watching stereoscopic television. J. Disp. Technol. 2005, 1, 328–340. [Google Scholar] [CrossRef]
Carnegie, K.; Rhee, T. Reducing Visual Discomfort with HMDs Using Dynamic Depth of Field. IEEE Comput. Graph. Appl. 2015, 35, 34–41. [Google Scholar] [CrossRef]
Mark, F.; Marsh, J.P. Overcoming Vergence Accommodation Conflict in Near Eye Display Systems; Whitepaper. 2019. Available online: https://docslib.org/doc/4636635/overcoming-vergence-accommodation-conflict-in-near-eye-display-systems (accessed on 18 June 2025).
Lambooij, M.; Fortuin, M.; Heynderickx, I.; IJsselsteijn, W. Visual discomfort andvisual fatigue of stereoscopic displays: A review. J. Imaging Sci. Technol. 2009, 53, 1–14. [Google Scholar] [CrossRef]
Munafo, J.; Diedrick, M.; Stoffregen, T. The virtual reality head-mounted display Oculus Rift induces motion sickness and is sexist in its effects. Exp. Brain Res. 2017, 235, 889–901. [Google Scholar] [CrossRef]
Padmanaban, N.; Konrad, R.; Stramer, T.; Cooper, E.A.; Wetzstein, G. Optimizing virtual reality for all users through gaze-contingent and adaptive focus displays. Proc. Natl. Acad. Sci. USA 2017, 114, 2183–2188. [Google Scholar] [CrossRef]
Kramida, G. Resolving the vergence accommodation conflict in head-mounted displays. IEEE Trans. Vis. Comput. Graph. 2015, 22, 1912–1931. [Google Scholar] [CrossRef] [PubMed]
Konrad, R.; Cooper, E.A.; Wetzstein, G. Novel optical configurations for virtual reality: Evaluating user preference and performance with focus-tunable and monovision near-eye displays. In Proceedings of the ACM CHI Conference on Human Factors in Computing System, New York, NY, USA, 7–12 May 2016; pp. 1211–1220. [Google Scholar]
Johnson, P.V.; Parnell, J.A.; Kim, J.; Saunter, C.D.; Love, G.D.; Banks, M.S. Dynamic lens and monovision 3D displays to improve viewer comfort. Opt. Express 2016, 24, 11808–11827. [Google Scholar] [CrossRef] [PubMed]
Mauderer, M.; Conte, S.; Nacenta, M.A.; Vishwanath, D. Depth perception with gaze contingent depth of field. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, ACM SIGCHI, Toronto, ON, Canada, 26 April 2014; pp. 217–226. [Google Scholar]
Maiello, G.; Chessa, M.; Solari, F.; Bex, P.J. The (in)effectiveness of simulated blur for depth perception in naturalistic images. PLoS ONE 2015, 10, e0140230. [Google Scholar] [CrossRef] [PubMed]
Suyama, S.; Date, M.; Takada, H. Three-Dimensional Display System with Dual-Frequency Liquid-Crystal Varifocal Lens. Jpn. J. Appl. Phys. 2000, 39, 480. [Google Scholar] [CrossRef]
Love, G.D.; Hoffman, D.M.; Hands, P.J.; Gao, J.; Kirby, A.K.; Banks, M.S. High-speed switchable lens enables the development of a volumetric stereoscopic display. Opt. Express 2009, 17, 15716–15725. [Google Scholar] [CrossRef]
Liu, S.; Cheng, D.; Hua, H. An optical see-through head mounted display with addressable focal planes. In Proceedings of the 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality, Cambridge, UK, 15–18 September 2008; pp. 33–42. [Google Scholar] [CrossRef]
Dunn, D.; Tippets, C.; Torell, K.; Kellnhofer, P.; Akit, K.; Didyk, P.; Myszkowski, K.; Luebke, D.; Fuchs, H. Wide Field Of View Varifocal Near-Eye Display Using See-Through Deformable Membrane Mirrors. IEEE Trans. Vis. Comput. Graph. 2017, 23, 1322–1331. [Google Scholar] [CrossRef]
Akşit, K.; Lopes, W.; Kim, J.; Shirley, P.; Luebke, D. Near-eye Varifocal Augmented Reality Display Using See-through Screens. ACM Trans. Graph. 2017, 36, 1–13. [Google Scholar] [CrossRef]
Akeley, K.; Watt, S.J.; Girshick, A.R.; Banks, M.S. A Stereo Display Prototype with Multiple Focal Distances. In ACM SIGGRAPH 2004 Papers, SIGGRAPH ’04; ACM: New York, NY, USA, 2004; pp. 804–813. [Google Scholar] [CrossRef]
Hu, X.; Hua, H. Design and tolerance of a free-form optical system for an optical see-through multi-focal-plane display. Appl. Opt. 2015, 54, 9990–9999. [Google Scholar] [CrossRef]
Liu, S.; Hua, H.; Cheng, D. A Novel Prototype for an Optical See-Through Head-Mounted Display with Addressable Focus Cues. IEEE Trans. Vis. Comput. Graph. 2010, 16, 381–393. [Google Scholar] [CrossRef]
MacKenzie, K.J.; Hoffman, D.M.; Watt, S.J. Accommodation to multiple focal plane displays: Implications for improving stereoscopic displays and for accommodation control. J. Vis. 2010, 10, 22. [Google Scholar] [CrossRef]
Hu, X.; Hua, H. Design and Assessment of a Depth-Fused Multi-Focal-Plane Display Prototype. J. Disp. Technol. 2014, 10, 308–316. [Google Scholar] [CrossRef]
Hu, X.; Hua, H. High-resolution optical see-through multi-focal-plane head-mounted display using freeform optics. Opt. Express 2014, 22, 13896–13903. [Google Scholar] [CrossRef] [PubMed]
Narain, R.; Albert, R.A.; Bulbul, A.; Ward, G.J.; Banks, M.S.; O’Brien, J.F. Optimal Presentation of Imagery with Focus Cues on Multiplane Displays. ACM Trans. Graph. 2015, 34, 1–12. [Google Scholar] [CrossRef]
Mercier, O.; Sulai, Y.; Mackenzie, K.; Zannoli, M.; Hillis, J.; Nowrouzezahrai, D.; Lanman, D. Fast Gaze-contingent Optimal Decompositions for Multifocal Displays. ACM Trans. Graph. 2017, 36, 237. [Google Scholar] [CrossRef]
Lee, S.; Cho, J.; Lee, B.; Jo, Y.; Jang, C.; Kim, D.; Lee, B. Foveated Retinal Optimization for See-Through Near-Eye Multi-Layer Displays. IEEE Access 2018, 6, 2170–2180. [Google Scholar] [CrossRef]
Hua, H. Enabling Focus Cues in Head-Mounted Displays. Proc. IEEE 2017, 105, 805–824. [Google Scholar] [CrossRef]
Chang, J.H.R.; Kumar, B.V.K.V.; Sankaranarayanan, A.C. Towards multifocal displays with dense focal stacks. ACM Trans. Graph. 2018, 37, 1–13. [Google Scholar] [CrossRef]
Rathinavel, K.; Wang, H.; Blate, A.; Fuchs, H. An Extended Depth-at-Field Volumetric Near-Eye Augmented Reality Display. IEEE Trans. Vis. Comput. Graph. 2018, 24, 2857–2866. [Google Scholar] [CrossRef]
Lanman, D.A.L.D. Near-eye light field displays. ACM Trans. Graph. 2013, 32, 1–10. [Google Scholar] [CrossRef]
Maimone, A.; Lanman, D.; Rathinavel, K.; Keller, K.; Luebke, D.; Fuchs, H. Pinlight Displays: Wide Field of View Augmented Reality Eyeglasses Using Defocused Point Light Sources. In ACM SIGGRAPH 2014 Emerging Technologies, SIGGRAPH’14; ACM: New York, NY, USA, 2014; p. 1. [Google Scholar] [CrossRef]
Wetzstein, G.; Lanman, D.; Hirsch, M.; Raskar, R. Tensor displays: Compressive light field synthesis using multilayer displays with directional backlighting. ACM Trans. Graph. 2012, 31, 1–11. [Google Scholar] [CrossRef]
Pamplona, V.F.; Oliveira, M.M.; Aliaga, D.G.; Raskar, R. Tailored displays to compensate for visual aberrations. ACM Trans. Graph. 2012, 31, 1–12. [Google Scholar] [CrossRef]
Javidi, B.; Okano, F. (Eds.) Three-Dimensional Television, Video and Display Technologies; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
Matusik, W.; Pfister, H. 3D TV: A Scalable System for Real-Time Acquisition, Transmission, and Autostereoscopic Display of Dynamic Scenes. ACM Trans. Graph. 2004, 23, 814–824. [Google Scholar] [CrossRef]
Huang, F.-C.; Chen, K.; Wetzstein, G. The Light Field Stereoscope: Immersive Computer Graphics via Factored Near-eye Light Field Displays with Focus Cues. ACM Trans. Graph. 2015, 34, 1–60. [Google Scholar] [CrossRef]
Hua, H.; Javidi, B. A 3D integral imaging optical see-through head mounted display. Opt. Express. 2014, 22, 13484–13491. [Google Scholar] [CrossRef] [PubMed]
Maimone, A.; Georgiou, A.; Kollin, J.S. Holographic Near-eye Displays for Virtual and Augmented Reality. ACM Trans. Graph. 2017, 36, 1–16. [Google Scholar] [CrossRef]
Matsuda, N.; Fix, A.; Lanman, D. Focal Surface Displays. ACM Trans. Graph. 2017, 36, 1–14. [Google Scholar] [CrossRef]
Shi, L.; Huang, F.-C.; Lopes, W.; Matusik, W.; Luebke, D. Near-eye light field holographic rendering with spherical waves for wide field of view interactive 3d computer graphics. ACM Trans. Graph. 2017, 36, 236. [Google Scholar] [CrossRef]
Dunn, D. Required accuracy of gaze tracking for varifocal displays. In Proceedings of the IEEE Virtual Reality (VR), Osaka, Japan, 23–27 March 2019; pp. 1838–1842. [Google Scholar]
Padmanaban, N.; Konrad, R.; Wetzstein, G. Autofocals: Evaluating gaze-contingent eyeglasses for presbyopes. Sci. Adv. 2018, 5, 1–2. [Google Scholar]
Ebner, C.; Mori, S.; Mohr, P.; Peng, Y.; Schmalstieg, D.; Wetzstein, G.; Kalkofen, D. Video See-Through Mixed Reality with Focus Cues. IEEE Trans. Vis. Comput. Graph. 2022, 28, 2256–2266. [Google Scholar] [CrossRef]
Hasan, N.; Kim, H.; Mastrangelo, C. Tunable-focus lens for adaptive eyeglasses. Opt. Express 2017, 25, 1221–1233. [Google Scholar] [CrossRef]
Lee, S.; Jo, Y.; Yoo, D.; Cho, J.; Lee, D.; Lee, B. Tomoreal: Tomographic displays. arXiv 2018, arXiv:1804.04619. [Google Scholar] [CrossRef]
Thibos, L.N.; Hong, X.; Bradley, A.; Cheng, X. Statistical variation of aberration structure and image quality in a normal population of healthy eyes. J. Opt. Soc. Am. A 2002, 19, 2329–2348. [Google Scholar] [CrossRef] [PubMed]
Watson, A.B.; Ahumada, A.J., Jr. Predicting visual acuity from wavefront aberrations. J. Vis. 2008, 8, 1–19. [Google Scholar] [CrossRef] [PubMed]
Lee, E.C.; Lee, J.W.; Park, K.R. Experimental Investigations of Pupil Accommodation Factors. Investig. Ophthalmol. Vis. Sci. 2011, 52, 6478–6485. [Google Scholar] [CrossRef] [PubMed]
Ghosh, T.; Majumder, A. Adaptable Lenses for Smart Eyeglasses. U.S. Patent 11,927,774 B2, 12 March 2024. [Google Scholar]

Figure 1. Simplified schematic of our low-cost VAC-compensated VR set. The set consists of a conventional VR set with the addition of two fast varifocal tunable lenses (VTLs). The VTLs sweep the virtual image space in front of the observer at ≥30 Hz. The VAC compensation is realized by (1) partitioning of virtual image depth space onto 3–7 bins of acceptable focus clarity and (2) time multiplexed display with object occlusion.

Figure 2. (Left) Simplified schematic of the tunable-focus lens excluding actuators, (Center) photograph of the actual lightweight tunable lens, and (Right) lens optical power as a function of actuator voltage [64]. This lens weighs ~15 gr and can operate at frequencies of ~100 Hz.

Figure 3. (Left) Schematic circuit of low-cost single-channel lens drive amplifier. The amplifier has one dual HV op-amp and a linear preamplifier state. (Right) Photograph of dual-channel drive amplifier board.

Figure 4. Photographs of the low-cost VAC-resolved VR set. (Left) The tunable lenses and the drive electronics all fit within the VR casing. (Right) This particular set was driven at a virtual image sweeping frequency of 30 Hz, directly from a phone set and screen running a Unity Google VR application switching between three different virtual image planes. The VAC compensation hardware(s) cost only an additional $200–300 compared to a conventional VR set. Note, the cost reduces considerably with higher volumes.

Figure 5. (a). Examples of true VAC-compensated 3D virtual images generated by the experimental set of Figure 4 at a volumetric scan rate of 30 Hz. The left photo is the virtual image projected for close objects (flower and bus) and the right is the virtual image projected far (at infinity, building wall). The images were generated using the Unity VR system with the Google lens package. Note that the blurring of objects is different on both images. This simple example demonstrates that VAC compensation can be realized with our inexpensive focus-tunable lenses. (b). Imaging of virtual images using a camera system emulating a human eye. In the left photograph, the camera was focused on the near range. In the right, the camera was focused at the mid-range. The adjustments in the size of the objects as they are projected further out is critical in producing sharp images with minimum distortion and additional defocusing.

Figure 6. Normalized frequency response of VTL lens under low drive excitation. The normalization is performed with respect to the maximum observed deflection.

Figure 7. Schematic of stroboscopic system for the measurement of the lens power vs. time. The Image seen by the SHS wavefront sensor is stationary corresponding to the lens wavefront at phase ϕ.

Figure 8. Example of stroboscopic screen capture of the lens wavefront by the SHS sensor as displayed by the display software V4.5.

Figure 9. (Left) Stroboscopic measurement of lens power change vs. phase at different drive frequencies. (Right) Phase lag respect to input vs. driving frequency.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ghosh, T.; Karkhanis, M.; Mastrangelo, C.H. An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses. Virtual Worlds 2025, 4, 38. https://doi.org/10.3390/virtualworlds4030038

AMA Style

Ghosh T, Karkhanis M, Mastrangelo CH. An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses. Virtual Worlds. 2025; 4(3):38. https://doi.org/10.3390/virtualworlds4030038

Chicago/Turabian Style

Ghosh, Tridib, Mohit Karkhanis, and Carlos H. Mastrangelo. 2025. "An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses" Virtual Worlds 4, no. 3: 38. https://doi.org/10.3390/virtualworlds4030038

APA Style

Ghosh, T., Karkhanis, M., & Mastrangelo, C. H. (2025). An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses. Virtual Worlds, 4(3), 38. https://doi.org/10.3390/virtualworlds4030038

Article Menu

An Economically Viable Minimalistic Solution for 3D Display Discomfort in Virtual Reality Headsets Using Vibrating Varifocal Fluidic Lenses

Abstract

1. Introduction

2. Background

3. Electro-Optical Design

3.1. Optical Layout

3.2. Piezoelectric Varifocal Tunable Liquid Lenses and Drive Electronics

3.3. Lens Vibration Frequency, Virtual Image Frame Rate and Visual Perception

4. VTL Dynamics Experiments

4.1. Varifocal Lens Frequency Response

4.2. Stroboscopic Varifocal Lens Power vs. Time Profile

5. Summary

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI