Self-Calibration Spherical Video Stabilization Based on Gyroscope

Ren, Zhengwei; Fang, Ming; Chen, Chunyi

doi:10.3390/info12080299

Open AccessArticle

Self-Calibration Spherical Video Stabilization Based on Gyroscope

by

Zhengwei Ren

¹,

Ming Fang

^2,* and

Chunyi Chen

^1,*

¹

School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China

²

School of Artificial Intelligence, Changchun University of Science and Technology, Changchun 130022, China

^*

Authors to whom correspondence should be addressed.

Information 2021, 12(8), 299; https://doi.org/10.3390/info12080299

Submission received: 19 June 2021 / Revised: 22 July 2021 / Accepted: 23 July 2021 / Published: 27 July 2021

Download

Browse Figures

Versions Notes

Abstract

:

With the development of handheld video capturing devices, video stabilization becomes increasingly important. The gyroscope-based video stabilization methods perform promising ability, since they can return more reliable three-dimensional (3D) camera rotation estimation, especially when there are many moving objects in scenes or there are serious motion blur or illumination changes. However, the gyroscope-based methods depend on the camera intrinsic parameters to execute video stabilization. Therefore, a self-calibrated spherical video stabilization method was proposed. It builds a virtual sphere, of which the spherical radius is calibrated automatically, and then projects each frame of the video to the sphere. Through the inverse rotation of the spherical image according to the rotation jitter component, the dependence on the camera intrinsic parameters is relaxed. The experimental results showed that the proposed method does not need to calibrate the camera and it can suppress the camera jitter by binding the gyroscope on the camera. Moreover, compared with other state-of-the-art methods, the proposed method can improve the peak signal-to-noise ratio, the structural similarity metric, the cropping ratio, the distortion score, and the stability score.

Keywords:

video stabilization; motion compensation; gyroscope; spherical rotation

1. Introduction

In recent years, with the development and popularization of handheld cameras and mobile phones, videos have been widely used to record interesting and important moments. However, in the moving environment, due to the instability of the carrier, the captured video is often accompanied by different degrees of image jitter [1,2]. This kind of jittered video will not only reduce the quality of the video, resulting in poor perception, but also affect the subsequent processing of the video [3]. Therefore, video stabilization is significant.

The purpose of video stabilization is to suppress or weaken the impact of camera jitter on video quality and to satisfy people’s perception and subsequent processing in the future. The video stabilization method can be mainly divided into three categories, i.e., mechanical image stabilization technology, optical image stabilization technology, and electronic image stabilization technology [4,5,6]. Mechanical image stabilization detects the jitter of the camera platform through gyro sensors and other devices and then adjusts the servo system to stabilize the image [7]. Optical image stabilization uses active optical components to adaptively adjust the optical path to compensate the image motion caused by the shaking of the camera platform, so as to achieve the purpose of image stabilization [8]. Electronic image stabilization computes motion estimation between consecutive images, and then motion smoothing and motion compensation are performed on each frame of the video to obtain a stable video [9]. Although mechanical image stabilization and optical image stabilization obtain better performance, there are still some problems such as large volumes, inconvenient carrying, and high cost. Therefore, electronic image stabilization has become a research hotspot in video stabilization.

Generally, electronic image stabilization includes three stages: camera motion estimation, motion smoothing, and video motion compensation [10]. According to different camera motion estimation methods, video stabilization can be divided into vision-based methods and attitude-sensor-based methods [11]. Vision-based methods usually estimate camera motion based on image sequences. Most of the existing methods model the transformation between two consecutive frames as affine transformation or homography transformation. This transformation relationship cannot model parallax, and it will be affected by moving objects and illumination transformation in the real-world scene. Attitude-sensor-based methods mainly use gyroscopes to estimate the camera rotation. Compared with vision-based methods, they can return more reliable three-dimensional (3D) camera rotation estimation, especially when there are many moving objects in the scene or there are serious motion blur or illumination changes in the video. Jia et al. proposed a video motion smoothing method based on a pure 3D rotation motion model and camera intrinsic parameters, which smooths the rotation matrix through a Riemannian geometry on a manifold [12]. Yang et al. proposed a Kalman filter method on a Lie group manifold to realize video stabilization, used a gyroscope to obtain rotation components for smoothing and used an intrinsic parameter matrix to calculate rotation projection on two-dimensional (2D) image for motion compensation [13]. Zhou et al. proposed a video stabilization method based on an optical flow sensor, which obtains rotation information through a gyroscope and describes image rotation by rotation around the z axis [14]. After using a gyroscope to compensate rotation jitter, Zhuang et al. used visual information to estimate the residual 2D translation in the image plane [15]. The premise of these methods is to calibrate the camera and to complete the video stabilization according to the camera intrinsic parameters. However, the camera intrinsic parameters may not be obtained in the process of video acquisition, which will affect the video stabilization.

Ren et al. proposed a virtual sphere model for video stabilization [16]. It focuses on the image spherical motion estimation and does not need to calibrate the camera intrinsic parameters. However, the spherical radius needs to be calibrated manually in advance. This paper proposes a self-calibration spherical video stabilization method based on a gyroscope, which improves the spherical-radius-obtaining way and realizes automatic calibration. To the best of our knowledge, this is the first time that the spherical radius self-calibration method is performed based on a gyroscope in the area of video stabilization. In the 3D rotation model, the camera motion can be regarded as a series of 3D rotation matrix. The motion smoothing is transformed into constrained regression problems, and the manifold structure of the rotation matrix sequence is used to smooth the path. Finally, the image sequence is compensated to get the stable image sequence. Compared with the state-of-the-art methods, the method described in this paper uses a spherical model to compensate the image, obtains the optimal spherical radius through self-calibration, projects the image on the spherical surface and reversely rotates the spherical surface to compensate the image. The proposal relaxes the dependence of the camera intrinsic parameters in the calculation and does not need manual calibration. Moreover, it can be applied to scenes with insufficient scene features.

The paper is organized as follows: Section 2 introduces the overall framework of the proposed method. Section 3 introduces the implementation details of the proposed method. Section 4 presents the experimental results. Section 5 states the conclusion.

2. Proposed Framework

As shown in Figure 1, the overall framework of spherical video stabilization based on a gyroscope is divided into the following three parts.

Input data: when a jitter video is obtained, the gyroscope data are collected at the same time.

Motion estimation: the 3D rotation transformation of the camera attitude at an adjacent time is calculated by the gyroscope data, and the camera rotation path is obtained cumulatively.

Motion smoothing: the camera rotation path smoothing is transformed into a constrained regression problem on a Riemannian manifold, and the optimal solution is calculated to obtain a smooth rotation path.

Motion compensation: the image is projected on a spherical surface, and the jitter rotation component is compensated by rotating the spherical surface. Then, the image is projected inversely onto a plane to obtain a stable video.

3. Methodology

The proposed self-calibration spherical video stabilization based on a gyroscope includes three main steps: motion estimation, motion smoothing, and motion compensation.

3.1. Motion Estimation and Smoothing

In the 3D rotation estimation module, the gyroscope data are used to estimate the 3D rotation of the camera. The rotation angular velocity

ω = (\begin{matrix} ω_{α} & ω_{β} & ω_{γ} \end{matrix})

of a camera in the 3D coordinate system is obtained by the gyroscope, and the rotation angle

θ = {(α, β, γ)}^{T} = Δ t ω

is calculated with the integral of rotation angular velocity with respect to time, where

Δ t

is the sampling time interval of the gyroscope. Subsequently, the inter frame rotation matrix

R

corresponding to the rotation angle can be obtained by Equation (1):

R = [\begin{matrix} \cos β \cos γ & \cos β \sin γ & - \sin β \\ \sin α \sin β \cos γ - \cos α \sin γ & \sin α \sin β \sin γ + \cos α \cos γ & \sin α \cos β \\ \cos α \sin β \cos γ + \sin α \sin γ & \cos α \sin β \sin γ - \sin α \cos γ & \cos α \cos γ \end{matrix}] .

(1)

The camera path

{Path}_{n}

is represented by Equation (2):

{Path}_{n} = \prod_{i = 1}^{n} R (i, i + 1),

(2)

where

R (i, i + 1)

is the rotation matrix between ith frame and (i + 1)th frame.

All of rotation matrices constitute the special orthogonal group, where any element

R

satisfies the constraint

R R^{T} = I, \det (R) = 1

, which can be also considered as an embedded Riemannian submanifold. The metric of Riemannian manifold is geodesic distance as shown in Equation (3):

d_{g} (R_{m}, R_{n}) = {‖ \log m (R_{m}^{'} R_{n}) ‖}_{F},

(3)

where

\log m (\cdot)

is the matrix logarithm operator and

{‖ \cdot ‖}_{F}

is the Frobenius norm of a matrix.

The motion smoothing can be formulated as the objective function, and the smoothed trajectory is obtained by solving the following minimum optimization problem, as shown in Equation (4):

\min_{{Path}_{n}^{cur}} \sum_{n = 1}^{N} \frac{1}{2} d_{g}^{2} ({Path}_{n}^{pre}, {Path}_{n}^{cur}) + α \sum_{n = 1}^{N - 1} \frac{1}{2} d_{g}^{2} ({Path}_{n}^{cur}, {Path}_{n + 1}^{cur}),

(4)

where

{Path}_{n}^{cur}

is the smoothed trajectory,

{Path}_{n}^{pre}

is the original trajectory, and

α

is the weight controlling the smoothness of the stabilized trajectory. For each video sequence, the camera’s rotation in the 3D space can be mapped to a curve on a Riemannian manifold. The stable camera’s 3D rotation can be obtained by optimizing the geodesic distance of the curve in this space. As shown in Figure 2, the rotation matrix is transformed into a Euler angle to describe the original path and smooth the path using Equation (5):

{\begin{matrix} α = atan (R_{12} / R_{11}) \\ β = atan (R_{23} / R_{33}) \\ γ = - asin (R_{13}) \end{matrix},

(5)

where

R_{i j}

is the element of

i

th row and

j

th column in the rotation matrix

R

.

The solid line is the original path, and the dotted line is the smoothed path. The smoothed path suppresses the jitter and retains the intentional motion.

3.2. Motion Compensation

Motion compensation is an important operation of video stabilization. The virtual sphere is first established by taking the optical center of the camera as the sphere center, and then each frame image is projected on a virtual spherical surface [16]. Next, the component causing the camera jitter will be obtained according to the difference between the smoothed camera path and the original path. Finally, the motion compensation is carried out by reversely rotating the spherical surface to compensate the jittered images.

3.2.1. Spherical Projection

According to the pinhole camera model, a 2D image coordinate system can be converted into a 3D spherical coordinate system. In this paper, the angle-based spherical projection method is used, and the model is shown in Figure 3.

The resolution of the image collected by the camera is set as

W \times H

, where

W

is the image width and

H

is the image height. In the spherical model of the right-handed coordinate system, the origin

O

is the optical center of the pinhole camera; the

y

axis is the optical axis and passes through the central point

o

of the image;

O o

is the radius of the sphere as shown in Figure 3, and it is plotted as a red line; the projection of point

P

in the world coordinate system is denoted as p in the image coordinate system (marked as

u - v

), which is centered at

(u_{0}, v_{0})

; the projection of point

P

in the virtual sphere is denoted as

P_{S}

, which can be represented by angular coordinates;

φ

is the angle between

O p_{x o y}

and the

y

axis, and

θ

is the angle between

O p_{y o z}

and the

z

axis. Thus,

(φ, θ)

can be calculated by Equation (6):

{\begin{cases} θ = \frac{π}{2} - \arctan \frac{v - H / 2}{r} \\ φ = \arctan \frac{W / 2 - u}{r} \end{cases},

(6)

where

r

is the radius of the sphere and

(u, v)

is the pixel coordinates of the image.

The spherical coordinates of point

P_{S}

is calculated by Equation (7):

{\begin{matrix} x = \frac{r \sin θ \tan φ}{\sqrt{1 + \sin^{2} θ \tan^{2} φ}} \\ y = \frac{r \sin θ}{\sqrt{1 + \sin^{2} θ \tan^{2} φ}} \\ z = \frac{r \cos θ}{\sqrt{1 + \sin^{2} θ \tan^{2} φ}} \end{matrix} .

(7)

In this way, the points in the image plane are converted to the corresponding points in the 3D sphere, which realizes the conversion of 2D plane images to 3D sphere images. As shown in Figure 4, taking the data published by Jia [12] as an example, the camera’s focal length of

f = 649

was used as the radius for spherical projection. Figure 4a is a 2D plan, and Figure 4b is the corresponding spherical projection.

3.2.2. Self-Calibration of the Spherical Radius

As shown in Figure 5, the projected points of point

P

on two adjacent frames

I_{1}

and

I_{2}

are

P_{1}

and

P_{2}

, respectively. The relative position deviation of

P_{1}

and

P_{2}

in two frames can be regarded as a jitter component.

θ_{1}

and

θ_{2}

are the angles between the corresponding spherical projected point and the optical axis. The rotation angle of

I_{2}

relative to

I_{1}

is denoted by

θ

, which conforms to the spherical rotation model with radius

r_{b}

. Since the gyroscope is bound to the camera,

θ

can be obtained from the gyroscope data. The most important part is how to implement the spherical radius value

r_{b}

, which determines the stabilization effectiveness. Take an example in Figure 5. When the radius of the sphere

r^{'} \neq r_{b}

, the rotation angle

θ^{'}

is not equal to the rotation angle

θ

obtained by the gyroscope, which does not conform to the rotation mode of the gyroscope; when the radius of the sphere is

r_{b}

, the rotation angle

θ^{'}

is equal to the rotation angle

θ

obtained by the gyroscope, which conforms to the imaging model of the camera and the rotation mode of the gyroscope.

Therefore, it is necessary to calibrate spherical radius

r

in accordance with the gyroscope rotation. The spherical radius should be the focal length of the camera in theory. To achieve the self-calibration of the spherical radius coupled with a different spherical radius

r

, mean square error (MSE), which is used in image processing to measure the difference between two images, is used to filter the optimal spherical radius value. The MSE of a stable video corresponding with different spherical radius values is defined as Equation (8):

\min_{r} \frac{1}{N} \sum_{k = 1}^{N - 1} (\frac{1}{W \times H} \sum_{x = 1}^{W} \sum_{y = 1}^{H} {(I_{_{k}}^{r} (x, y) - I_{_{k + 1}}^{r} (x, y))}^{2}),

(8)

where

N

is the number of the total frames in the video, and

I^{r} (x, y)

is the stable frame that is stabilized with radius

r

. Thus, the optimal spherical radius

r_{b}

is transformed into solving the optimization problem of Equation (8).

3.2.3. Spherical Rotation Compensation

The existing motion compensation methods transform a 3D rotation transformation relationship into a 2D image coordinate transformation relationship through the camera intrinsic parameter matrix [13]. Assuming that the camera intrinsic matrix is

K

, which contains five intrinsic parameters,

f_{x}

and

f_{y}

represent focal lengths in terms of pixel,

s

represents the skew coefficient between the

x

axis and the

y

axis, and c_x and

c_{y}

represent the principal points, which can be written as:

K = [\begin{matrix} f_{x} & s & c_{x} \\ 0 & f_{y} & c_{y} \\ 0 & 0 & 1 \end{matrix}] .

The pixel

[u_{i j}^{'}, v_{i j}^{'}]

in any stable frame can be calculated by the following equation under pure 3D camera rotation:

[\begin{array}{l} u_{i j}^{'} \\ v_{i j}^{'} \end{array}] = g (K ({Path}_{k}^{pre} {) (Path}_{k}^{cur})^{T} K^{- 1} [\begin{array}{l} u_{i j} \\ v_{i j} \\ 1 \end{array}]),

(9)

where

{Path}_{n}^{cur}

is the smoothed rotation matrix,

{Path}_{n}^{pre}

is the original rotation matrix,

g ({[x, y, z]}^{T}) = {[x / z, y / z]}^{T}

, and

[u_{i j}, v_{i j}]

are pixel points in the original image. The existing motion compensation methods depend on the intrinsic parameter matrix.

Different from the existing methods, the proposed method does not need the camera intrinsic parameter matrix to realize the video stabilization. The method proposed in this paper projects an image to a sphere and uses a rotation matrix to rotate the sphere for compensating the image. Owing to this, the rotation component causing jitter can be calculated based on the difference between the smoothed rotation matrix path and the original rotation matrix path. A relatively stable spherical image can be obtained using a rotation component to reversely rotate the spherical surface, as shown in Equation (10):

[\begin{matrix} {\tilde{x}}_{i j} \\ {\tilde{y}}_{i j} \\ {\tilde{z}}_{i j} \end{matrix}] = R_{r e s} [\begin{matrix} x_{i j} \\ y_{i j} \\ z_{i j} \end{matrix}],

(10)

where

{[\begin{matrix} x_{i j} & y_{i j} & z_{i j} \end{matrix}]}^{T}

is the original spherical point,

{[\begin{matrix} {\tilde{x}}_{i j} & {\tilde{y}}_{i j} & {\tilde{z}}_{i j} \end{matrix}]}^{T}

is the spherical point after rotation, and

R_{r e s} = {Path}^{cur} \times {({Path}^{prev})}^{- 1}

is the rotation jitter component. Finally, the stable image sequence can be obtained by expanding the spherical surface. In Figure 6, taking the target point

P

of

k

th frame and (

k

+ 1)th frame as the reference frame, point

P

is dislocated between these two adjacent frames. In the third sphere, the spherical image of (

k

+ 1)th frame is rotated to make its position relatively consistent, so as to achieve the effect of image stabilization.

4. Experiment and Result Analysis

In this section, two groups of experiments are illustrated. The first experiment compared different spherical radius values to prove the optimality of the proposed self-calibration method, and the second experiment compared different motion compensation methods to demonstrate the effectiveness of the proposed spherical motion compensation method.

4.1. Experiment Setting and Videos

In order to verify the effectiveness of the proposed method, vs2015 was used to program on a PC (Inter core i5-8500 CPU, 3.00GHz, 8GB RAM). To test the general applicability of the proposal, we collected the experimental data from different cameras, which were bound with a gyroscope. The joint calibration of the camera and the gyroscope proposed by Fang Ming [17] was used to align the image with the gyroscope data.

To evaluate the video stabilization effect quantitively, the peak signal-to-noise ratio (PSNR), the structural similarity index (SSIM) [18], the cropping ratio, the distortion score, and the stability score [19] were used. The PSNR and the SSIM are the commonly used metrics in image processing to evaluate the degree of registration between image sequences. The principle of the PSNR is that if the relative change between two adjacent frames is fully compensated, the pixel difference of two stable frames should be zero. The SSIM is widely used in video stability estimation. It considers brightness, contrast, and structure information to measure the similarity of two given images. The larger the PSNR value and the closer the SSIM value are to 1, the better the image stabilization effect is [18]. The computation methods of the PSNR and the SSIM are defined in Equation (11):

\begin{matrix} P S N R (I_{k}, I_{k + 1}) = 10 \times \log_{10} (\frac{255^{2}}{M S E (I_{k}, I_{k + 1})}) \\ S S I M (I_{k}, I_{k + 1}) = \frac{(2 μ_{I_{k}} μ_{I_{k + 1}} + c_{1}) (2 σ_{I_{k} I_{k + 1}} + c_{2})}{(μ_{I_{k}}^{2} + μ_{I_{k + 1}}^{2} + c_{1}) (σ_{I_{k}}^{2} + σ_{I_{k + 1}}^{2} + c_{2})} \end{matrix}

(11)

where

I_{k}

and I_k₊₁ are the gray images of two adjacent frames;

μ_{I_{k}}

and

μ_{I_{k + 1}}

are the gray averages of

I_{k}

and

I_{k + 1}

, respectively;

σ_{I_{k}}^{2}

and

σ_{I_{k + 1}}^{2}

are the gray variances of

I_{k}

and

I_{k + 1}

, respectively;

σ_{I_{k} I_{k + 1}}

is the gray covariance of

I_{k}

and

I_{k + 1}

;

c_{1} = {(k_{1} L)}^{2}

,

c_{2} = {(k_{2} L)}^{2}

,

k_{1} = 0.01

,

k_{2} = 0.03

, and

L = 255

are the constants used to maintain stability.

The cropping ratio measures the remaining after cropping away empty regions. The distortion score is estimated from the affine part of homography. The stability score measures the smoothness of stabilized videos [19].

4.2. Comparison of Different Spherical Radius Values

In order to verify the effectiveness of spherical radius self-calibration, two groups of experimental data were implemented: (1) public video data and gyroscope data; (2) the video collected from the cascaded camera and gyroscope.

Firstly, public video data and gyroscope data [12] were used, and the focal length of the camera was

f_{1} = 649

pixels. The optimal spherical radius value was located at

r = 656

pixels, which was calibrated by the proposed method in Section 3.2.2. The deviation between the computed value and the focal length value was small and acceptable, since the centers of the gyroscope and the camera could not completely coincide and the computed value also conformed to the imaging model basically. We ranged the spherical radius values from 200 to 3000 pixels. The PSNR and SSIM values of different spherical radius values are shown in Figure 7. The optimal values were obtained at

r = 656

pixels, which indicated that the calibration result of the spherical radius is reliable. In addition, the larger the distance from the optimal radius was, the worse the video stabilization effect was.

Secondly, the video data were collected from the cascaded camera and gyroscope, where the image data and gyroscope data were registered. The relationship between the camera and the gyroscope is shown in Figure 8. The focal length of

f = 1468

pixels was computed through the camera calibration. The spherical radius value obtained by the proposed self-calibration method was

r = 1450

pixels. Three groups of data, including video data and gyroscope data, were collected to verify the validity of the calibration results. We ranged the spherical radius values from 200 to 3000 pixels. Figure 9 exhibits the PSNR and SSIM values of the three videos corresponding to different spherical radii, respectively. It can be found that the PSNR values of the three videos are the maximum at

r = 1450

and the SSIM values closest to 1 are at

r = 1450

, which indicates that the calibration result of the spherical radius is reliable.

Therefore, the optimal spherical radius was basically consistent with the focal length of the camera, and the results obtained by the proposed self-calibration method conformed to the rotation model of the camera.

4.3. Comparison with the Intrinsic Parameter Matrix Method

In this paper, three different camera data were used to compare the image stabilization effect of spherical motion compensation and intrinsic parameter matrix compensation methods [12,13,15]. Figure 10 is a thumbnail of three videos, where video 1 is an indoor scene, video 2 is an outdoor scene, and video 3 is a feature-deficient scene. All the resolutions of the three videos were 1280 pixels × 720 pixels. We used the PSNR, the SSIM, the cropping ratio, the distortion score, and the stability score to compare the video stabilization effect. The results of these methods are shown in Table 1, Table 2 and Table 3. It can be seen that the proposed method achieves better indices. Meanwhile, we compared runtime as shown in Table 4. The runtime of the proposed method is slower than those of the methods [13,15], but it is promising to achieve real-time processing under the best stabilization effect. Moreover, the proposed method does not need to calibrate in advance.

4.4. Discussion

The proposed stabilization method uses a gyroscope to suppress random jitter effectively by a self-calibration spherical compensation model. At present, the representative classical methods of video stabilization based on a gyroscope include research [12,13,14,15]. We have carried out comparative analysis with references [12,13,15], which demonstrates that the proposed method has advantages in video stabilization effect and convenience. The method described in [14] designs a special optical flow sensor to assist video stabilization. There are no public sensor data and video data, so it is difficult for us to compare the results reported in this paper with those in [14].

In addition, the characteristic of this method is that it can avoid extra calibration work. In practical applications, in any scene, the image can be stabilized by fixing a gyroscope on a camera, which is more flexible and ensures the video stabilization effect. However, the runtime of the proposed method is not the fastest compared with those of other methods, but it is promising to achieve real-time processing under the best stabilization effect. In the next stage, we will optimize the method to reduce runtime.

5. Conclusions

In this paper, a self-calibration spherical compensation image stabilization method based on a gyroscope has been proposed. The camera motion trajectory is obtained by gyroscope, and its trajectory is smoothed on a Riemannian manifold to obtain a jitter component; the virtual sphere is established by the optical center of the camera, and the objective function about the spherical radius is established according to the mean square error of the stabilized video. The optimal spherical radius is determined by solving the optimal value of the objective function to complete the spherical radius calibration. Then, the image is projected on the spherical surface, and the spherical surface is rotated reversely according to the jitter component for motion compensation. Finally, the spherical image is expanded to obtain a stable video sequence. The experimental results showed that the stability metrics, i.e., the PSNR, the SSIM, the cropping ratio, the distortion score, and the stability score, were improved, demonstrating that the proposed method is better than the traditional intrinsic parameter matrix compensation methods. Moreover, the proposed stabilization method not only maintains the effectiveness of video stabilization, but also relaxes the dependence on camera calibration.

Author Contributions

All three authors contributed to this work. Methodology, Z.R. and M.F.; writing—original draft preparation, Z.R.; writing—review and editing, M.F. and C.C.; supervision, C.C.; project administration, M.F.; funding acquisition, M.F. and C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number: U19A2063) and the Jilin Provincial Science & Technology Development Program of China (grant numbers: 20190302113GX and 20170307002GX).

Data Availability Statement

The data used to support this study’s findings are available from the author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhao, M.D.; Ling, Q. Adaptively Meshed Video Stabilization. IEEE Trans. Circuits Syst. Video Technol. 2020, 1. [Google Scholar] [CrossRef]
Wu, R.; Xu, Z.; Zhang, J.; Zhang, L. Robust Global Motion Estimation for Video Stabilization Based on Improved K-Means Clustering and Superpixel. Sensors 2021, 21, 2505. [Google Scholar] [CrossRef] [PubMed]
Akira, H.; Katsuhiro, H. Sensorless Attitude Estimation of Three-degree-of-freedom Actuator for Image Stabilization. Int. J. Appl. Electromagn. Mech. 2021, 6, 249–263. [Google Scholar]
Shankarpure, M.R.; Abin, D. Video stabilization by mobile sensor fusion. J. Crit. Rev. 2020, 7, 1012–1018. [Google Scholar]
Raj, R.; Rajiv, P.; Kumar, P.; Khari, M.; Verdú, E.; Crespo, R.G.; Manogaran, G. Feature based video stabilization based on boosted HAAR Cascade and representative point matching algorithm. Image Vis. Comput. 2020, 2020, 103957. [Google Scholar] [CrossRef]
Hu, X.; Olesen, D.; Knudsen, P. Gyroscope Aided Video Stabilization Using Nonlinear Regression on Special Orthogonal Group. In Proceedings of the 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 4–8 May 2020; pp. 2707–2711. [Google Scholar]
Wang, Z.M.; Xu, Z.G. A Survey on Electronic Image Stabilization. J. Image Graph. 2010, 15, 470–480. [Google Scholar]
Huang, W.Q.; Dang, B.N.; Wang, Y.; Sun, M.X. Two-freedom image stabilization institution of large stroke of bidirectional actuation. Opt. Precis. Eng. 2017, 25, 1494–1501. [Google Scholar] [CrossRef]
Rodriguez-Padilla, I.; Castelle, B.; Marieu, V.; Morichon, D. A Simple and Efficient Image Stabilization Method for Coastal Monitoring Video Systems. Remote Sens. 2020, 12, 70. [Google Scholar] [CrossRef] [Green Version]
Guilluy, W.; Oudre, L.; Beghdadi, A. Video stabilization: Overview, challenges and perspectives. Signal Process. Image Commun. 2021, 2021, 116015. [Google Scholar] [CrossRef]
Cao, M.; Zheng, L.; Jia, W.; Liu, X. Real-time video stabilization via camera path correction and its applications to augmented reality on edge devices. Comput. Commun. 2020, 158, 104–115. [Google Scholar] [CrossRef]
Jia, C.; Evans, B.L. Constrained 3D Rotation Smoothing via Global Manifold Regression for Video Stabilization. IEEE Trans. Signal Process. 2014, 64, 3293–3304. [Google Scholar] [CrossRef]
Yang, J.; Lai, L.; Zhang, L.; Huang, H. Online Video Stabilization Algorithm on Lie Group Manifold. Pattern Recognit. Artif. Intell. 2019, 32, 295–305. [Google Scholar]
Pengwei, Z.; Yuanji, J.; Chao, D.; Tian, L.; Shichuan, H. Video Stabilization Technique Based on Optical Flow Sensor. Opto-Electron. Eng. 2019, 46, 180581. [Google Scholar]
Zhuang, B.; Bai, D.; Lee, J. 5D Video Stabilization through Sensor Vision Fusion. In Proceedings of the 2019 IEEE International Conference on Image Processing, Taipei, Taiwan, 22–25 September 2019; pp. 4340–4344. [Google Scholar]
Ren, Z.W.; Fang, M.; Chen, C.Y.; Kaneko, S.I. Video stabilization algorithm based on virtual sphere model. Electron. Imaging 2021, 30, 1–18. [Google Scholar] [CrossRef]
Fang, M.; Tian, Y. Robust Electronic Image Stabilization Method Based on IMU-Camera Calibration. Inf. Control 2018, 47, 156–165. [Google Scholar]
Chen, B.H.; Kopylov, A.; Huang, S.C.; Seredin, O.; Karpov, R.; Kuo, S.Y.; Lai, K.R.; Tan, T.-H.; Gochoo, M.; Hung, P.C.K. Improved global motion estimation via motion vector clustering for video stabilization. Eng. Appl. Artif. Intell. 2016, 54, 39–48. [Google Scholar] [CrossRef]
Zhao, M.D.; Ling, Q. PWStableNet: Learning pixel-wise warping maps for video stabilization. IEEE Trans. Image Process. 2020, 2020, 3582–3595. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Framework of the proposed algorithm.

Figure 2. Gyroscope path smoothing: (a)

α

path smoothing; (b)

β

path smoothing; (c)

γ

path smoothing.

Figure 2. Gyroscope path smoothing: (a)

α

path smoothing; (b)

β

path smoothing; (c)

γ

path smoothing.

Figure 3. Spherical projection model.

Figure 4. Example of spherical projection: (a) two-dimensional (2D) image; (b) spherical projection image.

Figure 5. Camera rotation model.

Figure 6. Spherical rotation compensation model.

Figure 7. Stability assessment of different spherical radii: (a) peak signal-to-noise ratio (PSNR) stability assessment; (b) structural similarity index (SSIM) stability assessment.

Figure 8. The relationship between the camera and the gyroscope.

Figure 9. Assessment of video stabilization results with different spherical radii: (a) comparison of PSNRs; (b) comparison of SSIMs.

Figure 10. Video thumbnail: (a) video 1; (b) video 2; (c) video 3.

Table 1. Video stabilization effect comparison of video 1.

Evaluation	Method [12]	Method [13]	Method [15]	Proposed
PSNR SSIM Cropping ratio	24.79	24.58	25.13	25.81
	0.83	0.87	0.85	0.88
	0.71	0.75	0.72	0.76
Distortion score	0.68	0.65	0.62	0.69
Stability score	0.73	0.71	0.74	0.76

Table 2. Video stabilization effect comparison of video 2.

Evaluation	Method [12]	Method [13]	Method [15]	Proposed
PSNR SSIM Cropping ratio	22.87	23.07	22.45	24.30
	0.79	0.81	0.76	0.85
	0.68	0.62	0.65	0.69
Distortion score	0.76	0.72	0.69	0.76
Stability score	0.71	0.73	0.71	0.74

Table 3. Video stabilization effect comparison of video 3.

Evaluation	Method [12]	Method [13]	Method [15]	Proposed
PSNR SSIM Cropping ratio	25.84	26.21	26.08	27.89
	0.82	0.89	0.86	0.91
	0.85	0.83	0.82	0.87
Distortion score	0.87	0.82	0.88	0.91
Stability score	0.81	0.85	0.82	0.86

Table 4. Single-frame time consumptions of different video stabilization methods.

	Method [12]	Method [13]	Method [15]	Proposed
Time (ms)	35	20	25	31

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ren, Z.; Fang, M.; Chen, C. Self-Calibration Spherical Video Stabilization Based on Gyroscope. Information 2021, 12, 299. https://doi.org/10.3390/info12080299

AMA Style

Ren Z, Fang M, Chen C. Self-Calibration Spherical Video Stabilization Based on Gyroscope. Information. 2021; 12(8):299. https://doi.org/10.3390/info12080299

Chicago/Turabian Style

Ren, Zhengwei, Ming Fang, and Chunyi Chen. 2021. "Self-Calibration Spherical Video Stabilization Based on Gyroscope" Information 12, no. 8: 299. https://doi.org/10.3390/info12080299

APA Style

Ren, Z., Fang, M., & Chen, C. (2021). Self-Calibration Spherical Video Stabilization Based on Gyroscope. Information, 12(8), 299. https://doi.org/10.3390/info12080299

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Self-Calibration Spherical Video Stabilization Based on Gyroscope

Abstract

1. Introduction

2. Proposed Framework

3. Methodology

3.1. Motion Estimation and Smoothing

3.2. Motion Compensation

3.2.1. Spherical Projection

3.2.2. Self-Calibration of the Spherical Radius

3.2.3. Spherical Rotation Compensation

4. Experiment and Result Analysis

4.1. Experiment Setting and Videos

4.2. Comparison of Different Spherical Radius Values

4.3. Comparison with the Intrinsic Parameter Matrix Method

4.4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI