Proc. Asilomar Conference on Signals, Systems and Computers,
Nov. 2-5, 2014, pp. 673-677, Pacific Grove, CA.
Real-Time 3D Rotation Smoothing for Video Stabilization
Chao Jia,
Zeina Sinno and
Brian L. Evans
Department of Electrical
and Computer Engineering,
Wireless Networking and Communications Group,
The University of Texas at Austin,
Austin, TX 78712 USA
kurtjc@gmail.com -
zeina@utexas.edu -
bevans@ece.utexas.edu
Paper(1 MB) -
PDF Slides (1 MB) -
PowerPoint Slides (160 MB) -
Project Site
Videos:
Figure 1:
(a) Original Video -
(b) Proposed IIR-like smoothing -
(c) Proposed UKF-like smoothing
Figure 2:
(a) Original Video -
(b) Proposed IIR-like smoothing -
(c) Proposed UKF-like smoothing
Original videos are 480 x 720 and stablized videos are 360 x 540
Abstract
We propose two real-time motion smoothing algorithms for video
stabilization using a pure 3D rotation motion model with known
camera projection parameters.
Both proposed algorithms aim at smoothing 3D rotation matrix
sequences in a causal way.
The first algorithm smooths the 3D rotation sequences in a way
similar to 1st-order IIR filtering.
The second algorithm uses sequential probabilistic estimation
under a constant angular velocity model.
These two algorithms are generalized from classical 2D motion
smoothing algorithms.
We exploit the manifold structure of the rotation matrices so
that the proposed algorithms directly smooth the 3D rotation
sequences on the manifold.
In addition, we introduce a simple projection step in order to
guarantee that no black borders intrude into the stabilized video
frames.
Experimental results show that the proposed algorithms
are able to effectively stabilize video sequences and outperform
their 2D counterparts with less jitter and distortion.
Questions and Answers
- Why do you consider the methods presented real time?
Answer: The first method, the IIR-based, executes a frame every
1.57 ms while the second one, the KF based, takes 6.97 ms.
Given that the acquisition is done with a frame rate below
30 frames/s, the delay for the first method would be less than
50 ms and for the second one less than 250 ms.
As a result we can consider the methods to be online.
- The presented videos are subject to very high distortions.
Are the videos always distorted as such?
If you have less distortion would you expect good results?
Answer: Handheld devices are used by a wide variety of users, so
we would expect a wide range of distortions.
We would expect such distorted videos under some conditions,
e.g if the road is not smooth.
For a proof-of-concept, we present highly distorted cases.
We would expect our algorithm to perform as well when we have
less distortions.
- What is the frame size of the new video?
Answer: 540x360. Originally it was 720x480 in the two provided examples.
COPYRIGHT NOTICE: All the documents on this server have been submitted
by their authors to scholarly journals or conferences as indicated,
for the purpose of non-commercial dissemination of scientific work.
The manuscripts are put on-line to facilitate this purpose.
These manuscripts are copyrighted by the authors or the journals in
which they were published.
You may copy a manuscript for scholarly, non-commercial purposes,
such as research or instruction, provided that you agree to respect
these copyrights.
Last updated 01/31/16.