Depth Enhancement via Low-rank Matrix Completion

Depth Enhancement via Low-rank Matrix Completion

¹Department of Computer Science, Portland State University
²Department of Computer Science and Engineering, University of Washington

Abstract

Depth captured by consumer RGB-D cameras is often noisy and misses values at some pixels, especially around object boundaries. Most existing methods complete the missing depth values guided by the corresponding color image. When the color image is noisy or the correlation between color and depth is weak, the depth map cannot be properly enhanced. In this paper, we present a depth map enhancement algorithm that performs depth map completion and de-noising simultaneously. Our method is based on the observation that similar RGB-D patches lie in a very low-dimensional subspace. We can then assemble the similar patches into a matrix and enforce this low-rank subspace constraint. This low-rank subspace constraint essentially captures the underlying structure in the RGB-D patches and enables robust depth enhancement against the noise or weak correlation between color and depth. Based on this subspace constraint, our method formulates depth map enhancement as a low-rank matrix completion problem. Since the rank of a matrix changes over matrices, we develop a data-driven method to automatically determine the rank number for each matrix. The experiments on both public benchmark and our own captured RGB-D images show that our method can effectively enhance depth maps.

Paper

Si Lu, Xiaofeng Ren, and Feng Liu. Depth Enhancement via Low-rank Matrix Completion.
IEEE CVPR 2014. PDF

Data

We apply our method on images from three datasets, namely the Middlebury dataset [1, 2, 3, 4], the RGBZ dataset [5] and our own RGBD dataset captured by ASUS Xtion Pro.

You can download our data as a zip package or click the links below for details
Middlebury dataset
RGBZ dataset
RGBD dataset

References

[1] H. Hirschmuller and D. Scharstein. Evaluation of cost functions for stereo matching. In IEEE CVPR, 2007.
[2] D. Scharstein and C. Pal. Learning conditional random fields for stereo. In IEEE CVPR, 2007.
[3] D. Scharstein and R. Szeliski. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vision,
47(1-3):7–42, 2002.
[4] D. Scharstein and R. Szeliski. High-accuracy stereo depth maps using structured light. In IEEE CVPR, 2003.
[5] Christian Richardt, Carsten Stoll, Neil A. Dodgson, Hans-Peter Seidel, and Christian Theobalt. Coherent Spatiotemporal Filtering,
Upsampling and Rendering of RGBZ Videos. Comp. Graph. Forum, 31(2):247–256, 2012.