© 2015 IEEE. Objects' spatial layout estimation and clutter identification are two important tasks to understand indoor scenes. We propose to solve both of these problems in a joint framework using RGBD images of indoor scenes. In contrast to recent approaches which focus on either one of these two problems, we perform 'fine grained structure categorization' by predicting all the major objects and simultaneously labeling the cluttered regions. A conditional random field model is proposed to incorporate a rich set of local appearance, geometric features and interactions between the scene elements. We take a structural learning approach with a loss of 3D localisation to estimate the model parameters from a large annotated RGBD dataset, and a mixed integer linear programming formulation for inference. We demonstrate that our approach is able to detect cuboids and estimate cluttered regions across many different object and scene categories in the presence of occlusion, illumination and appearance variations.
|Title of host publication||IEEE Conference on Computer Vision and Pattern Recognition|
|Publisher||IEEE, Institute of Electrical and Electronics Engineers|
|Publication status||Published - 2015|
|Event||Separating objects and clutter in indoor scenes - Boston, Massachusetts, USA|
Duration: 1 Jan 2015 → …
|Conference||Separating objects and clutter in indoor scenes|
|Period||1/01/15 → …|