TY - JOUR
T1 - Improving Semantic Image Segmentation with a Probabilistic Superpixel-Based Dense Conditional Random Field
AU - Zhang, Liang
AU - Li, Huan
AU - Shen, Peiyi
AU - Zhu, Guangming
AU - Song, Juan
AU - Shah, Syed Afaq Ali
AU - Zhang, Li
AU - Bennamoun, Mohammed
PY - 2018/3/9
Y1 - 2018/3/9
N2 - Deep convolutional neural networks (DCNNs) have been driving significant advances in semantic image segmentation due to their powerful feature representation for recognition. However, their performance in preserving object boundaries is still not satisfactory. Visual mechanism theory indicates that image segmentation tasks require not only recognition, like DCNNs, but also local visual attention capability. Considering that superpixel is good at grasping detailed local structure, we propose a probabilistic superpixel-based dense conditional random field model (PSP-CRF) to refine label assignments as a post-processing optimization method. First, the well-known fully convolutional networks (FCN) and Deeplab-ResNet are employed to produce coarse prediction probabilistic maps at each pixel. Second, we construct a fully connected CRF model based on the PSP generated by the simple linear iterative clustering algorithm. In our approach, an effective refining algorithm with entropy is developed to convert the pixel-level appearance and position features to the normalized PSP, which works well for CRF. Third, our method optimizes the PSP-CRF to obtain the final label assignment results by employing a highly efficient mean field inference algorithm and some quadratic programming relaxation related algorithms. The experiments on the PASCAL VOC segmentation dataset demonstrate the effectiveness of our methods which can improve the segmentation performance of DCNNs to 82% in mIoU while increasing the computational efficiency by 47%.
AB - Deep convolutional neural networks (DCNNs) have been driving significant advances in semantic image segmentation due to their powerful feature representation for recognition. However, their performance in preserving object boundaries is still not satisfactory. Visual mechanism theory indicates that image segmentation tasks require not only recognition, like DCNNs, but also local visual attention capability. Considering that superpixel is good at grasping detailed local structure, we propose a probabilistic superpixel-based dense conditional random field model (PSP-CRF) to refine label assignments as a post-processing optimization method. First, the well-known fully convolutional networks (FCN) and Deeplab-ResNet are employed to produce coarse prediction probabilistic maps at each pixel. Second, we construct a fully connected CRF model based on the PSP generated by the simple linear iterative clustering algorithm. In our approach, an effective refining algorithm with entropy is developed to convert the pixel-level appearance and position features to the normalized PSP, which works well for CRF. Third, our method optimizes the PSP-CRF to obtain the final label assignment results by employing a highly efficient mean field inference algorithm and some quadratic programming relaxation related algorithms. The experiments on the PASCAL VOC segmentation dataset demonstrate the effectiveness of our methods which can improve the segmentation performance of DCNNs to 82% in mIoU while increasing the computational efficiency by 47%.
KW - DCNNs
KW - dense conditional random field (CRF)
KW - semantic image segmentation
KW - superpixel segmentation
UR - http://www.scopus.com/inward/record.url?scp=85043468668&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2018.2814568
DO - 10.1109/ACCESS.2018.2814568
M3 - Article
AN - SCOPUS:85043468668
SN - 2169-3536
VL - 6
SP - 15297
EP - 15310
JO - IEEE Access
JF - IEEE Access
M1 - 2169-3536
ER -