Regional Video Object Segmentation by Efficient Motion-Aware Mask Propagation

Research output: Chapter in Book/Conference paperConference paperpeer-review

Abstract

The use of optical flow to aid feature matching has been employed in recent self-supervised video object segmentation (VOS) methods and has shown promising results. However, computing pixel-wise optical flow is costly, and the optical flow can also be further utilized for efficient regional segmentation. To address these challenges, we propose an efficient motion-aware mask propagation approach, dubbed EMMP, for self-supervised VOS. EMMP introduces an efficient patch optical flow to compute the motion offsets of image patches for dynamic matching ROI generation. Fine-grained pixel-wise feature matching is performed based on the dynamic matching ROIs for mask propagation. To reduce redundant segmentation while avoiding unnecessary computations, we re-use the patch optical flow to estimate reliable foreground ROIs in the next frame and perform regional segmentation. Evaluation on benchmark VOS datasets shows that EMMP achieves competitive performance with significant wall-clock speed-ups compared to existing self-supervised training methods, e.g., EMMP slightly outperforms MAMP and runs about 2× faster on segmentation. In addition, EMMP performs on par with many supervised training methods.
Original languageEnglish
Title of host publication2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages1-6
Number of pages6
ISBN (Electronic)9781665456425
ISBN (Print)978-1-6654-5643-2
DOIs
Publication statusPublished - 2 Dec 2022
Event2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA) - Sydney, Australia
Duration: 30 Nov 20222 Dec 2022

Publication series

Name2022 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2022

Conference

Conference2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Period30/11/222/12/22

Fingerprint

Dive into the research topics of 'Regional Video Object Segmentation by Efficient Motion-Aware Mask Propagation'. Together they form a unique fingerprint.

Cite this