Learning to Segment Dynamic Objects Using SLAM Outliers. Bojko, A., Dupont, R., Tamaazousti, M., & Le Borgne, H. In International Conference on Pattern Recognition (ICPR), pages 9780–9787, 2021.
Learning to Segment Dynamic Objects Using SLAM Outliers [link]Pdf  doi  abstract   bibtex   1 download  
We present a method to automatically learn to segment dynamic objects using SLAM outliers. It requires only one monocular sequence per dynamic object for training and consists in localizing dynamic objects using SLAM outliers, creating their masks, and using these masks to train a semantic segmentation network. We integrate the trained network in ORB-SLAM 2 and LDSO. At runtime we remove features on dynamic objects, making the SLAM unaffected by them. We also propose a new stereo dataset and new metrics to evaluate SLAM robustness. Our dataset includes consensus inversions, i.e., situations where the SLAM uses more features on dynamic objects that on the static background. Consensus inversions are challenging for SLAM as they may cause major SLAM failures. Our approach performs better than the State-of-the-Art on the TUM RGB-D dataset in monocular mode and on our dataset in both monocular and stereo modes.
@inproceedings{bojko2020icpr,
  title     = {Learning to Segment Dynamic Objects Using SLAM Outliers},
  author    = {Adrian Bojko and Romain Dupont and Mohamed Tamaazousti and Herv{\'e} {Le Borgne}},
  booktitle = "International Conference on Pattern Recognition (ICPR)",
  pages     = {9780--9787},
  doi       = {10.1109/ICPR48806.2021.9412341},
  url_PDF   = {https://arxiv.org/pdf/2011.06259},
  year      = {2021},
  abstract  = {We present a method to automatically learn to segment dynamic objects using SLAM outliers. It requires only one monocular sequence per dynamic object for training and consists in localizing dynamic objects using SLAM outliers, creating their masks, and using these masks to train a semantic segmentation network. We integrate the trained network in ORB-SLAM 2 and LDSO. At runtime we remove features on dynamic objects, making the SLAM unaffected by them. We also propose a new stereo dataset and new metrics to evaluate SLAM robustness. Our dataset includes consensus inversions, i.e., situations where the SLAM uses more features on dynamic objects that on the static background. Consensus inversions are challenging for SLAM as they may cause major SLAM failures. Our approach performs better than the State-of-the-Art on the TUM RGB-D dataset in monocular mode and on our dataset in both monocular and stereo modes. },
  keywords  = {slam}
}

Downloads: 1