Multi-scale synergy approach for real-time semantic segmentation

Quyen Van Toan, Min Young Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In deep convolution neural network based models for semantic segmentation, diverse receptive fields improve the performance by capturing disparate context information. Multiscale inference is good for both thin and large objects. However, the final result is not optimal through averaging or Max pooling combination. In this paper, we propose an approach to take advantage of multi-scale predictions. Our uncertain-pixels part discovers the worse prediction of a low scale and chooses the complement from a high scale. The final output is effectively merged from two scales. We validate our proposed model with a series of experiments on different datasets. The results achieve the accuracy and speed for real-time semantic segmentation. On Cityscapes dataset, our network achieves 76.3 % mIoU at 50 FPS, and on Mapillary, 42.6 % mIoU.

Original languageEnglish
Title of host publication4th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages216-220
Number of pages5
ISBN (Electronic)9781665458184
DOIs
StatePublished - 2022
Event4th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2022 - Jeju lsland, Korea, Republic of
Duration: 21 Feb 202224 Feb 2022

Publication series

Name4th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2022 - Proceedings

Conference

Conference4th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2022
Country/TerritoryKorea, Republic of
CityJeju lsland
Period21/02/2224/02/22

Keywords

  • Multi-scale
  • real time
  • semantic segmentation

Fingerprint

Dive into the research topics of 'Multi-scale synergy approach for real-time semantic segmentation'. Together they form a unique fingerprint.

Cite this