TY - GEN
T1 - Expandable spherical projection and feature fusion methods for object detection from fisheye images
AU - Kim, Songeun
AU - Park, Soon Yong
N1 - Publisher Copyright:
© 2021 MVA Organization.
PY - 2021/7/25
Y1 - 2021/7/25
N2 - One of the key requirements for enhanced autonomous driving systems is accurate detection of the objects from a wide range of view. Large-angle images from a fisheye lens camera can be an effective solution for automotive applications. However, it comes with the cost of strong radial distortions. In particular, the fisheye camera has a photographic effect of exaggerating the size of objects in central regions of the image, while making objects near the marginal area appear smaller. Therefore, we propose the Expandable Spherical Projection that expands center or margin regions to produce straight edges of de-warped objects with less unwanted background in the bounding boxes. In addition to this, we analyze the influence of multi-scale feature fusion in a real-time object detector, which learns to extract more meaningful information for small objects. We present three different types of concatenated YOLOv3-SPP architectures. Moreover, we demonstrate the effectiveness of our proposed projection and feature-fusion using multiple fisheye lens datasets, which shows up to 4.7% AP improvement compared to fisheye images and baseline model.
AB - One of the key requirements for enhanced autonomous driving systems is accurate detection of the objects from a wide range of view. Large-angle images from a fisheye lens camera can be an effective solution for automotive applications. However, it comes with the cost of strong radial distortions. In particular, the fisheye camera has a photographic effect of exaggerating the size of objects in central regions of the image, while making objects near the marginal area appear smaller. Therefore, we propose the Expandable Spherical Projection that expands center or margin regions to produce straight edges of de-warped objects with less unwanted background in the bounding boxes. In addition to this, we analyze the influence of multi-scale feature fusion in a real-time object detector, which learns to extract more meaningful information for small objects. We present three different types of concatenated YOLOv3-SPP architectures. Moreover, we demonstrate the effectiveness of our proposed projection and feature-fusion using multiple fisheye lens datasets, which shows up to 4.7% AP improvement compared to fisheye images and baseline model.
UR - http://www.scopus.com/inward/record.url?scp=85113987286&partnerID=8YFLogxK
U2 - 10.23919/MVA51890.2021.9511379
DO - 10.23919/MVA51890.2021.9511379
M3 - Conference contribution
AN - SCOPUS:85113987286
T3 - Proceedings of MVA 2021 - 17th International Conference on Machine Vision Applications
BT - Proceedings of MVA 2021 - 17th International Conference on Machine Vision Applications
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th International Conference on Machine Vision Applications, MVA 2021
Y2 - 25 July 2021 through 27 July 2021
ER -