Skip to main navigation Skip to search Skip to main content

Multi-Modal Integration of 2D and 3D Attributes for Multi-Vehicles Tracking

  • Kyungpook National University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Tracking multiple objects is crucial in autonomous vehicles, but relying on one sensor is unreliable due to potential failures in challenging scenarios. 2D cameras provide texture information, whereas LiDAR offers 3D structural data, each excelling under different conditions. Therefore, combining the features of these two sensors is essential for learning distinct characteristics. Effective fusion is challenging because the modal-ities contain fundamentally different data. In this study, we introduce multi-modal integration of point-level and pixel-level features to enhance feature distinctiveness. We utilize VoxelNet for obtaining multi-scale point cloud representations, and ResNet-50 for 2D image-based feature extraction. Additionally, we assess the benefits of pre-training individual modalities followed by fine-tuning the multi-modal. Our technique achieves MOTA 91.28% and 73.53% HOTA on the KITTI dataset, surpassing many methods without multi-modal integration.

Original languageEnglish
Title of host publicationICTC 2024 - 15th International Conference on ICT Convergence
Subtitle of host publicationAI-Empowered Digital Innovation
PublisherIEEE Computer Society
Pages364-369
Number of pages6
ISBN (Electronic)9798350364637
DOIs
StatePublished - 2024
Event15th International Conference on Information and Communication Technology Convergence, ICTC 2024 - Jeju Island, Korea, Republic of
Duration: 16 Oct 202418 Oct 2024

Publication series

NameInternational Conference on ICT Convergence
ISSN (Print)2162-1233
ISSN (Electronic)2162-1241

Conference

Conference15th International Conference on Information and Communication Technology Convergence, ICTC 2024
Country/TerritoryKorea, Republic of
CityJeju Island
Period16/10/2418/10/24

Keywords

  • autonomous vehicles
  • deep learning
  • merge features
  • Multiple object tracking
  • neural networks

Fingerprint

Dive into the research topics of 'Multi-Modal Integration of 2D and 3D Attributes for Multi-Vehicles Tracking'. Together they form a unique fingerprint.

Cite this