Electronics Engineering Perspectives on Computer Vision Applications: An Overview of Techniques, Sub-areas, Advancements and Future Challenges

Yu Xun Zheng, K. W.G.H.A. Chee, Anand Paul, Jeonghong Kim, H. Lv

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

This chapter provides a strategic overview of applications in the computer vision domain. We initially introduce the etymology of computer vision, main tasks, key techniques, and algorithms. Traditional feature extraction methods and deep learning techniques, including prominent algorithms like Region-Based Convolutional Neural Network (R-CNN) and You Only Look Once (YOLO), are explored. We discuss important sub-areas such as image classification, object detection, and image semantic segmentation. The versatility of computer vision is showcased, particularly in autonomous vehicles, healthcare, and surveillance. Furthermore, we delve into the challenges and potential of computer vision, highlighting the necessity for advanced algorithmic methodologies, efficient hardware, robust privacy protections, and conscientious ethical considerations. We also explore upcoming trends, including cross-modal learning, sophisticated ‘vision GPT’ models, and unified models that share architecture and parameters across different tasks. These future directions indicate a transformative impact across various sectors, encompassing autonomous driving, healthcare imaging, and e-commerce. Additionally, we outline the future challenges and trends in the field, underscoring the significance of continuous research and development to address issues such as data scarcity, model interpretability, and privacy concerns. By effectively addressing these challenges and capitalizing on emerging trends, computer vision stands poised to make profound advancements with far-reaching implications. This comprehensive overview aims to provide a solid foundation for understanding the field of computer vision and its potential impact across multiple industries and applications.

Original languageEnglish
Title of host publicationStudies in Computational Intelligence
PublisherSpringer Science and Business Media Deutschland GmbH
Pages113-142
Number of pages30
DOIs
StatePublished - 2023

Publication series

NameStudies in Computational Intelligence
Volume1118
ISSN (Print)1860-949X
ISSN (Electronic)1860-9503

Keywords

  • Deep learning
  • Image classification
  • Neural networks
  • Object detection
  • Scientific datasets

Fingerprint

Dive into the research topics of 'Electronics Engineering Perspectives on Computer Vision Applications: An Overview of Techniques, Sub-areas, Advancements and Future Challenges'. Together they form a unique fingerprint.

Cite this