TY - CHAP
T1 - Electronics Engineering Perspectives on Computer Vision Applications
T2 - An Overview of Techniques, Sub-areas, Advancements and Future Challenges
AU - Zheng, Yu Xun
AU - Chee, K. W.G.H.A.
AU - Paul, Anand
AU - Kim, Jeonghong
AU - Lv, H.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.
PY - 2023
Y1 - 2023
N2 - This chapter provides a strategic overview of applications in the computer vision domain. We initially introduce the etymology of computer vision, main tasks, key techniques, and algorithms. Traditional feature extraction methods and deep learning techniques, including prominent algorithms like Region-Based Convolutional Neural Network (R-CNN) and You Only Look Once (YOLO), are explored. We discuss important sub-areas such as image classification, object detection, and image semantic segmentation. The versatility of computer vision is showcased, particularly in autonomous vehicles, healthcare, and surveillance. Furthermore, we delve into the challenges and potential of computer vision, highlighting the necessity for advanced algorithmic methodologies, efficient hardware, robust privacy protections, and conscientious ethical considerations. We also explore upcoming trends, including cross-modal learning, sophisticated ‘vision GPT’ models, and unified models that share architecture and parameters across different tasks. These future directions indicate a transformative impact across various sectors, encompassing autonomous driving, healthcare imaging, and e-commerce. Additionally, we outline the future challenges and trends in the field, underscoring the significance of continuous research and development to address issues such as data scarcity, model interpretability, and privacy concerns. By effectively addressing these challenges and capitalizing on emerging trends, computer vision stands poised to make profound advancements with far-reaching implications. This comprehensive overview aims to provide a solid foundation for understanding the field of computer vision and its potential impact across multiple industries and applications.
AB - This chapter provides a strategic overview of applications in the computer vision domain. We initially introduce the etymology of computer vision, main tasks, key techniques, and algorithms. Traditional feature extraction methods and deep learning techniques, including prominent algorithms like Region-Based Convolutional Neural Network (R-CNN) and You Only Look Once (YOLO), are explored. We discuss important sub-areas such as image classification, object detection, and image semantic segmentation. The versatility of computer vision is showcased, particularly in autonomous vehicles, healthcare, and surveillance. Furthermore, we delve into the challenges and potential of computer vision, highlighting the necessity for advanced algorithmic methodologies, efficient hardware, robust privacy protections, and conscientious ethical considerations. We also explore upcoming trends, including cross-modal learning, sophisticated ‘vision GPT’ models, and unified models that share architecture and parameters across different tasks. These future directions indicate a transformative impact across various sectors, encompassing autonomous driving, healthcare imaging, and e-commerce. Additionally, we outline the future challenges and trends in the field, underscoring the significance of continuous research and development to address issues such as data scarcity, model interpretability, and privacy concerns. By effectively addressing these challenges and capitalizing on emerging trends, computer vision stands poised to make profound advancements with far-reaching implications. This comprehensive overview aims to provide a solid foundation for understanding the field of computer vision and its potential impact across multiple industries and applications.
KW - Deep learning
KW - Image classification
KW - Neural networks
KW - Object detection
KW - Scientific datasets
UR - http://www.scopus.com/inward/record.url?scp=85179707551&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-44127-1_6
DO - 10.1007/978-3-031-44127-1_6
M3 - Chapter
AN - SCOPUS:85179707551
T3 - Studies in Computational Intelligence
SP - 113
EP - 142
BT - Studies in Computational Intelligence
PB - Springer Science and Business Media Deutschland GmbH
ER -