Cloud Memory Enabled Code Generation via Online Computing for Seamless Edge AI Operation

Myeongjin Kang, Daejin Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper introduces an innovative architecture designed to enhance the execution of Artificial Intelligence (AI) software on edge devices, which are often constrained by limited hardware resources. The core of proposal is to dynamically adapt AI models through server-mediated parameter updates and learning, thus allowing edge devices to efficiently process AI tasks in real-time and adapt to various operational conditions. By leveraging the computational power of cloud resources for the heavy lifting of AI model training, the computational burden on edge devices is alleviated, enabling them to focus on inference tasks with updated models. This approach significantly improves the operational efficiency and adaptability of edge computing in AI applications. Our architecture employs server-based emulation to monitor and dynamically update edge devices, ensuring their execution is optimized for current conditions. Experimental results demonstrate a substantial reduction in operational time up to 75% compared to traditional edge devices without accelerators and 49 % when compared to devices equipped with accelerators. Moreover, proposed model shows an ability to improve accuracy by 20 % in scenarios with biased inputs through continuous learning and parameter updating, highlighting its adaptability to changing environments. This research contributes to the field of edge computing by demonstrating a viable solution for deploying sophisticated AI models in resource-constrained environments. By offloading computationally intensive tasks to the cloud, proposed architecture ensures that edge devices can operate more efficiently and handle a broader range of AI applications. This study not only underscores the potential of integrating cloud and edge computing to overcome the limitations of edge devices but also opens new avenues for future research in intelligent edge computing systems.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 48th Annual Computers, Software, and Applications Conference, COMPSAC 2024
EditorsHossain Shahriar, Hiroyuki Ohsaki, Moushumi Sharmin, Dave Towey, AKM Jahangir Alam Majumder, Yoshiaki Hori, Ji-Jiang Yang, Michiharu Takemoto, Nazmus Sakib, Ryohei Banno, Sheikh Iqbal Ahamed
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2361-2368
Number of pages8
ISBN (Electronic)9798350376968
DOIs
StatePublished - 2024
Event48th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2024 - Osaka, Japan
Duration: 2 Jul 20244 Jul 2024

Publication series

NameProceedings - 2024 IEEE 48th Annual Computers, Software, and Applications Conference, COMPSAC 2024

Conference

Conference48th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2024
Country/TerritoryJapan
CityOsaka
Period2/07/244/07/24

Keywords

  • Edge computing
  • Embedded system
  • Remote software execution

Fingerprint

Dive into the research topics of 'Cloud Memory Enabled Code Generation via Online Computing for Seamless Edge AI Operation'. Together they form a unique fingerprint.

Cite this