Skip to main navigation Skip to search Skip to main content

DeCo-MeSC: Deep Compression-Based Memory-Constrained Split Computing Framework for Cooperative Inference of Neural Network

  • Mingyu Sung
  • , Vikas Palakonda
  • , Il Min Kim
  • , Sangseok Yun
  • , Jae Mo Kang
  • Kyungpook National University
  • Queen's University Kingston
  • Pukyong National University

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Split computing (SC) of a deep neural network (DNN) across end nodes is a key enabling technology to realize energy-efficient and low-latency cooperative inference in wireless networks and Internet-of-Things (IoT). In this paper, we propose a novel SC framework based on the concept of deep compression (DC) of DNN considering the strictly limited memory footprint of a mobile device, namely, DeCo-MeSC. In our proposed DeCo-MeSC framework, an initial part of a target DNN (up to so-called the split layer) for the mobile device is compressed with the DC technique to satisfy the memory constraint, while the remaining part of the target DNN (after the split layer) for a cloud server is uncompressed, yet fine-tuned to compensate for the performance loss due to the compression of the initial part. Furthermore, the jointly optimal pair of the split layer and data rate is determined efficiently to maximize the end-to-end inference accuracy under both the end-to-end inference latency and memory constraints. Extensive experimental results demonstrate that the proposed scheme performs markedly better than the existing schemes.

Original languageEnglish
Pages (from-to)13319-13324
Number of pages6
JournalIEEE Transactions on Vehicular Technology
Volume74
Issue number8
DOIs
StatePublished - 2025

Keywords

  • Cooperative inference
  • deep compression
  • deep neural network (DNN)
  • memory constraint
  • split computing

Fingerprint

Dive into the research topics of 'DeCo-MeSC: Deep Compression-Based Memory-Constrained Split Computing Framework for Cooperative Inference of Neural Network'. Together they form a unique fingerprint.

Cite this