Sparrow ECC: A Lightweight ECC Approach for HBM Refresh Reduction towards Energy-efficient DNN Inference

Hoseok Kim, Seung Hun Choi, Joonho Kong, Young Ho Gong, Sung Woo Chung

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Exponential growth in deep neural network (DNN) model size has resulted in significant demands for memory bandwidth, leading to the extensive adoption of high bandwidth memory (HBM) in DNN inference. However, with the shorter retention time due to high operating temperature, HBM requires more frequent refresh operations, suffering larger refresh energy/performance overhead. In this paper, we propose Sparrow ECC, a lightweight but stronger HBM ECC technique for less refresh operations while preserving inference accuracy. Sparrow ECC exploits the dominant exponent pattern (i.e., value similarity) in pre-trained DNN weights, limiting the exponent value range of the pre-trained weights to prevent anomalously large weight value change due to the errors. In addition, through duplication and single error correction (SEC) code, Sparrow ECC strongly protects the critical bits in DNN weights. In our evaluation, when the proportion of 1→0 bit errors is 100% and 99%, Sparrow ECC reduces the refresh energy consumption by 90.40% and 93.22%, on average, respectively, compared to the state-of-the-art (RS(19,17)+ZEM [22]) refresh reduction technique, while preserving inference accuracy.

Original languageEnglish
Title of host publicationProceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400706882
DOIs
StatePublished - 5 Aug 2024
Event29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024 - Newport Beach, United States
Duration: 5 Aug 20247 Aug 2024

Publication series

NameProceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024

Conference

Conference29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024
Country/TerritoryUnited States
CityNewport Beach
Period5/08/247/08/24

Keywords

  • DRAM refresh
  • ECC
  • deep neural networks
  • energy efficiency

Fingerprint

Dive into the research topics of 'Sparrow ECC: A Lightweight ECC Approach for HBM Refresh Reduction towards Energy-efficient DNN Inference'. Together they form a unique fingerprint.

Cite this