TY - GEN
T1 - Sparrow ECC
T2 - 29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024
AU - Kim, Hoseok
AU - Choi, Seung Hun
AU - Kong, Joonho
AU - Gong, Young Ho
AU - Chung, Sung Woo
N1 - Publisher Copyright:
© 2024 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
PY - 2024/8/5
Y1 - 2024/8/5
N2 - Exponential growth in deep neural network (DNN) model size has resulted in significant demands for memory bandwidth, leading to the extensive adoption of high bandwidth memory (HBM) in DNN inference. However, with the shorter retention time due to high operating temperature, HBM requires more frequent refresh operations, suffering larger refresh energy/performance overhead. In this paper, we propose Sparrow ECC, a lightweight but stronger HBM ECC technique for less refresh operations while preserving inference accuracy. Sparrow ECC exploits the dominant exponent pattern (i.e., value similarity) in pre-trained DNN weights, limiting the exponent value range of the pre-trained weights to prevent anomalously large weight value change due to the errors. In addition, through duplication and single error correction (SEC) code, Sparrow ECC strongly protects the critical bits in DNN weights. In our evaluation, when the proportion of 1→0 bit errors is 100% and 99%, Sparrow ECC reduces the refresh energy consumption by 90.40% and 93.22%, on average, respectively, compared to the state-of-the-art (RS(19,17)+ZEM [22]) refresh reduction technique, while preserving inference accuracy.
AB - Exponential growth in deep neural network (DNN) model size has resulted in significant demands for memory bandwidth, leading to the extensive adoption of high bandwidth memory (HBM) in DNN inference. However, with the shorter retention time due to high operating temperature, HBM requires more frequent refresh operations, suffering larger refresh energy/performance overhead. In this paper, we propose Sparrow ECC, a lightweight but stronger HBM ECC technique for less refresh operations while preserving inference accuracy. Sparrow ECC exploits the dominant exponent pattern (i.e., value similarity) in pre-trained DNN weights, limiting the exponent value range of the pre-trained weights to prevent anomalously large weight value change due to the errors. In addition, through duplication and single error correction (SEC) code, Sparrow ECC strongly protects the critical bits in DNN weights. In our evaluation, when the proportion of 1→0 bit errors is 100% and 99%, Sparrow ECC reduces the refresh energy consumption by 90.40% and 93.22%, on average, respectively, compared to the state-of-the-art (RS(19,17)+ZEM [22]) refresh reduction technique, while preserving inference accuracy.
KW - DRAM refresh
KW - ECC
KW - deep neural networks
KW - energy efficiency
UR - http://www.scopus.com/inward/record.url?scp=85204966367&partnerID=8YFLogxK
U2 - 10.1145/3665314.3670825
DO - 10.1145/3665314.3670825
M3 - Conference contribution
AN - SCOPUS:85204966367
T3 - Proceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024
BT - Proceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024
PB - Association for Computing Machinery, Inc
Y2 - 5 August 2024 through 7 August 2024
ER -