TY - GEN
T1 - K-RAF
T2 - 33rd International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2024
AU - Jang, Youngwoo
AU - Byun, Jiseob
AU - Kwon, Soonbeom
AU - Choi, Illyoung
AU - Nam, Dukyun
AU - Tak, Byungchul
AU - Na, Gap Joo
AU - Suh, Young Kyoon
N1 - Publisher Copyright:
© 2024 held by the owner/author(s).
PY - 2024/6/3
Y1 - 2024/6/3
N2 - Internet of Things (IoT) (or edge) devices are typically resource-constrained in terms of CPU, memory, and storage. Thus, it is viable for the devices to request resource provisioning to an edge server in the presence of growing data and heavy computation, as the edge server provides better accessibility than cloud servers. Consequently, the edge devices often perform computation and storage provisioning to the edge servers in large-scale data operations. However, the conventional methods for provisioning edge devices take into little consideration the characteristics of resources that jobs executed at the devices rely on. In particular, fully migrating computation jobs from the device to the server may waste valuable resources of the server without considering the computation and I/O characteristics of the jobs, thereby making the devices' resources idle. To overcome these limitations, we propose a novel Kubernetes-based resource augmentation framework, termed K-RAF, for provisioning edge devices with limited capabilities and accelerating the devices' job processing. Our experiment demonstrates that utilizing GPU acceleration, on average, K-RAF can run tasks 306 times faster than local computation on an edge device. Also, we show that utilizing the task distribution between an edge device and K-RAF can offer an average speedup of about 40% compared to K-RAF alone.
AB - Internet of Things (IoT) (or edge) devices are typically resource-constrained in terms of CPU, memory, and storage. Thus, it is viable for the devices to request resource provisioning to an edge server in the presence of growing data and heavy computation, as the edge server provides better accessibility than cloud servers. Consequently, the edge devices often perform computation and storage provisioning to the edge servers in large-scale data operations. However, the conventional methods for provisioning edge devices take into little consideration the characteristics of resources that jobs executed at the devices rely on. In particular, fully migrating computation jobs from the device to the server may waste valuable resources of the server without considering the computation and I/O characteristics of the jobs, thereby making the devices' resources idle. To overcome these limitations, we propose a novel Kubernetes-based resource augmentation framework, termed K-RAF, for provisioning edge devices with limited capabilities and accelerating the devices' job processing. Our experiment demonstrates that utilizing GPU acceleration, on average, K-RAF can run tasks 306 times faster than local computation on an edge device. Also, we show that utilizing the task distribution between an edge device and K-RAF can offer an average speedup of about 40% compared to K-RAF alone.
KW - edge devices
KW - kubernetes
KW - private cloud
KW - resource augmentation
UR - http://www.scopus.com/inward/record.url?scp=85204935362&partnerID=8YFLogxK
U2 - 10.1145/3625549.3658826
DO - 10.1145/3625549.3658826
M3 - Conference contribution
AN - SCOPUS:85204935362
T3 - HPDC 2024 - Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing
SP - 364
EP - 366
BT - HPDC 2024 - Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing
PB - Association for Computing Machinery, Inc
Y2 - 3 June 2024 through 7 June 2024
ER -