TY - GEN
T1 - Quantifying the performance impact of large pages on in-memory big-data workloads
AU - Park, Jinsu
AU - Han, Myeonggyun
AU - Baek, Woongki
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/10/3
Y1 - 2016/10/3
N2 - In-memory big-data processing is rapidly emerging as a promising solution for large-scale data analytics with highperformance and/or real-time requirements. In-memory bigdata workloads are often hosted on servers that consist of a few multi-core CPUs and large physical memory, exhibiting the non-uniform memory access (NUMA) characteristics. While large pages are commonly known as an effective technique to reduce the performance overheads of virtual memory and widely supported across the modern hardware and system software stacks, relatively little work has been done to investigate their performance impact on in-memory big-data workloads hosted on NUMA systems. To bridge this gap, this work quantifies the performance impact of large pages on in-memory big-data workloads running on a large-scale NUMA system. Our experimental results show that large pages provide no or little performance gains over the 4KB pages when the in-memory big-data workloads process sufficiently large datasets. In addition, our experimental results show that large pages achieve higher performance gains as the dataset size of the in-memory big-data workloads decreases and the NUMA system scale increases. We also discuss the possible performance optimizations for large pages and estimate the potential performance improvements.
AB - In-memory big-data processing is rapidly emerging as a promising solution for large-scale data analytics with highperformance and/or real-time requirements. In-memory bigdata workloads are often hosted on servers that consist of a few multi-core CPUs and large physical memory, exhibiting the non-uniform memory access (NUMA) characteristics. While large pages are commonly known as an effective technique to reduce the performance overheads of virtual memory and widely supported across the modern hardware and system software stacks, relatively little work has been done to investigate their performance impact on in-memory big-data workloads hosted on NUMA systems. To bridge this gap, this work quantifies the performance impact of large pages on in-memory big-data workloads running on a large-scale NUMA system. Our experimental results show that large pages provide no or little performance gains over the 4KB pages when the in-memory big-data workloads process sufficiently large datasets. In addition, our experimental results show that large pages achieve higher performance gains as the dataset size of the in-memory big-data workloads decreases and the NUMA system scale increases. We also discuss the possible performance optimizations for large pages and estimate the potential performance improvements.
UR - http://www.scopus.com/inward/record.url?scp=84994750532&partnerID=8YFLogxK
U2 - 10.1109/IISWC.2016.7581281
DO - 10.1109/IISWC.2016.7581281
M3 - Conference contribution
AN - SCOPUS:84994750532
T3 - Proceedings of the 2016 IEEE International Symposium on Workload Characterization, IISWC 2016
SP - 209
EP - 218
BT - Proceedings of the 2016 IEEE International Symposium on Workload Characterization, IISWC 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 IEEE International Symposium on Workload Characterization, IISWC 2016
Y2 - 25 September 2016 through 27 September 2016
ER -