A Novel Data Management Scheme in Cloud for Micromachines

Gurwinder Singh, Rathinaraja Jeyaraj, Anil Sharma, Anand Paul

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

In cyber-physical systems (CPS), micromachines are typically deployed across a wide range of applications, including smart industry, smart healthcare, and smart cities. Providing on-premises resources for the storage and processing of huge data collected by such CPS applications is crucial. The cloud provides scalable storage and computation resources, typically through a cluster of virtual machines (VMs) with big data tools such as Hadoop MapReduce. In such a distributed environment, job latency and makespan are highly affected by excessive non-local executions due to various heterogeneities (hardware, VM, performance, and workload level). Existing approaches handle one or more of these heterogeneities; however, they do not account for the varying performance of storage disks. In this paper, we propose a prediction-based method for placing data blocks in virtual clusters to minimize the number of non-local executions. This is accomplished by applying a linear regression algorithm to determine the performance of disk storage on each physical machine hosting a virtual cluster. This allows us to place data blocks and execute map tasks where the data blocks are located. Furthermore, map tasks are scheduled based on VM performance to reduce job latency and makespan. We simulated our ideas and compared them with the existing schedulers in the Hadoop framework. The results show that the proposed method improves MapReduce performance in terms of job latency and makespan by minimizing non-local executions compared to other methods taken for evaluation.

Original languageEnglish
Article number3807
JournalElectronics (Switzerland)
Volume12
Issue number18
DOIs
StatePublished - Sep 2023

Keywords

  • MapReduce scheduling
  • cyber-physical system
  • data block placement
  • data locality

Fingerprint

Dive into the research topics of 'A Novel Data Management Scheme in Cloud for Micromachines'. Together they form a unique fingerprint.

Cite this