TY - GEN
T1 - An efficient and transparent transaction management based on the data workflow of HVEM DataGrid
AU - Jung, Im Y.
AU - Yeom, Heon Y.
PY - 2008
Y1 - 2008
N2 - This paper proposes an efficient and transparent transaction management on HVEM DataGrid. HVEM DataGrid has its data workflow which defines the precedence of the data to be saved. The precedence drives many states in a transaction for data management on HVEM DataGrid. We divide the transaction into several sub-transactions according to the states and classify the sub-transactions. Through the transaction hierarchy and parallelism, large image data can be uploaded to HVEM DataGrid efficiently and safely. The transaction management guarantees effective garbage collection as well. The garbage collection is important in HVEM DataGrid because it allows incomplete data insertion and update; it is not easy to decide whether the experimental results and their metadata are stored completely or not. Also, as shared storage system, HVEM DataGrid is composed of heterogeneous storages such as DBs and file storages. But, the transaction to save data to the datagrid requires to access those storages at the same time. And, the access should be transparent to users. The transaction hierarchy can solve the simultaneous access and satisfy these requirements. We will show the proposed scheme to be sound through analysis and to be effective by performance evaluation and overhead estimation. This paper shows that HVEM DataGrid can be a good model of e-Science datagrid through the efficient and transparent transaction scheme across many heterogeneous storages.
AB - This paper proposes an efficient and transparent transaction management on HVEM DataGrid. HVEM DataGrid has its data workflow which defines the precedence of the data to be saved. The precedence drives many states in a transaction for data management on HVEM DataGrid. We divide the transaction into several sub-transactions according to the states and classify the sub-transactions. Through the transaction hierarchy and parallelism, large image data can be uploaded to HVEM DataGrid efficiently and safely. The transaction management guarantees effective garbage collection as well. The garbage collection is important in HVEM DataGrid because it allows incomplete data insertion and update; it is not easy to decide whether the experimental results and their metadata are stored completely or not. Also, as shared storage system, HVEM DataGrid is composed of heterogeneous storages such as DBs and file storages. But, the transaction to save data to the datagrid requires to access those storages at the same time. And, the access should be transparent to users. The transaction hierarchy can solve the simultaneous access and satisfy these requirements. We will show the proposed scheme to be sound through analysis and to be effective by performance evaluation and overhead estimation. This paper shows that HVEM DataGrid can be a good model of e-Science datagrid through the efficient and transparent transaction scheme across many heterogeneous storages.
KW - Data workflow
KW - Hvem DataGrid
KW - Transaction management
UR - http://www.scopus.com/inward/record.url?scp=57349109935&partnerID=8YFLogxK
U2 - 10.1145/1383529.1383536
DO - 10.1145/1383529.1383536
M3 - Conference contribution
AN - SCOPUS:57349109935
SN - 9781605581569
T3 - CLADE - Proceedings of the 6th International Workshop on Challenges of Large Applications in Distributed Environments 2008, CLADE'08
SP - 35
EP - 43
BT - Proceedings of the 6th International Workshop on Challenges of Large Applications in Distributed Environments 2008, CLADE'08
T2 - 6th International Workshop on Challenges of Large Applications in Distributed Environments 2008, CLADE'08
Y2 - 23 June 2008 through 23 June 2008
ER -