TY - JOUR
T1 - Data-driven sequence labeling methods incorporating the long-range spatial variation of geological data for lithofacies sequence estimation
AU - Park, Gyeong Tae
AU - Jeong, Jina
AU - Emelyanova, Irina
AU - Pervukhina, Marina
AU - Esteban, Lionel
AU - Yun, Seong Taek
N1 - Publisher Copyright:
© 2021
PY - 2022/1
Y1 - 2022/1
N2 - The use of geophysical well-log data for interpreting the stratigraphic lithofacies sequence is cost effective. In this study, several data-driven lithofacies sequence estimation models are developed, where long- and short-term memory (LSTM) and bidirectional LSTM (BLSTM) are applied to efficiently complement the long-range spatial variation of successive well-log and lithofacies measurements. During the development, the models using the autoregressive (AR) input variables of lithofacies are designed to incorporate the lithofacies sequence pattern into the estimation. The performances of the proposed models are comparatively validated with an artificial deep neural network (DNN)-based model that does not consider long-range variation. Accordingly, a total of six estimation models are examined: DNN, AR-DNN, LSTM, AR-LSTM, BLSTM, and AR-BLSTM. For model implementation, synthetic data and actual data acquired from the Satyr-5 well in Western Australia are used. For the synthetic data, the results indicate that the incorporation of nonstationary statistical information improves the performance of BLSTM-based models. In addition, AR-input information is effective with respect to the estimation of the vertical thickness of lithofacies. The advantage of using AR inputs can also be observed for actual data, where AR-based models perform significantly better than the other models. Quantitatively speaking, the fitness of DNN-, LSTM-, and BLSTM-based models is 81.79 %, 84.53 %, and 85.08 %, respectively, whereas that of AR-DNN-, AR-LSTM-, and AR-BLSTM-based models is 85.43 %, 85.72 %, and 86.56 %, respectively. The proposed models are expected to be useful with respect to interpreting the heterogeneity of lithofacies distribution in a cost-effective and computationally efficient way. Particularly, BLSTM-based models are widely applicable because they perform well regardless the spatial statistics of lithofacies sequences.
AB - The use of geophysical well-log data for interpreting the stratigraphic lithofacies sequence is cost effective. In this study, several data-driven lithofacies sequence estimation models are developed, where long- and short-term memory (LSTM) and bidirectional LSTM (BLSTM) are applied to efficiently complement the long-range spatial variation of successive well-log and lithofacies measurements. During the development, the models using the autoregressive (AR) input variables of lithofacies are designed to incorporate the lithofacies sequence pattern into the estimation. The performances of the proposed models are comparatively validated with an artificial deep neural network (DNN)-based model that does not consider long-range variation. Accordingly, a total of six estimation models are examined: DNN, AR-DNN, LSTM, AR-LSTM, BLSTM, and AR-BLSTM. For model implementation, synthetic data and actual data acquired from the Satyr-5 well in Western Australia are used. For the synthetic data, the results indicate that the incorporation of nonstationary statistical information improves the performance of BLSTM-based models. In addition, AR-input information is effective with respect to the estimation of the vertical thickness of lithofacies. The advantage of using AR inputs can also be observed for actual data, where AR-based models perform significantly better than the other models. Quantitatively speaking, the fitness of DNN-, LSTM-, and BLSTM-based models is 81.79 %, 84.53 %, and 85.08 %, respectively, whereas that of AR-DNN-, AR-LSTM-, and AR-BLSTM-based models is 85.43 %, 85.72 %, and 86.56 %, respectively. The proposed models are expected to be useful with respect to interpreting the heterogeneity of lithofacies distribution in a cost-effective and computationally efficient way. Particularly, BLSTM-based models are widely applicable because they perform well regardless the spatial statistics of lithofacies sequences.
KW - A long-range spatial information
KW - Autoregressive lithofacies
KW - Bidirectional long- and short-term memory
KW - Geophysical well-log data
KW - Lithofacies sequence estimation
KW - Long- and short-term memory
UR - http://www.scopus.com/inward/record.url?scp=85112551786&partnerID=8YFLogxK
U2 - 10.1016/j.petrol.2021.109345
DO - 10.1016/j.petrol.2021.109345
M3 - Article
AN - SCOPUS:85112551786
SN - 0920-4105
VL - 208
JO - Journal of Petroleum Science and Engineering
JF - Journal of Petroleum Science and Engineering
M1 - 109345
ER -