Interpretation-A Journal of Subsurface Characterization 杂志投稿-刊鹿论文编译

Convolutional neural networks as aid in core lithofacies classification

Abstract:

Artificial intelligence methods have a very wide range of applications. From speech recognition to selfdriving cars, the development of modern deep-learning architectures is helping researchers to achieve new levels of accuracy in different fields. Although deep convolutional neural networks (CNNs) (a kind of deeplearning technique) have reached or surpassed human-level performance in image recognition tasks, little has been done to transport this new image classification technology to geoscientific problems. We have developed what we believe to be the first use of CNNs to identify lithofacies in cores. We use highly accurate models (trained with millions of images) and transfer learning to classify images of cored carbonate rocks. We found that different modern CNN architectures can achieve high levels of lithologic image classification accuracy (approximately 90%) and can aid in the core description task. This core image classification technique has the potential to greatly standardize and accelerate the description process. We also provide the community with a new set of labeled data that can be used for further geologic/data science studies. Introduction Advances in deep learning and artificial intelligence promise to not only drive our cars but to also taste our beer (Gardner et al., 1994; Daily et al., 2017). Specifically, recent advances in the architecture of deep-learning convolutional neural networks (CNNs) have brought the field of image classification and computer vision to a new level. Very deep CNNs emerged in 2014 and have achieved new levels of accuracy in several artificial intelligence classification problems (Szegedy et al., 2014). The current benchmark in object category classification and detection, called ImageNet, consists of hundreds of mixed-object categories and millions of images (Deng et al., 2009; Russakovsky et al., 2015), and it is commonly used to train CNNs. Current CNN models are able to differentiate the image of a leopard from that of a container ship; moreover, they can differentiate images of leopards from their biological cousins — cheetahs and snow leopards (Krizhevsky et al., 2012). Although machine learning has been significantly used in geoscience fields, the application of this technique in core-based lithofacies identification, a key component to better understanding oil and gas reservoirs, is still limited. Machine-learning techniques have been intensely used to aid seismic-facies classification (de Matos et al., 2007, 2011; Roy et al., 2014; Qi et al., 2016; Zhao et al., 2016, 2017; Qian et al., 2018), electrofacies classification (Allen and Pranter, 2016), lithofacies classification from well logs (Baldwin et al., 1990; Zhang et al., 1999; Bestagini et al., 2017), to predict permeability in tight sands (Zhang et al., 2018), and even for seismicity studies (Kortström et al., 2016; Perol et al., 2018; Sinha et al., 2018; Wu et al., 2018). Cored wells are important because they are the only data that provide the ground truth of subsurface reservoirs including the lithofacies variations. The goals of corebased rock-type descriptions are to identify key lithofacies and facies associations; evaluate facies stacking and identify and interpret depositional environments; evaluate the relationships among porosity, permeability, and lithofacies; and help operators to identify optimal zones for designing completions. Traditional corebased lithofacies identification is challenging because it is costly, time consuming, and subjective (e.g., different geologists describing the same core might yield different results). To address some of the core-based lithofacies identification challenges, we evaluate whether a CNN can help a specialist on their image-recognition task. CNN goes hand in hand with the construction and archival of digital databases. Many museums are now The University of Oklahoma, School of Geology and Geophysics, 100 East Boyd Street, RM 710, Norman, Oklahoma 73019, USA and The Geological Survey of Brazil-CPRM, 55 Rua Costa, São Paulo, Brazil. E-mail: rlima@ou.edu (corresponding author). The University of Oklahoma, School of Geology and Geophysics, 100 East Boyd Street, RM 710, Norman, Oklahoma 73019, USA and Oklahoma Geological Survey, 100 East Boyd Street, Room N-131, Norman, Oklahoma 73019, USA. E-mail: huangcienming-1@ou.edu. The University of Oklahoma, School of Geology and Geophysics, 100 East Boyd Street, RM 710, Norman, Oklahoma 73019, USA. E-mail: kmarfurt@ou.edu; matthew.pranter@ou.edu. Manuscript received by the Editor 18 December 2018; published ahead of production 02 April 2019; published online 28 May 2019. This paper appears in Interpretation, Vol. 7, No. 3 (August 2019); p. SF27–SF40, 15 FIGS., 11 TABLES. http://dx.doi.org/10.1190/INT-2018-0245.1. © 2019 Society of Exploration Geophysicists and American Association of Petroleum Geologists. All rights reserved. t Special section: Insights into digital oil field data using artificial intelligence and big data analytics Interpretation / August 2019 SF27 D ow nl oa de d 07 /3 1/ 19 to 6 8. 97 .1 15 .2 6. R ed is tr ib ut io n su bj ec t t o SE G li ce ns e or c op yr ig ht ; s ee T er m s of U se a t h ttp :// lib ra ry .s eg .o rg / busy digitizing and sharing their collections (Blagoderov et al., 2012; Ellwood et al., 2015) With the exception of core measured by deep-sea drilling projects and the like (e.g., NOAA, 2016), core images are not readily available. As an example, more than 100 mi of cores are stored in the Oklahoma Petroleum Information Center, managed by the Oklahoma Geologic Survey. Other states and countries have similar repositories (USGS Core Research Center, 2018). Further digitization of this valuable resource resulting in core images will not only facilitate access to data for traditional analysis but will also provide the information needed to build and calibrate innovative machine-learning algorithms. The work we use here has the potential to organize many miles of slabbed cores into a reliable and coherent system easily accessible to a variety of users. In this paper, we provide one of the first attempts to conduct automated core lithofacies classification using CNN. We begin with an overview of the methodology, which includes data preparation and transfer learning. The details of the CNN method are summarized in tutorial form in Appendix A. Then, we apply CNN to our core data set, and we use confusion matrices, test and validation accuracies, as well as precision, recall, and the F1 score (Fawcett, 2006) computed with the final test set as a means to analyze our results. We conclude with a summary of our findings and suggestions on how our workflow can be extended and improved. Methodology The deep-learning methodology and CNN techniques are now very well-disseminated in diverse fields. LeCun et al. (2015) present details in the construction and the value of deep learning. Dumoulin and Visin (2016) give details on convolutions and other arithmetic steps used in deep-learning algorithms. Although carefully constructed interative papers have been published detailing CNN image transformations and image understanding (e.g., Olah et al., 2017, 2018), CNN may appear to be “magic” and therefore somewhat suspect to the practicing geoscientist. For this reason, Appendix A provides a tutorial that looks under the covers, providing a simple CNN application to classify images into three groups. The work for this paper was developed using open-source computational packages described by Hunter (2007), Chollet (2015), and Abadi et al. (2016) When used for image recognition tasks, CNN models need examples (images) to understand the properties of each “class” that they try to discriminate. Part of the parameters learned for a primary task (such as the ImageNet classification) can be transferred to a secondary task (e.g., lithofacies classification) through the use of transfer learning (Pan and Yang, 2010; Oquab et al., 2014; Yosinski et al., 2014). Our work focuses on using transfer learning of complex CNN architectures to serve our specific image recognition task. The following subsections detail how we prepared our data sets and give a brief explanation of transfer learning. Data preparation We used cores described using traditional methods published by Suriamin and Pranter (2018), capturing images using modern photographic equipment to generate the set of labeled data to feed our CNN. The total section used for this project consists of approximately 700 ft from one core from the Mississippian limestone and chert reservoirs in the Anadarko Shelf, Grant County, Oklahoma. The set of core images shown in Table 1 includes 17 different lithofacies. Two pairs of lithofacies exhibit similar lithology and appearance; we grouped these into a single class for this project. We carefully cropped the images in a standardized fashion, providing consistent input to the CNN. We used a sliding window technique to extract consistent squared cropped sections from the original core images (Figure 1), generating 180 × 180 pixels images representing Table 1. Class number assigned to each lithofacies in the core used in this study. Class Lithofacies Training set Test set 01 Chert breccia in greenish shale matrix 218 3 02 Chert breccia 236 3 03 Skeletal mudstonewackestone 258 4 04 Skeletal grainstone 160 3 05 Splotchy packstone grainstone 344 4 06 Bedded skeletal peloidal packstone-grainstone 416 4 07 Nodular packstone-grainstone 445 11 08 Skeletal peloidal packstonegrainstone Not used Not used 09 Bioturbated skeletal peloidal packstone-grainstone 795 19 10 Bioturbated mudstonewackestone 150 4 11 Brecciated spiculitic mudstone Not used Not used 12 Intraclast spiculitic mudstone Not used Not used 13 Spiculitic mudstonewackestone 3077 79 14 Argillaceous spiculitic mudstone-wackestone 15 Glauconitic sandstone Not used Not used 16 Shale 789 17 17 Shaly claystone Total number of images in each set 6888 151 Note: Classe

Author Listing： Rafael Pires de Lima;Fnu Suriamin;Kurt J. Marfurt;Matthew J. Pranter

Volume： 7

Pages： None

DOI： 10.1190/INT-2018-0245.1

Language： English

Journal： Interpretation

Interpretation-A Journal of Subsurface Characterization

INTERPRETATION-J SUB

影响因子：1.1

是否综述期刊：否

是否OA：否

是否预警：不在预警名单内

发行时间：-

ISSN：2324-8858

发刊频率：-

收录数据库：SCIE/Scopus收录

出版国家/地区：UNITED STATES

出版社：Society of Exploration Geophysicists

期刊介绍

***Jointly published by the American Association of Petroleum Geologists (AAPG) and the Society of Exploration Geophysicists (SEG)***Interpretation is a new, peer-reviewed journal for advancing the practice of subsurface interpretation.

*** 由美国石油地质学家协会（AAPG）和勘探地球物理学家协会（SEG）联合出版 ***《解释》是一本新的同行评审期刊，旨在促进地下解释实践。

年发文量	81
国人发稿量	50
国人发文占比	61.73%
自引率	9.1%
平均录取率	-
平均审稿周期	-
版面费	-
偏重研究方向	GEOCHEMISTRY & GEOPHYSICS-
期刊官网	-
投稿链接	-

质量指标占比

研究类文章占比	OA被引用占比	撤稿占比	出版后修正文章占比
98.77%	2.37%	0.00%	0.98%

预警情况查看说明

时间	预警情况
2024年02月发布的2024版	不在预警名单中
2023年01月发布的2023版	不在预警名单中
2021年12月发布的2021版	不在预警名单中
2020年12月发布的2020版	不在预警名单中

JCR分区 WOS分区等级：Q3区

版本	按学科	分区
WOS期刊SCI分区 WOS期刊SCI分区是指SCI官方（Web of Science）为每个学科内的期刊按照IF数值排序，将期刊按照四等分的方法划分的Q1-Q4等级，Q1代表质量最高，即常说的1区期刊。（2021-2022年最新版）
	GEOCHEMISTRY & GEOPHYSICS	Q3

关于2019年中科院分区升级版（试行）

分区表升级版（试行）旨在解决期刊学科体系划分与学科发展以及融合趋势的不相容问题。由于学科交叉在当代科研活动的趋势愈发显著，学科体系构建容易引发争议。为了打破学科体系给期刊评价带来的桎梏，“升级版方案”首先构建了论文层级的主题体系，然后分别计算每篇论文在所属主题的影响力，最后汇总各期刊每篇论文分值，得到“期刊超越指数”，作为分区依据。

分区表升级版（试行）的优势：一是论文层级的主题体系既能体现学科交叉特点，又可以精准揭示期刊载文的多学科性；二是采用“期刊超越指数”替代影响因子指标，解决了影响因子数学性质缺陷对评价结果的干扰。整体而言，分区表升级版（试行）突破了期刊评价中学科体系构建、评价指标选择等瓶颈问题，能够更为全面地揭示学术期刊的影响力，为科研评价“去四唯”提供解决思路。相关研究成果经过国际同行的认可，已经发表在科学计量学领域国际重要期刊。

《2019年中国科学院文献情报中心期刊分区表升级版（试行）》首次将社会科学引文数据库（SSCI）期刊纳入到分区评估中。升级版分区表（试行）设置了包括自然科学和社会科学在内的18个大类学科。基础版和升级版（试行）将过渡共存三年时间，推测在此期间各大高校和科研院所仍可能会以基础版为考核参考标准。提示：中科院分区官方微信公众号“fenqubiao”仅提供基础版数据查询，暂无升级版数据，请注意区分。

中科院分区查看说明

版本	大类学科	小类学科	Top期刊	综述期刊
	地球科学 4区	GEOCHEMISTRY & GEOPHYSICS 地球化学与地球物理 4区	否	否
2021年12月基础版	地学 4区	GEOCHEMISTRY & GEOPHYSICS 地球化学与地球物理 4区	否	否
2021年12月升级版	地球科学 4区	GEOCHEMISTRY & GEOPHYSICS 地球化学与地球物理 4区	否	否
2020年12月旧的升级版	地球科学 4区	GEOCHEMISTRY & GEOPHYSICS 地球化学与地球物理 4区	否	否
2022年12月最新升级版	地球科学 4区	GEOCHEMISTRY & GEOPHYSICS 地球化学与地球物理 4区	否	否