AI-Powered Lithofacies Prediction and Fault Detection in Complex Thin Gas Reservoirs Offshore Eastern India

Rada, F., Chaveste, A., Roden, R., Smith, T., Dunn, M. | March 2025 | Published with permission: Geophysical Society of Houston

Download PDF

Technical Paper

Summary
Introduction
Geologic Setting
Methodology
Machine Learning (Self-Organizing Maps) for Multi-Attribute Volume Generation
Fault Detection Using Deep Learning
Results
Discussion and Conclusions

Summary

Accurately predicting lithofacies and detecting faults in thin, compartmentalized gas reservoirs is challenging due to complex stratigraphy and limited data resolution. This study addresses these challenges in an Offshore area in Eastern India, where one well intersects thin Pleistocene gas sands. Traditional lithofacies prediction methods, such as seismic inversion, are often constrained by the quality and extent of data. We applied K-means clustering to well log data and used Self-Organizing Maps (SOM) on multi-attribute seismic volumes to develop a high-resolution lithofacies model. This process produced a 95% match between lithofacies from logs and lithofacies from seismic. Convolutional Neural Networks (CNN) further refined fault detection, enhancing the structural understanding of reservoir compartments. Integrating these AI-driven methods resulted in a unified model of lithofacies and faults, significantly improving reservoir characterization. This approach offers an efficient alternative to conventional methods, aiding connectivity assessment and informing exploration strategies in complex geological settings.

Figure 1. Left: Cenozoic stratigraphy of the basin (modified from Indian Plate Lexicon of Stratigraphic Units, 2024). Right: Geological section (modified from Harikrishna et al., 2023).

Introduction

Lithofacies prediction and fault detection in thin, highly compartmentalized gas reservoirs pose significant technical and geological challenges. Complex stratigraphic and structural features hinder hydrocarbon flow and complicate connectivity assessment across reservoir blocks. In this study area, there is only one well intersecting a few thin, gas-bearing Pleistocene sandstones. The limited quantity and variable quality of well data often constrain robust characterization, while the seismic data’s insufficient vertical resolution further challenges the detection of thin reservoirs. Traditional lithofacies prediction relies on rock physics, expected geological trends, and techniques like amplitude versus offset (AVO) analysis and seismic inversion (Avseth et al., 2005; Mavko and Mukerji, 2009; Dvorkin and Nur, 1996). These approaches, though powerful, require extensive expertise, high-quality data, and are often costly, especially in heterogeneous, low-resolution areas (Russell and Hampson, 1991; Veeken and Silva, 2004). In this context, artificial intelligence (AI) offers a promising alternative. Neural networks and classification algorithms, like SOM, are powerful nonlinear cluster analysis and pattern recognition techniques that help geoscientists identify patterns that can be related to lithofacies (Chaveste et al., 2023).

In this study, AI predicts lithofacies volumes by combining K-means clustering of well log data with multi-attribute seismic classification volumes sampled at a 4-millisecond interval. This approach can match or exceed traditional inversion methods in accuracy, with less human intervention and greater tolerance for incomplete data. Additionally, convolutional neural networks (CNN) were employed to generate fault classification and probability volumes (Wu et al., 2019), which, alongside lithofacies volumes, help unravel the area’s complex stratigraphic and structural framework, essential for constructing static and dynamic reservoir models.

Figure 2. Lithofacies log using three petrphysical curves: PHIE, VSH and SW, clustered by K-means in four lithofacies: Shale, GasSand Clean, Gas Sand Shaly and Wet Sand.

Geologic Setting

This study is in a peri-cratonic passive margin basin off the east coast of India (Gupta, 2006). The basin has a layered tectono-sedimentary history shaped by rifting and sedimentation, with two major ectonic phases. This study focuses on the Rajahmundri Sandstone and Ravva Formation. The Ravva is glauconitic and calcareous, while the Rajahmundri is medium-grained and lateritized. Pleistocene-Miocene sedimentation created a complex structure with growth faults, intraslope lobes, and bypass channels (Figure 1).

Methodology

Machine Learning (K-means) for Lithofacies Log Generation

K-means clustering is highly valued in petroleum engineering for classifying well log data due to its efficiency with large datasets common in reservoir analysis. Its straightforward iterative approach enables rapid processing of data across various reservoir scales (Han et al., 2011). When applied to high-dimensional well log data, such as porosity, resistivity, and permeability, K-means effectively uncovers patterns within these various data types to identify lithofacies and assess reservoir quality ( Jain et al., 2010). A significant advantage of K-means is its interpretability; each cluster centroid represents the average properties of a lithofacies, aiding geoscientists in evaluating reservoir characteristics. Compared to more complex algorithms like SVMs or Neural Networks, K-means produces clusters that are easier to interpret, which is valuable for initial exploratory analysis when labeled data may be unavailable or costly (Chen et al., 2018). Additionally, K-means is highly scalable, processing data across numerous wells effectively. By selecting an optimal number of clusters (K), the approach can adapt to specific geological needs for a more comprehensive reservoir characterization. In this study, clustering used Effective Porosity, Volume of Shale, and Water Saturation as input properties. Initially, eight clusters were established, and histograms of each petrophysical curve response helped define four lithofacies: Gas Sand Clean, Gas Sand Shaly, Shale, and Wet Sand.

Machine Learning (Self-Organizing Maps) for Multi-Attribute Volume Generation

The seismic data used in this study originates from a Pre-Stack Time Migration amplitude volume, with a 4-millisecond sample interval and 12.5 meters squared bin size. This amplitude volume served as the base for generating eleven instantaneous seismic attributes: Phase Acceleration, Attenuation, Bandwidth, Envelope, Envelope Slope, Envelope Second Derivative, Hilbert, Instantaneous Frequency, Relative Acoustic Impedance, Sweetness, and Thin Bed. These attributes were classified simultaneously through the Self-Organizing Maps (SOM) neural network (Kohonen, 1982). Smith and Treitel (2015) stated that Self-organizing maps at single sampling interval are a practical way to identify natural clusters in multi-attribute seismic data. Here, the SOM volume was transformed into a log along the well path, allowing direct statistical comparison with the K-means lithofacies log, allowing the geoscientists to link both types of data through the Lithofacies Prediction tool in Paradise®.

Fault Detection Using Deep Learning

The Convolutional Neural Network (CNN) was trained using thousands of synthetic volumes with variations in the characteristics of reflectors (density, noise, dominant frequency, vertical shearing variation, dipping beds) and faults (dip, azimuth, location, displacements, attenuation, among others), allowing the algorithm to detect even the most ‘imperceptible’ faults to the naked eye (Qi et al, 2022).
The workflow starts with a pre-conditioning step of the original seismic amplitude, using a Structure Oriented Filter (SOF3D) that involves the computation of Dip, Similarity and SOF3D volumes for suppressing noise. The CNN will perform the “computer vision” task of comparing the training data with the real filtered data and producing a set of fault attributes that include Fault Probability and Fault
Classification. The next step is the post-processing of the fault images; creating more continuous features for faults (fault enhancement) and then sharpening them in the skeletonization process (Qi et al, 2020).

Figure 3. Lithofacies-to-SOM Process. A) Cartoon of the Lithofacies log versus SOM neuron cross tabulation. B) Stacked bars showinglithofacies log count by each SOM neuron, Contingency Table highlighting Gas Sand Clean Probability by neuron in descending order and Lithofacies Probability honeycomb. C) Actual comparison of Lithofacies log versus SOM with Litho-Log template in the well. — Figure 3. Lithofacies-to-SOM Process. A) Cartoon of the Lithofacies log versus SOM neuron cross tabulation. B) Stacked bars showing lithofacies log count by each SOM neuron, Contingency Table highlighting Gas Sand Clean Probability by neuron in descending order and Lithofacies Probability honeycomb. C) Actual comparison of Lithofacies log versus SOM with Litho-Log template in the well.

Results

In this study, we combined three key methodologies: K-means clustering for lithofacies classification from well logs, Self-Organizing Maps (SOM) for seismic multiattribute classification at 4 milliseconds scale, and seismic fault detection using Convolutional Neural Networks (CNN) to build a high-resolution reservoir model. We first applied K-means clustering to well log data, generating a lithofacies log based on effective porosity, water saturation, and shale volume. Four distinct clusters or lithofacies were identified in the histograms, reflecting their respective data distributions: Shale, Clean Gas
Sand, Shaly Gas Sand, and Wet (Brine) Sand (Figure 2, corners). These clusters were subsequently used to create a lithofacies log for the well, where the most significant sandstone layer encountered measures only 5 meters in thickness (Figure 2, center). As illustrated in Figure 3, the lithofacies log provided the basis for extending facies classification across the seismic SOM volume. The sampling interval scale-based SOM multiattribute seismic analysis enabled resolving the sandstone mentioned above. Each one of the SOM voxels (neurons) extracted along the well path was analyzed according to its lithofacies log, determining in a quantitively manner through contingency tables the probability of each neuron representing a specific lithofacies. With a total of 7549 samples evaluated, the percentage of fit between the lithofacies log and the predicted lithofacies volume is 95%. After grouping similar neurons in the contingency table, several parameters were computed to obtain Cramér’s V equal to 0.57, with a predefined Type I error equal to 0.05, the Chi-square statistic of 2486.26, three degrees of freedom, p-value of 0 and Statistical Power of 0.96. All these statistical parameters provide evidence of a strong
association between the seismic lithofacies prediction (LP) classification volume and the lithofacies log.

Figure 4. 3D Lithofacies Prediction (LP) volume. A) Unlabeled SOM classification Volume without 3D CNN faults. B) Predicted Lithofacies volume with embedded 3D CNN faults.

Figure 5. Perspective view of sculpted Gas Sand geobodies and CNN Faults showing detected structural compartments in the well vicinity and in new zones of interest.

Discussion and Conclusions

The integration of lithofacies log classification, seismic multi-attribute analysis, and fault detection has significantly refined our understanding of the Rajahmundri Sandstone and Ravva Formation within the basin. By combining K-means clustering on well log data with Self-Organizing Maps (SOM) on seismic attributes, we created a highresolution lithofacies model that aligns well with the depositional complexity and structural heterogeneity of the basin.

K-means clustering allowed us to define lithofacies based on porosity, shale volume, and water saturation, providing a direct, interpretable lithofacies log to guide reservoir characterization. This approach is particularly valuable given the stratigraphic unit thin sandstone layers. Importantly, the powerful SOM’s sampling interval multiattribute classification extended lithofacies beyond the well, capturing seismic responses of each lithofacies at a fine spatial scale. This integration is crucial in complex environments where conventional seismic alone might not sufficiently differentiate between lithofacies.

The fault detection using CNN-based seismic analysis added another layer of precision, defining structural compartments and enhancing the lithofacies model’s accuracy. Fault planes, mapped directly into the Gas Sand and Shale lithofacies volumes, revealed structural entrapment mechanisms essential for understanding reservoir connectivity. This integrated approach not only refines the interpretation of reservoir architecture but also streamlines the interpreter’s workflow by bringing together essential elements in a unified model, allowing for a comprehensive view of lithofacies distribution and structural compartmentalization.

By incorporating faults within lithofacies distributions, the model provides a robust framework for geologic continuity and compartmentalization, aligning with observed growth fault systems and toe-thrusts from previous basin studies. This combined approach ultimately delivers a more accurate reservoir model that can drive future exploration and development strategies in the basin by addressing both stratigraphic and structural elements in an integrated, data-driven manner.

References

Avseth et al., 2005; Mavko and Mukerji, 2009; Dvorkin and Nur, 1996.Russell and Hampson, 1991; Veeken and Silva,
2004

Russell and Hampson, 1991; Veeken and Silva, 2004.

Chaveste et al., 2023.

Wu et al., 2019).

Gupta, 2006.

Han et al., 2011.

Jain et al., 2010.

Chen et al., 2018.

Kohonen, 1982.

Qi et al, 2020.

Download PDF

Welcome Back!

Download PDF here

OR

Request access by filling the form below to download full PDF.

Name(Required)

First Last

Email(Required)

CAPTCHA

Case Study: An Integrated Machine Learning-Based Fault Classification Workflow

We build a fully automated machine learning-based fault detection workflow to compute the parameterized fault classification results ...

Case Study with Petrobras: Applying Unsupervised Multi-Attribute Machine Learning for 3D Stratigraphic Facies Classification in a Carbonate Field, Offshore Brazil

This study with Petrobras presents the results of a multi-attribute, machine learning study over a pre-salt carbonate field in the ...

Applying Machine Learning Technologies in the Niobrara Formation, DJ Basin, to Quickly Produce an Integrated Structural and Stratigraphic Seismic Classification Volume Calibrated to Wells

Carolan Laudon, Jie Qi, Yin-Kai Wang, Geophysical Research, LLC (d/b/a Geophysical Insights), University of Houston | Published with permission: Unconventional Resources ...

The Relationship of Self-Organization, Geology, and Machine Learning

Self-organization is the nonlinear formation of spatial and temporal structures, patterns or functions in complex systems (Aschwanden et al., 2018). Simple examples of self-organization include flocks of birds, schools of fish, crystal development, formation of snowflakes, and fractals. What these examples have in common is the appearance of structure or patterns without centralized control. Self-organizing systems are typically governed by power laws, such as the Gutenberg-Richter law of earthquake frequency and magnitude. In addition, the time frames of such systems display a characteristic self-similar (fractal) response, where earthquakes or avalanches for example, occur over all possible time scales (Baas, 2002).

The existence of nonlinear dynamic systems and ordered structures in the earth are well known and have been studied for centuries and can appear as sedimentary features, layered and folded structures, stratigraphic formations, diapirs, eolian dune systems, channelized fluvial and deltaic systems, and many more (Budd, et al., 2014; Dietrich and Jacob, 2018). Each of these geologic processes and features exhibit patterns through the action of undirected local dynamics and is generally termed “self-organization” (Paola, 2014).

Artificial intelligence and specifically neural networks exhibit and reveal self-organization characteristics. The reason for the interest in applying neural networks stems from the fact that they are universal approximators for various kinds of nonlinear dynamical systems of arbitrary complexity (Pessa, 2008). A special class of artificial neural networks is aptly named self-organizing map (SOM) (Kohonen, 1982). It has been found that SOM can identify significant organizational structure in the form of clusters from seismic attributes that relate to geologic features (Strecker and Uden, 2002; Coleou et al., 2003; de Matos, 2006; Roy et al., 2013; Roden et al., 2015; Zhao et al., 2016; Roden et al., 2017; Zhao et al., 2017; Roden and Chen, 2017; Sacrey and Roden, 2018; Leal et al, 2019; Hussein et al., 2020; Hardage et al., 2020; Manauchehri et al., 2020). As a consequence, SOM is an excellent machine learning neural network approach utilizing seismic attributes to help identify self-organization features and define natural geologic patterns not easily seen or seen at all in the data.

Table of contents

Summary

Introduction

Geologic Setting

Methodology

Machine Learning (K-means) for Lithofacies Log Generation

Machine Learning (Self-Organizing Maps) for Multi-Attribute Volume Generation

Fault Detection Using Deep Learning

Results

Discussion and Conclusions

References

Welcome Back!

OR

Request access by filling the form below to download full PDF.

Most Popular Papers

Jan Van De Mortel

Mike Powney

Tim Gibbons

Applying Unsupervised Multi-Attribute Machine Learning for 3D Stratigraphic Facies Classification in a Carbonate Field, Offshore Brazil

Automatic Fault Detection and Applying Machine Learning to Detect Thin Beds

Paradise: A Day in The Life of the Geoscientist

Machine Learning Fault Detection: A Case Study

How to Use Paradise to Interpret Clastic Reservoirs

Machine Learning in the Cloud

Senior Geoscientist

Stratton Field Case Study – New Solutions to Old Problems

Practice Lead, Advertas

Young Professionals – Managing Your Personal Brand to Level-up Your Career

Practice Lead, Advertas

Sr. Geophysicist, Petroleum Oil & Gas Services

Statistical Calibration of SOM results with Well Log Data (Case Study)

Assistant Professor, University of Oklahoma

Research Geophysicist

An Integrated Fault Detection Workflow

Research Geophysicist

An integrated machine learning-based fault classification workflow

Senior Research Geophysicist

Connecting Multi-attribute Classification to Reservoir Properties

Assistant Professor, University of Oklahoma

Gas Hydrates, Reefs, Channel Architecture, and Fizz Gas: SOM Applications in a Variety of Geologic Settings

Sr. Geophysicist, Petroleum Oil & Gas Servicest

Introduction to Automatic Fault Detection and Applying Machine Learning to Detect Thin Beds

Senior Geoscientist and Lead Trainer

President & CEO

Applying Machine Learning Technologies in the Niobrara Formation, DJ Basin, to Quickly Produce an Integrated Structural and Stratigraphic Seismic Classification Volume Calibrated to Wells

Senior Research Geophysicist

Research Geophysicist

Senior Consulting Geophysicist

The Relationship of Self-Organization, Geology, and Machine Learning

Senior Consulting Geophysicist

Investigating the Internal Fabric of VSP data with Attribute Analysis and Unsupervised Machine Learning

President and CEO, Geophysical Insights

Machine Learning for Incomplete Geoscientists

Owner, Auburn Energy

Senior Vice President Business Development

Product Manager