Geophysics for Data Scientists


Trainer(s): Jaap Mondt
Duration: 5 days

Introduction

More and more Deep Learning will play a role not only in society in general but also in the geosciences. Deep Learning resorts under the overall heading of Machine Learning / Artificial Intelligence. In this domain often the word “Algorithms” is used to indicate that computer algorithms are used to obtain results. Also, “Big Data” is mentioned, indicating that these algorithms need a large amount of training data to produce useful results. Many scientists mention “Let the data speak for itself” when referring to Deep Learning, indicating that hidden or latent relationships between observations and classes or values of (desired) outcomes can be derived using these algorithms. Examples are in the field of seismic processing (first arrival picking), interpretation (facies prediction), etc. Often, we resort to statistical relationships. Then Deep Learning enters the game. From a range of labelled data (called instances) we can derive a linear/nonlinear relationship (model in DL terminology) that predicts the label or value (supervised learning) of new data (instances in DL terminology). But sometimes it is already useful if an algorithm can define separate groupings / clusters, which then still need to be interpreted (unsupervised learning). Even more sophisticated is Semisupervised learning: labelled and unlabelled data together are clustered whereby the unlabelled data receives the label of the dominant class present in the cluster.

Domain Experts and Data Scientists

In discussions at the EAGE Digital conference 2024, it was emphasized that not only the Subject Matter Experts (SME’s) had to become familiar with the terminology and methods used by the Data Scientists, but also the Data Scientists must understand what geology and geophysics is about. That doesn’t mean they need to know the ins-and-outs of these subjects, but at least know the terminology and the overall context for which they need to provide the Machine / Deep learning tools. Therefore, this course will be a first step in providing the necessary geophysical background.

The Course

As it is assumed that the Data Scientists are familiar with mathematics and statistics, the course will include advanced geophysical subjects. A general overview of seismic and non-seismic acquisition, processing and interpretation will be followed by various uses of Machine / Deep learning for Geophysical Applications. We will predict lithology and pore fluids as well as Facies to learn the Deep Learning workflows and algorithms needed in geophysics. Use will be made of open-source software: TensorFlow and Keras. Power-point presentations and videos will introduce various aspects, but the emphasis is on computer-based exercises. The exercises deal with pre-conditioning the datasets and applying several methods to classify / cluster the data: Multilayer Perceptron, Support Vector, Nearest Neighbour, AdaBoost, Trees. Non-linear Regression is used to predict porosity. Use will be made of Google Colab and Scikit-Learn. It runs on the Cloud and allows use of a GPU. It is “the way” to learn using a whole range of open-source Deep Learning algorithms for geophysical applications. The course consists of many exercises as I am a strong believer in the paradigm: Tell me and I will forget, show me and I might remember, involve me (through exercises) and I will truly learn.

Learning

At the end of the course participants will have a clear idea of what goes on in Geophysics and how Artificial Intelligence will impact the future of Geosciences. Interactive quizzes using “Mentimeter” are used to enhance the learning.

Intended Audience

Data Scientists who will be cooperating with geoscientists to develop AI methods for exploration and development of hydrocarbons or mineral resources. Also, application for geothermal and CO2 storage are discussed.

Pre-requisites

A good understanding of mathematics, statistics and to some degree of physics.

Note: The course can be adapted to comply with the needs of participants.

Daily Programme:

Day 1
09:00-09:30 Program, Biography, Teams
09:30-10:00 Moodle, Geophysical Methods
10:00-10:15 Refreshments
10:15-10:45 Seismic Data
10:45-11:15 Non-Seismic Data
11:15-12:00 Ex Shot raypaths
12:30-13:00 Gravity
13:00-14:00 Lunch
14:00-14:30 Magnetics
14:30-15:00 Ex Sampling and Aliasing
15:00-15:30 Refreshments
15:30-16:00 Ex Field Record
16:00-16:30 Quiz 1
16:30-17:00 Team A preparation

Day 2
09:00-09:30 Team A: Summary Day 1
09:30-10:00 Seismic Acquisition
10:00-10:15 Refreshments
10:15-10:45 Ex Gravity & Magnetics
10:45-11:15 Wave propagation
11:15-12:00 Ex Reflection & Transmissio
12:30-13:00 Seismic Processing
13:00-14:00 Lunch
14:00-14:30 Stacking & Migration
14:30-15:00 Ex Correlation & Convolution
15:00-15:30 Refreshments
15:30-16:00 Ex Diffraction
16:00-16:30 Quiz 2
16:30-17:00 Team B preparation

Day 3
09:00-09:30 Team B: Summary Day 2
09:30-10:00 Ex Migration diffraction curves
10:00-10:15 Refreshments
10:15-10:45 Quantitative Interpretation
10:45-11:15 Ex Migration wavefronts,
11:15-12:00 AVO/AVA
12:30-13:00 Ex Time-Depth-Conversion: Stretch
13:00-14:00 Lunch
14:00-14:30 Ex Amplitudes
14:30-15:00 Ex Time-Depth-Conversion: Raytracing
15:00-15:30 Refreshments
15:30-16:00 Ex Direct Hydrocarbon Indicators
16:00-16:30 Quiz 3
16:30-17:00 Team C preparation

Day 4
09:00-09:30 Team C: Summary Day
09:30-10:00 Machine Learning for Geophysics
10:00-10:15 Refreshments
10:15-10:45 Ex Lith Classification
10:45-11:15 Clustering
11:15-12:00 Ex Facies Clustering
12:30-13:00 DL 4D Brazil
13:00-14:00 Lunch
14:00-14:30 Ex Facies-Merged_Reload_Test-Well_Classification
14:30-15:00 Deep Learning for Geophysics
15:00-15:30 Refreshments
15:30-16:00 Ex Oil Saturation Regression
16:00-16:30 Quiz 4
16:30-17:00 Team D preparation

Day 5
09:00-09:30 Team D: Summary Day 4
09:30-10:00 You ain’t seen nothing yet
10:00-10:15 Refreshments
10:15-10:45 Ex Salt-Segmentation-CNN
10:45-11:15 Semi-supervised Learning in Geophysics
11:15-12:00 Ex Salt-Segmentation-U-net
12:30-13:00 Inversion versus AI
13:00-14:00 Lunch
14:00-14:30 VOI-Geothermal & CO2 Sequestration
14:30-15:00 Ex AI Inversion
15:00-16:00 Refreshments
15:30-16:30 Quiz 5
16:00-16:30 ChatGPT: questions you never dared to ask!

Interactive Blended Learning Programme