Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data

Varouchakis, Emmanouil A.

doi:10.3390/jmse9070717

Open AccessTechnical Note

Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data

by

Emmanouil A. Varouchakis

^1,2

¹

School of Sciences and Engineering, University of Crete, 70013 Heraklion, Greece

²

Technical University of Crete, 73100 Chania, Greece

J. Mar. Sci. Eng. 2021, 9(7), 717; https://doi.org/10.3390/jmse9070717

Submission received: 4 June 2021 / Revised: 26 June 2021 / Accepted: 26 June 2021 / Published: 29 June 2021

(This article belongs to the Special Issue Spatial and Spatiotemporal Methods in Marine Science)

Download

Browse Figures

Versions Notes

Abstract

:

In this technical note, a geostatistical model was applied to explore the spatial distribution of source rock data in terms of total organic carbon weight concentration. The median polish kriging method was used to approximate the “row and column effect” in the generated array data, in order for the ordinary kriging methodology to be applied by means of the residuals. Moreover, the sequential Gaussian simulation was employed to quantify the uncertainty of the estimates. The modified Box–Cox technique was applied to normalize the residuals and a cross-validation analysis was performed to evaluate the efficiency of the method. A map of the spatial distribution of total organic carbon weight concentration was constructed along with the 5% and 95% confidence intervals. This work encourages the use of the median polish kriging method for similar applications.

Keywords:

Geostatistics; rock samples; residuals; hydrocarbons; uncertainty; reservoir; Tuscaloosa marine shale

1. Introduction

Onshore and coastal petroleum exploration commonly involves rock samples from different locations, geological formations and depths, which are analyzed for their mineral composition and organic properties. Geostatistics can be used to spatially analyze these properties and provide their spatial distribution in terms of mathematical principles of interdependence. In petroleum resources evaluation studies, geostatistics in the form of kriging have traditionally been used successfully [1,2,3,4,5]. Over the last decades, multi-point techniques, simulations and seismic inversion methods have been developed and successfully applied in reservoir modeling [6,7].

In this work, we present a combined methodology for spatial analysis of the distribution of total organic carbon weight concentration (TOC wt%) from rock samples. The weight percentage of organic carbon in source rocks represents the concentration of organic material. A 0.5% value by weight of TOC is the minimum allowed for an effective source rock sample. Besides, a value of 2% is considered the minimum for shale gas reservoirs. Values greater than 10% are also probable. However, such values indicate the likelihood that kerogen, rather than other types of hydrocarbons, fills the pore space [8].

The median polish method (MEP) combined with the sequentially Gaussian simulation (SGS) is applied to estimate the spatial distribution of the available dataset in an attempt to propose an alternative method to previous approaches for mapping TOC wt% [9]. In addition, it can designate areas that would be potentially under environmental risk during prospective explorations [10]. MEP approximates large scale variations by iteratively removing the median from the rows and columns of the gridded data. The determined residuals can be analyzed with the kriging method by means of the variogram to study the small-scale spatial structure variation of the study variable and to be combined with the estimated median polish trend [11,12]. MEP is widely used in geostatistical applications to overcome bias as well as the influence of extreme values [13,14], while SGS is used in geostatistical simulation applications in various disciplines [15,16,17]. However, the two methods have never been combined for a specific application, especially in the discipline of hydrocarbons. A similar work involving spatial polycyclic aromatic hydrocarbons (PAHs) analysis using indicator kriging and SGS has been successfully conducted [18], suggesting that such approaches can be useful for the spatial analysis of data related to hydrocarbons.

2. Materials and Methods

The data evaluated were from subsurface Mesozoic rock samples from the eastern onshore Gulf Coast Basin (mainly Mississippi and Louisiana), USA [19]. Specifically, 561 samples were collected by the USGS from 2011–2017 to investigate potential undiscovered petroleum resources. Part of the data from the aforementioned dataset was used in a recent study and showed that most samples were derived from a common mixed marine terrigenous source rock [20]. In this work, the determined TOC wt% was geostatistically evaluated in terms of its mean value at each sampling location. Thus, 132 unevenly distributed samples were used to perform the spatial analysis of TOC wt% in the study area (Figure 1). The sample was decomposed into trend and residuals according to the principles of MEP method. The residuals were examined to see if they followed a normal distribution, and a modification of the classic Box–Cox technique [21,22,23] was implemented to transform them.

2.1. Median Polish Kriging (MEPK)

Let

Z (s)

be a random field of sample size

N

. A mean structure can be obtained by additive decomposition of the row and column effect to define irregular gridded spatial data in which grid spacings do not have to be equal in either the horizonal or vertical directions:

u (s_{i}) = a + r_{k} + c_{l}, s_{i} = (x_{l}, y_{k})

(1)

where

r_{k}

and

c_{l}

are the row and column effects, respectively, and a is a constant. The nodes of an overlaid rectangular grid are allocated to the data positions if they are irregularly spaced.

A method called median polish has been proposed to estimate the additive effects given above using median theory in order to avoid bias and the impact of extreme values. [11]. Median polish is applied by recurrent mining until convergence of the row and column medians by means of a convergence criterion. It gives new estimators of

a, r_{k}, c_{l}

which we write as

\tilde{a}, \tilde{r_{k}}, \tilde{c_{l}}

. Thus, the initial spatial data are defined as:

Z (s_{i}) = \tilde{a} + {\tilde{r}}_{k} + {\tilde{c}}_{l} + R (s_{i})

(2)

where

R (s_{i})

corresponds to the residual term which is trend free to consent the application of ordinary kriging (OK),

\tilde{R} (s_{0}) = \sum_{i = 1}^{N} λ_{i} R (s_{i})

(3)

For

s = (x, y)'

in the area bounded by the lines that connect the four nodes:

(x_{l}, y_{k}) ’

;

(x_{l + 1}, y_{k}) ’; (x_{l}, y_{k + 1}) ’

;

(x_{l + 1}, y_{k + 1}) ’

, subject to

x_{l} < x_{l + 1}

and

y_{k} < y_{k + 1},

defines the planar interpolant,

\tilde{u} (s) \equiv \tilde{a} + {\tilde{r}}_{k} + (\frac{y - y_{k}}{y_{k + 1} - y_{k}}) ({\tilde{r}}_{k + 1} - {\tilde{r}}_{k}) + {\tilde{c}}_{l} + (\frac{x - x_{k}}{x_{k + 1} - x_{k}}) ({\tilde{c}}_{k + 1} - {\tilde{c}}_{l})

(4)

Details for the corresponding extrapolation equation can be found in Martínez et al. [13].

Thus, the median polish kriging predictor is provided from Equation (5),

\tilde{Z} (s_{0}) \equiv \tilde{u} (s_{0}) + \tilde{R} (s_{0}) .

(5)

Furthermore, the median polish kriging variance is defined by the following equation:

σ_{M P}^{2} (s_{0}) = γ' Γ^{- 1} γ - {(1' Γ^{- 1} γ - 1)}^{2} / (1' Γ^{- 1} 1)

(6)

where 1 is an

n \times 1

vector of ones,

γ

the variogram vector of the residuals between

s_{j}

(observation point) and

s_{0}

(estimation point).

Γ

is the variogram matrix of the residuals,

(N_{0} + 1) \times (N_{0} + 1)

, at the observation locations [11,13]. The decomposed median predictors are not considered in the MEP variance error estimation. Therefore, to calculate the uncertainty of estimations in a reliable way, SGS is applied.

2.2. Sequential Gaussian Simulation

The sequential Gaussian simulation (SGS) is a stochastic approach for producing equiprobable realizations (maps) of spatial distribution of a variable on a grid by means of kriging methodology. SGS fundamentals are based on drawing from a collection of conditional distributions of univariate realizations. The conditional cumulative distribution function (CDF) is defined as,

F (s_{1} |N) = F (s_{1} |N) \dots F (s_{N} |N - 1)

(7)

SGS is obtained using kriging estimates, where each simulated map is a realization of a multivariate normal process. For conditional SGS, the kriging variance at sampling locations is zero to ensure that the only possible drawings are those of the sampled values. SGS is applied in terms of kriging estimator by means of a linear combination of random variables at location

s_{i}

to estimate the value at location

s_{0}

[4,18]. OK method applies that

z (s)

and

z \in Z

is a random function with a constant but unknown mean. The OK estimate

\hat{z} (s_{0})

at

s_{0}

is calculated based on a weighted sum of the data,

\hat{z} (s_{0}) = \sum_{{i : s_{i} \in S_{0}}} λ_{i} z_{i} (s_{i}) .

(8)

The weights

λ_{i}

depend on the variogram model

γ_{z} (r)

[24] and are calculated by minimizing the mean square estimation error conditionally on the zero-bias constraint [11]. In detail, the following

(N_{0} + 1) \times (N_{0} + 1)

linear system of equations provides the weights

λ_{i}

,

\sum_{{i : s_{i} \in S_{0}}} λ_{i} γ_{z} (s_{i}, s_{j}) + μ = γ_{z} (s_{j}, s_{0}), j = 1, \dots, N_{0}

(9)

\sum_{{i : s_{i} \in S_{0}}} λ_{i} = 1

(10)

where

N_{0}

is the set of points included in the search neighborhood of

s_{0}^{}

,

γ_{z} (s_{i}, s_{j})

and

γ_{z} (s_{j}, s_{0})

are the variograms between two observation points

s_{i}

,

s_{j}

, and between

s_{j}

and the estimation point

s_{0}

, respectively, by means of a theoretical variogram model. The term

μ

corresponds to the Lagrange multiplier imposing the no-bias constraint. Equation (10) implements the zero-bias condition.

Kriging methods provide the estimation of a variable

\hat{z} (s_{0})

accompanied by the associated uncertainty, i.e., corresponding estimation’s error variance. The error variance of OK is independent of data values, is zero at monitoring locations and increases away from them, depends on the data configuration, i.e., complexity of the random field spatial variability as modeled by the variogram, while two identical spatially distant pairs of sampled points have the same variance independently of their values [25].

The SGS algorithm steps are outlined below as follows [18]:

(1): Examine if the original data follow normal distribution and apply transformation.
(2): A node at location $s$ is randomly selected that has not been yet simulated.
(3): Apply kriging estimation at $\hat{z} (s_{0})$ and calculate the corresponding kriging variance $σ_{E}^{2} (s_{0})$ .
(4): Draw a random value from the normal distribution $N (\hat{z} (s_{0}), σ_{E}^{2} (s_{0}))$ , which corresponds to the simulated value.
(5): The newly simulated value is added in the dataset and the process moves to another location.
(6): Repeat the procedure above until there are no locations left.
(7): If needed, back transformation to the original data scale applies to all values.

The main advantage of using geostatistical simulation is that any realization has the same variogram as the data. This is due to the sampling that applies from the conditional distribution during the simulation process [24].

2.3. Variogram

The experimental variogram of the transformed data was first calculated using the method of moments,

{\hat{γ}}_{Z} (r_{k}) = \frac{1}{2 N (r_{k})} \sum_{i, j = 1}^{N (r_{k})} \{{[Z (s_{i}) - Z (s_{j})]}^{2}\}

(11)

where

N (r_{k})

denotes the number of point pairs within class

r_{k}

. The theoretical variogram fit was held using the Spartan variogram function considering the model parameters

θ

,

γ_{Z} (r; θ) = C_{Z} (0; θ) - C_{Z} (r; θ)

(12)

The Spartan function is defined as follows [22,26,27]:

C_{Z} (h; θ) = \{\begin{cases} \frac{η_{0} e^{- h β_{2}}}{2 π \sqrt{| η_{1}^{2} - 4 |}} [\frac{\sin (h β_{1})}{h β_{1}}], for |η_{1}| < 2, σ_{z}^{2} = \frac{η_{0}}{2 π \sqrt{| η_{1}^{2} - 4 |}} \\ \frac{η_{0} e^{- h}}{8 π}, for η_{1} = 2, σ_{z}^{2} = \frac{η_{0}}{8 π} \\ \frac{η_{0} (e^{- h ω_{1}} - e^{- h ω_{2}})}{4 π (ω_{2} - ω_{1}) h \sqrt{| η_{1}^{2} - 4 |}}, for η_{1} > 2, σ_{z}^{2} = \frac{η_{0}}{4 π \sqrt{| η_{1}^{2} - 4 |}} \end{cases}

(13)

where

η_{0}

is the scale factor,

η_{1}

is the rigidity coefficient,

β_{1} = {|2 - η_{1}|}^{1 / 2} / 2

is a dimensionless wavenumber,

β_{2} = {|2 + η_{1}|}^{1 / 2} / 2

and

ω_{1, 2} = {(|η_{1} \mp Δ| / 2)}^{1 / 2}

,

Δ = {|η_{1}^{2} - 4|}^{1 / 2}

,

h = r / ξ

is the normalized lag vector,

ξ

is the correlation length,

h = | h |

is the distance norm and

σ_{z}^{2}

is the variance. The Spartan family models depending on

η_{1}^{}

coefficient values have a characteristic wave behavior near the sill that can capture important increments or decrements of the experimental variogram values. Therefore, this characteristic can provide optimum fitting.

The geostatistical analysis using the proposed methods and tools was performed in Matlab^® environment developing original codes.

2.4. Cross-Validation

The proposed method performance is assessed using a leave one out cross-validation procedure. The estimated values are compared with the corresponding observations using a series of performance metrics. The bias error, the mean absolute error and the linear correlation coefficient were examined:

Bias (optimum value close to 0, positive or negative sign denotes overestimation or underestimation):

ε_{B I A S} = \frac{1}{N} \sum_{i = 1}^{N} \hat{z} (s_{i}) - z (s_{i})

(14)

Mean absolute error (MAE) (optimum value close to 0):

ε_{MA} = \frac{1}{N} \sum_{i = 1}^{N} |\hat{z} (s_{i}) - z (s_{i})|

(15)

Linear correlation coefficient (optimum value close to 1):

R = \frac{\sum_{i = 1}^{N} [z (s_{i}) - \bar{z (s_{i})}] [\hat{z} (s_{i}) - \bar{\hat{z} (s_{i})}]}{\sqrt{\sum_{i = 1}^{N} [z (s_{i}) - \bar{z (s_{i})}]^{2}} \sqrt{\sum_{i = 1}^{N} [\hat{z} (s_{i}) - \bar{\hat{z} (s_{i})}]^{2}}}

(16)

The term

\hat{z} (s_{i})

is the estimation at point

s_{i}

,

z (s_{i})

the observed value,

\bar{z (s_{i})}

the spatial average of the observation data and

\bar{\hat{z} (s_{i})}

the spatial average of the estimations [28].

2.5. Methodology Flowchart

The proposed methodological steps described previously in detail are summarized in a flowchart (Figure 2) presenting the basic steps of the combined geostatistical tools applied to provide a reliable map with the spatial distribution of TOC wt% and the relative uncertainty of estimations.

3. Results

In this technical note, we apply a robust method for generating maps of TOC wt% using non-uniformly distributed monitoring stations and consider the joint application of MEPK with SGS. The sample data (Figure 3) are aligned with the nearest grid nodes according to the methodology, receiving new coordinates that correspond to the grid lines. In the case of several data aligned to a grid node, the median substitutes the data values [14].

The advantage of MEP is the robust fitting of a smooth and flexible trend surface. The MEP procedure first decomposes the data into trend and residuals, as explained in the methodology section. It then tests whether the residuals follow a normal distribution. The characteristics of the sample are shown in Table 1. The residuals’ skewness is equal to 1.68 and kurtosis to 4.86. Thus, TOC wt% residuals do not follow the normal distribution in order for OK and SGS to be suitable for application. Therefore, the modified Box–Cox technique [22,23] was applied to transform the residuals. The metrics are now improved to s = 0.02 and kurtosis, k = 3.26, very close to the desired Gaussian distribution metrics (s = 0 and k = 3.00). The transformed residuals’ histogram is presented in Figure 4.

The residuals’ spatial dependence structure was fitted using the Spartan model (Figure 5). The calculated variogram parameters are

σ_{z}^{2} = 1.38

,

η_{1} = - 1.96

,

ξ = 0.17

and nugget

c = 0.23

wt%².

The MEPK method prediction error by means of cross-validation error (after back transformation of residuals) is presented in Table 2. The cross-validation shows that errors between estimated and measured values are close to zero and significantly correlated. Thus, the method provides satisfactory estimation capability.

Furthermore, in this work, the mean of the TOC wt% simulated estimations (Figure 6) is presented from MEPK using SGS, while error maps (uncertainty) were provided using the 95% (Figure 7) and 5% (Figure 8) confidence intervals of the CDF of the simulations. The provided maps were produced after back-transforming estimations to the original scale.

The spatial analysis of the TOC wt% shows the areas of the case study that have concentrations close to 2%, taking into account the associated uncertainty, which can be further investigated employing information from petroleum geology properties. The results of this work, in comparison to previous studies that employ partial information from the same dataset [20,29], are in direct agreement in terms of the designated locations that have the highest TOC wt% potential. The data used in those works were located in the core of the measurements’ location, using geochemical and spatial analysis for hydrocarbon resources evaluation. Overall, similarly to previous works, the produced map highlights localized areas of high-quality source rock properties (TOC wt%) but of poor source rock properties on average. However, according to [29], additional work to map areas and better zones that may provide an improved potential for recoverable hydrocarbons should apply. Therefore, this work exploits the entire dataset [19], in terms of the mean TOC wt% in variable geological formations, to spatially map, by means of an efficient geostatistical methodology, the average TOC wt% potential in the study area. New locations forming zones of significant TOC wt% are presented at the south-west of the study area. Additionally, this research, compared to previous works [29], contributes, apart from the more detailed and high accuracy spatial analysis, to the uncertainty estimation of the spatial analysis by applying a simulation method that presents possible realizations of the TOC wt% providing the 5% and 95% bounds of the geostatistical method predictions uncertainty. The estimation of uncertainty can help to an integrated analysis of TOC wt% spatial distribution.

SGS is a robust stochastic simulation method that provides realizations that represent spatial patterns without smoothing effect. It exploits the capability to calculate the conditional probability of a univariate random variable given only a number of conditioning values. SGS realizations can be applied to represent all the possible spatial distribution patterns of the study variable and model the estimations uncertainty in TOC wt% evaluation.

4. Conclusions

In summary, MEPK can be characterized as a hybrid method that applies in a two-dimensional surface to estimate spatial data distribution. It combines a geostatistical approach, by means of kriging methodology, and a row and column effect analysis. This work suggests that MEPK combined with SGS can model the spatial distribution of TOC wt% and provide satisfactory estimation metrics and uncertainty quantification. The main advantage of this approach is the application of a robust trend surface based on the statistical properties of the sample. Moreover, depending on the coefficient values, the Spartan variogram model can capture important increments, or decrements, of the experimental variogram shape and provide optimum fitting. Finally, this study presents an alternative methodology for the geostatistical analysis of hydrocarbon-related spatial data.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The author would like to thank the United States Geological Survey (USGS) for providing the data online. Datasets for this research are available in the following website link: https://www.sciencebase.gov/catalog/item/5a96cfd0e4b06990606c4dc3 (accessed on 1 March 2021).

Conflicts of Interest

The author declares no conflict of interest.

References

Goovaerts, P. Geostatistics for Natural Resources Evaluation; Oxford University Press: New York, NY, USA, 1997. [Google Scholar]
Caers, J. Petroleum Geostatistics; Society of Petroleum Engineers: Richardson, TX, USA, 2005. [Google Scholar]
Hohn, M.E. Geostatistics and Petroleum Geology; Springer: Dordrecht, The Netherlands, 1999. [Google Scholar]
Pyrcz, M.J.; Deutsch, C.V. Geostatistical Reservoir Modeling; Oxford University Press: Oxford, UK, 2014. [Google Scholar]
Pereira, H.G.; Costa e Silva, A.; Ribeiro, L.; Guerreiro, L. Estimation of Reserves at Different Phases in the History of an Oil Field; Armstrong, M., Ed.; Geostatistics: Dordrecht, The Netherlands; Springer: Dordrecht, The Netherlands, 1989; pp. 543–555. [Google Scholar]
Azevedo, L.; Grana, D.; Amaro, C. Geostatistical rock physics ava inversion. GeoJI 2019, 216, 1728–1739. [Google Scholar] [CrossRef]
González, E.F.; Mukerji, T.; Mavko, G. Seismic inversion combining rock physics and multiple-point geostatistics. Geophysics 2008, 73, R11–R21. [Google Scholar] [CrossRef]
Peters, K.E.; Xia, X.; Pomerantz, A.E.; Mullins, O.C. Chapter 3-geochemistry applied to evaluation of unconventional resources. In Unconventional Oil and Gas Resources Handbook; Ma, Y.Z., Holditch, S.A., Eds.; Gulf Professional Publishing: Boston, MA, USA, 2016; pp. 71–126. [Google Scholar]
Hristopulos, D.T. Random Fields for Spatial Data Modeling; Springer/Nature: Dordrecht, The Netherlands, 2020. [Google Scholar]
Keramea, P.; Spanoudaki, K.; Zodiatis, G.; Gikas, G.; Sylaios, G. Oil spill modeling: A critical review on current trends, perspectives, and challenges. J. Mar. Sci. Eng. 2021, 9, 181. [Google Scholar] [CrossRef]
Cressie, N. Statistics for Spatial Data, revised ed.; Wiley: New York, NY, USA, 1993; p. 900. [Google Scholar]
Costa, J.F. Interpolating datasets with trends: A modified median polish approach. Comput. Geosci. 2009, 35, 2222–2230. [Google Scholar] [CrossRef]
Martínez, W.A.; Melo, C.E.; Melo, O.O. Median polish kriging for space–time analysis of precipitation. Spat. Stat. 2017, 19, 1–20. [Google Scholar] [CrossRef]
Berke, O. Modified median polish kriging and its application to the wolfcamp–aquifer data. Environmetrics 2001, 12, 731–748. [Google Scholar] [CrossRef]
Ersoy, A.; Yünsel, T.Y. Geostatistical conditional simulation for the assessment of the quality characteristics of cayırhan lignite deposits. Energy Explor. Exploit. 2006, 24, 391–416. [Google Scholar] [CrossRef]
Dimitrakopoulos, R.; Luo, X. Generalized sequential gaussian simulation on group size ν and screen-effect approximations for large field simulations. Math. Geol. 2004, 36, 567–591. [Google Scholar] [CrossRef]
Chen, M.; Zhou, Z.; Zhao, L.; Lin, M.; Guo, Q.; Li, M. Study of the scale effect on permeability in the interlayer shear weakness zone using sequential indicator simulation and sequential gaussian simulation. Water 2018, 10, 779. [Google Scholar] [CrossRef] [Green Version]
Bengtsson, G.; Törneman, N. A spatial approach to environmental risk assessment of pah contamination. Risk Anal. 2009, 29, 48–61. [Google Scholar] [CrossRef] [PubMed]
Enomoto, C.; Lohr, C.; Hackley, P.; Valentine, B.; Dulong, F.; Hatcherian, J. Petroleum geology data from mesozoic rock samples in the eastern us gulf coast collected 2011 to 2017. US Geol. Surv. Data Release 2018. [Google Scholar] [CrossRef]
Hackley, P.C.; Dennen, K.O.; Garza, D.; Lohr, C.D.; Valentine, B.J.; Hatcherian, J.J.; Enomoto, C.B.; Dulong, F.T. Oil-source rock correlation studies in the unconventional upper cretaceous tuscaloosa marine shale (tms) petroleum system, mississippi and louisiana, USA. J. Pet. Sci. Eng. 2020, 190, 107015. [Google Scholar] [CrossRef]
Box, G.E.P.; Cox, D.R. An analysis of transformations. J. R. Stat. Soc. Ser. B 1964, 26, 211–252. [Google Scholar] [CrossRef]
Varouchakis, E.A.; Hristopulos, D.T. Improvement of groundwater level prediction in sparsely gauged basins using physical laws and local geographic features as auxiliary variables. Adv. Water Resour. 2013, 52, 34–49. [Google Scholar] [CrossRef]
Varouchakis, E.A. Gaussian transformation methods for spatial data. Geosciences 2021, 11, 196. [Google Scholar] [CrossRef]
Deutsch, C.V.; Journel, A.G. Gslib. Geostatistical Software Library and User’s Guide; Oxford University Press: New York, NY, USA, 1992. [Google Scholar]
Kitanidis, P.K. Introduction to Geostatistics; Cambridge University Press: Cambridge, MA, USA, 1997. [Google Scholar]
Hristopulos, D.T.; Elogne, S.N. Analytic properties and covariance functions for a new class of generalized gibbs random fields. IEEE Trans. Inf. Theory 2007, 53, 4667–4679. [Google Scholar] [CrossRef] [Green Version]
Varouchakis, E.A.; Hristopulos, D.T. Comparison of spatiotemporal variogram functions based on a sparse dataset of groundwater level variations. Spat. Stat. 2019, 34, 100245. [Google Scholar] [CrossRef]
Pham, H. Springer Handbook of Engineering Statistics; Springer: London, UK, 2006. [Google Scholar]
Lohr, C.D.; Valentine, B.J.; Hackley, P.C.; Dulong, F.T. Characterization of the unconventional tuscaloosa marine shale reservoir in southwestern mississippi, USA: Insights from optical and sem petrography. Mar. Pet. Geol. 2020, 121, 104580. [Google Scholar] [CrossRef]

Figure 1. Spatial distribution of sampling points and geological formations (created in ArcGIS v10).

Figure 2. Methodology flowchart of the combined median polish (MEP) kriging and sequential Gaussian simulation (SGS) geostatistical approach (CDF stands for cumulative distribution function).

Figure 3. Distribution of spatial measurements on a rectangular grid.

Figure 4. Transformed residuals values.

Figure 5. Theoretical variogram fitted on the experimental variogram of the measurements. Stars denote the experimental variogram and the continued black line the Spartan model fit.

Figure 6. Spatial distribution of the mean TOC wt% values of the cumulative distribution function at each estimation point.

Figure 7. Spatial distribution of the 95% percentile TOC wt% values of the cumulative distribution function at each estimation point.

Figure 8. Spatial distribution of the 5% percentile TOC wt% values of the cumulative distribution function at each estimation point.

Table 1. Statistical measures of TOC wt% data.

z_{\min}

: minimum value;

z_{\max}

: maximum value;

z_{0.50}

: median;

m_{z}

: mean;

{\hat{σ}}_{z}

: standard deviation;

{\hat{s}}_{z}

: skewness factor;

{\hat{k}}_{z}

: kurtosis factor.

Table 1. Statistical measures of TOC wt% data.

z_{\min}

: minimum value;

z_{\max}

: maximum value;

z_{0.50}

: median;

m_{z}

: mean;

{\hat{σ}}_{z}

: standard deviation;

{\hat{s}}_{z}

: skewness factor;

{\hat{k}}_{z}

: kurtosis factor.

$z_{\min}$	$m_{z}$	$z_{\max}$	${\hat{σ}}_{z}$	${\hat{s}}_{z}$	${\hat{k}}_{z}$	$z_{0.50}$
0.05 wt%	0.82 wt%	5.24 wt%	0.80 wt%	2.80	13.74	0.67 wt%

Table 2. Leave one out cross-validation estimates of the median polish kriging method.

MAE (TOC wt%)	BIAS (TOC wt%)	R
0.10	0.03	0.94

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Varouchakis, E.A. Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data. J. Mar. Sci. Eng. 2021, 9, 717. https://doi.org/10.3390/jmse9070717

AMA Style

Varouchakis EA. Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data. Journal of Marine Science and Engineering. 2021; 9(7):717. https://doi.org/10.3390/jmse9070717

Chicago/Turabian Style

Varouchakis, Emmanouil A. 2021. "Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data" Journal of Marine Science and Engineering 9, no. 7: 717. https://doi.org/10.3390/jmse9070717

APA Style

Varouchakis, E. A. (2021). Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data. Journal of Marine Science and Engineering, 9(7), 717. https://doi.org/10.3390/jmse9070717

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Median Polish Kriging and Sequential Gaussian Simulation for the Spatial Analysis of Source Rock Data

Abstract

1. Introduction

2. Materials and Methods

2.1. Median Polish Kriging (MEPK)

2.2. Sequential Gaussian Simulation

2.3. Variogram

2.4. Cross-Validation

2.5. Methodology Flowchart

3. Results

4. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI