Hubbry Logo
KrigingKrigingMain
Open search
Kriging
Community hub
Kriging
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Kriging
Kriging
from Wikipedia
Example of one-dimensional data interpolation by kriging, with credible intervals. Squares indicate the location of the data. The kriging interpolation, shown in red, runs along the means of the normally distributed credible intervals shown in gray. The dashed curve shows a spline that is smooth, but departs significantly from the expected values given by those means.

In statistics, originally in geostatistics, kriging or Kriging (/ˈkrɡɪŋ/), also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging gives the best linear unbiased prediction (BLUP) at unsampled locations.[1] Interpolating methods based on other criteria such as smoothness (e.g., smoothing spline) may not yield the BLUP. The method is widely used in the domain of spatial analysis and computer experiments. The technique is also known as Wiener–Kolmogorov prediction, after Norbert Wiener and Andrey Kolmogorov.

The theoretical basis for the method was developed by the French mathematician Georges Matheron in 1960, based on the master's thesis of Danie G. Krige, the pioneering plotter of distance-weighted average gold grades at the Witwatersrand reef complex in South Africa. Krige sought to estimate the most likely distribution of gold based on samples from a few boreholes. The English verb is to krige, and the most common noun is kriging. The word is sometimes capitalized as Kriging in the literature.

Though computationally intensive in its basic formulation, kriging can be scaled to larger problems using various approximation methods.

Main principles

[edit]
[edit]

Kriging predicts the value of a function at a given point by computing a weighted average of the known values of the function in the neighborhood of the point. The method is closely related to regression analysis. Both theories derive a best linear unbiased estimator based on assumptions on covariances, make use of Gauss–Markov theorem to prove independence of the estimate and error, and use very similar formulae. Even so, they are useful in different frameworks: kriging is made for estimation of a single realization of a random field, while regression models are based on multiple observations of a multivariate data set.

The kriging estimation may also be seen as a spline in a reproducing kernel Hilbert space, with the reproducing kernel given by the covariance function.[2] The difference with the classical kriging approach is provided by the interpretation: while the spline is motivated by a minimum-norm interpolation based on a Hilbert-space structure, kriging is motivated by an expected squared prediction error based on a stochastic model.

Kriging with polynomial trend surfaces is mathematically identical to generalized least squares polynomial curve fitting.

Kriging can also be understood as a form of Bayesian optimization.[3] Kriging starts with a prior distribution over functions. This prior takes the form of a Gaussian process: samples from a function will be normally distributed, where the covariance between any two samples is the covariance function (or kernel) of the Gaussian process evaluated at the spatial location of two points. A set of values is then observed, each value associated with a spatial location. Now, a new value can be predicted at any new spatial location by combining the Gaussian prior with a Gaussian likelihood function for each of the observed values. The resulting posterior distribution is also Gaussian, with a mean and covariance that can be simply computed from the observed values, their variance, and the kernel matrix derived from the prior.

Geostatistical estimator

[edit]

In geostatistical models, sampled data are interpreted as the result of a random process. The fact that these models incorporate uncertainty in their conceptualization does not mean that the phenomenon – the forest, the aquifer, the mineral deposit – has resulted from a random process, but rather it allows one to build a methodological basis for the spatial inference of quantities in unobserved locations and to quantify the uncertainty associated with the estimator.

A real-life DEM patch refined using ordinary kriging (visualized with matplotlib)

A stochastic process is, in the context of this model, simply a way to approach the set of data collected from the samples. The first step in geostatistical modulation is to create a random process that best describes the set of observed data.

A value from location (generic denomination of a set of geographic coordinates) is interpreted as a realization of the random variable . In the space , where the set of samples is dispersed, there are realizations of the random variables , correlated between themselves.

The set of random variables constitutes a random function, of which only one realization is known – the set of observed data. With only one realization of each random variable, it's theoretically impossible to determine any statistical parameter of the individual variables or the function. The proposed solution in the geostatistical formalism consists in assuming various degrees of stationarity in the random function, in order to make the inference of some statistic values possible.

For instance, if one assumes, based on the homogeneity of samples in area where the variable is distributed, the hypothesis that the first moment is stationary (i.e. all random variables have the same mean), then one is assuming that the mean can be estimated by the arithmetic mean of sampled values.

The hypothesis of stationarity related to the second moment is defined in the following way: the correlation between two random variables solely depends on the spatial distance between them and is independent of their location. Thus if and , then:

For simplicity, we define and .

This hypothesis allows one to infer those two measures – the variogram and the covariogram:

where:

;
denotes the set of pairs of observations such that , and is the number of pairs in the set.

In this set, and denote the same element. Generally an "approximate distance" is used, implemented using a certain tolerance.

Linear estimation

[edit]

Spatial inference, or estimation, of a quantity , at an unobserved location , is calculated from a linear combination of the observed values and weights :

The weights are intended to summarize two extremely important procedures in a spatial inference process:

  • reflect the structural "proximity" of samples to the estimation location ;
  • at the same time, they should have a desegregation effect, in order to avoid bias caused by eventual sample clusters.

When calculating the weights , there are two objectives in the geostatistical formalism: unbias and minimal variance of estimation.

If the cloud of real values is plotted against the estimated values , the criterion for global unbias, intrinsic stationarity or wide sense stationarity of the field, implies that the mean of the estimations must be equal to mean of the real values.

The second criterion says that the mean of the squared deviations must be minimal, which means that when the cloud of estimated values versus the cloud real values is more disperse, the estimator is more imprecise.

Methods

[edit]

Depending on the stochastic properties of the random field and the various degrees of stationarity assumed, different methods for calculating the weights can be deduced, i.e. different types of kriging apply. Classical methods are:

  • Ordinary kriging assumes constant unknown mean only over the search neighborhood of .
  • Simple kriging assumes stationarity of the first moment over the entire domain with a known mean: , where is the known mean.
  • Universal kriging assumes a general polynomial trend model, such as linear trend model .
  • IRFk-kriging assumes to be an unknown polynomial in .
  • Indicator kriging uses indicator functions instead of the process itself, in order to estimate transition probabilities.
    • Multiple-indicator kriging is a version of indicator kriging working with a family of indicators. Initially, MIK showed considerable promise as a new method that could more accurately estimate overall global mineral deposit concentrations or grades. However, these benefits have been outweighed by other inherent problems of practicality in modelling due to the inherently large block sizes used and also the lack of mining scale resolution. Conditional simulation is fast, becoming the accepted replacement technique in this case.[citation needed]
  • Disjunctive kriging is a nonlinear generalisation of kriging.
  • Log-normal kriging interpolates positive data by means of logarithms.
  • Latent kriging assumes the various krigings on the latent level (second stage) of the nonlinear mixed-effects model to produce a spatial functional prediction.[4] This technique is useful when analyzing a spatial functional data , where is a time series data over period, is a vector of covariates, and is a spatial location (longitude, latitude) of the -th subject.
  • Co-kriging denotes the joint kriging of data from multiple sources with a relationship between the different data sources.[5] Co-kriging is also possible in a Bayesian approach.[6][7]
  • Bayesian kriging departs from the optimization of unknown coefficients and hyperparameters, which is understood as a maximum likelihood estimate from the Bayesian perspective. Instead, the coefficients and hyperparameters are estimated from their expectation values. An advantage of Bayesian kriging is, that it allows to quantify the evidence for and the uncertainty of the kriging emulator.[8] If the emulator is employed to propagate uncertainties, the quality of the kriging emulator can be assessed by comparing the emulator uncertainty to the total uncertainty (see also Bayesian Polynomial Chaos). Bayesian kriging can also be mixed with co-kriging.[6][7]

Ordinary kriging

[edit]

The unknown value is interpreted as a random variable located in , as well as the values of neighbors samples . The estimator is also interpreted as a random variable located in , a result of the linear combination of variables.

Kriging seeks to minimize the mean square value of the following error in estimating , subject to lack of bias:

The two quality criteria referred to previously can now be expressed in terms of the mean and variance of the new random variable :

Lack of bias

Since the random function is stationary, , the weights must sum to 1 in order to ensure that the model is unbiased. This can be seen as follows:

Minimum variance

Two estimators can have , but the dispersion around their mean determines the difference between the quality of estimators. To find an estimator with minimum variance, we need to minimize .

See covariance matrix for a detailed explanation.

where the literals stand for

Once defined the covariance model or variogram, or , valid in all field of analysis of , then we can write an expression for the estimation variance of any estimator in function of the covariance between the samples and the covariances between the samples and the point to estimate:

Some conclusions can be asserted from this expression. The variance of estimation:

  • is not quantifiable to any linear estimator, once the stationarity of the mean and of the spatial covariances, or variograms, are assumed;
  • grows when the covariance between the samples and the point to estimate decreases. This means that, when the samples are farther away from , the estimation becomes worse;
  • grows with the a priori variance of the variable ; when the variable is less disperse, the variance is lower in any point of the area ;
  • does not depend on the values of the samples, which means that the same spatial configuration (with the same geometrical relations between samples and the point to estimate) always reproduces the same estimation variance in any part of the area ; this way, the variance does not measure the uncertainty of estimation produced by the local variable.
System of equations

Solving this optimization problem (see Lagrange multipliers) results in the kriging system:

The additional parameter is a Lagrange multiplier used in the minimization of the kriging error to honor the unbiasedness condition.

Simple kriging

[edit]
Simple kriging can be seen as the mean and envelope of Brownian random walks passing through the data points.

Simple kriging is mathematically the simplest, but the least general.[9] It assumes the expectation of the random field is known and relies on a covariance function. However, in most applications neither the expectation nor the covariance are known beforehand.

The practical assumptions for the application of simple kriging are:

  • Wide-sense stationarity of the field (variance stationary).
  • The expectation is zero everywhere: .
  • Known covariance function .

The covariance function is a crucial design choice, since it stipulates the properties of the Gaussian process and thereby the behaviour of the model. The covariance function encodes information about, for instance, smoothness and periodicity, which is reflected in the estimate produced. A very common covariance function is the squared exponential, which heavily favours smooth function estimates.[10] For this reason, it can produce poor estimates in many real-world applications, especially when the true underlying function contains discontinuities and rapid changes.

System of equations

The kriging weights of simple kriging have no unbiasedness condition and are given by the simple kriging equation system:

This is analogous to a linear regression of on the other .

Estimation

The interpolation by simple kriging is given by

The kriging error is given by

which leads to the generalised least-squares version of the Gauss–Markov theorem (Chiles & Delfiner 1999, p. 159):

Bayesian kriging

[edit]

See also Bayesian Polynomial Chaos

Properties

[edit]
  • The kriging estimation is unbiased: .
  • The kriging estimation honors the actually observed value: (assuming no measurement error is incurred).
  • The kriging estimation is the best linear unbiased estimator of if the assumptions hold. However (e.g. Cressie 1993):[11]
    • As with any method, if the assumptions do not hold, kriging might be bad.
    • There might be better nonlinear and/or biased methods.
    • No properties are guaranteed when the wrong variogram is used. However, typically still a "good" interpolation is achieved.
    • Best is not necessarily good: e.g. in case of no spatial dependence the kriging interpolation is only as good as the arithmetic mean.
  • Kriging provides as a measure of precision. However, this measure relies on the correctness of the variogram.

Applications

[edit]

Although kriging was developed originally for applications in geostatistics, it is a general method of statistical interpolation and can be applied within any discipline to sampled data from random fields that satisfy the appropriate mathematical assumptions. It can be used where spatially related data has been collected (in 2-D or 3-D) and estimates of "fill-in" data are desired in the locations (spatial gaps) between the actual measurements.

To date kriging has been used in a variety of disciplines, including the following:

Design and analysis of computer experiments

[edit]

Another very important and rapidly growing field of application, in engineering, is the interpolation of data coming out as response variables of deterministic computer simulations,[28] e.g. finite element method (FEM) simulations. In this case, kriging is used as a metamodeling tool, i.e. a black-box model built over a designed set of computer experiments. In many practical engineering problems, such as the design of a metal forming process, a single FEM simulation might be several hours or even a few days long. It is therefore more efficient to design and run a limited number of computer simulations, and then use a kriging interpolator to rapidly predict the response in any other design point. Kriging is therefore used very often as a so-called surrogate model, implemented inside optimization routines.[29] Kriging-based surrogate models may also be used in the case of mixed integer inputs.[30]

See also

[edit]

References

[edit]

Further reading

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Kriging is a geostatistical method of spatial that estimates values of a spatial at unsampled locations using a weighted of known observations, incorporating the spatial structure to provide optimal unbiased predictions along with associated measures. Originating in the field of , it was pioneered in the 1950s by South African Danie G. Krige through empirical techniques for estimating gold ore grades from samples. The approach was formalized in 1963 by French mathematician Georges Matheron, who developed the theoretical framework of and named the method "kriging" in Krige's honor. At its core, kriging models the spatial dependence of data through the , a function that quantifies how dissimilarity between observations increases with , enabling the of weights that minimize prediction error. It assumes stationarity in the statistical properties of the field and serves as an exact interpolator, meaning predictions at observed locations match the data exactly. Common variants include ordinary kriging, which estimates a constant unknown ; simple kriging, assuming a known ; and universal kriging, which incorporates deterministic trends like spatial coordinates. These extensions allow kriging to handle non-stationarities and provide the best linear unbiased under Gaussian assumptions, often viewed as a form of regression. Kriging has become a foundational tool in , applied across diverse domains such as for mapping soil contaminants, for groundwater modeling, and for reservoir characterization. Its ability to generate probabilistic estimates distinguishes it from deterministic interpolators like , supporting decision-making in resource estimation and . Since its , advancements in computational tools have expanded its use to large-scale datasets, including and climate modeling, while maintaining its emphasis on rigorous .

Introduction

Definition and Purpose

Kriging is a geostatistical technique that estimates values of a spatial at unsampled locations using observed data points, under the assumption of spatial among nearby observations. This method originated in the field of mining and is now widely applied in , , and resource estimation to predict continuous spatial phenomena such as pollutant concentrations or properties. By modeling the structure of the data, kriging produces predictions that account for the inherent variability and dependence in spatial datasets. The primary purpose of kriging is to deliver unbiased estimates with the minimum possible variance, making it the best linear unbiased (BLUE) for spatial prediction. Unlike simpler deterministic methods like (IDW), which assign weights based solely on Euclidean distances and often oversmooth data, kriging incorporates a probabilistic model of spatial continuity to yield more precise and reliable interpolations, particularly in heterogeneous environments. This optimality ensures that predictions are not systematically biased and have the lowest prediction error among linear combinations of the observed data. In essence, kriging functions as a tailored form of regression for geospatial applications, where the spatial field is treated as a realization of a multivariate Gaussian distribution conditioned on the observations. A practical example is in mineral resource evaluation, where kriging predicts grades at untested sites within a mine based on limited core samples, enabling informed decisions on extraction feasibility and reserve volumes. The method models spatial dependence through tools like the to quantify how similarity decreases with distance.

Historical Development

The origins of kriging trace back to the work of South African mining engineer Danie G. Krige, who in 1951 developed an empirical method for estimating ore grades in mines using weighted averages of nearby drill hole samples to account for spatial variability. This approach, detailed in Krige's MSc and subsequent publication, addressed practical challenges in mine valuation by improving the accuracy of reserve estimates over traditional methods like polygonal estimation. Krige's technique was initially applied routinely in South African mines during the early , marking the first systematic use of spatial in mineral resource evaluation. The theoretical formalization of kriging occurred in 1963 through the efforts of French mathematician Georges Matheron at the , who built upon Krige's empirical results to establish a rigorous geostatistical framework. Matheron coined the term "kriging" in 1962 as a to Krige, integrating concepts like the —introduced to quantify spatial continuity and dependence in data—to derive unbiased linear predictors for unsampled locations. This development transformed Krige's practical tool into a statistically grounded method, emphasizing best linear unbiased under assumptions of spatial stationarity. Kriging evolved rapidly from its roots in the to a foundational geostatistical technique by the , with Matheron's seminal two-volume "Traité de Géostatistique Appliquée" (1962–1963) providing the comprehensive theoretical basis and applications that popularized it beyond ore evaluation. By the , kriging had gained traction in environmental sciences for tasks such as mapping pollutant distributions and groundwater contamination, further facilitated in the by accessible software like the U.S. Agency's Geo-EAS package, which implemented kriging routines for of environmental data. This period solidified kriging's role as a versatile method across disciplines, bridging empirical practices with broader statistical applications.

Mathematical Foundations

Spatial Dependence and Stationarity

Spatial dependence is a fundamental concept in , positing that values of a spatial process at nearby locations exhibit greater similarity than those at distant locations. This principle, known as Tobler's First Law of , states that "everything is related to everything else, but near things are more related than distant things," providing the theoretical foundation for methods like kriging, where predictive weights are derived from spatial proximity and correlation structures. Kriging relies on this dependence to model spatial , assuming that the influence of observed data points diminishes with increasing , thereby enabling unbiased predictions at unsampled s. In practice, this dependence is quantified through tools like the , which measures dissimilarity as a function of separation , though detailed modeling is addressed elsewhere. Stationarity assumptions underpin the validity of these models by ensuring that statistical properties of the spatial process remain consistent across the domain. Strict stationarity requires that the of the process is invariant under spatial translations, implying uniformity in all moments and higher-order dependencies. Second-order stationarity, a weaker and more commonly invoked condition in , assumes a constant mean throughout the domain and a function that depends solely on the lag vector between points, allowing the process variance to be well-defined and separable from . Intrinsic stationarity relaxes these further by focusing on the stationarity of increments rather than the process itself, where the variance of differences between values at points separated by a fixed lag is constant, facilitating the use of variograms even when means or variances vary spatially. This form is particularly useful for processes exhibiting local homogeneity in fluctuations but global trends. For more complex non-stationary scenarios, intrinsic random functions of order kk (IRF-kk) generalize the framework by assuming that the kk-th order differences of the process are intrinsically stationary, accommodating drifts or trends of order up to k1k-1. Introduced by Matheron, these functions extend geostatistical to datasets with underlying smooth variations, such as geological formations, by stabilizing higher-order increments. Examples illustrate these concepts in real-world applications. Climate variables like daily in a flat agricultural often satisfy second-order stationarity over moderate scales, with constant means and lag-dependent covariances reflecting atmospheric mixing. In contrast, data across varied typically exhibit non-stationarity due to systematic trends from underlying , necessitating IRF approaches or trend removal to model residual fluctuations effectively.

Variogram and Covariance Functions

The semivariogram, denoted as γ(h)\gamma(\mathbf{h}), quantifies the average dissimilarity between values of a spatial Z(x)Z(\mathbf{x}) separated by a lag vector h\mathbf{h}, and is defined as γ(h)=12E[(Z(x)Z(x+h))2]\gamma(\mathbf{h}) = \frac{1}{2} \mathbb{E} \left[ (Z(\mathbf{x}) - Z(\mathbf{x} + \mathbf{h}))^2 \right]. This measure arises under the assumption of intrinsic stationarity, where the first two moments of the increments are translation-invariant. The empirical semivariogram γ^(h)\hat{\gamma}(\mathbf{h}) is estimated from observed data pairs {Z(xi),Z(xi+h)}\{Z(\mathbf{x}_i), Z(\mathbf{x}_i + \mathbf{h})\} using the method-of-moments γ^(h)=12N(h)N(h)[Z(xi)Z(xi+h)]2\hat{\gamma}(\mathbf{h}) = \frac{1}{2 |N(\mathbf{h})|} \sum_{N(\mathbf{h})} [Z(\mathbf{x}_i) - Z(\mathbf{x}_i + \mathbf{h})]^2, where N(h)N(\mathbf{h}) is the set of pairs separated by h\mathbf{h}. Estimation considers potential by directional binning of lags, allowing separate models for different orientations if spatial dependence varies with direction. The nugget effect, appearing as γ^(0+)\hat{\gamma}(0^+), captures microscale variability or measurement error unresolved at the sampling scale. Theoretical semivariogram models are fitted to the empirical values to ensure validity for kriging. Common isotropic models include the spherical γ(h)={c0+c(32ha12(ha)3)hac0+ch>a\gamma(\mathbf{h}) = \begin{cases} c_0 + c \left( \frac{3}{2} \frac{|\mathbf{h}|}{a} - \frac{1}{2} \left( \frac{|\mathbf{h}|}{a} \right)^3 \right) & |\mathbf{h}| \leq a \\ c_0 + c & |\mathbf{h}| > a \end{cases}
Add your contribution
Related Hubs
User Avatar
No comments yet.