Recent from talks
Nothing was collected or created yet.
Kriging
View on Wikipedia
In statistics, originally in geostatistics, kriging or Kriging (/ˈkriːɡɪŋ/), also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging gives the best linear unbiased prediction (BLUP) at unsampled locations.[1] Interpolating methods based on other criteria such as smoothness (e.g., smoothing spline) may not yield the BLUP. The method is widely used in the domain of spatial analysis and computer experiments. The technique is also known as Wiener–Kolmogorov prediction, after Norbert Wiener and Andrey Kolmogorov.
The theoretical basis for the method was developed by the French mathematician Georges Matheron in 1960, based on the master's thesis of Danie G. Krige, the pioneering plotter of distance-weighted average gold grades at the Witwatersrand reef complex in South Africa. Krige sought to estimate the most likely distribution of gold based on samples from a few boreholes. The English verb is to krige, and the most common noun is kriging. The word is sometimes capitalized as Kriging in the literature.
Though computationally intensive in its basic formulation, kriging can be scaled to larger problems using various approximation methods.
Main principles
[edit]Related terms and techniques
[edit]Kriging predicts the value of a function at a given point by computing a weighted average of the known values of the function in the neighborhood of the point. The method is closely related to regression analysis. Both theories derive a best linear unbiased estimator based on assumptions on covariances, make use of Gauss–Markov theorem to prove independence of the estimate and error, and use very similar formulae. Even so, they are useful in different frameworks: kriging is made for estimation of a single realization of a random field, while regression models are based on multiple observations of a multivariate data set.
The kriging estimation may also be seen as a spline in a reproducing kernel Hilbert space, with the reproducing kernel given by the covariance function.[2] The difference with the classical kriging approach is provided by the interpretation: while the spline is motivated by a minimum-norm interpolation based on a Hilbert-space structure, kriging is motivated by an expected squared prediction error based on a stochastic model.
Kriging with polynomial trend surfaces is mathematically identical to generalized least squares polynomial curve fitting.
Kriging can also be understood as a form of Bayesian optimization.[3] Kriging starts with a prior distribution over functions. This prior takes the form of a Gaussian process: samples from a function will be normally distributed, where the covariance between any two samples is the covariance function (or kernel) of the Gaussian process evaluated at the spatial location of two points. A set of values is then observed, each value associated with a spatial location. Now, a new value can be predicted at any new spatial location by combining the Gaussian prior with a Gaussian likelihood function for each of the observed values. The resulting posterior distribution is also Gaussian, with a mean and covariance that can be simply computed from the observed values, their variance, and the kernel matrix derived from the prior.
Geostatistical estimator
[edit]In geostatistical models, sampled data are interpreted as the result of a random process. The fact that these models incorporate uncertainty in their conceptualization does not mean that the phenomenon – the forest, the aquifer, the mineral deposit – has resulted from a random process, but rather it allows one to build a methodological basis for the spatial inference of quantities in unobserved locations and to quantify the uncertainty associated with the estimator.

A stochastic process is, in the context of this model, simply a way to approach the set of data collected from the samples. The first step in geostatistical modulation is to create a random process that best describes the set of observed data.
A value from location (generic denomination of a set of geographic coordinates) is interpreted as a realization of the random variable . In the space , where the set of samples is dispersed, there are realizations of the random variables , correlated between themselves.
The set of random variables constitutes a random function, of which only one realization is known – the set of observed data. With only one realization of each random variable, it's theoretically impossible to determine any statistical parameter of the individual variables or the function. The proposed solution in the geostatistical formalism consists in assuming various degrees of stationarity in the random function, in order to make the inference of some statistic values possible.
For instance, if one assumes, based on the homogeneity of samples in area where the variable is distributed, the hypothesis that the first moment is stationary (i.e. all random variables have the same mean), then one is assuming that the mean can be estimated by the arithmetic mean of sampled values.
The hypothesis of stationarity related to the second moment is defined in the following way: the correlation between two random variables solely depends on the spatial distance between them and is independent of their location. Thus if and , then:
For simplicity, we define and .
This hypothesis allows one to infer those two measures – the variogram and the covariogram:
where:
- ;
- denotes the set of pairs of observations such that , and is the number of pairs in the set.
In this set, and denote the same element. Generally an "approximate distance" is used, implemented using a certain tolerance.
Linear estimation
[edit]Spatial inference, or estimation, of a quantity , at an unobserved location , is calculated from a linear combination of the observed values and weights :
The weights are intended to summarize two extremely important procedures in a spatial inference process:
- reflect the structural "proximity" of samples to the estimation location ;
- at the same time, they should have a desegregation effect, in order to avoid bias caused by eventual sample clusters.
When calculating the weights , there are two objectives in the geostatistical formalism: unbias and minimal variance of estimation.
If the cloud of real values is plotted against the estimated values , the criterion for global unbias, intrinsic stationarity or wide sense stationarity of the field, implies that the mean of the estimations must be equal to mean of the real values.
The second criterion says that the mean of the squared deviations must be minimal, which means that when the cloud of estimated values versus the cloud real values is more disperse, the estimator is more imprecise.
Methods
[edit]Depending on the stochastic properties of the random field and the various degrees of stationarity assumed, different methods for calculating the weights can be deduced, i.e. different types of kriging apply. Classical methods are:
- Ordinary kriging assumes constant unknown mean only over the search neighborhood of .
- Simple kriging assumes stationarity of the first moment over the entire domain with a known mean: , where is the known mean.
- Universal kriging assumes a general polynomial trend model, such as linear trend model .
- IRFk-kriging assumes to be an unknown polynomial in .
- Indicator kriging uses indicator functions instead of the process itself, in order to estimate transition probabilities.
- Multiple-indicator kriging is a version of indicator kriging working with a family of indicators. Initially, MIK showed considerable promise as a new method that could more accurately estimate overall global mineral deposit concentrations or grades. However, these benefits have been outweighed by other inherent problems of practicality in modelling due to the inherently large block sizes used and also the lack of mining scale resolution. Conditional simulation is fast, becoming the accepted replacement technique in this case.[citation needed]
- Disjunctive kriging is a nonlinear generalisation of kriging.
- Log-normal kriging interpolates positive data by means of logarithms.
- Latent kriging assumes the various krigings on the latent level (second stage) of the nonlinear mixed-effects model to produce a spatial functional prediction.[4] This technique is useful when analyzing a spatial functional data , where is a time series data over period, is a vector of covariates, and is a spatial location (longitude, latitude) of the -th subject.
- Co-kriging denotes the joint kriging of data from multiple sources with a relationship between the different data sources.[5] Co-kriging is also possible in a Bayesian approach.[6][7]
- Bayesian kriging departs from the optimization of unknown coefficients and hyperparameters, which is understood as a maximum likelihood estimate from the Bayesian perspective. Instead, the coefficients and hyperparameters are estimated from their expectation values. An advantage of Bayesian kriging is, that it allows to quantify the evidence for and the uncertainty of the kriging emulator.[8] If the emulator is employed to propagate uncertainties, the quality of the kriging emulator can be assessed by comparing the emulator uncertainty to the total uncertainty (see also Bayesian Polynomial Chaos). Bayesian kriging can also be mixed with co-kriging.[6][7]
Ordinary kriging
[edit]The unknown value is interpreted as a random variable located in , as well as the values of neighbors samples . The estimator is also interpreted as a random variable located in , a result of the linear combination of variables.
Kriging seeks to minimize the mean square value of the following error in estimating , subject to lack of bias:
The two quality criteria referred to previously can now be expressed in terms of the mean and variance of the new random variable :
- Lack of bias
Since the random function is stationary, , the weights must sum to 1 in order to ensure that the model is unbiased. This can be seen as follows:
- Minimum variance
Two estimators can have , but the dispersion around their mean determines the difference between the quality of estimators. To find an estimator with minimum variance, we need to minimize .
See covariance matrix for a detailed explanation.
where the literals stand for
Once defined the covariance model or variogram, or , valid in all field of analysis of , then we can write an expression for the estimation variance of any estimator in function of the covariance between the samples and the covariances between the samples and the point to estimate:
Some conclusions can be asserted from this expression. The variance of estimation:
- is not quantifiable to any linear estimator, once the stationarity of the mean and of the spatial covariances, or variograms, are assumed;
- grows when the covariance between the samples and the point to estimate decreases. This means that, when the samples are farther away from , the estimation becomes worse;
- grows with the a priori variance of the variable ; when the variable is less disperse, the variance is lower in any point of the area ;
- does not depend on the values of the samples, which means that the same spatial configuration (with the same geometrical relations between samples and the point to estimate) always reproduces the same estimation variance in any part of the area ; this way, the variance does not measure the uncertainty of estimation produced by the local variable.
- System of equations
Solving this optimization problem (see Lagrange multipliers) results in the kriging system:
The additional parameter is a Lagrange multiplier used in the minimization of the kriging error to honor the unbiasedness condition.
Simple kriging
[edit]This section may require cleanup to meet Wikipedia's quality standards. The specific problem is: this section is very poor and needs to be improved. (January 2021) |

Simple kriging is mathematically the simplest, but the least general.[9] It assumes the expectation of the random field is known and relies on a covariance function. However, in most applications neither the expectation nor the covariance are known beforehand.
The practical assumptions for the application of simple kriging are:
- Wide-sense stationarity of the field (variance stationary).
- The expectation is zero everywhere: .
- Known covariance function .
The covariance function is a crucial design choice, since it stipulates the properties of the Gaussian process and thereby the behaviour of the model. The covariance function encodes information about, for instance, smoothness and periodicity, which is reflected in the estimate produced. A very common covariance function is the squared exponential, which heavily favours smooth function estimates.[10] For this reason, it can produce poor estimates in many real-world applications, especially when the true underlying function contains discontinuities and rapid changes.
- System of equations
The kriging weights of simple kriging have no unbiasedness condition and are given by the simple kriging equation system:
This is analogous to a linear regression of on the other .
- Estimation
The interpolation by simple kriging is given by
The kriging error is given by
which leads to the generalised least-squares version of the Gauss–Markov theorem (Chiles & Delfiner 1999, p. 159):
Bayesian kriging
[edit]See also Bayesian Polynomial Chaos
Properties
[edit]This section may require cleanup to meet Wikipedia's quality standards. The specific problem is: this section needs revision. Incorrect or confusing text should be removed. (January 2021) |
- The kriging estimation is unbiased: .
- The kriging estimation honors the actually observed value: (assuming no measurement error is incurred).
- The kriging estimation is the best linear unbiased estimator of if the assumptions hold. However (e.g. Cressie 1993):[11]
- As with any method, if the assumptions do not hold, kriging might be bad.
- There might be better nonlinear and/or biased methods.
- No properties are guaranteed when the wrong variogram is used. However, typically still a "good" interpolation is achieved.
- Best is not necessarily good: e.g. in case of no spatial dependence the kriging interpolation is only as good as the arithmetic mean.
- Kriging provides as a measure of precision. However, this measure relies on the correctness of the variogram.
Applications
[edit]This section may require cleanup to meet Wikipedia's quality standards. The specific problem is: this section is very poor and needs to be improved. (January 2021) |
Although kriging was developed originally for applications in geostatistics, it is a general method of statistical interpolation and can be applied within any discipline to sampled data from random fields that satisfy the appropriate mathematical assumptions. It can be used where spatially related data has been collected (in 2-D or 3-D) and estimates of "fill-in" data are desired in the locations (spatial gaps) between the actual measurements.
To date kriging has been used in a variety of disciplines, including the following:
- Environmental science[12]
- Hydrogeology[13][14][15]
- Mining[16][17]
- Natural resources[18][19]
- Remote sensing[20]
- Real estate appraisal[21]
- Integrated circuit analysis and optimization[22]
- Modelling of microwave devices[23]
- Astronomy[24][25][26]
- Prediction of oil production curve of shale oil wells[27]
Design and analysis of computer experiments
[edit]Another very important and rapidly growing field of application, in engineering, is the interpolation of data coming out as response variables of deterministic computer simulations,[28] e.g. finite element method (FEM) simulations. In this case, kriging is used as a metamodeling tool, i.e. a black-box model built over a designed set of computer experiments. In many practical engineering problems, such as the design of a metal forming process, a single FEM simulation might be several hours or even a few days long. It is therefore more efficient to design and run a limited number of computer simulations, and then use a kriging interpolator to rapidly predict the response in any other design point. Kriging is therefore used very often as a so-called surrogate model, implemented inside optimization routines.[29] Kriging-based surrogate models may also be used in the case of mixed integer inputs.[30]
See also
[edit]References
[edit]- ^ Chung, Sang Yong; Venkatramanan, S.; Elzain, Hussam Eldin; Selvam, S.; Prasanna, M. V. (2019). "Supplement of Missing Data in Groundwater-Level Variations of Peak Type Using Geostatistical Methods". GIS and Geostatistical Techniques for Groundwater Science. Elsevier. pp. 33–41. doi:10.1016/b978-0-12-815413-7.00004-3. ISBN 978-0-12-815413-7. S2CID 189989265.
- ^ Wahba, Grace (1990). Spline Models for Observational Data. Vol. 59. SIAM. doi:10.1137/1.9781611970128. ISBN 978-0-89871-244-5.
- ^ Williams, C. K. I. (1998). "Prediction with Gaussian Processes: From Linear Regression to Linear Prediction and Beyond". Learning in Graphical Models. pp. 599–621. doi:10.1007/978-94-011-5014-9_23. ISBN 978-94-010-6104-9.
- ^ Lee, Se Yoon; Mallick, Bani (2021). "Bayesian Hierarchical Modeling: Application Towards Production Results in the Eagle Ford Shale of South Texas". Sankhya B. 84: 1–43. doi:10.1007/s13571-020-00245-8.
- ^ Le Gratiet, Loic; Garnier, Josselin (2014). "Recursive Co-Kriging Model for Design of Computer Experiments with Multiple Levels of Fidelity". International Journal for Uncertainty Quantification. 4 (5): 365–386. doi:10.1615/Int.J.UncertaintyQuantification.2014006914. ISSN 2152-5080. S2CID 14157948.
- ^ a b Ranftl, Sascha; Melito, Gian Marco; Badeli, Vahid; Reinbacher-Köstinger, Alice; Ellermann, Katrin; Linden, Wolfgang von der (2019-12-09). "On the Diagnosis of Aortic Dissection with Impedance Cardiography: A Bayesian Feasibility Study Framework with Multi-Fidelity Simulation Data". Proceedings. 33 (1): 24. doi:10.3390/proceedings2019033024. ISSN 2504-3900.
- ^ a b Ranftl, Sascha; Melito, Gian Marco; Badeli, Vahid; Reinbacher-Köstinger, Alice; Ellermann, Katrin; von der Linden, Wolfgang (2019-12-31). "Bayesian Uncertainty Quantification with Multi-Fidelity Data and Gaussian Processes for Impedance Cardiography of Aortic Dissection". Entropy. 22 (1): 58. Bibcode:2019Entrp..22...58R. doi:10.3390/e22010058. ISSN 1099-4300. PMC 7516489. PMID 33285833.
- ^ Ranftl, Sascha; von der Linden, Wolfgang (2021-11-13). "Bayesian Surrogate Analysis and Uncertainty Propagation". Physical Sciences Forum. 3 (1): 6. arXiv:2101.04038. doi:10.3390/psf2021003006. ISSN 2673-9984.
- ^ Olea, Ricardo A. (1999). Geostatistics for Engineers and Earth Scientists. Kluwer Academic. ISBN 978-1-4615-5001-3.
- ^ Rasmussen, Carl Edward; Williams, Christopher K. I. (2005-11-23). Gaussian Processes for Machine Learning. doi:10.7551/mitpress/3206.001.0001. ISBN 978-0-262-25683-4.
- ^ Cressie 1993, Chiles&Delfiner 1999, Wackernagel 1995.
- ^ Bayraktar, Hanefi; Sezer, Turalioglu (2005). "A Kriging-based approach for locating a sampling site—in the assessment of air quality". SERRA. 19 (4): 301–305. Bibcode:2005SERRA..19..301B. doi:10.1007/s00477-005-0234-8. S2CID 122643497.
- ^ Chiles, J.-P. and P. Delfiner (1999) Geostatistics, Modeling Spatial Uncertainty, Wiley Series in Probability and statistics.
- ^ Zimmerman, D. A.; De Marsily, G.; Gotway, C. A.; Marietta, M. G.; Axness, C. L.; Beauheim, R. L.; Bras, R. L.; Carrera, J.; Dagan, G.; Davies, P. B.; Gallegos, D. P.; Galli, A.; Gómez-Hernández, J.; Grindrod, P.; Gutjahr, A. L.; Kitanidis, P. K.; Lavenue, A. M.; McLaughlin, D.; Neuman, S. P.; Ramarao, B. S.; Ravenne, C.; Rubin, Y. (1998). "A comparison of seven geostatistically based inverse approaches to estimate transmissivities for modeling advective transport by groundwater flow" (PDF). Water Resources Research. 34 (6): 1373–1413. Bibcode:1998WRR....34.1373Z. doi:10.1029/98WR00003.
- ^ Tonkin, M. J.; Larson, S. P. (2002). "Kriging Water Levels with a Regional-Linear and Point-Logarithmic Drift". Ground Water. 40 (2): 185–193. Bibcode:2002GrWat..40..185T. doi:10.1111/j.1745-6584.2002.tb02503.x. PMID 11916123. S2CID 23008603.
- ^ Journel, A. G.; Huijbregts, C. J. (1978). Mining Geostatistics. London: Academic Press. ISBN 0-12-391050-1.
- ^ Richmond, A. (2003). "Financially Efficient Ore Selections Incorporating Grade Uncertainty". Mathematical Geology. 35 (2): 195–215. Bibcode:2003MatG...35..195R. doi:10.1023/A:1023239606028. S2CID 116703619.
- ^ Goovaerts (1997) Geostatistics for natural resource evaluation, OUP. ISBN 0-19-511538-4
- ^ Emery, X. (2005). "Simple and Ordinary Multigaussian Kriging for Estimating Recoverable Reserves". Mathematical Geology. 37 (3): 295–319. Bibcode:2005MatGe..37..295E. doi:10.1007/s11004-005-1560-6. S2CID 92993524.
- ^ Papritz, A.; Stein, A. (2002). "Spatial prediction by linear kriging". Spatial Statistics for Remote Sensing. Remote Sensing and Digital Image Processing. Vol. 1. p. 83. doi:10.1007/0-306-47647-9_6. ISBN 0-7923-5978-X.
- ^ Barris, J.; Garcia Almirall, P. (2010). "A density function of the appraisal value" (PDF). European Real Estate Society.
- ^ Oghenekarho Okobiah, Saraju Mohanty, and Elias Kougianos (2013) Geostatistical-Inspired Fast Layout Optimization of a Nano-CMOS Thermal Sensor. Archived 2014-07-14 at the Wayback Machine, IET Circuits, Devices and Systems (CDS), Vol. 7, No. 5, Sep. 2013, pp. 253–262.
- ^ Koziel, Slawomir (2011). "Accurate modeling of microwave devices using kriging-corrected space mapping surrogates". International Journal of Numerical Modelling: Electronic Networks, Devices and Fields. 25: 1–14. doi:10.1002/jnm.803. S2CID 62683207.
- ^ Pastorello, Nicola (2014). "The SLUGGS survey: exploring the metallicity gradients of nearby early-type galaxies to large radii". Monthly Notices of the Royal Astronomical Society. 442 (2): 1003–1039. arXiv:1405.2338. Bibcode:2014MNRAS.442.1003P. doi:10.1093/mnras/stu937. S2CID 119221897.
- ^ Foster, Caroline; Pastorello, Nicola; Roediger, Joel; Brodie, Jean; Forbes, Duncan; Kartha, Sreeja; Pota, Vincenzo; Romanowsky, Aaron; Spitler, Lee; Strader, Jay; Usher, Christopher; Arnold, Jacob (2016). "The SLUGGS survey: stellar kinematics, kinemetry and trends at large radii in 25 early-type galaxies". Monthly Notices of the Royal Astronomical Society. 457 (1): 147–171. arXiv:1512.06130. Bibcode:2016MNRAS.457..147F. doi:10.1093/mnras/stv2947. S2CID 53472235.
- ^ Bellstedt, Sabine; Forbes, Duncan; Foster, Caroline; Romanowsky, Aaron; Brodie, Jean; Pastorello, Nicola; Alabi, Adebusola; Villaume, Alexa (2017). "The SLUGGS survey: using extended stellar kinematics to disentangle the formation histories of low-mass S) galaxies". Monthly Notices of the Royal Astronomical Society. 467 (4): 4540–4557. arXiv:1702.05099. Bibcode:2017MNRAS.467.4540B. doi:10.1093/mnras/stx418. S2CID 54521046.
- ^ Lee, Se Yoon; Mallick, Bani (2021). "Bayesian Hierarchical Modeling: Application Towards Production Results in the Eagle Ford Shale of South Texas". Sankhya B. 84: 1–43. doi:10.1007/s13571-020-00245-8.
- ^ Sacks, J.; Welch, W. J.; Mitchell, T. J.; Wynn, H. P. (1989). "Design and Analysis of Computer Experiments". Statistical Science. 4 (4): 409–435. doi:10.1214/ss/1177012413. JSTOR 2245858.
- ^ Strano, M. (March 2008). "A technique for FEM optimization under reliability constraint of process variables in sheet metal forming". International Journal of Material Forming. 1 (1): 13–20. doi:10.1007/s12289-008-0001-8. S2CID 136682565.
- ^ Saves, Paul; Diouane, Youssef; Bartoli, Nathalie; Lefebvre, Thierry; Morlier, Joseph (2023). "A mixed-categorical correlation kernel for Gaussian process". Neurocomputing. 550 126472. arXiv:2211.08262. doi:10.1016/j.neucom.2023.126472.
Further reading
[edit]This "further reading" section may need cleanup. (November 2014) |
Historical references
[edit]- Chilès, Jean-Paul; Desassis, Nicolas (2018). "Fifty Years of Kriging". Handbook of Mathematical Geosciences. Cham: Springer International Publishing. pp. 589–612. doi:10.1007/978-3-319-78999-6_29. ISBN 978-3-319-78998-9. S2CID 125362741.
- Agterberg, F. P., Geomathematics, Mathematical Background and Geo-Science Applications, Elsevier Scientific Publishing Company, Amsterdam, 1974.
- Cressie, N. A. C., The origins of kriging, Mathematical Geology, v. 22, pp. 239–252, 1990.
- Krige, D. G., A statistical approach to some mine valuations and allied problems at the Witwatersrand, Master's thesis of the University of Witwatersrand, 1951.
- Link, R. F. and Koch, G. S., Experimental Designs and Trend-Surface Analsysis, Geostatistics, A colloquium, Plenum Press, New York, 1970.
- Matheron, G., "Principles of geostatistics", Economic Geology, 58, pp. 1246–1266, 1963.
- Matheron, G., "The intrinsic random functions, and their applications", Adv. Appl. Prob., 5, pp. 439–468, 1973.
- Merriam, D. F. (editor), Geostatistics, a colloquium, Plenum Press, New York, 1970.
- Mockus, J., "On Bayesian methods for seeking the extremum." Proceedings of the IFIP Technical Conference. 1974.
Books
[edit]- Abramowitz, M., and Stegun, I. (1972), Handbook of Mathematical Functions, Dover Publications, New York.
- Banerjee, S., Carlin, B. P. and Gelfand, A. E. (2004). Hierarchical Modeling and Analysis for Spatial Data. Chapman and Hall/CRC Press, Taylor and Francis Group.
- Chiles, J.-P. and P. Delfiner (1999) Geostatistics, Modeling Spatial uncertainty, Wiley Series in Probability and statistics.
- Clark, I., and Harper, W. V., (2000) Practical Geostatistics 2000, Ecosse North America, USA.
- Cressie, N. (1993) Statistics for spatial data, Wiley, New York.
- David, M. (1988) Handbook of Applied Advanced Geostatistical Ore Reserve Estimation, Elsevier Scientific Publishing
- Deutsch, C. V., and Journel, A. G. (1992), GSLIB – Geostatistical Software Library and User's Guide, Oxford University Press, New York, 338 pp.
- Goovaerts, P. (1997) Geostatistics for Natural Resources Evaluation, Oxford University Press, New York, ISBN 0-19-511538-4.
- Isaaks, E. H., and Srivastava, R. M. (1989), An Introduction to Applied Geostatistics, Oxford University Press, New York, 561 pp.
- Journel, A. G. and C. J. Huijbregts (1978) Mining Geostatistics, Academic Press London.
- Journel, A. G. (1989), Fundamentals of Geostatistics in Five Lessons, American Geophysical Union, Washington D.C.
- Press, W. H.; Teukolsky, S. A.; Vetterling, W. T.; Flannery, B. P. (2007), "Section 3.7.4. Interpolation by Kriging", Numerical Recipes: The Art of Scientific Computing (3rd ed.), New York: Cambridge University Press, ISBN 978-0-521-88068-8. Also, "Section 15.9. Gaussian Process Regression".
- Stein, M. L. (1999), Statistical Interpolation of Spatial Data: Some Theory for Kriging, Springer, New York.
- Wackernagel, H. (1995) Multivariate Geostatistics - An Introduction with Applications, Springer Berlin
Kriging
View on GrokipediaIntroduction
Definition and Purpose
Kriging is a geostatistical interpolation technique that estimates values of a spatial random field at unsampled locations using observed data points, under the assumption of spatial autocorrelation among nearby observations.[3] This method originated in the field of mining geology and is now widely applied in environmental science, hydrology, and resource estimation to predict continuous spatial phenomena such as pollutant concentrations or soil properties.[2] By modeling the covariance structure of the data, kriging produces predictions that account for the inherent variability and dependence in spatial datasets.[3] The primary purpose of kriging is to deliver unbiased estimates with the minimum possible variance, making it the best linear unbiased estimator (BLUE) for spatial prediction.[3] Unlike simpler deterministic methods like inverse distance weighting (IDW), which assign weights based solely on Euclidean distances and often oversmooth data, kriging incorporates a probabilistic model of spatial continuity to yield more precise and reliable interpolations, particularly in heterogeneous environments.[4] This optimality ensures that predictions are not systematically biased and have the lowest prediction error among linear combinations of the observed data.[3] In essence, kriging functions as a tailored form of Gaussian process regression for geospatial applications, where the spatial field is treated as a realization of a multivariate Gaussian distribution conditioned on the observations.[5] A practical example is in mineral resource evaluation, where kriging predicts ore grades at untested sites within a mine based on limited drill core samples, enabling informed decisions on extraction feasibility and reserve volumes.[6] The method models spatial dependence through tools like the variogram to quantify how similarity decreases with distance.[3]Historical Development
The origins of kriging trace back to the work of South African mining engineer Danie G. Krige, who in 1951 developed an empirical method for estimating gold ore grades in Witwatersrand mines using weighted averages of nearby drill hole samples to account for spatial variability.[7] This approach, detailed in Krige's MSc thesis and subsequent publication, addressed practical challenges in mine valuation by improving the accuracy of reserve estimates over traditional methods like polygonal estimation.[8] Krige's technique was initially applied routinely in South African gold mines during the early 1950s, marking the first systematic use of spatial interpolation in mineral resource evaluation.[9] The theoretical formalization of kriging occurred in 1963 through the efforts of French mathematician Georges Matheron at the École des Mines de Paris, who built upon Krige's empirical results to establish a rigorous geostatistical framework.[10] Matheron coined the term "kriging" in 1962 as a tribute to Krige, integrating concepts like the variogram—introduced to quantify spatial continuity and dependence in data—to derive unbiased linear predictors for unsampled locations.[11] This development transformed Krige's practical tool into a statistically grounded method, emphasizing best linear unbiased estimation under assumptions of spatial stationarity. Kriging evolved rapidly from its mining roots in the 1950s to a foundational geostatistical technique by the 1970s, with Matheron's seminal two-volume "Traité de Géostatistique Appliquée" (1962–1963) providing the comprehensive theoretical basis and applications that popularized it beyond ore evaluation.[12] By the 1980s, kriging had gained traction in environmental sciences for tasks such as mapping pollutant distributions and groundwater contamination, further facilitated in the 1990s by accessible software like the U.S. Environmental Protection Agency's Geo-EAS package, which implemented kriging routines for spatial analysis of environmental data.[13] This period solidified kriging's role as a versatile interpolation method across disciplines, bridging empirical mining practices with broader statistical applications.[14]Mathematical Foundations
Spatial Dependence and Stationarity
Spatial dependence is a fundamental concept in geostatistics, positing that values of a spatial process at nearby locations exhibit greater similarity than those at distant locations. This principle, known as Tobler's First Law of Geography, states that "everything is related to everything else, but near things are more related than distant things," providing the theoretical foundation for interpolation methods like kriging, where predictive weights are derived from spatial proximity and correlation structures. Kriging relies on this dependence to model spatial autocorrelation, assuming that the influence of observed data points diminishes with increasing distance, thereby enabling unbiased predictions at unsampled locations. In practice, this dependence is quantified through tools like the variogram, which measures dissimilarity as a function of separation distance, though detailed modeling is addressed elsewhere. Stationarity assumptions underpin the validity of these models by ensuring that statistical properties of the spatial process remain consistent across the domain. Strict stationarity requires that the joint probability distribution of the process is invariant under spatial translations, implying uniformity in all moments and higher-order dependencies. Second-order stationarity, a weaker and more commonly invoked condition in geostatistics, assumes a constant mean throughout the domain and a covariance function that depends solely on the lag vector between points, allowing the process variance to be well-defined and separable from location. Intrinsic stationarity relaxes these further by focusing on the stationarity of increments rather than the process itself, where the variance of differences between values at points separated by a fixed lag is constant, facilitating the use of variograms even when means or variances vary spatially. This form is particularly useful for processes exhibiting local homogeneity in fluctuations but global trends.[15] For more complex non-stationary scenarios, intrinsic random functions of order (IRF-) generalize the framework by assuming that the -th order differences of the process are intrinsically stationary, accommodating polynomial drifts or trends of order up to . Introduced by Matheron, these functions extend geostatistical inference to datasets with underlying smooth variations, such as geological formations, by stabilizing higher-order increments.[15] Examples illustrate these concepts in real-world applications. Climate variables like daily temperature in a flat agricultural region often satisfy second-order stationarity over moderate scales, with constant means and lag-dependent covariances reflecting atmospheric mixing. In contrast, elevation data across varied topography typically exhibit non-stationarity due to systematic trends from underlying geology, necessitating IRF approaches or trend removal to model residual fluctuations effectively.[16]Variogram and Covariance Functions
The semivariogram, denoted as , quantifies the average dissimilarity between values of a spatial random field separated by a lag vector , and is defined as .[17] This measure arises under the assumption of intrinsic stationarity, where the first two moments of the increments are translation-invariant.[17] The empirical semivariogram is estimated from observed data pairs using the method-of-moments estimator , where is the set of pairs separated by . Estimation considers potential anisotropy by directional binning of lags, allowing separate models for different orientations if spatial dependence varies with direction. The nugget effect, appearing as , captures microscale variability or measurement error unresolved at the sampling scale. Theoretical semivariogram models are fitted to the empirical values to ensure validity for kriging. Common isotropic models include the spherical , exponential , and Gaussian , where is the nugget, the sill, and the range. Fitting proceeds via weighted least squares, minimizing with weights accounting for estimation variance, or maximum likelihood, maximizing the Gaussian log-likelihood under the model parameters .[18] Under second-order stationarity, the covariance function relates to the semivariogram by , where is the variance (sill). Valid covariance functions must be positive definite, ensuring non-negative variances in any linear combination of the field, a property verified via Bochner's theorem for continuous models or spectral analysis. Cross-validation assesses model adequacy by omitting each data point, predicting it via kriging, and evaluating errors such as mean error (near zero for unbiasedness) and mean squared error (minimized for accuracy). This technique guides selection among candidate models by comparing prediction performance across the dataset.The Kriging Predictor
General Formulation
Kriging provides a method for predicting the value of a spatial random process at an unsampled location based on observed values at known locations . The general kriging predictor is formulated as a linear combination of the observations: where the weights are chosen to minimize the prediction error variance while satisfying the unbiasedness condition .[19] This condition ensures that the expected value of the predictor equals the true value at , i.e., , assuming the process has a constant mean.[19] To determine the optimal weights, the kriging system solves a constrained optimization problem that minimizes the prediction variance subject to the unbiasedness constraint. For ordinary kriging, this leads to the augmented linear system where is the variogram matrix with entries , is the vector of variograms between and the observation points with , denotes the semivariogram function, is a vector of ones, and is the Lagrange multiplier.[19] The variogram matrix captures the spatial dependence structure derived from the second-order properties of the process.[19] The associated prediction variance, or kriging variance, quantifies the uncertainty in the estimate and is given by where is the Lagrange multiplier from the kriging system. Note that by definition of the semivariogram, while the sill of the variogram, , is equivalent to the process variance (or nugget plus sill if microscale variation is present).[20] This variance is always non-negative and achieves zero at observed locations, reflecting exact interpolation.[21] In the context of Gaussian processes, simple kriging (with known zero mean) corresponds exactly to the conditional mean of a zero-mean Gaussian process given the observations, providing exact interpolation where the posterior mean passes through the data points and the posterior variance vanishes there. Ordinary kriging approximates this under an unknown constant mean, providing similar exact interpolation properties and serving as a form of empirical Bayesian inference under Gaussian assumptions with a constant mean.[21][22]Best Linear Unbiased Estimation
Kriging provides the best linear unbiased estimator (BLUE) for predicting the value of a spatial random field at an unobserved location , based on observations for . The estimator takes the linear form , where the weights are chosen to satisfy two key criteria: unbiasedness, meaning (which implies under stationarity assumptions), and minimum variance of the prediction error among all linear unbiased estimators. This optimality follows from the Gauss-Markov theorem applied to spatial processes, ensuring the kriging predictor has the smallest mean squared error in the class of linear estimators.[3] The derivation of the BLUE weights proceeds by minimizing the prediction error variance subject to the unbiasedness constraint . This variance is expressed as where denotes the covariance function. To solve this constrained optimization, the method of Lagrange multipliers is employed, forming the Lagrangian Taking partial derivatives with respect to each and setting them to zero yields along with the constraint equation. In matrix form, this results in the kriging system where is the covariance matrix with entries , is the vector with entries , and is a vector of ones. Solving this system provides the optimal weights that minimize the error variance while ensuring unbiasedness.[23][3] The BLUE property of kriging exhibits invariance under affine transformations of the data. Specifically, if the observed values are transformed as for constants and , the kriging estimator transforms accordingly to , preserving unbiasedness and minimum variance in the transformed space. This robustness stems from the linear structure and the use of second-moment properties.[3] Compared to other linear estimators, such as those based on geometric distances (e.g., inverse distance weighting), kriging achieves superior variance reduction by explicitly accounting for the spatial covariance structure, which captures dependencies beyond mere proximity. This incorporation of the covariance function ensures that the prediction error variance is lower, providing more reliable uncertainty quantification in spatially correlated data.[23]Kriging Variants
Simple Kriging
Simple kriging is a foundational geostatistical method for interpolating values at unsampled locations in a spatial field, assuming the underlying process has a known constant mean and exhibits second-order stationarity. Developed as part of the early geostatistical framework, it minimizes the mean squared prediction error among linear unbiased estimators under these conditions.[24] This approach is particularly suited to scenarios where prior knowledge of the global mean allows for straightforward application without additional estimation steps.[25] The method relies on two primary assumptions: the spatial random field possesses a constant known mean , and it is second-order stationary, such that the expected value is constant and the covariance between observations depends solely on their spatial separation .[24] These assumptions ensure that the covariance structure fully captures the spatial dependence, enabling reliable predictions without needing to model local variations in the mean.[26] Given observations at sampled locations , the simple kriging predictor at an unsampled location is formulated as the known mean plus a weighted sum of deviations from that mean: where is the vector of observations, is the vector of ones, is the vector of covariances between and the sampled locations, is the covariance matrix among the sampled locations, and the weights for the centered process satisfy .[25] This is equivalent to . The associated prediction variance, which quantifies uncertainty, is with denoting the process variance at lag zero.[24] Simple kriging offers advantages in simplicity and efficiency compared to more general variants when the mean is reliably known from prior data, reducing computational demands by avoiding mean estimation.[26] Additionally, when the spatial process is Gaussian, simple kriging provides the exact conditional expectation, making it optimal beyond mere linearity. A representative application involves interpolating rainfall in a uniform climatic region, where the global mean is derived from historical averages across the field, allowing simple kriging to leverage the covariance structure for precise spatial predictions.[27]Ordinary Kriging
Ordinary kriging is a geostatistical technique for predicting values at unsampled locations in a spatial process where the mean is assumed to be constant but unknown across the domain. It achieves best linear unbiased prediction by solving for optimal weights that minimize the prediction variance while enforcing an unbiasedness constraint through a Lagrange multiplier. This method was formalized as part of the foundational geostatistical framework developed by Georges Matheron in the 1960s and 1970s.[28][29] The primary assumptions of ordinary kriging are that the spatial process has an unknown constant mean and satisfies second-order stationarity, implying that the mean is constant and the covariance between any two points depends solely on their separation distance. These assumptions ensure that the spatial dependence structure can be modeled reliably using a covariance or variogram function, allowing for the estimation of the process without prior information on the mean value.[30][31] To obtain the kriging weights , the method solves an augmented linear system that incorporates the unbiasedness constraint : where is the variogram matrix among the sampled locations (with ), is the vector of ones, is the vector of variogram values between the sampled locations and the prediction point (with ), contains the weights, and is the Lagrange multiplier enforcing the constraint. The prediction at is then , where is the vector of observed values.[30][31][29] The associated prediction variance, or kriging variance, quantifies the uncertainty and is calculated as , where . This variance expression accounts for the spatial structure and the enforced constraint, providing a measure of reliability for the prediction.[30][31] Ordinary kriging approximates simple kriging when the sample mean is close to the true mean, as the Lagrange multiplier then aligns closely with the known mean assumption in simple kriging, reducing the additional complexity of the constraint.[32][30] A classic application of ordinary kriging is in ore grade estimation within mining, where sample data from drill holes are used to predict metal concentrations at unsampled sites without assuming prior knowledge of the average grade, thereby supporting resource evaluation and mine planning. For instance, in gold deposit modeling, ordinary kriging weights nearby samples to estimate grades while ensuring the predictions are unbiased and variance-minimized, leading to more accurate block models for extraction decisions.[29][30]Universal Kriging
Universal kriging extends the kriging framework to handle spatial processes with non-stationary means, where the expected value varies systematically according to a known functional form rather than being constant. Developed by Georges Matheron, this method models the process as , with representing the deterministic trend—typically a linear combination of basis functions such as polynomials in coordinates (e.g., constant, linear, or quadratic terms)—and denoting the zero-mean stochastic residuals. The coefficients are unknown and estimated from the data, while the residuals are assumed to be second-order stationary with a known covariance structure or variogram, ensuring the covariance of residuals depends only on spatial separation. The core of universal kriging involves solving a generalized linear system that simultaneously determines the interpolation weights and the trend parameters to achieve the best linear unbiased predictor (BLUP). In matrix notation using the covariance form, the system is given by where is the covariance matrix of the observations , is the design matrix with rows ( basis functions), is the vector of covariances between and the observations, , are the weights, and are the Lagrange multipliers associated with the trend constraints. The predictor is then , where , ensuring unbiasedness by satisfying . An equivalent variogram-based system replaces covariances with the variogram matrix and vector . The kriging variance for universal kriging, which quantifies prediction uncertainty, is where is the process variance; in variogram terms, it becomes under intrinsic stationarity assumptions for residuals. This variance accounts for both the spatial correlation of residuals and the uncertainty in the trend estimate, generally exceeding that of ordinary kriging due to the additional variability from . An alternative implementation involves trend removal: first fit the trend via generalized least squares using the residual covariance structure, subtract it from observations to yield residuals, apply ordinary kriging to the residuals, and add the fitted trend back at ; this is computationally equivalent under the model assumptions. A practical example arises in topographic modeling, where elevation data exhibit a linear trend due to underlying geology. The mean is modeled as , with basis functions , and residuals assumed to follow a spherical variogram with nugget effect, sill, and range fitted from data. For a dataset of elevations at irregular points, universal kriging yields predictions that capture both the regional slope and local fluctuations, with variances reflecting sparser trend information in peripheral areas.Other Variants
Indicator kriging is a geostatistical technique designed for handling categorical, binary, or non-normal continuous data by transforming the variable into a set of binary indicators at specified thresholds, allowing estimation of conditional cumulative distribution functions via ordinary kriging applied to each indicator variogram.[33] This approach enables probabilistic modeling of spatial distributions without assuming normality, making it suitable for applications like ore grade estimation where high and low values exhibit asymmetric continuity.[33] Multiple indicators can be combined to recover the full conditional distribution at unsampled locations, providing both point estimates and uncertainty measures for decision-making in resource evaluation.[33] Co-kriging extends the kriging framework to multivariate spatial data where variables are correlated, incorporating cross-covariances between primary and secondary variables to improve prediction accuracy beyond univariate kriging.[29] By solving a system of equations that includes auto- and cross-variograms, co-kriging leverages auxiliary data—such as densely sampled secondary variables—to enhance estimates of the target variable, particularly when direct observations are sparse.[29] This method is widely applied in scenarios like environmental monitoring, where correlated pollutants or geochemical indicators are jointly analyzed for more precise spatial interpolation.[29] Bayesian kriging integrates prior distributions on model parameters, such as the mean, trend, and covariance structure, to derive posterior predictions, offering a probabilistic framework that contrasts with the frequentist approach of classical kriging by explicitly accounting for uncertainty in hyperparameters.[34] Markov chain Monte Carlo (MCMC) methods are commonly employed to sample from the posterior, enabling flexible incorporation of expert knowledge or historical data into the spatial prediction process.[34] This variant is particularly useful in complex settings with limited data, where it provides full posterior distributions for predictions rather than point estimates and variances.[34] Log-normal kriging addresses positively skewed data, such as mineral grades or rainfall amounts, by applying a logarithmic transformation to achieve approximate normality before performing kriging, followed by a bias-corrected back-transformation to the original scale.[35] Disjunctive kriging, a related nonlinear extension, further handles skewed distributions using expansions like Hermite polynomials to model the variable as a sum of uncorrelated factors, allowing estimation of nonlinear functions of the spatial variable without transformation.[36] These methods preserve the log-normal or multimodal characteristics of the data, improving accuracy in applications involving multiplicative processes or bounded positive values.[35][36] Block kriging modifies the standard point kriging system to estimate average values over finite areas or volumes, such as mine blocks, by adjusting the variogram to account for the support size and shape of the block relative to point samples. This involves computing block-to-point covariances or using regularization in the variogram model, which reduces smoothing effects and provides variance estimates tailored to the larger support, essential for resource recovery planning. It is routinely used in mining to assess tonnage and grade over practical exploitation units, ensuring unbiased predictions that reflect the averaging inherent in block sampling.Computation and Implementation
Estimation Steps
The estimation of spatial predictions using kriging follows a structured workflow that ensures the method's assumptions are met and its results are reliable. This process begins with preparing the input data and culminates in generating predictions along with measures of uncertainty. The steps are iterative, often requiring validation and adjustment to account for the spatial structure of the data.[24] Step 1: Data Exploration and Declustering. Initial data preparation involves exploratory analysis to identify outliers, trends, and sampling irregularities, such as uneven density that can bias estimates. For uneven sampling, declustering techniques are applied to weight observations inversely proportional to local density, providing a more representative global mean and reducing overrepresentation from clustered points. This step ensures stationarity assumptions are reasonably satisfied before proceeding.[24][25] Step 2: Variogram Modeling. Next, the spatial dependence structure is modeled using an empirical variogram, calculated from pairwise differences in observed values as a function of separation distance. A theoretical variogram model (e.g., spherical or exponential) is fitted to the empirical points, often through least-squares or maximum likelihood methods. Validation occurs via cross-validation, where each point is temporarily removed, predicted from neighbors, and errors assessed for bias and variance; this confirms the model's adequacy for capturing spatial continuity. As detailed in the variogram and covariance functions section, this modeling is crucial for determining covariance weights.[24][25] Step 3: Choosing the Kriging Variant and Setting Up the System Matrix. The appropriate kriging variant is selected based on knowledge of the mean: ordinary kriging is typically chosen when the mean is unknown but assumed constant, as it incorporates a Lagrange multiplier to enforce unbiasedness without requiring a prior mean estimate. The system of equations is then formulated as a matrix, where the covariance matrix between observed points and the target location defines the weights for linear combination of observations.[24][25] Step 4: Solving the Kriging Equations. The kriging weights are obtained by solving the linear system, typically via direct matrix inversion for small datasets (e.g., fewer than 100 points), which yields exact solutions. For larger datasets, iterative methods like conjugate gradient solvers are employed to avoid computational instability and high memory demands.[24][25] Step 5: Generating Predictions and Variance Maps; Diagnostic Checks. Predictions at unsampled locations are computed as the weighted sum of neighboring observations, while the kriging variance—derived from the model—provides a map of prediction uncertainty, highlighting areas of high reliability or extrapolation risk. Diagnostic checks include verifying that the mean prediction error approximates zero and that the standardized errors follow a standard normal distribution, often through leave-one-out cross-validation to assess overall performance.[24][25] For large datasets, approximations such as kriging with moving neighborhoods are used, where predictions at each location incorporate only a local subset of nearby points (e.g., 10-50) within a defined search radius and anisotropy, balancing accuracy with computational efficiency. This approach mitigates the quadratic scaling of full kriging while preserving local spatial structure.[24][25]Software Tools
Kriging implementations are available in various open-source software libraries, particularly in statistical programming environments. In R, the gstat package provides tools for variogram fitting and spatial predictions using methods such as ordinary and universal kriging, along with support for spatio-temporal extensions.[37] The geoR package extends this capability with Bayesian geostatistical analysis, enabling uncertainty quantification in model parameters for Gaussian processes akin to kriging.[38] In Python, scikit-learn offers Gaussian process regression, which mathematically aligns with kriging for spatial interpolation and prediction under stationarity assumptions.[39] Additionally, PyKrige serves as a dedicated toolkit for 2D and 3D ordinary and universal kriging, incorporating standard variogram models like spherical and exponential. Commercial software caters to professional applications, especially in resource estimation and visualization. Surfer, developed by Golden Software, facilitates 2D and 3D mapping through kriging gridding, allowing users to create contoured surfaces from irregularly spaced data.[40] Isatis.neo by Geovariances supports comprehensive mining workflows, including kriging for mineral resource estimation with features for domaining and simulation-based uncertainty assessment.[41] Similarly, the Groundwater Modeling System (GMS) from Aquaveo integrates kriging routines based on the Geostatistical Software Library (GSLIB) for subsurface modeling in hydrogeological contexts.[42] These tools commonly include automated variogram modeling to fit experimental semivariograms, streamlining the estimation of spatial covariance structures.[43] Support for kriging variants, such as co-kriging, enables multivariate predictions by incorporating auxiliary variables with cross-variograms.[44] Visualization of uncertainty is a key feature, often through prediction variance maps that highlight prediction reliability across the spatial domain.[27] Open-source options like gstat, geoR, scikit-learn, and PyKrige are freely accessible to academics and researchers, promoting widespread adoption in educational and non-commercial settings. Commercial tools such as Surfer and Isatis.neo require licensing but offer robust support for industry-scale computations. Many integrate with geographic information systems (GIS); for instance, ArcGIS provides the Geostatistical Analyst extension for kriging within a broader spatial analysis framework.[45] Recent trends emphasize cloud-based platforms for handling large datasets. Google Earth Engine supports kriging interpolation on feature collections, enabling scalable environmental applications like temperature mapping over vast regions without local computational limits.[46]Applications
Mining and Earth Sciences
In mining and earth sciences, kriging is extensively applied for ore reserve estimation, where block kriging is particularly valued for interpolating grades across discretized blocks that represent smelting units, thereby providing averaged estimates that reduce uncertainty in tonnage and grade calculations.[47] This method accounts for spatial variability in mineral deposits by minimizing estimation variance through variogram-based weighting, enabling more reliable predictions of recoverable resources compared to simpler interpolation techniques.[48] For instance, in iron ore deposits like Choghart in Iran, block kriging has been used to construct three-dimensional models that integrate drillhole data, yielding tonnage estimates with quantified errors that inform mine planning and economic viability assessments.[47] Grade control during excavation relies heavily on ordinary kriging to deliver real-time estimates of mineral grades, allowing operators to optimize blasting patterns and selective mining that minimize dilution and ore loss.[49] By applying ordinary kriging to closely spaced drillhole samples as mining progresses, estimators can delineate high-grade zones with sufficient precision to guide excavation decisions, such as adjusting blast designs to target ore selectively while avoiding waste.[50] This approach is common in open-pit operations, where rapid kriging computations support on-site decisions that enhance recovery rates and reduce operational costs, as demonstrated in various gold and base metal mines.[51] In petroleum exploration and reservoir characterization, co-kriging integrates sparse well data with densely sampled seismic attributes to estimate properties like porosity, improving the spatial continuity of models for hydrocarbon reservoirs.[52] This multivariate extension of kriging leverages cross-correlations between primary (e.g., well-logged porosity) and secondary (e.g., seismic impedance) variables, resulting in more accurate interpolations across unsampled areas and better delineation of productive zones.[53] Applications in fields like those in southwest Iran have shown co-kriging to outperform univariate methods by incorporating seismic trends, leading to refined reservoir simulations that support enhanced oil recovery strategies.[52] A seminal case study traces kriging's origins to the Witwatersrand gold mines in South Africa during the 1950s, where D. G. Krige developed early distance-weighted averaging techniques to estimate gold grades from drillhole data, laying the foundation for modern geostatistics.[54] This work evolved into routine applications by the early 1960s on large Anglovaal gold mines, marking one of the first industrial uses of kriging for reserve evaluation in the region. Today, these methods persist in constructing three-dimensional geological models for Witwatersrand operations, integrating kriging with seismic data to map complex reef structures and update reserves amid ongoing extraction.[55] One key benefit of kriging in these contexts is its ability to quantify estimation risk through the kriging variance, which serves as a measure of local uncertainty and informs economic decision-making, such as cutoff grade optimization and investment risk assessment.[56] By providing not only point or block estimates but also variance metrics, kriging enables mining engineers to evaluate the reliability of tonnage and grade projections, supporting probabilistic analyses that balance resource recovery against operational hazards.[57] This risk quantification has become integral to standards like those from the Canadian Institute of Mining, ensuring that reserve statements reflect both mean values and their associated uncertainties for sustainable mine development.[58]Environmental and Hydrological Modeling
In environmental and hydrological modeling, kriging techniques are widely applied to interpolate sparse spatial data for simulating processes such as pollutant dispersion and water flow dynamics. Universal kriging, which accounts for underlying trends in the data, is particularly effective for estimating hydraulic conductivity fields in groundwater systems, where heterogeneity and regional drifts influence flow patterns. For instance, in aquifer studies, universal kriging has been used to generate continuous maps of hydraulic conductivity from borehole measurements, enabling more accurate simulations of groundwater movement and contaminant transport.[59][60] For air quality assessment, indicator kriging is employed to map exceedance probabilities of pollutant thresholds, transforming concentration data into binary indicators (exceedance or non-exceedance) to better capture non-linear spatial relationships. This approach facilitates probabilistic risk maps for pollutants like PM2.5 and NO2, integrating monitoring station data to predict areas at risk of violating air quality standards. By providing variance estimates, indicator kriging quantifies uncertainty in these predictions, supporting regulatory decisions on emission controls.[61][62][63] In climate modeling, simple kriging serves as a foundational method for interpolating temperature data across grids derived from weather station networks, assuming stationarity in the mean and variance. This technique produces smooth surfaces of monthly or annual temperatures, essential for downscaling climate models and assessing regional warming trends. Variogram models in simple kriging may incorporate anisotropy to reflect directional influences, such as prevailing winds affecting temperature gradients in hydrological catchments.[64][65][66] A notable case study involves co-kriging to map nitrate contamination risks in vulnerable aquifers, where nitrate measurements from wells are jointly interpolated with auxiliary land-use data (e.g., agricultural intensity) to enhance prediction accuracy. In the Vega de Granada aquifer, Spain, compositional co-kriging revealed high-risk zones linked to intensive farming, with probability maps showing exceedance risks above 50 mg/L in 30-40% of the area, aiding targeted remediation efforts. This multivariate extension leverages correlations between nitrate levels and land-use patterns to reduce estimation errors in sparse datasets.[67] Overall, kriging's advantages in these applications include its robustness to sparse monitoring networks typical of remote environmental sites, allowing reliable interpolations from limited observations, and its provision of kriging variance for uncertainty quantification, which is critical for risk-based hydrological assessments like flood or contamination forecasting.[68][69][70]Engineering and Computer Experiments
In engineering and computer experiments, kriging serves as a surrogate modeling technique within the design and analysis of computer experiments (DACE) framework to emulate computationally expensive simulations, such as computational fluid dynamics (CFD) analyses, thereby facilitating efficient optimization processes.[71] Introduced in seminal work on DACE, kriging models the deterministic output of a computer code as a realization of a stochastic process, enabling interpolation and prediction across the input space with associated uncertainty estimates.[71] This approach is particularly valuable in engineering design, where direct evaluations of high-fidelity simulations are prohibitive due to time and resource constraints, allowing for rapid exploration of design spaces.[72] Space-filling designs, such as Latin hypercube sampling (LHS), are commonly employed to generate initial datasets for fitting kriging models efficiently in DACE. LHS stratifies the input space into equally probable intervals, ensuring that sample points are maximally dispersed and representative of the design domain, which enhances the kriging model's predictive accuracy with fewer evaluations. For instance, in mechanical engineering applications, LHS combined with kriging reduces the number of required simulation runs while maintaining robust coverage of multidimensional parameter spaces, as demonstrated in surrogate-based optimization workflows.[73] In robust optimization, kriging's prediction variance is leveraged to propagate uncertainties in mechanical designs, enabling the identification of solutions that minimize sensitivity to input variations. By incorporating the kriging variance as a measure of model uncertainty and noise propagation, engineers can formulate objectives that balance mean performance with robustness, such as minimizing the variance of structural responses under parametric fluctuations.[74] This statistics-based approach, using kriging metamodels to approximate both mean and variance, has been applied to optimize components like cantilever beams, where it efficiently handles noisy evaluations and achieves designs with improved reliability.[74] A representative case study involves aerodynamic shape optimization, where kriging predicts lift coefficients from sparse CFD simulations of airfoil geometries. In optimizing transonic airfoils, such as the RAE 2822, kriging surrogates trained on a limited set of high-fidelity evaluations (e.g., 50-100 points) enable global search algorithms to maximize lift-to-drag ratios, reducing the overall computational cost by over 90% compared to direct evaluations while converging to near-optimal shapes.[75] Extensions to multi-fidelity modeling via co-kriging further enhance efficiency by combining low- and high-resolution simulations. Co-kriging constructs a hierarchical surrogate that uses inexpensive low-fidelity data to inform the high-fidelity model, scaling the correlation structure across fidelity levels to improve predictions in engineering optimization. For example, in structural design problems, this approach integrates coarse mesh finite element analyses with refined ones, achieving accuracy comparable to full high-fidelity kriging with significantly fewer expensive evaluations.[76]Limitations and Extensions
Assumptions and Challenges
Kriging methods rely on several foundational assumptions to ensure the validity of their predictions as best linear unbiased estimators (BLUE). A primary assumption is second-order stationarity of the underlying spatial process, which requires a constant mean across the domain and a covariance structure that depends solely on the lag distance between points, allowing the variogram or covariance function to model spatial dependence consistently.[77] For simple kriging, an additional assumption of Gaussianity—or normality of the spatial random field—is often invoked to achieve optimality in the mean squared prediction error sense, though ordinary kriging maintains unbiasedness without full normality by estimating the mean from the data.[78] Violations of these assumptions, such as non-stationarity or non-Gaussian distributions, can propagate errors through the interpolation process.[2] Equally critical is the assumption of a correctly specified variogram model, which captures the spatial autocorrelation structure; this model directly determines the weights assigned to observed data points in the kriging predictor. Misspecification of the variogram—such as choosing an inappropriate functional form (e.g., exponential versus spherical) or inaccurate parameter estimation—leads to suboptimal weights and biased predictions, potentially underestimating variability or introducing systematic errors in the kriging surface.[79] Kriging also implicitly assumes the absence of significant outliers in the dataset, as anomalous points can inflate the nugget effect in the variogram or skew weights, resulting in distorted predictions that fail to represent true spatial patterns.[26] When these assumptions hold, kriging provides reliable interpolations, but real-world data often deviate, necessitating careful model diagnostics. Computational challenges further complicate kriging applications, particularly the O(n³) time complexity required for inverting the n × n covariance matrix during weight estimation, which becomes infeasible for datasets with thousands of points and demands substantial memory and processing resources.[80] This issue is exacerbated by ill-conditioned matrices, often arising from collinear or densely clustered data points, which cause near-singularity and numerical instability, leading to unreliable or divergent solutions unless regularization techniques are applied.[81] Non-ergodicity presents a theoretical hurdle, as the finite sample variogram may not ergodically represent the population variogram, resulting in over-smoothed kriging estimates that regress extremes toward the mean and underrepresent local heterogeneity or sharp gradients in the spatial field.[82] Boundary effects in finite spatial domains introduce additional practical challenges, manifesting as bias near edges where fewer surrounding observations are available to inform predictions, often causing underprediction of variability or directional artifacts in the interpolated surface.[83] To evaluate adherence to assumptions and overall model performance, cross-validation diagnostics are essential; leave-one-out cross-validation, for instance, assesses unbiasedness by verifying that the mean error (ME) approximates zero and optimizes accuracy by minimizing the root mean square error (RMSE), providing quantitative measures of prediction reliability without requiring independent validation data.[84]Modern Developments
Since the 2000s, kriging has seen significant advancements in scalability to handle big data, particularly through approximations like fixed rank kriging, which employs low-rank covariance structures to reduce computational complexity for datasets exceeding millions of points, enabling efficient predictions on massive spatial grids.[85] Cluster-based kriging further enhances this by partitioning large datasets into subsets for local modeling, followed by aggregation, which has proven effective for high-dimensional problems while maintaining prediction accuracy.[86] Sparse Gaussian process approximations, akin to kriging, leverage inducing points to approximate full covariance matrices, facilitating applications to satellite imagery analysis where data volumes from remote sensing exceed traditional limits.[87] Hybrid approaches integrating kriging with machine learning have addressed non-stationarity in spatial data, such as random forest residuals kriging, where a random forest model captures nonlinear trends and kriging interpolates the residuals, improving predictions for complex environmental variables like soil organic matter.[88] Deep Gaussian processes, an extension of kriging's Gaussian process foundation, stack multiple layers to model hierarchical non-stationarities, offering superior flexibility for geospatial regression tasks compared to single-layer kriging.[89] Universal kriging has been applied in climate change modeling on global datasets to incorporate trends from general circulation models. Bayesian variants of kriging have been used to map COVID-19 spatial spread, employing empirical Bayesian kriging to interpolate incidence rates across regions while accounting for uncertainty in underreported data.[90] For uncertainty quantification, ensemble kriging methods combine multiple kriging models with other surrogates like artificial neural networks, yielding robust variance estimates in AI-driven predictions and reducing overfitting in high-stakes scenarios.[91] In the 2020s, trends emphasize kriging's integration with deep learning for multi-scale spatial prediction in environmental AI, such as deep kriging neural networks that fuse convolutional layers with Gaussian processes to enhance resolution in remote sensing tasks like land cover mapping.[92]References
- https://www.coastalwiki.org/wiki/Data_interpolation_with_Kriging
