Recent from talks
Nothing was collected or created yet.
Interpolation
View on WikipediaThis article includes a list of general references, but it lacks sufficient corresponding inline citations. (October 2016) |
In the mathematical field of numerical analysis, interpolation is a type of estimation, a method of constructing (finding) new data points based on the range of a discrete set of known data points.[1][2]
In engineering and science, one often has a number of data points, obtained by sampling or experimentation, which represent the values of a function for a limited number of values of the independent variable. It is often required to interpolate; that is, estimate the value of that function for an intermediate value of the independent variable.
A closely related problem is the approximation of a complicated function by a simple function. Suppose the formula for some given function is known, but too complicated to evaluate efficiently. A few data points from the original function can be interpolated to produce a simpler function which is still fairly close to the original. The resulting gain in simplicity may outweigh the loss from interpolation error and give better performance in calculation process.

Example
[edit]This table gives some values of an unknown function .

| 0 | 0 | ||||
| 1 | 0 | . | 8415 | ||
| 2 | 0 | . | 9093 | ||
| 3 | 0 | . | 1411 | ||
| 4 | −0 | . | 7568 | ||
| 5 | −0 | . | 9589 | ||
| 6 | −0 | . | 2794 | ||
Interpolation provides a means of estimating the function at intermediate points, such as
We describe some methods of interpolation, differing in such properties as: accuracy, cost, number of data points needed, and smoothness of the resulting interpolant function.
Piecewise constant interpolation
[edit]
The simplest interpolation method is to locate the nearest data value, and assign the same value. In simple problems, this method is unlikely to be used, as linear interpolation (see below) is almost as easy, but in higher-dimensional multivariate interpolation, this could be a favourable choice for its speed and simplicity.
Linear interpolation
[edit]
One of the simplest methods is linear interpolation (sometimes known as lerp). Consider the above example of estimating f(2.5). Since 2.5 is midway between 2 and 3, it is reasonable to take f(2.5) midway between f(2) = 0.9093 and f(3) = 0.1411, which yields 0.5252.
Generally, linear interpolation takes two data points, say (xa,ya) and (xb,yb), and the interpolant is given by:
This previous equation states that the slope of the new line between and is the same as the slope of the line between and
Linear interpolation is quick and easy, but it is not very precise. Another disadvantage is that the interpolant is not differentiable at the point xk.
The following error estimate shows that linear interpolation is not very precise. Denote the function which we want to interpolate by g, and suppose that x lies between xa and xb and that g is twice continuously differentiable. Then the linear interpolation error is
In words, the error is proportional to the square of the distance between the data points. The error in some other methods, including polynomial interpolation and spline interpolation (described below), is proportional to higher powers of the distance between the data points. These methods also produce smoother interpolants.
Polynomial interpolation
[edit]
Polynomial interpolation is a generalization of linear interpolation. Note that the linear interpolant is a linear function. We now replace this interpolant with a polynomial of higher degree.
Consider again the problem given above. The following sixth degree polynomial goes through all the seven points:
Substituting x = 2.5, we find that f(2.5) = ~0.59678.
Generally, if we have n data points, there is exactly one polynomial of degree at most n−1 going through all the data points. The interpolation error is proportional to the distance between the data points to the power n. Furthermore, the interpolant is a polynomial and thus infinitely differentiable. So, we see that polynomial interpolation overcomes most of the problems of linear interpolation.
However, polynomial interpolation also has some disadvantages. Calculating the interpolating polynomial is computationally expensive (see computational complexity) compared to linear interpolation. Furthermore, polynomial interpolation may exhibit oscillatory artifacts, especially at the end points (see Runge's phenomenon).
Polynomial interpolation can estimate local maxima and minima that are outside the range of the samples, unlike linear interpolation. For example, the interpolant above has a local maximum at x ≈ 1.566, f(x) ≈ 1.003 and a local minimum at x ≈ 4.708, f(x) ≈ −1.003. However, these maxima and minima may exceed the theoretical range of the function; for example, a function that is always positive may have an interpolant with negative values, and whose inverse therefore contains false vertical asymptotes.
More generally, the shape of the resulting curve, especially for very high or low values of the independent variable, may be contrary to commonsense; that is, to what is known about the experimental system which has generated the data points. These disadvantages can be reduced by using spline interpolation or restricting attention to Chebyshev polynomials.
Spline interpolation
[edit]
Linear interpolation uses a linear function for each of intervals [xk,xk+1]. Spline interpolation uses low-degree polynomials in each of the intervals, and chooses the polynomial pieces such that they fit smoothly together. The resulting function is called a spline.
For instance, the natural cubic spline is piecewise cubic and twice continuously differentiable. Furthermore, its second derivative is zero at the end points. The natural cubic spline interpolating the points in the table above is given by
In this case we get f(2.5) = 0.5972.
Like polynomial interpolation, spline interpolation incurs a smaller error than linear interpolation, while the interpolant is smoother and easier to evaluate than the high-degree polynomials used in polynomial interpolation. However, the global nature of the basis functions leads to ill-conditioning. This is completely mitigated by using splines of compact support, such as are implemented in Boost.Math and discussed in Kress.[3]
Mimetic interpolation
[edit]Depending on the underlying discretisation of fields, different interpolants may be required. In contrast to other interpolation methods, which estimate functions on target points, mimetic interpolation evaluates the integral of fields on target lines, areas or volumes, depending on the type of field (scalar, vector, pseudo-vector or pseudo-scalar).
A key feature of mimetic interpolation is that vector calculus identities are satisfied, including Stokes' theorem and the divergence theorem. As a result, mimetic interpolation conserves line, area and volume integrals.[4] Conservation of line integrals might be desirable when interpolating the electric field, for instance, since the line integral gives the electric potential difference at the endpoints of the integration path.[5] Mimetic interpolation ensures that the error of estimating the line integral of an electric field is the same as the error obtained by interpolating the potential at the end points of the integration path, regardless of the length of the integration path.
Linear, bilinear and trilinear interpolation are also considered mimetic, even if it is the field values that are conserved (not the integral of the field). Apart from linear interpolation, area weighted interpolation can be considered one of the first mimetic interpolation methods to have been developed.[6]
Functional interpolation
[edit]The Theory of Functional Connections (TFC) is a mathematical framework specifically developed for functional interpolation. Given any interpolant that satisfies a set of constraints, TFC derives a functional that represents the entire family of interpolants satisfying those constraints, including those that are discontinuous or partially defined. These functionals identify the subspace of functions where the solution to a constrained optimization problem resides. Consequently, TFC transforms constrained optimization problems into equivalent unconstrained formulations. This transformation has proven highly effective in the solution of differential equations. TFC achieves this by constructing a constrained functional (a function of a free function), that inherently satisfies given constraints regardless of the expression of the free function. This simplifies solving various types of equations and significantly improves the efficiency and accuracy of methods like Physics-Informed Neural Networks (PINNs). TFC offers advantages over traditional methods like Lagrange multipliers and spectral methods by directly addressing constraints analytically and avoiding iterative procedures, although it cannot currently handle inequality constraints.
Function approximation
[edit]Interpolation is a common way to approximate functions. Given a function with a set of points one can form a function such that for (that is, that interpolates at these points). In general, an interpolant need not be a good approximation, but there are well known and often reasonable conditions where it will. For example, if (four times continuously differentiable) then cubic spline interpolation has an error bound given by where and is a constant.[7]
Via Gaussian processes
[edit]Gaussian process is a powerful non-linear interpolation tool. Many popular interpolation tools are actually equivalent to particular Gaussian processes. Gaussian processes can be used not only for fitting an interpolant that passes exactly through the given data points but also for regression; that is, for fitting a curve through noisy data. In the geostatistics community Gaussian process regression is also known as Kriging.
Inverse Distance Weighting
[edit]Inverse Distance Weighting (IDW) is a spatial interpolation method that estimates values based on nearby data points, with closer points having more influence.[8] It uses an inverse power law for weighting, where higher power values emphasize local effects, while lower values create a smoother surface. IDW is widely used in GIS, meteorology, and environmental modeling for its simplicity but may produce artifacts in clustered or uneven data.[9]
Other forms
[edit]Other forms of interpolation can be constructed by picking a different class of interpolants. For instance, rational interpolation is interpolation by rational functions using Padé approximant, and trigonometric interpolation is interpolation by trigonometric polynomials using Fourier series. Another possibility is to use wavelets.
The Whittaker–Shannon interpolation formula can be used if the number of data points is infinite or if the function to be interpolated has compact support.
Sometimes, we know not only the value of the function that we want to interpolate, at some points, but also its derivative. This leads to Hermite interpolation problems.
When each data point is itself a function, it can be useful to see the interpolation problem as a partial advection problem between each data point. This idea leads to the displacement interpolation problem used in transportation theory.
In higher dimensions
[edit]
Black and red/yellow/green/blue dots correspond to the interpolated point and neighbouring samples, respectively.
Their heights above the ground correspond to their values.
Multivariate interpolation is the interpolation of functions of more than one variable. Methods include nearest-neighbor interpolation, bilinear interpolation and bicubic interpolation in two dimensions, and trilinear interpolation in three dimensions. They can be applied to gridded or scattered data. Mimetic interpolation generalizes to dimensional spaces where .[10][11]
-
Nearest neighbor
-
Bilinear
-
Bicubic
In digital signal processing
[edit]In the domain of digital signal processing, the term interpolation refers to the process of converting a sampled digital signal (such as a sampled audio signal) to that of a higher sampling rate (Upsampling) using various digital filtering techniques (for example, convolution with a frequency-limited impulse signal). In this application there is a specific requirement that the harmonic content of the original signal be preserved without creating aliased harmonic content of the original signal above the original Nyquist limit of the signal (that is, above fs/2 of the original signal sample rate). An early and fairly elementary discussion on this subject can be found in Rabiner and Crochiere's book Multirate Digital Signal Processing.[12]
Related concepts
[edit]The term extrapolation is used to find data points outside the range of known data points.
In curve fitting problems, the constraint that the interpolant has to go exactly through the data points is relaxed. It is only required to approach the data points as closely as possible (within some other constraints). This requires parameterizing the potential interpolants and having some way of measuring the error. In the simplest case this leads to least squares approximation.
Approximation theory studies how to find the best approximation to a given function by another function from some predetermined class, and how good this approximation is. This clearly yields a bound on how well the interpolant can approximate the unknown function.
Generalization
[edit]If we consider as a variable in a topological space, and the function mapping to a Banach space, then the problem is treated as "interpolation of operators".[13] The classical results about interpolation of operators are the Riesz–Thorin theorem and the Marcinkiewicz theorem. There are also many other subsequent results.
See also
[edit]- Barycentric coordinates – for interpolating within on a triangle or tetrahedron
- Brahmagupta's interpolation formula
- Discretization
- Fractal interpolation
- Imputation (statistics)
- Lagrange interpolation
- Missing data
- Newton–Cotes formulas
- Radial basis function interpolation
- Simple rational approximation
- Smoothing
References
[edit]- ^ Sheppard, William Fleetwood (1911). . In Chisholm, Hugh (ed.). Encyclopædia Britannica. Vol. 14 (11th ed.). Cambridge University Press. pp. 706–710.
- ^ Steffensen, J. F. (2006). Interpolation (Second ed.). Mineola, N.Y. ISBN 978-0-486-15483-1. OCLC 867770894.
{{cite book}}: CS1 maint: location missing publisher (link) - ^ Kress, Rainer (1998). Numerical Analysis. Springer. ISBN 9781461205999.
- ^ Pletzer, Alexander; Hayek, Wolfgang (2019-01-01). "Mimetic Interpolation of Vector Fields on Arakawa C/D Grids". Monthly Weather Review. 147 (1): 3–16. Bibcode:2019MWRv..147....3P. doi:10.1175/MWR-D-18-0146.1. ISSN 1520-0493. S2CID 125214770. Archived from the original on 2022-06-07. Retrieved 2022-06-07.
- ^ Stern, Ari; Tong, Yiying; Desbrun, Mathieu; Marsden, Jerrold E. (2015), Chang, Dong Eui; Holm, Darryl D.; Patrick, George; Ratiu, Tudor (eds.), "Geometric Computational Electrodynamics with Variational Integrators and Discrete Differential Forms", Geometry, Mechanics, and Dynamics, Fields Institute Communications, vol. 73, New York, NY: Springer New York, pp. 437–475, arXiv:0707.4470, doi:10.1007/978-1-4939-2441-7_19, ISBN 978-1-4939-2440-0, S2CID 15194760, retrieved 2022-06-15
- ^ Jones, Philip (1999). "First- and Second-Order Conservative Remapping Schemes for Grids in Spherical Coordinates". Monthly Weather Review. 127 (9): 2204–2210. Bibcode:1999MWRv..127.2204J. doi:10.1175/1520-0493(1999)127<2204:FASOCR>2.0.CO;2. S2CID 122744293.
- ^ Hall, Charles A.; Meyer, Weston W. (1976). "Optimal Error Bounds for Cubic Spline Interpolation". Journal of Approximation Theory. 16 (2): 105–122. doi:10.1016/0021-9045(76)90040-X.
- ^ Donald, Shepard (1968). "A two-dimensional interpolation function for irregularly-spaced data". 23rd ACM National Conference.
- ^ Ben Moshe, Nir (2025). "A Simple Solution for the Inverse Distance Weighting Interpolation (IDW) Clustering Problem". Sci. 7 (1): 30. doi:10.3390/sci7010030.
- ^ Whitney, Hassler (1957). Geometric Integration Theory. Dover Books on Mathematics. ISBN 978-0486445830.
{{cite book}}: ISBN / Date incompatibility (help) - ^ Pletzer, Alexander; Fillmore, David (2015). "Conservative interpolation of edge and face data on n dimensional structured grids using differential forms". Journal of Computational Physics. 302: 21–40. Bibcode:2015JCoPh.302...21P. doi:10.1016/j.jcp.2015.08.029.
- ^ Crochiere, Ronald E.; Rabiner, Lawrence R. (1983). R.E. Crochiere and L.R. Rabiner. (1983). Multirate Digital Signal Processing. Englewood Cliffs, NJ: Prentice–Hall. Prentice-Hall. ISBN 0136051626.
- ^ Colin Bennett, Robert C. Sharpley, Interpolation of Operators, Academic Press 1988
External links
[edit]- Online tools for linear Archived 2016-09-18 at the Wayback Machine, quadratic Archived 2016-09-18 at the Wayback Machine, cubic spline Archived 2016-08-20 at the Wayback Machine, and polynomial Archived 2016-09-18 at the Wayback Machine interpolation with visualisation and JavaScript source code.
- Sol Tutorials - Interpolation Tricks Archived 2021-01-31 at the Wayback Machine
- Barycentric rational interpolation in Boost.Math
- Interpolation via the Chebyshev transform in Boost.Math
Interpolation
View on GrokipediaFundamentals
Definition
In numerical analysis, interpolation is the process of constructing a function that passes exactly through a given set of discrete data points to estimate values at intermediate locations.[1] Specifically, given a set of distinct points where the are the nodes and are the corresponding function values, interpolation seeks a function such that for , allowing evaluation of for not coinciding with any .[1] This approach assumes basic knowledge of functions and emphasizes the role of data points as nodes where the interpolant must fit exactly.[4] A key distinction from function approximation is that interpolation requires an exact fit at the specified nodes, whereas approximation methods, such as least-squares fitting, seek to minimize overall error without necessarily passing through every point—often when the number of data points exceeds the degree of the approximating function.[1] For instance, in polynomial interpolation of degree at most through points, the solution is unique, ensuring precise reproduction at the nodes.[1] Common formulations include the Lagrange interpolating polynomial, expressed as where are the basis polynomials, or the Newton form, which builds the interpolant incrementally using divided differences without deriving the full expressions here.[1] These provide introductory ways to represent the interpolating function while maintaining the exact fit requirement.[5]Motivation and History
Interpolation serves as a fundamental tool in numerical analysis for estimating unknown values within discrete datasets, enabling the approximation of continuous functions from sampled points in fields such as scientific measurement, engineering design, and data visualization.[6] This need arises because real-world data is often collected at irregular or finite intervals, requiring interpolation to fill gaps, smooth curves, or predict intermediate values for practical applications like modeling physical phenomena or generating graphical representations.[7] The practice of interpolation dates back to ancient astronomy, where Claudius Ptolemy employed linear interpolation techniques in the 2nd century AD to construct tables of chord functions in his Almagest, facilitating the prediction of planetary positions from discrete observations.[8] In the 17th century, Isaac Newton laid foundational work on finite differences and interpolation in a 1675 letter, establishing methods for polynomial approximation that influenced classical theory.[9] Joseph-Louis Lagrange advanced this in 1795 with his explicit formula for polynomial interpolation, providing a systematic way to construct unique polynomials passing through given points.[9] By the early 20th century, Carl Runge highlighted limitations in 1901, demonstrating through examples that high-degree polynomial interpolation on equispaced points could lead to oscillatory errors, known as Runge's phenomenon, which underscored the need for careful node selection in approximations.[10] Key advancements in the mid-20th century included the mathematical formalization of splines by I. J. Schoenberg in 1946, inspired by flexible wooden or metal strips used in shipbuilding to draw smooth hull curves, leading to piecewise polynomial methods that avoid global oscillations.[11] In geostatistics, D. G. Krige developed early statistical interpolation techniques in 1951 for estimating ore grades in South African mining, which Georges Matheron formalized as kriging in 1960, introducing optimal unbiased prediction under spatial correlation assumptions.[12] The advent of digital computing in the 1950s propelled interpolation into numerical methods for solving differential equations and data processing on early machines like ENIAC, enabling efficient implementation of algorithms for engineering simulations. In the modern era, interpolation has integrated with artificial intelligence, particularly post-2020, for data imputation in large-scale machine learning datasets, where methods like Gaussian process regression variants enhance missing value estimation while preserving statistical properties in high-dimensional data.[13]Univariate Interpolation Methods
Nearest-Neighbor Interpolation
Nearest-neighbor interpolation, also known as piecewise constant interpolation or zero-order hold, is the simplest method for estimating values between known data points in univariate interpolation. It assigns to a query point the value of the closest data point from a given set of pairs for to , without any blending or smoothing.[14][15] This approach is particularly suited for categorical data or scenarios where smoothness is not required, producing a step-like function with constant values in intervals defined by midpoints between neighboring points.[16] The mathematical formulation is given by: In cases of ties (equal distances to multiple points), a convention such as selecting the leftmost or rightmost point, or rounding toward even indices, is typically applied to ensure determinism.[15] The algorithm involves, for a query point , computing the Euclidean distance (absolute difference in one dimension) to each and selecting the with the minimum distance; this naive implementation runs in time per query. With preprocessing, such as sorting the or using a search structure like a binary search tree, the time complexity can be reduced to or for uniform grids.[15][16] Consider a simple example with data points , , and . For a query at , the distances are , , and , so the closest point is and . This results in a discontinuous step function: constant at 1 for , 3 for , and 2 for , visualizing as a staircase with jumps at decision boundaries.[16] Nearest-neighbor interpolation offers significant computational efficiency, requiring minimal operations beyond distance comparisons, making it ideal for real-time applications or large datasets where speed trumps accuracy. It also preserves the exact range of input values, avoiding extrapolation beyond the data. However, it produces non-differentiable, discontinuous outputs that poorly approximate smooth underlying functions, leading to visual artifacts like blockiness in images or aliasing in signals.[14][15][16]Linear Interpolation
Linear interpolation is a fundamental method in numerical analysis for estimating values between known data points by constructing a piecewise linear function that connects consecutive points with straight line segments, assuming the data points are ordered by their independent variable values . This approach provides a simple, first-order approximation that is computationally efficient and preserves monotonicity within each interval.[17] The interpolated value for a query point lying within the interval , where the known points are and with , is given by the formula: This expression ensures exact reproduction of the data points at the endpoints.[17] The formula derives from the concept of a weighted average, where is a convex combination of and . The weight for is the relative distance , which ranges from 0 to 1 across the interval, and the weight for is the complement . This linear weighting follows directly from the equation of a straight line passing through the two points, parameterized by the slope .[18] To illustrate, consider the data points , , and . The piecewise linear interpolant consists of two segments: from to , and from to . The following table shows interpolated values at selected points within these intervals:| Interval | ||
|---|---|---|
| 0.0 | [0,1] | 0.0 |
| 0.25 | [0,1] | 0.25 |
| 0.5 | [0,1] | 0.5 |
| 0.75 | [0,1] | 0.75 |
| 1.0 | [0,1] or [1,2] | 1.0 |
| 1.25 | [1,2] | 0.75 |
| 1.5 | [1,2] | 0.5 |
| 1.75 | [1,2] | 0.25 |
| 2.0 | [1,2] | 0.0 |
Polynomial Interpolation
Polynomial interpolation constructs a unique polynomial of degree at most that passes exactly through given distinct points for , where for some underlying function . This global method applies the same polynomial across the entire domain, making it suitable for exact fitting but prone to instability for high degrees or ill-conditioned points.[22][23] One common representation is the Lagrange form, which expresses directly in terms of the data points without solving a system of equations: where the Lagrange basis polynomials are Each is 1 at and 0 at the other points (), ensuring the interpolation conditions are satisfied. This form is intuitive for theoretical analysis but computationally inefficient for large due to the product evaluations.[22][23] An alternative is the Newton form, which builds the polynomial incrementally using divided differences and is more efficient for adding points or evaluating at multiple locations: where the coefficients are the divided differences defined recursively: , and for , These can be computed via a divided-difference table, facilitating numerical stability and error estimation. For equispaced points, forward differences simplify the process, but the general form handles arbitrary spacing.[24][23] Example: Cubic Interpolation for Rocket Velocity Consider interpolating the upward velocity of a rocket at times seconds, with data:| (s) | (m/s) |
|---|---|
| 0 | 0 |
| 2 | 227 |
| 4 | 362 |
| 6 | 517 |
| First-order | Second-order | Third-order | ||
|---|---|---|---|---|
| 0 | 0 | |||
| 113.5 | ||||
| 2 | 227 | -11.5 | ||
| 67.5 | 2.333 | |||
| 4 | 362 | 2.5 | ||
| 77.5 | ||||
| 6 | 517 |