Hubbry Logo
search
logo
1602027

Jackknife resampling

logo
Community Hub0 Subscribers
Write something...
Be the first to start a discussion here.
Be the first to start a discussion here.
See all
Jackknife resampling

In statistics, the jackknife (jackknife cross-validation) is a cross-validation technique and, therefore, a form of resampling. It is especially useful for bias and variance estimation. The jackknife pre-dates other common resampling methods such as the bootstrap. Given a sample of size , a jackknife estimator can be built by aggregating the parameter estimates from each subsample of size obtained by omitting one observation. The jackknife is a linear approximation of the bootstrap.

The jackknife technique was developed by Maurice Quenouille (1924–1973) from 1949 and refined in 1956. John Tukey expanded on the technique in 1958 and proposed the name "jackknife" because, like a physical jack-knife (a compact folding knife), it is a rough-and-ready tool that can improvise a solution for a variety of problems even though specific problems may be more efficiently solved with a purpose-designed tool.

The jackknife estimator of a parameter is found by systematically leaving out each observation from a dataset and calculating the parameter estimate over the remaining observations and then aggregating these calculations.

For example, if the parameter to be estimated is the population mean of random variable , then for a given set of i.i.d. observations the natural estimator is the sample mean:

where the last sum used another way to indicate that the index runs over the set .

Then we proceed as follows: For each we compute the mean of the jackknife subsample consisting of all but the -th data point, and this is called the -th jackknife replicate:

It could help to think that these jackknife replicates approximate the distribution of the sample mean . A larger improves the approximation. Then finally to get the jackknife estimator, the jackknife replicates are averaged:

One may ask about the bias and the variance of . From the definition of as the average of the jackknife replicates one could try to calculate explicitly. The bias is a trivial calculation, but the variance of is more involved since the jackknife replicates are not independent.

See all
User Avatar
No comments yet.