Hubbry Logo
Variance-stabilizing transformationVariance-stabilizing transformationMain
Open search
Variance-stabilizing transformation
Community hub
Variance-stabilizing transformation
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Variance-stabilizing transformation
Variance-stabilizing transformation
from Wikipedia

In applied statistics, a variance-stabilizing transformation is a data transformation that is specifically chosen either to simplify considerations in graphical exploratory data analysis or to allow the application of simple regression-based or analysis of variance techniques.[1]

Overview

[edit]

The aim behind the choice of a variance-stabilizing transformation is to find a simple function ƒ to apply to values x in a data set to create new values y = ƒ(x) such that the variability of the values y is not related to their mean value. For example, suppose that the values x are realizations from different Poisson distributions: i.e. the distributions each have different mean values μ. Then, because for the Poisson distribution the variance is identical to the mean, the variance varies with the mean. However, if the simple variance-stabilizing transformation

is applied, the sampling variance associated with observation will be nearly constant: see Anscombe transform for details and some alternative transformations.

While variance-stabilizing transformations are well known for certain parametric families of distributions, such as the Poisson and the binomial distribution, some types of data analysis proceed more empirically: for example by searching among power transformations to find a suitable fixed transformation. Alternatively, if data analysis suggests a functional form for the relation between variance and mean, this can be used to deduce a variance-stabilizing transformation.[2] Thus if, for a mean μ,

a suitable basis for a variance stabilizing transformation would be

where the arbitrary constant of integration and an arbitrary scaling factor can be chosen for convenience.

Example: relative variance

[edit]

If X is a positive random variable and for some constant, s, the variance is given as h(μ) = s2μ2 then the standard deviation is proportional to the mean, which is called fixed relative error. In this case, the variance-stabilizing transformation is

That is, the variance-stabilizing transformation is the logarithmic transformation.

Example: absolute plus relative variance

[edit]

If the variance is given as h(μ) = σ2 + s2μ2 then the variance is dominated by a fixed variance σ2 when |μ| is small enough and is dominated by the relative variance s2μ2 when |μ| is large enough. In this case, the variance-stabilizing transformation is

That is, the variance-stabilizing transformation is the inverse hyperbolic sine of the scaled value x / λ for λ = σ / s.


Example: pearson correlation

[edit]

The Fisher transformation is a variance stabilizing transformation for the pearson correlation coefficient.

Relationship to the delta method

[edit]

Here, the delta method is presented in a rough way, but it is enough to see the relation with the variance-stabilizing transformations. To see a more formal approach see delta method.

Let be a random variable, with and . Define , where is a regular function. A first order Taylor approximation for is:

From the equation above, we obtain:

and

This approximation method is called delta method.

Consider now a random variable such that and . Notice the relation between the variance and the mean, which implies, for example, heteroscedasticity in a linear model. Therefore, the goal is to find a function such that has a variance independent (at least approximately) of its expectation.

Imposing the condition , this equality implies the differential equation:

This ordinary differential equation has, by separation of variables, the following solution:

This last expression appeared for the first time in a M. S. Bartlett paper.[3]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
A variance-stabilizing transformation (VST) is a mathematical function applied to a dataset in statistics to render the variance of the transformed observations approximately constant, regardless of the mean value of the original variable, thereby addressing heteroscedasticity and facilitating the application of standard parametric methods like linear regression and analysis of variance. This technique is particularly useful when the original data exhibit variance that increases with the mean, such as in count data or proportions, allowing transformed data to better approximate normality and constant spread assumptions. The concept of VSTs originated in the work of M. S. Bartlett, who in 1936 proposed the transformation for stabilizing variance in square root-transformed from continuous distributions and expanded on its use for analysis of variance in 1947. Building on this, F. J. Anscombe in 1948 derived specific VSTs for discrete distributions, including an adjusted for Poisson (approximately Y+3/8\sqrt{Y + 3/8}
Add your contribution
Related Hubs
User Avatar
No comments yet.