Pushforward measure

In measure theory, a pushforward measure (also known as push forward, push-forward or image measure) is obtained by transferring ("pushing forward") a measure from one measurable space to another using a measurable function.

Definition

Given measurable spaces $(X_{1},\Sigma _{1})$ and $(X_{2},\Sigma _{2})$ , a measurable function $f\colon X_{1}\to X_{2}$ and a measure $\mu \colon \Sigma _{1}\to [0,+\infty ]$ , the pushforward of $\mu$ by $f$ is defined to be the measure $f_{*}(\mu )\colon \Sigma _{2}\to [0,+\infty ]$ given by

f_{*}(\mu )(B)=\mu \left(f^{-1}(B)\right)

for

B\in \Sigma _{2}.

This definition applies mutatis mutandis for a signed or complex measure. The pushforward measure is also denoted as $\mu \circ f^{-1}$ , $f_{\sharp }\mu$ , $f\sharp \mu$ , or $f\#\mu$ .

Properties

Change of variable formula

Theorem:^[1] A measurable function g on X₂ is integrable with respect to the pushforward measure f_∗(μ) if and only if the composition $g\circ f$ is integrable with respect to the measure μ. In that case, the integrals coincide, i.e.,

\int _{X_{2}}g\,d(f_{*}\mu )=\int _{X_{1}}g\circ f\,d\mu .

Note that in the previous formula $X_{1}=f^{-1}(X_{2})$ .

Functoriality

Pushforwards of measures allow to induce, from a function between measurable spaces $f:X\to Y$ , a function between the spaces of measures $M(X)\to M(Y)$ . As with many induced mappings, this construction has the structure of a functor, on the category of measurable spaces.

For the special case of probability measures, this property amounts to functoriality of the Giry monad.

Examples and applications

If $(\Omega ,{\mathcal {F}},P)$ is a probability space, $(E,{\mathcal {E}})$ is a measurable space, and $X:\Omega \to E$ is a $(E,{\mathcal {E}})$ -valued random variable, then the probability distribution of $X$ is the pushforward measure of $P$ by $X$ onto $(E,{\mathcal {E}})$ .
A natural "Lebesgue measure" on the unit circle S¹ (here thought of as a subset of the complex plane C) may be defined using a push-forward construction and Lebesgue measure λ on the real line R. Let λ also denote the restriction of Lebesgue measure to the interval [0, 2π) and let f : [0, 2π) → S¹ be the natural bijection defined by f(t) = exp(i t). The natural "Lebesgue measure" on S¹ is then the push-forward measure f_∗(λ). The measure f_∗(λ) might also be called "arc length measure" or "angle measure", since the f_∗(λ)-measure of an arc in S¹ is precisely its arc length (or, equivalently, the angle that it subtends at the centre of the circle.)
The previous example extends nicely to give a natural "Lebesgue measure" on the n-dimensional torus Tⁿ. The previous example is a special case, since S¹ = T¹. This Lebesgue measure on Tⁿ is, up to normalization, the Haar measure for the compact, connected Lie group Tⁿ.
Gaussian measures on infinite-dimensional vector spaces are defined using the push-forward and the standard Gaussian measure on the real line: a Borel measure γ on a separable Banach space X is called Gaussian if the push-forward of γ by any non-zero linear functional in the continuous dual space to X is a Gaussian measure on R.
Consider a measurable function f : X → X and the composition of f with itself n times:

f^{(n)}=\underbrace {f\circ f\circ \dots \circ f} _{n\mathrm {\,times} }:X\to X.

This iterated function forms a dynamical system. It is often of interest in the study of such systems to find a measure μ on X that the map f leaves unchanged, a so-called invariant measure, i.e one for which f_∗(μ) = μ.

One can also consider quasi-invariant measures for such a dynamical system: a measure $\mu$ on $(X,\Sigma )$ is called quasi-invariant under $f$ if the push-forward of $\mu$ by $f$ is merely equivalent to the original measure μ, not necessarily equal to it. A pair of measures $\mu ,\nu$ on the same space are equivalent if and only if $\forall A\in \Sigma :\ \mu (A)=0\iff \nu (A)=0$ , so $\mu$ is quasi-invariant under $f$ if $\forall A\in \Sigma :\ \mu (A)=0\iff f_{*}\mu (A)=\mu {\big (}f^{-1}(A){\big )}=0$

Many natural probability distributions, such as the chi distribution, can be obtained via this construction.

Random variables induce pushforward measures. They map a probability space into a codomain space and endow that space with a probability measure defined by the pushforward. Furthermore, because random variables are functions (and hence total functions), the inverse image of the whole codomain is the whole domain, and the measure of the whole domain is 1, so the measure of the whole codomain is 1. This means that random variables can be composed ad infinitum and they will always remain random variables and endow the codomain spaces with probability measures.

A generalization

In general, any measurable function can be pushed forward. The push-forward then becomes a linear operator, known as the transfer operator or Frobenius–Perron operator. In finite spaces this operator typically satisfies the requirements of the Frobenius–Perron theorem, and the maximal eigenvalue of the operator corresponds to the invariant measure.

The adjoint to the push-forward is the pullback; as an operator on spaces of functions on measurable spaces, it is the composition operator or Koopman operator.

Notes

^ Theorem 3.6.1 in Bogachev 2007

References

Bogachev, Vladimir I. (2007), Measure Theory, Berlin: Springer Verlag, ISBN 9783540345138
Teschl, Gerald (2015), Topics in Real and Functional Analysis

[1] Theorem 3.6.1 in Bogachev 2007

[1]

Knowledge Base

Talk Channels

Special Pages

Pushforward measure

Pushforward measure

Pushforward measure

Definition

Properties

Change of variable formula

Functoriality

Examples and applications

A generalization

See also

Notes

References