Belief propagation

Belief propagation

current hub

Write something...

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

About hubStatsRules

See all

Wikipedia

Grokipedia

Belief propagation, also known as sum–product message passing, is a message-passing algorithm for performing inference on graphical models, such as Bayesian networks and Markov random fields. It calculates the marginal distribution for each unobserved node (or variable), conditional on any observed nodes (or variables). Belief propagation is commonly used in artificial intelligence and information theory, and has demonstrated empirical success in numerous applications, including low-density parity-check codes, turbo codes, free energy approximation, and satisfiability.

The algorithm was first proposed by Judea Pearl in 1982, who formulated it as an exact inference algorithm on trees, later extended to polytrees. While the algorithm is not exact on general graphs, it has been shown to be a useful approximate algorithm.

Given a finite set of discrete random variables $X_{1},\ldots ,X_{n}$ with joint probability mass function $p$ , a common task is to compute the marginal distributions of the $X_{i}$ . The marginal of a single $X_{i}$ is defined to be

where $\mathbf {x} '=(x'_{1},\ldots ,x'_{n})$ is a vector of possible values for the $X_{i}$ , and the notation $\mathbf {x} ':x'_{i}=x_{i}$ means that the sum is taken over those $\mathbf {x} '$ whose $i$ th coordinate is equal to $x_{i}$ .

Computing marginal distributions using this formula quickly becomes computationally prohibitive as the number of variables grows. For example, given 100 binary variables $X_{1},\ldots ,X_{100}$ , computing a single marginal $X_{i}$ using $p$ and the above formula would involve summing over $2^{99}\approx 6.34\times 10^{29}$ possible values for $\mathbf {x} '$ . If it is known that the probability mass function $p$ factors in a convenient way, belief propagation allows the marginals to be computed much more efficiently.

Variants of the belief propagation algorithm exist for several types of graphical models (Bayesian networks and Markov random fields in particular). We describe here the variant that operates on a factor graph. A factor graph is a bipartite graph containing nodes corresponding to variables $V$ and factors $F$ , with edges between variables and the factors in which they appear. We can write the joint mass function:

where $\mathbf {x} _{a}$ is the vector of neighboring variable nodes to the factor node $a$ . Any Bayesian network or Markov random field can be represented as a factor graph by using a factor for each node with its parents or a factor for each node with its neighborhood respectively.

The algorithm works by passing real valued functions called messages along the edges between the nodes. More precisely, if $v$ is a variable node and $a$ is a factor node connected to $v$ in the factor graph, then the messages $\mu _{v\to a}$ from $v$ to $a$ and the messages $\mu _{a\to v}$ from $a$ to $v$ are real-valued functions $\mu _{v\to a},\mu _{a\to v}:\operatorname {Dom} (v)\to \mathbb {R}$ , whose domain is the set of values that can be taken by the random variable associated with $v$ , denoted $\operatorname {Dom} (v)$ . These messages contain the "influence" that one variable exerts on another. The messages are computed differently depending on whether the node receiving the message is a variable node or a factor node. Keeping the same notation: