Hubbry Logo
Multinomial theoremMultinomial theoremMain
Open search
Multinomial theorem
Community hub
Multinomial theorem
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Multinomial theorem
Multinomial theorem
from Wikipedia

In mathematics, the multinomial theorem describes how to expand a power of a sum in terms of powers of the terms in that sum. It is the generalization of the binomial theorem from binomials to multinomials.

Theorem

[edit]

For any positive integer m and any non-negative integer n, the multinomial theorem describes how a sum with m terms expands when raised to the nth power: where is a multinomial coefficient.[1] The sum is taken over all combinations of nonnegative integer indices k1 through km such that the sum of all ki is n. That is, for each term in the expansion, the exponents of the xi must add up to n.[2][a]

In the case m = 2, this statement reduces to that of the binomial theorem.[2]

Example

[edit]

The third power of the trinomial a + b + c is given by This can be computed by hand using the distributive property of multiplication over addition and combining like terms, but it can also be done (perhaps more easily) with the multinomial theorem. It is possible to "read off" the multinomial coefficients from the terms by using the multinomial coefficient formula. For example, the term has coefficient , the term has coefficient , and so on.

Alternate expression

[edit]

The statement of the theorem can be written concisely using multiindices:

where

and

Proof

[edit]

This proof of the multinomial theorem uses the binomial theorem and induction on m.

First, for m = 1, both sides equal x1n since there is only one term k1 = n in the sum. For the induction step, suppose the multinomial theorem holds for m. Then

by the induction hypothesis. Applying the binomial theorem to the last factor,

which completes the induction. The last step follows because

as can easily be seen by writing the three coefficients using factorials as follows:

Multinomial coefficients

[edit]

The numbers

appearing in the theorem are the multinomial coefficients. They can be expressed in numerous ways, including as a product of binomial coefficients or of factorials:

Sum of all multinomial coefficients

[edit]

The substitution of xi = 1 for all i into the multinomial theorem

gives immediately that

Number of multinomial coefficients

[edit]

The number of terms in a multinomial sum, #n,m, is equal to the number of monomials of degree n on the variables x1, …, xm:

The count can be performed easily using the method of stars and bars.

Valuation of multinomial coefficients

[edit]

The largest power of a prime p that divides a multinomial coefficient may be computed using a generalization of Kummer's theorem.

Asymptotics

[edit]

By Stirling's approximation, or equivalently the log-gamma function's asymptotic expansion, so for example,

Interpretations

[edit]

Ways to put objects into bins

[edit]

The multinomial coefficients have a direct combinatorial interpretation, as the number of ways of depositing n distinct objects into m distinct bins, with k1 objects in the first bin, k2 objects in the second bin, and so on.[3]

Number of ways to select according to a distribution

[edit]

In statistical mechanics and combinatorics, if one has a number distribution of labels, then the multinomial coefficients naturally arise from the binomial coefficients. Given a number distribution {ni} on a set of N total items, ni represents the number of items to be given the label i. (In statistical mechanics i is the label of the energy state.)

The number of arrangements is found by

  • Choosing n1 of the total N to be labeled 1. This can be done ways.
  • From the remaining Nn1 items choose n2 to label 2. This can be done ways.
  • From the remaining Nn1n2 items choose n3 to label 3. Again, this can be done ways.

Multiplying the number of choices at each step results in:

Cancellation results in the formula given above.

Number of unique permutations of words

[edit]
Multinomial coefficient as a product of binomial coefficients, counting the permutations of the letters of MISSISSIPPI.

The multinomial coefficient

is also the number of distinct ways to permute a multiset of n elements, where ki is the multiplicity of each of the ith element. For example, the number of distinct permutations of the letters of the word MISSISSIPPI, which has 1 M, 4 Is, 4 Ss, and 2 Ps, is

Generalized Pascal's triangle

[edit]

One can use the multinomial theorem to generalize Pascal's triangle or Pascal's pyramid to Pascal's simplex. This provides a quick way to generate a lookup table for multinomial coefficients.

A related structure is the multinomial triangle, or generalized Pascal triangle of order m, which may be constructed using the recurrence relation: from which Pascal's rule is recovered when . These multinomial coefficients can be written as closed-form expressions with bounded integer compositions:

and without:[4] (sequence A008287 in the OEIS)

See also

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The multinomial theorem is a fundamental result in that generalizes the to the expansion of powers of sums involving more than two terms. It states that for a non-negative nn and indeterminates x1,x2,,xmx_1, x_2, \dots, x_m, (x1+x2++xm)n=k1+k2++km=nn!k1!k2!km!x1k1x2k2xmkm,(x_1 + x_2 + \dots + x_m)^n = \sum_{k_1 + k_2 + \dots + k_m = n} \frac{n!}{k_1! k_2! \dots k_m!} x_1^{k_1} x_2^{k_2} \dots x_m^{k_m}, where the sum is over all non-negative integers k1,k2,,kmk_1, k_2, \dots, k_m satisfying the equation, and the coefficients n!k1!k2!km!\frac{n!}{k_1! k_2! \dots k_m!} are known as multinomial coefficients. This theorem, which applies specifically to positive integer exponents, originated in the late as an extension of earlier work on binomial expansions, with formalizations attributed to mathematicians such as Jacques Bernoulli and , who generalized expansions to multiple terms during that period. Later contributions, including Abraham de Moivre's 1697 publication on raising multinomials to powers, further developed its theoretical framework. In , the multinomial coefficients count the number of ways to divide nn distinct objects into mm labeled groups of sizes k1,,kmk_1, \dots, k_m, providing a key tool for solving partitioning problems. The theorem also underpins the in , where it models the probabilities of outcomes in experiments with multiple categories, such as dice rolls or genetic patterns. Beyond these areas, it facilitates algebraic manipulations in generating functions and series expansions, though the series may diverge for non-integer or negative exponents, limiting its direct applicability in those cases.

Statement and Coefficients

Theorem Statement

The multinomial theorem generalizes the to the expansion of a power of a sum involving an arbitrary number of terms. For non-negative nn and indeterminates x1,x2,,xmx_1, x_2, \dots, x_m, the theorem states that (x1+x2++xm)n=k1+k2++km=nk1,k2,,km0n!k1!k2!km!x1k1x2k2xmkm,(x_1 + x_2 + \dots + x_m)^n = \sum_{k_1 + k_2 + \dots + k_m = n \atop k_1, k_2, \dots, k_m \geq 0} \frac{n!}{k_1! k_2! \dots k_m!} x_1^{k_1} x_2^{k_2} \dots x_m^{k_m}, where the sum is taken over all ordered mm-tuples of non-negative integers (k1,k2,,km)(k_1, k_2, \dots, k_m) satisfying the condition k1+k2++km=nk_1 + k_2 + \dots + k_m = n. This summation can be expressed more compactly using , where a multi-index k=(k1,k2,,km)\mathbf{k} = (k_1, k_2, \dots, k_m) is a vector of non-negative integers with k=k1+k2++km=n|\mathbf{k}| = k_1 + k_2 + \dots + k_m = n, and the is n!k!=n!k1!k2!km!\frac{n!}{\mathbf{k}!} = \frac{n!}{k_1! k_2! \dots k_m!}.

Multinomial Coefficients

The multinomial is denoted by (nk1,k2,,km)\binom{n}{k_1, k_2, \dots, k_m}, where nn and the kik_i are non-negative integers satisfying i=1mki=n\sum_{i=1}^m k_i = n. This notation generalizes the , which corresponds to the case m=2m=2. The explicit formula for the multinomial coefficient is (nk1,k2,,km)=n!k1!k2!km!.\binom{n}{k_1, k_2, \dots, k_m} = \frac{n!}{k_1! \, k_2! \cdots k_m!}. This expression arises from the representation of permutations adjusted for repeated elements. The multinomial coefficient relates recursively to binomial coefficients through the product identity (nk1,k2,,km)=(nk1)(nk1k2)(nk1km1km),\binom{n}{k_1, k_2, \dots, k_m} = \binom{n}{k_1} \binom{n - k_1}{k_2} \cdots \binom{n - k_1 - \cdots - k_{m-1}}{k_m}, which reflects a sequential partitioning of the total nn into the subgroups of sizes k1,k2,,kmk_1, k_2, \dots, k_m. This chained product of binomials equals the direct multinomial formula for any such partition. Each multinomial coefficient corresponds uniquely to a multi-index (k1,k2,,km)(k_1, k_2, \dots, k_m) consisting of non-negative integers that sum to nn, distinguishing it within the expansion of the multinomial theorem.

Examples and Alternate Forms

Basic Example

A basic illustration of the multinomial theorem is the expansion of (x+y+z)3(x + y + z)^3. The theorem states that this equals the sum over all non-negative integers k1,k2,k3k_1, k_2, k_3 with k1+k2+k3=3k_1 + k_2 + k_3 = 3 of the terms 3!k1!k2!k3!xk1yk2zk3\frac{3!}{k_1! \, k_2! \, k_3!} x^{k_1} y^{k_2} z^{k_3}, where the multinomial coefficients 3!k1!k2!k3!\frac{3!}{k_1! \, k_2! \, k_3!} serve as multipliers for each . To compute step-by-step, consider the possible exponent combinations:
  • For (k1,k2,k3)=(3,0,0)(k_1, k_2, k_3) = (3,0,0): 3!3!0!0!x3=x3\frac{3!}{3! \, 0! \, 0!} x^3 = x^3
  • For (0,3,0)(0,3,0): y3y^3
  • For (0,0,3)(0,0,3): z3z^3
  • For (2,1,0)(2,1,0): 3!2!1!0!x2y=3x2y\frac{3!}{2! \, 1! \, 0!} x^2 y = 3 x^2 y
  • For (2,0,1)(2,0,1): 3x2z3 x^2 z
  • For (1,2,0)(1,2,0): 3xy23 x y^2
  • For (0,2,1)(0,2,1): 3y2z3 y^2 z
  • For (1,0,2)(1,0,2): 3xz23 x z^2
  • For (0,1,2)(0,1,2): 3yz23 y z^2
  • For (1,1,1)(1,1,1): 3!1!1!1!xyz=6xyz\frac{3!}{1! \, 1! \, 1!} x y z = 6 x y z
Collecting like terms yields the full polynomial: x3+y3+z3+3x2y+3x2z+3xy2+3y2z+3xz2+3yz2+6xyzx^3 + y^3 + z^3 + 3x^2 y + 3x^2 z + 3x y^2 + 3y^2 z + 3x z^2 + 3y z^2 + 6 x y z. This expansion reveals symmetry in the coefficients: the pure cubic terms have coefficient 1, the terms with one variable squared and another to the first power have coefficient 3 (arising from the three permutations of the exponents), and the fully mixed term has coefficient 6, reflecting the number of distinct ways to assign the exponents.

Alternate Expressions

One common notational variant of the multinomial theorem employs , where a multi-index α=(α1,,αm)\alpha = (\alpha_1, \dots, \alpha_m) is an mm- of non-negative integers satisfying α=i=1mαi=n|\alpha| = \sum_{i=1}^m \alpha_i = n. In this form, the theorem states that (x1++xm)n=α=nn!α!xα,(x_1 + \cdots + x_m)^n = \sum_{|\alpha| = n} \frac{n!}{\alpha!} x^\alpha, where xα=x1α1xmαmx^\alpha = x_1^{\alpha_1} \cdots x_m^{\alpha_m} and α!=α1!αm!\alpha! = \alpha_1! \cdots \alpha_m!. This symmetric expression compactly captures the expansion by summing over all multi-indices of nn, emphasizing the combinatorial structure of the coefficients. An equivalent expression utilizes falling factorials, defined as (n)k=n(n1)(nk+1)(n)^{\underline{k}} = n(n-1)\cdots(n-k+1) for non-negative integers nn and kk. Since the exponents satisfy k1++km=nk_1 + \cdots + k_m = n, it follows that (n)k1++km=(n)n=n!(n)^{\underline{k_1 + \cdots + k_m}} = (n)^{\underline{n}} = n!, yielding (x1++xm)n=k1++km=n(n)k1++kmk1!km!x1k1xmkm.(x_1 + \cdots + x_m)^n = \sum_{k_1 + \cdots + k_m = n} \frac{(n)^{\underline{k_1 + \cdots + k_m}}}{k_1! \cdots k_m!} x_1^{k_1} \cdots x_m^{k_m}. This form highlights the sequential selection interpretation underlying the multinomial coefficients, akin to chained binomial expansions. The multinomial theorem extends naturally to the ring of over a , where the finite expansion (i=1mxi)n( \sum_{i=1}^m x_i )^n holds identically as a in the ring R[[x1,,xm]]\mathbb{R}[[x_1, \dots, x_m]], without requiring convergence. This context is particularly useful for algebraic manipulations in , treating the variables as indeterminates. The theorem also relates to exponential generating functions as a foundational setup: the exponential generating function exp(ti=1mxi)=n=0tnn!(i=1mxi)n\exp\left( t \sum_{i=1}^m x_i \right) = \sum_{n=0}^\infty \frac{t^n}{n!} \left( \sum_{i=1}^m x_i \right)^n encodes the multinomial expansions, where the coefficient of tn/n!t^n / n! is precisely (i=1mxi)n=k=nn!k!xk\left( \sum_{i=1}^m x_i \right)^n = \sum_{|k|=n} \frac{n!}{k!} x^k.

Proofs

Combinatorial Proof

The of the multinomial theorem proceeds by equating two ways of counting the number of sequences of length nn over an alphabet of mm symbols, where each symbol corresponds to one of the variables x1,,xmx_1, \dots, x_m. One the one hand, each position in the sequence can be assigned any of the mm symbols independently, yielding a total of mnm^n possible sequences. On the other hand, group these sequences by their type multiplicities: for a fixed multi-index (k1,,km)(k_1, \dots, k_m) with ki=n\sum k_i = n and each ki0k_i \geq 0, the sequences with exactly kik_i occurrences of symbol ii (for each ii) contribute to the x1k1xmkmx_1^{k_1} \cdots x_m^{k_m}. The number of such sequences is the number of ways to assign the positions to symbols, accounting for indistinguishability within each symbol type: choose k1k_1 positions out of nn for symbol 1, then k2k_2 out of the remaining nk1n - k_1 for symbol 2, and so on, giving (nk1)(nk1k2)(nk1km1km)=n!k1!k2!km!.\binom{n}{k_1} \binom{n - k_1}{k_2} \cdots \binom{n - k_1 - \cdots - k_{m-1}}{k_m} = \frac{n!}{k_1! k_2! \cdots k_m!}. Thus, the of x1k1xmkmx_1^{k_1} \cdots x_m^{k_m} in the expansion is n!k1!km!\frac{n!}{k_1! \cdots k_m!}. Summing over all multi-indices (k1,,km)(k_1, \dots, k_m) with ki=n\sum k_i = n accounts for every possible sequence exactly once, so the full expansion of (x1++xm)n(x_1 + \cdots + x_m)^n is n!k1!km!x1k1xmkm\sum \frac{n!}{k_1! \cdots k_m!} x_1^{k_1} \cdots x_m^{k_m}, where the sum is over all such multi-indices. This matches the total count of mnm^n sequences, confirming the theorem. Equivalently, the multinomial coefficient n!k1!km!\frac{n!}{k_1! \cdots k_m!} counts the number of distinct permutations of a multiset consisting of kik_i indistinguishable items of type ii (for each ii), which aligns with the sequential position assignments above.

Generating Function Proof

The generating function proof of the multinomial theorem utilizes exponential generating functions and to derive the expansion of (x1++xm)n(x_1 + \dots + x_m)^n. Consider the exponential generating function exp(t(x1++xm))\exp(t (x_1 + \dots + x_m)), which expands as n=0tnn!(x1++xm)n\sum_{n=0}^\infty \frac{t^n}{n!} (x_1 + \dots + x_m)^n. This can equivalently be expressed as the product i=1mexp(txi)\prod_{i=1}^m \exp(t x_i), where each factor exp(txi)=ki=0(txi)kiki!\exp(t x_i) = \sum_{k_i=0}^\infty \frac{(t x_i)^{k_i}}{k_i!}. The product of these series yields k1,,km0(i=1m(txi)kiki!)=k1,,km0tk1++kmk1!km!x1k1xmkm\sum_{k_1,\dots,k_m \geq 0} \left( \prod_{i=1}^m \frac{(t x_i)^{k_i}}{k_i!} \right) = \sum_{k_1,\dots,k_m \geq 0} \frac{t^{k_1 + \dots + k_m}}{k_1! \dots k_m!} x_1^{k_1} \dots x_m^{k_m}. To extract the coefficient of tnn!\frac{t^n}{n!}, group terms where k1++km=nk_1 + \dots + k_m = n. For such terms, tnk1!km!=tnn!n!k1!km!\frac{t^n}{k_1! \dots k_m!} = \frac{t^n}{n!} \cdot \frac{n!}{k_1! \dots k_m!}, so the is k1++km=nn!k1!km!x1k1xmkm\sum_{k_1 + \dots + k_m = n} \frac{n!}{k_1! \dots k_m!} x_1^{k_1} \dots x_m^{k_m}. Equating this to the earlier expansion gives (x1++xm)n=k1++km=nn!k1!km!x1k1xmkm(x_1 + \dots + x_m)^n = \sum_{k_1 + \dots + k_m = n} \frac{n!}{k_1! \dots k_m!} x_1^{k_1} \dots x_m^{k_m}. This derivation holds in the ring of , where operations are algebraic and convergence is not required, ensuring the equality is valid for coefficient extraction regardless of the specific values of the variables.

Properties of Coefficients

Sums and Identities

The sum of all multinomial coefficients (nk1,k2,,km)\binom{n}{k_1, k_2, \dots, k_m} over all non-negative integers k1+k2++km=nk_1 + k_2 + \dots + k_m = n equals mnm^n. This identity arises directly from the multinomial theorem by substituting xi=1x_i = 1 for each i=1,,mi = 1, \dots, m, yielding (1+1++1)n=mn(1 + 1 + \dots + 1)^n = m^n on the left side. The number of such multinomial coefficients, corresponding to the number of distinct terms in the multinomial expansion, is the number of non-negative solutions to k1++km=nk_1 + \dots + k_m = n, which is (n+m1m1)\binom{n + m - 1}{m - 1}. This count reflects the of the of homogeneous polynomials of degree nn in mm variables. A key identity involving sums of products of multinomial coefficients is the multinomial Vandermonde , which generalizes the binomial case. It states that (rj1,,jm)(sk1j1,,kmjm)=(r+sk1,,km),\sum \binom{r}{j_1, \dots, j_m} \binom{s}{k_1 - j_1, \dots, k_m - j_m} = \binom{r + s}{k_1, \dots, k_m}, where the sum is over all non-negative integers j1,,jmj_1, \dots, j_m such that jikij_i \leq k_i for each ii and j1++jm=rj_1 + \dots + j_m = r. This arises in the product of two multinomial expansions and has applications in generating functions. Sums of multinomial coefficients weighted by powers related to subsets of variables provide additional identities. Consider a subset S{1,,m}S \subseteq \{1, \dots, m\}; the sum (nk1,,km)aiSki\sum \binom{n}{k_1, \dots, k_m} a^{\sum_{i \in S} k_i}, taken over k1++km=nk_1 + \dots + k_m = n, equals (Sa+(mS))n(|S| \cdot a + (m - |S|))^n. This follows from setting xi=ax_i = a for iSi \in S and xi=1x_i = 1 otherwise in the multinomial theorem, establishing a direct relation to powers of linear combinations.

Asymptotic Behavior

The asymptotic behavior of multinomial coefficients (nk1,,km)\binom{n}{k_1, \dots, k_m} for large nn depends on the regime of the kik_i. When the kik_i are proportional to nn, with pi=ki/np_i = k_i / n fixed and pi=1\sum p_i = 1, applied to the factorials yields log(nk1,,km)nH(p)\log \binom{n}{k_1, \dots, k_m} \approx n H(p), where H(p)=i=1mpilogpiH(p) = -\sum_{i=1}^m p_i \log p_i is the Shannon entropy function. This approximation captures the leading exponential growth, with the maximum occurring at the uniform distribution pi=1/mp_i = 1/m for fixed mm. The local central limit theorem provides a refined approximation for the multinomial distribution, describing concentration around the mean μ=np\mathbf{\mu} = n \mathbf{p} for specified probabilities p\mathbf{p}, including the uniform case. Specifically, for k\mathbf{k} near μ\mathbf{\mu}, the probability mass P(K=k)(2πn)(m1)/2(detΣ)1/2exp(12n(kμ)TΣ1(kμ))P(\mathbf{K} = \mathbf{k}) \approx (2\pi n)^{-(m-1)/2} (\det \Sigma)^{-1/2} \exp\left( -\frac{1}{2n} (\mathbf{k} - \mathbf{\mu})^T \Sigma^{-1} (\mathbf{k} - \mathbf{\mu}) \right), where Σ=diag(p)ppT\Sigma = \operatorname{diag}(\mathbf{p}) - \mathbf{p} \mathbf{p}^T is the , with error terms explicit up to order O(n1)O(n^{-1}). This implies corresponding asymptotics for the coefficients via normalization by (nk1,,km)=mn\sum \binom{n}{k_1, \dots, k_m} = m^n. Ratios of consecutive multinomial coefficients, such as those differing by one in a single kik_i, admit bounds and asymptotics expressible via ratios of gamma functions for large nn. Error terms in these approximations vary by regime: for kink_i \propto n, the relative error in the -entropy formula is O((logn)/n)O((\log n)/n), improving with higher-order expansions; for fixed kik_i with ki=s\sum k_i = s and the nsn - s large, (nk1,,km,ns)ns/ki!\binom{n}{k_1, \dots, k_m, n-s} \sim n^s / \prod k_i!, with error O(ns1)O(n^{s-1}). These distinguish the entropic growth for balanced partitions from the regime for sparse ones.

p-adic Valuation

The p-adic valuation vpv_p of the multinomial coefficient (nk1,,km)\dbinom{n}{k_1, \dots, k_m}, where n=k1++kmn = k_1 + \dots + k_m and the kik_i are nonnegative integers, is defined as the highest exponent of a prime pp dividing this coefficient. It is given by the formula vp((nk1,,km))=i=1msp(ki)sp(n)p1,v_p\left( \dbinom{n}{k_1, \dots, k_m} \right) = \frac{ \sum_{i=1}^m s_p(k_i) - s_p(n) }{p-1}, where sp()s_p(\cdot) denotes the sum of the digits of its argument when expressed in base pp. This expression arises directly from applying de Polignac's formula (also known as Legendre's formula) to the factorial valuations in the definition (nk1,,km)=n!/(k1!km!)\dbinom{n}{k_1, \dots, k_m} = n! / (k_1! \cdots k_m!), since vp(n!)=(nsp(n))/(p1)v_p(n!) = (n - s_p(n))/(p-1). An equivalent formulation is provided by the generalization of Kummer's theorem to multinomial coefficients: the valuation vp((nk1,,km))v_p\left( \dbinom{n}{k_1, \dots, k_m} \right) equals the total number of carries that occur when adding the base-pp expansions of k1,,kmk_1, \dots, k_m to obtain the base-pp expansion of nn. This count is independent of the order of addition or any parenthesization of the sum. For instance, consider p=2p = 2 and (42,1,1)=12\dbinom{4}{2, 1, 1} = 12, so v2(12)=2v_2(12) = 2. In base 2, 2=1022 = 10_2, 1=0121 = 01_2, 1=0121 = 01_2, and 4=10024 = 100_2. Adding the least significant digits: 0+1+1=2=1020 + 1 + 1 = 2 = 10_2 (write 0, carry 1); next digits: 1+0+0+1=2=1021 + 0 + 0 + 1 = 2 = 10_2 (write 0, carry 1); highest digit: 0+0+0+1=10 + 0 + 0 + 1 = 1 (no carry). Thus, there are two carries, matching the valuation. This valuation connects to Lucas' theorem for multinomial coefficients modulo pp, which states that if the base-pp digits of nn are n=njpjn = \sum n_j p^j and of the kik_i are ki=ki,jpjk_i = \sum k_{i,j} p^j with 0nj,ki,j<p0 \leq n_j, k_{i,j} < p, then (nk1,,km)j0(njk1,j,,km,j)(modp),\dbinom{n}{k_1, \dots, k_m} \equiv \prod_{j \geq 0} \dbinom{n_j}{k_{1,j}, \dots, k_{m,j}} \pmod{p}, where each smaller multinomial is computed over integers less than pp (hence directly). The coefficient is divisible by pp (i.e., congruent to 0 modulo pp) precisely when there is at least one carry in the base-pp addition for some digit position jj, as that makes at least one factor (njk1,j,,km,j)=0\dbinom{n_j}{k_{1,j}, \dots, k_{m,j}} = 0 since ki,j>nj\sum k_{i,j} > n_j would be impossible without carry-over. For example, with p=3p = 3 and (31,1,1)=60(mod3)\dbinom{3}{1,1,1} = 6 \equiv 0 \pmod{3}, the base-3 digits are all 1 for the kik_i and 3 = 10310_3 for nn; adding three 1's in the units place gives 1+1+1=3=1031+1+1=3=10_3 (one carry), so the units multinomial (01,1,1)=0\dbinom{0}{1,1,1} = 0 (invalid), confirming divisibility by 3.

Combinatorial Interpretations

Objects into Bins

The multinomial coefficient (nk1,k2,,km)\binom{n}{k_1, k_2, \dots, k_m} counts the number of ways to distribute nn distinct objects into mm distinct bins such that the ii-th bin receives exactly kik_i objects, where i=1mki=n\sum_{i=1}^m k_i = n. This combinatorial interpretation arises from partitioning a set of nn distinct elements into an ordered sequence of mm labeled subsets of the specified sizes. The bins are distinct (labeled), which distinguishes this from unordered partitions where the subsets are indistinguishable except for their contents. This setup extends the stars-and-bars method, traditionally used for distributing nn indistinguishable objects into mm distinct bins without size restrictions (yielding (n+m1m1)\binom{n + m - 1}{m - 1} ways, allowing empty bins). In the multinomial case, the fixed sizes k1,,kmk_1, \dots, k_m and distinct objects adjust the analogy to count ordered partitions of the set, emphasizing the labeled nature of the bins over mere numerical compositions of the nn. For indistinguishable objects with fixed sizes, there would be only one such distribution per labeling, but the distinguishability of objects introduces the structure in the coefficient. A useful visualization involves up the nn distinct objects in a row (in n!n! possible orders) and inserting m1m-1 dividers to separate them into consecutive segments of lengths k1,k2,,kmk_1, k_2, \dots, k_m, assigning each segment to the corresponding labeled bin. Since the order within each bin does not matter, divide by ki!k_i! for each segment to account for the internal s, yielding the multinomial . This linear highlights the ordered partitioning, contrasting with unordered set partitions that would require additional division by symmetries among equal-sized subsets.

Permutations of Multisets

The multinomial coefficient (nk1,k2,,km)=n!k1!k2!km!\binom{n}{k_1, k_2, \dots, k_m} = \frac{n!}{k_1! k_2! \dots k_m!} gives the number of distinct permutations of a multiset with mm types of elements, where there are kik_i identical elements of type ii for each i=1,,mi = 1, \dots, m and n=k1++kmn = k_1 + \dots + k_m. This formula arises because the total arrangements of nn items treating all as distinct would be n!n!, but identical items within each type lead to overcounting by a factor of ki!k_i! for each type, which must be divided out. A classic example is the word "," which forms a with 1 M, 4 I's, 4 S's, and 2 P's (n=11n=11). The number of distinct rearrangements is 11!1!4!4!2!=34650\frac{11!}{1! \cdot 4! \cdot 4! \cdot 2!} = 34650. This counting principle connects to symmetry groups via the action of the SnS_n on the set of all sequences of length nn with the specified multiplicities. By the orbit-stabilizer theorem, the size of the orbit corresponding to the distinct permutations equals Sn|S_n| divided by the order of the stabilizer subgroup, which is the Sk1××SkmS_{k_1} \times \dots \times S_{k_m} of order k1!km!k_1! \dots k_m!. The interpretation extends to general arrangements in sequences, where the multinomial coefficient quantifies the ways to order elements under repetition constraints, such as in forming distinct strings or linear compositions from repeated components. This is combinatorially equivalent to assigning nn distinct positions to the types with fixed counts kik_i per type.

Pascal's Triangle Generalization

The multinomial triangle serves as a higher-dimensional analog to , extending the binomial structure to account for expansions involving m variables. In this array, rows are indexed by the non-negative integer n, with entries corresponding to all non-negative integer compositions k1+k2++km=nk_1 + k_2 + \dots + k_m = n, typically ordered lexicographically. There are (n+m1m1)\binom{n + m - 1}{m - 1} entries in row n. Each multinomial coefficient (nk1,k2,,km)\binom{n}{k_1, k_2, \dots, k_m} in row n is computed recursively as the sum of the up to m predecessor coefficients from row n-1 obtained by decreasing exactly one kjk_j by 1 (for j=1j = 1 to mm), provided kj1k_j \geq 1. A key property of the multinomial triangle arises from the multinomial theorem: the sum of all entries in row n equals mnm^n, reflecting the evaluation of (x1+x2++xm)n(x_1 + x_2 + \dots + x_m)^n at x1=x2==xm=1x_1 = x_2 = \dots = x_m = 1. This generalizes the fact that rows in (m=2) sum to 2n2^n. For visualization, consider m=3, where the row for n=2 in is [(22,0,0)\binom{2}{2,0,0}, (21,1,0)\binom{2}{1,1,0}, (21,0,1)\binom{2}{1,0,1}, (20,2,0)\binom{2}{0,2,0}, (20,1,1)\binom{2}{0,1,1}, (20,0,2)\binom{2}{0,0,2}] = [1, 2, 2, 1, 2, 1], summing to 9 = 3^2. The multinomial triangle inherits and extends several properties from , including symmetry and generalized summation identities. Symmetry manifests in the equality (nk1,k2,,km)=(nkσ(1),kσ(2),,kσ(m))\binom{n}{k_1, k_2, \dots, k_m} = \binom{n}{k_{\sigma(1)}, k_{\sigma(2)}, \dots, k_{\sigma(m)}} for any σ\sigma of the indices, ensuring the array is invariant under reordering of the parts.

Applications

Probability Distributions

The arises in as a generalization of the to scenarios involving multiple categories. It models the outcomes of nn independent trials, each with mm possible results having probabilities p1,p2,,pmp_1, p_2, \dots, p_m where i=1mpi=1\sum_{i=1}^m p_i = 1. Let K=(K1,K2,,Km)\mathbf{K} = (K_1, K_2, \dots, K_m) denote the counts of occurrences for each category, with i=1mKi=n\sum_{i=1}^m K_i = n. The is given by P(K=k)=(nk1,k2,,km)p1k1p2k2pmkm,P(\mathbf{K} = \mathbf{k}) = \binom{n}{k_1, k_2, \dots, k_m} p_1^{k_1} p_2^{k_2} \cdots p_m^{k_m}, where k=(k1,k2,,km)\mathbf{k} = (k_1, k_2, \dots, k_m) satisfies i=1mki=n\sum_{i=1}^m k_i = n and ki0k_i \geq 0 are integers, and the multinomial coefficient is (nk1,,km)=n!k1!k2!km!\binom{n}{k_1, \dots, k_m} = \frac{n!}{k_1! k_2! \cdots k_m!}. This derives its validity directly from the multinomial theorem. The theorem states that (p1+p2++pm)n=(nk1,,km)p1k1pmkm(p_1 + p_2 + \dots + p_m)^n = \sum \binom{n}{k_1, \dots, k_m} p_1^{k_1} \cdots p_m^{k_m}, where the sum is over all non-negative integers kik_i with ki=n\sum k_i = n. Since pi=1\sum p_i = 1, the left side equals 1n=11^n = 1, confirming that the probabilities sum to unity and normalize the distribution. The moments of the multinomial distribution reflect its structure as counts from categorized trials. The expected value for each component is E[Ki]=npiE[K_i] = n p_i. The variance of each component is Var(Ki)=npi(1pi)\mathrm{Var}(K_i) = n p_i (1 - p_i), and the covariance between distinct components is Cov(Ki,Kj)=npipj\mathrm{Cov}(K_i, K_j) = -n p_i p_j for iji \neq j, indicating negative dependence since the total count is fixed at nn. As a natural extension of the , which corresponds to the case m=2m=2, the multinomial arises from nn independent trials each with mm outcomes. Moreover, the of any single KiK_i follows a Bin(n,pi)\mathrm{Bin}(n, p_i), obtained by grouping all other categories as a single "failure" outcome.

Generating Functions in Algebra

The multinomial theorem provides a foundational expansion for ordinary generating functions in multiple variables, particularly for fixed degree nn. It states that (x1++xm)n=k1++km=nki0(nk1,,km)x1k1xmkm,(x_1 + \dots + x_m)^n = \sum_{\substack{k_1 + \dots + k_m = n \\ k_i \geq 0}} \binom{n}{k_1, \dots, k_m} x_1^{k_1} \cdots x_m^{k_m}, where (nk1,,km)=n!k1!km!\binom{n}{k_1, \dots, k_m} = \frac{n!}{k_1! \cdots k_m!} is the multinomial . This directly represents the ordinary whose are the multinomial coefficients, enabling algebraic manipulation and extraction in polynomial rings. Extending to an infinite series, the n=0(x1++xm)ntn=11t(x1++xm)\sum_{n=0}^\infty (x_1 + \dots + x_m)^n t^n = \frac{1}{1 - t(x_1 + \dots + x_m)} incorporates the theorem's expansion term by term, facilitating the study of and their algebraic properties in . In the realm of exponential generating functions, the multinomial theorem plays a crucial role in enumerating labeled algebraic structures through series expansions. The key identity is exp(x1++xm)=n=0(x1++xm)nn!=k1,,km0x1k1xmkmk1!km!,\exp(x_1 + \dots + x_m) = \sum_{n=0}^\infty \frac{(x_1 + \dots + x_m)^n}{n!} = \sum_{k_1, \dots, k_m \geq 0} \frac{x_1^{k_1} \cdots x_m^{k_m}}{k_1! \cdots k_m!}, where the inner power is expanded via the multinomial theorem to yield coefficients 1k1!km!\frac{1}{k_1! \cdots k_m!} for each term with n=k1++kmn = k_1 + \dots + k_m. This form arises in the multiplication principle for exponential generating functions: the product of such functions A1(x)Ad(x)A_1(x) \cdots A_d(x) has coefficients given by \sum_{\substack{i_1 + \dots + i_d = n \\ i_j \geq 0}} \binom{n}{i_1, \dots, i_d} a_1^{(1)}_{i_1} \cdots a_d^{(d)}_{i_d}, directly invoking the theorem to partition labels among components in algebraic compositions. These expansions extend to applications in combinatorial , such as theory and . In combinatorial , the exponential formula constructs generating functions for composite structures, where multinomial coefficients account for labeled partitions into connected components, as in the EGF exp(i=1mxi)\exp\left(\sum_{i=1}^m x_i\right) for assemblies of typed atoms. Similarly, the of the SnS_n, used in algebraic of symmetries, groups terms by cycle types via multinomial-like factorials: the number of permutations with cycle lengths specified by multiplicities mjm_j is n!jjmjmj!\frac{n!}{\prod_j j^{m_j} m_j!}, derived from partitioning nn elements. The theorem thus enables inversion by expanding powers to isolate specific coefficients in these series, supporting algebraic derivations in problems.

Computational Aspects

The multinomial coefficient (nk1,k2,,km)\binom{n}{k_1, k_2, \dots, k_m} for k1+k2++km=nk_1 + k_2 + \dots + k_m = n can be computed directly as the product (nk1)(nk1k2)(i=jmkikj)\binom{n}{k_1} \binom{n-k_1}{k_2} \cdots \binom{\sum_{i=j}^m k_i}{k_j} over successive binomial coefficients, avoiding the direct evaluation of large factorials n!n! and ki!k_i! that may cause intermediate overflow. This recursive decomposition leverages efficient algorithms for individual binomial coefficients, such as the multiplicative formula that computes (ab)\binom{a}{b} in O(b)O(b) arithmetic operations by iteratively multiplying terms ai+1i\frac{a - i + 1}{i} for i=1i = 1 to bb. For computing multiple multinomial coefficients or the full expansion of (x1++xm)n(x_1 + \dots + x_m)^n, dynamic programming algorithms construct a table of intermediate values using the (nk1,,km)=i=1m(n1k1,,ki1,,km)\binom{n}{k_1, \dots, k_m} = \sum_{i=1}^m \binom{n-1}{k_1, \dots, k_i - 1, \dots, k_m}, with to avoid redundant calculations; this achieves O(m(n+m1m1))O\left(m \binom{n + m - 1}{m - 1}\right), equivalent to O(m)O(m) operations per when mm is the number of terms and the table is built sequentially by adding one unit at a time. Such approaches are particularly useful for generating all coefficients in the expansion efficiently, especially when mm is small relative to nn. Symbolic mathematics libraries provide built-in functions for exact computation and expansion under the multinomial theorem. In , the multinomial function computes coefficients symbolically, while the expand method with multivariate polynomials handles full expansions; these support arbitrary precision via the underlying MPmath library. Similarly, Mathematica's Multinomial[{k1, k2, ..., km}] evaluates coefficients exactly, and Expand[(x1 + x2 + ... + xm)^n] generates the full multinomial expansion, with built-in support for high-precision arithmetic. Computing multinomial coefficients for large nn poses challenges due to their rapid growth, often exceeding fixed-precision integer limits and causing overflow. To address this, multiprecision arithmetic libraries such as GMP in C++ or Python's built-in int type (with arbitrary precision) are employed to store and manipulate the large integers required. Alternatively, logarithmic forms compute log(nk1,,km)=logn!logki!\log \binom{n}{k_1, \dots, k_m} = \log n! - \sum \log k_i! using summed log-gamma functions for approximation without overflow, suitable for probabilistic applications. For extremely large nn, where exact computation is infeasible, extended to multinomials provides estimates via (nk1,,km)2πn(ne)n/2πki(kie)ki\binom{n}{k_1, \dots, k_m} \approx \sqrt{2\pi n} \left( \frac{n}{e} \right)^n / \prod \sqrt{2\pi k_i} \left( \frac{k_i}{e} \right)^{k_i}
Add your contribution
Related Hubs
User Avatar
No comments yet.