Movatterモバイル変換

Multinomial distribution

From Wikipedia, the free encyclopedia

Generalization of the binomial distribution

Multinomial Distribution
Parameters	$n\in \{0,1,2,\ldots \}$ number of trials $k>0$ number of mutually exclusive events (integer) $p_{1},\ldots ,p_{k}$ event probabilities, where $p_{1}+\dots +p_{k}=1$
Support	$\left\lbrace (x_{1},\dots ,x_{k})\,{\Big \vert }\,\sum _{i=1}^{k}x_{i}=n,x_{i}\geq 0\ (i=1,\dots ,k)\right\rbrace$
PMF	${\frac {n!}{x_{1}!\cdots x_{k}!}}p_{1}^{x_{1}}\cdots p_{k}^{x_{k}}$
Mean	$\operatorname {E} (X_{i})=np_{i}$
Variance	$\operatorname {Var} (X_{i})=np_{i}(1-p_{i})$ $\operatorname {Cov} (X_{i},X_{j})=-np_{i}p_{j}~~(i\neq j)$
Entropy	${\begin{aligned}&-\log(n!)-n\sum _{i=1}^{k}p_{i}\log(p_{i})\\&+\sum _{i=1}^{k}\sum _{x_{i}=0}^{n}{\binom {n}{x_{i}}}p_{i}^{x_{i}}(1-p_{i})^{n-x_{i}}\log(x_{i}!)\end{aligned}}$
MGF	$\left(\sum _{i=1}^{k}p_{i}e^{t_{i}}\right)^{n}$
CF	$\left(\sum _{j=1}^{k}p_{j}e^{it_{j}}\right)^{n}$ where $i^{2}=-1$
PGF	$\left(\sum _{i=1}^{k}p_{i}z_{i}\right)^{n}$ for $(z_{1},\ldots ,z_{k})\in \mathbb {C} ^{k}$

Inprobability theory, themultinomial distribution is a generalization of thebinomial distribution. For example, it models the probability of counts for each side of ak-sided die rolledn times. Fornindependent trials each of which leads to a success for exactly one ofk categories, with each category having a given fixed success probability, the multinomial distribution gives the probability of any particular combination of numbers of successes for the various categories.

Whenk is 2 andn is 1, the multinomial distribution is theBernoulli distribution. Whenk is 2 andn is bigger than 1, it is thebinomial distribution. Whenk is bigger than 2 andn is 1, it is thecategorical distribution. The term "multinoulli" is sometimes used for the categorical distribution to emphasize this four-way relationship (son determines the suffix, andk the prefix).

TheBernoulli distribution models the outcome of a singleBernoulli trial. In other words, it models whether flipping a (possiblybiased) coin one time will result in either a success (obtaining a head) or failure (obtaining a tail). Thebinomial distribution generalizes this to the number of heads from performingn independent flips (Bernoulli trials) of the same coin. The multinomial distribution models the outcome ofn experiments, where the outcome of each trial has acategorical distribution, such as rolling a (possiblybiased)k-sided dien times.

Letk be a fixed finite number. Mathematically, we havek possible mutually exclusive outcomes, with corresponding probabilitiesp₁, ...,p_k, andn independent trials. Since thek outcomes are mutually exclusive and one must occur we havep_i ≥ 0 fori = 1, ..., k and ${\textstyle \sum _{i=1}^{k}p_{i}=1}$ . Then if the random variablesX_i indicate the number of times outcome numberi is observed over then trials, the vectorX = (X₁, ..., X_k) follows a multinomial distribution with parametersn andp, wherep = (p₁, ..., p_k). While the trials are independent, their outcomesX_i are dependent because they must sum to n.

	Test 2 positive	Test 2 negative	Row total
Test 1 positive	$f_{11}$	$f_{10}$	$f_{1*}=f_{11}+f_{10}$
Test 1 negative	$f_{01}$	$f_{00}$	$f_{0*}=f_{01}+f_{00}$
Column total	$f_{*1}=f_{11}+f_{01}$	$f_{*0}=f_{10}+f_{00}$	$n {\displaystyle n}$

Movatterモバイル変換

Definitions

Probability mass function

Example

Properties

Normalization

Expected value and variance

Matrix notation

Visualization

As slices of generalized Pascal's triangle

As polynomial coefficients

Large deviation theory

Asymptotics

Concentration at largen

Conditional concentration at largen

Related distributions

Statistical inference

Equivalence tests for multinomial distributions

Confidence intervals for the difference of two proportions

Occurrence and applications

Confidence intervals for the difference in matched-pairs binary data (using multinomial withk=4)

Computational methods

Random variate generation

Sampling using repeated conditional binomial samples

Algorithm: Sequential conditional binomial sampling

Software implementations

See also

References

Further reading