Movatterモバイル変換

Negative binomial distribution

From Wikipedia, the free encyclopedia

Probability distribution

Different texts (and even different parts of this article) adopt slightly different definitions for the negative binomial distribution. They can be distinguished by whether the support starts atk = 0 or atk = r, whetherp denotes the probability of a success or of a failure, and whetherr represents success or failure,^[1] so identifying the specific parametrization used is crucial in any given text.
Probability mass function The orange line represents the mean, which is equal to 10 in each of these plots; the green line shows the standard deviation.
Notation	$\mathrm {NB} (r,\,p)$
Parameters	r > 0 — number of successes until the experiment is stopped (integer, but the definition can also be extended toreals) p ∈ [0,1] — success probability in each experiment (real)
Support	k ∈ { 0, 1, 2, 3, … } — number of failures
PMF	$k\mapsto {k+r-1 \choose k}\cdot (1-p)^{k}p^{r},$ involving abinomial coefficient
CDF	$k\mapsto I_{p}(r,\,k+1),$ theregularized incomplete beta function
Mean	${\frac {r(1-p)}{p}}$
Mode	${\begin{cases}\left\lfloor {\frac {(r-1)(1-p)}{p}}\right\rfloor &{\text{if }}r>1\\0&{\text{if }}r\leq 1\end{cases}}$
Variance	${\frac {r(1-p)}{p^{2}}}$
Skewness	${\frac {2-p}{\sqrt {(1-p)r}}}$
Excess kurtosis	${\frac {6}{r}}+{\frac {p^{2}}{(1-p)r}}$
MGF	${\biggl (}{\frac {p}{1-(1-p)e^{t}}}{\biggr )}^{\!r}{\text{ for }}t<-\log(1-p)$
CF	${\biggl (}{\frac {p}{1-(1-p)e^{i\,t}}}{\biggr )}^{\!r}{\text{ with }}t\in \mathbb {R}$
PGF	${\biggl (}{\frac {p}{1-(1-p)z}}{\biggr )}^{\!r}{\text{ for }}\|z\|<{\frac {1}{p}}$
Fisher information	${\frac {r}{p^{2}(1-p)}}$
Method of moments	$r={\frac {E[X]^{2}}{V[X]-E[X]}}$ $p={\frac {E[X]}{V[X]}}$

Inprobability theory andstatistics, thenegative binomial distribution, also called aPascal distribution,^[2] is adiscrete probability distribution that models the number of failures in a sequence of independent and identically distributedBernoulli trials before a specified/constant/fixed number of successes $r {\displaystyle r}$ occur.^[3] For example, we can define rolling a 6 on some dice as a success, and rolling any other number as a failure, and ask how many failure rolls will occur before we see the third success ( $r=3$ ). In such a case, the probability distribution of the number of failures that appear will be a negative binomial distribution.

An alternative formulation is to model the number of total trials (instead of the number of failures). In fact, for a specified (non-random) number of successes(r), the number of failures(n −r) is random because the number of total trials(n) is random. For example, we could use the negative binomial distribution to model the number of daysn (random) a certain machine works (specified byr) before it breaks down.

The negative binomial distribution has a variance $\mu /p$ , with the distribution becoming identical to Poisson in the limit $p\to 1$ for a given mean $\mu$ (i.e. when the failures are increasingly rare). Here $p\in [0,1]$ is the success probability of each Bernoulli trial. This can make the distribution a usefuloverdispersed alternative to the Poisson distribution, for example for arobust modification ofPoisson regression. In epidemiology, it has been used to model disease transmission for infectious diseases where the likely number of onward infections may vary considerably from individual to individual and from setting to setting.^[4] More generally, it may be appropriate where events have positively correlated occurrences causing a largervariance than if the occurrences were independent, due to a positivecovariance term.

The term "negative binomial" is likely due to the fact that a certainbinomial coefficient that appears in the formula for theprobability mass function of the distribution can be written more simply with negative numbers.^[5]

	X is counting...	Probability mass function	Formula	Alternate formula (using equivalent binomial)	Alternate formula (simplified using: ${\textstyle n=k+r}$ )	Support
1	k failures, givenr successes	${\textstyle f(k;r,p)\equiv \Pr(X=k)=}$	${\textstyle {\binom {k+r-1}{k}}p^{r}(1-p)^{k}}$ ^[8]^[6]^[9]	${\textstyle {\binom {k+r-1}{r-1}}p^{r}(1-p)^{k}}$ ^[3]^[10]^[11]^[12]	${\textstyle {\binom {n-1}{k}}p^{r}(1-p)^{k}}$	${\text{for }}k=0,1,2,\ldots$
2	n trials, givenr successes	${\textstyle f(n;r,p)\equiv \Pr(X=n)=}$	${\textstyle {\binom {n-1}{r-1}}p^{r}(1-p)^{n-r}}$ ^[6]^[12]^[13]^[14]^[15]	${\textstyle {\binom {n-1}{n-r}}p^{r}(1-p)^{n-r}}$	${\textstyle {\binom {n-1}{k}}p^{r}(1-p)^{k}}$	${\text{for }}n=r,r+1,r+2,\dotsc$
3	n trials, givenr failures	${\textstyle f(n;r,p)\equiv \Pr(X=n)=}$	${\textstyle {\binom {n-1}{r-1}}p^{n-r}(1-p)^{r}}$	${\textstyle {\binom {n-1}{n-r}}p^{n-r}(1-p)^{r}}$	${\textstyle {\binom {n-1}{k}}p^{k}(1-p)^{r}}$	${\text{for }}n=r,r+1,r+2,\dotsc$
4	k successes, givenr failures	${\textstyle f(k;r,p)\equiv \Pr(X=k)=}$	${\textstyle {\binom {k+r-1}{k}}p^{k}(1-p)^{r}}$	${\textstyle {\binom {k+r-1}{r-1}}p^{k}(1-p)^{r}}$	${\textstyle {\binom {n-1}{k}}p^{k}(1-p)^{r}}$	${\text{for }}k=0,1,2,\ldots$
-	k successes, givenn trials	${\textstyle f(k;n,p)\equiv \Pr(X=k)=}$	This is thebinomial distribution not the negative binomial: ${\textstyle {\binom {n}{k}}p^{k}(1-p)^{n-k}={\binom {n}{n-k}}p^{k}(1-p)^{n-k}={\binom {n}{k}}p^{k}(1-p)^{r}}$			${\text{for }}k=0,1,2,\dotsc ,n$

	With replacements	No replacements
Given number of draws	binomial distribution	hypergeometric distribution
Given number of failures	negative binomial distribution	negative hypergeometric distribution

Movatterモバイル変換

Definitions

Probability mass function

Cumulative distribution function

Alternative formulations

Alternative parameterizations

Examples

Length of hospital stay

Selling candy

Properties

Expectation

Expectation of successes

Variance

Relation to the binomial theorem

Recurrence relations

Related distributions

Poisson distribution

Gamma–Poisson mixture

Distribution of a sum of geometrically distributed random variables

Representation as compound Poisson distribution

(a,b,0) class of distributions

Statistical inference

Parameter estimation

MVUE forp

Maximum likelihood estimation

Occurrence and applications

Waiting time in a Bernoulli process

Overdispersed Poisson

Multiplicity observations (physics)

History

See also

References