Movatterモバイル変換

[0]ホーム

Jump to content

Pearson distribution

Edit links

From Wikipedia, the free encyclopedia

Family of continuous probability distributions

Diagram of the Pearson system, showing distributions of types I, III, VI, V, and IV in terms of β₁ (squared skewness) and β₂ (traditional kurtosis)

ThePearson distribution is a family ofcontinuous probability distributions. It was first published byKarl Pearson in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles onbiostatistics.

History

[edit]

The Pearson system was originally devised in an effort to model visiblyskewed observations. It was well known at the time how to adjust a theoretical model to fit the first twocumulants ormoments of observed data: Anyprobability distribution can be extended straightforwardly to form alocation-scale family. Except inpathological cases, a location-scale family can be made to fit the observedmean (first cumulant) andvariance (second cumulant) arbitrarily well. However, it was not known how to construct probability distributions in which theskewness (standardized third cumulant) andkurtosis (standardized fourth cumulant) could be adjusted equally freely. This need became apparent when trying to fit known theoretical models to observed data that exhibited skewness. Pearson's examples include survival data, which are usually asymmetric.

In his original paper, Pearson (1895, p. 360) identified four types of distributions (numbered I through IV) in addition to thenormal distribution (which was originally known as type V). The classification depended on whether the distributions weresupported on a bounded interval, on a half-line, or on the wholereal line; and whether they were potentially skewed or necessarily symmetric. A second paper (Pearson 1901) fixed two omissions: it redefined the type V distribution (originally just thenormal distribution, but now theinverse-gamma distribution) and introduced the type VI distribution. Together the first two papers cover the five main types of the Pearson system (I, III, IV, V, and VI). In a third paper, Pearson (1916) introduced further special cases and subtypes (VII through XII).

Rhind (1909, pp. 430–432) devised a simple way of visualizing the parameter space of the Pearson system, which was subsequently adopted by Pearson (1916, plate 1 and pp. 430ff., 448ff.). The Pearson types are characterized by two quantities, commonly referred to as β₁ and β₂. The first is the square of theskewness: β₁ = γ₁² where γ₁ is the skewness, or thirdstandardized moment. The second is the traditionalkurtosis, or fourth standardized moment: β₂ = γ₂ + 3. (Modern treatments define kurtosis γ₂ in terms of cumulants instead of moments, so that for a normal distribution we have γ₂ = 0 and β₂ = 3. Here we follow the historical precedent and use β₂.) The diagram shows which Pearson type a given concrete distribution (identified by a point (β₁, β₂)) belongs to.

Many of the skewed or non-mesokurtic distributions familiar to statisticians today were still unknown in the early 1890s. What is now known as thebeta distribution had been used byThomas Bayes as aposterior distribution of the parameter of aBernoulli distribution in his 1763 work oninverse probability. The beta distribution gained prominence due to its membership in Pearson's system and was known until the 1940s as the Pearson type I distribution.^[1] (Pearson's type II distribution is a special case of type I, but is usually no longer singled out.) Thegamma distribution originated from Pearson's work (Pearson 1893, p. 331; Pearson 1895, pp. 357, 360, 373–376) and was known as the Pearson type III distribution, before acquiring its modern name in the 1930s and 1940s.^[2] Pearson's 1895 paper introduced the type IV distribution, which containsStudent'st-distribution as a special case, predatingWilliam Sealy Gosset's subsequent use by several years. His 1901 paper introduced theinverse-gamma distribution (type V) and thebeta prime distribution (type VI).

Definition

[edit]

A Pearsondensityp is defined to be any valid solution to thedifferential equation (cf. Pearson 1895, p. 381)

{\frac {p'(x)}{p(x)}}+{\frac {a+(x-\mu )}{b_{0}+b_{1}(x-\mu )+b_{2}(x-\mu )^{2}}}=0.\qquad (1)

with:

{\begin{aligned}b_{0}&={\frac {4\beta _{2}-3\beta _{1}}{10\beta _{2}-12\beta _{1}-18}}\mu _{2},\\[5pt]a=b_{1}&={\sqrt {\mu _{2}}}{\sqrt {\beta _{1}}}{\frac {\beta _{2}+3}{10\beta _{2}-12\beta _{1}-18}},\\[5pt]b_{2}&={\frac {2\beta _{2}-3\beta _{1}-6}{10\beta _{2}-12\beta _{1}-18}}.\end{aligned}}

According to Ord,^[3] Pearson devised the underlying form of Equation (1) on the basis of, firstly, the formula for the derivative of the logarithm of the density function of thenormal distribution (which gives a linear function) and, secondly, from a recurrence relation for values in theprobability mass function of thehypergeometric distribution (which yields the linear-divided-by-quadratic structure).

In Equation (1), the parametera determines astationary point, and hence under some conditions amode of the distribution, since

p'(\mu -a)=0

follows directly from the differential equation.

Since we are confronted with afirst-order linear differential equation with variable coefficients, its solution is straightforward:

p(x)\propto \exp \left(-\int {\frac {x+a}{b_{2}x^{2}+b_{1}x+b_{0}}}\,dx\right).

The integral in this solution simplifies considerably when certain special cases of the integrand are considered. Pearson (1895, p. 367) distinguished two main cases, determined by the sign of thediscriminant (and hence the number of realroots) of thequadratic function

f(x)=b_{2}x^{2}+b_{1}x+b_{0}.\qquad (2)

Particular types of distribution

[edit]

Case 1, negative discriminant

[edit]

The Pearson type IV distribution

[edit]

If the discriminant of the quadratic function (2) is negative ( $b_{1}^{2}-4b_{2}b_{0}<0$ ), it has no real roots. Then define

{\begin{aligned}y&=x+{\frac {b_{1}}{2b_{2}}},\\[5pt]\alpha &={\frac {\sqrt {4b_{2}b_{0}-b_{1}^{2}}}{2b_{2}}}.\end{aligned}}

Observe thatα is a well-defined real number andα ≠ 0, because by assumption $4b_{2}b_{0}-b_{1}^{2}>0$ and thereforeb₂ ≠ 0. Applying these substitutions, the quadratic function (2) is transformed into

f(x)=b_{2}(y^{2}+\alpha ^{2}).

The absence of real roots is obvious from this formulation, because α² is necessarily positive.

We now express the solution to the differential equation (1) as a function ofy:

p(y)\propto \exp \left(-{\frac {1}{b_{2}}}\int {\frac {y-{\frac {b_{1}}{2b_{2}}}+a}{y^{2}+\alpha ^{2}}}\,dy\right).

Pearson (1895, p. 362) called this the "trigonometrical case", because the integral

\int {\frac {y-{\frac {2b_{2}a-b_{1}}{2b_{2}}}}{y^{2}+\alpha ^{2}}}\,dy={\frac {1}{2}}\ln(y^{2}+\alpha ^{2})-{\frac {2b_{2}a-b_{1}}{2b_{2}\alpha }}\arctan \left({\frac {y}{\alpha }}\right)+C_{0}

involves theinverse trigonometric arctan function. Then

p(y)\propto \exp \left[-{\frac {1}{2b_{2}}}\ln \left(1+{\frac {y^{2}}{\alpha ^{2}}}\right)-{\frac {\ln \alpha }{b_{2}}}+{\frac {2b_{2}a-b_{1}}{2b_{2}^{2}\alpha }}\arctan \left({\frac {y}{\alpha }}\right)+C_{1}\right].

Finally, let

{\begin{aligned}m&={\frac {1}{2b_{2}}},\\[5pt]\nu &=-{\frac {2b_{2}a-b_{1}}{2b_{2}^{2}\alpha }}.\end{aligned}}

Applying these substitutions, we obtain the parametric function:

p(y)\propto \left[1+{\frac {y^{2}}{\alpha ^{2}}}\right]^{-m}\exp \left[-\nu \arctan \left({\frac {y}{\alpha }}\right)\right].

This unnormalized density hassupport on the entirereal line. It depends on ascale parameter α > 0 andshape parametersm > 1/2 and ν. One parameter was lost when we chose to find the solution to the differential equation (1) as a function ofy rather thanx. We therefore reintroduce a fourth parameter, namely thelocation parameterλ. We have thus derived the density of thePearson type IV distribution:

p(x)={\frac {\left|{\frac {\operatorname {\Gamma } \left(m+{\frac {\nu }{2}}i\right)}{\Gamma (m)}}\right|^{2}}{\alpha \operatorname {\mathrm {B} } \left(m-{\frac {1}{2}},{\frac {1}{2}}\right)}}\left[1+\left({\frac {x-\lambda }{\alpha }}\right)^{2}\right]^{-m}\exp \left[-\nu \arctan \left({\frac {x-\lambda }{\alpha }}\right)\right].

Thenormalizing constant involves thecomplex Gamma function (Γ) and theBeta function (B).Notice that thelocation parameterλ here is not the same as the original location parameter introduced in the general formulation, but is related via

\lambda =\lambda _{original}+{\frac {\alpha \nu }{2(m-1)}}.

The Pearson type VII distribution

[edit]

Plot of Pearson type VII densities withλ = 0,σ = 1, and:γ₂ = ∞ (red);γ₂ = 4 (blue); andγ₂ = 0 (black)

The shape parameterν of the Pearson type IV distribution controls itsskewness. If we fix its value at zero, we obtain a symmetric three-parameter family. This special case is known as thePearson type VII distribution (cf. Pearson 1916, p. 450). Its density is

p(x)={\frac {1}{\alpha \operatorname {\mathrm {B} } \left(m-{\frac {1}{2}},{\frac {1}{2}}\right)}}\left[1+\left({\frac {x-\lambda }{\alpha }}\right)^{2}\right]^{-m},

where B is theBeta function.

An alternative parameterization (and slight specialization) of the type VII distribution is obtained by letting

\alpha =\sigma {\sqrt {2m-3}},

which requiresm > 3/2. This entails a minor loss of generality but ensures that thevariance of the distribution exists and is equal to σ². Now the parameterm only controls thekurtosis of the distribution. Ifm approaches infinity asλ andσ are held constant, thenormal distribution arises as a special case:

{\begin{aligned}&\lim _{m\to \infty }{\frac {1}{\sigma {\sqrt {2m-3}}\,\operatorname {\mathrm {B} } \left(m-{\frac {1}{2}},{\frac {1}{2}}\right)}}\left[1+\left({\frac {x-\lambda }{\sigma {\sqrt {2m-3}}}}\right)^{2}\right]^{-m}\\[5pt]={}&{\frac {1}{\sigma {\sqrt {2}}\,\operatorname {\Gamma } \left({\frac {1}{2}}\right)}}\cdot \lim _{m\to \infty }{\frac {\Gamma (m)}{\operatorname {\Gamma } \left(m-{\frac {1}{2}}\right){\sqrt {m-{\frac {3}{2}}}}}}\cdot \lim _{m\to \infty }\left[1+{\frac {\left({\frac {x-\lambda }{\sigma }}\right)^{2}}{2m-3}}\right]^{-m}\\[5pt]={}&{\frac {1}{\sigma {\sqrt {2\pi }}}}\cdot 1\cdot \exp \left[-{\frac {1}{2}}\left({\frac {x-\lambda }{\sigma }}\right)^{2}\right].\end{aligned}}

This is the density of a normal distribution with meanλ and standard deviationσ.

It is convenient to require thatm > 5/2 and to let

m={\frac {5}{2}}+{\frac {3}{\gamma _{2}}}.

This is another specialization, and it guarantees that the first four moments of the distribution exist. More specifically, the Pearson type VII distribution parameterized in terms of (λ, σ, γ₂) has a mean ofλ,standard deviation ofσ,skewness of zero, and positiveexcess kurtosis of γ₂.

Student'st-distribution

[edit]

The Pearson type VII distribution is equivalent to the non-standardizedStudent'st-distribution with parameters ν > 0, μ, σ² by applying the following substitutions to its original parameterization:

{\begin{aligned}\lambda &=\mu ,\\[5pt]\alpha &={\sqrt {\nu \sigma ^{2}}},\\[5pt]m&={\frac {\nu +1}{2}},\end{aligned}}

Observe that the constraintm > 1/2 is satisfied.

The resulting density is

p(x\mid \mu ,\sigma ^{2},\nu )={\frac {1}{{\sqrt {\nu \sigma ^{2}}}\,\operatorname {\mathrm {B} } \left({\frac {\nu }{2}},{\frac {1}{2}}\right)}}\left(1+{\frac {1}{\nu }}{\frac {(x-\mu )^{2}}{\sigma ^{2}}}\right)^{-{\frac {\nu +1}{2}}},

which is easily recognized as the density of a Student'st-distribution.

This implies that the Pearson type VII distribution subsumes the standardStudent'st-distribution and also the standardCauchy distribution. In particular, the standard Student'st-distribution arises as a subcase, whenμ = 0 andσ² = 1, equivalent to the following substitutions:

{\begin{aligned}\lambda &=0,\\[5pt]\alpha &={\sqrt {\nu }},\\[5pt]m&={\frac {\nu +1}{2}},\end{aligned}}

The density of this restricted one-parameter family is a standard Student'st:

p(x)={\frac {1}{{\sqrt {\nu }}\,\operatorname {\mathrm {B} } \left({\frac {\nu }{2}},{\frac {1}{2}}\right)}}\left(1+{\frac {x^{2}}{\nu }}\right)^{-{\frac {\nu +1}{2}}},

Case 2, non-negative discriminant

[edit]

If the quadratic function (2) has a non-negative discriminant ( $b_{1}^{2}-4b_{2}b_{0}\geq 0$ ), it has real rootsa₁ anda₂ (not necessarily distinct):

{\begin{aligned}a_{1}&={\frac {-b_{1}-{\sqrt {b_{1}^{2}-4b_{2}b_{0}}}}{2b_{2}}},\\[5pt]a_{2}&={\frac {-b_{1}+{\sqrt {b_{1}^{2}-4b_{2}b_{0}}}}{2b_{2}}}.\end{aligned}}

In the presence of real roots the quadratic function (2) can be written as

f(x)=b_{2}(x-a_{1})(x-a_{2}),

and the solution to the differential equation is therefore

p(x)\propto \exp \left(-{\frac {1}{b_{2}}}\int {\frac {x-a}{(x-a_{1})(x-a_{2})}}\,dx\right).

Pearson (1895, p. 362) called this the "logarithmic case", because the integral

\int {\frac {x-a}{(x-a_{1})(x-a_{2})}}\,dx={\frac {(a_{1}-a)\ln(x-a_{1})-(a_{2}-a)\ln(x-a_{2})}{a_{1}-a_{2}}}+C

involves only thelogarithm function and not the arctan function as in the previous case.

Using the substitution

\nu ={\frac {1}{b_{2}(a_{1}-a_{2})}},

we obtain the following solution to the differential equation (1):

p(x)\propto (x-a_{1})^{-\nu (a_{1}-a)}(x-a_{2})^{\nu (a_{2}-a)}.

Since this density is only known up to a hidden constant of proportionality, that constant can be changed and the density written as follows:

p(x)\propto \left(1-{\frac {x}{a_{1}}}\right)^{-\nu (a_{1}-a)}\left(1-{\frac {x}{a_{2}}}\right)^{\nu (a_{2}-a)}.

The Pearson type I distribution

[edit]

ThePearson type I distribution (a generalization of thebeta distribution to more general finite region of support) arises when the roots of the quadratic equation (2) are of opposite sign, that is, $a_{1}<0<a_{2}$ . Then the solutionp is supported on the interval $(a_{1},a_{2})$ . Apply the substitution

x=a_{1}+y(a_{2}-a_{1}),

where $0<y<1$ , which yields a solution in terms ofy that is supported on the interval (0, 1):

p(y)\propto \left({\frac {a_{1}-a_{2}}{a_{1}}}y\right)^{(-a_{1}+a)\nu }\left({\frac {a_{2}-a_{1}}{a_{2}}}(1-y)\right)^{(a_{2}-a)\nu }.

One may define:

{\begin{aligned}m_{1}&={\frac {a-a_{1}}{b_{2}(a_{1}-a_{2})}},\\[5pt]m_{2}&={\frac {a-a_{2}}{b_{2}(a_{2}-a_{1})}}.\end{aligned}}

Regrouping constants and parameters, this simplifies to

p(y)\propto y^{m_{1}}(1-y)^{m_{2}},

Thus ${\frac {x-\lambda -a_{1}}{a_{2}-a_{1}}}$ follows abeta distribution $\mathrm {B} (m_{1}+1,m_{2}+1)$ with $\lambda =\mu _{1}-(a_{2}-a_{1}){\frac {m_{1}+1}{m_{1}+m_{2}+2}}-a_{1}$ . It turns out thatm₁,m₂ > −1 is necessary and sufficient forp to be a proper probability density function.

The Pearson type II distribution

[edit]

ThePearson type II distribution is a special case of the Pearson type I family restricted to symmetric distributions. Using formulae from the type I section, with $m_{1}=m_{2}=m$ and $-a_{1}=a_{2}=a$ , on the interval (−a, a) it can be written as:

p(x)\propto \left(1-{\frac {x^{2}}{a^{2}}}\right)^{m}.

Or with

x=-a+2ya,

$y {\displaystyle y}$ is distributed according to thebeta distribution on the interval (0, 1),

p(y)\propto \left(1-4\left(y-{\frac {1}{2}}\right)^{2}\right)^{m}\propto y^{m}(1-y)^{m}.

with appropriate constant of proportionality the PDF becomes

p(y)=y^{m}(1-y)^{m}{\frac {\Gamma (2m+2)}{\Gamma (m+1)^{2}}}.

The Pearson type III distribution

[edit]

Defining

\lambda =\mu _{1}+{\frac {b_{0}}{b_{1}}}-(m+1)b_{1},

$b_{0}+b_{1}(x-\lambda )$ is $\operatorname {Gamma} (m+1,b_{1}^{2})$ . The Pearson type III distribution is agamma distribution orchi-squared distribution.

The Pearson type V distribution

[edit]

Defining new parameters:

{\begin{aligned}C_{1}&={\frac {b_{1}}{2b_{2}}},\\\lambda &=\mu _{1}-{\frac {a-C_{1}}{1-2b_{2}}},\end{aligned}}

$x-\lambda$ follows an $\operatorname {InverseGamma} ({\frac {1}{b_{2}}}-1,{\frac {a-C_{1}}{b_{2}}})$ . The Pearson type V distribution is aninverse-gamma distribution.

The Pearson type VI distribution

[edit]

Defining

\lambda =\mu _{1}+(a_{2}-a_{1}){\frac {m_{2}+1}{m_{2}+m_{1}+2}}-a_{2},

${\frac {x-\lambda -a_{2}}{a_{2}-a_{1}}}$ follows a $\beta ^{\prime }(m_{2}+1,-m_{2}-m_{1}-1)$ . The Pearson type VI distribution is abeta prime distribution orF-distribution.

Relation to other distributions

[edit]

The Pearson family subsumes the following distributions, among others:

Bernoulli distribution (type B, limit of type I)
Beta distribution (type I and its symmetric subtype II)
Beta prime distribution (type VI)
Cauchy distribution (subtype of type VII)
Chi-squared distribution (subtype of type III)
Continuous uniform distribution (type R, subtype of types II, VIII, IX, and XII)
Exponential distribution (type X/E, subtype of type III, limit of types IX and XI)
Gamma distribution (type III, limit of types I and VI)
Generalized Pareto distribution (types VIII, IX, X, and XI)
F-distribution (subtype of type VI)
Inverse-chi-squared distribution (subtype of type V)
Inverse-gamma distribution (type V, limit of types IV and VI)
Normal distribution (type G, limit of types I/II, III, IV/VII, V, and VI)
Pareto distribution (type XI, subtype of type VI)
Power function distribution (types VIII and IX, subtypes of type I)
Student'st-distribution (type VII, symmetric subtype of type IV)

As of 2025, the only types without a name are types IV (seeabove) and XII (beta distribution with $\alpha +\beta =2$ ).

Alternatives to the Pearson system of distributions for the purpose of fitting distributions to data are thequantile-parameterized distributions (QPDs) and themetalog distributions. QPDs and metalogs can provide greater shape and bounds flexibility than the Pearson system. Instead of fitting moments, QPDs are typically fit toempirical CDF or other data withlinear least squares.

Examples of modern alternatives to the Pearson skewness-vs-kurtosis diagram are: (i)https://github.com/SchildCode/PearsonPlot and (ii) the "Cullen and Frey graph" in the statistical application R.

Applications

[edit]

These models are used in financial markets, given their ability to be parametrized in a way that has intuitive meaning for market traders. A number of models are in current use that capture the stochastic nature of the volatility of rates, stocks, etc.,^[which?]^{[citation needed]} and this family of distributions may prove to be one of the more important.

In the United States, thelog-gamma distribution (historically named Log-Pearson III) is the default distribution for flood frequency analysis.^[4]

Recently, there have been alternatives developed to the Pearson distributions that are more flexible and easier to fit to data. See themetalog distributions.

Notes

[edit]

^Miller, Jeff; et al. (2006-07-09)."Beta distribution".Earliest Known Uses of Some of the Words of Mathematics. Retrieved2006-12-09.
^Miller, Jeff; et al. (2006-12-07)."Gamma distribution".Earliest Known Uses of Some of the Words of Mathematics. Retrieved2006-12-09.
^Ord J.K. (1972) p. 2
^"Guidelines for Determine Flood Flow Frequency"(PDF).USGS Water. March 1982. Retrieved2019-06-14.

Sources

[edit]

Elderton, Sir W.P, Johnson, N.L. (1969)Systems of Frequency Curves. Cambridge University Press.
Ord J.K. (1972)Families of Frequency Distributions. Griffin, London.

Probability distributions (list)

Discrete
univariate

with finite support	Benford Bernoulli Beta-binomial Binomial Categorical Hypergeometric Negative Poisson binomial Rademacher Soliton Discrete uniform Zipf Zipf–Mandelbrot
with infinite support	Beta negative binomial Borel Conway–Maxwell–Poisson Discrete phase-type Delaporte Extended negative binomial Flory–Schulz Gauss–Kuzmin Geometric Logarithmic Mixed Poisson Negative binomial Panjer Parabolic fractal Poisson Skellam Yule–Simon Zeta

Continuous
univariate

supported on a bounded interval	Arcsine ARGUS Balding–Nichols Bates Beta Generalized Beta rectangular Continuous Bernoulli Irwin–Hall Kumaraswamy Logit-normal Noncentral beta PERT Power function Raised cosine Reciprocal Triangular U-quadratic Uniform Wigner semicircle
supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind Beta prime Burr Chi Chi-squared Noncentral Inverse Scaled Dagum Davis Erlang Hyper Exponential Hyperexponential Hypoexponential Logarithmic F Noncentral Folded normal Fréchet Gamma Generalized Inverse gamma/Gompertz Gompertz Shifted Half-logistic Half-normal Hotelling'sT-squared Hartman–Watson Inverse Gaussian Generalized Kolmogorov Lévy Log-Cauchy Log-Laplace Log-logistic Log-normal Log-t Lomax Matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami Pareto Phase-type Poly-Weibull Rayleigh Relativistic Breit–Wigner Rice Truncated normal type-2 Gumbel Weibull Discrete Wilks's lambda
supported on the whole real line	Cauchy Exponential power Fisher'sz Kaniadakis κ-Gaussian Gaussianq Generalized hyperbolic Generalized logistic (logistic-beta) Generalized normal Geometric stable Gumbel Holtsmark Hyperbolic secant Johnson'sS_U Landau Laplace Asymmetric Logistic Noncentralt Normal (Gaussian) Normal-inverse Gaussian Skew normal Slash Stable Student'st Tracy–Widom Variance-gamma Voigt
with support whose type varies	Generalized chi-squared Generalized extreme value Generalized Pareto Marchenko–Pastur Kaniadakisκ-exponential Kaniadakisκ-Gamma Kaniadakisκ-Weibull Kaniadakisκ-Logistic Kaniadakisκ-Erlang q-exponential q-Gaussian q-Weibull Shifted log-logistic Tukey lambda