Movatterモバイル変換

[0]ホーム

Jump to content

Normal-inverse-gamma distribution

Català

Edit links

From Wikipedia, the free encyclopedia

Family of multivariate continuous probability distributions

normal-inverse-gamma
Probability density function
Parameters	$\mu \,$ location (real) $\lambda >0\,$ (real) $\alpha >0\,$ (real) $\beta >0\,$ (real)
Support	$x\in (-\infty ,\infty )\,\!,\;\sigma ^{2}\in (0,\infty )$
PDF	${\frac {\sqrt {\lambda }}{\sqrt {2\pi \sigma ^{2}}}}{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1}\exp \left(-{\frac {2\beta +\lambda (x-\mu )^{2}}{2\sigma ^{2}}}\right)$
Mean	$\operatorname {E} [x]=\mu$ $\operatorname {E} [\sigma ^{2}]={\frac {\beta }{\alpha -1}}$ , for $\alpha >1$
Mode	$x=\mu \;{\textrm {(univariate)}},x={\boldsymbol {\mu }}\;{\textrm {(multivariate)}}$ $\sigma ^{2}={\frac {\beta }{\alpha +1+1/2}}\;{\textrm {(univariate)}},\sigma ^{2}={\frac {\beta }{\alpha +1+k/2}}\;{\textrm {(multivariate)}}$
Variance	$\operatorname {Var} [x]={\frac {\beta }{(\alpha -1)\lambda }}$ , for $\alpha >1$ $\operatorname {Var} [\sigma ^{2}]={\frac {\beta ^{2}}{(\alpha -1)^{2}(\alpha -2)}}$ , for $\alpha >2$ $\operatorname {Cov} [x,\sigma ^{2}]=0$ , for $\alpha >1$

Inprobability theory andstatistics, thenormal-inverse-gamma distribution (orGaussian-inverse-gamma distribution) is a four-parameter family of multivariate continuousprobability distributions. It is theconjugate prior of anormal distribution with unknownmean andvariance.

Definition

[edit]

Suppose

x\mid \sigma ^{2},\mu ,\lambda \sim \mathrm {N} (\mu ,\sigma ^{2}/\lambda )\,\!

has anormal distribution withmean $\mu$ andvariance $\sigma ^{2}/\lambda$ , where

\sigma ^{2}\mid \alpha ,\beta \sim \Gamma ^{-1}(\alpha ,\beta )\!

has aninverse-gamma distribution. Then $(x,\sigma ^{2})$ has a normal-inverse-gamma distribution, denoted as

(x,\sigma ^{2})\sim {\text{N-}}\Gamma ^{-1}(\mu ,\lambda ,\alpha ,\beta )\!.

( ${\text{NIG}}$ is also used instead of ${\text{N-}}\Gamma ^{-1}.$ )

Thenormal-inverse-Wishart distribution is a generalization of the normal-inverse-gamma distribution that is defined over multivariate random variables.

Characterization

[edit]

Probability density function

[edit]

f(x,\sigma ^{2}\mid \mu ,\lambda ,\alpha ,\beta )={\frac {\sqrt {\lambda }}{\sigma {\sqrt {2\pi }}}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1}\exp \left(-{\frac {2\beta +\lambda (x-\mu )^{2}}{2\sigma ^{2}}}\right)

For the multivariate form where $\mathbf {x}$ is a $k\times 1$ random vector,

f(\mathbf {x} ,\sigma ^{2}\mid \mu ,\mathbf {V} ^{-1},\alpha ,\beta )=|\mathbf {V} |^{-1/2}{(2\pi )^{-k/2}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1+k/2}\exp \left(-{\frac {2\beta +(\mathbf {x} -{\boldsymbol {\mu }})^{T}\mathbf {V} ^{-1}(\mathbf {x} -{\boldsymbol {\mu }})}{2\sigma ^{2}}}\right).

where $|\mathbf {V} |$ is thedeterminant of the $k\times k$ matrix $\mathbf {V}$ . Note how this last equation reduces to the first form if $k=1$ so that $\mathbf {x} ,\mathbf {V} ,{\boldsymbol {\mu }}$ arescalars.

Alternative parameterization

[edit]

It is also possible to let $\gamma =1/\lambda$ in which case the pdf becomes

f(x,\sigma ^{2}\mid \mu ,\gamma ,\alpha ,\beta )={\frac {1}{\sigma {\sqrt {2\pi \gamma }}}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1}\exp \left(-{\frac {2\gamma \beta +(x-\mu )^{2}}{2\gamma \sigma ^{2}}}\right)

In the multivariate form, the corresponding change would be to regard the covariance matrix $\mathbf {V}$ instead of itsinverse $\mathbf {V} ^{-1}$ as a parameter.

Cumulative distribution function

[edit]

F(x,\sigma ^{2}\mid \mu ,\lambda ,\alpha ,\beta )={\frac {e^{-{\frac {\beta }{\sigma ^{2}}}}\left({\frac {\beta }{\sigma ^{2}}}\right)^{\alpha }\left(\operatorname {erf} \left({\frac {{\sqrt {\lambda }}(x-\mu )}{{\sqrt {2}}\sigma }}\right)+1\right)}{2\sigma ^{2}\Gamma (\alpha )}}

Properties

[edit]

Marginal distributions

[edit]

Given $(x,\sigma ^{2})\sim {\text{N-}}\Gamma ^{-1}(\mu ,\lambda ,\alpha ,\beta )\!.$ as above, $\sigma ^{2}$ by itself follows aninverse gamma distribution:

\sigma ^{2}\sim \Gamma ^{-1}(\alpha ,\beta )\!

while ${\sqrt {\frac {\alpha \lambda }{\beta }}}(x-\mu )$ follows at distribution with $2\alpha$ degrees of freedom.^[1]

Proof for $\lambda =1$

For $\lambda =1$ probability density function is

$f(x,\sigma ^{2}\mid \mu ,\alpha ,\beta )={\frac {1}{\sigma {\sqrt {2\pi }}}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1}\exp \left(-{\frac {2\beta +(x-\mu )^{2}}{2\sigma ^{2}}}\right)$

Marginal distribution over $x {\displaystyle x}$ is

${\begin{aligned}f(x\mid \mu ,\alpha ,\beta )&=\int _{0}^{\infty }d\sigma ^{2}f(x,\sigma ^{2}\mid \mu ,\alpha ,\beta )\\&={\frac {1}{\sqrt {2\pi }}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\int _{0}^{\infty }d\sigma ^{2}\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1/2+1}\exp \left(-{\frac {2\beta +(x-\mu )^{2}}{2\sigma ^{2}}}\right)\end{aligned}}$

Except for normalization factor, expression under the integral coincides withInverse-gamma distribution

$\Gamma ^{-1}(x;a,b)={\frac {b^{a}}{\Gamma (a)}}{\frac {e^{-b/x}}{{x}^{a+1}}},$

with $x=\sigma ^{2}$ , $a=\alpha +1/2$ , $b={\frac {2\beta +(x-\mu )^{2}}{2}}$ .

Since $\int _{0}^{\infty }dx\Gamma ^{-1}(x;a,b)=1,\quad \int _{0}^{\infty }dxx^{-(a+1)}e^{-b/x}=\Gamma (a)b^{-a}$ , and

$\int _{0}^{\infty }d\sigma ^{2}\left({\frac {1}{\sigma ^{2}}}\right)^{\alpha +1/2+1}\exp \left(-{\frac {2\beta +(x-\mu )^{2}}{2\sigma ^{2}}}\right)=\Gamma (\alpha +1/2)\left({\frac {2\beta +(x-\mu )^{2}}{2}}\right)^{-(\alpha +1/2)}$

Substituting this expression and factoring dependence on $x {\displaystyle x}$ ,

$f(x\mid \mu ,\alpha ,\beta )\propto _{x}\left(1+{\frac {(x-\mu )^{2}}{2\beta }}\right)^{-(\alpha +1/2)}.$

Shape ofgeneralized Student's t-distribution is

$t(x|\nu ,{\hat {\mu }},{\hat {\sigma }}^{2})\propto _{x}\left(1+{\frac {1}{\nu }}{\frac {(x-{\hat {\mu }})^{2}}{{\hat {\sigma }}^{2}}}\right)^{-(\nu +1)/2}$ .

Marginal distribution $f(x\mid \mu ,\alpha ,\beta )$ follows t-distribution with $2\alpha$ degrees of freedom

$f(x\mid \mu ,\alpha ,\beta )=t(x|\nu =2\alpha ,{\hat {\mu }}=\mu ,{\hat {\sigma }}^{2}=\beta /\alpha )$ .

In the multivariate case, the marginal distribution of $\mathbf {x}$ is amultivariate t distribution:

\mathbf {x} \sim t_{2\alpha }({\boldsymbol {\mu }},{\frac {\beta }{\alpha }}\mathbf {V} )\!

Summation

[edit]

Scaling

[edit]

Suppose

(x,\sigma ^{2})\sim {\text{N-}}\Gamma ^{-1}(\mu ,\lambda ,\alpha ,\beta )\!.

Then for $c>0$ ,

(cx,c\sigma ^{2})\sim {\text{N-}}\Gamma ^{-1}(c\mu ,\lambda /c,\alpha ,c\beta )\!.

Proof: To prove this let $(x,\sigma ^{2})\sim {\text{N-}}\Gamma ^{-1}(\mu ,\lambda ,\alpha ,\beta )$ and fix $c>0$ . Defining $Y=(Y_{1},Y_{2})=(cx,c\sigma ^{2})$ , observe that the PDF of the random variable $Y {\displaystyle Y}$ evaluated at $(y_{1},y_{2})$ is given by $1/c^{2}$ times the PDF of a ${\text{N-}}\Gamma ^{-1}(\mu ,\lambda ,\alpha ,\beta )$ random variable evaluated at $(y_{1}/c,y_{2}/c)$ . Hence the PDF of $Y {\displaystyle Y}$ evaluated at $(y_{1},y_{2})$ is given by : $f_{Y}(y_{1},y_{2})={\frac {1}{c^{2}}}{\frac {\sqrt {\lambda }}{\sqrt {2\pi y_{2}/c}}}\,{\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{y_{2}/c}}\right)^{\alpha +1}\exp \left(-{\frac {2\beta +\lambda (y_{1}/c-\mu )^{2}}{2y_{2}/c}}\right)={\frac {\sqrt {\lambda /c}}{\sqrt {2\pi y_{2}}}}\,{\frac {(c\beta )^{\alpha }}{\Gamma (\alpha )}}\,\left({\frac {1}{y_{2}}}\right)^{\alpha +1}\exp \left(-{\frac {2c\beta +(\lambda /c)\,(y_{1}-c\mu )^{2}}{2y_{2}}}\right).\!$

The right hand expression is the PDF for a ${\text{N-}}\Gamma ^{-1}(c\mu ,\lambda /c,\alpha ,c\beta )$ random variable evaluated at $(y_{1},y_{2})$ , which completes the proof.

Measures difference between two distributions.

Maximum likelihood estimation

[edit]

This section is empty. You can help byadding to it.(July 2010)

Posterior distribution of the parameters

[edit]

See the articles onnormal-gamma distribution andconjugate prior.

Interpretation of the parameters

[edit]

See the articles onnormal-gamma distribution andconjugate prior.

Generating normal-inverse-gamma random variates

[edit]

Generation of random variates is straightforward:

Sample $\sigma ^{2}$ from an inverse gamma distribution with parameters $\alpha$ and $\beta$
Sample $x {\displaystyle x}$ from a normal distribution with mean $\mu$ and variance $\sigma ^{2}/\lambda$

Related distributions

[edit]

Thenormal-gamma distribution is the same distribution parameterized byprecision rather thanvariance
A generalization of this distribution which allows for a multivariate mean and a completely unknown positive-definite covariance matrix $\sigma ^{2}\mathbf {V}$ (whereas in the multivariate inverse-gamma distribution the covariance matrix is regarded as known up to the scale factor $\sigma ^{2}$ ) is thenormal-inverse-Wishart distribution

References

[edit]

^Ramírez-Hassan, Andrés.4.2 Conjugate prior to exponential family | Introduction to Bayesian Econometrics.

Denison, David G. T.; et al. (2002).Bayesian Methods for Nonlinear Classification and Regression. Wiley.ISBN 0471490369.
Koch, Karl-Rudolf (2007).Introduction to Bayesian Statistics (2nd ed.). Springer.ISBN 354072723X.

Probability distributions (list)

Discrete
univariate

with finite support	Benford Bernoulli Beta-binomial Binomial Categorical Hypergeometric Negative Poisson binomial Rademacher Soliton Discrete uniform Zipf Zipf–Mandelbrot
with infinite support	Beta negative binomial Borel Conway–Maxwell–Poisson Discrete phase-type Delaporte Extended negative binomial Flory–Schulz Gauss–Kuzmin Geometric Logarithmic Mixed Poisson Negative binomial Panjer Parabolic fractal Poisson Skellam Yule–Simon Zeta

Continuous
univariate

supported on a bounded interval	Arcsine ARGUS Balding–Nichols Bates Beta Generalized Beta rectangular Continuous Bernoulli Irwin–Hall Kumaraswamy Logit-normal Noncentral beta PERT Power function Raised cosine Reciprocal Triangular U-quadratic Uniform Wigner semicircle
supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind Beta prime Burr Chi Chi-squared Noncentral Inverse Scaled Dagum Davis Erlang Hyper Exponential Hyperexponential Hypoexponential Logarithmic F Noncentral Folded normal Fréchet Gamma Generalized Inverse gamma/Gompertz Gompertz Shifted Half-logistic Half-normal Hotelling'sT-squared Hartman–Watson Inverse Gaussian Generalized Kolmogorov Lévy Log-Cauchy Log-Laplace Log-logistic Log-normal Log-t Lomax Matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami Pareto Phase-type Poly-Weibull Rayleigh Relativistic Breit–Wigner Rice Truncated normal type-2 Gumbel Weibull Discrete Wilks's lambda
supported on the whole real line	Cauchy Exponential power Fisher'sz Kaniadakis κ-Gaussian Gaussianq Generalized hyperbolic Generalized logistic (logistic-beta) Generalized normal Geometric stable Gumbel Holtsmark Hyperbolic secant Johnson'sS_U Landau Laplace Asymmetric Logistic Noncentralt Normal (Gaussian) Normal-inverse Gaussian Skew normal Slash Stable Student'st Tracy–Widom Variance-gamma Voigt
with support whose type varies	Generalized chi-squared Generalized extreme value Generalized Pareto Marchenko–Pastur Kaniadakisκ-exponential Kaniadakisκ-Gamma Kaniadakisκ-Weibull Kaniadakisκ-Logistic Kaniadakisκ-Erlang q-exponential q-Gaussian q-Weibull Shifted log-logistic Tukey lambda