Movatterモバイル変換

[0]ホーム

Jump to content

Dirichlet process

Edit links

From Wikipedia, the free encyclopedia

Family of stochastic processes

Draws from the Dirichlet process $\operatorname {DP} (N(0,1),\alpha )$ . The four rows use different alpha $\alpha$ (top to bottom: 1, 10, 100 and 1000) and each row contains three repetitions of the same experiment. As seen from the graphs, draws from a Dirichlet process are discrete distributions and they become less concentrated (more spread out) with increasing $\alpha$ . The graphs were generated using thestick-breaking process view of the Dirichlet process.

{\displaystyle \operatorname {DP} (N(0,1),\alpha )} — Draws from the Dirichlet process $\operatorname {DP} (N(0,1),\alpha )$ . The four rows use different alpha $\alpha$ (top to bottom: 1, 10, 100 and 1000) and each row contains three repetitions of the same experiment. As seen from the graphs, draws from a Dirichlet process are discrete distributions and they become less concentrated (more spread out) with increasing $\alpha$ . The graphs were generated using thestick-breaking process view of the Dirichlet process.

Inprobability theory,Dirichlet processes (after the distribution associated withPeter Gustav Lejeune Dirichlet) are a family ofstochastic processes whoserealizations areprobability distributions. In other words, a Dirichlet process is a probability distribution whose range is itself a set of probability distributions. It is often used inBayesian inference to describe theprior knowledge about the distribution ofrandom variables—how likely it is that the random variables are distributed according to one or another particular distribution.

As an example, a bag of 100 real-world dice is arandomprobability mass function (random pmf)—to sample this random pmf you put your hand in the bag and draw out a die, that is, you draw a pmf. A bag of dice manufactured using a crude process 100 years ago will likely have probabilities that deviate wildly from the uniform pmf, whereas a bag of state-of-the-art dice used by Las Vegas casinos may have barely perceptible imperfections. We can model the randomness of pmfs with the Dirichlet distribution.^[1]

The Dirichlet process is specified by a base distribution $H {\displaystyle H}$ and a positivereal number $\alpha$ called theconcentration parameter (also known as scaling parameter). The base distribution is theexpected value of the process, i.e., the Dirichlet process draws distributions "around" the base distribution the way anormal distribution draws real numbers around its mean. However, even if the base distribution iscontinuous, the distributions drawn from the Dirichlet process arealmost surely discrete. The scaling parameter specifies how strong this discretization is: in the limit of $\alpha \rightarrow 0$ , the realizations are all concentrated at a single value, while in the limit of $\alpha \rightarrow \infty$ the realizations become continuous. Between the two extremes the realizations are discrete distributions with less and less concentration as $\alpha$ increases.

The Dirichlet process can also be seen as the infinite-dimensional generalization of theDirichlet distribution. In the same way as the Dirichlet distribution is theconjugate prior for thecategorical distribution, the Dirichlet process is the conjugate prior for infinite,nonparametric discrete distributions. A particularly important application of Dirichlet processes is as aprior probability distribution ininfinite mixture models.

The Dirichlet process was formally introduced byThomas S. Ferguson in 1973.^[2]It has since been applied indata mining andmachine learning, among others fornatural language processing,computer vision andbioinformatics.

v t e Stochastic processes
Discrete time	Bernoulli process Branching process Chinese restaurant process Galton–Watson process Independent and identically distributed random variables Markov chain Moran process Random walk Loop-erased Self-avoiding Biased Maximal entropy
Continuous time	Additive process Airy process Bessel process Birth–death process pure birth Brownian motion Bridge Dyson Excursion Fractional Geometric Meander Cauchy process Contact process Continuous-time random walk Cox process Diffusion process Empirical process Feller process Fleming–Viot process Gamma process Geometric process Hawkes process Hunt process Interacting particle systems Itô diffusion Itô process Jump diffusion Jump process Lévy process Local time Markov additive process McKean–Vlasov process Ornstein–Uhlenbeck process Poisson process Compound Non-homogeneous Quasimartingale Schramm–Loewner evolution Semimartingale Sigma-martingale Stable process Superprocess Telegraph process Variance gamma process Wiener process Wiener sausage
Both	Branching process Gaussian process Hidden Markov model (HMM) Markov process Martingale Differences Local Sub- Super- Random dynamical system Regenerative process Renewal process Stochastic chains with memory of variable length White noise
Fields and other	Dirichlet process Gaussian random field Gibbs measure Hopfield model Ising model Potts model Boolean network Markov random field Percolation Pitman–Yor process Point process Cox Determinantal Poisson Random field Random graph
Time series models	Autoregressive conditional heteroskedasticity (ARCH) model Autoregressive integrated moving average (ARIMA) model Autoregressive (AR) model Autoregressive–moving-average (ARMA) model Generalized autoregressive conditional heteroskedasticity (GARCH) model Moving-average (MA) model
Financial models	Binomial options pricing model Black–Derman–Toy Black–Karasinski Black–Scholes Chan–Karolyi–Longstaff–Sanders (CKLS) Chen Constant elasticity of variance (CEV) Cox–Ingersoll–Ross (CIR) Garman–Kohlhagen Heath–Jarrow–Morton (HJM) Heston Ho–Lee Hull–White Korn-Kreer-Lenssen LIBOR market Rendleman–Bartter SABR volatility Vašíček Wilkie
Actuarial models	Bühlmann Cramér–Lundberg Risk process Sparre–Anderson
Queueing models	Bulk Fluid Generalized queueing network M/G/1 M/M/1 M/M/c
Properties	Càdlàg paths Continuous Continuous paths Ergodic Exchangeable Feller-continuous Gauss–Markov Markov Mixing Piecewise-deterministic Predictable Progressively measurable Self-similar Stationary Time-reversible
Limit theorems	Central limit theorem Donsker's theorem Doob's martingale convergence theorems Ergodic theorem Fisher–Tippett–Gnedenko theorem Large deviation principle Law of large numbers (weak/strong) Law of the iterated logarithm Maximal ergodic theorem Sanov's theorem Zero–one laws (Blumenthal,Borel–Cantelli,Engelbert–Schmidt,Hewitt–Savage, Kolmogorov,Lévy)
Inequalities	Burkholder–Davis–Gundy Doob's martingale Doob's upcrossing Kunita–Watanabe Marcinkiewicz–Zygmund
Tools	Cameron–Martin formula Convergence of random variables Doléans-Dade exponential Doob decomposition theorem Doob–Meyer decomposition theorem Doob's optional stopping theorem Dynkin's formula Feynman–Kac formula Filtration Girsanov theorem Infinitesimal generator Itô integral Itô's lemma Karhunen–Loève theorem Kolmogorov continuity theorem Kolmogorov extension theorem Lévy–Prokhorov metric Malliavin calculus Martingale representation theorem Optional stopping theorem Prokhorov's theorem Quadratic variation Reflection principle Skorokhod integral Skorokhod's representation theorem Skorokhod space Snell envelope Stochastic differential equation Tanaka Stopping time Stratonovich integral Uniform integrability Usual hypotheses Wiener space Classical Abstract
Disciplines	Actuarial mathematics Control theory Econometrics Ergodic theory Extreme value theory (EVT) Large deviations theory Mathematical finance Mathematical statistics Probability theory Queueing theory Renewal theory Ruin theory Signal processing Statistics Stochastic analysis Time series analysis Machine learning
List of topics Category

Movatterモバイル変換

Introduction

Formal definition

Alternative views

The Chinese restaurant process

The stick-breaking process

The Pólya urn scheme

Use as a prior distribution

Prior conjugacy

Posterior consistency

Bernstein–Von Mises theorem

Use in Dirichlet mixture models

Example 1

Example 2

Applications of the Dirichlet process

Related distributions

References

External links