Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Exchangeable random variables

From Wikipedia, the free encyclopedia
Concept in statistics

Instatistics, anexchangeable sequence of random variables (also sometimesinterchangeable)[1] is a sequenceX1X2X3, ... (which may be finitely or infinitely long) whosejoint probability distribution does not change when the positions in the sequence in which finitely many of them appear are altered. In other words, the joint distribution is invariant to finite permutation. Thus, for example the sequences

X1,X2,X3,X4,X5,X6 and X3,X6,X1,X5,X2,X4{\displaystyle X_{1},X_{2},X_{3},X_{4},X_{5},X_{6}\quad {\text{ and }}\quad X_{3},X_{6},X_{1},X_{5},X_{2},X_{4}}

both have the same joint probability distribution.

It is closely related to the use ofindependent and identically distributed random variables in statistical models. Exchangeable sequences of random variables arise in cases ofsimple random sampling.

Definition

[edit]

Formally, anexchangeable sequence of random variables is a finite or infinite sequenceX1X2X3, … ofrandom variables such that for any finitepermutation σ of the indices 1, 2, 3, … (the permutation acts on only finitely many indices, with the rest fixed) thejoint probability distribution of the permuted sequence

Xσ(1),Xσ(2),Xσ(3),{\displaystyle X_{\sigma (1)},X_{\sigma (2)},X_{\sigma (3)},\dots }

is the same as the joint probability distribution of the original sequence.[1][2] A sequenceE1,E2,E3, … of events is said to be exchangeable iff the sequence of itsindicator functions is exchangeable.

The distribution functionFX1, …,Xn(x1, …,xn) of a finite sequence of exchangeable random variables is symmetric in its argumentsx1, …,xn.Olav Kallenberg provided an appropriate definition of exchangeability for continuous-time stochastic processes.[3][4]

History

[edit]

The concept was introduced byWilliam Ernest Johnson in his 1924 bookLogic, Part III: The Logical Foundations of Science.[5] Exchangeability is equivalent to the concept ofstatistical control introduced byWalter Shewhart also in 1924.[6][7]

Exchangeability and the i.i.d. statistical model

[edit]

The property of exchangeability is closely related to the use ofindependent and identically distributed (i.i.d.) random variables in statistical models.[8] A sequence of random variables that are i.i.d., conditional on some underlying distributional form, is exchangeable. This follows directly from the structure of the joint probability distribution generated by the i.i.d. form.

Mixtures of exchangeable sequences (in particular, sequences of i.i.d. variables) are exchangeable. The converse can be established for infinite sequences, through an importantrepresentation theorem byBruno de Finetti (later extended by other probability theorists such asHalmos andSavage).[9] The extended versions of the theorem show that in any infinite sequence of exchangeable random variables, the random variables are conditionally i.i.d., given the underlying distributional form. This theorem is stated briefly below. (De Finetti’s original theorem only showed this to be true for random indicator variables, but this was later extended to encompass all sequences of random variables.) Another way of putting this is that de Finetti’s theorem characterizes exchangeable sequences as mixtures of i.i.d. sequences—while an exchangeable sequence need not itself be unconditionally i.i.d., it can be expressed as a mixture of underlying i.i.d. sequences.[1]

This means that infinite sequences of exchangeable random variables can be regarded equivalently as sequences of conditionally i.i.d. random variables, based on some underlying distributional form. (Note that this equivalence does not quite hold for finite exchangeability. However, for finite vectors of random variables there is a close approximation to the i.i.d. model.) An infinite exchangeable sequence isstrictly stationary and so alaw of large numbers in the form ofBirkhoff–Khinchin theorem applies.[4] This means that the underlying distribution can be given an operational interpretation as the limiting empirical distribution of the sequence of values. The close relationship between exchangeable sequences of random variables and the i.i.d. form means that the latter can be justified on the basis of infinite exchangeability. This notion is central toBruno de Finetti’s development ofpredictive inference and toBayesian statistics. It can also be shown to be a useful foundational assumption infrequentist statistics and to link the two paradigms.[10]

The representation theorem: This statement is based on the presentation in O’Neill (2009) in the references below. Given an infinite sequence of random variablesX=(X1,X2,X3,){\displaystyle \mathbf {X} =(X_{1},X_{2},X_{3},\ldots )} we define the limitingempirical distribution functionFX{\displaystyle F_{\mathbf {X} }} by

FX(x)=limn1ni=1nI(Xix).{\displaystyle F_{\mathbf {X} }(x)=\lim _{n\to \infty }{\frac {1}{n}}\sum _{i=1}^{n}I(X_{i}\leq x).}

(This is theCesàro limit of the indicator functions. In cases where the Cesàro limit does not exist this function can actually be defined as theBanach limit of the indicator functions, which is an extension of this limit. This latter limit always exists for sums of indicator functions, so that the empirical distribution is always well-defined.) This means that for any vector of random variables in the sequence we have joint distribution function given by

Pr(X1x1,X2x2,,Xnxn)=i=1nFX(xi)dP(FX).{\displaystyle \Pr(X_{1}\leq x_{1},X_{2}\leq x_{2},\ldots ,X_{n}\leq x_{n})=\int \prod _{i=1}^{n}F_{\mathbf {X} }(x_{i})\,dP(F_{\mathbf {X} }).}

If the distribution functionFX{\displaystyle F_{\mathbf {X} }} is indexed by another parameterθ{\displaystyle \theta } then (with densities appropriately defined) we have

pX1,,Xn(x1,,xn)=i=1npXi(xiθ)dP(θ).{\displaystyle p_{X_{1},\ldots ,X_{n}}(x_{1},\ldots ,x_{n})=\int \prod _{i=1}^{n}p_{X_{i}}(x_{i}\mid \theta )\,dP(\theta ).}

These equations show the joint distribution or density characterised as a mixture distribution based on the underlying limiting empirical distribution (or a parameter indexing this distribution).

Note that not all finite exchangeable sequences are mixtures of i.i.d.. To see this, consider sampling without replacement from afinite set until no elements are left. The resulting sequence is exchangeable, but not a mixture of i.i.d.. Indeed, conditioned on all other elements in the sequence, the remaining element is known.

Covariance and correlation

[edit]

Exchangeable sequences have some basiccovariance and correlation properties which mean that they are generally positively correlated. For infinite sequences of exchangeable random variables, the covariance between the random variables is equal to the variance of the mean of the underlying distribution function.[10] For finite exchangeable sequences the covariance is also a fixed value which does not depend on the particular random variables in the sequence. There is a weaker lower bound than for infinite exchangeability and it is possible for negative correlation to exist.

Covariance for exchangeable sequences (infinite): If the sequenceX1,X2,X3,{\displaystyle X_{1},X_{2},X_{3},\ldots } is exchangeable, then

cov(Xi,Xj)=var(E(XiFX))=var(E(Xiθ))0for ij.{\displaystyle \operatorname {cov} (X_{i},X_{j})=\operatorname {var} (\operatorname {E} (X_{i}\mid F_{\mathbf {X} }))=\operatorname {var} (\operatorname {E} (X_{i}\mid \theta ))\geq 0\quad {\text{for }}i\neq j.}

Covariance for exchangeable sequences (finite): IfX1,X2,,Xn{\displaystyle X_{1},X_{2},\ldots ,X_{n}} is exchangeable withσ2=var(Xi){\displaystyle \sigma ^{2}=\operatorname {var} (X_{i})}, then

cov(Xi,Xj)σ2n1for ij.{\displaystyle \operatorname {cov} (X_{i},X_{j})\geq -{\frac {\sigma ^{2}}{n-1}}\quad {\text{for }}i\neq j.}

The finite sequence result may be proved as follows. Using the fact that the values are exchangeable, we have

0var(X1++Xn)=var(X1)++var(Xn)+cov(X1,X2)+all ordered pairs=nσ2+n(n1)cov(X1,X2).{\displaystyle {\begin{aligned}0&\leq \operatorname {var} (X_{1}+\cdots +X_{n})\\&=\operatorname {var} (X_{1})+\cdots +\operatorname {var} (X_{n})+\underbrace {\operatorname {cov} (X_{1},X_{2})+\cdots \quad {}} _{\text{all ordered pairs}}\\&=n\sigma ^{2}+n(n-1)\operatorname {cov} (X_{1},X_{2}).\end{aligned}}}

We can then solve the inequality for the covariance yielding the stated lower bound. The non-negativity of the covariance for the infinite sequence can then be obtained as a limiting result from this finite sequence result.

Equality of the lower bound for finite sequences is achieved in a simple urn model: An urn contains 1 red marble andn − 1 green marbles, and these are sampled without replacement until the urn is empty. LetXi = 1 if the red marble is drawn on thei-th trial and 0 otherwise. A finite sequence that achieves the lower covariance bound cannot be extended to a longer exchangeable sequence.[11]

Examples

[edit]

Applications

[edit]

Thevon Neumann extractor is arandomness extractor that depends on exchangeability: it gives a method to take an exchangeable sequence of 0s and 1s (Bernoulli trials), with some probabilityp of 0 andq=1p{\displaystyle q=1-p} of 1, and produce a (shorter) exchangeable sequence of 0s and 1s with probability 1/2.

Partition the sequence into non-overlapping pairs: if the two elements of the pair are equal (00 or 11), discard it; if the two elements of the pair are unequal (01 or 10), keep the first. This yields a sequence of Bernoulli trials withp=1/2,{\displaystyle p=1/2,} as, by exchangeability, the odds of a given pair being 01 or 10 are equal.

Exchangeable random variables arise in the study ofU statistics, particularly in the Hoeffding decomposition.[13]

Exchangeability is a key assumption of the distribution-free inference method ofconformal prediction.[14]

See also

[edit]

References

[edit]
  1. ^abcIn short, the order of the sequence of random variables does not affect its joint probability distribution.
    • Chow, Yuan Shih and Teicher, Henry,Probability theory. Independence, interchangeability, martingales, Springer Texts in Statistics, 3rd ed., Springer, New York, 1997. xxii+488 pp. ISBN 0-387-98228-0
  2. ^Aldous, David J.,Exchangeability and related topics, in: École d'Été de Probabilités de Saint-Flour XIII — 1983, Lecture Notes in Math. 1117, pp. 1–198, Springer, Berlin, 1985.ISBN 978-3-540-15203-3doi:10.1007/BFb0099421
  3. ^Diaconis, Persi (2009)."Book review:Probabilistic symmetries and invariance principles (Olav Kallenberg, Springer, New York, 2005)".Bulletin of the American Mathematical Society. New Series.46 (4):691–696.doi:10.1090/S0273-0979-09-01262-2.MR 2525743.
  4. ^abKallenberg, O.,Probabilistic symmetries and invariance principles. Springer-Verlag, New York (2005). 510 pp. ISBN 0-387-25115-4.
  5. ^Zabell, S. L. (1992). "Predicting the unpredictable".Synthese.90 (2): 205.doi:10.1007/bf00485351.S2CID 9416747.
  6. ^Barlow, R. E. & Irony, T. Z. (1992) "Foundations of statistical quality control" in Ghosh, M. & Pathak, P.K. (eds.)Current Issues in Statistical Inference: Essays in Honor of D. Basu, Hayward, CA: Institute of Mathematical Statistics, 99-112.
  7. ^Bergman, B. (2009) "Conceptualistic Pragmatism: A framework for Bayesian analysis?",IIE Transactions,41, 86–93
  8. ^Cordani, L. K.; Wechsler, S. (2006)."Teaching independence and exchangeability"(PDF).Proceedings of the International Conference on Teaching Statistics. Den Haag: International Association for Statistical Education.
  9. ^Diaconis, P. (1988). "Recent Progress on de Finetti's Notions of Exchangeability". InBernardo, J. M.; et al. (eds.).Bayesian Statistics. Vol. 3. Oxford University Press. pp. 111–125.ISBN 0-19-852220-7.
  10. ^abO'Neill, B. (2009). "Exchangeability, Correlation and Bayes' Effect".International Statistical Review.77 (2):241–250.doi:10.1111/j.1751-5823.2008.00059.x.
  11. ^Taylor, Robert Lee; Daffer, Peter Z.; Patterson, Ronald F. (1985).Limit theorems for sums of exchangeable random variables. Rowman and Allanheld. pp. 1–152.ISBN 9780847674350.
  12. ^Spizzichino, FabioSubjective probability models for lifetimes. Monographs on Statistics and Applied Probability, 91.Chapman & Hall/CRC, Boca Raton, FL, 2001. xx+248 pp. ISBN 1-58488-060-0
  13. ^Borovskikh, Yu. V. (1996). "Chapter 10 Dependent variables".U-statistics in Banach spaces. Utrecht: VSP. pp. 365–376.ISBN 90-6764-200-2.MR 1419498.
  14. ^Shafer, Glenn; Vovk, Vladimir (2008)."A Tutorial on Conformal Prediction".Journal of Machine Learning Research.9:371–421.

Further reading

[edit]
Discrete time
Continuous time
Both
Fields and other
Time series models
Financial models
Actuarial models
Queueing models
Properties
Limit theorems
Inequalities
Tools
Disciplines
Retrieved from "https://en.wikipedia.org/w/index.php?title=Exchangeable_random_variables&oldid=1334892634"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp