Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

BRS-inequality

From Wikipedia, the free encyclopedia
This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(February 2022) (Learn how and when to remove this message)

BRS-inequality is the short name forBruss-Robertson-Steele inequality. This inequality gives a convenient upper bound for the expected maximum number of non-negative random variables one can sum up without exceeding a given upper bounds>0{\displaystyle s>0}.

For example, suppose 100 random variablesX1,X2,...,X100{\displaystyle X_{1},X_{2},...,X_{100}} are all uniformly distributed on[0,1]{\displaystyle [0,1]}, not necessarily independent, and lets=10{\displaystyle s=10}, say. LetN[n,s]:=N[100,10]{\displaystyle N[n,s]:=N[100,10]} be the maximum number ofXj{\displaystyle X_{j}} one can select in{X1,X2,...,X100}{\displaystyle \{X_{1},X_{2},...,X_{100}\}} such that their sum does not exceeds=10{\displaystyle s=10}.N[100,10]{\displaystyle N[100,10]} is arandom variable, so what can one say about bounds for its expectation? How would an upper bound forE(N[n,s]){\displaystyle E(N[n,s])} behave, if one changes the sizen{\displaystyle n} of the sample and keepss{\displaystyle s} fixed, or alternatively, if one keepsn{\displaystyle n} fixed but variess{\displaystyle s}? What can one say aboutE(N[n,s]){\displaystyle E(N[n,s])}, if the uniform distribution is replaced by another continuous distribution? In all generality, what can one say if eachXk{\displaystyle X_{k}} may have its own continuous distribution functionFk{\displaystyle F_{k}}?

General problem

[edit]

LetX1,X2,...{\displaystyle X_{1},X_{2},...} be a sequence of non-negative random variables (possibly dependent) that are jointly continuously distributed. Forn{1,2,...}{\displaystyle n\in \{1,2,...\}} andsR+{\displaystyle s\in \mathbb {R} ^{+}} letN[n,s]{\displaystyle N[n,s]} be the maximum number of observations amongX1,X2,...,Xn{\displaystyle X_{1},X_{2},...,X_{n}} that one can sum up without exceedings{\displaystyle s}.

Now, to obtainN[n,s]{\displaystyle N[n,s]} one may think of looking at the list of all observations, first select the smallest one, then add the second smallest, then the third and so on, as long as the accumulated sum does not exceeds{\displaystyle s}. HenceN[s,n]{\displaystyle N[s,n]} can be defined in terms of the increasingorder statistics ofX1,X2,,Xn{\displaystyle X_{1},X_{2},\cdots ,X_{n}}, denoted byX1,nX2,nXn,n,{\displaystyle X_{1,n}\leq X_{2,n}\leq \cdots \leq X_{n,n},}, namely by

N[n,s]={0, if  X1,n>s,max{ kN: X1,n+X2,n++Xk,ns}, otherwise.(1){\displaystyle {\begin{aligned}N[n,s]={\begin{cases}0&,{\rm {~if~}}~X_{1,n}>s,\\\max\{~k\in \mathbb {N} :~X_{1,n}+X_{2,n}+\cdots +X_{k,n}\leq s\}&,{\rm {~otherwise}}.\end{cases}}\end{aligned}}(1)}

What is the best possiblegeneral upper bound forE(N[n,s]){\displaystyle E(N[n,s])} if one requires only the continuity of the joint distribution of all variables? And then, how to compute this bound?

Identically distributed random variables.

[edit]

Theorem 1 LetX1,X2,,Xn{\displaystyle X_{1},X_{2},\cdots ,X_{n}} be identically distributed non-negative random variables with absolutely continuous distribution functionF{\displaystyle F}. Then

E(N[n,s])nF(t),{\displaystyle E(N[n,s])\leq nF(t),} (2)

wheret:=t(n,s){\displaystyle t:=t(n,s)} solves the so-called BRS-equation

n0txdF(x)=s{\displaystyle n\int _{0}^{t}x\,dF(x)\,=\,s}. (3)

As an example, here are the answers for the questions posed at the beginning.Thus let allX1,X2,,Xn{\displaystyle X_{1},X_{2},\cdots ,X_{n}} be uniformly distributed on[0,1]{\displaystyle [0,1]}. ThenF(t)=t{\displaystyle F(t)=t} on[0,1]{\displaystyle [0,1]}, and hencedF(x)/dx=1{\displaystyle dF(x)/dx=1} on[0,1]{\displaystyle [0,1]}.The BRS-equation becomes

n0txdx=nt2/2=s.{\displaystyle n\int _{0}^{t}xdx=nt^{2}/2=s.}

The solution ist=2s/n{\displaystyle t={\sqrt {2s/n}}}, and thus from the inequality (2)

E(N[n,s])nF(t)=n2s/n=2sn{\displaystyle E(N[n,s])\leq n\,F(t)=n{\sqrt {2s/n}}={\sqrt {2sn}}}. (4)

Since one always hasN[n,s]n{\displaystyle N[n,s]\leq n}, this bound becomes trivial forsnE(X)=n/2{\displaystyle s\geq nE(X)=n/2}.

For the example questions withn=100,s=10{\displaystyle n=100,s=10} this yieldsE(N[100,10])200044.7{\displaystyle E(N[100,10])\leq {\sqrt {2000}}\approx 44.7}.As one sees from (4), doubling the sample sizen{\displaystyle n} and keepings{\displaystyle s} fixed, or vice versa, yield for the uniform distribution in the non-trivial case the same upper bound.

Generalised BRS-inequality

[edit]

Theorem 2. LetX1,X2,,Xn{\displaystyle X_{1},X_{2},\cdots ,X_{n}} be positive random variables that are jointly distributed such thatXk{\displaystyle X_{k}} has an absolutely continuous distribution functionFk, k=1,2,,n.{\displaystyle F_{k},~k=1,2,\cdots ,n.}.IfN[n,s]{\displaystyle N[n,s]} is defined as before, then

E(N[n,s])k=1nFk(t){\displaystyle E(N[n,s])\leq \sum _{k=1}^{n}F_{k}(t)}, (5)

wheret:=t(n,s){\displaystyle t:=t(n,s)} is the unique solution of the BRS-equation

k=1n0txdFk(x)=s.{\displaystyle \sum _{k=1}^{n}\int _{0}^{t}\,x\,dF_{k}(x)=s.} (6)

Clearly, if all random variablesXi,i=1,2,,n{\displaystyle X_{i},i=1,2,\cdots ,n} have the same marginal distributionF{\displaystyle F}, then (6) recaptures (3), and (5) recaptures (2).Again it should be pointed out thatno independence hypothesis whatsoever is needed.

Refinements of the BRS-inequality

[edit]

Depending on the type of the distributionsFk{\displaystyle F_{k}}, refinements of Theorem 2 can be of true interest. We just mention one of them.

LetA[n,s]{\displaystyle A[n,s]} be the random set of those variables one can sum up to yield the maximum random numberN[n,s]{\displaystyle N[n,s]}, that is,

#A[n,s]=N[n,s]{\displaystyle \#A[n,s]=N[n,s]},

and letSA[n,s]{\displaystyle S_{A[n,s]}} denote the sum of these variables.The so-calledresidualsSA[n,s]{\displaystyle s-S_{A[n,s]}}is by definition always non-negative, and one has:

Theorem 3. LetX1,X2,,Xn{\displaystyle X_{1},X_{2},\cdots ,X_{n}} be jointly continuously distributed withmarginal distribution functionsFk,k=1,2,,n{\displaystyle F_{k},k=1,2,\cdots ,n}, and lett:=t(n,s){\displaystyle t:=t(n,s)} be the solution of (6). Then

E(N[n,s])(k=1nFk(t(n,s)))sE(SA[n,s])t(n,s){\displaystyle E(N[n,s])\leq \left(\sum _{k=1}^{n}F_{k}(t(n,s))\right)-{\frac {s-E(S_{A[n,s]})}{t(n,s)}}}. (7)

The improvement in (7) compared with (5) therefore consists of

sE(SA[n,s])t(n,s){\displaystyle {\frac {s-E(S_{A[n,s]})}{t(n,s)}}}.

The expected residual in the numerator is typically difficult to compute or estimate, but there exist nice exceptions. For example, if allXk{\displaystyle X_{k}} are independent exponential random variables, then the memoryless property implies (if s is exceeded) the distributional symmetry of the residual and the overshoot overs{\displaystyle s}. For fixeds{\displaystyle s} one can then show that :sE(SA[n,s])t(n,s)1/2 as n{\displaystyle {\frac {s-E(S_{A[n,s]})}{t(n,s)}}\to 1/2{\rm {~as~}}n\to \infty }. This improvement fluctuates around1/2{\displaystyle 1/2}, and the convergence to1/2{\displaystyle 1/2}, (simulations) seems quick.

Source

[edit]

The first version of the BRS-inequality (Theorem 1) was proved in Lemma 4.1 ofF. Thomas Bruss and James B.Robertson (1991). This paper proves moreover that the upper bound is asymptotically tight if the random variables are independent of each other. The generalisation to arbitrary continuous distributions (Theorem 2) is due toJ. Michael Steele (2016). Theorem 3 and other refinements of the BRS-inequality are more recent and proved in Bruss (2021).

Applications

[edit]

The BRS-inequality is a versatile tool since it applies to many types of problems, and since the computation of the BRS-equation is often not very involved. Also, and in particular, one notes that the maximum numberN[n,s]{\displaystyle N[n,s]} always dominates the maximum number of selections underany additional constraint, such as e.g. foronline selections without recall. Examples studied in Steele (2016) and Bruss (2021) touch many applications, including comparisons between i.i.d. sequences and non-i.i.d. sequences, problems of condensingpoint processes, “awkward” processes,selection algorithms,knapsack problems,Borel-Cantelli-type problems, theBruss-Duerinckx theorem, and onlineTiling strategies.

References

[edit]

Bruss F. T. and Robertson J. B. (1991) ’Wald's Lemma’ for Sums of Order Statistics of i.i.d. Random Variables, Adv. Appl. Probab., Vol. 23, 612-623.

Bruss F. T. and Duerinckx M. (2015), Resource dependent branching processes and the envelope of societie, Ann. of Appl. Probab., Vol. 25 (1), 324-372.

Steele J.M. (2016), The Bruss-Robertson Inequality: Elaborations, Extensions, and Applications, Math. Applicanda, Vol. 44, No 1, 3-16.

Bruss F. T. (2021),The BRS-inequality and its applications, Probab. Surveys, Vol.18, 44-76.

Retrieved from "https://en.wikipedia.org/w/index.php?title=BRS-inequality&oldid=1312480525"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp