Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Rate of convergence

From Wikipedia, the free encyclopedia
Speed of convergence of a mathematical sequence

Inmathematical analysis, particularlynumerical analysis, therate of convergence andorder of convergence of asequence that converges to alimit are any of several characterizations of how quickly that sequence approaches its limit. These are broadly divided into rates and orders of convergence that describe how quickly a sequence further approaches its limit once it is already close to it, calledasymptotic rates and orders of convergence, and those that describe how quickly sequences approach their limits from starting points that are not necessarily close to their limits, called non-asymptotic rates and orders of convergence.

Asymptotic behavior is particularly useful for deciding when to stop a sequence of numerical computations, for instance once a target precision has been reached with an iterativeroot-finding algorithm, but pre-asymptotic behavior is often crucial for determining whether to begin a sequence of computations at all, since it may be impossible or impractical to ever reach a target precision with a poorly chosen approach. Asymptotic rates and orders of convergence are the focus of this article.

In practical numerical computations, asymptotic rates and orders of convergence follow two common conventions for two types of sequences: the first for sequences of iterations of aniterative numerical method and the second for sequences of successively more accurate numericaldiscretizations of a target. In formal mathematics, rates of convergence and orders of convergence are often described comparatively usingasymptotic notation commonly called "big O notation," which can be used to encompass both of the prior conventions; this is an application ofasymptotic analysis.

For iterative methods, a sequence(xk){\displaystyle (x_{k})} that converges toL{\displaystyle L} is said to have asymptoticorder of convergenceq1{\displaystyle q\geq 1} and asymptoticrate of convergenceμ{\displaystyle \mu } if

limk|xk+1L||xkL|q=μ.{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|x_{k+1}-L\right|}{\left|x_{k}-L\right|^{q}}}=\mu .}[1]

Where methodological precision is required, these rates and orders of convergence are known specifically as the rates and orders of Q-convergence, short for quotient-convergence, since the limit in question is a quotient of error terms.[1] The rate of convergenceμ{\displaystyle \mu } may also be called theasymptotic error constant, and some authors will userate where this article usesorder.[2]Series acceleration methods are techniques for improving the rate of convergence of the sequence of partial sums of aseries and possibly its order of convergence, also.

Similar concepts are used for sequences of discretizations. For instance, ideally the solution of adifferential equation discretized via aregular grid will converge to the solution of the continuous equation as the grid spacing goes to zero, and if so the asymptotic rate and order of that convergence are important properties of the gridding method. A sequence of approximate grid solutions(yk){\displaystyle (y_{k})} of some problem that converges to a true solutionS{\displaystyle S} with a corresponding sequence of regular grid spacings(hk){\displaystyle (h_{k})} that converge to 0 is said to have asymptoticorder of convergenceq{\displaystyle q} and asymptoticrate of convergenceμ{\displaystyle \mu } if

limk|ykS|hkq=μ,{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|y_{k}-S\right|}{h_{k}^{q}}}=\mu ,}

where the absolute value symbols stand for ametric for the space of solutions such as theuniform norm. Similar definitions also apply for non-grid discretization schemes such as thepolygon meshes of afinite element method or thebasis sets incomputational chemistry: in general, the appropriate definition of the asymptotic rateμ{\displaystyle \mu } will involve the asymptotic limit of the ratio of an approximation error term above to an asymptotic orderq{\displaystyle q} power of a discretization scale parameter below.

In general, comparatively, one sequence(ak){\displaystyle (a_{k})} that converges to a limitLa{\displaystyle L_{a}} is said to asymptotically converge more quickly than another sequence(bk){\displaystyle (b_{k})} that converges to a limitLb{\displaystyle L_{b}} if

limk|akLa||bkLb|=0,{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|a_{k}-L_{a}\right|}{|b_{k}-L_{b}|}}=0,}

and the two are said to asymptotically converge with the same order of convergence if the limit is any positive finite value. The two are said to be asymptotically equivalent if the limit is equal to one. These comparative definitions of rate and order of asymptotic convergence are fundamental in asymptotic analysis and find wide application in mathematical analysis as a whole, including numerical analysis,real analysis,complex analysis, andfunctional analysis.

Asymptotic rates of convergence for iterative methods

[edit]

Definitions

[edit]

Q-convergence

[edit]

Suppose that thesequence(xk){\displaystyle (x_{k})} of iterates of aniterative method converges to thelimit numberL{\displaystyle L} ask{\displaystyle k\rightarrow \infty }. The sequence is said toconverge with orderq{\displaystyle q} toL{\displaystyle L} and with arate of convergenceμ{\displaystyle \mu } if thek{\displaystyle k\rightarrow \infty } limit of quotients ofabsolute differences of sequential iteratesxk,xk+1{\displaystyle x_{k},x_{k+1}} from their limitL{\displaystyle L} satisfies

limk|xk+1L||xkL|q=μ{\displaystyle \lim _{k\to \infty }{\frac {|x_{k+1}-L|}{|x_{k}-L|^{q}}}=\mu }

for some positive constantμ(0,1){\displaystyle \mu \in (0,1)} ifq=1{\displaystyle q=1} andμ(0,){\displaystyle \mu \in (0,\infty )} ifq>1{\displaystyle q>1}.[1][3][4] Other more technical rate definitions are needed if the sequence converges butlimk|xk+1L||xkL|=1{\textstyle \lim _{k\to \infty }{\frac {|x_{k+1}-L|}{|x_{k}-L|}}=1}[5] or the limit does not exist.[1] This definition is technically called Q-convergence, short for quotient-convergence, and the rates and orders are called rates and orders of Q-convergence when that technical specificity is needed.§ R-convergence, below, is an appropriate alternative when this limit does not exist.

Sequences with larger ordersq{\displaystyle q} converge more quickly than those with smaller order, and those with smaller ratesμ{\displaystyle \mu } converge more quickly than those with larger rates for a given order. This "smaller rates converge more quickly" behavior among sequences of the same order is standard but it can be counterintuitive. Therefore it is also common to definelog10μ{\displaystyle -\log _{10}\mu } as the rate; this is the "number of extra decimals of precision per iterate" for sequences that converge with order 1.[1]

Integer powers ofq{\displaystyle q} are common and are given common names. Convergence with orderq=1{\displaystyle q=1} andμ(0,1){\displaystyle \mu \in (0,1)} is calledlinear convergence and the sequence is said toconverge linearly toL{\displaystyle L}. Convergence withq=2{\displaystyle q=2} and anyμ{\displaystyle \mu } is calledquadratic convergence and the sequence is said toconverge quadratically. Convergence withq=3{\displaystyle q=3} and anyμ{\displaystyle \mu } is calledcubic convergence. However, it is not necessary thatq{\displaystyle q} be an integer. For example, thesecant method, when converging to a regular,simple root, has an order of thegolden ratio φ ≈ 1.618.[6]

The common names for integer orders of convergence connect toasymptotic big O notation, where the convergence of the quotient implies|xk+1L|=O(|xkL|q).{\textstyle |x_{k+1}-L|=O(|x_{k}-L|^{q}).} These are linear, quadratic, and cubic polynomial expressions whenq{\displaystyle q} is 1, 2, and 3, respectively. More precisely, the limits imply the leading order error is exactlyμ|xkL|q,{\textstyle \mu |x_{k}-L|^{q},} which can be expressed usingasymptotic small o notation as|xk+1L|=μ|xkL|q+o(|xkL|q).{\textstyle |x_{k+1}-L|=\mu |x_{k}-L|^{q}+o(|x_{k}-L|^{q}).}

In general, whenq>1{\displaystyle q>1} for a sequence or for any sequence that satisfieslimk|xk+1L||xkL|=0,{\textstyle \lim _{k\to \infty }{\frac {|x_{k+1}-L|}{|x_{k}-L|}}=0,} those sequences are said toconverge superlinearly (i.e., faster than linearly).[1] A sequence is said toconverge sublinearly (i.e., slower than linearly) if it converges andlimk|xk+1L||xkL|=1.{\textstyle \lim _{k\to \infty }{\frac {|x_{k+1}-L|}{|x_{k}-L|}}=1.} Importantly, it is incorrect to say that these sublinear-order sequences converge linearly with an asymptotic rate of convergence of 1. A sequence(xk){\displaystyle (x_{k})}converges logarithmically toL{\displaystyle L} if the sequence converges sublinearly and alsolimk|xk+1xk||xkxk1|=1.{\textstyle \lim _{k\to \infty }{\frac {|x_{k+1}-x_{k}|}{|x_{k}-x_{k-1}|}}=1.}[5]

R-convergence

[edit]

The definitions of Q-convergence rates have the shortcoming that they do not naturally capture the convergence behavior of sequences that do converge, but do not converge with an asymptotically constant rate with every step, so that the Q-convergence limit does not exist. One class of examples is the staggered geometric progressions that get closer to their limits only every other step or every several steps, for instance the example(bk)=1,1,1/4,1/4,1/16,1/16,,1/4k2,{\textstyle (b_{k})=1,1,1/4,1/4,1/16,1/16,\ldots ,1/4^{\left\lfloor {\frac {k}{2}}\right\rfloor },\ldots } detailed below (wherex{\textstyle \lfloor x\rfloor } is thefloor function applied tox{\displaystyle x}). The defining Q-linear convergence limits do not exist for this sequence because one subsequence of error quotients starting from odd steps converges to 1 and another subsequence of quotients starting from even steps converges to 1/4. When two subsequences of a sequence converge to different limits, the sequence does not itself converge to a limit.

In cases like these, a closely related but more technical definition of rate of convergence called R-convergence is more appropriate. The "R-" prefix stands for "root."[1][7]: 620  A sequence(xk){\displaystyle (x_{k})} that converges toL{\displaystyle L} is said toconverge at least R-linearly if there exists an error-bounding sequence(εk){\displaystyle (\varepsilon _{k})} such that|xkL|εkfor all k{\textstyle |x_{k}-L|\leq \varepsilon _{k}\quad {\text{for all }}k} and(εk){\displaystyle (\varepsilon _{k})} converges Q-linearly to zero; analogous definitions hold for R-superlinear convergence, R-sublinear convergence, R-quadratic convergence, and so on.[1]

Any error bounding sequence(εk){\displaystyle (\varepsilon _{k})} provides a lower bound on the rate and order of R-convergence and the greatest lower bound gives the exact rate and order of R-convergence. As for Q-convergence, sequences with larger ordersq{\displaystyle q} converge more quickly and those with smaller ratesμ{\displaystyle \mu } converge more quickly for a given order, so these greatest-rate-lower-bound error-upper-bound sequences are those that have the greatest possibleq{\displaystyle q} and the smallest possibleμ{\displaystyle \mu } given thatq{\displaystyle q}.

For the example(bk){\textstyle (b_{k})} given above, the tight bounding sequence(εk)=2,1,1/2,1/4,1/8,1/16,,1/2k1,{\textstyle (\varepsilon _{k})=2,1,1/2,1/4,1/8,1/16,\ldots ,1/2^{k-1},\ldots }converges Q-linearly with rate 1/2, so(bk){\textstyle (b_{k})} converges R-linearly with rate 1/2. Generally, for any staggered geometric progression(ark/m){\displaystyle (ar^{\lfloor k/m\rfloor })}, the sequence will not converge Q-linearly but will converge R-linearly with rate|r|m.{\textstyle {\sqrt[{m}]{|r|}}.} These examples demonstrate why the "R" in R-linear convergence is short for "root."

Examples

[edit]

Thegeometric progression(ak)=1,12,14,18,116,132,,(12)k,{\textstyle (a_{k})=1,{\frac {1}{2}},{\frac {1}{4}},{\frac {1}{8}},{\frac {1}{16}},{\frac {1}{32}},\ldots ,{\bigl (}{\tfrac {1}{2}}{\bigr )}^{k},\dots } converges toL=0{\displaystyle L=0}. Plugging the sequence into the definition of Q-linear convergence (i.e., order of convergence 1) shows that

limk|1/2k+10||1/2k0|=limk2k2k+1=12.{\displaystyle \lim _{k\to \infty }{\frac {\left|1/2^{k+1}-0\right|}{\left|1/2^{k}-0\right|}}=\lim _{k\to \infty }{\frac {2^{k}}{2^{k+1}}}={\frac {1}{2}}.}

Thus(ak){\displaystyle (a_{k})} converges Q-linearly with a convergence rate ofμ=1/2{\displaystyle \mu =1/2}; see the first plot of the figure below.

More generally, for any initial valuea{\displaystyle a} in the real numbers and a real number common ratior{\displaystyle r} between -1 and 1, a geometric progression(ark){\displaystyle (ar^{k})} converges linearly with rate|r|{\displaystyle |r|} and the sequence of partial sums of ageometric series(n=0karn){\textstyle {\bigl (}\sum _{n=0}^{k}ar^{n}{\bigr )}} also converges linearly with rate|r|{\displaystyle |r|}. The same holds also for geometric progressions and geometric series parameterized by anycomplex numbersaC,rC,|r|<1.{\displaystyle a\in \mathbb {C} ,r\in \mathbb {C} ,|r|<1.}

The staggered geometric progression(bk)=1,1,14,14,116,116,,(14)k/2,,{\textstyle (b_{k})=1,1,{\frac {1}{4}},{\frac {1}{4}},{\frac {1}{16}},{\frac {1}{16}},\ldots ,{\bigl (}{\tfrac {1}{4}}{\bigr )}^{\left\lfloor k/2\right\rfloor },\ldots ,} using thefloor functionx{\textstyle \lfloor x\rfloor } that gives the largest integer that is less than or equal tox,{\displaystyle x,} converges R-linearly to 0 with rate 1/2, but it does not converge Q-linearly; see the second plot of the figure below. The defining Q-linear convergence limits do not exist for this sequence because one subsequence of error quotients starting from odd steps converges to 1 and another subsequence of quotients starting from even steps converges to 1/4. When two subsequences of a sequence converge to different limits, the sequence does not itself converge to a limit. Generally, for any staggered geometric progression(ark/m){\displaystyle (ar^{\lfloor k/m\rfloor })}, the sequence will not converge Q-linearly but will converge R-linearly with rate|r|m;{\textstyle {\sqrt[{m}]{|r|}};} these examples demonstrate why the "R" in R-linear convergence is short for "root."

The sequence(ck)=12,14,116,1256,165,536,,122k,{\displaystyle (c_{k})={\frac {1}{2}},{\frac {1}{4}},{\frac {1}{16}},{\frac {1}{256}},{\frac {1}{65,\!536}},\ldots ,{\frac {1}{2^{2^{k}}}},\ldots }converges to zero Q-superlinearly. In fact, it is quadratically convergent with a quadratic convergence rate of 1. It is shown in the third plot of the figure below.

Finally, the sequence(dk)=1,12,13,14,15,16,,1k+1,{\displaystyle (d_{k})=1,{\frac {1}{2}},{\frac {1}{3}},{\frac {1}{4}},{\frac {1}{5}},{\frac {1}{6}},\ldots ,{\frac {1}{k+1}},\ldots }converges to zero Q-sublinearly and logarithmically and its convergence is shown as the fourth plot of the figure below.

Plot showing the different rates of convergence for the sequences ak, bk, ck and dk.
Log-linear plots of the example sequencesak,bk,ck, anddk that exemplify linear, linear, superlinear (quadratic), and sublinear rates of convergence, respectively.

Convergence rates to fixed points of recurrent sequences

[edit]

Recurrent sequencesxk+1:=f(xk){\textstyle x_{k+1}:=f(x_{k})}, calledfixed point iterations, define discrete time autonomousdynamical systems and have important general applications in mathematics through variousfixed-point theorems about their convergence behavior. Whenf iscontinuously differentiable, given afixed pointp,f(p)=p,{\textstyle f(p)=p,} such that|f(p)|<1{\textstyle |f'(p)|<1}, the fixed point is anattractive fixed point and the recurrent sequence will converge at least linearly top for any starting valuex0{\displaystyle x_{0}} sufficiently close top. If|f(p)|=0{\displaystyle |f'(p)|=0} and|f(p)|<1{\textstyle |f''(p)|<1}, then the recurrent sequence will converge at least quadratically, and so on. If|f(p)|>1{\displaystyle |f'(p)|>1}, then the fixed point is arepulsive fixed point and sequences cannot converge top from its immediateneighborhoods, though they may still jump top directly from outside of its local neighborhoods.

Order estimation

[edit]

A practical method to calculate the order of convergence for a sequence generated by a fixed point iteration is to calculate the following sequence, which converges to the orderq{\displaystyle q}:[8]qlog|xk+1xkxkxk1|log|xkxk1xk1xk2|.{\displaystyle q\approx {\frac {\log \left|\displaystyle {\frac {x_{k+1}-x_{k}}{x_{k}-x_{k-1}}}\right|}{\log \left|\displaystyle {\frac {x_{k}-x_{k-1}}{x_{k-1}-x_{k-2}}}\right|}}.}

For numerical approximation of an exact value through a numerical method of orderq{\displaystyle q} see.[9]

Accelerating convergence rates

[edit]
Main article:Series acceleration

Many methods exist to accelerate the convergence of a given sequence, i.e., totransform one sequence into a second sequence that converges more quickly to the same limit. Such techniques are in general known as "series acceleration" methods. These may reduce thecomputational costs of approximating the limits of the original sequences. One example of series acceleration by sequence transformation isAitken's delta-squared process. These methods in general, and in particular Aitken's method, do not typically increase the order of convergence and thus they are useful only if initially the convergence is not faster than linear: if(xk){\displaystyle (x_{k})} converges linearly, Aitken's method transforms it into a sequence(ak){\displaystyle (a_{k})} that still converges linearly (except for pathologically designed special cases), but faster in the sense thatlimk(akL)/(xkL)=0{\textstyle \lim _{k\rightarrow \infty }(a_{k}-L)/(x_{k}-L)=0}. On the other hand, if the convergence is already of order ≥ 2, Aitken's method will bring no improvement.

Asymptotic rates of convergence for discretization methods

[edit]
icon
This sectionneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources in this section. Unsourced material may be challenged and removed.(August 2020) (Learn how and when to remove this message)

Definitions

[edit]

A sequence of discretized approximations(yk){\displaystyle (y_{k})} of some continuous-domain functionS{\displaystyle S} that converges to this target, together with a corresponding sequence of discretization scale parameters(hk){\displaystyle (h_{k})} that converge to 0, is said to have asymptoticorder of convergenceq{\displaystyle q} and asymptoticrate of convergenceμ{\displaystyle \mu } if

limk|ykS|hkq=μ,{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|y_{k}-S\right|}{h_{k}^{q}}}=\mu ,}

for some positive constantsμ{\displaystyle \mu } andq{\displaystyle q} and using|x|{\displaystyle |x|} to stand for an appropriatedistance metric on thespace of solutions, most often either theuniform norm, theabsolute difference, or theEuclidean distance. Discretization scale parameters may be spacings of aregular grid in space or in time, the inverse of the number of points of a grid in one dimension, an average or maximum distance between points in apolygon mesh, the single-dimension spacings of an irregularsparse grid, or a characteristic quantum of energy or momentum in aquantum mechanicalbasis set.

When all the discretizations are generated using a single common method, it is common to discuss the asymptotic rate and order of convergence for the method itself rather than any particular discrete sequences of discretized solutions. In these cases one considers a single abstract discretized solutionyh{\displaystyle y_{h}} generated using the method with a scale parameterh{\displaystyle h} and then the method is said to have asymptoticorder of convergenceq{\displaystyle q} and asymptoticrate of convergenceμ{\displaystyle \mu } if

limh0|yhS|hq=μ,{\displaystyle \lim _{h\rightarrow 0}{\frac {\left|y_{h}-S\right|}{h^{q}}}=\mu ,}

again for some positive constantsμ{\displaystyle \mu } andq{\displaystyle q} and an appropriate metric|x|.{\displaystyle |x|.} This implies that the error of a discretization asymptotically scales like the discretization's scale parameter to theq{\displaystyle q} power, or|yhS|=O(hq){\textstyle \left|y_{h}-S\right|=O(h^{q})} usingasymptotic big O notation. More precisely, it implies the leading order error isμhq,{\displaystyle \mu h^{q},} which can be expressed usingasymptotic small o notation as|yhS|=μhq+o(hq).{\textstyle \left|y_{h}-S\right|=\mu h^{q}+o(h^{q}).}

In some cases multiple rates and orders for the same method but with different choices of scale parameter may be important, for instance forfinite difference methods based on multidimensional grids where the different dimensions have different grid spacings or forfinite element methods based on polygon meshes where choosing either average distance between mesh points or maximum distance between mesh points as scale parameters may imply different orders of convergence. In some especially technical contexts, discretization methods' asymptotic rates and orders of convergence will be characterized by several scale parameters at once with the value of each scale parameter possibly affecting the asymptotic rate and order of convergence of the method with respect to the other scale parameters.

Example

[edit]

Consider the ordinary differential equation

dydx=κy{\displaystyle {\frac {dy}{dx}}=-\kappa y}

with initial conditiony(0)=y0{\displaystyle y(0)=y_{0}}. We can approximate a solution to this one-dimensional equation using a sequence(yn){\displaystyle (y_{n})} applying theforward Euler method for numerical discretization using any regular grid spacingh{\displaystyle h} and grid points indexed byn{\displaystyle n} as follows:

yn+1ynh=κyn,{\displaystyle {\frac {y_{n+1}-y_{n}}{h}}=-\kappa y_{n},}

which implies the first-orderlinear recurrence with constant coefficients

yn+1=yn(1hκ).{\displaystyle y_{n+1}=y_{n}(1-h\kappa ).}

Giveny(0)=y0{\displaystyle y(0)=y_{0}}, the sequence satisfying that recurrence is thegeometric progression

yn=y0(1hκ)n=y0(1nhκ+n(n1)2h2κ2+....).{\displaystyle y_{n}=y_{0}(1-h\kappa )^{n}=y_{0}\left(1-nh\kappa +{\frac {n(n-1)}{2}}h^{2}\kappa ^{2}+....\right).}

The exact analytical solution to the differential equation isy=f(x)=y0exp(κx){\displaystyle y=f(x)=y_{0}\exp(-\kappa x)}, corresponding to the followingTaylor expansion innhκ{\displaystyle nh\kappa }:f(xn)=f(nh)=y0exp(κnh)=y0(1nhκ+n2h2κ22+...).{\displaystyle f(x_{n})=f(nh)=y_{0}\exp(-\kappa nh)=y_{0}\left(1-nh\kappa +{\frac {n^{2}h^{2}\kappa ^{2}}{2}}+...\right).}

Therefore the error of the discrete approximation at each discrete point is

|ynf(xn)|=nh2κ22+{\displaystyle |y_{n}-f(x_{n})|={\frac {nh^{2}\kappa ^{2}}{2}}+\ldots }

For any specificx=p{\displaystyle x=p}, given a sequence of forward Euler approximations((yn)k){\displaystyle ((y_{n})_{k})}, each using grid spacingshk{\displaystyle h_{k}} that dividep{\displaystyle p} so thatnp,k=p/hk{\displaystyle n_{p,k}=p/h_{k}}, one has

limhk0|yk(p)f(p)|hk=limhk0|yk,np,kf(hknp,k)|hk=hknp,kκ22=pκ22{\displaystyle \lim _{h_{k}\rightarrow 0}{\frac {|y_{k}(p)-f(p)|}{h_{k}}}=\lim _{h_{k}\rightarrow 0}{\frac {|y_{k,n_{p,k}}-f(h_{k}n_{p,k})|}{h_{k}}}={\frac {h_{k}n_{p,k}\kappa ^{2}}{2}}={\frac {p\kappa ^{2}}{2}}}

for any sequence of grids with successively smaller grid spacingshk{\displaystyle h_{k}}. Thus((yn)k){\displaystyle ((y_{n})_{k})} converges tof(x){\displaystyle f(x)}pointwise with a convergence orderq=1{\displaystyle q=1} and asymptotic error constantpκ2/2{\displaystyle p\kappa ^{2}/2} at each pointp>0.{\displaystyle p>0.} Similarly, the sequence convergesuniformly with the same order and with rateLκ2/2{\displaystyle L\kappa ^{2}/2} on any bounded interval ofpL{\displaystyle p\leq L}, but it does not converge uniformly on the unbounded set of all positive real values,[0,).{\displaystyle [0,\infty ).}

Comparing asymptotic rates of convergence

[edit]

Definitions

[edit]

Inasymptotic analysis in general, one sequence(ak)kN{\displaystyle (a_{k})_{k\in \mathbb {N} }} that converges to alimitL{\displaystyle L} is said to asymptotically converge toL{\displaystyle L} with a faster order of convergence than another sequence(bk)kN{\displaystyle (b_{k})_{k\in \mathbb {N} }} that converges toL{\displaystyle L} in a sharedmetric space withdistance metric||,{\displaystyle |\cdot |,} such as thereal numbers orcomplex numbers with the ordinaryabsolute difference metrics, if

limk|akL||bkL|=0,{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|a_{k}-L\right|}{|b_{k}-L|}}=0,}

the two are said to asymptotically converge toL{\displaystyle L} with the same order of convergence if

limk|akL||bkL|=μ{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|a_{k}-L\right|}{|b_{k}-L|}}=\mu }

for some positive finite constantμ,{\displaystyle \mu ,} and the two are said to asymptotically converge toL{\displaystyle L} with the same rate and order of convergence if

limk|akL||bkL|=1.{\displaystyle \lim _{k\rightarrow \infty }{\frac {\left|a_{k}-L\right|}{|b_{k}-L|}}=1.}

These comparative definitions of rate and order of asymptotic convergence are fundamental inasymptotic analysis.[10][11] For the first two of these there are associated expressions inasymptotic O notation: the first is thatakL=o(bkL){\displaystyle a_{k}-L=o(b_{k}-L)} in small o notation[12] and the second is thatakL=Θ(bkL){\displaystyle a_{k}-L=\Theta (b_{k}-L)} in Knuth notation.[13] The third is also called asymptotic equivalence, expressedakLbkL.{\displaystyle a_{k}-L\sim b_{k}-L.}[14][15]

Examples

[edit]

For any twogeometric progressions(ark)kN{\displaystyle (ar^{k})_{k\in \mathbb {N} }} and(bsk)kN,{\displaystyle (bs^{k})_{k\in \mathbb {N} },} with shared limit zero, the two sequences are asymptotically equivalent if and only if botha=b{\displaystyle a=b} andr=s.{\displaystyle r=s.} They converge with the same order if and only ifr=s.{\displaystyle r=s.}(ark){\displaystyle (ar^{k})} converges with a faster order than(bsk){\displaystyle (bs^{k})} if and only ifr<s.{\displaystyle r<s.} The convergence of anygeometric series to its limit has error terms that are equal to a geometric progression, so similar relationships hold among geometric series as well. Any sequence that is asymptotically equivalent to a convergent geometric sequence may be either be said to "converge geometrically" or "converge exponentially" with respect to the absolute difference from its limit, or it may be said to "converge linearly" relative to a logarithm of the absolute difference such as the "number of decimals of precision." The latter is standard in numerical analysis.

For any two sequences of elements proportional to an inverse power ofk,{\displaystyle k,}(akn)kN{\displaystyle (ak^{-n})_{k\in \mathbb {N} }} and(bkm)kN,{\displaystyle (bk^{-m})_{k\in \mathbb {N} },} with shared limit zero, the two sequences are asymptotically equivalent if and only if botha=b{\displaystyle a=b} andn=m.{\displaystyle n=m.} They converge with the same order if and only ifn=m.{\displaystyle n=m.}(akn){\displaystyle (ak^{-n})} converges with a faster order than(bkm){\displaystyle (bk^{-m})} if and only ifn>m.{\displaystyle n>m.}

For any sequence(ak)kN{\displaystyle (a_{k})_{k\in \mathbb {N} }} with a limit of zero, its convergence can be compared to the convergence of the shifted sequence(ak1)kN,{\displaystyle (a_{k-1})_{k\in \mathbb {N} },} rescalings of the shifted sequence by a constantμ,{\displaystyle \mu ,}(μak1)kN,{\displaystyle (\mu a_{k-1})_{k\in \mathbb {N} },} and scaledq{\displaystyle q}-powers of the shifted sequence,(μak1q)kN.{\displaystyle (\mu a_{k-1}^{q})_{k\in \mathbb {N} }.} These comparisons are the basis for the Q-convergence classifications for iterative numerical methods as described above: when a sequence of iterate errors from a numerical method(|xkL|)kN{\displaystyle (|x_{k}-L|)_{k\in \mathbb {N} }} is asymptotically equivalent to the shifted, exponentiated, and rescaled sequence of iterate errors(μ|xk1L|q)kN,{\displaystyle (\mu |x_{k-1}-L|^{q})_{k\in \mathbb {N} },} it is said to converge with orderq{\displaystyle q} and rateμ.{\displaystyle \mu .}

Non-asymptotic rates of convergence

[edit]
icon
This sectionneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources in this section. Unsourced material may be challenged and removed.(October 2024) (Learn how and when to remove this message)

Non-asymptotic rates of convergence do not have the common, standard definitions that asymptotic rates of convergence have. Among formal techniques,Lyapunov theory is one of the most powerful and widely applied frameworks for characterizing and analyzing non-asymptotic convergence behavior.

Foriterative methods, one common practical approach is to discuss these rates in terms of the number of iterates or thecomputer time required to reach closeneighborhoods of a limit from starting points far from the limit. The non-asymptotic rate is then an inverse of that number of iterates or computer time. In practical applications, an iterative method that required fewer steps or less computer time than another to reach target accuracy will be said to have converged faster than the other, even if its asymptotic convergence is slower. These rates will generally be different for different starting points and different error thresholds for defining the neighborhoods. It is most common to discuss summaries ofstatistical distributions of these single point rates corresponding to distributions of possible starting points, such as the "average non-asymptotic rate," the "median non-asymptotic rate," or the "worst-case non-asymptotic rate" for some method applied to some problem with some fixed error threshold. These ensembles of starting points can be chosen according to parameters like initial distance from the eventual limit in order to define quantities like "average non-asymptotic rate of convergence from a given distance."

Fordiscretized approximation methods, similar approaches can be used with a discretization scale parameter such as an inverse of a number ofgrid ormesh points or aFourier seriescutoff frequency playing the role of inverse iterate number, though it is not especially common. For any problem, there is a greatest discretization scale parameter compatible with a desired accuracy of approximation, and it may not be as small as required for the asymptotic rate and order of convergence to provide accurate estimates of the error. In practical applications, when one discretization method gives a desired accuracy with a larger discretization scale parameter than another it will often be said to converge faster than the other, even if its eventual asymptotic convergence is slower.

References

[edit]
  1. ^abcdefghNocedal, Jorge; Wright, Stephen J. (1999).Numerical Optimization (1st ed.). New York, NY: Springer. pp. 28–29.ISBN 978-0-387-98793-4.
  2. ^Senning, Jonathan R."Computing and Estimating the Rate of Convergence"(PDF).gordon.edu. Retrieved2020-08-07.
  3. ^Hundley, Douglas."Rate of Convergence"(PDF).Whitman College. Retrieved2020-12-13.
  4. ^Porta, F. A. (1989)."On Q-Order and R-Order of Convergence"(PDF).Journal of Optimization Theory and Applications.63 (3):415–431.doi:10.1007/BF00939805.S2CID 116192710. Retrieved2020-07-31.
  5. ^abVan Tuyl, Andrew H. (1994)."Acceleration of convergence of a family of logarithmically convergent sequences"(PDF).Mathematics of Computation.63 (207):229–246.doi:10.2307/2153571.JSTOR 2153571. Retrieved2020-08-02.
  6. ^Chanson, Jeffrey R. (October 3, 2024)."Order of Convergence".LibreTexts Mathematics. RetrievedOctober 3, 2024.
  7. ^Nocedal, Jorge; Wright, Stephen J. (2006).Numerical Optimization (2nd ed.). Berlin, New York:Springer-Verlag.ISBN 978-0-387-30303-1.
  8. ^Senning, Jonathan R."Computing and Estimating the Rate of Convergence"(PDF).gordon.edu. Retrieved2020-08-07.
  9. ^Senning, Jonathan R."Verifying Numerical Convergence Rates"(PDF). Retrieved2024-02-09.
  10. ^Balcázar, José L.; Gabarró, Joaquim."Nonuniform complexity classes specified by lower and upper bounds"(PDF).RAIRO – Theoretical Informatics and Applications – Informatique Théorique et Applications.23 (2): 180.ISSN 0988-3754.Archived(PDF) from the original on 14 March 2017. Retrieved14 March 2017 – via Numdam.
  11. ^Cucker, Felipe; Bürgisser, Peter (2013)."A.1 Big Oh, Little Oh, and Other Comparisons".Condition: The Geometry of Numerical Algorithms. Berlin, Heidelberg: Springer. pp. 467–468.doi:10.1007/978-3-642-38896-5.ISBN 978-3-642-38896-5.
  12. ^Apostol, Tom M. (1967).Calculus. Vol. 1 (2nd ed.). USA: John Wiley & Sons. p. 286.ISBN 0-471-00005-1.
  13. ^Knuth, Donald (April–June 1976)."Big Omicron and big Omega and big Theta".SIGACT News.8 (2):18–24.doi:10.1145/1008328.1008329.S2CID 5230246.
  14. ^Apostol, Tom M. (1967).Calculus. Vol. 1 (2nd ed.). USA: John Wiley & Sons. p. 396.ISBN 0-471-00005-1.
  15. ^"Asymptotic equality",Encyclopedia of Mathematics,EMS Press, 2001 [1994]
Classification
Operations
Attributes of variables
Relation to processes
Solutions
Existence/uniqueness
Solution topics
Solution methods
Examples
Mathematicians
Retrieved from "https://en.wikipedia.org/w/index.php?title=Rate_of_convergence&oldid=1310464537"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp