Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Integration by substitution

From Wikipedia, the free encyclopedia
Part of a series of articles about
Calculus
abf(t)dt=f(b)f(a){\displaystyle \int _{a}^{b}f'(t)\,dt=f(b)-f(a)}
Technique in integral evaluation

Incalculus,integration by substitution, also known asu-substitution,reverse chain rule orchange of variables,[1] is a method for evaluatingintegrals andantiderivatives. It is the counterpart to thechain rule fordifferentiation, and can loosely be thought of as using the chain rule "backwards." This involvesdifferential forms.

Substitution for a single variable

[edit]

Introduction (indefinite integrals)

[edit]

Before stating the resultrigorously, consider a simple case usingindefinite integrals.

Compute(2x3+1)7(x2)dx.{\textstyle \int (2x^{3}+1)^{7}(x^{2})\,dx.}[2]

Setu=2x3+1.{\displaystyle u=2x^{3}+1.} This meansdudx=6x2,{\textstyle {\frac {du}{dx}}=6x^{2},} or as adifferential form,du=6x2dx.{\textstyle du=6x^{2}\,dx.} Now:(2x3+1)7(x2)dx=16(2x3+1)7u7(6x2)dxdu=16u7du=16(18u8)+C=148(2x3+1)8+C,{\displaystyle {\begin{aligned}\int (2x^{3}+1)^{7}(x^{2})\,dx&={\frac {1}{6}}\int \underbrace {(2x^{3}+1)^{7}} _{u^{7}}\underbrace {(6x^{2})\,dx} _{du}\\&={\frac {1}{6}}\int u^{7}\,du\\&={\frac {1}{6}}\left({\frac {1}{8}}u^{8}\right)+C\\&={\frac {1}{48}}(2x^{3}+1)^{8}+C,\end{aligned}}}whereC{\displaystyle C} is an arbitraryconstant of integration.

This procedure is frequently used, but not all integrals are of a form that permits its use. In any event, the result should be verified by differentiating and comparing to the original integrand.ddx[148(2x3+1)8+C]=16(2x3+1)7(6x2)=(2x3+1)7(x2).{\displaystyle {\frac {d}{dx}}\left[{\frac {1}{48}}(2x^{3}+1)^{8}+C\right]={\frac {1}{6}}(2x^{3}+1)^{7}(6x^{2})=(2x^{3}+1)^{7}(x^{2}).}For definite integrals, the limits of integration must also be adjusted, but the procedure is mostly the same.

Statement for definite integrals

[edit]

Letg:[a,b]I{\displaystyle g:[a,b]\to I} be adifferentiable function with acontinuous derivative, whereIR{\displaystyle I\subset \mathbb {R} } is aninterval. Suppose thatf:IR{\displaystyle f:I\to \mathbb {R} } is acontinuous function. Then:[3]abf(g(x))g(x)dx=g(a)g(b)f(u) du.{\displaystyle \int _{a}^{b}f(g(x))\cdot g'(x)\,dx=\int _{g(a)}^{g(b)}f(u)\ du.}

In Leibniz notation, the substitutionu=g(x){\displaystyle u=g(x)} yields:dudx=g(x).{\displaystyle {\frac {du}{dx}}=g'(x).}Working heuristically withinfinitesimals yields the equationdu=g(x)dx,{\displaystyle du=g'(x)\,dx,}which suggests the substitution formula above. (This equation may be put on a rigorous foundation by interpreting it as a statement aboutdifferential forms.) One may view the method of integration by substitution as a partial justification ofLeibniz's notation for integrals and derivatives.

The formula is used to transform one integral into another integral that is easier to compute. Thus, the formula can be read from left to right or from right to left in order to simplify a given integral. When used in the former manner, it is sometimes known asu-substitution orw-substitution in which a new variable is defined to be a function of the original variable found inside thecomposite function multiplied by the derivative of the inner function. The latter manner is commonly used intrigonometric substitution, replacing the original variable with atrigonometric function of a new variable and the originaldifferential with the differential of the trigonometric function.

Proof

[edit]

Integration by substitution can be derived from thefundamental theorem of calculus as follows. Letf{\displaystyle f} andg{\displaystyle g} be two functions satisfying the above hypothesis thatf{\displaystyle f} is continuous onI{\displaystyle I} andg{\displaystyle g'} is integrable on the closed interval[a,b]{\displaystyle [a,b]}. Then the functionf(g(x))g(x){\displaystyle f(g(x))\cdot g'(x)} is also integrable on[a,b]{\displaystyle [a,b]}. Hence the integralsabf(g(x))g(x) dx{\displaystyle \int _{a}^{b}f(g(x))\cdot g'(x)\ dx}andg(a)g(b)f(u) du{\displaystyle \int _{g(a)}^{g(b)}f(u)\ du}in fact exist, and it remains to show that they are equal.

Sincef{\displaystyle f} is continuous, it has anantiderivativeF{\displaystyle F}. Thecomposite functionFg{\displaystyle F\circ g} is then defined. Sinceg{\displaystyle g} is differentiable, combining thechain rule and the definition of an antiderivative gives:(Fg)(x)=F(g(x))g(x)=f(g(x))g(x).{\displaystyle (F\circ g)'(x)=F'(g(x))\cdot g'(x)=f(g(x))\cdot g'(x).}

Applying thefundamental theorem of calculus twice gives:abf(g(x))g(x) dx=ab(Fg)(x) dx=(Fg)(b)(Fg)(a)=F(g(b))F(g(a))=g(a)g(b)F(u)du=g(a)g(b)f(u)du,{\displaystyle {\begin{aligned}\int _{a}^{b}f(g(x))\cdot g'(x)\ dx&=\int _{a}^{b}(F\circ g)'(x)\ dx\\&=(F\circ g)(b)-(F\circ g)(a)\\&=F(g(b))-F(g(a))\\&=\int _{g(a)}^{g(b)}F'(u)\,du=\int _{g(a)}^{g(b)}f(u)\,du,\end{aligned}}}which is the substitution rule.

Examples: Antiderivatives (indefinite integrals)

[edit]

Substitution can be used to determineantiderivatives. One chooses a relation betweenx{\displaystyle x} andu,{\displaystyle u,} determines the corresponding relation betweendx{\displaystyle dx} anddu{\displaystyle du} by differentiating, and performs the substitutions. An antiderivative for the substituted function can hopefully be determined; the original substitution betweenx{\displaystyle x} andu{\displaystyle u} is then undone.

Example 1

[edit]

Consider the integral:xcos(x2+1) dx.{\displaystyle \int x\cos(x^{2}+1)\ dx.}Make the substitutionu=x2+1{\textstyle u=x^{2}+1} to obtaindu=2x dx,{\displaystyle du=2x\ dx,} meaningx dx=12 du.{\textstyle x\ dx={\frac {1}{2}}\ du.} Therefore:xcos(x2+1)dx=122xcos(x2+1)dx=12cosudu=12sinu+C=12sin(x2+1)+C,{\displaystyle {\begin{aligned}\int x\cos(x^{2}+1)\,dx&={\frac {1}{2}}\int 2x\cos(x^{2}+1)\,dx\\[6pt]&={\frac {1}{2}}\int \cos u\,du\\[6pt]&={\frac {1}{2}}\sin u+C\\[6pt]&={\frac {1}{2}}\sin(x^{2}+1)+C,\end{aligned}}}whereC{\displaystyle C} is an arbitraryconstant of integration.

Example 2: Antiderivatives of tangent and cotangent

[edit]

Thetangent function can be integrated using substitution by expressing it in terms of the sine and cosine:tanx=sinxcosx{\displaystyle \tan x={\tfrac {\sin x}{\cos x}}}.

Using the substitutionu=cosx{\displaystyle u=\cos x} givesdu=sinxdx{\displaystyle du=-\sin x\,dx} andtanxdx=sinxcosxdx=duu=ln|u|+C=ln|cosx|+C=ln|secx|+C.{\displaystyle {\begin{aligned}\int \tan x\,dx&=\int {\frac {\sin x}{\cos x}}\,dx\\&=\int -{\frac {du}{u}}\\&=-\ln \left|u\right|+C\\&=-\ln \left|\cos x\right|+C\\&=\ln \left|\sec x\right|+C.\end{aligned}}}

Thecotangent function can be integrated similarly by expressing it ascotx=cosxsinx{\displaystyle \cot x={\tfrac {\cos x}{\sin x}}} and using the substitutionu=sinx,du=cosxdx{\displaystyle u=\sin {x},du=\cos {x}\,dx}:cotxdx=cosxsinxdx=duu=ln|u|+C=ln|sinx|+C.{\displaystyle {\begin{aligned}\int \cot x\,dx&=\int {\frac {\cos x}{\sin x}}\,dx\\&=\int {\frac {du}{u}}\\&=\ln \left|u\right|+C\\&=\ln \left|\sin x\right|+C.\end{aligned}}}

Examples: Definite integrals

[edit]

When evaluating definite integrals by substitution, one may calculate the antiderivative fully first, then apply the boundary conditions. In that case, there is no need to transform the boundary terms. Alternatively, one may fully evaluate the indefinite integral (see above) first then apply the boundary conditions. This becomes especially handy when multiple substitutions are used.

Example 1

[edit]

Consider the integral:02xx2+1dx.{\displaystyle \int _{0}^{2}{\frac {x}{\sqrt {x^{2}+1}}}dx.}Make the substitutionu=x2+1{\textstyle u=x^{2}+1} to obtaindu=2x dx,{\displaystyle du=2x\ dx,} meaningx dx=12 du.{\textstyle x\ dx={\frac {1}{2}}\ du.} Therefore:x=0x=2xx2+1 dx=12u=1u=5duu=12(2521)=51.{\displaystyle {\begin{aligned}\int _{x=0}^{x=2}{\frac {x}{\sqrt {x^{2}+1}}}\ dx&={\frac {1}{2}}\int _{u=1}^{u=5}{\frac {du}{\sqrt {u}}}\\[6pt]&={\frac {1}{2}}\left(2{\sqrt {5}}-2{\sqrt {1}}\right)\\[6pt]&={\sqrt {5}}-1.\end{aligned}}}Since the lower limitx=0{\displaystyle x=0} was replaced withu=1,{\displaystyle u=1,} and the upper limitx=2{\displaystyle x=2} with22+1=5,{\displaystyle 2^{2}+1=5,} a transformation back into terms ofx{\displaystyle x} was unnecessary.

Example 2:Trigonometric substitution

[edit]

For the integral011x2dx,{\displaystyle \int _{0}^{1}{\sqrt {1-x^{2}}}\,dx,}a variation of the above procedure is needed. The substitutionx=sinu{\displaystyle x=\sin u} implyingdx=cosudu{\displaystyle dx=\cos u\,du} is useful because1sin2u=cosu.{\textstyle {\sqrt {1-\sin ^{2}u}}=\cos u.} We thus have:011x2 dx=0π/21sin2ucosu du=0π/2cos2u du=[u2+sin(2u)4]0π/2=π4+0=π4.{\displaystyle {\begin{aligned}\int _{0}^{1}{\sqrt {1-x^{2}}}\ dx&=\int _{0}^{\pi /2}{\sqrt {1-\sin ^{2}u}}\cos u\ du\\[6pt]&=\int _{0}^{\pi /2}\cos ^{2}u\ du\\[6pt]&=\left[{\frac {u}{2}}+{\frac {\sin(2u)}{4}}\right]_{0}^{\pi /2}\\[6pt]&={\frac {\pi }{4}}+0\\[6pt]&={\frac {\pi }{4}}.\end{aligned}}}

The resulting integral can be computed usingintegration by parts or adouble angle formula,2cos2u=1+cos(2u),{\textstyle 2\cos ^{2}u=1+\cos(2u),} followed by one more substitution. One can also note that the function being integrated is the upper right quarter of a circle with a radius of one, and hence integrating the upper right quarter from zero to one is the geometric equivalent to the area of one quarter of the unit circle, orπ4.{\displaystyle {\tfrac {\pi }{4}}.}

Substitution for multiple variables

[edit]

One may also use substitution when integratingfunctions of several variables.

Here, the substitution function(v1,...,vn) =φ(u1, ...,un) needs to beinjective and continuously differentiable, and the differentials transform as:dv1dvn=|det(Dφ)(u1,,un)|du1dun,{\displaystyle dv_{1}\cdots dv_{n}=\left|\det(D\varphi )(u_{1},\ldots ,u_{n})\right|\,du_{1}\cdots du_{n},}wheredet()(u1, ...,un) denotes thedeterminant of theJacobian matrix ofpartial derivatives ofφ at the point(u1, ...,un). This formula expresses the fact that theabsolute value of the determinant of a matrix equals the volume of theparallelotope spanned by its columns or rows.

More precisely, thechange of variables formula is stated in the next theorem:

TheoremLetU be an open set inRn andφ :URn aninjective differentiable function with continuous partial derivatives, the Jacobian of which is nonzero for everyx inU. Then for any real-valued, compactly supported, continuous functionf, with support contained inφ(U):φ(U)f(v)dv=Uf(φ(u))|det(Dφ)(u)|du.{\displaystyle \int _{\varphi (U)}f(\mathbf {v} )\,d\mathbf {v} =\int _{U}f(\varphi (\mathbf {u} ))\,\,\left|\!\det(D\varphi )(\mathbf {u} )\right|\,d\mathbf {u} .}

The conditions on the theorem can be weakened in various ways. First, the requirement thatφ be continuously differentiable can be replaced by the weaker assumption thatφ be merely differentiable and have a continuous inverse.[4] This is guaranteed to hold ifφ is continuously differentiable by theinverse function theorem. Alternatively, the requirement thatdet() ≠ 0 can be eliminated by applyingSard's theorem.[5]

For Lebesgue measurable functions, the theorem can be stated in the following form:[6]

TheoremLetU be a measurable subset ofRn andφ :URn aninjective function, and suppose for everyx inU there existsφ′(x) inRn,n such thatφ(y) =φ(x) +φ′(x)(yx) +o(‖yx‖) asyx (hereo islittle-o notation). Thenφ(U) is measurable, and for any real-valued functionf defined onφ(U):φ(U)f(v)dv=Uf(φ(u))|detφ(u)|du{\displaystyle \int _{\varphi (U)}f(v)\,dv=\int _{U}f(\varphi (u))\,\,\left|\!\det \varphi '(u)\right|\,du}in the sense that if either integral exists (including the possibility of being properly infinite), then so does the other one, and they have the same value.

Another very general version inmeasure theory is the following:[7]

TheoremLetX be alocally compactHausdorff space equipped with a finiteRadon measureμ, and letY be aσ-compact Hausdorff space with aσ-finite Radon measureρ. Letφ :XY be anabsolutely continuous function (where the latter means thatρ(φ(E)) = 0 wheneverμ(E) = 0). Then there exists a real-valuedBorel measurable functionw onX such that for everyLebesgue integrable functionf :YR, the function(fφ) ⋅w is Lebesgue integrable onX, andYf(y)dρ(y)=X(fφ)(x)w(x)dμ(x).{\displaystyle \int _{Y}f(y)\,d\rho (y)=\int _{X}(f\circ \varphi )(x)\,w(x)\,d\mu (x).}Furthermore, it is possible to writew(x)=(gφ)(x){\displaystyle w(x)=(g\circ \varphi )(x)}for some Borel measurable functiong onY.

Ingeometric measure theory, integration by substitution is used withLipschitz functions. A bi-Lipschitz function is a Lipschitz functionφ :URn which is injective and whose inverse functionφ−1 :φ(U) →U is also Lipschitz. ByRademacher's theorem, a bi-Lipschitz mapping is differentiablealmost everywhere. In particular, the Jacobian determinant of a bi-Lipschitz mappingdet is well-defined almost everywhere. The following result then holds:

TheoremLetU be an open subset ofRn andφ :URn be a bi-Lipschitz mapping. Letf :φ(U) →R be measurable. Thenφ(U)f(x)dx=U(fφ)(x)|detDφ(x)|dx{\displaystyle \int _{\varphi (U)}f(x)\,dx=\int _{U}(f\circ \varphi )(x)\,\,\left|\!\det D\varphi (x)\right|\,dx}in the sense that if either integral exists (or is properly infinite), then so does the other one, and they have the same value.

The above theorem was first proposed byEuler when he developed the notion ofdouble integrals in 1769. Although generalized to triple integrals byLagrange in 1773, and used byLegendre,Laplace, andGauss, and first generalized ton variables byMikhail Ostrogradsky in 1836, it resisted a fully rigorous formal proof for a surprisingly long time, and was first satisfactorily resolved 125 years later, byÉlie Cartan in a series of papers beginning in the mid-1890s.[8][9]

Application in probability

[edit]

Substitution can be used to answer the following important question in probability: given a random variableX with probability densitypX and another random variableY such thatY=ϕ(X) forinjective (one-to-one)ϕ, what is the probability density forY?

It is easiest to answer this question by first answering a slightly different question: what is the probability thatY takes a value in some particular subsetS? Denote this probabilityP(YS). Of course, ifY has probability densitypY, then the answer is:P(YS)=SpY(y)dy,{\displaystyle P(Y\in S)=\int _{S}p_{Y}(y)\,dy,}but this is not really useful because we do not knowpY; it is what we are trying to find. We can make progress by considering the problem in the variableX.Y takes a value inS wheneverX takes a value inϕ1(S),{\textstyle \phi ^{-1}(S),} so:P(YS)=P(Xϕ1(S))=ϕ1(S)pX(x)dx.{\displaystyle P(Y\in S)=P(X\in \phi ^{-1}(S))=\int _{\phi ^{-1}(S)}p_{X}(x)\,dx.}

Changing from variablex toy gives:P(YS)=ϕ1(S)pX(x)dx=SpX(ϕ1(y))|dϕ1dy|dy.{\displaystyle P(Y\in S)=\int _{\phi ^{-1}(S)}p_{X}(x)\,dx=\int _{S}p_{X}(\phi ^{-1}(y))\left|{\frac {d\phi ^{-1}}{dy}}\right|\,dy.}Combining this with our first equation gives:SpY(y)dy=SpX(ϕ1(y))|dϕ1dy|dy,{\displaystyle \int _{S}p_{Y}(y)\,dy=\int _{S}p_{X}(\phi ^{-1}(y))\left|{\frac {d\phi ^{-1}}{dy}}\right|\,dy,}so:pY(y)=pX(ϕ1(y))|dϕ1dy|.{\displaystyle p_{Y}(y)=p_{X}(\phi ^{-1}(y))\left|{\frac {d\phi ^{-1}}{dy}}\right|.}

In the case whereX andY depend on several uncorrelated variables (i.e.,pX=pX(x1,,xn){\textstyle p_{X}=p_{X}(x_{1},\ldots ,x_{n})} andy=ϕ(x){\displaystyle y=\phi (x)}),pY{\displaystyle p_{Y}}can be found by substitution in several variables discussed above. The result is:pY(y)=pX(ϕ1(y))|detDϕ1(y)|.{\displaystyle p_{Y}(y)=p_{X}(\phi ^{-1}(y))\left|\det D\phi ^{-1}(y)\right|.}

See also

[edit]

Notes

[edit]
  1. ^Swokowski 1983, p. 257
  2. ^Swokowski 1983, p. 258
  3. ^Briggs & Cochran 2011, p. 361
  4. ^Rudin 1987, Theorem 7.26
  5. ^Spivak 1965, p. 72
  6. ^Fremlin 2010, Theorem 263D
  7. ^Hewitt & Stromberg 1965, Theorem 20.3
  8. ^Katz 1982
  9. ^Ferzola 1994

References

[edit]

External links

[edit]
The WikibookCalculus has a page on the topic of:The Substitution Rule
Wikiversity has learning resources aboutIntegration by Substitution
Precalculus
Limits
Differential calculus
Integral calculus
Vector calculus
Multivariable calculus
Sequences and series
Special functions
and numbers
History of calculus
Lists
Integrals
Miscellaneous topics
Types of
integrals
Integration
techniques
Improper integrals
Stochastic integrals
Miscellaneous
Retrieved from "https://en.wikipedia.org/w/index.php?title=Integration_by_substitution&oldid=1323465235"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp