Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Method of characteristics

From Wikipedia, the free encyclopedia
Technique for solving hyperbolic partial differential equations
Differential equations
Scope
Classification
Solution
People
Specific equations

Inmathematics, themethod of characteristics is a technique for solving particularpartial differential equations. Typically, it applies tofirst-order equations, though in generalcharacteristic curves can also be found forhyperbolic andparabolic partial differential equation. The method is to reduce a partial differential equation (PDE) to a family ofordinary differential equations (ODEs) along which the solution can be integrated from some initial data given on a suitablehypersurface.

Characteristics of first-order partial differential equation

[edit]

For a first-order PDE, the method of characteristics discovers so calledcharacteristic curves along which the PDE becomes an ODE.[1][2] Once the ODE is found, it can be solved along the characteristic curves and transformed into a solution for the original PDE.

Two-dimensional quasilinear PDE

[edit]

For the sake of simplicity, we initially direct our attention to the case of a function of two independent variablesx andy. Consider aquasilinear PDE of the form[3]

a(x,y,u)ux+b(x,y,u)uy=c(x,y,u).{\displaystyle a(x,y,u){\frac {\partial u}{\partial x}}+b(x,y,u){\frac {\partial u}{\partial y}}=c(x,y,u).}1

For a differentiable function(x,y)u(x,y){\displaystyle (x,y)\mapsto u(x,y)}, consider thegraph ofu, which is the setgph(u)={(x,y,z)R3z=u(x,y)}{\displaystyle \operatorname {gph} (u)=\{(x,y,z)\in \mathbb {R} ^{3}\mid z=u(x,y)\}} Anormal vector togph(u){\displaystyle \operatorname {gph} (u)} is given by[4]

n(x,y)=(ux(x,y),uy(x,y),1).{\displaystyle n(x,y)=\left({\frac {\partial u}{\partial x}}(x,y),{\frac {\partial u}{\partial y}}(x,y),-1\right).}

Consider the vector field

(x,y,z)[a(x,y,z)b(x,y,z)c(x,y,z)].{\displaystyle (x,y,z)\mapsto {\begin{bmatrix}a(x,y,z)\\b(x,y,z)\\c(x,y,z)\end{bmatrix}}.}2

Thedot product of the vector field (2) with the normal vector togph(u){\displaystyle \operatorname {gph} (u)} at each(x,y,u(x,y))gph(u){\displaystyle (x,y,u(x,y))\in \operatorname {gph} (u)} is[ux(x,y)uy(x,y)1][a(x,y,u(x,y))b(x,y,u(x,y))c(x,y,u(x,y))]=a(x,y,u(x,y))ux(x,y)+b(x,y,u(x,y))uy(x,y)c(x,y,u(x,y)).{\displaystyle {\begin{bmatrix}{\dfrac {\partial u}{\partial x}}(x,y)\\{\dfrac {\partial u}{\partial y}}(x,y)\\-1\end{bmatrix}}\cdot {\begin{bmatrix}a{\big (}x,y,u(x,y){\big )}\\b{\big (}x,y,u(x,y){\big )}\\c{\big (}x,y,u(x,y){\big )}\end{bmatrix}}=a{\big (}x,y,u(x,y){\big )}{\frac {\partial u}{\partial x}}(x,y)+b{\big (}x,y,u(x,y){\big )}{\frac {\partial u}{\partial y}}(x,y)-c{\big (}x,y,u(x,y){\big )}.}

Comparing the right-hand side of the above equation with (1), it is evident the following statements are equivalent:

In other words, the graph of the solution to (1) is the union ofintegral curves of the vector field (2). Each integral curve is called acharacteristic curve of the PDE (1) equation and follow as the solutions of thecharacteristic equations:[3]

{dxdt=a(x,y,z),dydt=b(x,y,z),dzdt=c(x,y,z).{\displaystyle \left\{{\begin{aligned}{\dfrac {dx}{dt}}&=a(x,y,z),\\[4pt]{\dfrac {dy}{dt}}&=b(x,y,z),\\[4pt]{\dfrac {dz}{dt}}&=c(x,y,z).\end{aligned}}\right.}

A parametrization invariant form of theLagrange–Charpit equations is:[5]

dxa(x,y,z)=dyb(x,y,z)=dzc(x,y,z).{\displaystyle {\frac {dx}{a(x,y,z)}}={\frac {dy}{b(x,y,z)}}={\frac {dz}{c(x,y,z)}}.}

N-dimensional linear and quasilinear PDE

[edit]
Example. The solution to the equationyuxxuyeu=0{\displaystyle yu_{x}-xu_{y}-e^{u}=0} with boundary condition{(x,y,u)=(s,sins,0):sR}{\displaystyle \{(x,y,u)=(s,\sin s,0):s\in \mathbb {R} \}} is obtained by drawing all characteristic curves through the boundary condition set.

Consider now a PDE of the form

i=1nai(x1,,xn,u)uxi=c(x1,,xn,u).{\displaystyle \sum _{i=1}^{n}a_{i}(x_{1},\dots ,x_{n},u){\frac {\partial u}{\partial x_{i}}}=c(x_{1},\dots ,x_{n},u).}

For this PDE to belinear, the coefficientsai may be functions of the spatial variables only, and independent ofu. For it to be quasilinear,[6]ai may also depend on the value of the function, but not on any derivatives. The distinction between these two cases is inessential for the discussion here.

For a linear or quasilinear PDE, the characteristic curves are given parametrically by

(x1,,xn,u)=(X1(s),,Xn(s),U(s)){\displaystyle (x_{1},\dots ,x_{n},u)=(X_{1}(s),\dots ,X_{n}(s),U(s))}
u(X(s))=U(s){\displaystyle u(\mathbf {X} (s))=U(s)}

for some univariate functionss(Xi(s))i,U(s){\displaystyle s\mapsto (X_{i}(s))_{i},U(s)}of one real variables{\displaystyle s}satisfying the following system of ordinary differential equations

Xi=ai(X1,,Xn,U) for i=1,,n{\displaystyle X_{i}'=a_{i}(X_{1},\dots ,X_{n},U){\text{ for }}i=1,\dotsc ,n}4
U=c(X1,,Xn,U).{\displaystyle U'=c(X_{1},\dots ,X_{n},U).}5

Equations (4) and (5) give the characteristics of the PDE.

Proof for quasilinear case

In the quasilinear case, the use of the method of characteristics is justified byGrönwall's inequality. The above equation may be written asa(x,u)u(x)=c(x,u){\displaystyle \mathbf {a} (\mathbf {x} ,u)\cdot \nabla u(\mathbf {x} )=c(\mathbf {x} ,u)}

We must distinguish between the solutions to the ODE and the solutions to the PDE, which we do not know are equala priori. Letting capital letters be the solutions to the ODE we findX(s)=a(X(s),U(s)){\displaystyle \mathbf {X} '(s)=\mathbf {a} (\mathbf {X} (s),U(s))}U(s)=c(X(s),U(s)){\displaystyle U'(s)=c(\mathbf {X} (s),U(s))}

ExaminingΔ(s)=|u(X(s))U(s)|2{\displaystyle \Delta (s)=|u(\mathbf {X} (s))-U(s)|^{2}}, we find, upon differentiating thatΔ(s)=2(u(X(s))U(s))(X(s)u(X(s))U(s)){\displaystyle \Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {X} '(s)\cdot \nabla u(\mathbf {X} (s))-U'(s){\Big )}}which is the same asΔ(s)=2(u(X(s))U(s))(a(X(s),U(s))u(X(s))c(X(s),U(s))){\displaystyle \Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {a} (\mathbf {X} (s),U(s))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),U(s)){\Big )}}

We cannot conclude the above is 0 as we would like, since the PDE only guarantees us that this relationship is satisfied foru(x){\displaystyle u(\mathbf {x} )},a(x,u)u(x)=c(x,u){\displaystyle \mathbf {a} (\mathbf {x} ,u)\cdot \nabla u(\mathbf {x} )=c(\mathbf {x} ,u)},and we do not yet know thatU(s)=u(X(s)){\displaystyle U(s)=u(\mathbf {X} (s))}.

However, we can see thatΔ(s)=2(u(X(s))U(s))(a(X(s),U(s))u(X(s))c(X(s),U(s))(a(X(s),u(X(s)))u(X(s))c(X(s),u(X(s))))){\displaystyle \Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}\mathbf {a} (\mathbf {X} (s),U(s))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),U(s))-{\big (}\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s)))\cdot \nabla u(\mathbf {X} (s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big )}{\Big )}}since by the PDE, the last term is 0. This equalsΔ(s)=2(u(X(s))U(s))((a(X(s),U(s))a(X(s),u(X(s))))u(X(s))(c(X(s),U(s))c(X(s),u(X(s))))){\displaystyle \Delta '(s)=2{\big (}u(\mathbf {X} (s))-U(s){\big )}{\Big (}{\big (}\mathbf {a} (\mathbf {X} (s),U(s))-\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s))){\big )}\cdot \nabla u(\mathbf {X} (s))-{\big (}c(\mathbf {X} (s),U(s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big )}{\Big )}}

By the triangle inequality, we have|Δ(s)|2|u(X(s))U(s)|(a(X(s),U(s))a(X(s),u(X(s))) u(X(s))+|c(X(s),U(s))c(X(s),u(X(s)))|){\displaystyle |\Delta '(s)|\leq 2{\big |}u(\mathbf {X} (s))-U(s){\big |}{\Big (}{\big \|}\mathbf {a} (\mathbf {X} (s),U(s))-\mathbf {a} (\mathbf {X} (s),u(\mathbf {X} (s))){\big \|}\ \|\nabla u(\mathbf {X} (s))\|+{\big |}c(\mathbf {X} (s),U(s))-c(\mathbf {X} (s),u(\mathbf {X} (s))){\big |}{\Big )}}

Assuminga,c{\displaystyle \mathbf {a} ,c} are at leastC1{\displaystyle C^{1}}, we can bound this for small times. Choose a neighborhoodΩ{\displaystyle \Omega } aroundX(0),U(0){\displaystyle \mathbf {X} (0),U(0)} small enough such thata,c{\displaystyle \mathbf {a} ,c} arelocally Lipschitz. By continuity,(X(s),U(s)){\displaystyle (\mathbf {X} (s),U(s))} will remain inΩ{\displaystyle \Omega } for small enoughs{\displaystyle s}. SinceU(0)=u(X(0)){\displaystyle U(0)=u(\mathbf {X} (0))}, we also have that(X(s),u(X(s))){\displaystyle (\mathbf {X} (s),u(\mathbf {X} (s)))} will be inΩ{\displaystyle \Omega } for small enoughs{\displaystyle s} by continuity. So,(X(s),U(s))Ω{\displaystyle (\mathbf {X} (s),U(s))\in \Omega } and(X(s),u(X(s)))Ω{\displaystyle (\mathbf {X} (s),u(\mathbf {X} (s)))\in \Omega } fors[0,s0]{\displaystyle s\in [0,s_{0}]}. Additionally,u(X(s))M{\displaystyle \|\nabla u(\mathbf {X} (s))\|\leq M} for someMR{\displaystyle M\in \mathbb {R} } fors[0,s0]{\displaystyle s\in [0,s_{0}]} by compactness. From this, we find the above is bounded as|Δ(s)|C|u(X(s))U(s)|2=C|Δ(s)|{\displaystyle |\Delta '(s)|\leq C|u(\mathbf {X} (s))-U(s)|^{2}=C|\Delta (s)|}for someCR{\displaystyle C\in \mathbb {R} }. It is a straightforward application of Grönwall's Inequality to show that sinceΔ(0)=0{\displaystyle \Delta (0)=0} we haveΔ(s)=0{\displaystyle \Delta (s)=0} for as long as this inequality holds. We have some interval[0,ε){\displaystyle [0,\varepsilon )} such thatu(X(s))=U(s){\displaystyle u(X(s))=U(s)} in this interval. Choose the largestε{\displaystyle \varepsilon } such that this is true. Then, by continuity,U(ε)=u(X(ε)){\displaystyle U(\varepsilon )=u(\mathbf {X} (\varepsilon ))}. Provided the ODE still has a solution in some interval afterε{\displaystyle \varepsilon }, we can repeat the argument above to find thatu(X(s))=U(s){\displaystyle u(X(s))=U(s)} in a larger interval. Thus, so long as the ODE has a solution, we haveu(X(s))=U(s){\displaystyle u(X(s))=U(s)}.

Fully nonlinear PDE

[edit]

Consider the partial differential equation

F(x1,,xn,u,p1,,pn)=0{\displaystyle F(x_{1},\dots ,x_{n},u,p_{1},\dots ,p_{n})=0}6

where the variablespi are shorthand for the partial derivatives

pi=uxi.{\displaystyle p_{i}={\frac {\partial u}{\partial x_{i}}}.}

Lets(x1(s),,xn(s),u(s),p1(s),,pn(s)){\displaystyle s\mapsto (x_{1}(s),\dots ,x_{n}(s),u(s),p_{1}(s),\dots ,p_{n}(s))} be a curve inR2n+1. Suppose thatu is any solution, and that

u(s)=u(x1(s),,xn(s)).{\displaystyle u(s)=u(x_{1}(s),\dots ,x_{n}(s)).}

The derivatives with respect tos{\displaystyle s} ofxi,{\displaystyle x_{i},}u,{\displaystyle u,} andpi{\displaystyle p_{i}} are written asx˙i,{\displaystyle {\dot {x}}_{i},},u˙,{\displaystyle {\dot {u}},} andp˙i,{\displaystyle {\dot {p}}_{i},} respectively.Along a solution, differentiating (6) with respect tos gives[7]

i(Fxi+Fupi)x˙i+iFpip˙i=0{\displaystyle \sum _{i}(F_{x_{i}}+F_{u}p_{i}){\dot {x}}_{i}+\sum _{i}F_{p_{i}}{\dot {p}}_{i}=0}
u˙ipix˙i=0{\displaystyle {\dot {u}}-\sum _{i}p_{i}{\dot {x}}_{i}=0}
i(x˙idpip˙idxi)=0.{\displaystyle \sum _{i}({\dot {x}}_{i}dp_{i}-{\dot {p}}_{i}dx_{i})=0.}

The second equation follows from applying thechain rule to a solutionu, and the third follows by taking anexterior derivative of the relationduipidxi=0{\displaystyle du-\sum _{i}p_{i}\,dx_{i}=0}. Manipulating these equations gives

{x˙i=λFpi,p˙i=λ(Fxi+Fupi),u˙=λipiFpi{\displaystyle \left\{{\begin{aligned}{\dot {x}}_{i}&=\lambda F_{p_{i}},\\[5pt]{\dot {p}}_{i}&=-\lambda (F_{x_{i}}+F_{u}p_{i}),\\[5pt]{\dot {u}}&=\lambda \sum _{i}p_{i}F_{p_{i}}\end{aligned}}\right.}

where λ is a constant. Writing these equations more symmetrically, one obtains the Lagrange–Charpit equations for the characteristic

x˙iFpi=p˙iFxi+Fupi=u˙piFpi.{\displaystyle {\frac {{\dot {x}}_{i}}{F_{p_{i}}}}=-{\frac {{\dot {p}}_{i}}{F_{x_{i}}+F_{u}p_{i}}}={\frac {\dot {u}}{\sum p_{i}F_{p_{i}}}}.}

Geometrically, the method of characteristics in the fully nonlinear case can be interpreted as requiring that theMonge cone of the differential equation should everywhere be tangent to the graph of the solution.

Example

[edit]

As an example, consider theadvection equation (this example assumes familiarity with PDE notation, and solutions to basic ODEs).

aux+ut=0{\displaystyle a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}=0}

wherea{\displaystyle a} is constant andu{\displaystyle u} is a function ofx{\displaystyle x} andt{\displaystyle t}. We want to transform this linear first-order PDE into an ODE along the appropriate curve; i.e. something of the form

ddsu(x(s),t(s))=F(u,x(s),t(s)),{\displaystyle {\frac {d}{ds}}u(x(s),t(s))=F(u,x(s),t(s)),}

where(x(s),t(s)){\displaystyle (x(s),t(s))} is a characteristic line. First, we find

ddsu(x(s),t(s))=uxdxds+utdtds{\displaystyle {\frac {d}{ds}}u(x(s),t(s))={\frac {\partial u}{\partial x}}{\frac {dx}{ds}}+{\frac {\partial u}{\partial t}}{\frac {dt}{ds}}}

by the chain rule. Now, if we setdxds=a{\displaystyle {\frac {dx}{ds}}=a} anddtds=1{\displaystyle {\frac {dt}{ds}}=1} we get

aux+ut{\displaystyle a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}}

which is the left hand side of the PDE we started with. Thus

ddsu=aux+ut=0.{\displaystyle {\frac {d}{ds}}u=a{\frac {\partial u}{\partial x}}+{\frac {\partial u}{\partial t}}=0.}

So, along the characteristic line(x(s),t(s)){\displaystyle (x(s),t(s))}, the original PDE becomes the ODEus=F(u,x(s),t(s))=0{\displaystyle u_{s}=F(u,x(s),t(s))=0}. That is to say that along the characteristics, the solution is constant. Thus,u(xs,ts)=u(x0,0){\displaystyle u(x_{s},t_{s})=u(x_{0},0)} where(xs,ts){\displaystyle (x_{s},t_{s})\,} and(x0,0){\displaystyle (x_{0},0)} lie on the same characteristic. Therefore, to determine the general solution, it is enough to find the characteristics by solving the characteristic system of ODEs:

In this case, the characteristic lines are straight lines with slopea{\displaystyle a}, and the value ofu{\displaystyle u} remains constant along any characteristic line.

Characteristics of linear differential operators

[edit]

LetX be adifferentiable manifold andP a lineardifferential operator

P:C(X)C(X){\displaystyle P:C^{\infty }(X)\to C^{\infty }(X)}

of orderk. In a local coordinate systemxi,

P=|α|kPα(x)xα{\displaystyle P=\sum _{|\alpha |\leq k}P^{\alpha }(x){\frac {\partial }{\partial x^{\alpha }}}}

in whichα denotes amulti-index. The principalsymbol ofP, denotedσP, is the function on thecotangent bundle TX defined in these local coordinates by

σP(x,ξ)=|α|=kPα(x)ξα{\displaystyle \sigma _{P}(x,\xi )=\sum _{|\alpha |=k}P^{\alpha }(x)\xi _{\alpha }}

where theξi are the fiber coordinates on the cotangent bundle induced by the coordinate differentialsdxi. Although this is defined using a particular coordinate system, the transformation law relating theξi and thexi ensures thatσP is a well-defined function on the cotangent bundle.

The functionσP ishomogeneous of degreek in theξ variable. The zeros ofσP, away from the zero section of TX, are the characteristics ofP. A hypersurface ofX defined by the equationF(x) = c is called a characteristic hypersurface atx if

σP(x,dF(x))=0.{\displaystyle \sigma _{P}(x,dF(x))=0.}

Invariantly, a characteristic hypersurface is a hypersurface whoseconormal bundle is in the characteristic set ofP.

Qualitative analysis of characteristics

[edit]

Characteristics are also a powerful tool for gaining qualitative insight into a PDE.

One can use the crossings of the characteristics to findshock waves forpotential flow in acompressible fluid. Intuitively, we can think of each characteristic line implying a solution tou{\displaystyle u} along itself. Thus, when two characteristics cross, the function becomes multi-valued resulting in a non-physical solution. Physically, this contradiction is removed by the formation of a shock wave, a tangential discontinuity or a weak discontinuity and can result in non-potential flow, violating the initial assumptions.[8]

Characteristics may fail to cover part of the domain of the PDE. This is called ararefaction, and indicates the solution typically exists only in aweak, i.e.integral equation, sense.

The direction of the characteristic lines indicates the flow of values through the solution, as the example above demonstrates. This kind of knowledge is useful when solving PDEs numerically as it can indicate whichfinite difference scheme is best for the problem.

See also

[edit]

Notes

[edit]
  1. ^Zachmanoglou & Thoe 1986, pp. 112–152.
  2. ^Pinchover & Rubinstein 2005, pp. 25–28.
  3. ^abJohn 1991, p. 9.
  4. ^Zauderer 2006, p. 82.
  5. ^Demidov 1982, pp. 331–333.
  6. ^"Partial Differential Equations (PDEs)—Wolfram Language Documentation".
  7. ^John 1991, pp. 19–24.
  8. ^Debnath, Lokenath (2005), "Conservation Laws and Shock Waves",Nonlinear Partial Differential Equations for Scientists and Engineers (2nd ed.), Boston: Birkhäuser, pp. 251–276,ISBN 0-8176-4323-0

References

[edit]

External links

[edit]
Finite difference
Parabolic
Hyperbolic
Others
Finite volume
Finite element
Meshless/Meshfree
Domain decomposition
Others
Related
Retrieved from "https://en.wikipedia.org/w/index.php?title=Method_of_characteristics&oldid=1317947977"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp