| Differential equations | |||||
|---|---|---|---|---|---|
| Scope | |||||
| Classification | |||||
Types
| |||||
Relation to processes | |||||
| Solution | |||||
Solution methods | |||||
| People | |||||
Inmathematics, alinear differential equation is adifferential equation that islinear in the unknown function and its derivatives, so it can be written in the formwherea0(x), ...,an(x) andb(x) are arbitrarydifferentiable functions that do not need to be linear, andy′, ...,y(n) are the successive derivatives of an unknown functiony of the variablex.
Such an equation is anordinary differential equation (ODE). Alinear differential equation may also be a linearpartial differential equation (PDE), if the unknown function depends on several variables, and the derivatives that appear in the equation arepartial derivatives.
A linear differential equation or asystem of linear equations such that the associated homogeneous equations have constant coefficients may besolved by quadrature, which means that the solutions may be expressed in terms ofintegrals. This is also true for a linear equation of order one, with non-constant coefficients. An equation of order two or higher with non-constant coefficients cannot, in general, be solved by quadrature. For order two,Kovacic's algorithm allows deciding whether there are solutions in terms of integrals, and computing them if any.
The solutions of homogeneous linear differential equations withpolynomial coefficients are calledholonomic functions. This class of functions is stable under sums, products,differentiation,integration, and contains many usual functions andspecial functions such asexponential function,logarithm,sine,cosine,inverse trigonometric functions,error function,Bessel functions andhypergeometric functions. Their representation by the defining differential equation and initial conditions allows making algorithmic (on these functions) most operations ofcalculus, such as computation ofantiderivatives,limits,asymptotic expansion, and numerical evaluation to any precision, with a certified error bound.
The highestorder of derivation that appears in a (linear) differential equation is theorder of the equation. The termb(x), which does not depend on the unknown function and its derivatives, is sometimes called theconstant term of the equation (by analogy withalgebraic equations), even when this term is a non-constant function. If the constant term is thezero function, then the differential equation is said to behomogeneous, as it is ahomogeneous polynomial in the unknown function and its derivatives. The equation obtained by replacing, in a linear differential equation, the constant term by the zero function is theassociated homogeneous equation. A differential equation hasconstant coefficients if onlyconstant functions appear as coefficients in the associated homogeneous equation.
Asolution of a differential equation is a function that satisfies the equation.The solutions of a homogeneous linear differential equation form avector space. In the ordinary case, this vector space has a finite dimension, equal to the order of the equation. All solutions of a linear differential equation are found by adding to a particular solution any solution of the associated homogeneous equation.
Abasic differential operator of orderi is a mapping that maps anydifferentiable function to itsith derivative, or, in the case of several variables, to one of itspartial derivatives of orderi. It is commonly denotedin the case ofunivariate functions, andin the case of functions ofn variables. The basic differential operators include the derivative of order 0, which is the identity mapping.
Alinear differential operator (abbreviated, in this article, aslinear operator or, simply,operator) is alinear combination of basic differential operators, with differentiable functions as coefficients. In the univariate case, a linear operator has thus the form[1]wherea0(x), ...,an(x) are differentiable functions, and the nonnegative integern is theorder of the operator (ifan(x) is not thezero function).
LetL be a linear differential operator. The application ofL to a functionf is usually denotedLf orLf(X), if one needs to specify the variable (this must not be confused with a multiplication). A linear differential operator is alinear operator, since it maps sums to sums and the product by ascalar to the product by the same scalar.
As the sum of two linear operators is a linear operator, as well as the product (on the left) of a linear operator by a differentiable function, the linear differential operators form avector space over thereal numbers or thecomplex numbers (depending on the nature of the functions that are considered). They form also afree module over thering of differentiable functions.
The language of operators allows a compact writing for differentiable equations: ifis a linear differential operator, then the equationmay be rewritten
There may be several variants to this notation; in particular the variable of differentiation may appear explicitly or not iny and the right-hand and of the equation, such asLy(x) =b(x) orLy =b.
Thekernel of a linear differential operator is itskernel as a linear mapping, that is thevector space of the solutions of the (homogeneous) differential equationLy = 0.
In the case of an ordinary differential operator of ordern,Carathéodory's existence theorem implies that, under very mild conditions, the kernel ofL is a vector space of dimensionn, and that the solutions of the equationLy(x) =b(x) have the formwherec1, ...,cn are arbitrary numbers. Typically, the hypotheses of Carathéodory's theorem are satisfied in an intervalI, if the functionsb,a0, ...,an are continuous inI, and there is a positive real numberk such that|an(x)| >k for everyx inI.
A homogeneous linear differential equation hasconstant coefficients if it has the formwherea1, ...,an are (real or complex) numbers. In other words, it has constant coefficients if it is defined by a linear operator with constant coefficients.
The study of these differential equations with constant coefficients dates back toLeonhard Euler, who introduced theexponential function, which is the unique solution of the equation, such that. It follows that thenth derivative of is, and this allows solving homogeneous linear differential equations rather easily.
Letbe a homogeneous linear differential equation with constant coefficients (that isa0, ...,an are real or complex numbers).
Searching solutions of this equation that have the formeαx is equivalent to searching the constantsα such thatFactoring outeαx (which is never zero), shows thatα must be a root of thecharacteristic polynomialof the differential equation, which is the left-hand side of thecharacteristic equation
When these roots are alldistinct, one hasn distinct solutions that are not necessarily real, even if the coefficients of the equation are real. These solutions can be shown to belinearly independent, by considering theVandermonde determinant of the values of these solutions atx = 0, ...,n – 1. Together they form abasis of thevector space of solutions of the differential equation (that is, the kernel of the differential operator).
| Example |
|---|
has the characteristic equationThis has zeros,i,−i, and1 (multiplicity 2). The solution basis is thusA real basis of solution is thus |
In the case where the characteristic polynomial has onlysimple roots, the preceding provides a complete basis of the solutions vector space. In the case ofmultiple roots, more linearly independent solutions are needed for having a basis. These have the formwherek is a nonnegative integer,α is a root of the characteristic polynomial of multiplicitym, andk <m. For proving that these functions are solutions, one may remark that ifα is a root of the characteristic polynomial of multiplicitym, the characteristic polynomial may be factored asP(t)(t −α)m. Thus, applying the differential operator of the equation is equivalent with applying firstm times the operator, and then the operator that hasP as characteristic polynomial. By theexponential shift theorem,
and thus one gets zero afterk + 1 application of.
As, by thefundamental theorem of algebra, the sum of the multiplicities of the roots of a polynomial equals the degree of the polynomial, the number of above solutions equals the order of the differential equation, and these solutions form a basis of the vector space of the solutions.
In the common case where the coefficients of the equation are real, it is generally more convenient to have a basis of the solutions consisting ofreal-valued functions. Such a basis may be obtained from the preceding basis by remarking that, ifa +ib is a root of the characteristic polynomial, thena –ib is also a root, of the same multiplicity. Thus a real basis is obtained by usingEuler's formula, and replacing and by and.
A homogeneous linear differential equation of the second order may be writtenand its characteristic polynomial is
Ifa andb arereal, there are three cases for the solutions, depending on the discriminantD =a2 − 4b. In all three cases, the general solution depends on two arbitrary constantsc1 andc2.
Finding the solutiony(x) satisfyingy(0) =d1 andy′(0) =d2, one equates the values of the above general solution at0 and its derivative there tod1 andd2, respectively. This results in a linear system of two linear equations in the two unknownsc1 andc2. Solving this system gives the solution for a so-calledCauchy problem, in which the values at0 for the solution of the DEQ and its derivative are specified.
A non-homogeneous equation of ordern with constant coefficients may be writtenwherea1, ...,an are real or complex numbers,f is a given function ofx, andy is the unknown function (for sake of simplicity, "(x)" will be omitted in the following).
There are several methods for solving such an equation. The best method depends on the nature of the functionf that makes the equation non-homogeneous. Iff is a linear combination of exponential and sinusoidal functions, then theexponential response formula may be used. If, more generally,f is a linear combination of functions of the formxneax,xn cos(ax), andxn sin(ax), wheren is a nonnegative integer, anda a constant (which need not be the same in each term), then themethod of undetermined coefficients may be used. Still more general, theannihilator method applies whenf satisfies a homogeneous linear differential equation, typically, aholonomic function.
The most general method is thevariation of constants, which is presented here.
The general solution of the associated homogeneous equationiswhere(y1, ...,yn) is a basis of the vector space of the solutions andu1, ...,un are arbitrary constants. The method of variation of constants takes its name from the following idea. Instead of consideringu1, ...,un as constants, they can be considered as unknown functions that have to be determined for makingy a solution of the non-homogeneous equation. For this purpose, one adds the constraintswhich imply (byproduct rule andinduction)fori = 1, ...,n – 1, and
Replacing in the original equationy and its derivatives by these expressions, and using the fact thaty1, ...,yn are solutions of the original homogeneous equation, one gets
This equation and the above ones with0 as left-hand side form a system ofn linear equations inu′1, ...,u′n whose coefficients are known functions (f, theyi, and their derivatives). This system can be solved by any method oflinear algebra. The computation ofantiderivatives givesu1, ...,un, and theny =u1y1 + ⋯ +unyn.
As antiderivatives are defined up to the addition of a constant, one finds again that the general solution of the non-homogeneous equation is the sum of an arbitrary solution and the general solution of the associated homogeneous equation.
The general form of a linear ordinary differential equation of order 1, after dividing out the coefficient ofy′(x), is:
If the equation is homogeneous, i.e.g(x) = 0, one may rewrite and integrate:wherek is an arbitraryconstant of integration and is anyantiderivative off. Thus, the general solution of the homogeneous equation iswherec =ek is an arbitrary constant.
For the general non-homogeneous equation, it is useful to multiply both sides of the equation by thereciprocale−F of a solution of the homogeneous equation.[2] This givesAs theproduct rule allows rewriting the equation asThus, the general solution iswherec is a constant of integration, andF is any antiderivative off (changing of antiderivative amounts to change the constant of integration).
Solving the equationThe associated homogeneous equation givesthat is
Dividing the original equation by one of these solutions givesThat isandFor the initial conditionone gets the particular solution
A system of linear differential equations consists of several linear differential equations that involve several unknown functions. In general one restricts the study to systems such that the number of unknown functions equals the number of equations.
An arbitrary linear ordinary differential equation and a system of such equations can be converted into a first order system of linear differential equations by adding variables for all but the highest order derivatives. That is, if appear in an equation, one may replace them by new unknown functions that must satisfy the equations and fori = 1, ...,k – 1.
A linear system of the first order, which hasn unknown functions andn differential equations may normally be solved for the derivatives of the unknown functions. If it is not the case this is adifferential-algebraic system, and this is a different theory. Therefore, the systems that are considered here have the formwhere and the are functions ofx. In matrix notation, this system may be written (omitting "(x)")
The solving method is similar to that of a single first order linear differential equations, but with complications stemming from noncommutativity of matrix multiplication.
Letbe the homogeneous equation associated to the above matrix equation.Its solutions form avector space of dimensionn, and are therefore the columns of asquare matrix of functions, whosedeterminant is not the zero function. Ifn = 1, orA is a matrix of constants, or, more generally, ifA commutes with itsantiderivative, then one may chooseU equal theexponential ofB. In fact, in these cases, one hasIn the general case there is no closed-form solution for the homogeneous equation, and one has to use either anumerical method, or an approximation method such asMagnus expansion.
Knowing the matrixU, the general solution of the non-homogeneous equation iswhere the column matrix is an arbitraryconstant of integration.
If initial conditions are given asthe solution that satisfies these initial conditions is
A linear ordinary equation of order one with variable coefficients may be solved byquadrature, which means that the solutions may be expressed in terms ofintegrals. This is not the case for order at least two. This is the main result ofPicard–Vessiot theory which was initiated byÉmile Picard andErnest Vessiot, and whose recent developments are calleddifferential Galois theory.
The impossibility of solving by quadrature can be compared with theAbel–Ruffini theorem, which states that analgebraic equation of degree at least five cannot, in general, be solved by radicals. This analogy extends to the proof methods and motivates the denomination ofdifferential Galois theory.
Similarly to the algebraic case, the theory allows deciding which equations may be solved by quadrature, and if possible solving them. However, for both theories, the necessary computations are extremely difficult, even with the most powerful computers.
Nevertheless, the case of order two with rational coefficients has been completely solved byKovacic's algorithm.
Cauchy–Euler equations are examples of equations of any order, with variable coefficients, that can be solved explicitly. These are the equations of the form where are constant coefficients.
Aholonomic function, also called aD-finite function, is a function that is a solution of a homogeneous linear differential equation with polynomial coefficients.
Most functions that are commonly considered in mathematics are holonomic or quotients of holonomic functions. In fact, holonomic functions includepolynomials,algebraic functions,logarithm,exponential function,sine,cosine,hyperbolic sine,hyperbolic cosine,inverse trigonometric andinverse hyperbolic functions, and manyspecial functions such asBessel functions andhypergeometric functions.
Holonomic functions have severalclosure properties; in particular, sums, products,derivative andintegrals of holonomic functions are holonomic. Moreover, these closure properties are effective, in the sense that there arealgorithms for computing the differential equation of the result of any of these operations, knowing the differential equations of the input.[3]
Usefulness of the concept of holonomic functions results of Zeilberger's theorem, which follows.[3]
Aholonomic sequence is a sequence of numbers that may be generated by arecurrence relation with polynomial coefficients. The coefficients of theTaylor series at a point of a holonomic function form a holonomic sequence. Conversely, if the sequence of the coefficients of apower series is holonomic, then the series defines a holonomic function (even if theradius of convergence is zero). There are efficient algorithms for both conversions, that is for computing the recurrence relation from the differential equation, andvice versa.[3]
It follows that, if one represents (in a computer) holonomic functions by their defining differential equations and initial conditions, mostcalculus operations can be done automatically on these functions, such asderivative,indefinite anddefinite integral, fast computation of Taylor series (thanks of the recurrence relation on its coefficients), evaluation to a high precision with certified bound of theapproximation error,limits, localization ofsingularities,asymptotic behavior at infinity and near singularities, proof of identities, etc.[4]