Mathematical induction can be informally illustrated by reference to the sequential effect offalling dominoes.[1][2]
Mathematical induction is a method forproving that a statement is true for everynatural number, that is, that the infinitely many cases all hold. This is done by first proving a simple case, then also showing that if we assume the claim is true for a given case, then the next case is also true. Informal metaphors help to explain this technique, such as falling dominoes or climbing a ladder:
Mathematical induction proves that we can climb as high as we like on a ladder, by proving that we can climb onto the bottom rung (thebasis) and that from each rung we can climb up to the next one (thestep).
Aproof by induction consists of two cases. The first, thebase case, proves the statement for without assuming any knowledge of other cases. The second case, theinduction step, proves thatif the statement holds for any given case,then it must also hold for the next case. These two steps establish that the statement holds for every natural number. The base case does not necessarily begin with, but often with, and possibly with any fixed natural number, establishing the truth of the statement for all natural numbers.
Despite its name, mathematical induction differs fundamentally frominductive reasoning asused in philosophy, in which the examination of many cases results in a probable conclusion. The mathematical method examines infinitely many cases to prove a general statement, but it does so by a finite chain ofdeductive reasoning involving thevariable, which can take infinitely many values. The result is a rigorous proof of the statement, not an assertion of its probability.[4]
In 370 BC,Plato'sParmenides may have contained traces of an early example of an implicit inductive proof,[5] however, the earliest implicit proof by mathematical induction was written byal-Karaji around 1000 AD, who applied it toarithmetic sequences to prove thebinomial theorem and properties ofPascal's triangle. Whilst the original work was lost, it was later referenced byAl-Samawal al-Maghribi in his treatiseal-Bahir fi'l-jabr (The Brilliant in Algebra) in around 1150 AD.[6][7][8]
Katz says in his history of mathematics
Another important idea introduced by al-Karaji and continued by al-Samaw'al and others was that of an inductive argument for dealing with certain arithmetic sequences. Thus al-Karaji used such an argument to prove the result on the sums of integral cubes already known toAryabhata [...] Al-Karaji did not, however, state a general result for arbitraryn. He stated his theorem for the particular integer 10 [...] His proof, nevertheless, was clearly designed to be extendable to any other integer. [...] Al-Karaji's argument includes in essence the two basic components of a modern argument by induction, namely thetruth of the statement forn = 1 (1 = 13) and the deriving of the truth forn =k from that ofn =k − 1. Of course, this second component is not explicit since, in some sense, al-Karaji's argument is in reverse; this is, he starts fromn = 10 and goes down to 1 rather than proceeding upward. Nevertheless, his argument inal-Fakhri is the earliest extant proof ofthe sum formula for integral cubes.[9]
None of these ancient mathematicians, however, explicitly stated the induction hypothesis. Another similar case (contrary to what Vacca has written, as Freudenthal carefully showed)[11] was that ofFrancesco Maurolico in hisArithmeticorum libri duo (1575), who used the technique to prove that the sum of the firstnoddintegers isn2.
The earliestrigorous use of induction was byGersonides (1288–1344).[12][13] The first explicit formulation of the principle of induction was given byPascal in hisTraité du triangle arithmétique (1665). Another Frenchman,Fermat, made ample use of a related principle: indirect proof byinfinite descent.
The simplest and most common form of mathematical induction infers that a statement involving anatural numbern (that is, an integern ≥ 0 or 1) holds for all values ofn. The proof consists of two steps:
Thebase case (orinitial case): prove that the statement holds for 0, or 1.
Theinduction step (orinductive step, orstep case): prove that for everyn, if the statement holds forn, then it holds forn + 1. In other words, assume that the statement holds for some arbitrary natural numbern, and prove that the statement holds forn + 1.
The hypothesis in the induction step, that the statement holds for a particularn, is called theinduction hypothesis orinductive hypothesis. To prove the induction step, one assumes the induction hypothesis forn and then uses this assumption to prove that the statement holds forn + 1.
Authors who prefer to define natural numbers to begin at 0 use that value in the base case; those who define natural numbers to begin at 1 use that value.
Mathematical induction can be used to prove the following statement for all natural numbers:
This states a general formula for the sum of the natural numbers less than or equal to a given number; in fact an infinite sequence of statements:,,, etc.
Proposition. For every, we have that
Proof. Let be the statement We give a proof by induction on.
Base case: Show that the statement holds for the smallest natural numbern = 0.
is clearly true:
Induction step: Show that for every, if holds, then also holds.
Assume the induction hypothesis that for a particular, the single case holds, meaning is true:It follows that:
Equating the extreme left hand and right hand sides, we deduce that: That is, the statement also holds true, establishing the induction step.
Conclusion: Since both the base case and the induction step have been proved as true, by mathematical induction the statement holds for every natural number.Q.E.D.
Induction is often used to proveinequalities. As an example, we prove that for anyreal number and natural number.
At first glance, it may appear that a more general version, for anyreal numbers, could be proven without induction; but the case shows it may be false for non-integer values of. This suggests we examine the statement specifically fornatural values of, and induction is the readiest tool.
Proposition. For any and,.
Proof. Fix an arbitrary real number, and let be the statement. We induce on.
Base case: The calculation verifies.
Induction step: We show theimplication for any natural number. Assume the induction hypothesis: for a given value, the single case is true. Using theangle addition formula and thetriangle inequality, we deduce:
The inequality between the extreme left-hand and right-hand quantities shows that is true, which completes the induction step.
Conclusion: The proposition holds for all natural numbers Q.E.D.
In practice, proofs by induction are often structured differently, depending on the exact nature of the property to be proven.All variants of induction are special cases oftransfinite induction; seebelow.
If one wishes to prove a statement, not for all natural numbers, but only for all numbersn greater than or equal to a certain numberb, then the proof by induction consists of the following:
Showing that the statement holds whenn =b.
Showing that if the statement holds for an arbitrary numbern ≥b, then the same statement also holds forn + 1.
This can be used, for example, to show that2n ≥n + 5 forn ≥ 3.
In this way, one can prove that some statementP(n) holds for alln ≥ 1, or even for alln ≥ −5. This form of mathematical induction is actually a special case of the previous form, because if the statement to be proved isP(n) then proving it with these two rules is equivalent with provingP(n +b) for all natural numbersn with an induction base case0.[17]
Assume an infinite supply of 4- and 5-dollar coins. Induction can be used to prove that any whole amount of dollars greater than or equal to12 can be formed by a combination of such coins. LetS(k) denote the statement "k dollars can be formed by a combination of 4- and 5-dollar coins". The proof thatS(k) is true for allk ≥ 12 can then be achieved by induction onk as follows:
Base case: Showing thatS(k) holds fork = 12 is simple: take three 4-dollar coins.
Induction step: Given thatS(k) holds for some value ofk ≥ 12 (induction hypothesis), prove thatS(k + 1) holds, too. AssumeS(k) is true for some arbitraryk ≥ 12. If there is a solution fork dollars that includes at least one 4-dollar coin, replace it by a 5-dollar coin to makek + 1 dollars. Otherwise, if only 5-dollar coins are used,k must be a multiple of 5 and so at least 15; but then we can replace three 5-dollar coins by four 4-dollar coins to makek + 1 dollars. In each case,S(k + 1) is true.
Therefore, by the principle of induction,S(k) holds for allk ≥ 12, and the proof is complete.
In this example, althoughS(k) also holds for, the above proof cannot be modified to replace the minimum amount of12 dollar to any lower valuem. Form = 11, the base case is actually false; form = 10, the second case in the induction step (replacing three 5- by four 4-dollar coins) will not work; let alone for even lowerm.
It is sometimes desirable to prove a statement involving two natural numbers,n andm, by iterating the induction process. That is, one proves a base case and an induction step forn, and in each of those proves a base case and an induction step form. See, for example, theproof of commutativity accompanyingaddition of natural numbers. More complicated arguments involving three or more counters are also possible.
The method of infinite descent is a variation of mathematical induction which was used byPierre de Fermat. It is used to show that some statementQ(n) is false for all natural numbersn. Its traditional form consists of showing that ifQ(n) is true for some natural numbern, it also holds for some strictly smaller natural numberm. Because there are no infinite decreasing sequences of natural numbers, this situation would be impossible, thereby showing (by contradiction) thatQ(n) cannot be true for anyn.
The validity of this method can be verified from the usual principle of mathematical induction. Using mathematical induction on the statementP(n) defined as "Q(m) is false for all natural numbersm less than or equal ton", it follows thatP(n) holds for alln, which means thatQ(n) is false for every natural numbern.
If one wishes to prove that a propertyP holds for all natural numbers less than or equal to a fixedN, proving thatP satisfies the following conditions suffices:[18]
P holds for 0,
For any natural numberx less thanN, ifP holds forx, thenP holds forx + 1
The most common form of proof by mathematical induction requires proving in the induction step that
whereupon the induction principle "automates"n applications of this step in getting fromP(0) toP(n). This could be called "predecessor induction" because each step proves something about a number from something about that number's predecessor.
A variant of interest incomputational complexity is "prefix induction", in which one proves the following statement in the induction step:or equivalently
The induction principle then "automates"log2n applications of this inference in getting fromP(0) toP(n). In fact, it is called "prefix induction" because each step proves something about a number from something about the "prefix" of that number — as formed by truncating the low bit of itsbinary representation. It can also be viewed as an application of traditional induction on the length of that binary representation.
If traditional predecessor induction is interpreted computationally as ann-step loop, then prefix induction would correspond to a log-n-step loop. Because of that, proofs using prefix induction are "more feasibly constructive" than proofs using predecessor induction.
Predecessor induction can trivially simulate prefix induction on the same statement. Prefix induction can simulate predecessor induction, but only at the cost of making the statement more syntactically complex (adding aboundeduniversal quantifier), so the interesting results relating prefix induction topolynomial-time computation depend on excluding unbounded quantifiers entirely, and limiting the alternation of bounded universal andexistential quantifiers allowed in the statement.[19]
One can take the idea a step further: one must provewhereupon the induction principle "automates"log logn applications of this inference in getting fromP(0) toP(n). This form of induction has been used, analogously, to study log-time parallel computation.[citation needed]
Another variant, calledcomplete induction,course of values induction orstrong induction (in contrast to which the basic form of induction is sometimes known asweak induction), makes the induction step easier to prove by using a stronger hypothesis: one proves the statement under the assumption that holds forall natural numbers less than; by contrast, the basic form only assumes. The name "strong induction" does not mean that this method can prove more than "weak induction", but merely refers to the stronger hypothesis used in the induction step.
In fact, it can be shown that the two methods are actually equivalent, as explained below. In this form of complete induction, one still has to prove the base case,, and it may even be necessary to prove extra-base cases such as before the general argument applies, as in the example below of theFibonacci number.
Although the form just described requires one to prove the base case, this is unnecessary if one can prove (assuming for all lower) for all. This is a special case oftransfinite induction as described below, although it is no longer equivalent to ordinary induction. In this form the base case is subsumed by the case, where is proved with no other assumed; this case may need to be handled separately, but sometimes the same argument applies for and, making the proof simpler and more elegant.In this method, however, it is vital to ensure that the proof of does not implicitly assume that, e.g. by saying "choose an arbitrary", or by assuming that a set ofm elements has an element.
Complete induction is equivalent to ordinary mathematical induction as described above, in the sense that a proof by one method can be transformed into a proof by the other. Suppose there is a proof of by complete induction. Then, this proof can be transformed into an ordinary induction proof by assuming a stronger inductive hypothesis. Let be the statement " holds for all such that"—this becomes the inductive hypothesis for ordinary induction. We can then show and for assuming only and show that implies.[20]
If, on the other hand, had been proven by ordinary induction, the proof would already effectively be one by complete induction: is proved in the base case, using no assumptions, and is proved in the induction step, in which one may assume all earlier cases but need only use the case.
Complete induction is most useful when several instances of the inductive hypothesis are required for each induction step. For example, complete induction can be used to show thatwhere is then-thFibonacci number, and (thegolden ratio) and are theroots of thepolynomial. By using the fact that for each, the identity above can be verified by direct calculation for if one assumes that it already holds for both and. To complete the proof, the identity must be verified in the two base cases: and.
Another proof by complete induction uses the hypothesis that the statement holds forall smaller more thoroughly. Consider the statement that "everynatural number greater than 1 is a product of (one or more)prime numbers", which is the "existence" part of thefundamental theorem of arithmetic. For proving the induction step, the induction hypothesis is that for a given the statement holds for all smaller. If is prime then it is certainly a product of primes, and if not, then by definition it is a product:, where neither of the factors is equal to 1; hence neither is equal to, and so both are greater than 1 and smaller than. The induction hypothesis now applies to and, so each one is a product of primes. Thus is a product of products of primes, and hence by extension a product of primes itself.
We shall look to prove the same example asabove, this time withstrong induction. The statement remains the same:
However, there will be slight differences in the structure and the assumptions of the proof, starting with the extended base case.
Proof.
Base case: Show that holds for.
The base case holds.
Induction step: Given some, assume holds for all with. Prove that holds.
Choosing, and observing that shows that holds, by the inductive hypothesis. That is, the sum can be formed by some combination of and dollar coins. Then, simply adding a dollar coin to that combination yields the sum. That is, holds[21] Q.E.D.
Sometimes, it is more convenient to deduce backwards, proving the statement for, given its validity for. However, proving the validity of the statement for no single number suffices to establish the base case; instead, one needs to prove the statement for an infinite subset of the natural numbers. For example,Augustin Louis Cauchy first used forward (regular) induction to prove theinequality of arithmetic and geometric means for allpowers of 2, and then used backwards induction to show it for all natural numbers.[22][23]
The induction step must be proved for all values ofn. To illustrate this, Joel E. Cohen proposed the following argument, which purports to prove by mathematical induction thatall horses are of the same color:[24]
Base case: in a set of onlyone horse, there is only one color.
Induction step: assume as induction hypothesis that within any set of horses, there is only one color. Now look at any set of horses. Number them:. Consider the sets and. Each is a set of only horses, therefore within each there is only one color. But the two sets overlap, so there must be only one color among all horses.
The base case is trivial, and the induction step is correct in all cases. However, the argument used in the induction step is incorrect for, because the statement that "the two sets overlap" is false for and.
In words, the base caseP(0) and the induction step (namely, that the induction hypothesisP(k) impliesP(k + 1)) together imply thatP(n) for any natural numbern. The axiom of induction asserts the validity of inferring thatP(n) holds for any natural numbern from the base case and the induction step.
The first quantifier in the axiom ranges overpredicates rather than over individual numbers. This is a second-order quantifier, which means that this axiom is stated insecond-order logic. Axiomatizing arithmetic induction infirst-order logic requires anaxiom schema containing a separate axiom for each possible predicate. The articlePeano axioms contains further discussion of this issue.
The axiom of structural induction for the natural numbers was first formulated by Peano, who used it to specify the natural numbers together with the following four other axioms:
0 is a natural number.
The successor functions of every natural number yields a natural number(s(x) =x + 1).
Infirst-orderZFC set theory, quantification over predicates is not allowed, but one can still express induction by quantification over sets:A may be read as a set representing a proposition, and containing natural numbers, for which the proposition holds. This is not an axiom, but a theorem, given that natural numbers are defined in the language of ZFC set theory by axioms, analogous to Peano's. Seeconstruction of the natural numbers using theaxiom of infinity andaxiom schema of specification.
One variation of the principle of complete induction can be generalized for statements about elements of anywell-founded set, that is, a set with anirreflexive relation < that contains noinfinite descending chains. Every set representing anordinal number is well-founded, the set of natural numbers is one of them.
Applied to a well-founded set, transfinite induction can be formulated as a single step. To prove that a statementP(n) holds for each ordinal number:
Show, for each ordinal numbern, that ifP(m) holds for allm <n, thenP(n) also holds.
This form of induction, when applied to a set of ordinal numbers (which form awell-ordered and hence well-foundedclass), is calledtransfinite induction. It is an important proof technique inset theory,topology and other fields.
Proofs by transfinite induction typically distinguish three cases:
whenn is a minimal element, i.e. there is no element smaller thann;
whenn has a direct predecessor, i.e. the set of elements which are smaller thann has a largest element;
whenn has no direct predecessor, i.e.n is a so-calledlimit ordinal.
Strictly speaking, it is not necessary in transfinite induction to prove a base case, because it is avacuous special case of the proposition that ifP is true of alln <m, thenP is true ofm. It is vacuously true precisely because there are no values ofn <m that could serve as counterexamples. So the special cases are special cases of the general case.
The principle of mathematical induction is usually stated as anaxiom of the natural numbers; seePeano axioms. It is strictly stronger than thewell-ordering principle in the context of the other Peano axioms. Suppose the following:
Thetrichotomy axiom: For any natural numbersn andm,n is less than or equal tom if and only ifm is not less thann.
For any natural numbern,n + 1 is greaterthann.
For any natural numbern, no natural number isbetweenn andn + 1.
No natural number is less than zero.
It can then be proved that induction, given the above-listed axioms, implies the well-ordering principle. The following proof uses complete induction and the first and fourth axioms.
Proof. Suppose there exists anon-empty set,S, of natural numbers that has no least element. LetP(n) be the assertion thatn is not inS. ThenP(0) is true, for if it were false then 0 is the least element ofS. Furthermore, letn be a natural number, and supposeP(m) is true for all natural numbersm less thann + 1. Then ifP(n + 1) is falsen + 1 is inS, thus being a minimal element inS, a contradiction. ThusP(n + 1) is true. Therefore, by the complete induction principle,P(n) holds for all natural numbersn; soS is empty, a contradiction. Q.E.D.
"Number line" for the set{(0,n):n ∈N} ∪{(1,n):n ∈N}. Numbers refer to the second component of pairs; the first can be obtained from color or location.
On the other hand, the set, shown in the picture, is well-ordered[25]: 35lf by thelexicographic order.Moreover, except for the induction axiom, it satisfies all Peano axioms, where Peano's constant 0 is interpreted as the pair (0, 0), and Peano'ssuccessor function is defined on pairs bysucc(x,n) = (x,n + 1) for all and.As an example for the violation of the induction axiom, define the predicateP(x,n) as(x,n) = (0, 0) or(x,n) = succ(y,m) for some and. Then the base caseP(0, 0) is trivially true, and so is the induction step: ifP(x,n), thenP(succ(x,n)). However,P is not true for all pairs in the set, sinceP(1,0) is false.
Peano's axioms with the induction principle uniquely model the natural numbers. Replacing the induction principle with the well-ordering principle allows for more exotic models that fulfill all the axioms.[25]
It is mistakenly printed in several books[25] and sources that the well-ordering principle is equivalent to the induction axiom. In the context of the other Peano axioms, this is not the case, but in the context of other axioms, they are equivalent;[25] specifically, the well-ordering principle implies the induction axiom in the context of the first two above listed axioms and
Every natural number is either 0 orn + 1 for some natural numbern.
A common mistake in many erroneous proofs is to assume thatn − 1 is a unique and well-defined natural number, a property which is not implied by the other Peano axioms.[25]
^"The Binomial Theorem".mathcenter.oxford.emory.edu. Retrieved2 December 2024.That said, he was not the first person to study it. The Persian mathematician and engineer Al-Karaji, who lived from 935 to 1029 is currently credited with its discovery. (Interesting tidbit: Al-Karaji also introduced the powerful idea of arguing by mathematical induction.)
^abCajori (1918), p. 197: 'The process of reasoning called "Mathematical Induction" has had several independent origins. It has been traced back to the Swiss Jakob (James) Bernoulli, the Frenchman B. Pascal and P. Fermat, and the Italian F. Maurolycus. [...] By reading a little between the lines one can find traces of mathematical induction still earlier, in the writings of the Hindus and the Greeks, as, for instance, in the "cyclic method" of Bhaskara, and in Euclid's proof that the number of primes is infinite.'
^"It is sometimes required to prove a theorem which shall be true whenever a certain quantityn which it involves shall be an integer or whole number and the method of proof is usually of the following kind.1st. The theorem is proved to be truewhenn = 1.2ndly. It is proved that if the theorem is true whenn is a given whole number, it will be true ifn is the next greater integer. Hence the theorem is true universally. … This species of argument may be termed a continuedsorites" (Boole c. 1849Elementary Treatise on Logic not mathematical pp. 40–41 reprinted inGrattan-Guinness, Ivor and Bornet, Gérard (1997),George Boole: Selected Manuscripts on Logic and its Philosophy, Birkhäuser Verlag, Berlin,ISBN3-7643-5456-9)
^Cohen, Joel E. (1961). "On the nature of mathematical proof".Opus.. Reprinted inA Random Walk in Science (R. L. Weber, ed.), Crane, Russak & Co., 1973.