Inmathematical logic, theLöwenheim–Skolem theorem is a theorem on the existence andcardinality ofmodels, named afterLeopold Löwenheim andThoralf Skolem.
The precise formulation is given below. It implies that if acountablefirst-ordertheory has an infinitemodel, then for every infinitecardinal numberκ it has a model of sizeκ, and that no first-order theory with an infinite model can have a unique modelup to isomorphism. As a consequence, first-order theories are unable to control the cardinality of their infinite models.
The (downward) Löwenheim–Skolem theorem is one of the two key properties, along with thecompactness theorem, that are used inLindström's theorem to characterizefirst-order logic. In general, the Löwenheim–Skolem theorem does not hold in stronger logics such assecond-order logic.
In its general form, theLöwenheim–Skolem Theorem states that for everysignatureσ, every infiniteσ-structureM and every infinite cardinal numberκ ≥ |σ|, there is aσ-structureN such that|N| =κ and such that
The theorem is often divided into two parts corresponding to the two cases above. The part of the theorem asserting that a structure has elementary substructures of all smaller infinite cardinalities is known as thedownward Löwenheim–Skolem Theorem.[1]: 160–162 The part of the theorem asserting that a structure has elementary extensions of all larger cardinalities is known as theupward Löwenheim–Skolem Theorem.[2]
Below we elaborate on the general concept of signatures and structures.
Asignature consists of a set of function symbolsSfunc, a set of relation symbolsSrel, and a function representing thearity of function and relation symbols. (A nullary function symbol is called a constant symbol.) In the context of first-order logic, a signature is sometimes called a language. It is called countable if the set of function and relation symbols in it is countable, and in general the cardinality of a signature is the cardinality of the set of all the symbols it contains.
A first-ordertheory consists of a fixed signature and a fixed set of sentences (formulas with no free variables) in that signature.[3]: 40 Theories are often specified by giving a list of axioms that generate the theory, or by giving a structure and taking the theory to consist of the sentences satisfied by the structure.
Given a signatureσ, aσ-structureMis a concrete interpretation of the symbols inσ. It consists of an underlying set (often also denoted by "M") together with an interpretation of the function and relation symbols ofσ. An interpretation of a constant symbol ofσ inM is simply an element ofM. More generally, an interpretation of ann-ary function symbolf is a function fromMn toM. Similarly, an interpretation of a relation symbolR is ann-ary relation onM, i.e. a subset of Mn.
A substructure of aσ-structureM is obtained by taking a subsetN ofM which is closed under the interpretations of all the function symbols inσ (hence includes the interpretations of all constant symbols inσ), and then restricting the interpretations of the relation symbols toN. Anelementary substructure is a very special case of this; in particular an elementary substructure satisfies exactly the same first-order sentences as the original structure (its elementary extension).
The statement given in the introduction follows immediately by takingM to be an infinite model of the theory. The proof of the upward part of the theorem also shows that a theory with arbitrarily large finite models must have an infinite model; sometimes this is considered to be part of the theorem.[1]
A theory is calledcategorical if it has only one model, up to isomorphism. This term was introduced byVeblen (1904), and for some time thereafter mathematicians hoped they could put mathematics on a solid foundation by describing a categorical first-order theory of some version of set theory. The Löwenheim–Skolem theorem dealt a first blow to this hope, as it implies that a first-order theory which has an infinite model cannot be categorical. Later, in 1931, the hope was shattered completely byGödel's incompleteness theorem.[1]
Many consequences of the Löwenheim–Skolem theorem seemed counterintuitive to logicians in the early 20th century, as the distinction between first-order and non-first-order properties was not yet understood. One such consequence is the existence of uncountable models oftrue arithmetic, which satisfy every first-orderinduction axiom but have non-inductive subsets.
LetN denote the natural numbers andR the reals. It follows from the theorem that the theory of (N, +, ×, 0, 1) (the theory of true first-order arithmetic) has uncountable models, and that the theory of (R, +, ×, 0, 1) (the theory ofreal closed fields) has a countable model. There are, of course, axiomatizations characterizing (N, +, ×, 0, 1) and (R, +, ×, 0, 1) up to isomorphism. The Löwenheim–Skolem theorem shows that these axiomatizations cannot be first-order. For example, in the theory of the real numbers, the completeness of a linear order used to characterizeR as a complete ordered field, is anon-first-order property.[1]: 161
Another consequence that was considered particularly troubling is the existence of a countable model of set theory, which nevertheless must satisfy the sentence saying the real numbers are uncountable.Cantor's theorem states that some sets are uncountable. This counterintuitive situation came to be known asSkolem's paradox; it shows that the notion of countability is notabsolute.[4]
For each first-order-formula, theaxiom of choice implies the existence of a function
such that, for all, either
or
Applying the axiom of choice again we get a function from the first-order formulas to such functions.
The family of functions gives rise to apreclosure operator on thepower set of
for.
Iterating countably many times results in aclosure operator. Taking an arbitrary subset such that, and having defined, one can see that also. Then is an elementary substructure of by theTarski–Vaught test.
The trick used in this proof is essentially due to Skolem, who introduced function symbols for theSkolem functions into the language. One could also define the aspartial functions such that is defined if and only if. The only important point is that is a preclosure operator such that contains a solution for every formula with parameters in which has a solution in and that
First, one extends the signature by adding a new constant symbol for every element of. The complete theory of for the extended signature is called theelementary diagram of. In the next step one adds many new constant symbols to the signature and adds to the elementary diagram of the sentences for any two distinct new constant symbols and. Using thecompactness theorem, the resulting theory is easily seen to be consistent. Since its models must have cardinality at least, the downward part of this theorem guarantees the existence of a model which has cardinality exactly. It contains an isomorphic copy of as an elementary substructure.[5][6]: 100–102
Although the (classical) Löwenheim–Skolem theorem is tied very closely to first-order logic, variants hold for other logics. For example, every consistent theory insecond-order logic has a model smaller than the firstsupercompact cardinal (assuming one exists). The minimum size at which a (downward) Löwenheim–Skolem–type theorem applies in a logic is known as the Löwenheim number, and can be used to characterize that logic's strength. Moreover, if we go beyond first-order logic, we must give up one of three things: countable compactness, the downward Löwenheim–Skolem Theorem, or the properties of anabstract logic.[7]: 134
This account is based mainly onDawson (1993). To understand the early history of model theory one must distinguish betweensyntactical consistency (no contradiction can be derived using the deduction rules for first-order logic) andsatisfiability (there is a model). Somewhat surprisingly, even before thecompleteness theorem made the distinction unnecessary, the termconsistent was used sometimes in one sense and sometimes in the other.
The first significant result in what later becamemodel theory wasLöwenheim's theorem inLeopold Löwenheim's publication "Über Möglichkeiten im Relativkalkül" (1915):
Löwenheim's paper was actually concerned with the more generalPeirce–Schrödercalculus of relatives (relation algebra with quantifiers).[1] He also used the now antiquated notations ofErnst Schröder. For a summary of the paper in English and using modern notations seeBrady (2000, chapter 8).
According to the received historical view, Löwenheim's proof was faulty because it implicitly usedKőnig's lemma without proving it, although the lemma was not yet a published result at the time. In arevisionist account,Badesa (2004) considers that Löwenheim's proof was complete.
Skolem (1920) gave a (correct) proof using formulas in what would later be calledSkolem normal form and relying on the axiom of choice:
Skolem (1922) also proved the following weaker version without the axiom of choice:
Skolem (1929) simplifiedSkolem (1920). Finally,Anatoly Ivanovich Maltsev (Анато́лий Ива́нович Ма́льцев, 1936) proved the Löwenheim–Skolem theorem in its full generality (Maltsev 1936). He cited a note by Skolem, according to which the theorem had been proved byAlfred Tarski in a seminar in 1928. Therefore, the general theorem is sometimes known as theLöwenheim–Skolem–Tarski theorem. But Tarski did not remember his proof, and it remains a mystery how he could do it without thecompactness theorem.
It is somewhat ironic that Skolem's name is connected with the upward direction of the theorem as well as with the downward direction:
The Löwenheim–Skolem theorem is treated in all introductory texts onmodel theory ormathematical logic.