Traditionally, expressions in formal systems have been regarded assignifying finite inscriptions which are—at least inprinciple—capable of actually being written out in primitivenotation. However, the fact that (first-order) formulas may beidentified with natural numbers (via “Gödelnumbering”) and hence with finitesets makes it nolonger necessary to regard formulas as inscriptions, and suggests thepossibility of fashioning “languages” some of whoseformulas would be naturally identified asinfinite sets. A“language” of this kind is called aninfinitarylanguage: in this article I discuss those infinitary languageswhich can be obtained in a straightforward manner from first-orderlanguages by allowing conjunctions, disjunctions and, possibly,quantifier sequences, to be of infinite length. In the course of thediscussion it will be seen that, while the expressive power of suchlanguages far exceeds that of their finitary (first-order)counterparts, very few of them possess the “attractive”features (e.g., compactness and completeness) of thelatter. Accordingly, the infinitary languages that do in fact possessthese features merit special attention.
In §1 the basic syntax and semantics of infinitary languages arelaid down; their expressive power is then displayed by means ofexamples. §2 is devoted to those infinitary languages whichpermit only finite quantifier sequences: these languages turn out tobe relatively well-behaved. §3 is devoted to a discussion ofthecompactness problem for infinitary languages and itsconnection with purely set-theoretical questions concerning“large” cardinal numbers. In §4 an argument issketched which shows that most “infinite quantifier”languages have asecond-order nature and are,ipsofacto, highly incomplete. §5 provides a brief account of acertain special class of sublanguages of infinitary languages forwhich a satisfactory generalization of the compactness theorem can beproved. This section includes a subsection on the definition ofadmissible sets. Historical and bibliographical remarks are providedin §6.
Given a pair κ, λ of infinite cardinals such that λ≤ κ, we define a class of infinitary languages in each ofwhich we may form conjunctions and disjunctions of sets of formulas ofcardinality < κ, and quantifications over sequences ofvariables of length < λ.
LetL — the (finitary)base language — be an arbitrary but fixed first-orderlanguage with any number of extralogical symbols. The infinitarylanguageL(κ,λ) hasthe followingbasic symbols:
The class ofpreformulas ofL(κ,λ) is defined recursively asfollows:
If Φ is a set of preformulas indexed by a setI, sayΦ = {φi :i ∈I},then we agree to write Φfor:
i∈I φ
or, ifI is the set of natural numbers, we write Φ for:
φ0 ∧φ1 ∧…
IfX is a set of individual variables indexed by an ordinalα, sayX = {xξ : ξ <α}, we agree to write(∃xξ)ξ<αφ for∃Xφ.
The logical operators ∨, →,↔ are defined in the customary manner. We also introduce theoperators (infinitarydisjunction) and ∀ (universal quantification)by
Φ=df ¬{¬φ : φ ∈ Φ}∀Xφ =df ¬∃X¬φ,
and employ similar conventions as for , ∃ .
ThusL(κ,λ) isthe infinitary language obtained fromL by permitting conjunctions and disjunctions of length< κ and quantifications[1] of length <λ. LanguagesL(κ,ω) are calledfinite-quantifierlanguages, the restinfinite-quantifier languages. ObservethatL(ω,ω) is justL itself.
Notice the followinganomaly which can arise in aninfinitary language but not in a finitary one. In the languageL(ω1,ω), whichallows countably infinite conjunctions but only finite quantifications,there are preformulas with so many free variables that they cannot be“closed” into sentences ofL(ω1,ω) by prefixing quantifiers.Such is the case, for example, for theL(ω1,ω)-preformula
x0 <x1 ∧ x1<x2 ∧ … ∧ xn <xn+1 …,
whereL contains the binaryrelation symbol <. For this reason we make the following
Definition. Aformula ofL(κ,λ) is a preformulawhich contains < λ free variables. The set of all formulas ofL(κ,λ) will bedenoted byForm(L(κ,λ)) or simplyForm(κ,λ) and the set of all sentences bySent(L(κ,λ)) or simplySent(κ,λ).
In this connection, observe that, in general, nothing would be gainedby considering “languages”L(κ,λ) with λ > κ. Forexample, in the “language”L(ω,ω1), formulas will have onlyfinitely many free variables, while there will be a host of “useless”quantifiers able to bind infinitely many free variables.[2]
Having defined the syntax ofL(κ,λ), we next sketch itssemantics. Since the extralogical symbols ofL(κ,λ) are just those ofL, and it is these symbols whichdetermine the form of the structures in which a given first-orderlanguage is to be interpreted, it is natural to define anL(κ,λ)-structure to besimply anL-structure. The notionof a formula ofL(κ,λ) beingsatisfied in anL-structureA (by a sequence of elements from the domainofA) is defined in the same inductive manneras for formulas ofL except thatwe must add two extra clauses corresponding to the clauses for Φ and ∃Xφ in thedefinition of preformula. In these two cases we naturally define:
Φ is satisfiedinA (by a given sequence) ⇔ for all φ ∈ Φ, φ is satisfied inA (by the sequence);∃Xφ is satisfied inA ⇔ there is a sequence of elements from the domain ofA in bijective correspondence withXwhich satisfies φ inA.
These informal definitions need to be tightened up in a rigorousdevelopment, but their meaning should be clear to the reader. Now theusual notions oftruth, validity, satisfiability, andmodel for formulas and sentences ofL(κ,λ) become available. In particular, ifA is anL-structure and σ ∈Sent(κ,λ), we shall writeA ⊨σ forA is a model of σ, and ⊨ σ for σis valid,that is, for allA,A ⊨σ. If Δ ⊆Sent(κ,λ), weshall write Δ ⊨ σ forσis a logical consequence of Δ, that is, eachmodel of Δ is a model of σ.
We now give some examples intended to display the expressive powerof the infinitary languagesL(κ,λ) with κ ≥ω1. In each case it is well-known that the notion inquestion cannot be expressed in any first-order language.
Characterization of the standard model of arithmeticinL(ω1,ω). Here thestandardmodel of arithmetic is the structureN =〈N, +, ·,s, 0〉, whereN is the setof natural numbers, +, ·, and 0 have their usual meanings, ands is the successor operation. LetL be the first-order language appropriate forN. Then the class ofL-structures isomorphic toN coincideswith the class of models of the conjunction of the followingL(ω1,ω)sentences (where0 is a name of 0):
m∈ωn∈ωsm0 +sn0 =sm+n0
m∈ωn∈ωsm0·sn0 =sm·n0
m∈ωn∈ω−{m}sm0≠sn0
∀xm∈ωx =sm0
The termssnx are defined recursively by
s0x = x sn+1x = s(snx)
Characterization of the class of all finite sets inL(ω1,ω).Here the base language has no extralogical symbols. The class of allfinite sets then coincides with the class of models of theL(ω1,ω)-sentence
n∈ω∃v0 …∃vn∀x(x =v0 ∨ … ∨x =vn).
Truth definition inL(ω1,ω)for a countablebase languageL. LetL be a countable first-orderlanguage (for example, the language of arithmetic or set theory) whichcontains a namen for each natural numbern, and let σ0, σ1, …be an enumeration of its sentences. Then theL(ω1,ω)-formula
Tr(x) =df n∈ω (x =n ∧σn)
is atruth predicate forL inasmuch as the sentence
Tr(n) ↔σn
is valid for eachn.
Characterization of well-orderings inL(ω1,ω1). The baselanguageL here includes a binarypredicate symbol ≤. Let σ1 be the usualL-sentence characterizing linearorderings. Then the class ofL-structures in which the interpretation of ≤ is awell-ordering coincides with the class of models of theL(ω1,ω1) sentenceσ = σ1 ∧σ2, where
σ2 =df(∀vn)n∈ω∃x [n∈ω (x =vn) ∧n∈ω(x ≤vn)].
Notice that the sentence σ2 contains aninfinitequantifier: it expresses the essentiallysecond-orderassertion that every countable subset has a least member. It can infact be shown that the presence of this infinite quantifier isessential: the class of well-ordered structures cannot be characterizedin any finite-quantifier language. This example indicates thatinfinite-quantifier languages such asL(ω1,ω1) behave ratherlike second-order languages; we shall see that they share the latters'defects (incompleteness) as well as some of their advantages (strongexpressive power).
Many extensions of first-order languages can betranslatedinto infinitary languages. For example, consider the generalizedquantifier languageL(Q0) obtained fromL by adding a new quantifier symbolQ0 and interpretingQ0xφ(x) asthere existinfinitely many x such that φ(x). It is easily seenthat the sentenceQ0xφ(x) hasthe same models as theL(ω1,ω)-sentence
¬n∈ω∃v0…∃vn∀x[φ(x) → (x = v0 ∨ … ∨x = vn)].
ThusL(Q0)is, in a natural sense, translatable intoL(ω1,ω). Another languagetranslatable intoL(ω1,ω) in this sense is theweak second-order language obtained by adding a countable setof monadic predicate variables toL which are then interpreted as ranging over allfinite sets of individuals.
Languages with arbitrarily long conjunctions, disjunctions and(possibly) quantifications may also be introduced. For a fixed infinitecardinal λ, the languageL(∞,λ) is defined by specifying its classof formulas,Form(∞,λ), to be the union,over all κ ≥ λ, of the setsForm(κ,λ). ThusL(∞,λ) allows arbitrarily long conjunctionsand disjunctions, in the sense that if Φ is an arbitrary subset ofForm(∞,λ), then both Φ and Φ are members ofForm(∞,λ). ButL(∞,λ) admits only quantifications oflength < λ: all its formulas have < λ freevariables. The languageL(∞,∞) is defined in turn by specifying itsclass of formulas,Form(∞,∞), to be theunion, over all infinite cardinals λ, of the classesForm(∞,λ). SoL(∞,∞) allows arbitrarily longquantifications in addition to arbitrarily long conjunctions anddisjunctions. Note thatForm(∞,λ) andForm(∞,∞) are proper classes in the senseof Gödel-Bernays set theory. Satisfaction of formulas ofL(∞,λ) andL(∞,∞) in a structure maybe defined by an obvious extension of the corresponding notion forL(κ,λ).
We have remarked that infinite-quantifier languages such asL(ω1,ω1) resemblesecond-order languages inasmuch as they allow quantification overinfinite sets of individuals. The fact that this is not permitted infinite-quantifier languages suggests that these may be in certainrespects closer to their first-order counterparts than might be evidentat first sight. We shall see that this is indeed the case, notably inthe case ofL(ω1,ω).
The languageL(ω1,ω) occupies a special placeamong infinitary languages because—like first-orderlanguages—it admits an effectivedeductive apparatus. Infact, let us add to the usual first-order axioms and rules of inferencethe new axiom scheme
Φ →φ
for any countable set Φ ⊆Form(ω1,ω) and any φ ∈Φ, together with the new rule of inference
φ0, φ1, …, φn, … n∈ωφn
and allow deductions to be of countable length. Writing ⊢* for deducibility in this sense, wethen have the
L(ω1,ω)-CompletenessTheorem. For any σ ∈Sent(ω1,ω), ⊨ σ ⇔ ⊢*σ
As an immediate corollary we infer that this deductive apparatus isadequate for deductions from countable sets of premises inL(ω1,ω).That is, with the obvious extension of notation, we have, for anycountable set Δ ⊆Sent(ω1,ω)
(2.1) Δ ⊨σ ⇔ Δ⊢*σ
This completeness theorem can be proved by modifying the usualHenkin completeness proof for first-order logic, or by employingBoolean-algebraic methods. Similar arguments, applied to suitablefurther augmentations of the axioms and rules of inference, yieldanalogous completeness theorems for many other finite-quantifierlanguages.
If just deductions of countable length are admitted, then nodeductive apparatus forL(ω1,ω) can be set up which isadequate for deductions fromarbitrary sets of premises, thatis, for which (2.1) would hold for every set Δ ⊆Sent(ω1,ω),regardless ofcardinality. This follows from the simple observation that thereis a first-order languageL andan uncountable set Γ ofL(ω1,ω)-sentences such thatΓhas no model but every countable subset of Γdoes. To see this, letLbe the language of arithmetic augmented by ω1 newconstant symbols {cξ : ξ< ω1} and let Γ be the set ofL(ω1,ω)-sentences {σ} ∪{cξ ≠cη : ξ ≠ η}, whereσ is theL(ω1,ω)-sentence characterizingthe standard model of arithmetic. This example also shows that thecompactness theorem fails forL(ω1,ω) and so also for anyL(κ,λ) with κ ≥ω1.
Another result which holds in the first-order case but fails forL(κ,ω) with κ≥ ω1 (and also forL(ω1,ω1), althoughthis is more difficult to prove) is theprenex normal formtheorem. A sentence isprenex if all its quantifiersappear at the front; we give an example of anL(ω1,ω)-sentence which is notequivalent to a conjunction of prenex sentences. LetL be the first-order language withoutextralogical symbols and let σ be theL(ω1,ω)-sentence whichcharacterizes the class of finite sets. Suppose that σ wereequivalent to a conjunction
i∈Iσi
of prenexL(ω1,ω)-sentencesσi. Then each σi isof the form
Q1x1 …Qnxnφi(x1,…,xn),
where eachQk is ∀ or ∃ andφi is a (possibly infinitary) conjunction ordisjunction of formulas of the formxk =xl orxk ≠xl. Since each σi is asentence, there are only finitely many variables in eachφi, and it is easy to see that eachφi is then equivalent to a first-order formula.Accordingly each σi may be taken to be afirst-order sentence. Since σ is assumed to be equivalent to theconjunction of the σi, it follows thatσ and the set Δ = {σi :i ∈I} have the same models. But obviouslyσ, and hence also Δ, have models of all finitecardinalities; the compactness theorem for sets of first-ordersentences now implies that Δ, and hence also σ, has aninfinite model, contradicting the definition of σ.
Turning to theLöwenheim-Skolem theorem, we find thatthedownward version has adequate generalizations toL(ω1,ω) (and,indeed, to all infinitary languages). In fact, one can show in much thesame way as for sets of first-order sentences that if Δ ⊆Sent(ω1,ω) has an infinitemodel of cardinality ≥ |Δ|, it has a model of cardinality thelarger of ℵ0, |Δ|.In particular, anyL(ω1,ω)-sentence with an infinitemodel has a countable model.
On the other hand, theupward Löwenheim-Skolem theoremin its usual formfails for all infinitary languages. Forexample, theL(ω1,ω)-sentence characterizingthe standard model of arithmetic has a model of cardinality ℵ0 but no models of any othercardinality. However, all is not lost here, as we shall see.
We define theHanf numberh(L) of alanguage L to be the least cardinal κ such that, if anL-sentence has a model of cardinality κ, it hasmodels of arbitrarily large cardinality. The existence ofh(L) is readily established. For eachL-sentence σ not possessing models ofarbitrarily large cardinality let κ(σ) be the leastcardinal κ such that σ does not have a model of cardinalityκ. If λ is the supremum of all the κ(σ), then,if a sentence ofL has a model of cardinalityλ, it has models of arbitrarily large cardinality.
Define the cardinals μ(α) recursively by
μ(0) = ℵ0 μ(α+1) = 2μ(α) μ(λ) = ∑α<λ μ(α),for limit λ.
Then it can be shown that
h(L(ω1,ω)) =μ(ω1),
similar results holding for other finite-quantifier languages. Thevalues of the Hanf numbers of infinite-quantifier languages such asL(ω1,ω1) are sensitiveto the presence or otherwise of large cardinals, but must in any casegreatly exceed that ofL(ω1,ω).
A result forL whichgeneralizes toL(ω1,ω) but to no otherinfinitary language is the
Craig Interpolation Theorem: Ifσ,τ ∈Sent(ω1,ω)are such that ⊨ σ →τ, then there is θ ∈Sent(ω1,ω) such that ⊨ σ → θ and ⊨ θ → τ, and eachextralogical symbol occurring in θ occurs in both σ andτ.
The proof is a reasonably straightforward extension of the first-ordercase.
Finally, we mention one further result which generalizes nicely toL(ω1,ω)but to no other infinitary language. It is well known that, ifA is any finiteL-structure with only finitely many relations, there isanL-sentence σcharacterizingA up to isomorphism. ForL(ω1,ω) we havethe following generalization known as
Scott's Isomorphism Theorem. IfA is a countableL-structure with only countably many relations, thenthere is anL(ω1,ω)-sentence whose class ofcountable models coincides with the class ofL-structures isomorphic withA.
The restriction tocountable structures is essential becausecountability cannot in general be expressed by anL(ω1,ω)-sentence.
The languageL(∞,ω) may also be counted as afinite-quantifier language. The concept of equivalence of structureswith respect to this language is of especial significance: we call two(similar) structuresA andB (∞,ω)-equivalent,writtenA ≡∞ωB, if the same sentences ofL(∞,ω) hold in bothA andB. Thisrelation can, first of all, be characterized in terms of the notion ofpartial isomorphism. Apartial isomorphism betweenA andB is anonempty familyP of maps such that:
If a partial isomorphism exists betweenAandB, we say thatA andB arepartially isomorphic and writeApB. Wethen have
Karp's Partial Isomorphism Theorem.
For any similar structuresA,B,A≡∞ωB ⇔ApB.
There is also a version of Scott's isomorphism theorem forL(∞,ω), namely,
(2.2) Given any structureA,there is anL(∞,ω)-sentence σ such that, for allstructuresB,ApB ⇔B ⊨ σ.
Partial isomorphism and (∞,ω)-equivalence are related tothe notion ofBoolean isomorphism. To define this we need tointroduce the idea of a Boolean-valued model of set theory. Given acomplete Boolean algebraB, theuniverseV(B)of B-valued sets, also knownas theB-extension of the universe V of sets, is obtained byfirst defining, recursively on α,
Vα(B) ={x:x is a function ∧ range(x) ⊆B ∧ ∃ξ<α[domain(x) ⊆Vξ(B)]}
and then setting
V(B) = {x:∃α(x ∈Vα(B))}.
Members ofV(B) are calledB-valuedsets. It is now easily seen that aB-valued set isprecisely aB-valued function with domain a set ofB-valued sets. Now letL be the first-orderlanguage of set theory and letL(B) be the language obtained byadding toL a name for each element ofV(B) (we shall use the same symbol for theelement and its name). One can now construct a mapping[·](B) of the (sentences of the) languageL(B) intoB: for eachsentence σ ofL(B), theelement [σ](B) ofB is the “Booleantruth value” of σ inV(B). Thismapping [·](B) is defined so as to send allthe theorems of Zermelo-Fraenkel set theory to the top element 1 ofB, i.e., to “truth”; accordingly,V(B) may be thought of as aBoolean-valued model of set theory. In general, if[σ](B) = 1, we say that σ isvalid inV(B), and writeV(B) ⊨σ.
Now eachx ∈V has a canonical representativex inV(B), satisfying
x =y iffV(B) ⊨x =y
x ∈y iffV(B) ⊨x ∈y
We say that two similar structuresA,B are Boolean isomorphic, writtenAbB, if, forsome complete Boolean algebraB, we haveV(B) ⊨AB, that is, if there is a Booleanextension of the universe of sets in which the canonicalrepresentatives ofA andB are isomorphic with Boolean value 1. It canthen be shown that:
(2.3)A≡∞ωB ⇔AbB.
This result can be strengthened through category-theoretic formulation.For this we require the concept of a(n) (elementary)topos. Tointroduce this concept, we start with the familiary category ofSet of sets and mappings.Set has thefollowing key properties:
All four of these conditions can be formulated in category-theoreticlanguage — a category satisfying them is called atopos.The categorySet is a topos; so also are (i) thecategorySet(B) of Boolean-valuedsets and mappings in any Boolean extensionV(B) of the universe of sets; (ii) thecategory of sheaves of sets on a topological space; (iii) the categoryof all diagrams of maps of sets
X0 →X1 →X2 → …
The objects of each of these categories may be regarded as setswhich arevarying in some manner: in case (i) over aBoolean algebra; in case (ii) over a topologicalspace; in case (iii) over (discrete) time. A topos may beconceived, then, as a universe of “variable” sets. The familiarcategorySet is the special limiting case of a toposin which the “variation” of the objects has been reduced to zero.
Just as in set theory, “logical operators” can be defined on thetruth-value object in any topos. These are maps ¬: Ω →Ω; ∧, ∨, ⇒: Ω × Ω → Ωcorresponding to the logical operations of negation, conjunction,disjunction and implication. With these operations, Ω becomes aHeyting algebra, thus embodying in general the laws not of classicalbut of intutionistic logic. In this sense intuitionistic logic is“internalized” in a topos: intuitionistic logic is the logic ofvariable sets. (Of course, classical logic is internalized in certaintoposes, for instanceSet andSet(B) for any complete BooleanalgebraB.)
Any topos may be conceived as possible “universe of discourse” inwhich mathematical assertions may be interpreted and mathematicalconstructions may be performed. Mathematical assertions are renderedinterpretable in a toposE byexpression withinE'sinternal language — a type-theoretic version of theusual language of set theory. In a manner analogous to Boolean-valuedvalidity, one can introduce an appropriate notion of validity inE of a sentence σ of itsinternal language. Again, we writeE ⊨ σ for“σ is valid inE”.
A toposE is said to befull if, for any setI, theI-foldcopower[3] ∐I1 of its terminal object exists inE. ∐I1 may be thought of as thecanonical representative inE ofthe setI; accordingly, we write it simply asI. (InV(B) this coincideswithI as previously defined.) All the toposesmentioned above are full.
Now letE be a full topos. IfA = (A,R, …) is astructure, writeA for(A,R, …). Two structuresA andB are said tobetopos isomorphic, writtenAtB, if, for some toposE defined over the category of sets, we haveE ⊨AB. In other words two structures are topos isomorphicif their canonical representatives are isomorphic in the internallanguage of some topos. It can then be shown that
(2.4)A≡∞ωB ⇔AtB.
Accordingly (∞,ω)-equivalence may be regarded asisomorphism in the extremely general context of universes of “variable”sets. In this respect (∞,ω)-equivalence is an “invariant”notion of isomorphism.
As we have seen, the compactness theorem in its usual form fails forall infinitary languages. Nevertheless, it is of some interest todetermine whether infinitary languages satisfy some suitably modifiedversion of the theorem. This so-calledcompactness problemturns out to have a natural connection with purely set-theoreticquestions involving “large” cardinal numbers.
We construct the following definition. Let κ be an infinitecardinal. A languageL is said to beκ-compact (resp.weaklyκ-compact) if whenever Δ is a set ofL-sentences (resp. a set ofL-sentences of cardinality ≤ κ) and eachsubset of Δ of cardinality < κ has a model, so doesΔ. Notice that the usual compactness theorem forL is precisely the assertion thatL is ω-compact. One reason for according significance to the κ-compactness property is the following. CallL κ-complete (resp.weakly κ-complete) if there is a deductive systemP forL with deductions oflength < κ such that, if Δ is aP-consistent[4] set ofL-sentences (resp. such that |Δ| ≤ κ),then Δ has a model. Observe that such aP will be adequate for deductions fromarbitrary sets of premises (of cardinality ≤ κ) in the senseof §2. It is easily seen that ifL isκ-complete or weakly κ-complete, thenL isκ-compact or weakly κ-compact. Thus, if we can show that agiven language isnot (weakly) κ-compact, then there canbe no deductive system for it with deductions of length < κadequate for deductions from arbitrary sets of premises (of cardinality≤ κ).
It turns out, in fact, that most languagesL(κ,λ) fail to be even weaklyκ-compact, and, for those that are, κ must be anexceedinglylarge cardinal. We shall need somedefinitions.
An infinite cardinal κ is said to beweaklyinaccessible if
(a) λ < κ → λ+ <κ, (where λ+ denotes the cardinal successor ofλ), and
(b) | I | < κ and λi <κ (for alli ∈I) ⇒∑i∈Iλi < κ.
If in addition
(c) λ < κ ⇒ 2λ <κ,
then κ is said to be (strongly)inaccessible.Since ℵ0 is inaccessible,it is normal practice to confine attention to those inaccessible, orweakly inaccessible, cardinals that exceed ℵ0. Accordingly, inaccessible or weaklyinaccessible cardinals will always be taken to beuncountable.It is clear that such cardinals—if they exist—must beextremely large; and indeed the Gödel incompleteness theoremimplies that the existence of even weakly inaccessible cardinals cannotbe proved from the usual axioms of set theory.
Let us call a cardinal κcompact (resp.weaklycompact) if the languageL(κ,κ) is κ-compact (resp. weaklyκ-compact). Then we have the following results:
(3.1) ℵ0is compact. This is, of course, just a succinct way ofexpressing the compactness theorem for first-order languages.(3.2) κ isweakly compact ⇒L(κ,ω) isweaklyκ-compact ⇒ κ isweaklyinaccessible. Accordingly, it is consistent (with the usual axiomsof set theory) to assume that no languageL(κ,ω) with κ ≥ ω1is weakly κ-compact, or,a fortiori, weaklyκ-complete.
(3.3) Suppose κ is inaccessible. Then κ isweaklycompact ⇔ L(κ,ω) isweaklyκ-compact. Also, Also κ is weakly compact ⇒there is a set of κinaccessibles beforeκ. Thus a weakly compact inaccessible cardinal is exceedinglylarge; in particular it cannot be the first, second, …,nth, … inaccessible.
(3.4) κis compact ⇒ κisinaccessible. (But, by the result immediately above, the conversefails.)
LetConstr stand for Gödel's axiom ofconstructibility; recall thatConstr is consistentwith the usual axioms of set theory.
(3.5)IfConstrholds, thenthere are no compact cardinals.(3.6)AssumeConstrand letκbe inaccessible. Then κis weaklycompact ⇔L(ω1,ω)is weaklyκ-compact for allL.
(3.7)IfConstrholds, then there areno cardinals κfor whichL(ω1,ω)is compact.Accordingly, it is consistent with the usual axioms of set theory tosuppose that there is no cardinal κ such that all languagesL(ω1,ω) areκ-complete. This result is to be contrasted with the fact thatall first-order languages are ω-complete.
The import of these results is that the compactness theorem failsvery badly for most languagesL(κ,λ) with κ ≥ω1.
Some historical remarks are in order here. In the 1930smathematicians investigated various versions of the so-calledmeasure problem for sets, a problem which arose in connectionwith the theory of Lebesgue measure on the continuum. In particular,the following very simple notion of measure was formulated. IfX is a set, a (countably additive two-valued nontrivial)measure onX is a map μ on the power setPX to the set {0, 1} satisfying:
(a) μ(X) = 1,(b) μ({x}) = μ(∅) = 0 for allx∈X, and
(c) ifA is any countable family ofmutually disjoint subsets ofX, then μ(∪A) =∑{μ(Y) :Y ∈A}.
Obviously, whether a given set supports such a measure depends onlyon its cardinality, so it is natural to define a cardinal κ to bemeasurable if all sets of cardinality κ support ameasure of this sort. It was quickly realized that a measurablecardinal must be inaccessible, but the falsity of the converse was notestablished until the 1960s when Tarski showed that measurablecardinals are weakly compact and his student Hanf showed that thefirst, second, etc. inaccessibles are not weakly compact (cf. (3.3)).Although the conclusion that measurable cardinals must be monstrouslylarge is now normally proved without making the detour through weakcompactness and infinitary languages, the fact remains that these ideaswere used to establish the result in the first instance.
Probably the most important result about first-order languages is theGödel completeness theorem which of course says that theset of all valid formulas of any first-order languageL can be generated from a simple setof axioms by means of a few straightforward rules of inference. A majorconsequence of this theorem is that, if the formulas ofL are coded as natural numbers in someconstructive way, then the set of (codes of) valid sentences isrecursively enumerable. Thus, the completeness of afirst-order language implies that the set of its valid sentences isdefinable in a particularly simple way. It would accordinglyseem reasonable, given anarbitrary languageL, to turn this implication around and suggest that,if the set of validL-sentences isnotdefinable in some simple fashion, thenno meaningfulcompleteness result can be established forL, or, aswe shall say, thatL isincomplete. In thissection we are going to employ this suggestion in sketching a proofthat “most”infinite quantifier languages are incomplete inthis sense.
Let us first introduce the formal notion ofdefinability asfollows. IfL is a language,A anL-structure, andX a subset of the domainA ofA, we say thatX isdefinableinA by a formula φ(x,y1,…,yn) ofL if there is a sequencea1,…,an of elements ofA such thatX is the subset of all elementsx ∈A for which φ(x,a1,…,an) holds inA.
Now writeVal(L) for the set of all thevalidL-sentences, i.e., those that hold ineveryL-structure. In order to assign a meaning to thestatement “Val(L) is definable”, we have tospecify
Then, if we identifyVal(L) with its image inC(L) under the coding map,we shall interpret the statement “Val(L) isdefinable” as the statement “Val(L), regardedas a subset of the domain ofC(L), is definable inC(L) by a formula ofL.”
For example, whenL is the first-order languageL of arithmetic, Gödeloriginally used as coding structure the standard model ofarithmetic ℕ and as coding map the well-knownfunction obtained from the prime factorization theorem for naturalnumbers. The recursive enumerability ofVal(L) then means simply that the set ofcodes (“Gödel numbers”) of members ofVal(L) is definable in ℕ by anL-formula of the form ∃yφ(x, y), where φ(x,y) is a recursive formula.
Another, equivalent, coding structure for the first-order languageof arithmetic is the structure[5]〈H(ω), ∈ ⨡H(ω)〉 ofhereditarily finite sets, where a setx ishereditarily finite ifx, its members, its members ofmembers, etc., are all finite. This coding structure takes account ofthe fact that first-order formulas are naturally regarded as finitesets.
Turning now to the case in whichL is an infinitarylanguageL(κ,λ),what would be a suitable coding structure in this case? We remarked atthe beginning that infinitary languages were suggested by thepossibility of thinking of formulas as set-theoretical objects, so letus try to obtain our coding structure by thinking about what kind ofset-theoretical objects we should take infinitary formulas to be. Giventhe fact that, for eachφ∈Form(κ,λ), φ and itssubformulas, subsubformulas, etc., are all of length <κ,[6] a moment's reflection reveals thatformulas ofL(κ,λ)“correspond” to setsx hereditarily of cardinality <κ in the sense thatx, its members, its members ofmembers, etc., are all of cardinality < κ. The collection ofall such sets is writtenH(κ).H(ω) isthe collection ofhereditarily finite sets introduced above,andH(ω1) that of allhereditarilycountable sets.
For simplicity let us suppose that the only extralogical symbol ofthe base languageL is the binarypredicate symbol∈ (the discussion is easily extended tothe case in whichL containsadditional extralogical symbols). Guided by the remarks above, ascoding structure forL(κ,λ) we take the structure,
H(κ)=df 〈H(κ), ∈ ⨡H(κ)〉.
Now we can define the coding map ofForm(κ,λ) intoH(κ). First, to each basic symbols ofL(κ,λ) we assign acode object⌈s⌉ ∈H(κ) asfollows. Let {vξ: ξ < κ} be anenumeration of the individual variables ofL(κ,λ).
Symbol Code Object Notation ¬ 1 ⌈¬⌉ ∧ 2 ⌈∧⌉ 3 ⌈⌉ ∃ 4 ⌈∃⌉ ∈ 5 ⌈∈⌉ = 6 ⌈=⌉ vξ 〈0,ξ〉 ⌈vξ⌉
Then, to each φ ∈Form(κ,λ) weassign the code object⌈φ⌉ recursively as follows:
⌈vξ =vη⌉ =df 〈⌈vξ⌉,⌈=⌉,⌈vη⌉〉,⌈vξ∈vη⌉ =df 〈⌈vξ⌉,⌈∈⌉,⌈vη⌉〉;
for φ, ψ ∈Form(κ,λ),
⌈φ ∧ ψ⌉ =df 〈⌈φ⌉,⌈∧⌉,⌈ψ⌉〉⌈¬φ⌉ =df 〈⌈¬⌉,⌈φ⌉〉
⌈∃Xφ⌉ =df 〈⌈∃⌉, {⌈x⌉:x ∈X},⌈φ⌉〉;
and finally if Φ ⊆Form(κ,λ)with |Φ|< κ,
⌈Φ⌉ =df 〈⌈⌉, {⌈φ⌉: φ ∈ Φ}〉.
The map φ ↦⌈φ⌉ fromForm(κ,λ) intoH(κ) is easily seen to be one-one and is the requiredcoding map. Accordingly, we agree to identifyVal(L(κ,λ)) with its image inH(κ) under this coding map.
When isVal(L(κ,λ)) adefinable subset ofH(κ)? In order to answer thisquestion we require the following definitions.
AnL-formula is called aΔ0-formula if it is equivalent to a formulain which all quantifiers are of the form∀x∈y or ∃x∈y(i.e., ∀x(x∈y → …)or ∃x(x∈y ∧ …)). AnL-formula is a Σ1-formula ifit is equivalent to one which can be built up from atomic formulas andtheir negations using only the logical operators ∧, ∨,∀x∈y, ∃x. A subsetX of a setA is said to be Δ0 (resp.Σ1)on A if it is definable in the structure〈A, ∈ ⨡A〉 by a Δ0- (resp.Σ1-) formula ofL.
For example, if we identify the set of natural numbers with the setH(ω) of hereditarily finite sets in the usual way, thenfor eachX ⊆H(ω) we have:
X is Δ0 onH(ω)⇔X is recursiveX is Σ1 onH(ω) ⇔X is recursively enumerable.
Thus the notions of Δ0- and Σ1-setmay be regarded as generalizations of the notions ofrecursiveandrecursively enumerable set, respectively.
The completeness theorem forLimplies thatVal(L)— regarded as a subset ofH(ω) — isrecursively enumerable, and hence Σ1 onH(ω). Similarly, the completeness theorem forL(ω1,ω) (see§2) implies thatVal(L(ω1,ω)) — regarded as asubset ofH(ω1) — isΣ1 onH(ω1). However, thispleasant state of affairs collapses completely as soon asL(ω1,ω1) is reached.For one can prove
Scott's Undefinability Theorem forL(ω1,ω1).Val(L(ω1,ω1))is notdefinable inH(ω1)even by anL(ω1,ω1)-formula;hence a fortioriVal(L(ω1,ω1))isnot Σ1onH(ω1).
This theorem is proved in much the same way as the well-known resultthat the set of (codes of) valid sentences of the second-order languageof arithemeticL2 isnot second-order definable in its coding structure ℕ. To get this latter result, one first observes that ℕ is characterized by a singleL2-sentence, and then showsthat, if the result were false, then “truth in ℕ” forL2-sentences would be definable by anL2-formula, therebyviolating Tarski's theorem on the undefinability of truth.
Accordingly, to prove Scott's undefinability theorem along the abovelines, one needs to establish:
(4.1)Characterizability of the coding structureH(ω1)inL(ω1,ω1): there is anL(ω1,ω1)-sentenceτ0 such that, for allL-structuresA,A ⊨ τ0 ⇔ AH(ω1).
(4.2)Undefinability of truth forL(ω1,ω1)-sentencesin the coding structure: there is noL(ω1,ω1)-formulaφ(v0) such that, for allL(ω1, ω1)-sentencesσ,H(ω1) ⊨ σ↔φ(⌈σ⌉).
(4.3)There is a termt(v0,v1)ofL(ω1,ω1)suchthat, for each pair of sentences σ, τofL(ω1,ω1),H(ω1) ⊨ [t(⌈σ⌉,⌈τ⌉) =⌈σ → τ⌉].
(4.1) is proved by analyzing the set-theoretic definition ofH(ω1) and showingthat it can be “internally” formulated inL(ω1,ω1). (4.2) isestablished in much the same way as Tarski's theorem on theundefinability of truth for first- or second-order languages. (4.3) isobtained by formalizing the definition of the coding map σ ↦⌈σ⌉ inL(ω1,ω1).
Armed with these facts, we can obtain Scott's undefinability theoremin the following way. Suppose it were false; then there would be anL(ω1,ω1)-formulaθ(v0) such that, for allL(ω1,ω1)-sentencesσ,
(4.4)H(ω1) ⊨ θ(⌈σ⌉) iff σ ∈Val(L(ω1,ω1)).
Let τ0 be the sentence given in (4.1). Then we have, forallL(ω1,ω1)-sentencesσ,
H(ω1) ⊨ σ iff (τ0 →σ) ∈Val(L(ω1,ω1)),
so that, by (4.4),
H(ω1) ⊨ σ iff H(ω1) ⊨ θ(⌈τ0 → σ⌉).
Ift is the term given in (4.3), it would follow that
H(ω1) ⊨ σ↔θ(t(⌈τ0⌉,⌈σ⌉)).
Now write φ(v0) for theL(ω1,ω1)-formulaθ(t(⌈τ0⌉,⌈σ⌉)). Then
H(ω1) ⊨ σ↔φ(⌈σ⌉),
contradicting (4.2), and completing the proof.
ThusVal(L(ω1,ω1)) is notdefinableeven by anL(ω1,ω1)-formula,soa fortioriL(ω1,ω1) isincomplete. Similar arguments show that Scott's undefinability theoremcontinues to hold when ω1 is replaced by any successorcardinal κ+; accordingly the languagesL(κ+,κ+) are allincomplete.[7]
Given what we now know about infinitary languages, it would seem thatL(ω1,ω) isthe only one to be reasonably well behaved. On the other hand, thefailure of the compactness theorem to generalize toL(ω1,ω) in anyuseful fashion is a severe drawback as far as applications areconcerned. Let us attempt to analyze this failure in more detail.
Recall from §4 that we may code the formulas of a first-orderlanguageL as hereditarily finitesets, i.e., as members ofH(ω). In that case each finiteset of (codes of)L-sentences isalso a member ofH(ω), and it follows that thecompactness theorem forL can bestated in the form:
(5.1) If Δ ⊆Sent(L) is such that each subsetΔ0 ⊆ Δ, Δ0 ∈H(ω) has a model, so does Δ.
Now it is well-known that (5.1) is an immediate consequence of thegeneralized completeness theorem forL, which, stated in a form similar to that of (5.1),becomes the assertion:
(5.2) If Δ ⊆Sent(L) and σ ∈Sent(L) satisfyΔ ⊨ σ, then there is adeductionD of σ from Δ such thatD ∈H(ω).[8]
In §2 we remarked that the compactness theorem forL(ω1,ω) failsvery strongly; in fact, we constructed a set Γ ⊆Sent(ω1,ω) such that
(5.3) Each countable subset of Γ has a model butΓ does not.
Recall also that we introduced the notion ofdeduction inL(ω1,ω); sincesuch deductions are of countable length it quickly follows from (5.3)that
(5.4) There is a sentence[9]σ ∈Sent(ω1,ω) such that Γ ⊨ σ, but there is nodeduction of σ inL(ω1,ω) fromΓ.
Now the formulas ofL(ω1, ω) can be coded as membersofH(ω1), and itis clear thatH(ω1) is closed under the formation ofcountable subsets and sequences. Accordingly (5.3) and (5.4) may bewritten:
(5.3bis) Each Γ0 ⊆ Γsuch that Γ0 ∈H(ω1) has a model, but Γ doesnot;(5.4bis) There is a sentence σ ∈Sent(ω1,ω) such that Γ ⊨ σ, but there is nodeductionD ∈H(ω1) of σ from Γ.
It follows that (5.1) and (5.2) fail when “L” is replaced by “L(ω1,ω)” and “H(ω)” by “H(ω1)”. Moreover, it can be shown thatthe set Γ ⊆Sent(ω1,ω) in (5.3bis) and (5.4bis) may be taken to beΣ1 onH(ω1). Thus thecompactness and generalized completeness theorems fail even forΣ1-sets ofL(ω1, ω)-sentences.
We see from (5.4bis) that the reason why the generalizedcompleteness theorem fails for Σ1-sets inL(ω1,ω) isthat, roughly speaking,H(ω1) is not “closed”under the formation of deductions from Σ1-sets ofsentences inH(ω1). So in order to remedythis it would seem natural to replaceH(ω1)by setsA which are, in some sense, closed under the formationof such deductions, and then to consider just those formulas whosecodes are inA.
We now give a sketch of how this can be done.
First, we identify the symbols and formulas ofL(ω1,ω) withtheir codes inH(ω1), as in §4. For eachcountable transitive[10] setA, let
LA =Form(L(ω1,ω)) ∩A.
We say thatLA is asublanguage ofL(ω1,ω) if thefollowing conditions are satisfied:
The notion of deduction inLA is defined in the customary way;if Δ is a set of sentences ofLA and φ ∈LA, then adeduction of φ from Δ inLA is a deduction of φ fromΔ inL(ω1,ω) every formula of which is inLA. We say that φ isdeducible from Δ inLA if there is a deductionD of φ from Δ inLA; under theseconditions we write Δ ⊢A φ. In general,D will not be a member ofA; inorder to ensure that such a deduction can be found inA itwill be necessary to impose further conditions onA.
LetA be a countable transitive set such thatLA is asublanguage ofL(ω1, ω) and let Δ be a setof sentences ofLA. We say thatA (or, byabuse of terminology,LA) is Δ-closed if,for any formula φ ofLA such that Δ ⊢A φ, there is adeductionD of φ from Δ such thatD ∈A. It can be shown that theonly countable language which is Δ-closed forarbitraryΔ is the first-order languageL, i.e., whenA =H(ω). HoweverJ. Barwise discovered that there are countable setsA ⊆H(ω1) whose corresponding languagesLA differ fromL and yet are Δ-closedfor all Σ1-sets of sentencesΔ. Such setsA are calledadmissible sets;roughly speaking, they are extensions of the hereditarily finite setsin which recursion theory—and hence proof theory—are still possible (for the full definition, see Section 5.1 below).
From Barwise's result one obtains immediately the
Barwise Compactness Theorem.LetAbe a countable admissible set and let Δbe a setof sentences ofLAwhich isΣ1on A.If each Δ′⊆ Δsuch that Δ′ ∈ Ahas amodel, then so does Δ.
The presence of “Σ1” here indicates that this theoremis a generalization of the compactness theorem forrecursivelyenumerable sets of sentences.
Another version of the Barwise compactness theorem, useful forconstructing models of set theory, is the following. LetZFC be the usual set of axioms for Zermelo-Fraenkelset theory, including the axiom of choice. Then we have:
5.5 Theorem.Let A be a countabletransitive set such thatA = 〈A,∈ ⨡A〉is a model ofZFC.If Δis a set of sentences ofLAwhich isdefinable inA by a formula of the language of settheory and if each Δ′ ⊆ Δsuchthat Δ′ ∈A has a model, so doesΔ.
To conclude, we give a simple application of this theorem. LetA = 〈A, ∈ ⨡A〉 bea model ofZFC. A modelB =〈B, E〉 ofZFC is said to be aproper end-extension ofA if (i)A ⊆B, (ii)A ≠B, (iii)a ∈A, b ∈B,bEa ⇒b∈A. Thus a proper end-extension of a model ofZFC is a proper extension in which no “new” elementcomes “before” any “old” element. As our application of5.5 we prove
5.6 Theorem.Each countabletransitive model ofZFC has a properend-extension.Proof. LetA =〈A, ∈ ⨡A〉 be a transitive model ofZFC and letL bethe first-order language of set theory augmented by a namea for eacha ∈A, and anadditional constantc. Let Δ be the set ofLA-sentencescomprising:
- all axioms ofZFC;
- c ≠a, for eacha∈A;
- ∀x(x∈a→b∈ax =b), for eacha ∈A;
- a∈b, for eacha ∈b ∈A.
It is easily shown that Δ is a subset ofA which isdefinable inA by a formula of the languageof set theory. Also, each subset Δ′ ⊆ Δ suchthat Δ′ ∈A has a model. For the setC of alla ∈A for whicha occurs in Δ′ belongs toA— since Δ′ does — and so, if we interpretc as any member of the (necessarily nonempty) setA − C, thenA is a model ofΔ′. Accordingly, (5.5) implies that Δ has a model〈B, E〉. If we interpret each constantaas the elementa ∈A, then 〈B, E〉is a proper end-extension ofA. The proof iscomplete.
The reader will quickly see that the first-order compactness theoremwill not yield this result.
A nonempty transitive setA is said to beadmissiblewhen the following conditions are satisfied:
Condition (ii) — the Δ0-separation scheme —is a restricted version of Zermelo's axiom of separation. Condition(iii) — a similarly weakened version of the axiom of replacement —may be called the Δ0-replacement scheme.
It is quite easy to see that ifA is a transitive set suchthat <A, ∈ |A> is a model ofZFC, thenA is admissible. More generally,the result continues to hold when the power set axiom is omitted fromZFC, so that bothH(ω) andH(ω1) are admissible. However, since thelatter is uncountable, the Barwise compactness theorem fails to applyto it.
§§1 and2. Infinitarypropositional and predicate languages seem to have made their firstexplicit appearance in print with the papers of Scott and Tarski [1958]and Tarski [1958]. The completeness theorem forL(ω1,ω), as well as for otherinfinitary languages, was proved by Karp [1964]. The Hanf numbercalculations forL(ω1,ω) were first performed byMorley [1965]. The nondefinability of well-orderings infinite-quantifier languages was proved by Karp [1965] and Lopez-Escobar[1966]. The interpolation theorem forL(ω1,ω) was proved byLopez-Escobar [1965] and Scott's isomorphism theorem forL(ω1,ω) byScott [1965]. Negri [2021] extends the idea of geometric logic originating intopos theory toL(ω1,ω).
Karp's partial isomorphism theorem was first proved in Karp [1965];see also Barwise [1973]. Result (2.2) appears in Chang [1968], result(2.3) in Ellentuck [1976] and result (2.4) in Bell [1981].
§3. Results (3.2) and (3.3) are due to Hanf[1964], with some refinements by Lopez-Escobar [1966] and Dickmann[1975], while (3.4) was proved by Tarski. Result (3.5) is due to Scott[1961], (3.6) to Bell [1970] and [1972]; and (3.7) to Bell [1974].Measurable cardinals were first considered by Ulam [1930] and Tarski[1939]. The fact that measurable cardinals are weakly compact was notedin Tarski [1962].
§4. Concerning the undefinability theoremforL(ω1,ω1).Carol Karp remarks (1964, 166), “At the International Congressof Logic, Methodology and the Philosophy of Science at StanfordUniversity in 1960, Dana Scott circulated an outline of a proof of theimpossibility of a complete definable formal system for (γ+,γ+) languages with a single two-place predicate symbol inaddition to the equality symbol.” Scott never published hisresult, and a fully detailed proof first appeared in Karp [1964].The approach to the theorem adopted here is based on the account givenin Dickmann [1975].
§5. The original motivation for the resultspresented in this section came from Kreisel; in his [1965] he pointedout that there were no compelling grounds for choosing infinitaryformulas solely on the grounds of “length”, and proposed instead thatdefinability or “closure” criteria be employed. Kreisel's suggestionwas taken up with great success by Barwise [1967], where hiscompactness theorem was proved. The notion of admissible set is due toPlatek [1966]. Theorem (5.6) is taken from Keisler [1974].
For further reading on the subject of infinitary languages, see Aczel[1973], Dickmann [1975], Karp [1964], Keisler [1974], and Makkai[1977]. A useful account of the connection between infinitarylanguages and large cardinals can be found in Chapter 10 of Drake[1974].
How to cite this entry. Preview the PDF version of this entry at theFriends of the SEP Society. Look up topics and thinkers related to this entry at the Internet Philosophy Ontology Project (InPhO). Enhanced bibliography for this entryatPhilPapers, with links to its database.
View this site from another server:
The Stanford Encyclopedia of Philosophy iscopyright © 2023 byThe Metaphysics Research Lab, Department of Philosophy, Stanford University
Library of Congress Catalog Data: ISSN 1095-5054