Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Von Neumann–Bernays–Gödel set theory

From Wikipedia, the free encyclopedia
System of mathematical set theory

In thefoundations of mathematics,von Neumann–Bernays–Gödel set theory (NBG) is anaxiomatic set theory that is aconservative extension ofZermelo–Fraenkel–choice set theory (ZFC). NBG introduces thenotion ofclass, which is a collection ofsets defined by aformula whosequantifiers range only over sets. NBG can define classes that are larger than sets, such as the class of all sets and the class of allordinals.Morse–Kelley set theory (MK) allows classes to be defined by formulas whose quantifiers range over classes. NBG is finitely axiomatizable, while ZFC and MK are not.

A key theorem of NBG is the class existence theorem, which states that for every formula whose quantifiers range only over sets, there is a class consisting of the sets satisfying the formula. This class is built by mirroring the step-by-step construction of the formula with classes. Since all set-theoretic formulas are constructed from two kinds ofatomic formulas (membership andequality) and finitely manylogical symbols, only finitely manyaxioms are needed to build the classes satisfying them. This is why NBG is finitely axiomatizable. Classes are also used for other constructions, for handling theset-theoretic paradoxes, and for stating theaxiom of global choice, which is stronger than ZFC'saxiom of choice.

John von Neumann introduced classes into set theory in 1925. Theprimitive notions of his theory werefunction andargument. Using these notions, he defined class and set.[1]Paul Bernays reformulated von Neumann's theory by taking class and set as primitive notions.[2]Kurt Gödel simplified Bernays' theory for hisrelative consistency proof of theaxiom of choice and thegeneralized continuum hypothesis.[3]

Classes in set theory

[edit]

The uses of classes

[edit]

Classes have several uses in NBG:

Axiom schema versus class existence theorem

[edit]

Once classes are added to the language of ZFC, it is easy to transform ZFC into a set theory with classes. First, theaxiom schema of class comprehension is added. This axiom schema states: For every formulaϕ(x1,,xn){\displaystyle \phi (x_{1},\ldots ,x_{n})} that quantifies only over sets, there exists a classA{\displaystyle A} consisting of then{\displaystyle n}-tuples satisfying the formula—that is,x1xn[(x1,,xn)Aϕ(x1,,xn)].{\displaystyle \forall x_{1}\cdots \,\forall x_{n}[(x_{1},\ldots ,x_{n})\in A\iff \phi (x_{1},\ldots ,x_{n})].} Then theaxiom schema of replacement is replaced by asingle axiom that uses a class. Finally, ZFC'saxiom of extensionality is modified to handle classes: If two classes have the same elements, then they are identical. The other axioms of ZFC are not modified.[8]

This theory is not finitely axiomatized. ZFC's replacement schema has been replaced by a single axiom, but the axiom schema of class comprehension has been introduced.

To produce a theory with finitely many axioms, the axiom schema of class comprehension is first replaced with finitely manyclass existence axioms. Then these axioms are used to prove the class existence theorem, which implies every instance of the axiom schema.[8] Theproof of this theorem requires only seven class existence axioms, which are used to convert the construction of a formula into the construction of a class satisfying the formula.

Axiomatization of NBG

[edit]

Classes and sets

[edit]

NBG has two types of objects: classes and sets. Intuitively, every set is also a class. There are two ways to axiomatize this.[non-primary source needed] Bernays usedmany-sorted logic with two sorts: classes and sets.[2] Gödel avoided sorts by introducing primitive predicates:Cls(A){\displaystyle {\mathfrak {Cls}}(A)} for "A{\displaystyle A} is a class" andM(A){\displaystyle {\mathfrak {M}}(A)} for "A{\displaystyle A} is a set" (in German, "set" isMenge). He also introduced axioms stating that every set is a class and that if classA{\displaystyle A} is a member of a class, thenA{\displaystyle A} is a set.[9] Using predicates is the standard way to eliminate sorts.Elliott Mendelson modified Gödel's approach by having everything be a class and defining the set predicateM(A){\displaystyle M(A)} asC(AC).{\displaystyle \exists C(A\in C).}[10] This modification eliminates Gödel's class predicate and his two axioms.

Bernays' two-sorted approach may appear more natural at first, but it creates a more complex theory.[b] In Bernays' theory, every set has two representations: one as a set and the other as a class. Also, there are twomembership relations: the first, denoted by "∈", is between two sets; the second, denoted by "η", is between a set and a class.[2] This redundancy is required by many-sorted logic because variables of different sorts range over disjoint subdomains of thedomain of discourse.

The differences between these two approaches do not affect what can be proved, but they do affect how statements are written. In Gödel's approach,AC{\displaystyle A\in C} whereA{\displaystyle A} andC{\displaystyle C} are classes is a valid statement. In Bernays' approach this statement has no meaning. However, ifA{\displaystyle A} is a set, there is an equivalent statement: Define "seta{\displaystyle a} represents classA{\displaystyle A}" if they have the same sets as members—that is,x(xaxηA).{\displaystyle \forall x(x\in a\iff x\;\eta \;A).} The statementaηC{\displaystyle a\;\eta \;C} where seta{\displaystyle a} represents classA{\displaystyle A} is equivalent to Gödel'sAC.{\displaystyle A\in C.}[2]

The approach adopted in this article is that of Gödel with Mendelson's modification. This means that NBG is anaxiomatic system infirst-order predicate logic withequality, and its onlyprimitive notions are of classes and the membership relation.

Definitions and axioms of extensionality and pairing

[edit]

A set is a class that belongs to at least one class:A{\displaystyle A} is a set if and only ifC(AC){\displaystyle \exists C(A\in C)}.A class that is not a set is called a proper class:A{\displaystyle A} is a proper class if and only ifC(AC){\displaystyle \forall C(A\notin C)}.[12]Therefore, every class is either a set or a proper class, and no class is both.

Gödel introduced the convention that uppercase variables range over classes, while lowercase variables range over sets.[9] Gödel also used names that begin with an uppercase letter to denote particular classes, including functions andrelations defined on the class of all sets. Gödel's convention is used in this article. It allows us to write:

The following axioms and definitions are needed for the proof of the class existence theorem.

Axiom of extensionality. If two classes have the same elements, then they are identical.

AB[x(xAxB)A=B]{\displaystyle \forall A\,\forall B\,[\forall x(x\in A\iff x\in B)\implies A=B]}[13]

This axiom generalizes ZFC'saxiom of extensionality to classes.

Axiom of pairing. Ifx{\displaystyle x} andy{\displaystyle y} are sets, then there exists a setp{\displaystyle p} whose only members arex{\displaystyle x} andy{\displaystyle y}.

xypz[zp(z=xz=y)]{\displaystyle \forall x\,\forall y\,\exists p\,\forall z\,[z\in p\iff (z=x\,\lor \,z=y)]}[14]

As in ZFC, the axiom ofextensionality implies the uniqueness of the setp{\displaystyle p}, which allows us to introduce the notation{x,y}.{\displaystyle \{x,y\}.}

Ordered pairs are defined by:

(x,y)={{x},{x,y}}{\displaystyle (x,y)=\{\{x\},\{x,y\}\}}

Tuples are definedinductively using ordered pairs:

(x1)=x1,{\displaystyle (x_{1})=x_{1},}
For n>1:(x1,,xn1,xn)=((x1,,xn1),xn).{\displaystyle {\text{For }}n>1\!:(x_{1},\ldots ,x_{n-1},x_{n})=((x_{1},\ldots ,x_{n-1}),x_{n}).}[c]

Class existence axioms and axiom of regularity

[edit]

Class existence axioms will be used to prove the class existence theorem: For every formula inn{\displaystyle n}free set variables that quantifies only over sets, there exists a class ofn{\displaystyle n}-tuples that satisfy it. The following example starts with two classes that arefunctions and builds acomposite function. This example illustrates the techniques that are needed to prove the class existence theorem, which lead to the class existence axioms that are needed.

Example 1: If the classesF{\displaystyle F} andG{\displaystyle G} are functions, then the composite functionGF{\displaystyle G\circ F} is defined by the formula:t[(x,t)F(t,y)G].{\displaystyle \exists t[(x,t)\in F\,\land \,(t,y)\in G].} Since this formula has two free set variables,x{\displaystyle x} andy,{\displaystyle y,} the class existence theorem constructs the class of ordered pairs:GF={(x,y):t[(x,t)F(t,y)G]}.{\displaystyle G\circ F\,=\,\{(x,y):\exists t[(x,t)\in F\,\land \,(t,y)\in G]\}.}

Because this formula is built from simpler formulas usingconjunction{\displaystyle \land } andexistential quantification{\displaystyle \exists }, classoperations are needed that take classes representing the simpler formulas and produce classes representing the formulas with{\displaystyle \land } and{\displaystyle \exists }. To produce a class representing a formula with{\displaystyle \land },intersection used sincexABxAxB.{\displaystyle x\in A\cap B\iff x\in A\land x\in B.} To produce a class representing a formula with{\displaystyle \exists }, thedomain is used sincexDom(A)t[(x,t)A].{\displaystyle x\in Dom(A)\iff \exists t[(x,t)\in A].}

Before taking the intersection, the tuples inF{\displaystyle F} andG{\displaystyle G} need an extra component so they have the same variables. The componenty{\displaystyle y} is added to the tuples ofF{\displaystyle F} andx{\displaystyle x} is added to the tuples ofG{\displaystyle G}:F={(x,t,y):(x,t)F}{\displaystyle F'=\{(x,t,y):(x,t)\in F\}\,} andG={(t,y,x):(t,y)G}{\displaystyle \,G'=\{(t,y,x):(t,y)\in G\}} In the definition ofF,{\displaystyle F',} the variabley{\displaystyle y} is not restricted by the statement(x,t)F,{\displaystyle (x,t)\in F,} soy{\displaystyle y} ranges over the classV{\displaystyle V} of all sets. Similarly, in the definition ofG,{\displaystyle G',} the variablex{\displaystyle x} ranges overV.{\displaystyle V.} So an axiom is needed that adds an extra component (whose values range overV{\displaystyle V}) to the tuples of a given class.

Next, the variables are put in the same order to prepare for the intersection:F={(x,y,t):(x,t)F}{\displaystyle F''=\{(x,y,t):(x,t)\in F\}\,} andG={(x,y,t):(t,y)G}{\displaystyle \,G''=\{(x,y,t):(t,y)\in G\}}To go fromF{\displaystyle F'} toF{\displaystyle F''} and fromG{\displaystyle G'} toG{\displaystyle G''} requires two differentpermutations, so axioms that support permutations of tuple components are needed.

The intersection ofF{\displaystyle F''} andG{\displaystyle G''} handles{\displaystyle \land }:FG={(x,y,t):(x,t)F(t,y)G}{\displaystyle F''\cap G''=\{(x,y,t):(x,t)\in F\,\land \,(t,y)\in G\}}

Since(x,y,t){\displaystyle (x,y,t)} is defined as((x,y),t){\displaystyle ((x,y),t)}, taking the domain ofFG{\displaystyle F''\cap G''} handlest{\displaystyle \exists t} and produces the composite function:GF=Dom(FG)={(x,y):t((x,t)F(t,y)G)}{\displaystyle G\circ F=Dom(F''\cap G'')=\{(x,y):\exists t((x,t)\in F\,\land \,(t,y)\in G)\}}So axioms of intersection and domain are needed.

The class existence axioms are divided into two groups: axioms handling language primitives and axioms handling tuples. There are four axioms in the first group and three axioms in the second group.[d]

Axioms for handling language primitives:

Membership. There exists a classE{\displaystyle E} containing all the ordered pairs whose first component is a member of the second component.

Exy[(x,y)Exy]{\displaystyle \exists E\,\forall x\,\forall y\,[(x,y)\in E\iff x\in y]\!}[18]

Intersection (conjunction). For any two classesA{\displaystyle A} andB{\displaystyle B}, there is a classC{\displaystyle C} consisting precisely of the sets that belong to bothA{\displaystyle A} andB{\displaystyle B}.

ABCx[xC(xAxB)]{\displaystyle \forall A\,\forall B\,\exists C\,\forall x\,[x\in C\iff (x\in A\,\land \,x\in B)]}[19]

Complement (negation). For any classA{\displaystyle A}, there is a classB{\displaystyle B} consisting precisely of the sets not belonging toA{\displaystyle A}.

ABx[xB¬(xA)]{\displaystyle \forall A\,\exists B\,\forall x\,[x\in B\iff \neg (x\in A)]}[20]

Domain (existential quantifier). For any classA{\displaystyle A}, there is a classB{\displaystyle B} consisting precisely of the first components of the ordered pairs ofA{\displaystyle A}.

ABx[xBy((x,y)A)]{\displaystyle \forall A\,\exists B\,\forall x\,[x\in B\iff \exists y((x,y)\in A)]}[21]

By the axiom of extensionality, classC{\displaystyle C} in the intersection axiom and classB{\displaystyle B} in the complement and domain axioms are unique. They will be denoted by:AB,{\displaystyle A\cap B,}A,{\displaystyle \complement A,} andDom(A),{\displaystyle Dom(A),} respectively.[e]

The first three axioms imply the existence of the empty class and the class of all sets: The membership axiom implies the existence of a classE.{\displaystyle E.} The intersection and complement axioms imply the existence ofEE{\displaystyle E\cap \complement E}, which is empty. By the axiom of extensionality, this class is unique; it is denoted by.{\displaystyle \emptyset .} The complement of{\displaystyle \emptyset } is the classV{\displaystyle V} of all sets, which is also unique by extensionality. The set predicateM(A){\displaystyle M(A)}, which was defined asC(AC){\displaystyle \exists C(A\in C)}, is now redefined asAV{\displaystyle A\in V} to avoid quantifying over classes.

Axioms for handling tuples:

Product byV{\displaystyle V}. For any classA{\displaystyle A}, there is a classB{\displaystyle B} consisting of the ordered pairs whose first component belongs toA{\displaystyle A}.

ABu[uBxy(u=(x,y)xA)]{\displaystyle \forall A\,\exists B\,\forall u\,[u\in B\iff \exists x\,\exists y\,(u=(x,y)\land x\in A)]}[23]

Circular permutation. For any classA{\displaystyle A}, there is a classB{\displaystyle B} whose 3‑tuples are obtained by applying the circular permutation(y,z,x)(x,y,z){\displaystyle (y,z,x)\mapsto (x,y,z)} to the 3‑tuples ofA{\displaystyle A}.

ABxyz[(x,y,z)B(y,z,x)A]{\displaystyle \forall A\,\exists B\,\forall x\,\forall y\,\forall z\,[(x,y,z)\in B\iff (y,z,x)\in A]}[24]

Transposition. For any classA{\displaystyle A}, there is a classB{\displaystyle B} whose 3‑tuples are obtained by transposing the last two components of the 3‑tuples ofA{\displaystyle A}.

ABxyz[(x,y,z)B(x,z,y)A]{\displaystyle \forall A\,\exists B\,\forall x\,\forall y\,\forall z\,[(x,y,z)\in B\iff (x,z,y)\in A]}[25]

By extensionality, the product byV{\displaystyle V} axiom implies the existence of a unique class, which is denoted byA×V.{\displaystyle A\times V.} This axiom is used to define the classVn{\displaystyle V^{n}} of alln{\displaystyle n}-tuples:V1=V{\displaystyle V^{1}=V} andVn+1=Vn×V.{\displaystyle V^{n+1}=V^{n}\times V.\,} IfA{\displaystyle A} is a class, extensionality implies thatAVn{\displaystyle A\cap V^{n}} is the unique class consisting of then{\displaystyle n}-tuples ofA.{\displaystyle A.} For example, the membership axiom produces a classE{\displaystyle E} that may contain elements that are not ordered pairs, while the intersectionEV2{\displaystyle E\cap V^{2}} contains only the ordered pairs ofE{\displaystyle E}.

The circular permutation and transposition axioms do not imply the existence of unique classes because they specify only the 3‑tuples of classB.{\displaystyle B.} By specifying the 3‑tuples, these axioms also specify then{\displaystyle n}-tuples forn4{\displaystyle n\geq 4} since:(x1,,xn2,xn1,xn)=((x1,,xn2),xn1,xn).{\displaystyle (x_{1},\ldots ,x_{n-2},x_{n-1},x_{n})=((x_{1},\ldots ,x_{n-2}),x_{n-1},x_{n}).} The axioms for handling tuples and the domain axiom imply the following lemma, which is used in the proof of the class existence theorem.

Tuple lemma

  1. AB1xyz[(z,x,y)B1(x,y)A]{\displaystyle \forall A\,\exists B_{1}\,\forall x\,\forall y\,\forall z\,[(z,x,y)\in B_{1}\iff (x,y)\in A]}
  2. AB2xyz[(x,z,y)B2(x,y)A]{\displaystyle \forall A\,\exists B_{2}\,\forall x\,\forall y\,\forall z\,[(x,z,y)\in B_{2}\iff (x,y)\in A]}
  3. AB3xyz[(x,y,z)B3(x,y)A]{\displaystyle \forall A\,\exists B_{3}\,\forall x\,\forall y\,\forall z\,[(x,y,z)\in B_{3}\iff (x,y)\in A]}
  4. AB4xyz[(y,x)B4(x,y)A]{\displaystyle \forall A\,\exists B_{4}\,\forall x\,\forall y\,\forall z\,[(y,x)\in B_{4}\iff (x,y)\in A]}
Proof

One more axiom is needed to prove the class existence theorem: theaxiom of regularity. Since the existence of the empty class has been proved, the usual statement of this axiom is given.[f]

Axiom of regularity. Every nonempty set has at least one element with which it has no element in common.a[au(uaua=)].{\displaystyle \forall a\,[a\neq \emptyset \implies \exists u(u\in a\land u\cap a=\emptyset )].}

This axiom implies that a set cannot belong to itself: Assume thatxx{\displaystyle x\in x} and leta={x}.{\displaystyle a=\{x\}.} Thenxa{\displaystyle x\cap a\neq \emptyset } sincexxa.{\displaystyle x\in x\cap a.} This contradicts the axiom of regularity becausex{\displaystyle x} is the only element ina.{\displaystyle a.} Therefore,xx.{\displaystyle x\notin x.} The axiom of regularity also prohibits infinite descending membership sequences of sets:xn+1xnx1x0.{\displaystyle \cdots \in x_{n+1}\in x_{n}\in \cdots \in x_{1}\in x_{0}.}

Gödel stated regularity for classes rather than for sets in his 1940 monograph, which was based on lectures given in 1938.[26] In 1939, he proved that regularity for sets implies regularity for classes.[27]

Class existence theorem

[edit]

Class existence theoremLetϕ(x1,,xn,Y1,,Ym){\displaystyle \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})} be a formula that quantifies only over sets and contains nofree variables other thanx1,,xn,Y1,,Ym{\displaystyle x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m}} (not necessarily all of these). Then for allY1,,Ym{\displaystyle Y_{1},\dots ,Y_{m}}, there exists a unique classA{\displaystyle A} ofn{\displaystyle n}-tuples such that:x1xn[(x1,,xn)Aϕ(x1,,xn,Y1,,Ym)].{\displaystyle \forall x_{1}\cdots \,\forall x_{n}[(x_{1},\dots ,x_{n})\in A\iff \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})].} The classA{\displaystyle A} is denoted by{(x1,,xn):ϕ(x1,,xn,Y1,,Ym)}.{\displaystyle \{(x_{1},\dots ,x_{n}):\phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})\}.}[g]

The theorem's proof will be done in two steps:

  1. Transformation rules are used to transform the given formulaϕ{\displaystyle \phi } into anequivalent formula that simplifies theinductive part of the proof. For example, the only logical symbols in the transformed formula are¬{\displaystyle \neg },{\displaystyle \land }, and{\displaystyle \exists }, so the induction handles logical symbols with just three cases.
  2. The class existence theorem is proved inductively for transformed formulas. Guided by the structure of the transformed formula, the class existence axioms are used to produce the unique class ofn{\displaystyle n}-tuples satisfying the formula.

Transformation rules. In rules 1 and 2 below,Δ{\displaystyle \Delta } andΓ{\displaystyle \Gamma } denote set or class variables. These two rules eliminate all occurrences of class variables before an{\displaystyle \in } and all occurrences of equality. Each time rule 1 or 2 is applied to a subformula,i{\displaystyle i} is chosen so thatzi{\displaystyle z_{i}} differs from the other variables in the current formula. The three rules are repeated until there are no subformulas to which they can be applied. This produces a formula that is built only with¬{\displaystyle \neg },{\displaystyle \land },{\displaystyle \exists },{\displaystyle \in }, set variables, and class variablesYk{\displaystyle Y_{k}} whereYk{\displaystyle Y_{k}} does not appear before an{\displaystyle \in }.

  1. YkΓ{\displaystyle \,Y_{k}\in \Gamma } is transformed intozi(zi=YkziΓ).{\displaystyle \exists z_{i}(z_{i}=Y_{k}\,\land \,z_{i}\in \Gamma ).}
  2. Extensionality is used to transformΔ=Γ{\displaystyle \Delta =\Gamma } intozi(ziΔziΓ).{\displaystyle \forall z_{i}(z_{i}\in \Delta \iff z_{i}\in \Gamma ).}
  3. Logical identities are used to transform subformulas containing,,,{\displaystyle \lor ,\implies ,\iff ,} and{\displaystyle \forall } to subformulas that only use¬,,{\displaystyle \neg ,\land ,} and.{\displaystyle \exists .}

Transformation rules:bound variables. Consider the composite function formula ofexample 1 with its free set variables replaced byx1{\displaystyle x_{1}} andx2{\displaystyle x_{2}}:t[(x1,t)F(t,x2)G].{\displaystyle \exists t[(x_{1},t)\in F\,\land \,(t,x_{2})\in G].} The inductive proof will removet{\displaystyle \exists t}, which produces the formula(x1,t)F(t,x2)G.{\displaystyle (x_{1},t)\in F\land (t,x_{2})\in G.} However, since the class existence theorem is stated for subscripted variables, this formula does not have the form expected by theinduction hypothesis. This problem is solved by replacing the variablet{\displaystyle t} withx3.{\displaystyle x_{3}.} Bound variables within nested quantifiers are handled by increasing the subscript by one for each successive quantifier. This leads to rule 4, which must be applied after the other rules since rules 1 and 2 produce quantified variables.

  1. If a formula contains no free set variables other thanx1,,xn,{\displaystyle x_{1},\dots ,x_{n},} then bound variables that are nested withinq{\displaystyle q} quantifiers are replaced withxn+q{\displaystyle x_{n+q}}. These variables have(quantifier) nesting depthq{\displaystyle q}.

Example 2:Rule 4 is applied to the formulaϕ(x1){\displaystyle \phi (x_{1})} that defines the class consisting of all sets of the form{,{,},}.{\displaystyle \{\emptyset ,\{\emptyset ,\dots \},\dots \}.} That is, sets that contain at least{\displaystyle \emptyset } and a set containing{\displaystyle \emptyset } — for example,{,{,a,b,c},d,e}{\displaystyle \{\emptyset ,\{\emptyset ,a,b,c\},d,e\}} wherea,b,c,d,{\displaystyle a,b,c,d,} ande{\displaystyle e} are sets.

ϕ(x1)=u[ux1¬v(vu)]w(wx1y[(yw¬z(zy)])ϕr(x1)=x2[x2x1¬x3(x3x2)]x2(x2x1x3[(x3x2¬x4(x4x3)]){\displaystyle {\begin{aligned}\phi (x_{1})\,&=\,\exists u\;\,[\,u\in x_{1}\,\land \,\neg \exists v\;\,(\;v\,\in \,u\,)]\,\land \,\,\exists w\;{\bigl (}w\in x_{1}\,\land \,\exists y\;\,[(\;y\,\in w\;\land \;\neg \exists z\;\,(\;z\,\in \,y\,)]{\bigr )}\\\phi _{r}(x_{1})\,&=\,\exists x_{2}[x_{2}\!\in \!x_{1}\,\land \,\neg \exists x_{3}(x_{3}\!\in \!x_{2})]\,\land \,\,\exists x_{2}{\bigl (}x_{2}\!\in \!x_{1}\,\land \,\exists x_{3}[(x_{3}\!\in \!x_{2}\,\land \,\neg \exists x_{4}(x_{4}\!\in \!x_{3})]{\bigr )}\end{aligned}}}

Sincex1{\displaystyle x_{1}} is the only free variable,n=1.{\displaystyle n=1.} The quantified variablex3{\displaystyle x_{3}} appears twice inx3x2{\displaystyle x_{3}\in x_{2}} at nesting depth 2. Its subscript is 3 becausen+q=1+2=3.{\displaystyle n+q=1+2=3.} If two quantifier scopes are at the same nesting depth, they are either identical or disjoint. The two occurrences ofx3{\displaystyle x_{3}} are in disjoint quantifier scopes, so they do not interact with each other.

Proof of the class existence theorem. The proof starts by applying the transformation rules to the given formula to produce a transformed formula. Since this formula is equivalent to the given formula, the proof is completed by proving the class existence theorem for transformed formulas.

Proof of the class existence theorem for transformed formulas

The following lemma is used in the proof.

Expansion lemmaLet1i<jn,{\displaystyle 1\leq i<j\leq n,} and letP{\displaystyle P} be a class containing all the ordered pairs(xi,xj){\displaystyle (x_{i},x_{j})} satisfyingR(xi,xj).{\displaystyle R(x_{i},x_{j}).} That is,P{(xi,xj):R(xi,xj)}.{\displaystyle P\supseteq \{(x_{i},x_{j}):R(x_{i},x_{j})\}.} ThenP{\displaystyle P} can be expanded into the unique classQ{\displaystyle Q} ofn{\displaystyle n}-tuples satisfyingR(xi,xj){\displaystyle R(x_{i},x_{j})}. That is,Q={(x1,,xn):R(xi,xj)}.{\displaystyle Q=\{(x_{1},\ldots ,x_{n}):R(x_{i},x_{j})\}.}

Proof:

  1. Ifi=1,{\displaystyle i=1,} letP1=P.{\displaystyle P_{1}=P.}
    Otherwise,i>1,{\displaystyle i>1,} so components are added in front ofxi:{\displaystyle x_{i}{\text{:}}} apply thetuple lemma's statement 1 toP{\displaystyle P} withz=(x1,,xi1).{\displaystyle z=(x_{1},\dots ,x_{i-1}).} This produces a classP1{\displaystyle P_{1}} containing all the(i+1){\displaystyle (i+1)}-tuples((x1,,xi1),xi,xj)=(x1,,xi1,xi,xj){\displaystyle ((x_{1},\dots ,x_{i-1}),x_{i},x_{j})=(x_{1},\dots ,x_{i-1},x_{i},x_{j})} satisfyingR(xi,xj).{\displaystyle R(x_{i},x_{j}).}
  2. Ifj=i+1,{\displaystyle j=i+1,} letP2=P1.{\displaystyle P_{2}=P_{1}.}
    Otherwise,j>i+1,{\displaystyle j>i+1,} so components are added betweenxi{\displaystyle x_{i}} andxj:{\displaystyle x_{j}{\text{:}}} add the componentsxi+1,,xj1{\displaystyle x_{i+1},\dots ,x_{j-1}} one by one using the tuple lemma's statement 2. This produces a classP2{\displaystyle P_{2}} containing all thej{\displaystyle j}-tuples(((((x1,,xi),xi+1),),xj1),xj)=(x1,,xj){\displaystyle (((\cdots ((x_{1},\dots ,x_{i}),x_{i+1}),\cdots ),x_{j-1}),x_{j})=(x_{1},\dots ,x_{j})} satisfyingR(xi,xj).{\displaystyle R(x_{i},x_{j}).}
  3. Ifj=n,{\displaystyle j=n,} letP3=P2.{\displaystyle P_{3}=P_{2}.}
    Otherwise,j<n,{\displaystyle j<n,} so components are added afterxj:{\displaystyle x_{j}{\text{:}}} add the componentsxj+1,,xn{\displaystyle x_{j+1},\dots ,x_{n}} one by one using the tuple lemma's statement 3. This produces a classP3{\displaystyle P_{3}} containing all then{\displaystyle n}-tuples((((x1,,xj),xj+1),),xn)=(x1,,xn){\displaystyle ((\cdots ((x_{1},\dots ,x_{j}),x_{j+1}),\cdots ),x_{n})=(x_{1},\dots ,x_{n})} satisfyingR(xi,xj).{\displaystyle R(x_{i},x_{j}).}
  4. LetQ=P3Vn.{\displaystyle Q=P_{3}\cap V^{n}.} Extensionality implies thatQ{\displaystyle Q} is the unique class ofn{\displaystyle n}-tuples satisfyingR(xi,xj).{\displaystyle R(x_{i},x_{j}).}

Class existence theorem for transformed formulasLetϕ(x1,,xn,Y1,,Ym){\displaystyle \phi (x_{1},\ldots ,x_{n},Y_{1},\ldots ,Y_{m})} be a formula that:

  1. contains no free variables other thanx1,,xn,Y1,,Ym{\displaystyle x_{1},\ldots ,x_{n},Y_{1},\ldots ,Y_{m}};
  2. contains only{\displaystyle \in },¬{\displaystyle \neg },{\displaystyle \land },{\displaystyle \exists }, set variables, and the class variablesYk{\displaystyle Y_{k}} whereYk{\displaystyle Y_{k}} does not appear before an{\displaystyle \in };
  3. only quantifies set variablesxn+q{\displaystyle x_{n+q}} whereq{\displaystyle q} is the quantifier nesting depth of the variable.

Then for allY1,,Ym{\displaystyle Y_{1},\dots ,Y_{m}}, there exists a unique classA{\displaystyle A} ofn{\displaystyle n}-tuples such that:x1xn[(x1,,xn)Aϕ(x1,,xn,Y1,,Ym)].{\displaystyle \forall x_{1}\cdots \,\forall x_{n}[(x_{1},\ldots ,x_{n})\in A\iff \phi (x_{1},\ldots ,x_{n},Y_{1},\ldots ,Y_{m})].}

Proof: Basis step:ϕ{\displaystyle \phi } has 0 logical symbols. The theorem's hypothesis implies thatϕ{\displaystyle \phi } is an atomic formula of the formxixj{\displaystyle x_{i}\in x_{j}} orxiYk.{\displaystyle x_{i}\in Y_{k}.}

Case 1: Ifϕ{\displaystyle \phi } isxixj{\displaystyle x_{i}\in x_{j}}, we build the classEi,j,n={(x1,,xn):xixj},{\displaystyle E_{i,j,n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in x_{j}\},} the unique class ofn{\displaystyle n}-tuples satisfyingxixj.{\displaystyle x_{i}\in x_{j}.}

Case a:ϕ{\displaystyle \phi } isxixj{\displaystyle x_{i}\in x_{j}} wherei<j.{\displaystyle i<j.} The axiom of membership produces a classP{\displaystyle P} containing all the ordered pairs(xi,xj){\displaystyle (x_{i},x_{j})} satisfyingxixj.{\displaystyle x_{i}\in x_{j}.} Apply the expansion lemma toP{\displaystyle P} to obtainEi,j,n={(x1,,xn):xixj}.{\displaystyle E_{i,j,n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in x_{j}\}.}

Case b:ϕ{\displaystyle \phi } isxixj{\displaystyle x_{i}\in x_{j}} wherei>j.{\displaystyle i>j.} The axiom of membership produces a classP{\displaystyle P} containing all the ordered pairs(xi,xj){\displaystyle (x_{i},x_{j})} satisfyingxixj.{\displaystyle x_{i}\in x_{j}.} Apply the tuple lemma's statement 4 toP{\displaystyle P} to obtainP{\displaystyle P'} containing all the ordered pairs(xj,xi){\displaystyle (x_{j},x_{i})} satisfyingxixj.{\displaystyle x_{i}\in x_{j}.} Apply the expansion lemma toP{\displaystyle P'} to obtainEi,j,n={(x1,,xn):xixj}.{\displaystyle E_{i,j,n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in x_{j}\}.}

Case c:ϕ{\displaystyle \phi } isxixj{\displaystyle x_{i}\in x_{j}} wherei=j.{\displaystyle i=j.} Since this formula is false by theaxiom of regularity, non{\displaystyle n}-tuples satisfy it, soEi,j,n=.{\displaystyle E_{i,j,n}=\emptyset .}

Case 2: Ifϕ{\displaystyle \phi } isxiYk{\displaystyle x_{i}\in Y_{k}}, we build the classEi,Yk,n={(x1,,xn):xiYk},{\displaystyle E_{i,Y_{k},n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in Y_{k}\},} the unique class ofn{\displaystyle n}-tuples satisfyingxiYk.{\displaystyle x_{i}\in Y_{k}.}

Case a:ϕ{\displaystyle \phi } isxiYk{\displaystyle x_{i}\in Y_{k}} wherei<n.{\displaystyle i<n.} Apply the axiom of product byV{\displaystyle V} toYk{\displaystyle Y_{k}} to produce the classP=Yk×V={(xi,xi+1):xiYk}.{\displaystyle P=Y_{k}\times V=\{(x_{i},x_{i+1}):x_{i}\in Y_{k}\}.} Apply the expansion lemma toP{\displaystyle P} to obtainEi,Yk,n={(x1,,xn):xiYk}.{\displaystyle E_{i,Y_{k},n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in Y_{k}\}.}

Case b:ϕ{\displaystyle \phi } isxiYk{\displaystyle x_{i}\in Y_{k}} wherei=n>1.{\displaystyle i=n>1.} Apply the axiom of product byV{\displaystyle V} toYk{\displaystyle Y_{k}} to produce the classP=Yk×V={(xi,xi1):xiYk}.{\displaystyle P=Y_{k}\times V=\{(x_{i},x_{i-1}):x_{i}\in Y_{k}\}.} Apply the tuple lemma's statement 4 toP{\displaystyle P} to obtainP=V×Yk={(xi1,xi):xiYk}.{\displaystyle P'=V\times Y_{k}=\{(x_{i-1},x_{i}):x_{i}\in Y_{k}\}.} Apply the expansion lemma toP{\displaystyle P'} to obtainEi,Yk,n={(x1,,xn):xiYk}.{\displaystyle E_{i,Y_{k},n}=\{(x_{1},\ldots ,x_{n}):x_{i}\in Y_{k}\}.}

Case c:ϕ{\displaystyle \phi } isxiYk{\displaystyle x_{i}\in Y_{k}} wherei=n=1.{\displaystyle i=n=1.} ThenEi,Yk,n=Yk.{\displaystyle E_{i,Y_{k},n}=Y_{k}.}

Inductive step:ϕ{\displaystyle \phi } hask{\displaystyle k} logical symbols wherek>0{\displaystyle k>0}. Assume the induction hypothesis that the theorem is true for allψ{\displaystyle \psi } with less thank{\displaystyle k} logical symbols. We now prove the theorem forϕ{\displaystyle \phi } withk{\displaystyle k} logical symbols. In this proof, the list of class variablesY1,,Ym{\displaystyle Y_{1},\dots ,Y_{m}} is abbreviated byY{\displaystyle {\vec {Y}}}, so a formula—such asϕ(x1,,xn,Y1,,Ym){\displaystyle \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})}—can be written asϕ(x1,,xn,Y).{\displaystyle \phi (x_{1},\dots ,x_{n},{\vec {Y}}).}

Case 1:ϕ(x1,,xn,Y)=¬ψ(x1,,xn,Y).{\displaystyle \phi (x_{1},\ldots ,x_{n},{\vec {Y}})=\neg \psi (x_{1},\ldots ,x_{n},{\vec {Y}}).} Sinceψ{\displaystyle \psi } hask1{\displaystyle k-1} logical symbols, the induction hypothesis implies that there is a unique classA{\displaystyle A} ofn{\displaystyle n}-tuples such that:(x1,,xn)Aψ(x1,,xn,Y).{\displaystyle \quad (x_{1},\ldots ,x_{n})\in A\iff \psi (x_{1},\ldots ,x_{n},{\vec {Y}}).} By the complement axiom, there is a classA{\displaystyle \complement A} such thatu[uA¬(uA)].{\displaystyle \forall u\,[u\in \complement A\iff \neg (u\in A)].} However,A{\displaystyle \complement A} contains elements other thann{\displaystyle n}-tuples ifn>1.{\displaystyle n>1.} To eliminate these elements, useVnA={\displaystyle \complement _{V^{n}}A=\,}AVn={\displaystyle \complement A\cap V^{n}=\,}VnA,{\displaystyle V^{n}\setminus A,} which is the complement relative to the classVn{\displaystyle V^{n}} of alln{\displaystyle n}-tuples.[e] Then, by extensionality,VnA{\displaystyle \complement _{V^{n}}A} is the unique class ofn{\displaystyle n}-tuples such that:(x1,,xn)VnA¬[(x1,,xn)A]¬ψ(x1,,xn,Y)ϕ(x1,,xn,Y).{\displaystyle {\begin{alignedat}{2}\quad &(x_{1},\ldots ,x_{n})\in \complement _{V^{n}}A&&\iff \neg [(x_{1},\ldots ,x_{n})\in A]\\&&&\iff \neg \psi (x_{1},\ldots ,x_{n},{\vec {Y}})\\&&&\iff \phi (x_{1},\ldots ,x_{n},{\vec {Y}}).\end{alignedat}}}

Case 2:ϕ(x1,,xn,Y)=ψ1(x1,,xn,Y)ψ2(x1,,xn,Y).{\displaystyle \phi (x_{1},\ldots ,x_{n},{\vec {Y}})=\psi _{1}(x_{1},\ldots ,x_{n},{\vec {Y}})\land \psi _{2}(x_{1},\ldots ,x_{n},{\vec {Y}}).} Since bothψ1{\displaystyle \psi _{1}} andψ2{\displaystyle \psi _{2}} have less thank{\displaystyle k} logical symbols, the induction hypothesis implies that there are unique classes ofn{\displaystyle n}-tuples,A1{\displaystyle A_{1}} andA2{\displaystyle A_{2}}, such that:

(x1,,xn)A1ψ1(x1,,xn,Y).(x1,,xn)A2ψ2(x1,,xn,Y).{\displaystyle {\begin{aligned}\quad &(x_{1},\ldots ,x_{n})\in A_{1}\iff \psi _{1}(x_{1},\ldots ,x_{n},{\vec {Y}}).\\&(x_{1},\ldots ,x_{n})\in A_{2}\iff \psi _{2}(x_{1},\ldots ,x_{n},{\vec {Y}}).\end{aligned}}}

By the axioms of intersection and extensionality,A1A2{\displaystyle A_{1}\cap A_{2}} is the unique class ofn{\displaystyle n}-tuples such that:(x1,,xn)A1A2(x1,,xn)A1(x1,,xn)A2ψ1(x1,,xn,Y)ψ2(x1,,xn,Y)ϕ(x1,,xn,Y).{\displaystyle {\begin{alignedat}{2}\quad &(x_{1},\ldots ,x_{n})\in A_{1}\cap A_{2}&&\iff (x_{1},\ldots ,x_{n})\in A_{1}\land (x_{1},\ldots ,x_{n})\in A_{2}\\&&&\iff \psi _{1}(x_{1},\ldots ,x_{n},{\vec {Y}})\land \psi _{2}(x_{1},\ldots ,x_{n},{\vec {Y}})\\&&&\iff \phi (x_{1},\ldots ,x_{n},{\vec {Y}}).\end{alignedat}}}

Case 3:ϕ(x1,,xn,Y)=xn+1ψ(x1,,xn,xn+1,Y).{\displaystyle \phi (x_{1},\ldots ,x_{n},{\vec {Y}})=\exists x_{n+1}\psi (x_{1},\ldots ,x_{n},x_{n+1},{\vec {Y}}).} The quantifier nesting depth ofψ{\displaystyle \psi } is one more than that ofϕ{\displaystyle \phi } and the additional free variable isxn+1.{\displaystyle x_{n+1}.} Sinceψ{\displaystyle \psi } hask1{\displaystyle k-1} logical symbols, the induction hypothesis implies that there is a unique classA{\displaystyle A} of(n+1){\displaystyle (n+1)}-tuples such that:(x1,,xn,xn+1)Aψ(x1,,xn,xn+1,Y).{\displaystyle \quad (x_{1},\ldots ,x_{n},x_{n+1})\in A\iff \psi (x_{1},\ldots ,x_{n},x_{n+1},{\vec {Y}}).}By the axioms of domain and extensionality,Dom(A){\displaystyle Dom(A)} is the unique class ofn{\displaystyle n}-tuples such that:[h](x1,,xn)Dom(A)xn+1[((x1,,xn),xn+1)A]xn+1[(x1,,xn,xn+1)A]xn+1ψ(x1,,xn,xn+1,Y)ϕ(x1,,xn,Y).{\displaystyle {\begin{alignedat}{2}\quad &(x_{1},\ldots ,x_{n})\in Dom(A)&&\iff \exists x_{n+1}[((x_{1},\ldots ,x_{n}),x_{n+1})\in A]\\&&&\iff \exists x_{n+1}[(x_{1},\ldots ,x_{n},x_{n+1})\in A]\\&&&\iff \exists x_{n+1}\,\psi (x_{1},\ldots ,x_{n},x_{n+1},{\vec {Y}})\\&&&\iff \phi (x_{1},\ldots ,x_{n},{\vec {Y}}).\end{alignedat}}}

Gödel pointed out that the class existence theorem "is ametatheorem, that is, a theorem about the system [NBG], not in the system …"[30] It is a theorem about NBG because it is proved in themetatheory by induction on NBG formulas. Also, its proof—instead of invoking finitely many NBG axioms—inductively describes how to use NBG axioms to construct a class satisfying a given formula. For every formula, this description can be turned into a constructive existence proof that is in NBG. Therefore, this metatheorem can generate the NBG proofs that replaceuses of NBG's class existence theorem.

Arecursivecomputer program succinctly captures the construction of a class from a given formula. The definition of this program does not depend on the proof of the class existence theorem. However, the proof is needed to prove that the class constructed by the program satisfies the given formula and is built using the axioms. This program is written inpseudocode that uses aPascal-stylecase statement.[i]functionClass(ϕ,n)input:ϕ is a transformed formula of the form ϕ(x1,,xn,Y1,,Ym);n specifies that a class of n-tuples is returned.output:class A of n-tuples satisfying x1xn[(x1,,xn)Aϕ(x1,,xn,Y1,,Ym)].begincaseϕofxixj:returnEi,j,n;// Ei,j,n={(x1,,xn):xixj}xiYk:returnEi,Yk,n;// Ei,Yk,n={(x1,,xn):xiYk}¬ψ:returnVnClass(ψ,n);// VnClass(ψ,n)=VnClass(ψ,n)ψ1ψ2:returnClass(ψ1,n)Class(ψ2,n);xn+1(ψ):returnDom(Class(ψ,n+1));// xn+1 is free in ψ; Class(ψ,n+1) // returns a class of (n+1)-tuplesendend{\displaystyle {\begin{array}{l}\mathbf {function} \;{\text{Class}}(\phi ,\,n)\\\quad {\begin{array}{rl}\mathbf {input} \!:\;\,&\phi {\text{ is a transformed formula of the form }}\phi (x_{1},\ldots ,x_{n},Y_{1},\ldots ,Y_{m});\\&n{\text{ specifies that a class of }}n{\text{-tuples is returned.}}\\\;\;\;\;\mathbf {output} \!:\;\,&{\text{class }}A{\text{ of }}n{\text{-tuples satisfying }}\\&\,\forall x_{1}\cdots \,\forall x_{n}[(x_{1},\ldots ,x_{n})\in A\iff \phi (x_{1},\ldots ,x_{n},Y_{1},\ldots ,Y_{m})].\end{array}}\\\mathbf {begin} \\\quad \mathbf {case} \;\phi \;\mathbf {of} \\\qquad {\begin{alignedat}{2}x_{i}\in x_{j}:\;\;&\mathbf {return} \;\,E_{i,j,n};&&{\text{// }}E_{i,j,n}\;\,=\{(x_{1},\dots ,x_{n}):x_{i}\in x_{j}\}\\x_{i}\in Y_{k}:\;\;&\mathbf {return} \;\,E_{i,Y_{k},n};&&{\text{// }}E_{i,Y_{k},n}=\{(x_{1},\dots ,x_{n}):x_{i}\in Y_{k}\}\\\neg \psi :\;\;&\mathbf {return} \;\,\complement _{V^{n}}{\text{Class}}(\psi ,\,n);&&{\text{// }}\complement _{V^{n}}{\text{Class}}(\psi ,\,n)=V^{n}\setminus {\text{Class}}(\psi ,\,n)\\\psi _{1}\land \psi _{2}:\;\;&\mathbf {return} \;\,{\text{Class}}(\psi _{1},\,n)\cap {\text{Class}}(\psi _{2},\,n);&&\\\;\;\;\;\,\exists x_{n+1}(\psi ):\;\;&\mathbf {return} \;\,Dom({\text{Class}}(\psi ,\,n+1));&&{\text{// }}x_{n+1}{\text{ is free in }}\psi ;{\text{ Class}}(\psi ,\,n+1)\\&\ &&{\text{// returns a class of }}(n+1){\text{-tuples}}\end{alignedat}}\\\quad \mathbf {end} \\\mathbf {end} \end{array}}}

Letϕ{\displaystyle \phi } be the formula ofexample 2. The function callA=Class(ϕ,1){\displaystyle A=Class(\phi ,1)} generates the classA,{\displaystyle A,} which is compared below withϕ.{\displaystyle \phi .} This shows that the construction of the classA{\displaystyle A} mirrors the construction of its defining formulaϕ.{\displaystyle \phi .}

ϕ=x2(x2x1¬x3(x3x2))x2(x2x1x3(x3x2¬x4(x4x3)))A=Dom(E2,1,2V2Dom(E3,2,3))Dom(E2,1,2Dom(E3,2,3V3Dom(E4,3,4))){\displaystyle {\begin{alignedat}{2}&\phi \;&&=\;\;\exists x_{2}\,(x_{2}\!\in \!x_{1}\land \;\;\neg \;\;\;\;\exists x_{3}\;(x_{3}\!\in \!x_{2}))\,\land \;\;\,\exists x_{2}\,(x_{2}\!\in \!x_{1}\land \;\;\,\exists x_{3}\,(x_{3}\!\in \!x_{2}\,\land \;\;\neg \;\;\;\;\exists x_{4}\;(x_{4}\!\in \!x_{3})))\\&A\;&&=Dom\,(\;E_{2,1,2}\;\cap \;\complement _{V^{2}}\,Dom\,(\;E_{3,2,3}\;))\,\cap \,Dom\,(\;E_{2,1,2}\;\cap \,Dom\,(\;\,E_{3,2,3}\;\cap \;\complement _{V^{3}}\,Dom\,(\;E_{4,3,4}\;)))\end{alignedat}}}

Extending the class existence theorem

[edit]

Gödel extended the class existence theorem to formulasϕ{\displaystyle \phi } containingrelations over classes (such asY1Y2{\displaystyle Y_{1}\subseteq Y_{2}} and theunary relationM(Y1){\displaystyle M(Y_{1})}), special classes (such asOrd{\displaystyle Ord}), andoperations (such as(x1,x2){\displaystyle (x_{1},x_{2})} andx1Y1{\displaystyle x_{1}\cap Y_{1}}).[32] To extend the class existence theorem, the formulas defining relations, special classes, and operations must quantify only over sets. Thenϕ{\displaystyle \phi } can be transformed into an equivalent formula satisfying thehypothesis of the class existence theorem.

The following definitions specify how formulas define relations, special classes, and operations:

  1. A relationR{\displaystyle R} is defined by:R(Z1,,Zk)ψR(Z1,,Zk).{\displaystyle R(Z_{1},\dots ,Z_{k})\iff \psi _{R}(Z_{1},\dots ,Z_{k}).}
  2. A special classC{\displaystyle C} is defined by:uCψC(u).{\displaystyle u\in C\iff \psi _{C}(u).}
  3. An operationP{\displaystyle P} is defined by:uP(Z1,,Zk)ψP(u,Z1,,Zk).{\displaystyle u\in P(Z_{1},\dots ,Z_{k})\iff \psi _{P}(u,Z_{1},\dots ,Z_{k}).}

Aterm is defined by:

  1. Variables and special classes are terms.
  2. IfP{\displaystyle P} is an operation withk{\displaystyle k} arguments andΓ1,,Γk{\displaystyle \Gamma _{1},\dots ,\Gamma _{k}} are terms, thenP(Γ1,,Γk){\displaystyle P(\Gamma _{1},\dots ,\Gamma _{k})} is a term.

The following transformation rules eliminate relations, special classes, and operations. Each time rule 2b, 3b, or 4 is applied to a subformula,i{\displaystyle i} is chosen so thatzi{\displaystyle z_{i}} differs from the other variables in the current formula. The rules are repeated until there are no subformulas to which they can be applied.Γ1,,Γk,Γ,{\displaystyle \,\Gamma _{1},\dots ,\Gamma _{k},\Gamma ,} andΔ{\displaystyle \Delta } denote terms.

  1. A relationR(Z1,,Zk){\displaystyle R(Z_{1},\dots ,Z_{k})} is replaced by its defining formulaψR(Z1,,Zk).{\displaystyle \psi _{R}(Z_{1},\dots ,Z_{k}).}
  2. LetψC(u){\displaystyle \psi _{C}(u)} be the defining formula for the special classC.{\displaystyle C.}
  3. LetψP(u,Z1,,Zk){\displaystyle \psi _{P}(u,Z_{1},\dots ,Z_{k})} be the defining formula for the operationP(Z1,,Zk).{\displaystyle P(Z_{1},\dots ,Z_{k}).}
  4. Extensionality is used to transformΔ=Γ{\displaystyle \Delta =\Gamma } intozi(ziΔziΓ).{\displaystyle \forall z_{i}(z_{i}\in \Delta \iff z_{i}\in \Gamma ).}
Example 3: TransformingY1Y2.{\displaystyle Y_{1}\subseteq Y_{2}.}

Y1Y2z1(z1Y1z1Y2)(rule 1){\displaystyle Y_{1}\subseteq Y_{2}\iff \forall z_{1}(z_{1}\in Y_{1}\implies z_{1}\in Y_{2})\quad {\text{(rule 1)}}}

Example 4: Transformingx1Y1x2.{\displaystyle x_{1}\cap Y_{1}\in x_{2}.}

x1Y1x2z1[z1=x1Y1z1x2](rule 3b)z1[z2(z2z1z2x1Y1)z1x2](rule 4)z1[z2(z2z1z2x1z2Y1)z1x2](rule 3a){\displaystyle {\begin{alignedat}{2}x_{1}\cap Y_{1}\in x_{2}&\iff \exists z_{1}[z_{1}=x_{1}\cap Y_{1}\,\land \,z_{1}\in x_{2}]&&{\text{(rule 3b)}}\\&\iff \exists z_{1}[\forall z_{2}(z_{2}\in z_{1}\iff z_{2}\in x_{1}\cap Y_{1})\,\land \,z_{1}\in x_{2}]&&{\text{(rule 4)}}\\&\iff \exists z_{1}[\forall z_{2}(z_{2}\in z_{1}\iff z_{2}\in x_{1}\land z_{2}\in Y_{1})\,\land \,z_{1}\in x_{2}]\quad &&{\text{(rule 3a)}}\\\end{alignedat}}}

This example illustrates how the transformation rules work together to eliminate an operation.

Class existence theorem (extended version)Letϕ(x1,,xn,Y1,,Ym){\displaystyle \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})} be a formula that quantifies only over sets, contains no free variables other thanx1,,xn,Y1,,Ym{\displaystyle x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m}}, and may contain relations, special classes, and operations defined by formulas that quantify only over sets. Then for allY1,,Ym,{\displaystyle Y_{1},\dots ,Y_{m},} there exists a unique classA{\displaystyle A} ofn{\displaystyle n}-tuples such thatx1xn[(x1,,xn)Aϕ(x1,,xn,Y1,,Ym)].{\displaystyle \forall x_{1}\cdots \,\forall x_{n}[(x_{1},\dots ,x_{n})\in A\iff \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})].}[j]

Proof

Apply the transformation rules toϕ{\displaystyle \phi } to produce an equivalent formula containing no relations, special classes, or operations. This formula satisfies the hypothesis of the class existence theorem. Therefore, for allY1,,Ym,{\displaystyle Y_{1},\dots ,Y_{m},} there is a unique classA{\displaystyle A} ofn{\displaystyle n}-tuples satisfyingx1xn[(x1,,xn)Aϕ(x1,,xn,Y1,,Ym)].{\displaystyle \forall x_{1}\cdots \,\forall x_{n}[(x_{1},\dots ,x_{n})\in A\iff \phi (x_{1},\dots ,x_{n},Y_{1},\dots ,Y_{m})].}

Set axioms

[edit]

The axioms of pairing and regularity, which were needed for the proof of the class existence theorem, have been given above. NBG contains four other set axioms. Three of these axioms deal with class operations being applied to sets.

Definition.F{\displaystyle F} is afunction ifFV2xyz[(x,y)F(x,z)Fy=z].{\displaystyle F\subseteq V^{2}\land \forall x\,\forall y\,\forall z\,[(x,y)\in F\,\land \,(x,z)\in F\implies y=z].}

In set theory, the definition of a function does not require specifying the domain or codomain of the function (seeFunction (set theory)). NBG's definition of function generalizes ZFC's definition from a set of ordered pairs to a class of ordered pairs.

ZFC's definitions of the set operations ofimage,union, andpower set are also generalized to class operations. The image of classA{\displaystyle A} under the functionF{\displaystyle F} isF[A]={y:x(xA(x,y)F)}.{\displaystyle F[A]=\{y:\exists x(x\in A\,\land \,(x,y)\in F)\}.} This definition does not require thatADom(F).{\displaystyle A\subseteq Dom(F).} The union of classA{\displaystyle A} isA={x:y(xyyA)}.{\displaystyle \cup A=\{x:\exists y(x\in y\,\,\land \,y\in A)\}.} The power class ofA{\displaystyle A} isP(A)={x:xA}.{\displaystyle {\mathcal {P}}(A)=\{x:x\subseteq A\}.} The extended version of the class existence theorem implies the existence of these classes. The axioms of replacement,union, andpower set imply that when these operations are applied to sets, they produce sets.[34]

Axiom of replacement. IfF{\displaystyle F} is a function anda{\displaystyle a} is a set, thenF[a]{\displaystyle F[a]}, theimage ofa{\displaystyle a} underF{\displaystyle F}, is a set.Fa[F is a functionby(ybx(xa(x,y)F))].{\displaystyle \forall F\,\forall a\,[F{\text{ is a function}}\implies \exists b\,\forall y\,(y\in b\iff \exists x(x\in a\,\land \,(x,y)\in F))].}

Not having the requirementADom(F){\displaystyle A\subseteq Dom(F)} in the definition ofF[A]{\displaystyle F[A]} produces a stronger axiom of replacement, which is used in the following proof.

Theorem (NBG'saxiom of separation)Ifa{\displaystyle a} is a set andB{\displaystyle B} is a subclass ofa,{\displaystyle a,} thenB{\displaystyle B} is a set.

Proof

The class existence theorem constructs therestriction of theidentity function toB{\displaystyle B}:IB={(x1,x2):x1Bx2=x1}.{\displaystyle I{\upharpoonright _{B}}=\{(x_{1},x_{2}):x_{1}\in B\land x_{2}=x_{1}\}.} Since the image ofa{\displaystyle a} underIB{\displaystyle I{\upharpoonright _{B}}} isB{\displaystyle B}, the axiom of replacement implies thatB{\displaystyle B} is a set. This proof depends on the definition of image not having the requirementaDom(F){\displaystyle a\subseteq Dom(F)} sinceDom(IB)=Ba{\displaystyle Dom(I{\upharpoonright _{B}})=B\subseteq a} rather thanaDom(IB).{\displaystyle a\subseteq Dom(I{\upharpoonright _{B}}).}

Axiom of union. Ifa{\displaystyle a} is a set, then there is a set containinga.{\displaystyle \cup a.}abx[y(xyya)xb].{\displaystyle \forall a\,\exists b\,\forall x\,[\,\exists y(x\in y\,\,\land \,y\in a)\implies x\in b\,].}

Axiom of power set. Ifa{\displaystyle a} is a set, then there is a set containingP(a).{\displaystyle {\mathcal {P}}(a).}

abx(xaxb).{\displaystyle \forall a\,\exists b\,\forall x\,(x\subseteq a\implies x\in b).}[k]

Theorem Ifa{\displaystyle a} is a set, thena{\displaystyle \cup a} andP(a){\displaystyle {\mathcal {P}}(a)} are sets.

Proof

The axiom of union states thata{\displaystyle \cup a} is a subclass of a setb{\displaystyle b}, so the axiom of separation impliesa{\displaystyle \cup a} is a set. Likewise, the axiom of power set states thatP(a){\displaystyle {\mathcal {P}}(a)} is a subclass of a setb{\displaystyle b}, so the axiom of separation implies thatP(a){\displaystyle {\mathcal {P}}(a)} is a set.

Axiom of infinity. There exists a nonempty seta{\displaystyle a} such that for allx{\displaystyle x} ina{\displaystyle a}, there exists ay{\displaystyle y} ina{\displaystyle a} such thatx{\displaystyle x} is a proper subset ofy{\displaystyle y}.a[u(ua)x(xay(yaxy))].{\displaystyle \exists a\,[\exists u(u\in a)\,\land \,\forall x(x\in a\implies \exists y(y\in a\,\land \,x\subset y))].}

The axioms of infinity and replacement prove the existence of theempty set. In thediscussion of the class existence axioms, the existence of the empty class{\displaystyle \emptyset } was proved. We now prove that{\displaystyle \emptyset } is a set. Let functionF={\displaystyle F=\emptyset } and leta{\displaystyle a} be the set given by the axiom of infinity. By replacement, the image ofa{\displaystyle a} underF{\displaystyle F}, which equals{\displaystyle \emptyset }, is a set.

NBG's axiom of infinity is implied by ZFC'saxiom of infinity:a[ax(xax{x}a)].{\displaystyle \,\exists a\,[\emptyset \in a\,\land \,\forall x(x\in a\implies x\cup \{x\}\in a)].\,} The firstconjunct of ZFC's axiom,a{\displaystyle \emptyset \in a}, implies the first conjunct of NBG's axiom. The second conjunct of ZFC's axiom,x(xax{x}a){\displaystyle \forall x(x\in a\implies x\cup \{x\}\in a)}, implies the second conjunct of NBG's axiom sincexx{x}.{\displaystyle x\subset x\cup \{x\}.} To prove ZFC's axiom of infinity from NBG's axiom of infinity requires some of the other NBG axioms (seeWeak axiom of infinity).[l]

Axiom of global choice

[edit]

The class concept allows NBG to have a stronger axiom of choice than ZFC. Achoice function is a functionf{\displaystyle f} defined on a sets{\displaystyle s} of nonempty sets such thatf(x)x{\displaystyle f(x)\in x} for allxs.{\displaystyle x\in s.} ZFC's axiom of choice states that there exists a choice function for every set of nonempty sets. A global choice function is a functionG{\displaystyle G} defined on the class of all nonempty sets such thatG(x)x{\displaystyle G(x)\in x} for every nonempty setx.{\displaystyle x.} The axiom of global choice states that there exists a global choice function. This axiom implies ZFC's axiom of choice since for every sets{\displaystyle s} of nonempty sets,G|s{\displaystyle G\vert _{s}} (therestriction ofG{\displaystyle G} tos{\displaystyle s}) is a choice function fors.{\displaystyle s.} In 1964,William B. Easton proved that global choice is stronger than the axiom of choice by usingforcing to construct amodel that satisfies the axiom of choice and all the axioms of NBG except the axiom of global choice.[38] The axiom of global choice is equivalent to every class having a well-ordering, while ZFC's axiom of choice is equivalent to every set having a well-ordering.[m]

Axiom of global choice. There exists a function that chooses an element from every nonempty set.

G[G is a functionx(xy(yx(x,y)G))].{\displaystyle \exists G\,[G{\text{ is a function}}\,\land \forall x(x\neq \emptyset \implies \exists y(y\in x\land (x,y)\in G))].}

History

[edit]
refer to caption
History of approaches that led to NBG set theory

Von Neumann's 1925 axiom system

[edit]

Von Neumann published an introductory article on his axiom system in 1925. In 1928, he provided a detailed treatment of his system.[39] Von Neumann based his axiom system on two domains ofprimitive objects: functions and arguments. These domains overlap—objects that are in both domains are called argument-functions. Functions correspond to classes in NBG, and argument-functions correspond to sets. Von Neumann's primitive operation isfunction application, denoted by [ax] rather thana(x) wherea is a function andx is an argument. This operation produces an argument. Von Neumann defined classes and sets using functions and argument-functions that take only two values,A andB. He definedx ∈ a if [ax] ≠ A.[1]

Von Neumann's work in set theory was influenced byGeorg Cantor's articles,Ernst Zermelo's 1908 axioms for set theory, and the 1922 critiques ofZermelo's set theory that were given independently byAbraham Fraenkel andThoralf Skolem. Both Fraenkel and Skolem pointed out that Zermelo's axioms cannot prove the existence of the set {Z0Z1Z2, ...} whereZ0 is the set ofnatural numbers andZn+1 is thepower set ofZn. They then introduced the axiom of replacement, which would guarantee the existence of such sets.[40][n] However, they were reluctant to adopt this axiom: Fraenkel stated "that Replacement was too strong an axiom for 'general set theory'", while "Skolem only wrote that 'we could introduce' Replacement".[42]

Von Neumann worked on the problems ofZermelo set theory and provided solutions for some of them:

  • A theory of ordinals
    • Problem: Cantor's theory ofordinal numbers cannot be developed in Zermelo set theory because it lacks the axiom of replacement.[o]
    • Solution: Von Neumann recovered Cantor's theory by defining the ordinals using sets that arewell-ordered by the ∈-relation,[p] and by using the axiom of replacement to prove key theorems about the ordinals, such as every well-ordered set isorder-isomorphic with an ordinal.[o] In contrast to Fraenkel and Skolem, von Neumann emphasized how important the replacement axiom is for set theory: "In fact, I believe that no theory of ordinals is possible at all without this axiom."[45]
  • A criterion identifying classes that are too large to be sets
    • Problem: Zermelo did not provide such a criterion. His set theory avoids the large classes that lead to theparadoxes, but it leaves out many sets, such as the one mentioned by Fraenkel and Skolem.[q]
    • Solution: Von Neumann introduced the criterion: A class is too large to be a set if and only if it can be mappedonto the classV of all sets. Von Neumann realized that the set-theoretic paradoxes could be avoided by not allowing such large classes to be members of any class. Combining this restriction with his criterion, he obtained hisaxiom of limitation of size: A classC is not a member of any class if and only ifC can be mapped ontoV.[48][r]
  • Finite axiomatization
    • Problem: Zermelo had used the imprecise concept of "definitepropositional function" inhis axiom of separation.
    • Solutions: Skolem introduced theaxiom schema of separation that was later used in ZFC, and Fraenkel introduced an equivalent solution.[50] However, Zermelo rejected both approaches "particularly because they implicitly involve the concept of natural number which, in Zermelo's view, should be based upon set theory."[s] Von Neumann avoidedaxiom schemas by formalizing the concept of "definite propositional function" with his functions, whose construction requires only finitely many axioms. This led to his set theory having finitely many axioms.[51] In 1961,Richard Montague proved that ZFC cannot be finitely axiomatized.[52]
  • The axiom of regularity
    • Problem: Zermelo set theory starts with the empty set and an infinite set, and iterates the axioms of pairing, union, power set, separation, and choice to generate new sets. However, it does not restrict sets to these. For example, it allows sets that are notwell-founded, such as a setx satisfyingx ∈ x.[t]
    • Solutions: Fraenkel introduced an axiom to exclude these sets. Von Neumann analyzed Fraenkel's axiom and stated that it was not "precisely formulated", but it would approximately say: "Besides the sets ... whose existence is absolutely required by the axioms, there are no further sets."[54] Von Neumann proposed the axiom of regularity as a way to exclude non-well-founded sets, but did not include it in his axiom system. In 1930, Zermelo became the first to publish an axiom system that included regularity.[u]

Von Neumann's 1929 axiom system

[edit]
refer to caption
John von Neumann

In 1929, von Neumann published an article containing the axioms that would lead to NBG.This article was motivated by his concern about the consistency of the axiom of limitation of size. He stated that this axiom "does a lot, actually too much." Besides implying the axioms of separation and replacement, and thewell-ordering theorem, it also implies that any class whosecardinality is less than that ofV is a set. Von Neumann thought that this last implication went beyond Cantorian set theory and concluded: "We must therefore discuss whether its [the axiom's] consistency is not even more problematic than an axiomatization of set theory that does not go beyond the necessary Cantorian framework."[57]

Von Neumann started his consistency investigation by introducing his 1929 axiom system, which contains all the axioms of his 1925 axiom system except the axiom of limitation of size. He replaced this axiom with two of its consequences, the axiom of replacement and a choice axiom. Von Neumann's choice axiom states: "Every relationR has a subclass that is a function with the same domain asR."[58]

LetS be von Neumann's 1929 axiom system. Von Neumann introduced the axiom systemS + Regularity (which consists ofS and the axiom of regularity) to demonstrate that his 1925 system isconsistent relative toS. He proved:

  1. IfS is consistent, thenS + Regularity is consistent.
  2. S + Regularity implies the axiom of limitation of size. Since this is the only axiom of his 1925 axiom system thatS + Regularity does not have,S + Regularity implies all the axioms of his 1925 system.

These results imply: IfS is consistent, then von Neumann's 1925 axiom system is consistent. Proof: IfS is consistent, thenS + Regularity is consistent (result 1). Usingproof by contradiction, assume that the 1925 axiom system is inconsistent, or equivalently: the 1925 axiom system implies a contradiction. SinceS + Regularity implies the axioms of the 1925 system (result 2),S + Regularity also implies a contradiction. However, this contradicts the consistency ofS + Regularity. Therefore, ifS is consistent, then von Neumann's 1925 axiom system is consistent.

SinceS is his 1929 axiom system, von Neumann's 1925 axiom system is consistent relative to his 1929 axiom system, which is closer to Cantorian set theory. The major differences between Cantorian set theory and the 1929 axiom system are classes and von Neumann's choice axiom. The axiom systemS + Regularity was modified by Bernays and Gödel to produce the equivalent NBG axiom system.

Bernays' axiom system

[edit]
Paul Bernays

In 1929,Paul Bernays started modifying von Neumann's new axiom system by taking classes and sets as primitives. He published his work in a series of articles appearing from 1937 to 1954.[59] Bernays stated that:

The purpose of modifying the von Neumann system is to remain nearer to the structure of the original Zermelo system and to utilize at the same time some of the set-theoretic concepts of theSchröder logic and ofPrincipia Mathematica which have become familiar to logicians. As will be seen, a considerable simplification results from this arrangement.[60]

Bernays handled sets and classes in atwo-sorted logic and introduced two membership primitives: one for membership in sets and one for membership in classes. With these primitives, he rewrote and simplified von Neumann's 1929 axioms. Bernays also included the axiom of regularity in his axiom system.[61]

Gödel's axiom system (NBG)

[edit]
refer to caption
Kurt Gödel,    c. 1926

In 1931, Bernays sent a letter containing his set theory toKurt Gödel.[36] Gödel simplified Bernays' theory by making every set a class, which allowed him to use just one sort and one membership primitive. He also weakened some of Bernays' axioms and replaced von Neumann's choice axiom with the equivalent axiom of global choice.[62][v] Gödel used his axioms in his 1940 monograph on the relative consistency of global choice and the generalized continuum hypothesis.[63]

Several reasons have been given for Gödel choosing NBG for his monograph:[w]

  • Gödel gave a mathematical reason—NBG's global choice produces a stronger consistency theorem: "This stronger form of the axiom [of choice], if consistent with the other axioms, implies, of course, that a weaker form is also consistent."[5]
  • Robert Solovay conjectured: "My guess is that he [Gödel] wished to avoid a discussion of the technicalities involved in developing the rudiments ofmodel theory within axiomatic set theory."[67][x]
  • Kenneth Kunen gave a reason for Gödel avoiding this discussion: "There is also a much more combinatorial approach toL [theconstructible universe], developed by ... [Gödel in his 1940 monograph] in an attempt to explain his work to non-logicians. ... This approach has the merit of removing all vestiges of logic from the treatment ofL."[68]
  • Charles Parsons provided a philosophical reason for Gödel's choice: "This view [that 'property of set' is a primitive of set theory] may be reflected in Gödel's choice of a theory with class variables as the framework for ... [his monograph]."[69]

Gödel's achievement together with the details of his presentation led to the prominence that NBG would enjoy for the next two decades.[70] In 1963,Paul Cohen proved hisindependence proofs for ZF with the help of some tools that Gödel had developed for his relative consistency proofs for NBG.[71] Later, ZFC became more popular than NBG. This was caused by several factors, including the extra work required to handleforcing in NBG,[72] Cohen's 1966 presentation of forcing, which used ZF,[73][y] and the proof that NBG is a conservative extension of ZFC.[z]

NBG, ZFC, and MK

[edit]

NBG is not logically equivalent to ZFC because its language is more expressive: it can make statements about classes, which cannot be made in ZFC. However, NBG and ZFC imply the same statements about sets. Therefore, NBG is aconservative extension of ZFC. NBG implies theorems that ZFC does not imply, but since NBG is a conservative extension, these theorems must involve proper classes. For example, it is a theorem of NBG that the global axiom of choice implies that the proper classV can be well-ordered and that every proper class can be put into one-to-one correspondence withV.[aa]

One consequence of conservative extension is that ZFC and NBG areequiconsistent. Proving this uses theprinciple of explosion: from acontradiction, everything is provable. Assume that either ZFC or NBG is inconsistent. Then the inconsistent theory implies the contradictory statements ∅ = ∅ and ∅ ≠ ∅, which are statements about sets. By the conservative extension property, the other theory also implies these statements. Therefore, it is also inconsistent. So although NBG is more expressive, it is equiconsistent with ZFC. This result together with von Neumann's 1929 relative consistency proof implies that his 1925 axiom system with the axiom of limitation of size is equiconsistent with ZFC. This completely resolvesvon Neumann's concern about the relative consistency of this powerful axiom since ZFC is within the Cantorian framework.

Even though NBG is a conservative extension of ZFC, a theorem may have a shorter and more elegant proof in NBG than in ZFC (or vice versa). For a survey of known results of this nature, seePudlák 1998.

Morse–Kelley set theory has an axiom schema of class comprehension that includes formulas whose quantifiers range over classes. MK is a stronger theory than NBG because MK proves the consistency of NBG,[76] whileGödel's second incompleteness theorem implies that NBG cannot prove the consistency of NBG.

For a discussion of someontological and other philosophical issues posed by NBG, especially when contrasted with ZFC and MK, see Appendix C ofPotter 2004.

Models

[edit]

ZFC, NBG, and MK havemodels describable in terms of thecumulative hierarchyVα and theconstructible hierarchyLα. LetV include aninaccessible cardinal κ, letXVκ, and let Def(X) denote the class offirst-order definable subsets ofX with parameters. In symbols where "(X,){\displaystyle (X,\in )}" denotes the model withdomainX{\displaystyle X} and relation{\displaystyle \in }, and "{\displaystyle \models }" denotes thesatisfaction relation:Def(X):={{xxX and (X,)ϕ(x,y1,,yn)}:ϕ is a first-order formula and y1,,ynX}.{\displaystyle \operatorname {Def} (X):={\Bigl \{}\{x\mid x\in X{\text{ and }}(X,\in )\models \phi (x,y_{1},\ldots ,y_{n})\}:\phi {\text{ is a first-order formula and }}y_{1},\ldots ,y_{n}\in X{\Bigr \}}.}

Then:

Category theory

[edit]

The ontology of NBG provides scaffolding for speaking about "large objects" without risking paradox. For instance, in some developments ofcategory theory, a "large category" is defined as one whoseobjects andmorphisms make up a proper class. On the other hand, a "small category" is one whose objects and morphisms are members of a set. Thus, we can speak of the "category of all sets" or "category of all small categories" without risking paradox since NBG supports large categories.

However, NBG does not support a "category of all categories" since large categories would be members of it and NBG does not allow proper classes to be members of anything. An ontological extension that enables us to talk formally about such a "category" is theconglomerate, which is a collection of classes. Then the "category of all categories" is defined by its objects: the conglomerate of all categories; and its morphisms: the conglomerate of all morphisms fromA toB whereA andB are objects.[83] On whether an ontology including classes as well as sets is adequate for category theory, seeMuller 2001.

Notes

[edit]
  1. ^Axiom of global choice explains why it is provably stronger.
  2. ^The historical development suggests that the two-sorted approach does appear more natural at first. In introducing his theory, Bernays stated: "According to the leading idea of von Neumann set theory we have to deal with two kinds of individuals, which we may distinguish assets andclasses."[11]
  3. ^Gödel defined(x1,x2,,xn)=(x1,(x2,,xn)){\displaystyle (x_{1},x_{2},\ldots ,x_{n})=(x_{1},(x_{2},\ldots ,x_{n}))}.[15] This affects the statements of some of his definitions, axioms, and theorems. This article uses Mendelson's definition.[16]
  4. ^Bernays' class existence axioms specify unique classes. Gödel weakened all but three of Bernays' axioms (intersection, complement, domain) by replacingbiconditionals withimplications, which means they specify only the ordered pairs or the 3-tuples of the class. The axioms in this section are Gödel's except for Bernays' strongerproduct by V axiom, which specifies a unique class of ordered pairs. Bernays' axiom simplifies the proof of theclass existence theorem. Gödel's axiom B6 appears as the fourth statement of thetuple lemma. Bernays later realized that one of his axioms is redundant, which implies that one of Gödel's axioms is redundant. Using the other axioms, axiom B6 can be proved from axiom B8, and B8 can be proved from B6, so either axiom can be considered the redundant axiom.[17] The names for the tuple-handling axioms are from the French Wikipédia article:Théorie des ensembles de von Neumann.
  5. ^abThis article usesBourbaki's complement notationA{\displaystyle \complement A} and relative complement notationXA=AX{\displaystyle \complement _{X}A=\complement A\cap X}.[22] This prefix relative complement notation is used by the class existence theorem tomirror the prefix logical not (¬{\displaystyle \neg }).
  6. ^Since Gödel states this axiom before he proves the existence of the empty class, he states it without using the empty class.[5]
  7. ^The proofs in this and the next section come from Gödel's proofs, which he gave at theInstitute for Advanced Study where he "could count upon an audience well versed inmathematical logic".[28] To make Gödel's proofs more accessible to Wikipedia readers, a few modifications have been made. The goal in this and the next section is to prove Gödel's M4, his fourth class existence theorem. The proof in this section mostly follows the M1 proof,[29] but it also uses techniques from the M3 and M4 proofs. The theorem is stated with class variables rather than M1's symbols for special classes (universal quantification over the class variables is equivalent to being true for any instantiation of the class variables). The major differences from the M1 proof are: unique classes ofn{\displaystyle n}-tuples are generated at the end of the basis and inductive steps (which require Bernays' stronger product byV{\displaystyle V} axiom), andbound variables are replaced by subscripted variables that continue the numbering of the free set variables. Since bound variables are free for part of the induction, this guarantees that, when they are free, they are treated the same as the original free variables. One of the benefits of this proof is theexample output of the function Class, which shows that a class's construction mirrors its defining formula's construction.
  8. ^One detail has been left out of this proof. Gödel's convention is being used, soxϕ(x){\displaystyle \exists x\,\phi (x)} is defined to bex[C(xC)ϕ(x)].{\displaystyle \exists x[\exists C(x\in C)\land \phi (x)].} Since this formula quantifies over classes, it must be replaced with the equivalentx[xVϕ(x)].{\displaystyle \exists x[x\in V\land \phi (x)].} Then the three formulas in the proof having the formxn+1[xn+1]{\displaystyle \exists x_{n+1}[x_{n+1}\land \dots ]} becomexn+1[xn+1V],{\displaystyle \exists x_{n+1}[x_{n+1}\in V\land \dots ],} which produces a valid proof.
  9. ^Recursive computer programs written in pseudocode have been used elsewhere inpure mathematics. For example, they have been used to prove theHeine-Borel theorem and other theorems ofanalysis.[31]
  10. ^This theorem is Gödel's theorem M4. He proved it by first proving M1, a class existence theorem that uses symbols for special classes rather than free class variables. M1 produces a class containing all then{\displaystyle n}-tuples satisfyingϕ{\displaystyle \phi }, but which may contain elements that are notn{\displaystyle n}-tuples. Theorem M2 extends this theorem to formulas containing relations, special classes, and operations. Theorem M3 is obtained from M2 by replacing the symbols for special classes with free variables. Gödel used M3 to defineA×B={x:yz[x=(y,z)yAzB]},{\displaystyle A\times B=\{x:\exists y\exists z[x=(y,z)\land y\in A\land z\in B]\},} which is unique by extensionality. He usedA×B{\displaystyle A\times B} to defineVn.{\displaystyle V^{n}.} Theorem M4 is obtained from M3 by intersecting the class produced by M3 withVn{\displaystyle V^{n}} to produce the unique class ofn{\displaystyle n}-tuples satisfying the given formula. Gödel's approach, especially his use of M3 to defineA×B{\displaystyle A\times B}, eliminates the need for Bernays' stronger form of the product byV{\displaystyle V} axiom.[33]
  11. ^Gödel weakened Bernays' axioms of union and power set, which state the existence of these sets, to the above axioms that state there is a set containing the union and a set containing the power set.[35] Bernays published his axioms after Gödel, but had sent them to Gödel in 1931.[36]
  12. ^Since ZFC's axiom requires the existence of the empty set, an advantage of NBG's axiom is that the axiom of the empty set is not needed. Mendelson's axiom system uses the ZFC's axiom of infinity and also has the axiom of the empty set.[37]
  13. ^ForV{\displaystyle V} having a well-ordering implying global choice, seeImplications of the axiom of limitation of size. For global choice implying the well-ordering of any class, seeKanamori 2009, p. 53.
  14. ^In 1917,Dmitry Mirimanoff published a form of replacement based on cardinal equivalence.[41]
  15. ^abIn 1928, von Neumann stated: "A treatment of ordinal number closely related to mine was known to Zermelo in 1916, as I learned subsequently from a personal communication. Nevertheless, the fundamental theorem, according to which to each well-ordered set there is a similar ordinal, could not be rigorously proved because the replacement axiom was unknown."[43]
  16. ^von Neumann 1923. Von Neumann's definition also used the theory of well-ordered sets. Later, his definition was simplified to the current one: An ordinal is atransitive set that is well-ordered by ∈.[44]
  17. ^After introducing thecumulative hierarchy, von Neumann could show that Zermelo's axioms do not prove the existence of ordinals α ≥ ω + ω, which include uncountably manyhereditarily countable sets. This follows from Skolem's result that Vω+ω satisfies Zermelo's axioms[46] and from α ∈ Vβ implying α < β.[47]
  18. ^Von Neumann stated his axiom in an equivalent functional form.[49]
  19. ^Skolem's approach implicitly involves natural numbers because the formulas of an axiom schema are built usingstructural recursion, which is a generalization ofmathematical recursion over the natural numbers.
  20. ^Mirimanoff defined well-founded sets in 1917.[53]
  21. ^Akihiro Kanamori points out that Bernays lectured on his axiom system in 1929-1930 and states that "… he and Zermelo must have arrived at the idea of incorporating Foundation [regularity] almost at the same time."[55] However, Bernays did not publish the part of his axiom system containing regularity until 1941.[56]
  22. ^Proof that von Neumann's axiom implies global choice: LetR={(x,y):xyx}.{\displaystyle R=\{(x,y):x\neq \emptyset \land y\in x\}.} Von Neumann's axiom implies there is a functionGR{\displaystyle G\subseteq R} such thatDom(G)=Dom(R).{\displaystyle Dom(G)=Dom(R).} The functionG{\displaystyle G} is a global choice function since for all nonempty setsx,{\displaystyle x,}G(x)x.{\displaystyle G(x)\in x.}
    Proof that global choice implies von Neumann's axiom: LetG{\displaystyle G} be a global choice function, and letR{\displaystyle R} be a relation. ForxDom(R),{\displaystyle x\in Dom(R),} letα(x)=least{α:y[(x,y)RVα]}{\displaystyle \alpha (x)={\text{least}}\,\{\alpha :\exists y[(x,y)\in R\cap V_{\alpha }]\}} whereVα{\displaystyle V_{\alpha }} is the set of all sets havingrank less thanα.{\displaystyle \alpha .} Letzx={y:(x,y)RVα(x)}.{\displaystyle z_{x}=\{y:(x,y)\in R\cap V_{\alpha (x)}\}.} ThenF={(x,G(zx)):xDom(R)}{\displaystyle F=\{(x,G(z_{x})):x\in Dom(R)\}} is a function that satisfies von Neumann's axiom sinceFR{\displaystyle F\subseteq R} andDom(F)=Dom(R).{\displaystyle Dom(F)=Dom(R).}
  23. ^Gödel used von Neumann's 1929 axioms in his 1938 announcement of his relative consistency theorem and stated "A corresponding theorem holds ifT denotes the system ofPrincipia mathematica".[64] His 1939 sketch of his proof is for Zermelo set theory and ZF.[65] Proving a theorem in multipleformal systems was not unusual for Gödel. For example, he proved hisincompleteness theorem for the system ofPrincipia mathematica, but pointed out that it "holds for a wide class of formal systems ...".[66]
  24. ^Gödel's consistency proof builds theconstructible universe. To build this in ZF requires some model theory. Gödel built it in NBG without model theory. For Gödel's construction, seeGödel 1940, pp. 35–46 orCohen 1966, pp. 99–103.
  25. ^Cohen also gave a detailed proof of Gödel's relative consistency theorems using ZF.[74]
  26. ^In the 1960s, this conservative extension theorem was proved independently by Paul Cohen,Saul Kripke, and Robert Solovay. In his 1966 book, Cohen mentioned this theorem and stated that its proof requires forcing. It was also proved independently byRonald Jensen and Ulrich Felgner, who published his proof in 1971.[75]
  27. ^Both conclusions follow from the conclusion that every proper class can be put into one-to-one correspondence with the class of all ordinals. A proof of this is outlined inKanamori 2009, p. 53.
  28. ^Easton built amodel of Mendelson's version of NBG in which ZFC's axiom of choice holds but global choice fails.
  29. ^In the cumulative hierarchy Vκ, the subsets of Vκ are in Vκ+1. The constructible hierarchyLκ produces subsets more slowly, which is why the subsets ofLκ are inLκ+ rather thanLκ+1.[80]

References

[edit]
  1. ^abvon Neumann 1925, pp. 221–224, 226, 229; English translation:van Heijenoort 2002b, pp. 396–398, 400, 403.
  2. ^abcdBernays 1937, pp. 66–67.
  3. ^Gödel 1940.
  4. ^Gödel 1940, pp. 3–7.
  5. ^abcGödel 1940, p. 6.
  6. ^Gödel 1940, p. 25.
  7. ^Gödel 1940, pp. 35–38.
  8. ^ab"The Neumann-Bernays-Gödel axioms".Encyclopædia Britannica. Retrieved17 January 2019.
  9. ^abGödel 1940, p. 3.
  10. ^Mendelson 1997, pp. 225–226.
  11. ^Bernays 1937, p. 66.
  12. ^Mendelson 1997, p. 226.
  13. ^Gödel's axiom A3 (Gödel 1940, p. 3).
  14. ^Gödel's axiom A4 (Gödel 1940, p. 3).
  15. ^Gödel 1940, p. 4).
  16. ^Mendelson 1997, p. 230.
  17. ^Kanamori 2009, p. 56;Bernays 1937, p. 69;Gödel 1940, pp. 5, 9;Mendelson 1997, p. 231.
  18. ^Gödel's axiom B1 (Gödel 1940, p. 5).
  19. ^Gödel's axiom B2 (Gödel 1940, p. 5).
  20. ^Gödel's axiom B3 (Gödel 1940, p. 5).
  21. ^Gödel's axiom B4 (Gödel 1940, p. 5).
  22. ^Bourbaki 2004, p. 71.
  23. ^Bernays' axiom b(3) (Bernays 1937, p. 5).
  24. ^Gödel's axiom B7 (Gödel 1940, p. 5).
  25. ^Gödel's axiom B8 (Gödel 1940, p. 5).
  26. ^Gödel 1940, p. 6;Kanamori 2012, p. 70.
  27. ^Kanamori 2009, p. 57;Gödel 2003, p. 121. Both references contain Gödel's proof but Kanamori's is easier to follow since he uses modern terminology.
  28. ^Dawson 1997, p. 134.
  29. ^Gödel 1940, pp. 8–11
  30. ^Gödel 1940, p. 11.
  31. ^Gray 1991.
  32. ^Gödel 1940, pp. 11–13.
  33. ^Gödel 1940, pp. 8–15.
  34. ^Gödel 1940, pp. 16–18.
  35. ^Bernays 1941, p. 2;Gödel 1940, p. 5).
  36. ^abKanamori 2009, p. 48;Gödel 2003, pp. 104–115.
  37. ^Mendelson 1997, pp. 228, 239.
  38. ^Easton 1964, pp. 56a–64.
  39. ^von Neumann 1925,von Neumann 1928.
  40. ^Ferreirós 2007, p. 369.
  41. ^Mirimanoff 1917, p. 49.
  42. ^Kanamori 2012, p. 62.
  43. ^Hallett 1984, p. 280.
  44. ^Kunen 1980, p. 16.
  45. ^von Neumann 1925, p. 223 (footnote); English translation:van Heijenoort 2002b, p. 398 (footnote).
  46. ^Kanamori 2012, p. 61
  47. ^Kunen 1980, pp. 95–96. Uses the notation R(β) instead ofVβ.
  48. ^Hallett 1984, pp. 288–290.
  49. ^von Neumann 1925, p. 225; English translation:van Heijenoort 2002b, p. 400.
  50. ^Fraenkel,Historical Introduction inBernays 1991, p. 13.
  51. ^von Neumann 1925, pp. 224–226; English translation:van Heijenoort 2002b, pp. 399–401.
  52. ^Montague 1961.
  53. ^Mirimanoff 1917, p. 41.
  54. ^von Neumann 1925, pp. 230–232; English translation:van Heijenoort 2002b, pp. 404–405.
  55. ^Kanamori 2009, pp. 53–54.
  56. ^Bernays 1941, p. 6.
  57. ^von Neumann 1929, p. 229;Ferreirós 2007, pp. 379–380.
  58. ^Kanamori 2009, pp. 49, 53.
  59. ^Kanamori 2009, pp. 48, 58. Bernays' articles are reprinted inMüller 1976, pp. 1–117.
  60. ^Bernays 1937, p. 65.
  61. ^Kanamori 2009, pp. 48–54.
  62. ^Kanamori 2009, p. 56.
  63. ^Kanamori 2009, pp. 56–58;Gödel 1940, Chapter I: The axioms of abstract set theory, pp. 3–7.
  64. ^Gödel 1990, p. 26.
  65. ^Gödel 1990, pp. 28–32.
  66. ^Gödel 1986, p. 145.
  67. ^Solovay 1990, p. 13.
  68. ^Kunen 1980, p. 176.
  69. ^Gödel 1990, p. 108, footnote i. The paragraph containing this footnote discusses why Gödel considered "property of set" a primitive of set theory and how it fit into hisontology. "Property of set" corresponds to the "class" primitive in NBG.
  70. ^Kanamori 2009, p. 57.
  71. ^Cohen 1963.
  72. ^Kanamori 2009, p. 65: "Forcing itself went a considerable distance in downgrading any formal theory of classes because of the added encumbrance of having to specify the classes of generic extensions."
  73. ^Cohen 1966, pp. 107–147.
  74. ^Cohen 1966, pp. 85–99.
  75. ^Ferreirós 2007, pp. 381–382;Cohen 1966, p. 77;Felgner 1971.
  76. ^Mostowski 1950, p. 113, footnote 11. Footnote referencesWang's NQ set theory, which later evolved into MK.
  77. ^Kanamori 2009b, pp. 18, 29.
  78. ^Chuaqui 1981, p. 313 proves that (VκVκ+1, ∈) is a model of MKTR + AxC. MKT is Tarski's axioms for MK without Choice or Replacement. MKTR + AxC is MKT with Replacement and Choice (Chuaqui 1981, pp. 4, 125), which is equivalent to MK.
  79. ^Mendelson 1997, p. 275.
  80. ^Gödel 1940, p. 54;Solovay 1990, pp. 9–11.
  81. ^Gödel 1940, p. 54.
  82. ^A. Enayat, "Set theoretical analogues of the Barwise-Schlipf theorem". Annals of Pure and Applied Logic vol. 173 (2022).
  83. ^Adámek, Herrlich & Strecker 2004, pp. 15–16, 40.

Bibliography

[edit]

External links

[edit]
Overview
Venn diagram of set intersection
Axioms
Operations
  • Concepts
  • Methods
Set types
Theories
Set theorists
General
Theorems (list)
 and paradoxes
Logics
Traditional
Propositional
Predicate
Set theory
Types ofsets
Maps and cardinality
Set theories
Formal systems (list),
language and syntax
Example axiomatic
systems
 (list)
Proof theory
Model theory
Computability theory
Related
Retrieved from "https://en.wikipedia.org/w/index.php?title=Von_Neumann–Bernays–Gödel_set_theory&oldid=1307958421"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp