Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Constructive set theory

From Wikipedia, the free encyclopedia
Axiomatic set theories based on the principles of mathematical constructivism

Axiomaticconstructive set theory is an approach tomathematical constructivism following the program ofaxiomatic set theory.The samefirst-order language with "={\displaystyle =}" and "{\displaystyle \in }" of classical set theory is usually used, so this is not to be confused with aconstructive types approach.On the other hand, some constructive theories are indeed motivated by their interpretability intype theories.

In addition to rejecting theprinciple of excluded middle (PEM{\displaystyle {\mathrm {PEM} }}), constructive set theories often require some logical quantifiers in their axioms to be setbounded. The latter is motivated by results tied toimpredicativity.

Introduction

[edit]

Constructive outlook

[edit]

Preliminary on the use of intuitionistic logic

[edit]

The logic of the set theories discussed here isconstructive in that it rejects the principle of excluded middlePEM{\displaystyle {\mathrm {PEM} }}, i.e. that thedisjunctionϕ¬ϕ{\displaystyle \phi \lor \neg \phi } automatically holds for all propositionsϕ{\displaystyle \phi }. This is also often called thelaw of excluded middle (LEM{\displaystyle {\mathrm {LEM} }}) in contexts where it is assumed. Constructively, as a rule, to prove the excluded middle for a propositionP{\displaystyle P}, i.e. to prove the particular disjunctionP¬P{\displaystyle P\lor \neg P}, eitherP{\displaystyle P} or¬P{\displaystyle \neg P} needs to be explicitly proven. When either such proof is established, one says the proposition is decidable, and this then logically implies the disjunction holds. Similarly and more commonly, a predicateQ(x){\displaystyle Q(x)} forx{\displaystyle x} in a domainX{\displaystyle X} is said to be decidable when the more intricate statement(xX).(Q(x)¬Q(x)){\displaystyle \forall (x\in X).{\big (}Q(x)\lor \neg Q(x){\big )}} is provable. Non-constructive axioms may enable proofs that formally claim decidability of suchP{\displaystyle P} (and/orQ{\displaystyle Q}) in the sense that they prove excluded middle forP{\displaystyle P} (resp. the statement using the quantifier above) without demonstrating the truth of either side of the disjunction(s). This is often the case in classical logic. In contrast, axiomatic theories deemed constructive tend to not permit many classical proofs of statements involving properties that are provenly computationallyundecidable.

Thelaw of noncontradiction is a special case of the propositional form ofmodus ponens. Using the former with any negated statement¬P{\displaystyle \neg P}, one validDe Morgan's law thus implies¬¬(P¬P){\displaystyle \neg \neg (P\lor \neg P)} already in the more conservativeminimal logic. In words,intuitionistic logic still posits: It is impossible to rule out a proposition and rule out its negation both at once, and thus the rejection of any instantiated excluded middle statement for an individual proposition is inconsistent. Here the double-negation captures that the disjunction statement now provenly can never be ruled out or rejected, even in cases where the disjunction may not be provable (for example, by demonstrating one of the disjuncts, thus decidingP{\displaystyle P}) from the assumed axioms.

More generally, constructive mathematical theories tend to proveclassically equivalent reformulations of classical theorems. For example, inconstructive analysis, one cannot prove theintermediate value theorem in its textbook formulation, but one can prove theorems with algorithmic content that, as soon as double negation elimination and its consequences are assumed legal, are at once classically equivalent to the classical statement. The difference is that the constructive proofs are harder to find.

The intuitionistic logic underlying the set theories discussed here, unlike minimal logic, still permitsdouble negation elimination for individual propositionsP{\displaystyle P} for which excluded middle holds. In turn the theorem formulations regarding finite objects tends to not differ from their classical counterparts. Given a model of all natural numbers, the equivalent for predicates, namelyMarkov's principle, does not automatically hold, but may be considered as an additional principle.

In an inhabited domain and usingexplosion, the disjunctionP(xX).¬Q(x){\displaystyle P\lor \exists (x\in X).\neg Q(x)} implies the existence claim(xX).(Q(x)P){\displaystyle \exists (x\in X).(Q(x)\to P)}, which in turn implies((xX).Q(x))P{\displaystyle {\big (}\forall (x\in X).Q(x){\big )}\to P}. Classically, these implications are always reversible. If one of the former is classically valid, it can be worth trying to establish it in the latter form. For the special case whereP{\displaystyle P} is rejected, one deals with a counter-example existence claim(xX).¬Q(x){\displaystyle \exists (x\in X).\neg Q(x)}, which is generally constructively stronger than a rejection claim¬(xX).Q(x){\displaystyle \neg \forall (x\in X).Q(x)}: Exemplifying at{\displaystyle t} such thatQ(t){\displaystyle Q(t)} is contradictory of course means it is not the case thatQ{\displaystyle Q} holds for all possiblex{\displaystyle x}. But one may also demonstrate thatQ{\displaystyle Q} holding for allx{\displaystyle x} would logically lead to a contradiction without the aid of a specific counter-example, and even while not being able to construct one. In the latter case, constructively, here one does not stipulate an existence claim.

Imposed restrictions on a set theory

[edit]

Compared to the classical counterpart, one is generally less likely to prove the existence of relations that cannot be realized. A restriction to the constructive reading of existence apriori leads to stricter requirements regarding which characterizations of a setfX×Y{\displaystyle f\subset X\times Y} involving unbounded collections constitute a (mathematical, and so always meaningtotal) function. This is often because the predicate in a case-wise would-be definition may not be decidable. Adopting the standard definition of set equality via extensionality, the fullAxiom of Choice is such a non-constructive principle that impliesPEM{\displaystyle {\mathrm {PEM} }} for the formulas permitted in one's adopted Separation schema, byDiaconescu's theorem. Similar results hold for theAxiom of Regularity existence claim, as shown below. The latter has a classically equivalentinductive substitute.So a genuinely intuitionistic development of set theory requires the rewording of some standard axioms to classically equivalent ones. Apart from demands for computability and reservations regrading of impredicativity,[1] technical question regarding which non-logical axioms effectively extend the underlying logic of a theory is also a research subject in its own right.

Metalogic

[edit]

With computably undecidable propositions already arising inRobinson arithmetic, even justPredicative separation lets one define elusive subsets easily. In stark contrast to the classical framework, constructive set theories may be closed under the rule that any property that is decidablefor all sets is already equivalent to one of the two trivial ones,{\displaystyle \top } or{\displaystyle \bot }. Also the real line may be taken to beindecomposable in this sense. Undecidability of disjunctions also affects the claims about total orders such as that of allordinal numbers, expressed by the provability and rejection of the clauses in the order defining disjunction(αβ)(α=β)(βα){\displaystyle (\alpha \in \beta )\lor (\alpha =\beta )\lor (\beta \in \alpha )}. This determines whether the relation istrichotomous. A weakened theory of ordinals in turn affects the proof theoretic strength defined inordinal analysis.

In exchange, constructive set theories can exhibit attractivedisjunction and existence properties, as is familiar from the study of constructive arithmetic theories. These are features of a fixed theory which metalogically relatejudgements of propositions provable in the theory. Particularly well-studied are those such features that can be expressed inHeyting arithmetic, with quantifiers over numbers and which can often berealized by numbers, as formalized inproof theory. In particular, those are the numerical existence property and the closely related disjunctive property, as well as being closed underChurch's rule, witnessing any given function to becomputable.[2]

A set theory does not only express theorems about numbers, and so one may consider a more general so-called strong existence property that is harder to come by, as will be discussed. A theory has this property if the following can be established: For any propertyϕ{\displaystyle \phi }, if the theory proves that a set exist that has that property, i.e. if the theory claims the existence statement, then there is also a propertyψ{\displaystyle \psi } thatuniquely describes such a set instance. More formally, for any predicateϕ{\displaystyle \phi } there is a predicateψ{\displaystyle \psi } so that

Tx.ϕ(x)T!x.ϕ(x)ψ(x){\displaystyle {\mathsf {T}}\vdash \exists x.\phi (x)\implies {\mathsf {T}}\vdash \exists !x.\phi (x)\land \psi (x)}

The role analogous to that of realized numbers in arithmetic is played here by defined sets proven to exist by (or according to) the theory. Questions concerning the axiomatic set theory's strength and its relation to term construction are subtle. While many theories discussed tend have all the various numerical properties, the existence property can easily be spoiled, as will be discussed. Weaker forms of existence properties have been formulated.

Some theories with a classical reading of existence can in fact also be constrained so as to exhibit the strong existence property. InZermelo–Fraenkel set theory with sets all taken to beordinal-definable, a theory denotedZF+(V=HOD){\displaystyle {\mathsf {ZF}}+({\mathrm {V} }={\mathrm {HOD} })}, no sets without such definability exist. The property is also enforced via theconstructible universe postulate inZF+(V=L){\displaystyle {\mathsf {ZF}}+({\mathrm {V} }={\mathrm {L} })}.For contrast, consider the theoryZFC{\displaystyle {\mathsf {ZFC}}} given byZF{\displaystyle {\mathsf {ZF}}} plus the fullaxiom of choiceexistence postulate: Recall that this collection of axioms proves thewell-ordering theorem, implying well-orderings exists for any set. In particular, this means that relationsWR×R{\displaystyle W\subset {\mathbb {R} }\times {\mathbb {R} }} formally exist that establish the well-ordering ofR{\displaystyle {\mathbb {R} }} (i.e. the theory claims the existence of a least element for all subsets ofR{\displaystyle {\mathbb {R} }} with respect to those relations). This is despite the fact that definability of such an ordering isknown to beindependent ofZFC{\displaystyle {\mathsf {ZFC}}}. The latter implies that for no particular formulaψ{\displaystyle \psi } in the language of the theory does the theory prove that the corresponding set is a well-ordering relation of the reals. SoZFC{\displaystyle {\mathsf {ZFC}}} formallyproves the existence of a subsetWR×R{\displaystyle W\subset {\mathbb {R} }\times {\mathbb {R} }} with the property of being a well-ordering relation, but at the same time no particular setW{\displaystyle W} for which the property could be validated can possibly be defined.

Anti-classical principles

[edit]

As mentioned above, a constructive theoryT{\displaystyle {\mathsf {T}}} may exhibit the numerical existence property,Te.ψ(e)Tψ(e_){\displaystyle {\mathsf {T}}\vdash \exists e.\psi (e)\implies {\mathsf {T}}\vdash \psi ({\underline {\mathrm {e} }})}, for some numbere{\displaystyle {\mathrm {e} }} and wheree_{\displaystyle {\underline {\mathrm {e} }}} denotes the corresponding numeral in the formal theory. Here one must carefully distinguish between provable implications between two propositions,TPQ{\displaystyle {\mathsf {T}}\vdash P\to Q}, and a theory's properties of the formTPTQ{\displaystyle {\mathsf {T}}\vdash P\implies {\mathsf {T}}\vdash Q}. When adopting a metalogically established schema of the latter type as an inference rule of one's proof calculus and nothing new can be proven, one says the theoryT{\displaystyle {\mathsf {T}}} is closed under that rule.

One may instead consider adjoining the rule corresponding to the meta-theoretical property as an implication (in the sense of "{\displaystyle \to }") toT{\displaystyle {\mathsf {T}}}, as anaxiom schema or in quantified form. A situation commonly studied is that of a fixedT{\displaystyle {\mathsf {T}}} exhibiting the meta-theoretical property of the following type: For an instance from some collection of formulas of a particular form, here captured viaϕ{\displaystyle \phi } andψ{\displaystyle \psi }, one established the existence of a numbere{\displaystyle {\mathrm {e} }} so thatTϕTψ(e_){\displaystyle {\mathsf {T}}\vdash \phi \implies {\mathsf {T}}\vdash \psi ({\underline {\mathrm {e} }})}. Here one may then postulateϕ(eN).ψ(e){\displaystyle \phi \to \exists (e\in {\mathbb {N} }).\psi (e)}, where the bounde{\displaystyle e} is a number variable in language of the theory. For example, Church's rule is anadmissible rule in first-order Heyting arithmeticHA{\displaystyle {\mathsf {HA}}} and, furthermore, thecorresponding Church's thesis principleCT0{\displaystyle {\mathrm {CT} }_{0}} may consistently be adopted as an axiom. The new theory with the principle added is anti-classical, in that it may not be consistent anymore to also adoptPEM{\displaystyle {\mathrm {PEM} }}. Similarly, adjoining the excluded middle principlePEM{\displaystyle {\mathrm {PEM} }} to some theoryT{\displaystyle {\mathsf {T}}}, the theory thus obtained may prove new, strictly classical statements, and this may spoil some of the meta-theoretical properties that were previously established forT{\displaystyle {\mathsf {T}}}. In such a fashion,CT0{\displaystyle {\mathrm {CT} }_{0}} may not be adopted inHA+PEM{\displaystyle {\mathsf {HA}}+{\mathrm {PEM} }}, also known asPeano arithmeticPA{\displaystyle {\mathsf {PA}}}.

The focus in this subsection shall be on set theories with quantification over a fully formal notion of an infinite sequences space, i.e. function space, as it will be introduced further below.A translation of Church'srule into the language of the theory itself may here read

(fNN).(eN).((nN).(wN).T(e,n,w)U(w,f(n))){\displaystyle \forall (f\in {\mathbb {N} }^{\mathbb {N} }).\exists (e\in {\mathbb {N} }).{\Big (}\forall (n\in {\mathbb {N} }).\exists (w\in {\mathbb {N} }).T(e,n,w)\land U(w,f(n)){\Big )}}

Kleene's T predicate together with the result extraction expresses that any input numbern{\displaystyle n} being mapped to the numberf(n){\displaystyle f(n)} is, throughw{\displaystyle w}, witnessed to be a computable mapping. HereN{\displaystyle {\mathbb {N} }} now denotes a set theory model of the standard natural numbers ande{\displaystyle e} is an index with respect to a fixed program enumeration. Stronger variants have been used, which extend this principle to functionsfNX{\displaystyle f\in {\mathbb {N} }^{X}} defined on domainsXN{\displaystyle X\subset {\mathbb {N} }} of low complexity. The principle rejects decidability for the predicateQ(e){\displaystyle Q(e)} defined as(wN).T(e,e,w){\displaystyle \exists (w\in {\mathbb {N} }).T(e,e,w)}, expressing thate{\displaystyle e} is the index of a computable functionhalting on its own index. Weaker, double negated forms of the principle may be considered too, which do not require the existence of a recursive implementation for everyf{\displaystyle f}, but which still make principles inconsistent that claim the existence of functions which provenly have no recursive realization. Some forms of a Church's thesis as principle are even consistent with the classical, weak so calledsecond-order arithmetic theoryRCA0{\displaystyle {\mathsf {RCA}}_{0}}, a subsystem of thetwo-sorted first-order theoryZ2{\displaystyle {\mathsf {Z}}_{2}}.

The collection of computable functions is classicallysubcountable, which classically is the same as being countable. But classical set theories will generally claim thatNN{\displaystyle {\mathbb {N} }^{\mathbb {N} }} holds also other functions than the computable ones. For example there is a proof inZF{\displaystyle {\mathsf {ZF}}} that total functions (in the set theory sense) do exist that cannot be captured by aTuring machine.Taking the computable world seriously as ontology, a prime example of an anti-classical conception related the Markovian school is the permitted subcountability of various uncountable collections. When adopting the subcountability of the collection of all unending sequences of natural numbers (NN{\displaystyle {\mathbb {N} }^{\mathbb {N} }}) as an axiom in a constructive theory, the "smallness" (in classical terms) of this collection, in some set theoretical realizations, is then already captured by the theory itself.A constructive theory may also adopt neither classical nor anti-classical axioms and so stay agnostic towards either possibility.

Constructive principles already prove(xX).¬¬(Q(x)¬Q(x)){\displaystyle \forall (x\in X).\neg \neg {\big (}Q(x)\lor \neg Q(x){\big )}} for anyQ{\displaystyle Q}. And so for any given elementx{\displaystyle x} ofX{\displaystyle X}, the corresponding excluded middle statement for the proposition cannot be negated. Indeed, for any givenx{\displaystyle x}, by noncontradiction it is impossible to rule outQ(x){\displaystyle Q(x)} and rule out its negation both at once, and the relevant De Morgan's rule applies as above. But a theory may in some instances also permit the rejection claim¬(xX).(Q(x)¬Q(x)){\displaystyle \neg \forall (x\in X).{\big (}Q(x)\lor \neg Q(x){\big )}}. Adopting this does not necessitate providing a particulartX{\displaystyle t\in X} witnessing the failure of excluded middle for the particular propositionQ(t){\displaystyle Q(t)}, i.e. witnessing the inconsistent¬(Q(t)¬Q(t)){\displaystyle \neg {\big (}Q(t)\lor \neg Q(t){\big )}}. PredicatesQ(x){\displaystyle Q(x)} on an infinite domainX{\displaystyle X} correspond todecision problems. Motivated by provenlycomputably undecidable problems, one may reject the possibility of decidability of a predicate without also making any existence claim inX{\displaystyle X}.As another example, such a situation is enforced inBrouwerian intuitionistic analysis, in a case where the quantifier ranges over infinitely manyunending binary sequences andQ(x){\displaystyle Q(x)} states that a sequencex{\displaystyle x} is everywhere zero. Concerning this property, of being conclusively identified as the sequence which is forever constant, adopting Brouwer's continuity principle strictly rules out that this could be proven decidable for all the sequences.

So in a constructive context with a so-callednon-classical logic as used here, one may consistently adopt axioms which are both in contradiction to quantified forms of excluded middle, but also non-constructive in the computable sense or as gauged by meta-logical existence properties discussed previously. In that way, a constructive set theory can also provide the framework to study non-classical theories, say rings modelingsmooth infinitesimal analysis.

History and overview

[edit]

Historically, the subject of constructive set theory (often also "CST{\displaystyle {\mathsf {CST}}}") begun withJohn Myhill's work on the theories also calledIZF{\displaystyle {\mathsf {IZF}}} andCST{\displaystyle {\mathsf {CST}}}.[3][4][5]In 1973, he had proposed the former as a first-order set theory based on intuitionistic logic, taking the most common foundationZFC{\displaystyle {\mathsf {ZFC}}} and throwing out the Axiom of choice as well as the principle of the excluded middle, initially leaving everything else as is. However, different forms of some of theZFC{\displaystyle {\mathsf {ZFC}}} axioms which are equivalent in the classical setting are inequivalent in the constructive setting, and some forms implyPEM{\displaystyle {\mathrm {PEM} }}, as will be demonstrated. In those cases, the intuitionistically weaker formulations were consequently adopted. The far more conservative systemCST{\displaystyle {\mathsf {CST}}} is also a first-order theory, but of several sorts and bounded quantification, aiming to provide a formal foundation forErrett Bishop's program of constructive mathematics.

The main discussion presents a sequence of theories in the same language asZF{\displaystyle {\mathsf {ZF}}}, leading up toPeter Aczel's well studiedCZF{\displaystyle {\mathsf {CZF}}},[6] and beyond. Many modern results trace back to Rathjen and his students.CZF{\displaystyle {\mathsf {CZF}}} is also characterized by the two features present also in Myhill's theory:On the one hand, it is using thePredicative Separation instead of the full, unbounded Separation schema. Boundedness can be handled as a syntactic property or, alternatively, the theories can be conservatively extended with a higher boundedness predicate and its axioms. Secondly, theimpredicativePowerset axiom is discarded, generally in favor of related but weaker axioms. The strong form is very casually used in classicalgeneral topology.AddingPEM{\displaystyle {\mathrm {PEM} }} to a theory even weaker thanCZF{\displaystyle {\mathsf {CZF}}} recoversZF{\displaystyle {\mathsf {ZF}}}, as detailed below.[7]The system, which has come to be known as Intuitionistic Zermelo–Fraenkel set theory (IZF{\displaystyle {\mathsf {IZF}}}), is a strong set theory withoutPEM{\displaystyle {\mathrm {PEM} }}. It is similar toCZF{\displaystyle {\mathsf {CZF}}}, but less conservative orpredicative.The theory denotedIKP{\displaystyle {\mathsf {IKP}}} is the constructive version ofKP{\displaystyle {\mathsf {KP}}}, the classicalKripke–Platek set theory without a form of Powerset and where even the Axiom of Collection is bounded.

Models

[edit]

Many theories studied in constructive set theory are mere restrictions ofZermelo–Fraenkel set theory (ZF{\displaystyle {\mathsf {ZF}}}) with respect to their axiom as well as their underlying logic. Such theories can then also be interpreted in any model ofZF{\displaystyle {\mathsf {ZF}}}.

Peano arithmeticPA{\displaystyle {\mathsf {PA}}} isbi-interpretable with the theory given byZF{\displaystyle {\mathsf {ZF}}} minus Infinity and without infinite sets, plus the existence of alltransitive closures. (The latter is also implied after promotingRegularity to theSet Induction schema, which is discussed below.) Likewise, constructive arithmetic can also be taken as an apology for most axioms adopted inCZF{\displaystyle {\mathsf {CZF}}}:Heyting arithmeticHA{\displaystyle {\mathsf {HA}}} is bi-interpretable with a weak constructive set theory,[8][9] as also described in the article onHA{\displaystyle {\mathsf {HA}}}. One may arithmetically characterize a membership relation "{\displaystyle \in }" and with it prove - instead of the existence of a set of natural numbersω{\displaystyle \omega } - that all sets in its theory are in bijection with a (finite)von Neumann natural, a principle denotedV=Fin{\displaystyle {\mathrm {V} }={\mathrm {Fin} }}. This context further validates Extensionality, Pairing, Union, Binary Intersection (which is related to theAxiom schema of predicative separation) and the Set Induction schema. Taken as axioms, the aforementioned principles constitute a set theory that is already identical with the theory given byCZF{\displaystyle {\mathsf {CZF}}} minus the existence ofω{\displaystyle \omega } but plusV=Fin{\displaystyle {\mathrm {V} }={\mathrm {Fin} }} as axiom. All those axioms are discussed in detail below.Relatedly,CZF{\displaystyle {\mathsf {CZF}}} also proves that thehereditarily finite sets fulfill all the previous axioms. This is a result which persists when passing on toPA{\displaystyle {\mathsf {PA}}} andZF{\displaystyle {\mathsf {ZF}}} minus Infinity.

As far as constructive realizations go there is a relevantrealizability theory. Relatedly, Aczel's theory constructive Zermelo-FraenkelCZF{\displaystyle {\mathsf {CZF}}} has been interpreted in aMartin-Löf type theories, as sketched in the section onCZF{\displaystyle {\mathsf {CZF}}}. In this way, theorems provable in this and weaker set theories are candidates for a computer realization.

Presheaf models for constructive set theories have also been introduced. These are analogous to presheaf models for intuitionistic set theory developed byDana Scott in the 1980s.[10][11] Realizability models ofCZF{\displaystyle {\mathsf {CZF}}} within theeffective topos have been identified, which, say, at once validate full Separation, relativized dependent choiceRDC{\displaystyle {\mathrm {RDC} }},independence of premiseIP{\displaystyle {\mathrm {IP} }} for sets, but also the subcountability of all sets, Markov's principleMP{\displaystyle {\mathrm {MP} }} and Church's thesisCT0{\displaystyle {\mathrm {CT} _{0}}} in the formulation for all predicates.[12]

Notation

[edit]

In an axiomaticset theory, sets are the entities exhibiting properties. But there is then a more intricate relation between the set concept and logic. For example, theproperty of being a natural number smaller than 100 may be reformulated as being a member of theset of numbers with that property. The set theory axioms govern set existence and thus govern which predicates can be materialized as entity in itself, in this sense. Specification is also directly governed by the axioms, as discussed below. For a practical consideration, consider for example the property of being a sequence of coin flip outcomes that overall show more heads than tails. This property may be used to separate out a corresponding subset of any set of finite sequences of coin flips. Relatedly, themeasure theoretic formalization of aprobabilistic event is explicitly based around sets and provides many more examples.

This section introduces the object language and auxiliary notions used to formalize this materialization.

Language

[edit]

Thepropositional connective symbols used to form syntactic formulas are standard. The axioms of set theory give a means to proveequality "={\displaystyle =}" of sets and that symbol may, byabuse of notation, be used for classes. A set in which the equality predicate is decidable is also calleddiscrete. Negation "¬{\displaystyle \neg }" of equality is sometimes called the denial of equality, and is commonly written "{\displaystyle \neq }". However, in a context withapartness relations, for example when dealing with sequences, the latter symbol is also sometimes used for something different.

The common treatment, as also adopted here, formally only extends the underlying logic by one primitive binary predicate of set theory, "{\displaystyle \in }". As with equality, negation of elementhood "{\displaystyle \in }" is often written "{\displaystyle \notin }".

Variables

[edit]

Below the Greekϕ{\displaystyle \phi } denotes a proposition or predicate variable inaxiom schemas andP{\displaystyle P} orQ{\displaystyle Q} is used for particular such predicates. The word "predicate" is sometimes used interchangeably with "formulas" as well, even in theunary case.

Quantifiers only ever range over sets and those are denoted by lower case letters.As is common, one may use argument brackets to express predicates, for the sake of highlighting particular free variables in their syntactic expression, as in "Q(z){\displaystyle Q(z)}".Unique existence!x.Q(x){\displaystyle \exists !x.Q(x)} here meansx.y.(y=xQ(y)){\displaystyle \exists x.\forall y.{\big (}y=x\leftrightarrow Q(y){\big )}}.

Classes

[edit]

As is also common, one makes use set builder notation forclasses, which, in most contexts, are not part of the object language but used for concise discussion. In particular, one may introduce notation declarations of the correspondingclass via "A={zQ(z)}{\displaystyle A=\{z\mid Q(z)\}}", for the purpose of expressing anyQ(a){\displaystyle Q(a)} asaA{\displaystyle a\in A}. Logically equivalent predicates can be used to introduce the same class.One also writes{zBQ(z)}{\displaystyle \{z\in B\mid Q(z)\}} as shorthand for{zzBQ(z)}{\displaystyle \{z\mid z\in B\land Q(z)\}}. For example, one may consider{zBzC}{\displaystyle \{z\in B\mid z\notin C\}} and this is also denotedBC{\displaystyle B\setminus C}.

One abbreviatesz.(zAQ(z)){\displaystyle \forall z.{\big (}z\in A\to Q(z){\big )}} by(zA).Q(z){\displaystyle \forall (z\in A).Q(z)} andz.(zAQ(z)){\displaystyle \exists z.{\big (}z\in A\land Q(z){\big )}} by(zA).Q(z){\displaystyle \exists (z\in A).Q(z)}. The syntactic notion of bounded quantification in this sense can play a role in the formulation of axiom schemas, as seen in the discussion of axioms below. Express thesubclass claim(zA).zB{\displaystyle \forall (z\in A).z\in B}, i.e.z.(zAzB){\displaystyle \forall z.(z\in A\to z\in B)}, byAB{\displaystyle A\subset B}.For a predicateQ{\displaystyle Q}, triviallyz.((zBQ(z))zB){\displaystyle \forall z.{\big (}(z\in B\land Q(z))\to z\in B{\big )}}. And so follows that{zBQ(z)}B{\displaystyle \{z\in B\mid Q(z)\}\subset B}.The notion of subset-bounded quantifiers, as in(zA).zB{\displaystyle \forall (z\subset A).z\in B}, has been used in set theoretical investigation as well, but will not be further highlighted here.

If there provenlyexists a set inside a class, meaningz.(zA){\displaystyle \exists z.(z\in A)}, then one calls itinhabited. One may also use quantification inA{\displaystyle A} to express this as(zA).(z=z){\displaystyle \exists (z\in A).(z=z)}. The classA{\displaystyle A} is then provenly not the empty set, introduced below. While classically equivalent, constructivelynon-empty is a weaker notion with two negations and ought to be callednot uninhabited. Unfortunately, the word for the more useful notion of 'inhabited' is rarely used in classical mathematics.

Two ways to express that classes aredisjoint does capture many of the intuitionistically valid negation rules:((xA).xB)¬(xA).xB{\displaystyle {\big (}\forall (x\in A).x\notin B{\big )}\leftrightarrow \neg \exists (x\in A).x\in B}. Using the above notation, this is a purely logical equivalence and in this article the proposition will furthermore be expressible asAB={}{\displaystyle A\cap B=\{\}}.

A subclassAB{\displaystyle A\subset B} is calleddetachable fromB{\displaystyle B} if the relativized membership predicate is decidable, i.e. if(xB).xAxA{\displaystyle \forall (x\in B).x\in A\lor x\notin A} holds. It is also called decidable if the superclass is clear from the context - often this is the set of natural numbers.

Extensional equivalence

[edit]

Denote byAB{\displaystyle A\simeq B} the statement expressing that two classes have exactly the same elements, i.e.z.(zAzB){\displaystyle \forall z.(z\in A\leftrightarrow z\in B)}, or equivalently(AB)(BA){\displaystyle (A\subset B)\land (B\subset A)}. This is not to be conflated with the concept ofequinumerosity also used below.

WithA{\displaystyle A} standing for{zQ(z)}{\displaystyle \{z\mid Q(z)\}}, the convenient notational relation betweenxA{\displaystyle x\in A} andQ(x){\displaystyle Q(x)}, axioms of the forma.z.(zaQ(z)){\displaystyle \exists a.\forall z.{\big (}z\in a\leftrightarrow Q(z){\big )}} postulate that theclass of all sets for whichQ{\displaystyle Q} holds actually forms aset. Less formally, this may be expressed asa.aA{\displaystyle \exists a.a\simeq A}. Likewise, the propositiona.(aA)P(a){\displaystyle \forall a.(a\simeq A)\to P(a)} conveys "P(A){\displaystyle P(A)} whenA{\displaystyle A} is among the theory's sets." For the case whereP{\displaystyle P} is the trivially false predicate, the proposition is equivalent to the negation of the former existence claim, expressing the non-existence ofA{\displaystyle A} as a set.

Further extensions of class comprehension notation as above are in common used in set theory, giving meaning to statements such as "{f(z)Q(z)}{x,y,zT(x,y,z)}{\displaystyle \{f(z)\mid Q(z)\}\simeq \{\langle x,y,z\rangle \mid T(x,y,z)\}}", and so on.

Syntactically more general, a setw{\displaystyle w} may also be characterized using another 2-ary predicateR{\displaystyle R} troughx.xwR(x,w){\displaystyle \forall x.x\in w\leftrightarrow R(x,w)}, where the right hand side may depend on the actual variablew{\displaystyle w}, and possibly even on membership inw{\displaystyle w} itself.

Subtheories of ZF

[edit]

Here a series of familiar axioms is presented, or the relevant slight reformulations thereof. It is emphasized how the absence ofPEM{\displaystyle {\mathrm {PEM} }} in the logic affects what is provable and it is highlighted which non-classical axioms are, in turn, consistent.

Equality

[edit]

Using the notation introduced above, the following axiom gives a means to prove equality "={\displaystyle =}" of twosets, so that through substitution, any predicate aboutx{\displaystyle x} translates to one ofy{\displaystyle y}.By the logical properties of equality, the converse direction of the postulated implication holds automatically.

Extensionality

x.y.  xyx=y{\displaystyle \forall x.\forall y.\ \ x\simeq y\to x=y}

In a constructive interpretation, the elements of a subclassA={zBQ(z)¬Q(z)}{\displaystyle A=\{z\in B\mid Q(z)\lor \neg Q(z)\}} ofB{\displaystyle B} may come equipped with more information than those ofB{\displaystyle B}, in the sense that being able to judgebA{\displaystyle b\in A} is being able to judgeQ(b)¬Q(b){\displaystyle Q(b)\lor \neg Q(b)}. And (unless the whole disjunction follows from axioms) in theBrouwer–Heyting–Kolmogorov interpretation, this means to have provenQ(b){\displaystyle Q(b)} or having rejected it.As{zBQ(z)}{\displaystyle \{z\in B\mid Q(z)\}} may not be detachable fromB{\displaystyle B}, i.e. asQ{\displaystyle Q}may be not decidable for all elements inB{\displaystyle B}, the two classesA{\displaystyle A} andB{\displaystyle B} must a priori be distinguished.

Consider a predicateQ{\displaystyle Q} that provenly holds for all elements of a sety{\displaystyle y}, so thaty{zyQ(z)}{\displaystyle y\simeq \{z\in y\mid Q(z)\}}, and assume that the class on the right hand side is established to be a set. Note that, even if this set on the right informally also ties to proof-relevant information about the validity ofQ{\displaystyle Q} for all the elements, the Extensionality axiom postulates that, in our set theory, the set on the right hand side is judged equal to the one on the left hand side.This above analysis also shows that a statement of the form(xw).Q(x){\displaystyle \forall (x\in w).Q(x)}, which in informal class notation may be expressed asw{xQ(x)}{\displaystyle w\subset \{x\mid Q(x)\}}, is then equivalently expressed as{xwQ(x)}=w{\displaystyle \{x\in w\mid Q(x)\}=w}. This means that establishing such{\displaystyle \forall }-theorems (e.g. the ones provable from full mathematical induction) enables substituting the subclass ofw{\displaystyle w} on the left hand side of the equality for justw{\displaystyle w}, in any formula.

Note that adopting "={\displaystyle =}" as a symbol in a predicate logic theory makes equality of two terms a quantifier-free expression.

Alternative approaches

[edit]

While often adopted, this axiom has been criticized in constructive thought, as it effectively collapses differently defined properties, or at least the sets viewed as the extension of these properties, aFregian notion.

Modern type theories may instead aim at defining the demanded equivalence "{\displaystyle \simeq }" in terms of functions, see e.g.type equivalence. The related concept of functionextensionality is often not adopted in type theory.

Other frameworks for constructive mathematics might instead demand a particular rule for equality orapartness come for the elementszx{\displaystyle z\in x} of each and every setx{\displaystyle x} discussed. But also in an approach to sets emphasizing apartness may the above definition in terms of subsets be used to characterize a notion of equality "{\displaystyle \simeq }" of those subsets.Relatedly, a loose notion of complementation of twosubsetsux{\displaystyle u\subset x} andvx{\displaystyle v\subset x} is given when any two memberssu{\displaystyle s\in u} andtv{\displaystyle t\in v} are provably apart from each other. The collection of complementing pairsu,v{\displaystyle \langle u,v\rangle } isalgebraically well behaved.

Merging sets

[edit]

Define class notation for the pairing of a few given elements via disjunctions. E.g.z{a,b}{\displaystyle z\in \{a,b\}} is the quantifier-free statement(z=a)(z=b){\displaystyle (z=a)\lor (z=b)}, and likewisez{a,b,c}{\displaystyle z\in \{a,b,c\}} says(z=a)(z=b)(z=c){\displaystyle (z=a)\lor (z=b)\lor (z=c)}, and so on.

Two other basic existence postulates given some other sets are as follows. Firstly,

Pairing

x.y.  p.{x,y}p{\displaystyle \forall x.\forall y.\ \ \exists p.\{x,y\}\subset p}

Given the definitions above,{x,y}p{\displaystyle \{x,y\}\subset p} expands toz.(z=xz=y)zp{\displaystyle \forall z.(z=x\lor z=y)\to z\in p}, so this is making use of equality and a disjunction. The axiom says that for any two setsx{\displaystyle x} andy{\displaystyle y}, there is at least one setp{\displaystyle p}, which hold at least those two sets.

With bounded Separation below, also the class{x,y}{\displaystyle \{x,y\}} exists as a set. Denote byx,y{\displaystyle \langle x,y\rangle } the standardordered pair model{{x},{x,y}}{\displaystyle \{\{x\},\{x,y\}\}}, so that e.g.q=x,y{\displaystyle q=\langle x,y\rangle }denotes another bounded formula in the formal language of the theory.

And then, using existential quantification and a conjunction,

Union

x.  u.z.(((yx).zy)zu){\displaystyle \forall x.\ \ \exists u.\forall z.{\Big (}{\big (}\exists (y\in x).z\in y{\big )}\to z\in u{\Big )}}

saying that for any setx{\displaystyle x}, there is at least one setu{\displaystyle u}, which holds all the membersz{\displaystyle z}, ofx{\displaystyle x}'s membersy{\displaystyle y}.The minimal such set is theunion.

The two axioms are commonly formulated stronger, in terms of "{\displaystyle \leftrightarrow }" instead of just "{\displaystyle \to }", although this is technically redundant in the context ofBCST{\displaystyle {\mathsf {BCST}}}: As the Separation axiom below is formulated with "{\displaystyle \leftrightarrow }", for statementst.z.ϕ(z)zt{\displaystyle \exists t.\forall z.\phi (z)\to z\in t} the equivalence can be derived, given the theory allows for separation usingϕ{\displaystyle \phi }. In cases whereϕ{\displaystyle \phi } is an existential statement, like here in the union axiom, there is alsoanother formulation using a universal quantifier.

Also using bounded Separation, the two axioms just stated together imply the existence of a binary union of two classesa{\displaystyle a} andb{\displaystyle b}, when they have been established to be sets, denoted by{a,b}{\displaystyle \bigcup \{a,b\}} orab{\displaystyle a\cup b}. For a fixed setz{\displaystyle z}, to validate membershipzab{\displaystyle z\in a\cup b} in the union of two given setsy=a{\displaystyle y=a} andy=b{\displaystyle y=b}, one needs to validate thezy{\displaystyle z\in y} part of the axiom, which can be done by validating the disjunction of the predicates defining the setsa{\displaystyle a} andb{\displaystyle b}, forz{\displaystyle z}. In terms of the associated sets, it is done by validating the disjunctionzazb{\displaystyle z\in a\lor z\in b}.

The union and other set forming notations are also used for classes. For instance, the propositionzAzC{\displaystyle z\in A\land z\notin C} is writtenzAC{\displaystyle z\in A\setminus C}. Let nowBA{\displaystyle B\subset A}. GivenzA{\displaystyle z\in A}, the decidability of membership inB{\displaystyle B}, i.e. the potentially independent statementzBzB{\displaystyle z\in B\lor z\notin B}, can also be expressed aszB(AB){\displaystyle z\in B\cup (A\setminus B)}. But, as for any excluded middle statement, the double-negation of the latter holds: That union isn't not inhabited byz{\displaystyle z}. This goes to show that partitioning is also a more involved notion, constructively.

Set existence

[edit]

The property that is false for any set corresponds to theempty class, which is denoted by{}{\displaystyle \{\}} or zero,0{\displaystyle 0}. That the empty class is a set readily follows from other existence axioms, such as the Axiom of Infinity below. But if, e.g., one is explicitly interested in excluding infinite sets in one's study, one may at this point adopt the

Axiom of empty set:

x.y.¬(yx){\displaystyle \exists x.\forall y.\,\neg (y\in x)}

Introduction of the symbol{}{\displaystyle \{\}} (as abbreviating notation for expressions in involving characterizing properties) is justified as uniqueness for this set can be proven. Asy{}{\displaystyle y\in \{\}} is false for anyy{\displaystyle y}, the axiom then readsx.x{}{\displaystyle \exists x.x\simeq \{\}}.

Write1{\displaystyle 1} forS0{\displaystyle S0}, which equals{{}}{\displaystyle \{\{\}\}}, i.e.{1}{\displaystyle \{1\}}. Likewise, write2{\displaystyle 2} forS1{\displaystyle S1}, which equals{{},{{}}}{\displaystyle \{\{\},\{\{\}\}\}}, i.e.{0,1}{\displaystyle \{0,1\}}. A simple and provenly false proposition then is, for example,{}{}{\displaystyle \{\}\in \{\}}, corresponding to0<0{\displaystyle 0<0} in the standard arithmetic model.Again, here symbols such as{}{\displaystyle \{\}} are treated as convenient notation and any proposition really translates to an expression using only "{\displaystyle \in }" and logical symbols, including quantifiers. Accompanied by a metamathematical analysis that the capabilities of the new theories are equivalent in an effective manner,formal extensions by symbols such as0{\displaystyle 0} may also be considered.

More generally, for a setx{\displaystyle x}, define thesuccessor setSx{\displaystyle Sx} asx{x}{\displaystyle x\cup \{x\}}. The interplay of the successor operation with the membership relation has a recursive clause, in the sense that(ySx)(yxy=x){\displaystyle (y\in Sx)\leftrightarrow (y\in x\lor y=x)}. By reflexivity of equality,xSx{\displaystyle x\in Sx}, and in particularSx{\displaystyle Sx} is always inhabited.

BCST

[edit]

The following makes use ofaxiom schemas, i.e. axioms for some collection of predicates.Some of the stated axiom schemas shall allow for any collection of set parameters as well (meaning any particular named variablesv0,v1,,vn{\displaystyle v_{0},v_{1},\dots ,v_{n}}). That is, instantiations of the schema are permitted in which the predicate (some particularϕ{\displaystyle \phi }) also depends on a number of further set variables and the statement of the axiom is understood with corresponding extra outer universal closures (as inv0.v1.vn.{\displaystyle \forall v_{0}.\forall v_{1}.\cdots \forall v_{n}.}).

Separation

[edit]

Basic constructive set theoryBCST{\displaystyle {\mathsf {BCST}}} consists of several axioms also part of standard set theory, except the so called "full"Separation axiom is weakened.Beyond the four axioms above, it postulates Predicative Separation as well as the Replacement schema.

Axiom schema of predicative separation: For anybounded predicateϕ{\displaystyle \phi }, with parameters and with set variabley{\displaystyle y} not free in it,

y.s.x.(xs(xyϕ(x))){\displaystyle \forall y.\,\exists s.\forall x.{\big (}x\in s\,\leftrightarrow \,(x\in y\land \phi (x)){\big )}}

This axiom amounts to postulating the existence of a sets{\displaystyle s} obtained by theintersection of any sety{\displaystyle y} and any predicatively described class{xϕ(x)}{\displaystyle \{x\mid \phi (x)\}}.For anyz{\displaystyle z} proven to be a set, when the predicate is taken asϕ(x):=xz{\displaystyle \phi (x):=x\in z}, one obtains the binary intersection of sets and writess=yz{\displaystyle s=y\cap z}. Intersection corresponds to conjunction in an analog way to how union corresponds to disjunction.

When the predicate is taken as the negationϕ(x):=xz{\displaystyle \phi (x):=x\notin z}, one obtains the difference principle, granting existence of any setyz{\displaystyle y\setminus z}. Note that sets likeyy{\displaystyle y\setminus y} or{xy¬(x=x)}{\displaystyle \{x\in y\mid \neg (x=x)\}} are always empty. So, as noted, from Separation and the existence of at least one set (e.g. Infinity below) will follow the existence of the empty set{}{\displaystyle \{\}} (also denoted0{\displaystyle 0}).Within this conservative context ofBCST{\displaystyle {\mathsf {BCST}}}, the Predicative Separation schema is actually equivalent to Empty Set plus the existence of the binary intersection for any two sets. The latter variant of axiomatization does not make use of a formula schema.

Predicative Separation is a schema that takes into account syntactic aspects of set defining predicates, up to provable equivalence. The permitted formulas are denoted byΔ0{\displaystyle \Delta _{0}}, the lowest level in the set theoreticalLévy hierarchy.[13]General predicates in set theory are never syntactically restricted in such a way and so, in praxis, generic subclasses of sets are still part of the mathematical language. As the scope of subclasses that are provably sets is sensitive to what sets already exist, this scope is expanded when further set existence postulates added added.

For a propositionP{\displaystyle P}, a recurring trope in the constructive analysis of set theory is to view the predicatex=0P{\displaystyle x=0\land P} as the subclassB:={x1P}{\displaystyle B:=\{x\in 1\mid P\}} of the second ordinal1:=S0={0}{\displaystyle 1:=S0=\{0\}}. If it is provable thatP{\displaystyle P} holds, or¬P{\displaystyle \neg P}, or¬¬P{\displaystyle \neg \neg P}, thenB{\displaystyle B} is inhabited, or empty (uninhabited), or non-empty (not uninhabited), respectively.Clearly,P{\displaystyle P} is equivalent to both the proposition0B{\displaystyle 0\in B}, and alsoB=1{\displaystyle B=1}. Likewise,¬P{\displaystyle \neg P} is equivalent toB=0{\displaystyle B=0} and, equivalently, also¬(0B){\displaystyle \neg (0\in B)}. So, here,B{\displaystyle B} being detachable from1{\displaystyle 1} exactly meansP¬P{\displaystyle P\lor \neg P}.In the model of the naturals, ifB{\displaystyle B} is a number,0B{\displaystyle 0\in B} also expresses that0{\displaystyle 0} is smaller thanB{\displaystyle B}. The union that is part of the successor operation definition above may be used to express the excluded middle statement as0SB{\displaystyle 0\in SB}. In words,P{\displaystyle P} is decidable if and only if the successor ofB{\displaystyle B} is larger than the smallest ordinal0{\displaystyle 0}. The propositionP{\displaystyle P} is decided either way through establishinghow0{\displaystyle 0} is smaller: By0{\displaystyle 0} already being smaller thanB{\displaystyle B}, or by0{\displaystyle 0} beingSB{\displaystyle SB}'s direct predecessor. Yet another way to express excluded middle forP{\displaystyle P} is as the existence of a least number member of the inhabited classb:=B{1}{\displaystyle b:=B\cup \{1\}}.

If one's separation axiom allows for separation withP{\displaystyle P}, thenB{\displaystyle B} is a subset, which may be called thetruth value associated withP{\displaystyle P}. Two truth values can be proven equal, as sets, by proving an equivalence. In terms of this terminology, the collection of proof values can a priori be understood to be rich. Unsurprisingly, decidable propositions have one of a binary set of truth values. The excluded middle disjunction for thatP{\displaystyle P} is then also implied by the global statementb.(0b)(0b){\displaystyle \forall b.(0\in b)\lor (0\notin b)}.

No universal set

[edit]

When using the informal class terminology, any set is also considered a class. At the same time, there do arise so calledproper classes that can have no extension as a set. When in a theory there is a proof of¬x.Ax{\displaystyle \neg \exists x.A\subset x}, thenA{\displaystyle A} must be proper. (When taking up the perspective ofZF{\displaystyle {\mathsf {ZF}}} on sets, a theory which has full Separation, proper classes are generally thought of as those that are "too big" to be a set. More technically, they are subclasses of thecumulative hierarchy that extend beyond any ordinal bound.)

By a remark in the section on merging sets, a set cannot consistently ruled out to be a member of a class of the formA{xxA}{\displaystyle A\cup \{x\mid x\notin A\}}. A constructive proof that it is in that class contains information. Now ifA{\displaystyle A} is a set, then the class{xxA}{\displaystyle \{x\mid x\notin A\}} is provably proper. The following demonstrates this in the special case whenA{\displaystyle A} is empty, i.e. when the right side is the universal class. Being negative results, it reads as in the classical theory.

The following holds for any relationE{\displaystyle E}. It gives a purely logical condition such that two termss{\displaystyle s} andy{\displaystyle y} cannot beE{\displaystyle E}-related to one another.

(x.xEs(xEy¬xEx))¬(yEssEssEy){\displaystyle {\big (}\forall x.xEs\leftrightarrow (xEy\land \neg xEx){\big )}\to \neg (yEs\lor sEs\lor sEy)}

Most important here is the rejection of the final disjunct,¬sEy{\displaystyle \neg sEy}. The expression¬(xx){\displaystyle \neg (x\in x)} does not involve unbounded quantification and is thus allowed in Separation.Russel's construction in turn shows that{xyxx}y{\displaystyle \{x\in y\mid x\notin x\}\notin y}. So for any sety{\displaystyle y}, Predicative Separation alone implies that there exists a set which is not a member ofy{\displaystyle y}. In particular, nouniversal set can exist in this theory.

In a theory further adopting theaxiom of regularity, likeZF{\displaystyle {\mathsf {ZF}}}, provenlyxx{\displaystyle x\in x} is false for any setx{\displaystyle x}. There, this then means that the subset{xyxx}{\displaystyle \{x\in y\mid x\notin x\}} is equal toy{\displaystyle y} itself, and that the class{xxx}{\displaystyle \{x\mid x\in x\}} is the empty set.

For anyE{\displaystyle E} andy{\displaystyle y}, the special cases=y{\displaystyle s=y} in the formula above gives

¬(x.xEy¬xEx){\displaystyle \neg {\big (}\forall x.xEy\leftrightarrow \neg xEx{\big )}}

This already implies that no sety{\displaystyle y} equals the subclass{xxx}{\displaystyle \{x\mid x\notin x\}} of the universal class is, i.e. that subclass is a proper one as well. But even inZF{\displaystyle {\mathsf {ZF}}} without Regularity it is consistent for there to be a proper class of singletons which each contain exactly themselves.

As an aside, in a theory withstratification likeIntuitionistic New Foundations, the syntactic expressionxx{\displaystyle x\in x} may be disallowed in Separation. In turn, the above proof of negation of the existence of a universal set cannot be performed, in that theory.

Predicativity

[edit]

The axiom schema of Predicative Separation is also calledΔ0{\displaystyle \Delta _{0}}-Separation or Bounded Separation, as in Separation for set-bounded quantifiers only. (Warning note: The Lévy hierarchy nomenclature is in analogy toΔ00{\displaystyle \Delta _{0}^{0}} in thearithmetical hierarchy, albeit comparison can be subtle: The arithmetic classification is sometimes expressed not syntactically but in terms of subclasses of the naturals. Also, the bottom level of the arithmetical hierarchy has several common definitions, some not allowing the use of some total functions. A similar distinction is not relevant on the levelΣ10{\displaystyle \Sigma _{1}^{0}} or higher. Finally note that aΔ0{\displaystyle \Delta _{0}} classification of a formula may be expressed up to equivalence in the theory.)

The schema is also the way in whichMac Lane weakens a system close toZermelo set theoryZ{\displaystyle {\mathsf {Z}}}, for mathematical foundations related totopos theory. It is also used in the study ofabsoluteness, and there part of the formulation ofKripke-Platek set theory.

The restriction in the axiom is also gatekeepingimpredicative definitions: Existence should at best not be claimed for objects that are not explicitly describable, or whose definition involves themselves or reference to a proper class, such as when a property to be checked involves a universal quantifier. So in a constructive theory withoutAxiom of power set, whenR{\displaystyle R} denotes some 2-ary predicate, one should not generally expect a subclasss{\displaystyle s} ofy{\displaystyle y} to be a set, in case that it is defined, for example, as in

{xyt.((ty)R(x,t))}{\displaystyle \{x\in y\mid \forall t.{\big (}(t\subset y)\to R(x,t){\big )}\}},

or via a similar definitions involving any quantification over the setsty{\displaystyle t\subset y}. Note that if this subclasss{\displaystyle s} ofy{\displaystyle y} is provenly a set, then this subset itself is also in the unbounded scope of set variablet{\displaystyle t}. In other words, as the subclass propertysy{\displaystyle s\subset y} is fulfilled, this exact sets{\displaystyle s}, defined using the expressionR(x,s){\displaystyle R(x,s)}, would play a role in its own characterization.

While predicative Separation leads to fewer given class definitions being sets, it may be emphasized that many class definitions that are classically equivalent are not so when restricting oneself to the weaker logic. Due to the potentialundecidability of general predicates, the notion of subset and subclass is automatically more elaborate in constructive set theories than in classical ones. So in this way one has obtained a broader theory. This remains true if full Separation is adopted, such as in the theoryIZF{\displaystyle {\mathsf {IZF}}}, which however spoils theexistence property as well as the standard type theoretical interpretations, and in this way spoils a bottom-up view of constructive sets.As an aside, assubtyping is not a necessary feature ofconstructive type theory, constructive set theory can be said to quite differ from that framework.

Replacement

[edit]

Next consider the

Axiom schema of Replacement: For any predicateϕ{\displaystyle \phi } with set variabler{\displaystyle r} not free in it,

d.  (xd).!y.ϕ(x,y)r.y.(yr(xd).ϕ(x,y)){\displaystyle \forall d.\ \ \forall (x\in d).\exists !y.\phi (x,y)\to \exists r.\forall y.{\big (}y\in r\leftrightarrow \exists (x\in d).\phi (x,y){\big )}}

It is granting existence, as sets, of the range of function-like predicates, obtained via their domains. In the above formulation, the predicate is not restricted akin to the Separation schema, but this axiom already involves an existential quantifier in the antecedent. Of course, weaker schemas could be considered as well.

Via Replacement, the existence of any pair{x,y}{\displaystyle \{x,y\}} also follows from that of any other particular pair, such as{0,1}=2=SS0{\displaystyle \{0,1\}=2=SS0}. But as the binary union used inS{\displaystyle S} already made use of the Pairing axiom, this approach then necessitates postulating the existence of2{\displaystyle 2} over that of0{\displaystyle 0}. In a theory with the impredicative Powerset axiom, the existence of2PP0{\displaystyle 2\subset {\mathcal {P}}{\mathcal {P}}0} can also be demonstrated using Separation.

With the Replacement schema, the theory outlined thus far proves that theequivalence classes orindexed sums are sets. In particular, theCartesian product, holding all pairs of elements of two sets, is a set. In turn, for any fixed number (in the metatheory), the corresponding product expression, sayx×x×x×x{\displaystyle x\times x\times x\times x}, can be constructed as a set. The axiomatic requirements for sets recursively defined in the language are discussed further below. A setx{\displaystyle x} is discrete, i.e. equality of elements inside a setx{\displaystyle x} is decidable, if the corresponding relation as a subset ofx×x{\displaystyle x\times x} is decidable.

Replacement is relevant forfunction comprehension and can be seen as a form of comprehension more generally.Only when assumingPEM{\displaystyle {\mathrm {PEM} }} does Replacement already imply full Separation. InZF{\displaystyle {\mathsf {ZF}}}, Replacement is mostly important to prove the existence of sets of highrank, namely via instances of the axiom schema whereϕ(x,y){\displaystyle \phi (x,y)} relates relatively small setx{\displaystyle x} to bigger ones,y{\displaystyle y}.

Constructive set theories commonly have Axiom schema of Replacement, sometimes restricted to bounded formulas. However, when other axioms are dropped, this schema is actually often strengthened - not beyondZF{\displaystyle {\mathsf {ZF}}}, but instead merely to gain back some provability strength. Such stronger axioms exist that do not spoil the strongexistence properties of a theory, as discussed further below.

IfiX{\displaystyle i_{X}} is provenly a function onX{\displaystyle X} and it is equipped with a codomainY{\displaystyle Y} (all discussed in detail below), then theimage ofiX{\displaystyle i_{X}} is a subset ofY{\displaystyle Y}. In other approaches to the set concept, the notion of subsets is defined in terms of "operations", in this fashion.

Hereditarily finite sets

[edit]

Pendants of the elements of the class ofhereditarily finite setsH0{\displaystyle H_{\aleph _{0}}} can be implemented in any common programming language. The axioms discussed above abstract from common operations on theset data type: Pairing and Union are related to nesting andflattening, or taken together concatenation. Replacement is related tocomprehension and Separation is then related to the often simplerfiltering. Replacement together withSet Induction (introduced below) suffices to axiomizeH0{\displaystyle H_{\aleph _{0}}} constructively and that theory is also studied without Infinity.

A sort of blend between pairing and union, an axiom more readily related to the successor is theAxiom of adjunction.[14][15] Such principles are relevant for the standard modeling of individualNeumann ordinals. Axiom formulations also exist that pair Union and Replacement in one. While postulating Replacement is not a necessity in the design of a weak constructive set theory that is bi-interpretable with Heyting arithmeticHA{\displaystyle {\mathsf {HA}}}, some form of induction is. For comparison, consider the very weak classical theory calledGeneral set theory that interprets the class of natural numbers and their arithmetic via just Extensionality, Adjunction and full Separation.

The discussion now proceeds with axioms granting existence of objects which, in different but related form, are also found independent type theories, namelyproducts and the collection of natural numbers as a completed set. Infinite sets are particularly handy to reason about operations applied to sequences defined on unbounded indexdomains, say the formal differentiation of agenerating function or the addition of two Cauchy sequences.

ECST

[edit]

For some fixed predicateI{\displaystyle I} and a seta{\displaystyle a}, the statementI(a)(y.I(y)ay){\displaystyle I(a)\land {\big (}\forall y.I(y)\to a\subset y{\big )}} expresses thata{\displaystyle a} is the smallest (in the sense of "{\displaystyle \subset }") among all setsy{\displaystyle y} for whichI(y){\displaystyle I(y)} holds true, and that it is always a subset of suchy{\displaystyle y}. The aim of the axiom of infinity is to eventually obtainunique smallest inductive set.

In the context of common set theory axioms, one statement of infinitude is to state that a class is inhabited and also includes a chain of membership (or alternatively a chain of supersets). That is,

(z.zA)(xA).(sA).xs{\displaystyle {\big (}\exists z.z\in A{\big )}\land \forall (x\in A).\exists (s\in A).x\in s}.

More concretely, denote byIndA{\displaystyle \mathrm {Ind} _{A}} the inductive property,

(0A)(xA).SxA{\displaystyle (0\in A)\land \forall (x\in A).Sx\in A}.

In terms of a predicateQ{\displaystyle Q} underlying the class so thatx.(xA)Q(x){\displaystyle \forall x.(x\in A)\leftrightarrow Q(x)}, the latter translates toQ(0)x.(Q(x)Q(Sx)){\displaystyle Q(0)\land \forall x.{\big (}Q(x)\to Q(Sx){\big )}}.

WriteB{\displaystyle \bigcap B} for the general intersection{x(yB).xy}{\displaystyle \{x\mid \forall (y\in B).x\in y\}}. (A variant of this definition may be considered which requiresBB{\displaystyle \cap B\subset \cup B}, but we only use this notion for the following auxiliary definition.)

One commonly defines a classω={yIndy}{\displaystyle \omega =\bigcap \{y\mid \mathrm {Ind} _{y}\}}, the intersection of all inductive sets. (Variants of this treatment may work in terms of a formula that depends on a set parameterw{\displaystyle w} so thatωw{\displaystyle \omega \subset w}.) The classω{\displaystyle \omega } exactly holds allx{\displaystyle x} fulfilling the unbounded propertyy.Indyxy{\displaystyle \forall y.\mathrm {Ind} _{y}\to x\in y}. The intention is that if inductive sets exist at all, then the classω{\displaystyle \omega } shares each common natural number with them, and then the propositionωA{\displaystyle \omega \subset A}, by definition of "{\displaystyle \subset }", implies thatQ{\displaystyle Q} holds for each of these naturals. While bounded separation does not suffice to proveω{\displaystyle \omega } to be the desired set, the language here forms the basis for the following axiom, granting natural number induction for predicates that constitute a set.

The elementary constructive Set TheoryECST{\displaystyle {\mathsf {ECST}}} has the axiom ofBCST{\displaystyle {\mathsf {BCST}}} as well as the postulate

Strong Infinity

w.(Indw(y.Indywy)){\displaystyle \exists w.{\Big (}\mathrm {Ind} _{w}\,\land \,{\big (}\forall y.\mathrm {Ind} _{y}\to w\subset y{\big )}{\Big )}}

Going on, one takes the symbolω{\displaystyle \omega } to denote the now unique smallest inductive set, an unboundedvon Neumann ordinal. It contains the empty set and, for each set inω{\displaystyle \omega }, another set inω{\displaystyle \omega } that contains one element more.

Symbols called zero and successor are in thesignature of the theory ofPeano. InBCST{\displaystyle {\mathsf {BCST}}}, the above defined successor of any number also being in the classω{\displaystyle \omega } follow directly from the characterization of the natural naturals by our von Neumann model. Since the successor of such a set contains itself, one also finds that no successor equals zero. So two of thePeano axioms regarding the symbols zero and the one regarding closedness ofS{\displaystyle S} come easily. Fourthly, inECST{\displaystyle {\mathsf {ECST}}}, whereω{\displaystyle \omega } is a set,S{\displaystyle S} onω{\displaystyle \omega } can be proven to be an injective operation.

For some predicate of setsP{\displaystyle P}, the statementS.(SωP(S)){\displaystyle \forall S.(S\subset \omega \to P(S))} claimsP{\displaystyle P} holds for all subsets of the set of naturals. And the axiom now proves such sets do exist. Such quantification is also possible insecond-order arithmetic.

The pairwise order "<{\displaystyle <}" on the naturals is captured by their membership relation "{\displaystyle \in }". The theory proves the order as well as the equality relation on this set to be decidable. Not only is nonumber smaller than0{\displaystyle 0}, but induction implies that among subsets ofω{\displaystyle \omega }, it is exactly the empty set which has no least member. The contrapositive of this proves the double-negatedleast number existence for all non-empty subsets ofω{\displaystyle \omega }. Another valid principle also classically equivalent to it is least number existence for all inhabited detachable subsets. That said, the bare existence claim for the inhabited subsetb:={z1P}{1}{\displaystyle b:=\{z\in 1\mid P\}\cup \{1\}} ofω{\displaystyle \omega } is equivalent to excluded middle forP{\displaystyle P}, and a constructive theory will therefore not proveω{\displaystyle \omega } to bewell-ordered.

Weaker formulations of infinity

[edit]

Should it need motivation, the handiness of postulating an unbounded set of numbers in relation to other inductive properties becomes clear in the discussion of arithmetic in set theory further below. But as is familiar from classical set theory, also weak forms of Infinity can be formulated. For example, one may just postulate the existence of some inductive set,y.Indy{\displaystyle \exists y.\mathrm {Ind} _{y}} - such an existence postulate suffices when full Separation may then be used to carve out the inductive subsetw{\displaystyle w} of natural numbers, the shared subset of all inductive classes. Alternatively, more specific mere existence postulates may be adopted. Either which way, the inductive set then fulfills the followingΔ0{\displaystyle \Delta _{0}} predecessor existence property in the sense of thevon Neumann model:

m.(mw)(m=0(pw).Sp=m){\displaystyle \forall m.(m\in w)\leftrightarrow {\big (}m=0\lor \exists (p\in w).Sp=m{\big )}}

Without making use of the notation for the previously defined successor notation, the extensional equality to a successorSp=m{\displaystyle Sp=m} is captured byn.(nm)(n=pnp){\displaystyle \forall n.(n\in m)\leftrightarrow (n=p\lor n\in p)}. This expresses that all elementsm{\displaystyle m} are either equal to0{\displaystyle 0} or themselves hold a predecessor setpw{\displaystyle p\in w} which shares all other members withm{\displaystyle m}.

Observe that through the expression "(pw){\displaystyle \exists (p\in w)}" on the right hand side, the property characterizingw{\displaystyle w} by its membersm{\displaystyle m} here syntactically again contains the symbolw{\displaystyle w} itself. Due to the bottom-up nature of the natural numbers, this is tame here. AssumingΔ0{\displaystyle \Delta _{0}}-set induction on top ofECST{\displaystyle {\mathsf {ECST}}}, no two different sets have this property.Also note that there are alsolonger formulations of this property, avoiding "(pw){\displaystyle \exists (p\in w)}" in favor unbounded quantifiers.

Number bounds

[edit]

Adopting an Axiom of Infinity, the set-bounded quantification legal in predicates used inΔ0{\displaystyle \Delta _{0}}-Separation then explicitly permits numerically unbounded quantifiers - the two meanings of "bounded" should not be confused.Withω{\displaystyle \omega } at hand, call a class of numbersIω{\displaystyle I\subset \omega }bounded if the followingexistence statement holds

(mω).(nω).(nIn<m){\displaystyle \exists (m\in \omega ).\forall (n\in \omega ).(n\in I\to n<m)}

This is a statements of finiteness, also equivalently formulated viamnnI{\displaystyle m\leq n\to n\notin I}. Similarly, to reflect more closely the discussion of functions below, consider the above condition in the form(mω).(nI).(n<m){\displaystyle \exists (m\in \omega ).\forall (n\in I).(n<m)}.For decidable properties, these areΣ20{\displaystyle \Sigma _{2}^{0}}-statements in arithmetic, but with the Axiom of Infinity, the two quantifiers are set-bound.

For a classC{\displaystyle C}, the logically positive unboundedness statement

(kω).(jω).(kjjC){\displaystyle \forall (k\in \omega ).\exists (j\in \omega ).(k\leq j\land j\in C)}

is now also one of infinitude. It isΠ20{\displaystyle \Pi _{2}^{0}} in the decidable arithmetic case. To validate infinitude of a set, this property even works if the set holds other elements besides infinitely many of members ofω{\displaystyle \omega }.

Moderate induction in ECST

[edit]

In the following, an initial segment of the natural numbers, i.e.{nωn<m}{\displaystyle \{n\in \omega \mid n<m\}} for anymω{\displaystyle m\in \omega } and including the empty set, is denoted by{0,1,,m1}{\displaystyle \{0,1,\dots ,m-1\}}. This set equalsm{\displaystyle m} and so at this point "m1{\displaystyle m-1}" is mere notation for its predecessor (i.e. not involving subtraction function).

It is instructive to recall the way in which a theory with set comprehension and extensionality ends up encoding predicate logic. Like any class in set theory, a set can be read as corresponding to predicates on sets. For example, an integer is even if it is a member of the set of even integers, or a natural number has a successor if it is a member of the set of natural numbers that have a successor. For a less primitive example, fix some sety{\displaystyle y} and letQ(n){\displaystyle Q(n)} denote the existential statement that the function space on the finite ordinal intoy{\displaystyle y} exist. The predicate will be denotedh.hy{0,1,,n1}{\displaystyle \exists h.h\simeq y^{\{0,1,\dots ,n-1\}}} below, and here the existential quantifier is not merely one over natural numbers, nor is it bounded by any other set. Now a proposition like the finite exponentiation principle(nω).Q(n){\displaystyle \forall (n\in \omega ).Q(n)} and, less formally, the equalityω={nωQ(n)}{\displaystyle \omega =\{n\in \omega \mid Q(n)\}} are just two ways of formulating the same desired statement, namely ann{\displaystyle n}-indexed conjunction of existential propositions wheren{\displaystyle n} ranges over the set of all naturals. Via extensional identification, the second form expresses the claim using notation for subclass comprehension and the bracketed object on the right hand side may not even constitute a set. If that subclass is not provably a set, it may not actually be used in many set theory principles in proofs, and establishing the universal closure(nω).Q(n){\displaystyle \forall (n\in \omega ).Q(n)} as a theorem may not be possible. The set theory can thus be strengthened by more set existence axioms, to be used with predicativebounded Separation, but also by just postulating stronger{\displaystyle \forall }-statements.

The second universally quantified conjunct in the strong axiom of Infinity expresses mathematical induction for ally{\displaystyle y} in the universe of discourse, i.e. for sets. This is because the consequent of this clause,ωy{\displaystyle \omega \subset y}, states that allnω{\displaystyle n\in \omega } fulfill the associated predicate. Being able to use predicative separation to define subsets ofω{\displaystyle \omega }, the theory proves induction for all predicatesϕ(n){\displaystyle \phi (n)} involving only set-bounded quantifiers. This role of set-bounded quantifiers also means that more set existence axioms impact the strength of this induction principle, further motivating the function space and collection axioms that will be a focus of the rest of the article. Notably,ECST{\displaystyle {\mathsf {ECST}}} already validates induction with quantifiers over the naturals, and hence induction as in the first-order arithmetic theoryHA{\displaystyle {\mathsf {HA}}}. The so called axiom offull mathematical induction for any predicate (i.e. class) expressed through set theory language is far stronger than the bounded induction principle valid inECST{\displaystyle {\mathsf {ECST}}}. The former induction principle could be directly adopted, closer mirroring second-order arithmetic. In set theory it also follows from full (i.e. unbounded) Separation, which says that all predicates on{\displaystyle \forall } are sets. Mathematical induction is also superseded by the (full) Set induction axiom.

Warning note: In naming induction statements, one must take care not to conflate terminology with arithmetic theories. The first-order induction schema of natural number arithmetic theory claims induction for all predicates definable in the language offirst-order arithmetic, namely predicates of just numbers. So to interpret the axiom schema ofHA{\displaystyle {\mathsf {HA}}}, one interprets these arithmetical formulas. In that context, thebounded quantification specifically means quantification over a finite range of numbers.One may also speak about the induction in the first-order but two-sorted theory of so-called second-order arithmeticZ2{\displaystyle {\mathsf {Z}}_{2}}, in a form explicitly expressed for subsets of the naturals. That class of subsets can be taken to correspond to a richer collection of formulas than the first-order arithmetic definable ones. In the program ofreverse mathematics, all mathematical objects discussed are encoded as naturals or subsets of naturals. Subsystems ofZ2{\displaystyle {\mathsf {Z}}_{2}} with very lowcomplexity comprehension studied in that framework have a language that does not merely expressarithmetical sets, while all sets of naturals particular such theories prove to exist are justcomputable sets. Theorems therein can be a relevant reference point for weak set theories with a set of naturals, predicative separation and only some further restricted form of induction. Constructive reverse mathematics exists as a field but is less developed than its classical counterpart.[16]Z2{\displaystyle {\mathsf {Z}}_{2}} shall moreover not be confused with the second-order formulation of Peano arithmeticPA2{\displaystyle {\mathsf {PA}}_{2}}. Typical set theories like the one discussed here are also first-order, but those theories are not arithmetics and so formulas may also quantify over the subsets of the naturals. When discussing the strength of axioms concerning numbers, it is also important to keep in mind that the arithmetical and the set theoretical framework do not share a commonsignature. Likewise, care must always be taken with insights abouttotality of functions. Incomputability theory, theμ operator enables all partialgeneral recursive functions (or programs, in the sense that they are Turing computable), including ones e.g. non-primitive recursive butPA{\displaystyle {\mathsf {PA}}}-total, such as theAckermann function. The definition of the operator involves predicates over the naturals and so the theoretical analysis of functions and their totality depends on the formal framework and proof calculus at hand.

Functions

[edit]

General note on programs and functions

[edit]

Naturally, the meaning of existence claims is a topic of interest in constructivism, be it for a theory of sets or any other framework.LetR{\displaystyle R} express a property such that a mathematical framework validates what amounts to the statement

(aA).(cC).R(a,c){\displaystyle \forall (a\in A).\exists (c\in C).R(a,c)}

A constructive proof calculus may validate such a judgement in terms of programs on represented domains and some object representing a concrete assignmentaca{\displaystyle a\mapsto c_{a}}, providing a particular choice of value inC{\displaystyle C} (aunique one), for each input fromA{\displaystyle A}. Expressed through therewriting(aA).R(a,ca){\displaystyle \forall (a\in A).R(a,c_{a})}, this function object may be understood aswitnessing the proposition. Consider for example the notions of proof in throughrealizability theory or function terms in atype theory with a notion of quantifiers. The latter captures proof of logical proposition through programs via theCurry–Howard correspondence.

Depending on the context, the word "function" may be used in association with a particularmodel of computation, and this is a priori narrower than what is discussed in the present set theory context. One notion ofprogram is formalized bypartial recursive "functions" incomputability theory. But beware that here the word "function" is used in a way that also comprisespartial functions, and not just "total functions". The scare quotes are used for clarity here, as in a set theory context there is technically no need to speak oftotal functions, because this requirement is part of the definition of a set theoretical function and partial function spaces can be modeled via unions. At the same time, when combined with a formal arithmetic, partial function programs provides one particularly sharp notion of totality for functions. ByKleene's normal form theorem, each partial recursive function on the naturals computes, for the values where it terminates, the same asaU(μw.T1(e,a,w)){\displaystyle a\mapsto U(\mu w.T_{1}(e,a,w))}, for some partial function program indexeN{\displaystyle e\in {\mathbb {N} }}, and any index will constitute some partial function. A program can be associated with ae{\displaystyle e} and may be said to beT1{\displaystyle T_{1}}-total whenever a theory provesa.w.T1(e,a,w){\displaystyle \forall a.\exists w.T_{1}(e,a,w)}, whereT1{\displaystyle T_{1}} amounts to a primitive recursive program andw{\displaystyle w} is related to the execution ofe{\displaystyle e}.Kreisel proved that the class of partial recursive functions provenT1{\displaystyle T_{1}}-total byHA{\displaystyle {\mathsf {HA}}} is not enriched whenPEM{\displaystyle {\mathrm {PEM} }} is added.[17] As a predicate ine{\displaystyle e}, this totality constitutes anundecidable subset of indices, highlighting that the recursive world of functions between the naturals is already captured by a set dominated byN{\displaystyle {\mathbb {N} }}. As a third warning, note that this notion is really about programs and several indices will in fact constitute the same function, in theextensional sense.

A theory infirst-order logic, such as the axiomatic set theories discussed here, comes with a joint notion oftotal andfunctional for a binary predicateR{\displaystyle R}, namelya.!c.R(a,c){\displaystyle \forall a.\exists !c.R(a,c)}. Such theories relate to programs only indirectly. IfS{\displaystyle S} denotes the successor operation in a formal language of a theory being studied, then any number, e.g.SSS0{\displaystyle {\mathrm {SSS0} }} (the number three), maymetalogically be related to the standard numeral, e.g.SSS0_=SSS0{\displaystyle {\underline {\mathrm {SSS0} }}=SSS0}. Similarly, programs in the partial recursive sense may be unrolled to predicates and weak assumptions suffice so that such a translation respects equality of their return values. Among finitely axiomizable sub-theories ofPA{\displaystyle {\mathsf {PA}}}, classicalRobinson arithmeticQ{\displaystyle {\mathsf {Q}}} exactly fulfills this. Its existence claims are intended to only concern natural numbers and instead of using the fullmathematical induction schema for arithmetic formulas, the theories' axioms postulate that every number is either zero or that there exists a predecessor number to it. Focusing onT1{\displaystyle T_{1}}-total recursive functions here, it is a meta-theorem that the language of arithmetic expresses them byΣ1{\displaystyle \Sigma _{1}}-predicatesG{\displaystyle G} encoding their graph such thatQ{\displaystyle {\mathsf {Q}}}represents them, in the sense that it correctly proves or rejectsG(a_,c_){\displaystyle G({\underline {\mathrm {a} }},{\underline {\mathrm {c} }})} for any input-output pair of numbersa{\displaystyle \mathrm {a} } andc{\displaystyle \mathrm {c} } in the meta-theory. Now given a correctly representingG{\displaystyle G}, the predicateGleast(a,c){\displaystyle G_{\mathrm {least} }(a,c)} defined byG(a,c)(n<c).¬G(a,n){\displaystyle G(a,c)\land \forall (n<c).\neg G(a,n)} represents the recursive function just as well, and as this explicitly only validates the smallest return value, the theory also proves functionality for all inputsa{\displaystyle {\mathrm {a} }} in the sense ofQ!c.Gleast(a_,c){\displaystyle {\mathsf {Q}}\vdash \exists !c.G_{\mathrm {least} }({\underline {\mathrm {a} }},c)}. Given a representing predicate, then at the cost of making use ofPEM{\displaystyle {\mathrm {PEM} }}, one can always also systematically (i.e. with aa.{\displaystyle \forall a.}) prove the graph to be total functional.[18]

Which predicates are provably functional for various inputs, or even total functional on their domain, generally depends on the adopted axioms of a theory and proof calculus. For example, for thediagonal halting problem, which cannot have aT1{\displaystyle T_{1}}-total index, it isHA{\displaystyle {\mathsf {HA}}}-independent whether the corresponding graph predicate onN×{0,1}{\displaystyle {\mathbb {N} }\times \{0,1\}} (adecision problem) is total functional, butPEM{\displaystyle {\mathrm {PEM} }} implies that it is. Proof theoreticalfunction hierarchies provide examples of predicates proven total functional in systems going beyondPA{\displaystyle {\mathsf {PA}}}.Which sets proven to exist do constitute a total function, in the sense introduced next, also always depends on the axioms and the proof calculus. Finally, note that the soundness of halting claims is a metalogical property beyond consistency, i.e. a theory may be consistent and from it one may prove that some program will eventually halt, despite this never actually occurring when said program is run. More formally, assuming consistency of a theory does not imply it is also arithmeticallyΣ1{\displaystyle \Sigma _{1}}-sound.

Total functional relations

[edit]

In set theory language here, speak of afunction class whenfA×C{\displaystyle f\subset A\times C} and provenly

(aA).!(cC).a,cf{\displaystyle \forall (a\in A).\,\exists !(c\in C).\langle a,c\rangle \in f}.

Notably, this definition involves quantifier explicitly asking for existence - an aspect which is particularly important in the constructive context. In words: For everya{\displaystyle a}, it demands the unique existence of ac{\displaystyle c} so thata,cf{\displaystyle \langle a,c\rangle \in f}. In the case that this holds one may usefunction application bracket notation and writef(a)=c{\displaystyle f(a)=c}. The above property may then be stated as(aA).!(cC).f(a)=c{\displaystyle \forall (a\in A).\,\exists !(c\in C).f(a)=c}. This notation may be extended to equality of function values. Some notational conveniences involving function application will only work when a set has indeed been established to be a function.LetCA{\displaystyle C^{A}} (also writtenAC{\displaystyle ^{A}C}) denote the class of sets that fulfill the function property. This is the class of functions fromA{\displaystyle A} toC{\displaystyle C} in a pure set theory. Below the notationxy{\displaystyle x\to y} is also used foryx{\displaystyle y^{x}}, for the sake of distinguishing it from ordinal exponentiation. When functions are understood as just function graphs as here, the membership propositionfCA{\displaystyle f\in C^{A}} is also writtenf:AC{\displaystyle f\colon A\to C}. The Boolean-valuedχB:A{0,1}{\displaystyle \chi _{B}\colon A\to \{0,1\}} are among the classes discussed in the next section.

By construction, any such function respects equality in the sense that(x=y)f(x)=f(y){\displaystyle (x=y)\to f(x)=f(y)}, for any inputs fromA{\displaystyle A}. This is worth mentioning since also more broader concepts of "assignment routines" or "operations" exist in the mathematical literature, which may not in general respect this. Variants of the functional predicate definition usingapartness relations onsetoids have been defined as well. A subset of a function is still a function and the function predicate may also be proven for enlarged chosen codomain sets. As noted, care must be taken with nomenclature "function", a word which sees use in most mathematical frameworks. When a function set itself is not tied to a particular codomain, then this set of pairs is also member of a function space with larger codomain. This do not happen when by the word one denotes the subset of pairs paired with a codomain set, i.e. a formalization in terms of(A×C)×{C}{\displaystyle (A\times C)\times \{C\}}. This is mostly a matter of bookkeeping, but affects how other predicates are defined, question of size. This choice is also just enforced by some mathematical frameworks. Similar considerations apply to any treatment ofpartial functions and their domains.

If both the domainA{\displaystyle A} and considered codomainC{\displaystyle C} are sets, then the above function predicate only involves bounded quantifiers. Common notions such asinjectivity andsurjectivity can be expressed in a bounded fashion as well, and thus so isbijectivity. Both of these tie in to notions of size. Importantly, injection existence between any two sets provides apreorder. A power class does not inject into its underlying set and the latter does not map onto the former. Surjectivity is formally a more complex definition. Note that injectivity shall be defined positively, not by its contrapositive, which is common practice in classical mathematics. The version without negations is sometimes called weakly injective. The existence of value collisions is a strong notion of non-injectivity. And regarding surjectivity, similar considerations exist for outlier-production in the codomain.

Whether a subclass (or predicate for that matter) can be judged to be a function set, or even total functional to begin with, will depend on the strength of the theory, which is to say the axioms one adopts. And notably, a general class could also fulfill the above defining predicate without being a subclass of the productA×C{\displaystyle A\times C}, i.e. the property is expressing not more or less than functionality w.r.t. the inputs fromA{\displaystyle A}.Now if the domain is a set, the function comprehension principle, also calledaxiom of unique choice or non-choice, says that a function as a set, with some codomain, exists well. (And this principle is valid in a theory likeCZF{\displaystyle {\mathsf {CZF}}}. Also compare with theReplacement axiom.) That is, the mapping information exists as set and it has a pair for each element in the domain. Of course, for any set from some class, one may always associate unique element of the singleton1{\displaystyle 1}, which shows that merely a chosen range being a set does not suffice to be granted a function set.It is a metatheorem for theories containingBCST{\displaystyle {\mathsf {BCST}}} that adding a function symbol for a provenly total class function is a conservative extension, despite this formally changing the scope ofbounded Separation.In summary, in the set theory context the focus is on capturing particulartotal relations that are functional. To delineate the notion offunction in the theories of the previous subsection (a 2-ary logical predicate defined to express a functions graph, together with a proposition that it is total and functional) from the "material" set theoretical notion here, one may explicitly call the lattergraph of a function,anafunction orset function. The axiom schema of Replacement can also be formulated in terms of the ranges of such set functions.

Finitude

[edit]

One defines three distinct notions involving surjections. For a general set to be (Bishop-)finite shall mean there is a bijective function to a natural. If the existence of such a bijection is proven impossible, the set is callednon-finite. Secondly, for a notion weaker than finite, to befinitely indexed (orKuratowski-finite) shall mean that there is a surjection from a von Neumann natural number onto it. In programming terms, the element of such a set are accessible in a (ending)for-loop, and only those, while it may not be decidable whether repetition occurred. Thirdly, call a setsubfinite if it is the subset of a finite set, which thus injects into that finite set. Here, a for-loop will access all of the set's members, but also possibly others. For another combined notion, one weaker than finitely indexed, to besubfinitely indexed means to be in the surjective image of a subfinite set, and inETCS{\displaystyle {\mathsf {ETCS}}} this just means to be the subset of a finitely indexed set, meaning the subset can also be taken on the image side instead of the domain side. A set exhibiting either of those notions can be understood to be majorized by a finite set, but in the second case the relation between the sets members is not necessarily fully understood. In the third case, validating membership in the set is generally more difficult, and not even membership of its member with respect to some superset of the set is necessary fully understood.The claim that being finite is equivalent to being subfinite, for all sets, is equivalent toPEM{\displaystyle {\mathrm {PEM} }}.More finiteness properties for a setX{\displaystyle X} can be defined, e.g. expressing the existence of some large enough natural such that a certain class of functions on the naturals always fail to map to distinct elements inX{\displaystyle X}. One definition considers some notion of non-injectivity intoX{\displaystyle X}. Other definitions consider functions to a fixed superset ofX{\displaystyle X} with more elements.

Terminology for conditions of finiteness and infinitude may vary. Notably, subfinitely indexed sets (a notion necessarily involving surjections) are sometimes called subfinite (which can be defined without functions). The property of being finitely indexed could also be denoted "finitely countable", to fit the naming logic, but is by some authors also called finitely enumerable (which might be confusing as this suggest an injection in the other direction). Relatedly, the existence of a bijection with a finite set has not established, one may say a set is not finite, but this use of language is then weaker than to claim the set to be non-finite. The same issue applies to countable sets (not proven countable vs. proven non-countable), et cetera. A surjective map may also be called an enumeration.

Infinitude

[edit]

The setω{\displaystyle \omega } itself is clearly unbounded. In fact, for any surjection from a finite range ontoω{\displaystyle \omega }, one may construct an element that is different from any element in the functions range. Where needed, this notion of infinitude can also be expressed in terms of an apartness relation on the set in question. Being not Kuratowski-finite implies being non-finite and indeed the naturals shall not be finite in any sense. Commonly, the word infinite is used for the negative notion of being non-finite. Further, observe thatω{\displaystyle \omega }, unlike any of its members, can be put in bijection with some of its proper unbounded subsets, e.g. those of the formwm:={kωk>m}{\displaystyle w_{m}:=\{k\in \omega \mid k>m\}} for anymω{\displaystyle m\in \omega }. This validates the formulations ofDedekind-infinite. So more generally than the property of infinitude in the previous section on number bounds, one may call a setinfinite in the logically positive sense if one can injectω{\displaystyle \omega } into it. A set that is even in bijection withω{\displaystyle \omega } may be called countably infinite. A set isTarski-infinite if there is a chain of{\displaystyle \subset }-increasing subsets of it. Here each set has new elements compared to its predecessor and the definition does not speak of sets growing rank. There are indeed plenty of properties characterizing infinitude even in classicalZF{\displaystyle {\mathsf {ZF}}} and that theory does not prove all non-finite sets to be infinite in the injection existence sense, albeitit there holds when further assuming countable choice.ZF{\displaystyle {\mathsf {ZF}}} without any choice even permits cardinals aside thealeph numbers, and there can then be sets that negate both of the above properties, i.e. they are both non-Dedekind-infinite and non-finite (also called Dedekind-finite infinite sets).

Call an inhabited setcountable if there exists a surjection fromω{\displaystyle \omega } onto it andsubcountable if this can be done from some subset ofω{\displaystyle \omega }. Call a setenumerable if there exists an injection toω{\displaystyle \omega }, which renders the set discrete. Notably, all of these are function existence claims. The empty set is not inhabited but generally deemed countable too, and note that the successor set of any countable set is countable. The setω{\displaystyle \omega } is trivially infinite, countable and enumerable, as witnessed by the identity function. Also here, in strong classical theories many of these notions coincide in general and, as a result, the naming conventions in the literature are inconsistent. An infinite, countable set isequinumeros toω{\displaystyle \omega }.

There are also various ways to characterize logically negative notion. The notion of uncountability, in the sense of being not countable, is also discussed in conjunction with the exponentiation axiom further below. Another notion of uncountability ofX{\displaystyle X} is given when one can produce a member in the compliment of any ofX{\displaystyle X}'s countable subsets. Moreproperties of finiteness may be defined as negations of such properties, et cetera.

Characteristic functions

[edit]

Separation lets us cut out subsets of productsA×C{\displaystyle A\times C}, at least when they are described in a bounded fashion. Given anyBA{\displaystyle B\subset A}, one is now led to reason about classes such as

XB:={x,yA×{0,1}(xBy=1)(xBy=0)}.{\displaystyle X_{B}:={\big \{}\langle x,y\rangle \in A\times \{0,1\}\mid (x\in B\land y=1)\lor (x\notin B\land y=0){\big \}}.}

Since¬(0=1){\displaystyle \neg (0=1)}, one has

(aB a,1XB)(aB a,0XB){\displaystyle {\big (}a\in B\ \leftrightarrow \,\langle a,1\rangle \in X_{B}{\big )}\,\land \,{\big (}a\notin B\ \leftrightarrow \,\langle a,0\rangle \in X_{B}{\big )}}

and so

(aBaB)  !(y{0,1}).a,yXB{\displaystyle {\big (}a\in B\lor a\notin B{\big )}\ \leftrightarrow \ \exists !(y\in \{0,1\}).\langle a,y\rangle \in X_{B}}.

But be aware that in absence of any non-constructive axiomsaB{\displaystyle a\in B} may in generallynot be decidable, since one requires an explicit proof of either disjunct. Constructively, when(y{0,1}).x,yXB{\displaystyle \exists (y\in \{0,1\}).\langle x,y\rangle \in X_{B}} cannot bewitnessed for all thexA{\displaystyle x\in A}, or uniqueness of the termsy{\displaystyle y} associated with eachx{\displaystyle x} cannot be proven, then one cannot judge the comprehended collection to be total functional.Case in point: The classical derivation ofSchröder–Bernstein relies on case analysis - but to constitute afunction, particular cases shall actually be specifiable, given any input from the domain. It has been established that Schröder–Bernstein cannot have a proof on the base ofIZF{\displaystyle {\mathsf {IZF}}} plus constructive principles.[19] So to the extent thatintuitionistic inference does not go beyond what is formalized here, there is no generic construction of a bijection from two injections in opposing directions.

But being compatible withZF{\displaystyle {\mathsf {ZF}}}, the development in this section still always permits "function onω{\displaystyle \omega }" to be interpreted as a completed object that is also not necessarily given aslawlike sequence. Applications may be found in the common models for claims about probability, e.g. statements involving the notion of "being given" an unending random sequence of coin flips, even if many predictions can also be expressed in terms ofspreads.

If indeed one is given a functionχB:A{0,1}{\displaystyle \chi _{B}\colon A\to \{0,1\}}, it is thecharacteristic function actually deciding membership in some detachable subsetBA{\displaystyle B\subset A} and

B={nωχB(n)=1}.{\displaystyle B=\{n\in \omega \mid \chi _{B}(n)=1\}.}

Per convention, the detachable subsetB{\displaystyle B},χB{\displaystyle \chi _{B}} as well as any equivalent of the formulasnB{\displaystyle n\in B} andχB(n)=1{\displaystyle \chi _{B}(n)=1} (withn{\displaystyle n} free) may be referred to as adecidable property or set onA{\displaystyle A}.

One may call a collectionA{\displaystyle A} searchable forχB{\displaystyle \chi _{B}} if existence is actually decidable,

(xA).χB(x)=1  (xA).χB(x)=0.{\displaystyle \exists (x\in A).\chi _{B}(x)=1\ \lor \ \forall (x\in A).\chi _{B}(x)=0.}

Now consider the caseA=ω{\displaystyle A=\omega }. IfχB(0)=0{\displaystyle \chi _{B}(0)=0}, say, then the range{0}R{0,1}{\displaystyle \{0\}\subset R\subset \{0,1\}} ofχB{\displaystyle \chi _{B}} is an inhabited, counted set, by Replacement. However, theR{\displaystyle R} need not be again a decidable set itself, since the claimR={0}{\displaystyle R=\{0\}} is equivalent to the rather strongn.χB(n)=0{\displaystyle \forall n.\chi _{B}(n)=0}. Moreover,R={0}{\displaystyle R=\{0\}} is also equivalent toB={}{\displaystyle B=\{\}} and so one can state undecidable propositions aboutB{\displaystyle B} also when membership inB{\displaystyle B} is decidable.This also plays out like this classically in the sense that statements aboutB{\displaystyle B} may beindependent, but any classical theory then nonetheless claims the joint propositionB={}¬(B={}){\displaystyle B=\{\}\lor \neg (B=\{\})}. Consider the setB{\displaystyle B} of all indices of proofs of an inconsistency of the theory at hand, in which case the universally closed statementB={}{\displaystyle B=\{\}} is a consistency claim. In terms of arithmetic principles, assuming decidability of this would beΠ10{\displaystyle \Pi _{1}^{0}}-PEM{\displaystyle {\mathrm {PEM} }} or arithmetic{\displaystyle \forall }-PEM{\displaystyle {\mathrm {PEM} }}. This and thestronger relatedLPO{\displaystyle {\mathrm {LPO} }}, or arithmetic{\displaystyle \exists }-PEM{\displaystyle {\mathrm {PEM} }}, is discussed below.

Witness of apartness

[edit]

Theidentity of indiscernibles, which in the first-order context is a higher order principle, holds that the equalityx=y{\displaystyle x=y} of two termsx{\displaystyle x} andy{\displaystyle y} necessitates thatall predicatesP{\displaystyle P} agree on them. And soif there exists a predicateP{\displaystyle P} that distinguishes two termsx{\displaystyle x} andy{\displaystyle y} in the sense thatP(x)¬P(y){\displaystyle P(x)\land \neg P(y)}, then the principle implies that the two terms do not coincide. A form of this may be expressed set theoretically:x,yA{\displaystyle x,y\in A} may be deemed apart if there exists a subsetBA{\displaystyle B\subset A} such that one is a member and the other is not. Restricted to detachable subsets, this may also be formulated concisely using characteristic functionsχB{0,1}A{\displaystyle \chi _{B}\in \{0,1\}^{A}}. Indeed, the latter does not actually depend on the codomain being a binary set: Equality is rejected, i.e.xy{\displaystyle x\neq y} is proven, as soon it is established thatnot all functionsf{\displaystyle f} onA{\displaystyle A} validatef(x)=f(y){\displaystyle f(x)=f(y)}, a logically negative condition.

One may on any setA{\displaystyle A} define the logically positive apartness relation

x#Ay:=(fNA).f(x)f(y){\displaystyle x\,\#_{A}\,y\,:=\,\exists (f\in {\mathbb {N} }^{A}).f(x)\neq f(y)}

As the naturals are discrete, for these functions the negative condition is equivalent to the (weaker) double-negation of this relation. Again in words, equality ofx{\displaystyle x} andy{\displaystyle y} implies that no coloringfNA{\displaystyle f\in {\mathbb {N} }^{A}} can distinguish them - and so to rule out the former, i.e. to provexy{\displaystyle x\neq y}, one must merely rule out the latter, i.e. merely prove¬¬(x#Ay){\displaystyle \neg \neg (x\,\#_{A}\,y)}.

Computable sets

[edit]

Going back to more generality, given a general predicateQ{\displaystyle Q} on the numbers (say one defined fromKleene's T predicate), let again

B:={nωQ(n)}.{\displaystyle B:=\{n\in \omega \mid Q(n)\}.}

Given any naturalnω{\displaystyle n\in \omega }, then

(Q(n)¬Q(n))(nBnB).{\displaystyle {\big (}Q(n)\lor \neg Q(n){\big )}\leftrightarrow {\big (}n\in B\lor n\notin B{\big )}.}

In classical set theory,(nω).Q(n)¬Q(n){\displaystyle \forall (n\in \omega ).Q(n)\lor \neg Q(n)} byPEM{\displaystyle {\mathrm {PEM} }} and so excluded middle also holds for subclass membership. If the classB{\displaystyle B} has no numerical bound, then successively going through the natural numbersn{\displaystyle n}, and thus "listing" all numbers inB{\displaystyle B} by simply skipping those withnB{\displaystyle n\notin B}, classically always constitutes an increasing surjective sequenceb:ωB{\displaystyle b\colon \omega \twoheadrightarrow B}. There, one can obtain a bijectivefunction. In this way, the class of functions in typical classical set theories is provenly rich, as it also contains objects that are beyond what we know to beeffectively computable, or programmatically listable in praxis.

Incomputability theory, thecomputable sets are ranges of non-decreasing total functionsin the recursive sense, at the levelΣ10Π10=Δ10{\displaystyle \Sigma _{1}^{0}\cap \Pi _{1}^{0}=\Delta _{1}^{0}} of thearithmetical hierarchy, and not higher. Deciding a predicate at that level amounts to solving the task of eventually finding acertificate that either validates or rejects membership.As not every predicateQ{\displaystyle Q} is computably decidable, also the theoryCZF{\displaystyle {\mathsf {CZF}}} alone will not claim (prove) that all unboundedBω{\displaystyle B\subset \omega } are the range of some bijective function with domainω{\displaystyle \omega }. See also Kripke's schema.Note that bounded Separation nonetheless proves the more complicated arithmetical predicates to still constitute sets, the next level being thecomputably enumerable ones atΣ10{\displaystyle \Sigma _{1}^{0}}.

There is a large corpus of computability theory notions regarding how general subsets of naturals relate to one another. For example, one way to establish a bijection of two such sets is by relating them through acomputable isomorphism, which is a computable permutation of all the naturals. The latter may in turn be established by a pair ofparticular injections in opposing directions.

Boundedness criteria

[edit]

Any subsetBω{\displaystyle B\subset \omega } injects intoω{\displaystyle \omega }. IfB{\displaystyle B} is decidable and inhabited withy0B{\displaystyle y_{0}\in B}, the sequence

q:={x,yω×B(xBy=x)(xBy=y0)}{\displaystyle q:={\big \{}\langle x,y\rangle \in \omega \times B\mid (x\in B\land y=x)\lor (x\notin B\land y=y_{0}){\big \}}}

i.e.

q(x):={xxBy0xB{\displaystyle q(x):={\begin{cases}x&x\in B\\y_{0}&x\notin B\\\end{cases}}}

is surjective ontoB{\displaystyle B}, making it a counted set. That function also has the property(xB).q(x)=x{\displaystyle \forall (x\in B).q(x)=x}.

Now consider a countable setRω{\displaystyle R\subset \omega } that isbounded in the sense defined previously. Any sequence taking values inR{\displaystyle R} is then numerically capped as well, and in particular eventually does not exceed the identity function on its input indices. Formally,

(r:ωR).(mω).(kω).k>mr(k)<k{\displaystyle \forall (r\colon \omega \to R).\exists (m\in \omega ).\forall (k\in \omega ).k>m\to r(k)<k}

A setI{\displaystyle I} such that this loose bounding statement holds for all sequences taking values inI{\displaystyle I} (or an equivalent formulation of this property) is calledpseudo-bounded. The intention of this property would be to still capture thatIω{\displaystyle I\subset \omega } is eventually exhausted, albeit now this is expressed in terms of the function spaceIω{\displaystyle I^{\omega }} (which is bigger thanI{\displaystyle I} in the sense thatI{\displaystyle I} always injects intoIω{\displaystyle I^{\omega }}). Therelated notion familiar from topological vector space theory is formulated in terms of ratios going to zero for all sequences (r(k)k{\displaystyle {\tfrac {r(k)}{k}}} in the above notation). For a decidable, inhabited set, validity of pseudo-boundedness, together with the counting sequence defined above, grants a bound for all the elements ofI{\displaystyle I}.

The principle that any inhabited, pseudo-bounded subset ofω{\displaystyle \omega } that is just countable (but not necessarily decidable) is always also bounded is calledBD{\displaystyle \mathrm {BD} }-N{\displaystyle {\mathbb {N} }}. The principle also holds generally in many constructive frameworks, such as the Markovian base theoryHA+ECT0+MP{\displaystyle {\mathsf {HA}}+{\mathrm {ECT} }_{0}+{\mathrm {MP} }}, which is a theory postulating exclusively lawlike sequences with nice number search termination properties. However,BD{\displaystyle \mathrm {BD} }-N{\displaystyle {\mathbb {N} }} is independent ofIZF{\displaystyle {\mathsf {IZF}}}.

Choice functions

[edit]

Not even classicalZF{\displaystyle {\mathsf {ZF}}} proves each union of a countable set of two-element sets to be countable again. Indeed,models ofZF{\displaystyle {\mathsf {ZF}}} have beendefined that negate the countability of such a countable union of pairs. Assumingcountable choice rules out that model as an interpretation of the resulting theory. This principle is still independent ofZF{\displaystyle {\mathsf {ZF}}} - A naive proof strategy for that statement fails at the accounting of infinitely manyexistential instantiations.

A choice principle postulates that certain selections can always be made in a joint fashion in the sense that they are also manifested as a single set function in the theory. As with any independent axiom, this raises the proving capabilities while restricting the scope of possible (model-theoretic) interpretations of the (syntactic) theory. A function existence claim can often be translated to the existence of inverses, orderings, and so on. Choice moreover implies statements about cardinalities of different sets, e.g. they imply or rule out countability of sets. Adding full choice toZF{\displaystyle {\mathsf {ZF}}} doesnot prove any newΠ41{\displaystyle \Pi _{4}^{1}}-theorems, but it is strictly non-constructive, as shown below. The development here proceeds in a fashion agnostic to any of the variants described next.[20]

Diaconescu's theorem

[edit]

To highlight the strength of full Choice and its relation to matters ofintentionality, one should consider the classes

a={u{0,1}(u=0)P}{\displaystyle a=\{u\in \{0,1\}\mid (u=0)\lor P\}}
b={u{0,1}(u=1)P}{\displaystyle b=\{u\in \{0,1\}\mid (u=1)\lor P\}}

from the proof ofDiaconescu's theorem. They are as contingent as the propositionP{\displaystyle P} involved in their definition and they are not proven finite. Nonetheless, the setup entails several consequences. Referring back to the introductory elaboration on the meaning of such convenient class notation, as well as to theprinciple of distributivity,ta(t=0(t=1P)){\displaystyle t\in a\leftrightarrow {\big (}t=0\lor (t=1\land P){\big )}}. So unconditionally,0a{\displaystyle 0\in a} as well as1b{\displaystyle 1\in b}, and in particular they are inhabited. As¬(0=1){\displaystyle \neg (0=1)} in any model of Heyting arithmetic, using thedisjunctive syllogism both0b{\displaystyle 0\in b} and1a{\displaystyle 1\in a} each implyP{\displaystyle P}. The two statements are indeed equivalent to the proposition, as clearlyP(a={0,1}b={0,1}){\displaystyle P\to (a=\{0,1\}\land b=\{0,1\})}. The latter also says that validity ofP{\displaystyle P} meansa{\displaystyle a} andb{\displaystyle b} share all members, and there are two of these. Asa{\displaystyle a} areb{\displaystyle b} are then sets, alsoP(a=b{a,b}={a}){\displaystyle P\to (a=b\land \{a,b\}=\{a\})} by extensionality. Conversely, assuming they are equal meansxaxb{\displaystyle x\in a\leftrightarrow x\in b} for anyx{\displaystyle x}, validating all membership statements. So both the membership statements as well as the equalities are found to be equivalent toP{\displaystyle P}. Using thecontrapositive results in the weaker equivalence of disjuncts(P¬P)(a=b¬(a=b)){\displaystyle (P\lor \neg P)\leftrightarrow (a=b\lor \neg (a=b))}. Of course, explicitly¬P(a={0}b={1}){\displaystyle \neg P\to (a=\{0\}\land b=\{1\})} and so one actually finds in which way the sets can end up being different. As functions preserves equality by definition,¬(g(a)=g(b))¬P{\displaystyle \neg {\big (}g(a)=g(b){\big )}\to \neg P} indeed holds for anyg{\displaystyle g} with domain{a,b}{\displaystyle \{a,b\}}.

In the following assume a context in whicha,b{\displaystyle a,b} are indeed established to be sets, and thus subfinite sets. The general axiom of choice claims existence of a functionf:{a,b}ab{\displaystyle f\colon \{a,b\}\to a\cup b} withf(z)z{\displaystyle f(z)\in z}. It is important that the elementsa,b{\displaystyle a,b} of the function's domain are different than the natural numbers0,1{\displaystyle 0,1} in the sense that a priori less is known about the former. When forming then union of the two classes,u=0u=1{\displaystyle u=0\lor u=1} is a necessary but then also sufficient condition. Thusab={0,1}{\displaystyle a\cup b=\{0,1\}} and one is dealing with functionsf{\displaystyle f} into a set of twodistinguishable values. With choice come the conjunctionf(a)af(b)b{\displaystyle f(a)\in a\land f(b)\in b} in the codomain of the function, but thepossible function return values are known to be just0{\displaystyle 0} or1{\displaystyle 1}. Using the distributivity, there arises a list of conditions, another disjunction. Expanding what is then established, one finds that either bothP{\displaystyle P} as well as the sets equality holds, or that the return values are different andP{\displaystyle P} can be rejected. The conclusion is that the choice postulate actually impliesP¬P{\displaystyle P\lor \neg P} whenever a Separation axiom allows for set comprehension using undecidable propositionP{\displaystyle P}.

Analysis of Diaconescu's theorem

[edit]

So full choice is non-constructive in set theory as defined here. The issue is that when propositions are part of set comprehension (like whenP{\displaystyle P} is used to separate, and thereby define, the classesa{\displaystyle a} andb{\displaystyle b} from{0,1}{\displaystyle \{0,1\}}), the notion of their truth values are ramified into set terms of the theory. Equality defined by the set theoreticalaxiom of extensionality, which itself is not related to functions, in turn couples knowledge about the proposition to information about function values. To recapitulate the final step in terms function values: On the one hand, witnessingf(a)=1{\displaystyle f(a)=1} impliesP{\displaystyle P} anda=b{\displaystyle a=b} and this conclusion independently also applies to witnessingf(b)=0{\displaystyle f(b)=0}. On the other hand, witnessingf(a)=0f(b)=1{\displaystyle f(a)=0\land f(b)=1} implies the two function arguments are not equal and this rules outP{\displaystyle P}. There are really only three combinations, as the axiom of extensionality in the given setup makesf(a)=1f(b)=0{\displaystyle f(a)=1\land f(b)=0} inconsistent. So if the constructive reading of existence is to be preserved, full choice may be not adopted in the set theory, because the mere claim of function existence does not realize a particular function.

To better understand why one cannot expect to be granted a definitive (total) choice function with domain{a,b}{\displaystyle \{a,b\}}, consider naive function candidates. Firstly, an analysis of the domain is in order. The surjection{0,a,1,b}{\displaystyle \{\langle 0,a\rangle ,\langle 1,b\rangle \}} witnesses that{a,b}{\displaystyle \{a,b\}} is finitely indexed. It was noted that its members are subfinite and also inhabited, since regardless ofP{\displaystyle P} it is the case that0a{\displaystyle 0\in a} and1b{\displaystyle 1\in b}. So naively, this would seem to makef={a,0,b,1}{\displaystyle f=\{\langle a,0\rangle ,\langle b,1\rangle \}} a contender for a choice function. WhenP{\displaystyle P} can be rejected, then this is indeed the only option. But in the case of provability ofP{\displaystyle P}, when{a,b}={a}{\displaystyle \{a,b\}=\{a\}}, there is extensionally only one possible function input to a choice function. So in that situation, a choice function would explicitly have typef:{a}{0,1}{\displaystyle f\colon \{a\}\to \{0,1\}}, for examplef={a,0}{\displaystyle f=\{\langle a,0\rangle \}} and this would rule out the initial contender. For generalP{\displaystyle P}, the domain of a would-be choice function is not concrete but contingent onP{\displaystyle P} and not proven finite. When considering the above functional assignmentf(a)=0{\displaystyle f(a)=0}, then neither unconditionally declaringf(b)=1{\displaystyle f(b)=1} norf(b)=0{\displaystyle f(b)=0} is necessarily consistent. Having identified1{\displaystyle 1} with{0}{\displaystyle \{0\}}, the two candidates described above can be represented simultaneously viaf={a,0,b,B¬}{\displaystyle f=\{\langle a,0\rangle ,\langle b,B_{\neg }\rangle \}} (which is not proven finite either) with the subfinite "truth value of¬P{\displaystyle \neg P}" given asB¬:={u{0}¬P}{\displaystyle B_{\neg }:=\{u\in \{0\}\mid \neg P\}}. As(P(B¬=0))(¬P(B¬=1)){\displaystyle (P\to (B_{\neg }=0))\land (\neg P\to (B_{\neg }=1))}, postulatingP{\displaystyle P}, or¬P{\displaystyle \neg P}, or the classical principleP¬P{\displaystyle P\lor \neg P} here would indeed imply thatB¬{\displaystyle B_{\neg }} is a natural, so that the latter setf{\displaystyle f} constitutes a choice function into{0,1}{\displaystyle \{0,1\}}. And as in the constructive case, given aparticular choice function - a set holding either exactly one or exactly two pairs - one could actually infer whetherP{\displaystyle P} or whether¬P{\displaystyle \neg P} does hold. Vice versa, the third and last candidatef={b,1}{\displaystyle f=\{\langle b,1\rangle \}} can be captured as part off={a,B,b,1}{\displaystyle f=\{\langle a,B\rangle ,\langle b,1\rangle \}}, whereB:={u{0}P}{\displaystyle B:=\{u\in \{0\}\mid P\}}. Such aB{\displaystyle B} had already been considered in the early section on the axiom of separation. Again, the latterf{\displaystyle f} here is a classical choice function either way also, whereP{\displaystyle P} functions as a (potentially undecidable) "if-clause". Constructively, the domain and values of suchP{\displaystyle P}-dependent would-be functions are not understood enough to prove them to be a total functional relation into{0,1}{\displaystyle \{0,1\}}.

For computable semantics, set theory axioms postulating (total) function existence lead to the requirement for halting recursive functions. From their function graph in individual interpretations, one can infer the branches taken by the "if-clauses" that were undecided in the interpreted theory. But on the level of the synthetic frameworks, when they broadly become classical from adopting full choice, these extensional set theories theories contradict the constructive Church's rule.

Regularity implies PEM

[edit]

The axiom of choice grants existence a function associated with every set-sized collection of inhabited elementss{\displaystyle s}, with which one can then at once pick unique elementst{\displaystyle t}.Theaxiom of regularity states that for every inhabited sets{\displaystyle s} in the universal collection, thereexists an elementt{\displaystyle t} ins{\displaystyle s}, which shares no elements withs{\displaystyle s}. This formulation does not involve functions or unique existence claims, but instead directly guarantees setsts{\displaystyle t\in s} with a specific property.As the axiom correlates membership claims at different rank, the axiom also ends up implyingPEM{\displaystyle {\mathrm {PEM} }}:

The proof from Choice above had used1:={0}{\displaystyle 1:=\{0\}} and a particular set{a,b}{\displaystyle \{a,b\}}. The proof in this paragraph also assumes Separation applies toP{\displaystyle P} and usesb{\displaystyle b}, for which{0}b{\displaystyle \{0\}\in b} by definition. It was already explained thatP0b{\displaystyle P\leftrightarrow 0\in b} and so one may prove excluded middle forP{\displaystyle P} in the form0b0b{\displaystyle 0\in b\lor 0\notin b}. Now lettb{\displaystyle t\in b} be the postulated member with the empty intersection property. The setb{\displaystyle b} was defined as a subset of{0,1}{\displaystyle \{0,1\}} and so any giventb{\displaystyle t\in b} fulfills the disjunctiont=0t=1{\displaystyle t=0\lor t=1}. The left clauset=0{\displaystyle t=0} implies0b{\displaystyle 0\in b}, while for the right clauset=1{\displaystyle t=1} one may use that the special non-intersecting elementt{\displaystyle t} fulfills(t={0})(0b){\displaystyle (t=\{0\})\leftrightarrow (0\notin b)}.

Demanding that the set of naturals iswell-ordered with respect to it standard order relation imposes the same condition on the inhabited setbω{\displaystyle b\subset \omega }. So theleast number principle has the same non-constructive implication.As with the proof from Choice, the scope of propositions for which these results hold is governed by one's Separation axiom.

Arithmetic

[edit]

The fourPeano axioms for0{\displaystyle 0} andS{\displaystyle S}, characterizing the setω{\displaystyle \omega } as a model of the natural numbers in the constructive set theoryECST{\displaystyle {\mathsf {ECST}}}, have been discussed. The order "<{\displaystyle <}" of natural numbers is captured by membership "{\displaystyle \in }" in this von Neumann model and this set is discrete, i.e. alsoϕ(n,m):=(n=m){\displaystyle \phi (n,m):=(n=m)} is decidable.

As discussed, induction for arithmetic formulas is a theorem. However, when not assuming full mathematical induction (or stronger axioms like full Separation) in a set theory, there is a pitfall regarding the existence of arithmetic operations. The first-order theory ofHeyting arithmeticHA{\displaystyle {\mathsf {HA}}} has the samesignature and non-logical axioms as Peano arithmeticPA{\displaystyle {\mathsf {PA}}}. In contrast, the signature of set theory does not contain addition "+{\displaystyle +}" or multiplication "×{\displaystyle \times }".ECST{\displaystyle {\mathsf {ECST}}} does actually not enableprimitive recursion inω{\displaystyle \omega } for function definitions of what would beh:(x×ω)y{\displaystyle h\colon (x\times \omega )\to y} (where "×{\displaystyle \times }" here denotes theCartesian product of set, not to be confused with multiplication above). Indeed, despite having the Replacement axiom, the theory does not prove there to be a set capturing the addition function+:(ω×ω)ω{\displaystyle +\colon (\omega \times \omega )\to \omega }. In the next section, it is clarified which set theoretical axiom may be asserted to prove existence of the latter as a function set, together with their desired relation to zero and successor.

Far beyond just the equality predicate, the obtained model of arithmetic then validates

HAn.m.(ϕ(n,m)¬ϕ(n,m)){\displaystyle {\mathsf {HA}}\vdash \forall n.\forall m.{\big (}\phi (n,m)\lor \neg \phi (n,m){\big )}}

for any quantifier-free formula. Indeed,PA{\displaystyle {\mathsf {PA}}} isΠ20{\displaystyle \Pi _{2}^{0}}-conservative overHA{\displaystyle {\mathsf {HA}}} and double-negation elimination is possible for anyHarrop formula.

Arithmetic functions from recursion

[edit]

Going a step beyondECST{\displaystyle {\mathsf {ECST}}}, theaxiom granting definition of set functions via iteration-step set functions must be added: For any sety{\displaystyle y}, setzy{\displaystyle z\in y} andf:yy{\displaystyle f\colon y\to y}, there must also exist a functiong:ωy{\displaystyle g\colon \omega \to y} attained by making use of the former, namely such thatg(0)=z{\displaystyle g(0)=z} andg(Sn)=f(g(n)){\displaystyle g(Sn)=f(g(n))}. This iteration- orrecursion principle is akin to thetransfinite recursion theorem, except it is restricted to set functions and finite ordinal arguments, i.e. there is no clause aboutlimit ordinals. It functions as the set theoretical equivalent of anatural numbers object incategory theory.This then enables a full interpretation of Heyting arithmeticHA{\displaystyle {\mathsf {HA}}} in our set theory, including addition and multiplication functions.

With this,N{\displaystyle {\mathbb {N} }} andZ{\displaystyle {\mathbb {Z} }} are well-founded, in the sense of theinductive subsets formulation. Further, arithmetic of rational numbersQ{\displaystyle {\mathbb {Q} }} can then also be defined and its properties, like uniqueness and countability, be proven.

Recursion from set theory axioms

[edit]

Recall thathyx{\displaystyle h\simeq y^{x}} is short forf.(fhfyx){\displaystyle \forall f.{\big (}f\in h\leftrightarrow f\in y^{x}{\big )}}, wherefyx{\displaystyle f\in y^{x}} is short for the total function predicate, a proposition in terms of uses bounded quantifiers. If both sides are sets, then by extensionality this is also equivalent toh=yx{\displaystyle h=y^{x}}. (Although by slight abuse of formal notation, as with the symbol "{\displaystyle \in }", the symbol "={\displaystyle =}" is also commonly used with classes anyhow.)

A set theory with theHA{\displaystyle {\mathsf {HA}}}-model enabling recursion principle, spelled out above, will also prove that, for all naturalsn{\displaystyle n} andm{\displaystyle m}, the function spaces

{0,1,,n1}{0,1,,m1}{\displaystyle {\{0,1,\dots ,n-1\}}\to {\{0,1,\dots ,m-1\}}}

are sets. Indeed, bounded recursion suffices, i.e. the principle forΔ0{\displaystyle \Delta _{0}}-defined classes.

Conversely, the recursion principle can be proven from a definition involving the union of recursive functions on finite domains. Relevant for this is the class of partial functions onω{\displaystyle \omega } such that all of its members have a return values only up to some natural number bound, which may be expressed bynωy{0,1,,n1}{\displaystyle \cup _{n\in \omega }y^{\{0,1,\dots ,n-1\}}}. Existence of this as a set becomes provable assuming that the individual function spacesyn{\displaystyle y^{n}} all form sets themselves. To this end

Exponentiation for finite domains

(nω).y.  h.hyn{\displaystyle \forall (n\in \omega ).\forall y.\ \ \exists h.h\simeq y^{n}}

With this axiom, any such space is now aset of subsets ofn×y{\displaystyle n\times y} and this is strictly weaker than full Separation. Notably, adoption of this principle has genuine set theoretical flavor, in contrast to a direct embedding of arithmetic principles into our theory. And it is a modest principle insofar as these function spaces are tame: When instead assuming full induction or full exponentiation, takingy{\displaystyle y} to function spacesyn{\displaystyle y^{n}}, or to n-fold Cartesian products, provably does preserve countability.

InECTS{\displaystyle {\mathsf {ECTS}}} plus finite exponentiation, the recursion principle is a theorem. Moreover, enumerable forms of thepigeon hole principle can now also be proven, e.g. that on a finitely indexed set, every auto-injection is also a surjection. As a consequence, thecardinality of finite sets, i.e. the finite von Neumann ordinal, is provably unique. The finitely indexed discrete sets are just the finite sets. In particular, finitely indexed subsets ofω{\displaystyle \omega } are finite. Taking quotients or taking the binary union or Cartesian product of two sets preserve finiteness, sub-finiteness and being finitely indexed.

The set theory axioms listed so far incorporates first-order arithemtic and suffices as formalized framework for a good portion of common mathematics. The restriction to finite domains is lifted in the strictly stronger exponentiation axiom below. However, also that axiom does not entail the full induction schema for formulas with unbound quantifiers over the domain of sets, nor a dependent choice principle. Likewise, there are Collection principles that are constructively not implied by Replacement, as discussed further below. A consequence of this is that for some statements of higher complexity or indirection, even if concrete instances of interest may well be provable, the theory may not prove the universal closure.Stronger than this theory with finite exponentiation isECTS{\displaystyle {\mathsf {ECTS}}} plus full induction. It implies the recursion principle even for classes and such thatg{\displaystyle g} is unique. Already that recursion principle when restricted toΔ0{\displaystyle \Delta _{0}} does prove finite exponentiation, and also the existence of a transitive closure for every set with respect to{\displaystyle \in } (since union formation isΔ0{\displaystyle \Delta _{0}}). With it more common constructions preserve countability. General unions over a finitely indexed set of finitely indexed sets are again finitely indexed, when at least assuming induction forΣ1{\displaystyle \Sigma _{1}}-predicates (with respect to the set theory language, and this then holds regardless of the decidability of their equality relations.)

Induction without infinite sets

[edit]

Before discussing even classically uncountable sets, this last section takes a step back to a context more akin toBCST{\displaystyle {\mathsf {BCST}}}. The addition of numbers, considered as relation on triples, is an infinite collection, just like collection of natural numbers themselves. But note that induction schemas may be adopted (for sets, ordinals or in conjunction with a natural number sort), without ever postulating that the collection of naturals exists as a set. As noted,Heyting arithmeticHA{\displaystyle {\mathsf {HA}}} isbi-interpretable with such a constructive set theory, in which all sets are postulated to be in bijection with an ordinal. TheBIT predicate is a common means to encode sets in arithmetic.

This paragraph lists a few weak natural number induction principles studied in theproof theory of arithmetic theories with addition and multiplication in their signature. This is the framework where these principles are most well understood. The theories may be defined viabounded formulations orvariations on induction schemas that may furthermore only allow for predicates of restricted complexity. On the classical first-order side, this leads to theories between theRobinson arithmeticQ{\displaystyle {\mathsf {Q}}} andPeano arithmeticPA{\displaystyle {\mathsf {PA}}}: The theoryQ{\displaystyle {\mathsf {Q}}} does not have any induction.PA{\displaystyle {\mathsf {PA}}} has full mathematical induction for arithmetical formulas and hasordinalε0{\displaystyle \varepsilon _{0}}, meaning the theory lets one encode ordinals of weaker theories as recursive relation on just the naturals. Theories may also include additional symbols for particular functions.Many of the well studied arithmetic theories are weak regarding proof of totality for some morefast growing functions. Some of the most basic examples of arithmetics includeelementary function arithmeticEFA{\displaystyle {\mathsf {EFA}}}, which includes induction for just bounded arithmetical formulas, here meaning with quantifiers over finite number ranges. The theory has aproof theoretic ordinal (the least not provenly recursivewell-ordering) ofω3{\displaystyle \omega ^{3}}.TheΣ10{\displaystyle \Sigma _{1}^{0}}-induction schema for arithmetical existential formulas allows for induction for those properties of naturals a validation of which is computable via a finite search with unbound (any, but finite) runtime. The schema is also classically equivalent to theΠ10{\displaystyle \Pi _{1}^{0}}-induction schema.The relatively weak classical first-order arithmetic which adopts that schema is denotedIΣ1{\displaystyle {\mathsf {I\Sigma }}_{1}} and proves the primitive recursive functions total. The theoryIΣ1{\displaystyle {\mathsf {I\Sigma }}_{1}} isΠ20{\displaystyle \Pi _{2}^{0}}-conservative overprimitive recursive arithmeticPRA{\displaystyle {\mathsf {PRA}}}.Note that theΣ10{\displaystyle \Sigma _{1}^{0}}-induction is also part of thesecond-order reverse mathematicsbase systemRCA0{\displaystyle {\mathsf {RCA}}_{0}}, its other axioms beingQ{\displaystyle {\mathsf {Q}}} plusΔ10{\displaystyle \Delta _{1}^{0}}-comprehension of subsets of naturals. The theoryRCA0{\displaystyle {\mathsf {RCA}}_{0}} isΠ11{\displaystyle \Pi _{1}^{1}}-conservative overIΣ1{\displaystyle {\mathsf {I\Sigma }}_{1}}. Those last mentioned arithmetic theories all have ordinalωω{\displaystyle \omega ^{\omega }}.

Let us mention one more step beyond theΣ10{\displaystyle \Sigma _{1}^{0}}-induction schema. Lack of stronger induction schemas means, for example, that some unbounded versions of thepigeon hole principle are unprovable. One relatively weak one being theRamsey theorem type claim here expressed as follows:For anym>0{\displaystyle m>0} and coding of a coloring mapf{\displaystyle f}, associating eachnω{\displaystyle n\in \omega } with a color{0,1,,m1}{\displaystyle \{0,1,\dots ,m-1\}}, it is not the case that for every colorc<m{\displaystyle c<m} there exists a threshold input numbernc{\displaystyle n_{c}} beyond whichc{\displaystyle c} is not ever the mappings return value anymore. (In the classical context and in terms of sets, this claim about coloring may be phrased positively, as saying that there always exists at least one return valuek{\displaystyle k} such that, in effect, for some unbounded domainKω{\displaystyle K\subset \omega } it holds that(nK).f(n)=k{\displaystyle \forall (n\in K).f(n)=k}. In words, whenf{\displaystyle f} provides infinite enumerated assignments, each being of one ofm{\displaystyle m} different possible colors, it is claimed that a particulark{\displaystyle k} coloring infinitely many numbers always exists and that the set can thus be specified, without even having to inspect properties off{\displaystyle f}. When read constructively, one would wantk{\displaystyle k} to be concretely specifiable and so that formulation is a stronger claim.) Higher indirection, than in induction for mere existential statements, is needed to formally reformulate such a negation (the Ramsey theorem type claim in the original formulation above) and prove it. Namely to restate the problem in terms of the negation of the existence of one joint threshold number, depending on all the hypotheticalnc{\displaystyle n_{c}}'s, beyond which the function would still have to attain some color value. More specifically, the strength of the requiredbounding principle isstrictly between the induction schema inIΣ10{\displaystyle {\mathsf {I\Sigma }}_{1}^{0}} andIΣ20{\displaystyle {\mathsf {I\Sigma }}_{2}^{0}}.For properties in terms of return values of functions on finite domains, brute force verification through checking all possible inputs has computational overhead which is larger for larger domains, but always finite. Acceptance of an induction schema as inIΣ20{\displaystyle {\mathsf {I\Sigma }}_{2}^{0}} validates the former so called infinite pigeon hole principle, which concerns unbounded domains, and so is about mappings with infinitely many inputs.

It is worth noting that in the program ofpredicative arithmetic, even the mathematical induction schema has been criticized as possibly beingimpredicative, when natural numbers are defined as the object which fulfill this schema, which itself is defined in terms of all naturals.

Exponentiation

[edit]

ClassicalZFC{\displaystyle {\mathsf {ZFC}}} without thePowerset axiom has natural models in classes of sets ofhereditary size less than certain uncountable cardinals.[21] In particular, it is still consistent with all existing sets (including sets holding reals) beingsubcountable, and there even countable. Such a theory essentially amounts tosecond-order arithmetic. All sets being subcountable can constructively be consistent even in the present of uncountable sets, as introduced now.

Possible choice principles were discussed, a weakened form of the Separation schema was already adopted, and more of the standardZFC{\displaystyle {\mathsf {ZFC}}} axioms shall be weakened for a more predicative and constructive theory. The first one of those is the Powerset axiom, which is adopted in the form of the space of characteristic functions. The following axiomExp{\displaystyle {\mathrm {Exp} }} is strictly stronger than its pendant for finite domains discussed above:

Exponentiation

x.y.  h.hyx{\displaystyle \forall x.\forall y.\ \ \exists h.h\simeq y^{x}}

The formulation here again uses the convenient notation for function spaces, as discussed above. In words, the axiom says that given two setsx,y{\displaystyle x,y}, the classyx{\displaystyle y^{x}} of all functions is, in fact, also a set.This is certainly required, for example, to formalize the object map of an internalhom-functor likehom(N,).{\displaystyle {\mathrm {hom} }({\mathbb {N} },-).}

Adopting such an existence statement also the quantificationf{\displaystyle \forall f} over the elements of certain classes of (total) functions now only range over sets. Consider the collection of pairsa,bx×x{\displaystyle \langle a,b\rangle \in x\times x} validating the apartness relation(fNx).f(a)f(b){\displaystyle \exists (f\in {\mathbb {N} }^{x}).f(a)\neq f(b)}. Via bounded Separation, this now constitutes a subset ofx×x{\displaystyle x\times x}. This examples shows that the Exponentiation axiom not only enriches the domain of sets directly, but via separation also enables the derivation of yet more sets, and this then furthermore also strengthens other axioms.

Notably, these bounded quantifiers now range over function spaces that are provablyuncountable, and hence even classically uncountable. E.g. the collection of all functionsf:ω2{\displaystyle f\colon \omega \to 2} where2:=SS0={0,1}{\displaystyle 2:=SS0=\{0,1\}}, i.e. the set2N{\displaystyle 2^{\mathbb {N} }} of points underlying theCantor space, is uncountable, byCantor's diagonal argument, and can at best be taken to be a subcountable set. In this theory one may now also quantify over subspaces of spaces like2N{\displaystyle 2^{\mathbb {N} }}, which is a third order notion on the naturals. (In this section and beyond, the symbol for the semiring of natural numbers in expressions likeyN{\displaystyle y^{\mathbb {N} }} is used, or writtenωy{\displaystyle \omega \to y}, just to avoid conflation of cardinal- with ordinal exponentiation.)Roughly, classically uncountable sets, like for example these function spaces, tend to not have computably decidable equality.

By taking the general union over anx{\displaystyle x}-indexed family{yi}i{\displaystyle \{y_{i}\}_{i}}, also the dependent or indexed product, writtenΠixyi{\displaystyle \Pi _{i\in x}\,y_{i}}, is now a set. For constantyi{\displaystyle y_{i}}, this again reduces to the function spaceyx{\displaystyle y^{x}}.And taking the general union over function spaces themselves, whenever the powerclass ofx{\displaystyle x} is a set, then also the supersetsxys{\displaystyle \cup _{s\subset x}y^{s}} ofyx{\displaystyle y^{x}} is now a set - giving a means to talk about the space of partial functions onx{\displaystyle x}.

Unions and countability

[edit]

With Exponentiation, the theory proves the existence of anyprimitive recursive function inx×ωy{\displaystyle x\times \omega \to y}, and in particular in the uncountable function spaces out ofω{\displaystyle \omega }. Indeed, with function spaces and the finite von Neumann ordinals as domains, we can modelHA{\displaystyle {\mathsf {HA}}} as discussed, and thus encode ordinals in the arithmetic. One then furthermore obtains theordinal-exponentiated numberωω{\displaystyle \omega ^{\omega }} as a set, which may be characterized asnωωn{\displaystyle \cup _{n\in \omega }\omega ^{n}}, the counted set of words over an infinitealphabet. The union of all finite sequences over a countable set is now a countable set. Further, for any countable family of counting functions together with their ranges, the theory proves the union of those ranges to be countable. In contrast, not assuming countable choice, evenZF{\displaystyle {\mathsf {ZF}}} is consistent with the uncountable setR{\displaystyle {\mathbb {R} }} being the union of a countable set of countable sets.

The list here is by no means complete. Many theorems about the various function existence predicates hold, especially when assuming countable choice - which as noted is never implicitly assumed in this discussion.

At last, with Exponentiation, any finitely indexed union of a family of subfinitely indexed resp. subcountable sets is itself subfinitely indexed resp. subcountable as well.The theory also proves the collection of all thecountable subsets of any setx{\displaystyle x} to be a set itself. Concerning this subset of the powerclassPx{\displaystyle {\mathcal {P}}_{x}}, some natural cardinality questions can also classically only be settled with Choice, at least for uncountablex{\displaystyle x}.

The class of all subsets of a set

[edit]

Given a sequence of sets, one may define new such sequences, e.g. ina,b,a,b,a,b{\displaystyle \langle a,b\rangle \mapsto \langle \langle \rangle ,\langle a\rangle ,\langle b\rangle ,\langle a,b\rangle \rangle }. But notably, in a mathematical set theory framework, the collection of all subsets of a set is defined not in a bottom-up construction from its constituents but via a comprehension overall sets in the domain of discourse. The standard, standalone characterization of the powerclass of a setx{\displaystyle x} involves unbounded universal quantification, namelyu.(uPxux){\displaystyle \forall u.\left(u\in {\mathcal {P}}_{x}\leftrightarrow u\subset x\right)}, where{\displaystyle \subset } was previously defined also in terms of the membership predicate{\displaystyle \in }. Here, a statement expressed as(uPx).Q(x){\displaystyle \forall (u\in {\mathcal {P}}_{x}).Q(x)} must a priori be taken foru.(uxQ(x)){\displaystyle \forall u.{\big (}u\subset x\to Q(x){\big )}} and is not equivalent to a set-bounded proposition. Indeed, the statementy=Px{\displaystyle y={\mathcal {P}}_{x}} itselfisΠ1{\displaystyle \Pi _{1}}. IfPx{\displaystyle {\mathcal {P}}_{x}} is a set, then the defining quantification even ranges acrossPx{\displaystyle {\mathcal {P}}_{x}}, which makes theaxiom of powersetimpredicative.

Recall that a member of the set of characteristic functions2x{\displaystyle 2^{x}} corresponds to a predicate that is decidable on a setx{\displaystyle x}, which it thus determines a detachable subsetsx{\displaystyle s\subset x}. In turn, the classDxPx{\displaystyle {\mathcal {D}}_{x}\subset {\mathcal {P}}_{x}} of all detachable subsets ofx{\displaystyle x} is now also a set, via Replacement. However,Dx{\displaystyle {\mathcal {D}}_{x}} may fail to provably have desirable properties, e.g. being closed under unending operations such as the unions over countably infinite index sets: For a countable sequenceunDx{\displaystyle u_{n}\in {\mathcal {D}}_{x}}, the subsetU:=kωuk{\displaystyle U:=\cup _{k\in \omega }u_{k}} ofx{\displaystyle x} validating(aU)(mω).aum{\displaystyle (a\in U)\leftrightarrow \exists (m\in \omega ).a\in u_{m}} for allax{\displaystyle a\in x} does exist as a set. But it may fail to be detachable and is therefore then not necessarily provably itself a member ofDx{\displaystyle {\mathcal {D}}_{x}}. Meanwhile, over classical logic, all subsets of a setx{\displaystyle x} are trivially detachable, meaningDx=Px{\displaystyle {\mathcal {D}}_{x}={\mathcal {P}}_{x}} and thenDx{\displaystyle {\mathcal {D}}_{x}} of course holds any subset. Over classical logic, this furthermore means that Exponentiation turns the power class into a set.

Translating results of set theory based mathematical theories like point-set topology or measure theory to a constructive framework is a subtle back and forth. For example, whileDx{\displaystyle {\mathcal {D}}_{x}} is afield of sets, for it to form aσ-algebra per definition also requires the above mentioned closedness under unions. But while a domain of subsets may fail to exhibit such closure property constructively, classically a measureμ{\displaystyle \mu } iscontinuous from below and so its value on an infinite union can in any case also be expressed without reference to that set as function input, namely aslimn{\displaystyle \lim _{n\to \infty }} of the growing sequenceμ(knuk){\displaystyle \mu (\cup _{k\leq n}u_{k})} of the function's values at finite unions.

Apart from the class of detachable sets, also various other subclasses of any powerclass are now provenly sets. For example, the theory also proves this for the collection of all the countable subsets of any set.

The richness of the full powerclass in a theory without excluded middle can best be understood by considering small classically finite sets. For any propositionP{\displaystyle P}, consider the subclassB:={x1P}{\displaystyle B:=\{x\in 1\mid P\}} of1{\displaystyle 1} (i.e.{0}{\displaystyle \{0\}} orS0{\displaystyle S0}). It equalsB=0{\displaystyle B=0} whenP{\displaystyle P} can be rejected and it equalsB=1{\displaystyle B=1} (i.e.B=S0{\displaystyle B=S0}), whenP{\displaystyle P} can be proven. ButP{\displaystyle P} may also not be decidable at all. Consider three different undecidable proposition, none of which provenly imply another. They can be used to define three subclasses of the singleton1{\displaystyle 1}, none of which are provenly the same. In this view, the powerclassP1{\displaystyle {\mathcal {P}}_{1}} of the singleton, usually denoted byΩ{\displaystyle \Omega }, is called thetruth valuealgebra and does not necessarily provenly have only two elements.

With Exponentiation, the powerclass of the singleton,P1{\displaystyle {\mathcal {P}}_{1}}, being a set already implies Powerset for sets in general. The proof is via replacement for the association offP1x{\displaystyle f\in {{\mathcal {P}}_{1}}^{x}} to{zx0f(z)}Px{\displaystyle \{z\in x\mid 0\in f(z)\}\in {\mathcal {P}}_{x}}, and an argument why all subsets are covered. The set2x{\displaystyle 2^{x}} injects into the function spaceP1x{\displaystyle {{\mathcal {P}}_{1}}^{x}} also.

If the theory provesB{\displaystyle B} above a set (as for exampleIZF{\displaystyle {\mathsf {IZF}}} unconditionally does), then the subsetb:={0,B}{\displaystyle b:=\{\langle 0,B\rangle \}} of1×P1{\displaystyle 1\times {\mathcal {P}}_{1}} is a functionb:1P1{\displaystyle b\colon 1\to {\mathcal {P}}_{1}} with(b(0)=1)P{\displaystyle {\big (}b(0)=1{\big )}\leftrightarrow P}. To claim thatP1=2{\displaystyle {\mathcal {P}}_{1}=2} is to claim that excluded middle holds forP{\displaystyle P}.

It has been pointed out that the empty set0{\displaystyle 0} and the set1{\displaystyle 1} itself are of course two subsets of1{\displaystyle 1}, meaning2P1{\displaystyle 2\subset {\mathcal {P}}_{1}}. Whether alsoP12{\displaystyle {\mathcal {P}}_{1}\subset 2} is true in a theory is contingent on a simple disjunction:

((xP1).(0x0x))P12{\displaystyle {\big (}\forall (x\in {\mathcal {P}}_{1}).(0\in x\lor 0\notin x){\big )}\to \,{\mathcal {P}}_{1}\subset 2}.

So assumingPEM{\displaystyle {\mathrm {PEM} }} for just bounded formulas, predicative Separation then lets one demonstrate that the powerclassP1{\displaystyle {\mathcal {P}}_{1}} is a set. And so in this context, also full Choice proves Powerset. (In the context ofIZF{\displaystyle {\mathsf {IZF}}}, bounded excluded middle in fact already turns set theory classical, as discussed further below.)

Full Separation is equivalent to just assuming that each individual subclass of1{\displaystyle 1} is a set. Assuming full Separation, both full Choice and Regularity provePEM{\displaystyle {\mathrm {PEM} }}.

AssumingPEM{\displaystyle {\mathrm {PEM} }} in this theory, Set induction becomes equivalent to Regularity and Replacement becomes capable of proving full Separation.

Note that cardinal relations involving uncountable sets are also elusive inZFC{\displaystyle {\mathsf {ZFC}}}, where the characterization of uncountability simplifies to|ω|<|x|{\displaystyle |\omega |<|x|}. For example, regarding the uncountable power2|ω|{\displaystyle 2^{|\omega |}}, it is independent of that classical theory whether all suchx{\displaystyle x} have2|ω||x|{\displaystyle 2^{|\omega |}\leq |x|}, nor does it prove that2|ω|<2|x|{\displaystyle 2^{|\omega |}<2^{|x|}}. Seecontinuum hypothesis and the relatedEaston's theorem.

Category and type theoretic notions

[edit]

So in this context with Exponentiation, first-order arithmetic has a model and all function spaces between sets exist. The latter are more accessible than the classes containing all subsets of a set, as is the case withexponential objects resp.subobjects in category theory.Incategory theoretical terms, the theoryBCST+Exp{\displaystyle {\mathsf {BCST}}+{\mathrm {Exp} }} essentially corresponds to constructivelywell-pointedCartesian closedHeyting pretoposes with (whenever Infinity is adopted) anatural numbers object. Existence of powerset is what would turn a Heyting pretopos into anelementary topos.[22]Every such topos that interpretsZF{\displaystyle {\mathsf {ZF}}} is of course a model of these weaker theories, but locally Cartesian closed pretoposes have been defined that e.g. interpret theories with Exponentiation but reject full Separation and Powerset.A form ofPEM{\displaystyle {\mathrm {PEM} }} corresponds to any subobject having a complement, in which case we call the topos Boolean. Diaconescu's theorem in its original topos form says that this hold iff anycoequalizer of two nonintersectingmonomorphisms has a section. The latter is aformulation of choice.Barr's theorem states that any topos admits a surjection from a Boolean topos onto it, relating to classical statements being provable intuitionistically.

In type theory, the expression "xy{\displaystyle x\to y}" exists on its own and denotesfunction spaces, a primitive notion. These types (or, in set theory, classes or sets) naturally appear, for example, as the type of thecurrying bijection between(z×x)y{\displaystyle (z\times x)\to y} andzyx{\displaystyle z\to y^{x}}, anadjunction.A typical type theory with general programming capability - and certainly those that can modelCZF{\displaystyle {\mathsf {CZF}}}, which is considered a constructive set theory - will have a type of integers and function spaces representingZZ{\displaystyle {\mathbb {Z} }\to {\mathbb {Z} }}, and as such also include types that are not countable. This is just to say, or implies, that among the function termsf:Z(ZZ){\displaystyle f\colon {\mathbb {Z} }\to ({\mathbb {Z} }\to {\mathbb {Z} })}, none have the property of being a surjection.

Constructive set theories are also studied in the context ofapplicative axioms.

Metalogic

[edit]

While the theoryECST+Exp{\displaystyle {\mathsf {ECST}}+{\mathrm {Exp} }} does not exceed the consistency strength of Heyting arithmetic, adding Excluded Middle gives a theory proving the same theorems as classicalZF{\displaystyle {\mathsf {ZF}}} minus Regularity!Thus, adding Regularity as well as eitherPEM{\displaystyle {\mathrm {PEM} }} or full Separation toECST+Exp{\displaystyle {\mathsf {ECST}}+{\mathrm {Exp} }} gives full classicalZF{\displaystyle {\mathsf {ZF}}}.Adding full Choice and full Separation givesZFC{\displaystyle {\mathsf {ZFC}}} minus Regularity.So this would thus lead to a theory beyond the strength of typicaltype theory.

The presented theory does not prove a function space likeNN{\displaystyle {\mathbb {N} }^{\mathbb {N} }} to be not enumerable, in the sense of injections out of it. Without further axioms, intuitionistic mathematics has models in recursive functions but also forms ofhypercomputation.

Analysis

[edit]

In this section the strength ofECST+Exp{\displaystyle {\mathsf {ECST}}+{\mathrm {Exp} }} is elaborated on.For context, possible further principles are mentioned, which are not necessarily classical and also not generally considered constructive.Here a general warning is in order: When reading proposition equivalence claims in the computable context, one shall always be aware whichchoice,induction andcomprehension principles are silently assumed.See also the relatedconstructive analysis,[23]feasible analysis andcomputable analysis.

The theory so far provesuniqueness ofArchimedean,Dedekind complete (pseudo-)ordered fields, with equivalence by a unique isomorphism. The prefix "pseudo" here highlights that the order will, in any case, constructively not always be decidable. This result is relevant assuming complete such models exist as sets.

Topology

[edit]

Regardless of the choice of model, the characteristic flavor of a constructive theory of numbers can be explicated using an independent propositionP{\displaystyle P}. Consider a counter-example to the constructive provability of the well-orderedness of the naturals, but now embedded in the reals. Say

M:={xR(x=0P)(x=1)}{\displaystyle M:=\{x\in {\mathbb {R} }\mid (x=0\land P)\lor (x=1)\}}.

The infimum metric distance between some point and such a subset, what may be expressed asρ(0,M){\displaystyle \rho (0,M)} for example, may constructively fail to provably exist. More generally, this locatedness property of subsets governs the well-developed constructive metric space theory.

Whether Cauchy or Dedekind reals, among others, also fewer statements about the arithmetic of the reals aredecidable, compared to the classical theory.

Cauchy sequences

[edit]

Exponentiation implies recursion principles and so inECST+Exp{\displaystyle {\mathsf {ECST}}+{\mathrm {Exp} }}, one can comfortably reason about sequencess:ωQ{\displaystyle s\colon \omega \to {\mathbb {Q} }}, their regularity properties such as|snsm|1n+1m{\displaystyle |s_{n}-s_{m}|\leq {\tfrac {1}{n}}+{\tfrac {1}{m}}}, or about shrinking intervals inω(Q×Q){\displaystyle \omega \to ({\mathbb {Q} }\times {\mathbb {Q} })}. So this enables speaking ofCauchy sequences and their arithmetic. This is also the approach to analysis taken inZ2{\displaystyle {\mathsf {Z}}_{2}}.

Cauchy reals

[edit]

Any Cauchy real number is a collection of such sequences, i.e. a subset of a set of functions onω{\displaystyle \omega } constructed with respect to anequivalence relation. Exponentiation together with bounded separation prove the collection of Cauchy reals to be a set, thus somewhat simplifying the logically treatment of the reals.

Even in the strong theoryIZF{\displaystyle {\mathsf {IZF}}} with a strengthened form of Collection, the Cauchy reals are poorly behaved when not assuming a form ofcountable choice, andACω,2{\displaystyle {\mathrm {AC} _{\omega ,2}}} suffices for most results. This concernscompleteness of equivalence classes of such sequences, equivalence of the whole set to the Dedekind reals, existence of amodulus of convergence for all Cauchy sequences and the preservation of such a modulus when taking limits.[24] An alternative approach that is slightly better behaved is to work a collection of Cauchy reals together a choice of modulus, i.e. not with just the real numbers but with a set of pairs, or even with a fixed modulus shared by all real numbers.

Towards the Dedekind reals

[edit]

As in the classical theory,Dedekind cuts are characterized using subsets of algebraic structures such asQ{\displaystyle {\mathbb {Q} }}: The properties of being inhabited, numerically bounded above, "closed downwards" and "open upwards" are all bounded formulas with respect to the given set underlying the algebraic structure. A standard example of a cut, the first component indeed exhibiting these properties, is the representation of2{\displaystyle {\sqrt {2}}} given by

{xQx<0x2<2},{xQ0<x2<x2}  PQ×PQ{\displaystyle {\big \langle }\{x\in {\mathbb {Q} }\mid x<0\lor x^{2}<2\},\,\{x\in {\mathbb {Q} }\mid 0<x\land 2<x^{2}\}{\big \rangle }\,\ \in \,\ {{\mathcal {P}}_{\mathbb {Q} }}\times {{\mathcal {P}}_{\mathbb {Q} }}}

(Depending on the convention for cuts, either of the two parts or neither, like here, may makes use of the sign{\displaystyle \leq }.)

The theory given by the axioms so far validates that apseudo-ordered field that is also Archimedean and Dedekind complete, if it exists at all, is in this way characterized uniquely, up to isomorphism.However, the existence of just function spaces such as{0,1}Q{\displaystyle \{0,1\}^{\mathbb {Q} }} does not grantPQ{\displaystyle {{\mathcal {P}}_{\mathbb {Q} }}} to be a set, and so neither is the class of all subsets ofQ{\displaystyle {\mathbb {Q} }} that do fulfill the named properties.What is required for the class of Dedekind reals to be a set is an axiom regarding existence of a set of subsets and this is discussed further below in the section on Binary refinement.In a context withoutPEM{\displaystyle {\mathrm {PEM} }} or Powerset,countable choice into finite sets is assumed to prove the uncountability of the set of all Dedekind reals.

Constructive schools

[edit]

Most schools for constructive analysis validate some choice and alsoBD{\displaystyle \mathrm {BD} }-N{\displaystyle {\mathbb {N} }}, as defined in the second section on number bounds. Here are some other propositions employed in theories of constructive analysis that are not provable using just base intuitionistic logic:

Certain laws in both of those schools contradictWLPO{\displaystyle {\mathrm {WLPO} }}, so that choosing to adopt all principles from either school disproves theorems from classical analysis.CT0{\displaystyle {\mathrm {CT} }_{0}} is still consistent with some choice, but contradicts the classicalWKL{\displaystyle {\mathrm {WKL} }} andLLPO{\displaystyle {\mathrm {LLPO} }}, explained below. Theindependence of premise rule with set existence premises is not fully understood, but as a number theoretic principle it is in conflict with the Russian school axioms in some frameworks. Notably,CT0{\displaystyle {\mathrm {CT} }_{0}} also contradictsFANΔ{\displaystyle {\mathrm {FAN} }_{\Delta }}, meaning the constructive schools also cannot be combined in full. Some of the principles cannot be combined constructively to the extent that together they imply forms ofPEM{\displaystyle {\mathrm {PEM} }} - for exampleMP{\displaystyle {\mathrm {MP} }} plus the countability of all subsets of the naturals. These combinations are then naturally also not consistent with further anti-classical principles.

Indecomposability

[edit]

Denote the class of all sets byV{\displaystyle {\mathcal {V}}}. Decidability of membership in a classR{\displaystyle R} can be expressed as membership inR(VR){\displaystyle R\cup ({\mathcal {V}}\setminus R)}. We also note that, by definition, the two extremal classesV{\displaystyle {\mathcal {V}}} and{}{\displaystyle \{\}} are trivially decidable. Membership in those two is equivalent to the trivial propositionsx=x{\displaystyle x=x} resp.¬(x=x){\displaystyle \neg (x=x)}.

Call a classR{\displaystyle R}indecomposable or cohesive if, for any predicateχ{\displaystyle \chi },

((xR).χ(x)¬χ(x))(((xR).χ(x))((xR).¬χ(x))){\displaystyle {\big (}\forall (x\in R).\chi (x)\lor \neg \chi (x){\big )}\to {\Big (}{\big (}\forall (x\in R).\chi (x){\big )}\lor {\big (}\forall (x\in R).\neg \chi (x){\big )}{\Big )}}

This expresses that the only properties that are decidable onR{\displaystyle R} are the trivial properties. This is well studied in intuitionistic analysis.

The so called indecomposability schemaUZ{\displaystyle {\mathrm {UZ} }} (Unzerlegbarkeit) for set theory is a possible principle which states that the whole classV{\displaystyle {\mathcal {V}}} is indecomposable. Extensionally speaking,UZ{\displaystyle {\mathrm {UZ} }} postulates that the two trivial classes are the only classes that are decidable with respect to the class of all sets. For a simple motivating predicate, consider membershipx1{\displaystyle x\in 1} in the first non-trivial class, which is to say the propertyx={}{\displaystyle x=\{\}} of being empty. This property is non-trivial to the extent that it separates some sets: The empty set is a member of1{\displaystyle 1}, by definition, while a plethora of sets are provenly not members of1{\displaystyle 1}. But, using Separation, one may of course also define various sets for which emptiness is not decidable in a constructive theory at all, i.e. membership in1(V1){\displaystyle 1\cup ({\mathcal {V}}\setminus 1)} is not provable for all sets. So here the property of emptiness does not partition the set theoretical domain of discourse into two decidable parts. For any such non-trivial property, the contrapositive ofUZ{\displaystyle {\mathrm {UZ} }} says that it cannot be decidable over all sets.

UZ{\displaystyle {\mathrm {UZ} }} is implied by the uniformity principleUP{\displaystyle {\mathrm {UP} }}, which is consistent withCZF{\displaystyle {\mathsf {CZF}}} and discussed below.

Non-constructive principles

[edit]

Of coursePEM{\displaystyle {\mathrm {PEM} }} and many principles definingintermediate logics are non-constructive.PEM{\displaystyle {\mathrm {PEM} }} andWPEM{\displaystyle {\mathrm {WPEM} }}, which isPEM{\displaystyle {\mathrm {PEM} }} for just negated propositions, can be presented asDe Morgan's rules. More specifically, this section shall be concerned with statements in terms of predicates - especially weaker ones, expressed in terms of a few quantifiers over sets, on top of decidable predicates on numbers. Referring back to the section on characteristic functions, one may call a collectionA{\displaystyle A} searchable if it is searchable for all its detachable subsets, which itself corresponds to{0,1}A{\displaystyle \{0,1\}^{A}}. This is a form of{\displaystyle \exists }-PEM{\displaystyle {\mathrm {PEM} }} forA{\displaystyle A}. Note that in the context of Exponentiation, such proposition on sets are now set-bound.

Particularly valuable in the study ofconstructive analysis are non-constructive claims commonly formulated in terms of the collection of all binary sequences and the characteristic functionsf{\displaystyle f} on the arithmetic domainA=ω{\displaystyle A=\omega } are well studied. Heref(n_)=0{\displaystyle f({\underline {\mathrm {n} }})=0} is a decidable proposition at each numeraln{\displaystyle {\mathrm {n} }}, but, as demonstrated previously, quantified statements in terms off{\displaystyle f} may not be. As is known from theincompleteness theorem and its variations, already in first-order arithmetic, example functions onN{\displaystyle {\mathbb {N} }} can be characterized such that ifPA{\displaystyle {\mathsf {PA}}} is consistent, the competing{\displaystyle \exists }-PEM{\displaystyle {\mathrm {PEM} }} disjuncts, of low complexity, are eachPA{\displaystyle {\mathsf {PA}}}-unprovable (even ifPA{\displaystyle {\mathsf {PA}}} proves the disjunction of the two axiomatically.)

More generally, the arithmetic{\displaystyle \exists }-PEM{\displaystyle {\mathrm {PEM} }}, a most prominent non-constructive, essentially logical statement goes by the namelimited principle of omniscienceLPO{\displaystyle {\mathrm {LPO} }}. In the constructive set theoryCZF{\displaystyle {\mathsf {CZF}}} introduced below, it impliesBD{\displaystyle {\mathrm {BD} }}-N{\displaystyle {\mathbb {N} }},MP{\displaystyle {\mathrm {MP} }}, theΠ10{\displaystyle \Pi _{1}^{0}}-version of the fan theorem, but alsoWKL{\displaystyle {\mathrm {WKL} }} discussed below. Recall examples of famous sentences that can be written down in aΠ10{\displaystyle \Pi _{1}^{0}}-fashion, i.e. of Goldbach-type:Goldbach conjecture,Fermat's last theorem but also theRiemann hypothesis are among them. Assuming relativized dependent choiceRDP{\displaystyle {\mathrm {RDP} }} and the classicalLPO{\displaystyle {\mathrm {LPO} }} overCZF{\displaystyle {\mathsf {CZF}}} does not enable proofs of moreΠ02{\displaystyle \Pi _{0}^{2}}-statements.LPO{\displaystyle {\mathrm {LPO} }} postulates a disjunctive property, as does the weaker decidability statement for functions being constant (Π10{\displaystyle \Pi _{1}^{0}}-sentences)WLPO{\displaystyle {\mathrm {WLPO} }}, the arithmetic{\displaystyle \forall }-PEM{\displaystyle {\mathrm {PEM} }}. The two are related in a similar way as isPEM{\displaystyle {\mathrm {PEM} }} versusWPEM{\displaystyle {\mathrm {WPEM} }} and they essentially differ byMP{\displaystyle {\mathrm {MP} }}.WLPO{\displaystyle {\mathrm {WLPO} }} in turn implies the so-called "lesser" versionLLPO{\displaystyle {\mathrm {LLPO} }}. This is the (arithmetic){\displaystyle \exists }-version of the non-constructive De Morgan's rule for a negated conjunction. There are, for example, models of the strong set theoryIZF{\displaystyle {\mathsf {IZF}}} which separate such statements, in the sense that they may validateLLPO{\displaystyle {\mathrm {LLPO} }} but rejectWLPO{\displaystyle {\mathrm {WLPO} }}.

Disjunctive principles aboutΠ10{\displaystyle \Pi _{1}^{0}}-sentences generally hint at equivalent formulations decidingapartness in analysis in a context with mild choice orMP{\displaystyle {\mathrm {MP} }}. The claim expressed byLPO{\displaystyle {\mathrm {LPO} }} translated to real numbers is equivalent to the claim that either equality or apartness of any two reals is decidable (it in fact decides the trichotomy). It is then also equivalent to the statement that every real is either rational or irrational - without the requirement for or construction of a witness for either disjunct. Likewise, the claim expressed byLLPO{\displaystyle {\mathrm {LLPO} }} for real numbers is equivalent that the ordering{\displaystyle \leq } of any two reals is decidable (dichotomy). It is then also equivalent to the statement that if the product of two reals is zero, then either of the reals is zero - again without a witness. Indeed, formulations of the three omniscience principles are then each equivalent to theorems of the apartness, equality or order of two reals in this way. Yet more can be said about the Cauchy sequences that are augmented with a modulus of convergence.

Afamous source of computable undecidability - and in turn also of a broad range of undecidable propositions - is the predicate expressing a computer program to be total.

Infinite trees

[edit]

Through the relation between computability and the arithmetical hierarchy, insights in this classical study are also revealing for constructive considerations. A basic insight ofreverse mathematics concerns computable infinite finitely branching binary trees. Such a tree may e.g. be encoded as an infinite set of finite sets

Tnω{0,1}{0,1,,n1}{\displaystyle T\,\subset \,\bigcup _{n\in \omega }\{0,1\}^{\{0,1,\dots ,n-1\}}},

with decidable membership, and those trees then provenly contain elements of arbitrary big finite size. The so called WeakKőnig's lemmaWKL{\displaystyle {\mathrm {WKL} }} states: For suchT{\displaystyle T}, there always exists an infinite path inω{0,1}{\displaystyle \omega \to \{0,1\}}, i.e. an infinite sequence such that all its initial segments are part of the tree. In reverse mathematics, the second-order arithmetic subsystemRCA0{\displaystyle {\mathsf {RCA}}_{0}} does not proveWKL{\displaystyle {\mathrm {WKL} }}. To understand this, note that there are computable treesK{\displaystyle K} for which nocomputable such path through it exists. To prove this, oneenumerates the partial computable sequences and then diagonalizes all total computable sequences in one partial computable sequencesd{\displaystyle d}. One can then roll out a certain treeK{\displaystyle K}, one exactly compatible with the still possible values ofd{\displaystyle d} everywhere, which by construction is incompatible with any total computable path.

InCZF{\displaystyle {\mathsf {CZF}}}, the principleWKL{\displaystyle {\mathrm {WKL} }} impliesLLPO{\displaystyle {\mathrm {LLPO} }} andΠ10{\displaystyle \Pi _{1}^{0}}-ACω,2{\displaystyle {\mathrm {AC} }_{\omega ,2}}, a very modest form of countable choice introduced above. The former two are equivalent assuming that choice principle already in the more conservative arithmetic context.WKL{\displaystyle {\mathrm {WKL} }} is also equivalent to theBrouwer fixed point theorem and other theorems regarding values of continuous functions on the reals. The fixed point theorem in turn implies theintermediate value theorem, but always be aware that these claims may depend on the formulation, as the classical theorems about encoded reals can translate to different variants when expressed in a constructive context.[25]

TheWKL{\displaystyle {\mathrm {WKL} }}, and some variants thereof, concernsinfinite graphs and so its contrapositives gives a condition for finiteness. Again to connect to analysis, over theclassical arithmetic theoryRCA0{\displaystyle {\mathsf {RCA}}_{0}}, the claim ofWKL{\displaystyle {\mathrm {WKL} }} is for example equivalent to theBorel compactness regarding finite subcovers of the real unit interval.FANΔ{\displaystyle {\mathrm {FAN} }_{\Delta }} is a closely related existence claim involving finite sequences in an infinite context. OverRCA0{\displaystyle {\mathsf {RCA}}_{0}}, they are actually equivalent. InCZF{\displaystyle {\mathsf {CZF}}} those are distinct, but, after again assuming some choice, here thenWKL{\displaystyle {\mathrm {WKL} }} impliesFANΔ{\displaystyle {\mathrm {FAN} }_{\Delta }}.

Induction

[edit]

Mathematical induction

[edit]

It was observed that in set language, induction principles can readIndAωA{\displaystyle \mathrm {Ind} _{A}\to \omega \subset A}, with the antecedentIndA{\displaystyle \mathrm {Ind} _{A}} defined as further above, and withωA{\displaystyle \omega \subset A} meaning(nω).nA{\displaystyle \forall (n\in \omega ).n\in A} where the setω{\displaystyle \omega } always denotes the standard model of natural numbers. Via the strong axiom of Infinity and predicative Separation, the validity of induction for set-bounded orΔ0{\displaystyle \Delta _{0}}-definitions was already established and thoroughly discussed. For those predicates involving only quantifiers overω{\displaystyle \omega }, it validates induction in the sense of the first-order arithmetic theory. In a set theory context whereω{\displaystyle \omega } is a set, this induction principle can be used to prove predicatively defined subclasses ofω{\displaystyle \omega } to be thesetω{\displaystyle \omega } itself. The so called full mathematical induction schema now postulates set equality ofω{\displaystyle \omega } toall its inductive subclasses. As in the classical theory, it is also implied when passing to the impredicative full Separation schema. As stated in the section on Choice, induction principles such as this are also implied by various forms of choice principles.

The recursion principle for set functions mentioned in the section dedicated to arithmetic is also implied by the full mathematical induction schema over one's structure modeling the naturals (e.g.ω{\displaystyle \omega }). So for that theorem, granting a model of Heyting arithmetic, it represents an alternative to exponentiation principles.

Axiom schema of fullmathematical induction: For any predicateϕ{\displaystyle \phi } onω{\displaystyle \omega },

(ϕ(0)  (kω).(ϕ(k)ϕ(Sk)))(nω).ϕ(n){\displaystyle {\Big (}\phi (0)\ \land \ \forall (k\in \omega ).{\big (}\phi (k)\to \phi (Sk){\big )}{\Big )}\to \forall (n\in \omega ).\phi (n)}

Predicate formulas used with the schema are to be understood as formulas in first-order set theory. The zero0{\displaystyle 0} denotes the set{}{\displaystyle \{\}} as above, and the setSn{\displaystyle Sn} denotes the successor set ofnω{\displaystyle n\in \omega }, withnSn{\displaystyle n\in Sn}. By Axiom of Infinity above, it is again a member ofω{\displaystyle \omega }. Beware that unlike in an arithmetic theory, the naturals here are not the abstract elements in the domain of discourse, but elements of a model. As has been observed in previous discussions, when acceptingECST{\displaystyle {\mathsf {ECST}}}, not even for all predicatively defined sets is the equality to such a finite von Neumann ordinal necessarily decidable.

Set Induction

[edit]

Going beyond the previous induction principles, one has full set induction, which is to be compared towell-founded induction. Like mathematical induction above, the following axiom is formulated as a schema in terms of predicates, and thus has a different character than induction principles proven from predicative set theory axioms. A variant of the axiom just forbounded formulas is also studied independently and may be derived from other axioms.

Axiom schema of Set induction: For any predicateϕ{\displaystyle \phi },

(x.((yx). ϕ(y))ϕ(x))z.ϕ(z){\displaystyle {\Big (}\forall x.\,{\big (}\forall (y\in x).\ \phi (y){\big )}\to \phi (x){\Big )}\to \forall z.\phi (z)}

Here(z{}).ϕ(z){\displaystyle \forall (z\in \{\}).\phi (z)} holds trivially and so this covers to the "bottom case"ϕ({}){\displaystyle \phi (\{\})} in the standard framework. This (as well as natural number induction) may again be restricted to just the bounded set formulas, in which case arithmetic is not impacted.

InECST{\displaystyle {\mathsf {ECST}}}, the axiom proves induction intransitive sets and so in particular also for transitive sets of transitive sets. The latter then is an adequate definition of the ordinals, and even aΔ0{\displaystyle \Delta _{0}}-formulation. Set induction in turn enables ordinal arithmetic in this sense. It further allows definitions of class functions bytransfinite recursion. The study of the various principles that grant set definitions by induction, i.e. inductive definitions, is a main topic in the context of constructive set theory and their comparatively weakstrengths. This also holds for theircounterparts in type theory. Replacement is not required to prove induction over the set of naturals from set induction, but that axiom is necessary for their arithmetic modeled within the set theory.

Theaxiom of regularity is a single statement with universal quantifier over sets and not a schema. As show, it impliesPEM{\displaystyle {\mathrm {PEM} }}, and so is non-constructive. Now forϕ{\displaystyle \phi } taken to be the negation of some predicate¬S{\displaystyle \neg S} and writingΣ{\displaystyle \Sigma } for the class{yS(y)}{\displaystyle \{y\mid S(y)\}}, induction reads

(xΣ).¬(xΣ={})Σ={}{\displaystyle \forall (x\in \Sigma ).\neg (x\cap \Sigma =\{\})\,\,\leftrightarrow \,\,\Sigma =\{\}}

Via the contrapositive, set induction implies all instances of regularity but only with double-negated existence in the conclusion. In the other direction, given enoughtransitive sets, regularity implies each instance of set induction.

Metalogic

[edit]

The theory formulated above can be expressed asCZF{\displaystyle {\mathsf {CZF}}} with its collection axioms discarded in favour of the weaker Replacement and Exponentiation axioms. It proves the Cauchy reals to be a set, but not the class of Dedekind reals.

Call an ordinal itself trichotomous if the irreflexive membership relation "{\displaystyle \in }" among its members istrichotomous. Like the axiom of regularity, set induction restricts the possible models of "{\displaystyle \in }" and thus that of a set theory, as was the motivation for the principle in the 20's. But the constructive theory here does not prove a trichotomy for all ordinals, while the trichotomous ordinals are not well behaved with respect to the notion of successor and rank.

The added proof-theoretical strength attained with Induction in the constructive context is significant, even if dropping Regularity in the context ofZF{\displaystyle {\mathsf {ZF}}} does not reduce the proof-theoretical strength.Even without Exponentiation, the present theory with set induction has the same proof theoretic strength asCZF{\displaystyle {\mathsf {CZF}}} and proves the same functions recursive. Specifically, its proof-theoreticlarge countable ordinal is theBachmann–Howard ordinal. This is also the ordinal of classical or intuitionisticKripke–Platek set theory.It is consistent even withIZF{\displaystyle {\mathsf {IZF}}} to assume that the class of trichotomous ordinals form a set. The current theory augmented with this ordinal set existence postulate proves the consistency ofCZF{\displaystyle {\mathsf {CZF}}}.

Aczel was also one of the main developers orNon-well-founded set theory, which rejects set induction.

Relation to ZF

[edit]

The theory also constitutes a presentation ofZermelo–Fraenkel set theoryZF{\displaystyle {\mathsf {ZF}}} in the sense that variants of all its eight axioms are present. Extensionality, Pairing, Union and Replacement are indeed identical. Separation is adopted in a weak predicative form while Infinity is stated in a strong formulation. Akin to the classical formulation, this Separation axiom and the existence of any set already proves the Empty Set axiom. Exponentiation for finite domains and full mathematical induction are also implied by their stronger adopted variants. Without the principle of excluded middle, the theory here is lacking, in its classical form, full Separation, Powerset as well as Regularity. AcceptingPEM{\displaystyle {\mathrm {PEM} }} now exactly leads into the classical theory.

The following highlights the different readings of a formal theory. LetCH{\displaystyle \mathrm {CH} } denote the continuum hypothesis andB:={z1CH}{\displaystyle B:=\{z\in 1\mid \mathrm {CH} \}} so that0BCH{\displaystyle 0\in B\leftrightarrow \mathrm {CH} }. Thenb:=B{1}{\displaystyle b:=B\cup \{1\}} is inhabited by1{\displaystyle 1} and any set that is established to be a member ofb{\displaystyle b} either equals0{\displaystyle 0} or1{\displaystyle 1}. Induction onω{\displaystyle \omega } implies that it cannot consistently be negated thatb{\displaystyle b} has some least natural number member. The value of such a member can be shown to be independent of theories such asZFC{\displaystyle {\mathsf {ZFC}}}. Nonetheless, any classical set theory likeZFC{\displaystyle {\mathsf {ZFC}}} also proves thereexists such a number.

Strong Collection

[edit]

Having discussed all the weakened forms of the classical set theory axioms, Replacement and Exponentiation can be further strengthened without losing a type theoretical interpretation, and in a way that is not going beyondZF{\displaystyle {\mathsf {ZF}}}.

So firstly, one may reflect upon the strength of theaxiom of replacement, also in the context of the classical set theory. For any sety{\displaystyle y} and any naturaln{\displaystyle n}, there exists the productyn{\displaystyle y^{n}} recursively given byyn1×y{\displaystyle y^{n-1}\times y}, which have ever deeperrank. Induction for unbound predicates proves that these sets exist for all of the infinitely many naturals. Replacement "fornyn{\displaystyle n\mapsto y^{n}}" now moreover states that this infinite class of products can be turned into the infinite set,{p(nω).p=yn}{\displaystyle \{p\mid \exists (n\in \omega ).p=y^{n}\}}. This is also not a subset of any previously established set.

Going beyond those axioms also seen in Myhill's typed approach, consider the discussed constructive theory with Exponentiation and Induction, but now strengthened by thecollection schema. InZF{\displaystyle {\mathsf {ZF}}} it is equivalent to Replacement, unless the powerset axiom is dropped. In the current context the strong axiom presented supersedes Replacement, due to not requiring thebinary relation definition to be functional, but possibly multi-valued.

Axiom schema of Strong Collection: For any predicateϕ{\displaystyle \phi },

a.  ((xa).y.ϕ(x,y))b.((xa).(yb).ϕ(x,y)  (yb).(xa).ϕ(x,y)){\displaystyle \forall a.\ \ {\Big (}\forall (x\in a).\exists y.\phi (x,y){\Big )}\to \exists b.{\Big (}\forall (x\in a).\exists (y\in b).\phi (x,y)\ \land \ \forall (y\in b).\exists (x\in a).\phi (x,y){\Big )}}

In words, for every total relation, there exists an image setb{\displaystyle b} such that the relation is total in both directions. Expressing this via a raw first-order formulation leads to a somewhat repetitive format. The antecedent states that one considers relationϕ{\displaystyle \phi } between setsx{\displaystyle x} andy{\displaystyle y} that are total over a certain domain seta{\displaystyle a}, that is,ϕ{\displaystyle \phi } has at least one "image value"y{\displaystyle y} for every elementx{\displaystyle x} in the domain. This is more general than an inhabitance conditionxy{\displaystyle x\in y} in a set theoretical choice axiom, but also more general than the condition of Replacement, which demands unique existence!y{\displaystyle \exists !y}. In the consequent, firstly, the axioms states that then there exists a setb{\displaystyle b} which contains at least one "image" valuey{\displaystyle y} underϕ{\displaystyle \phi }, for every element of the domain. Secondly, in this axioms formulation it then moreover states that only such imagesy{\displaystyle y} are elements of that new codomain setb{\displaystyle b}. It is guaranteeing thatb{\displaystyle b} does not overshoot the codomain ofϕ{\displaystyle \phi } and thus the axiom is also expressing some power akin to a Separation procedure. The principle may be used in the constructive study of larger sets beyond the everyday need of analysis.

Weak collection andpredicative separation together imply strong collection: separation cuts out the subset ofb{\displaystyle b} consisting of thosey{\displaystyle y} such thatϕ(x,y){\displaystyle \phi (x,y)} for somexa{\displaystyle x\in a}.

Metalogic

[edit]

This theory withoutPEM{\displaystyle {\mathrm {PEM} }}, without unbounded separation and without "naive" Power set enjoys various nice properties. For example, as opposed toCZF{\displaystyle {\mathsf {CZF}}} with its subset collection schema below, it has theexistence property.

Constructive Zermelo–Fraenkel

[edit]

Binary refinement

[edit]

The so called binary refinement axiom says that for anya{\displaystyle a} there exists a setBaPa{\displaystyle {\mathcal {B}}_{a}\subset {\mathcal {P}}_{a}} such that for any coveringa=xy{\displaystyle a=x\cup y}, the setBa{\displaystyle {\mathcal {B}}_{a}} holds two subsetscx{\displaystyle c\subset x} anddy{\displaystyle d\subset y} that also do this covering job,a=cd{\displaystyle a=c\cup d}. It is a weakest form of the powerset axiom and at the core of some important mathematical proofs.Fullness below, for relations between the seta{\displaystyle a} and the finite{0,1}{\displaystyle \{0,1\}}, implies that this is indeed possible.

Taking another step back,ECST{\displaystyle {\mathsf {ECST}}} plus Recursion and plus Binary refinement already proves that there exists an Archimedean, Dedekind complete pseudo-ordered field. That set theory also proves that the class of leftDedekind cuts is a set, not requiring Induction or Collection. And it moreover proves that function spaces into discrete sets are sets (there e.g.ωω{\displaystyle \omega \to \omega }), without assumingExp{\displaystyle {\mathrm {Exp} }}. Already over the weak theoryBCST{\displaystyle {\mathsf {BCST}}} (which is to say without Infinity) does binary refinement prove that function spaces into discrete sets are sets, and therefore e.g. the existence of allcharacteristic function spaces{0,1}a{\displaystyle \{0,1\}^{a}}.

Subset Collection

[edit]

The theory known asCZF{\displaystyle {\mathsf {CZF}}} adopts the axioms of the previous sections plus a stronger form of Exponentiation. It is by adopting the following alternative to Exponentiation, which can again be seen as a constructive version of thePower set axiom:

Axiom schema of Subset Collection: For any predicateϕ{\displaystyle \phi },

a.b.  u.z.((xa).(yb).ϕ(x,y,z))(vu).((xa).(yv).ϕ(x,y,z)(yv).(xa).ϕ(x,y,z)){\displaystyle \forall a.\forall b.\ \ \exists u.\forall z.{\big (}\forall (x\in a).\exists (y\in b).\phi (x,y,z){\big )}\to \exists (v\in u).{\big (}\forall (x\in a).\exists (y\in v).\phi (x,y,z)\;\land \;\forall (y\in v).\exists (x\in a).\phi (x,y,z){\big )}}

An alternative that is not a schema is elaborated on below.

Fullness

[edit]

For givena{\displaystyle a} andb{\displaystyle b}, letRab{\displaystyle {\mathcal {R}}_{ab}} be the class of alltotal relations betweena{\displaystyle a} andb{\displaystyle b}. This class is given as

rRab(((xa).(yb).x,yr)((pr).(xa).(yb).p=x,y)){\displaystyle r\in {\mathcal {R}}_{ab}\leftrightarrow {\Big (}{\big (}\forall (x\in a).\exists (y\in b).\langle x,y\rangle \in r{\big )}\,\land \,{\big (}\forall (p\in r).\exists (x\in a).\exists (y\in b).p=\langle x,y\rangle {\big )}{\Big )}}

As opposed to the function definition, there is no unique existence quantifier in!(yb){\displaystyle \exists !(y\in b)}. The classRab{\displaystyle {\mathcal {R}}_{ab}} represents the space of "non-unique-valued functions" or "multivalued functions" froma{\displaystyle a} tob{\displaystyle b}, but as set of individual pairs with right projection inb{\displaystyle b}. The second clause says that one is concerned with only these relations, not those which are total ona{\displaystyle a} but also extend their domain beyonda{\displaystyle a}.

One does not postulateRab{\displaystyle {\mathcal {R}}_{ab}} to be a set, since with Replacement one can use this collection of relations between a seta{\displaystyle a} and the finiteb={0,1}{\displaystyle b=\{0,1\}}, i.e. the "bi-valued functions ona{\displaystyle a}", to extract the setPa{\displaystyle {\mathcal {P}}_{a}} of all its subsets. In other wordsRab{\displaystyle {\mathcal {R}}_{ab}} being a set would imply the Powerset axiom.

OverECTS+Strong Collection{\displaystyle {\mathsf {ECTS}}+{\text{Strong Collection}}}, there is a single, somewhat cleareralternative axiom to the Subset Collection schema. It postulates the existence of a sufficiently largesetSab{\displaystyle {\mathcal {S}}_{ab}} of total relations betweena{\displaystyle a} andb{\displaystyle b}.

Axiom of Fullness:

a.b. Sab.(SabRab)(rRab).(sSab).sr{\displaystyle \forall a.\forall b.\ \exists {\mathcal {S}}_{ab}.\,({\mathcal {S}}_{ab}\subset {\mathcal {R}}_{ab})\,\land \,\forall (r\in {\mathcal {R}}_{ab}).\exists (s\in {\mathcal {S}}_{ab}).s\subset r}

This says that for any two setsa{\displaystyle a} andb{\displaystyle b}, there exists a setSabRab{\displaystyle {\mathcal {S}}_{ab}\subset {\mathcal {R}}_{ab}} which among its members inhabits a still total relationsSab{\displaystyle s\in {\mathcal {S}}_{ab}} for any given total relationrRab{\displaystyle r\in {\mathcal {R}}_{ab}}.

On a given domaina{\displaystyle a}, the functions are exactly the sparsest total relations, namely the unique valued ones. Therefore, the axiom implies that there is a set such that all functions are in it. In this way, Fullness implies Exponentiation. It further implies binary refinement, already overBCST{\displaystyle {\mathsf {BCST}}}.

The Fullness axiom, as well as dependent choice, is in turn also implied by the so-called Presentation Axiom about sections, which can also be formulatedcategory theoretically.

Metalogic of CZF

[edit]

CZF{\displaystyle {\mathsf {CZF}}} has thenumerical existence property and the disjunctive property, but there are concessions:CZF{\displaystyle {\mathsf {CZF}}} lacks the existence property due to the Subset Collection Schema or Fullness axiom. The schema can also be an obstacle for realizability models. The existence property is not lacking when the weaker Exponentiation or the stronger but impredicative Powerset axiom axiom is adopted instead. The latter is in general lacking a constructive interpretation.

Unprovable claims

[edit]

The theory is consistent with some anti-classical assertions, but on its own proves nothing not provable inZF{\displaystyle {\mathsf {ZF}}}. Some prominent statements not proven by the theory (nor byIZF{\displaystyle {\mathsf {IZF}}}, for that matter) are part of the principles listed above, in the sections on constructive schools in analysis, on the Cauchy construction and on non-constructive principles. What follows concerns set theoretical concepts:

The bounded notion of a transitive set of transitive sets is a good way to define ordinals and enables induction on ordinals. But notably, this definition includes someΔ0{\displaystyle \Delta _{0}}-subsets inCZF{\displaystyle {\mathsf {CZF}}}. So assuming that the membership of0{\displaystyle 0} is decidable in all successor ordinalsSα{\displaystyle S\alpha } provesPEM{\displaystyle {\mathrm {PEM} }} for bounded formulas inCZF{\displaystyle {\mathsf {CZF}}}. Also, neitherlinearity ofordinals, nor existence of power sets of finite sets are derivable in this theory, as assuming either implies Power set. The circumstance that ordinals are better behaved in the classical than in the constructive context manifests in a different theory of large set existence postulates.

Consider the functions the domain of which isω{\displaystyle \omega } or somenω{\displaystyle n\in \omega }. These are sequences and their ranges are counted sets. Denote byC{\displaystyle C} the class characterized as the smallest codomain such that the ranges of the aforementioned functions intoC{\displaystyle C} are also itself members ofC{\displaystyle C}. InZF{\displaystyle {\mathsf {ZF}}}, this is thesetH1{\displaystyle H_{\aleph _{1}}} ofhereditarily countable sets and has ordinal rank at mostω2{\displaystyle \omega _{2}}. InZF{\displaystyle {\mathsf {ZF}}}, it is uncountable (as it also contains all countable ordinals, the cardinality of which is denoted1{\displaystyle \aleph _{1}}) but its cardinality is not necessarily that ofR{\displaystyle {\mathbb {R} }}. Meanwhile,CZF{\displaystyle {\mathsf {CZF}}} does not proveC{\displaystyle C} even constitutes a set, even whencountable choice is assumed.

Finally, the theory does not prove that all function spaces formed from sets in theconstructible universeL{\displaystyle L} are setsinsideL{\displaystyle L}, and this holds even when assuming Powerset instead of the weaker Exponentiation axiom. So this is a particular statement preventingCZF{\displaystyle {\mathsf {CZF}}} from proving the classL{\displaystyle L} to be a model ofCZF{\displaystyle {\mathsf {CZF}}}.

Ordinal analysis

[edit]

TakingCZF{\displaystyle {\mathsf {CZF}}} and dropping set induction gives a theory that is conservative overHA{\displaystyle {\mathsf {HA}}} for arithmetic statements, in that sense that it proves the same arithmetical statements for itsHA{\displaystyle {\mathsf {HA}}}-modelω{\displaystyle \omega }. Adding back just mathematical induction gives a theory withproof theoretic ordinalφ(ε0,0){\displaystyle \varphi (\varepsilon _{0},0)}, which is the first common fixed point of theVeblen functionsφβ{\displaystyle \varphi _{\beta }} forβ<ε0{\displaystyle \beta <\varepsilon _{0}}. This is the same ordinal as forML1{\displaystyle {\mathsf {ML_{1}}}} and is below theFeferman–Schütte ordinalΓ0{\displaystyle \Gamma _{0}}.Exhibiting a type theoretical model, the full theoryCZF{\displaystyle {\mathsf {CZF}}} goes beyondΓ0{\displaystyle \Gamma _{0}}, its ordinal still being the modest Bachmann–Howard ordinal.Assuming the class of trichotomous ordinals is a set raises the proof theoretical strength ofCZF{\displaystyle {\mathsf {CZF}}} (but not ofIZF{\displaystyle {\mathsf {IZF}}}).

Being related to inductive definitions orbar induction, the regular extension axiomREA{\displaystyle {\mathrm {REA} }} raises the proof theoretical strength ofCZF{\displaystyle {\mathsf {CZF}}}. This large set axiom, granting the existence of certain nice supersets for every set, is proven byZFC{\displaystyle {\mathsf {ZFC}}}.

Models

[edit]

The category of sets and functions ofCZF+REA{\displaystyle {\mathsf {CZF}}+{\mathrm {REA} }} is aΠW{\displaystyle \Pi W}-pretopos. Without diverging intotopos theory, certain extended suchΠW{\displaystyle \Pi W}-pretopoi contain models ofCZF+REA{\displaystyle {\mathsf {CZF}}+{\mathrm {REA} }}. Theeffective topos contains a model of thisCZF+REA{\displaystyle {\mathsf {CZF}}+{\mathrm {REA} }} based on maps characterized by certain good subcountability properties.

Separation, stated redundantly in a classical context, is constructively not implied by Replacement. The discussion so far only committed to the predicatively justified bounded Separation. Note that full Separation (together withRDC{\displaystyle {\mathrm {RDC} }},MP{\displaystyle {\mathrm {MP} }} and alsoIP{\displaystyle {\mathrm {IP} }} for sets) is validated in some effective topos models, meaning the axiom does not spoil cornerstones of the restrictive recursive school.

Related are type theoretical interpretations.In 1977 Aczel showed thatCZF{\displaystyle {\mathsf {CZF}}} can still be interpreted inMartin-Löf type theory,[26] using thepropositions-as-types approach. More specifically, this uses one universe andW{\displaystyle W}-types, providing what is now seen a standard model ofCZF{\displaystyle {\mathsf {CZF}}} inML1V{\displaystyle {\mathsf {ML_{1}V}}}.[27]This is done in terms of the images of its functions and has a fairly direct constructive and predicative justification, while retaining the language of set theory. Roughly, there are two "big" typesU,V{\displaystyle U,V}, the sets are all given through anyf:AV{\displaystyle f\colon A\to V} on someA:U{\displaystyle A\colon U}, and membership of ax{\displaystyle x} in the set is defined to hold when(a:A).f(a)=x{\displaystyle \exists (a\colon A).f(a)=x}.Conversely,CZF{\displaystyle {\mathsf {CZF}}} interpretsML1V{\displaystyle {\mathsf {ML_{1}V}}}.All statements validated in the subcountable model of the set theory can be proven exactly viaCZF{\displaystyle {\mathsf {CZF}}} plus thechoice principleΠΣ{\displaystyle \Pi \Sigma }-AC{\displaystyle \mathrm {AC} }, stated further above.As noted, theories likeCZF{\displaystyle {\mathsf {CZF}}}, and also together with choice, have the existence property for a broad class of sets in common mathematics.Martin-Löf type theories with additional induction principles validate corresponding set theoretical axioms.

Soundness andCompleteness theorems ofCZF{\displaystyle {\mathsf {CZF}}}, with respect torealizability, have been established.

Breaking with ZF

[edit]

One may of course add a Church's thesis.

One may postulate thesubcountability of all sets. This already holds true in the type theoretical interpretation and the model in the effective topos. By Infinity and Exponentiation,ωω{\displaystyle \omega \to \omega } is an uncountable set, while the classPω{\displaystyle {\mathcal {P}}_{\omega }} or evenP1{\displaystyle {\mathcal {P}}_{1}} is then provenly not a set, byCantor's diagonal argument. So this theory then logically rejects Powerset and of coursePEM{\displaystyle {\mathrm {PEM} }}.Subcountability is also in contradiction with variouslarge set axioms. (On the other hand, also usingCZF{\displaystyle {\mathsf {CZF}}}, some such axioms imply the consistency of theories such asZF{\displaystyle {\mathsf {ZF}}} and stronger.)

As a rule of inference,CZF{\displaystyle {\mathsf {CZF}}} is closed underTroelstra's general uniformity for bothz=ω{\displaystyle z=\omega } andz={0,1}{\displaystyle z=\{0,1\}}. One may adopt it as an anti-classical axiom schema, the uniformity principle which may be denotedUP{\displaystyle {\mathrm {UP} }},

z.(x.(yz).ϕ(x,y))(yz).x.ϕ(x,y){\displaystyle \forall z.{\big (}\forall x.\exists (y\in z).\phi (x,y){\big )}\to \exists (y\in z).\forall x.\phi (x,y)}

This also is incompatible with the powerset axiom. The principle is also often formulated forz=ω{\displaystyle z=\omega }. Now for a binary set of labelsz={0,1}{\displaystyle z=\{0,1\}},UP{\displaystyle {\mathrm {UP} }} implies the indecomposability schemaUZ{\displaystyle {\mathrm {UZ} }}, as noted.

In 1989 Ingrid Lindström showed thatnon-well-founded sets can also be interpreted in Martin-Löf type theory, which are obtained by replacing Set Induction inCZF{\displaystyle {\mathsf {CZF}}} withAczel's anti-foundation axiom.[28] The resulting theoryCZFA{\displaystyle {\mathsf {CZFA}}} may be studied by also adding back theω{\displaystyle \omega }-induction schema or relativizeddependent choice, as well as the assertion that every set is member of atransitive set.

Intuitionistic Zermelo–Fraenkel

[edit]

The theoryIZF{\displaystyle {\mathsf {IZF}}} isCZF{\displaystyle {\mathsf {CZF}}}adopting both the standardSeparation as well asPower set and, as inZF{\displaystyle {\mathsf {ZF}}}, one conventionally formulates the theory with Collection below. As such,IZF{\displaystyle {\mathsf {IZF}}} can be seen as the most straight forward variant ofZF{\displaystyle {\mathsf {ZF}}} withoutPEM.So as noted, inIZF{\displaystyle {\mathsf {IZF}}}, in place of Replacement, one may use the

Axiom schema of collection: For any predicateϕ{\displaystyle \phi },

z.((xz).y.ϕ(x,y))w.(xz).(yw).ϕ(x,y){\displaystyle \forall z.{\big (}\forall (x\in z).\exists y.\phi (x,y){\big )}\to \exists w.\forall (x\in z).\exists (y\in w).\phi (x,y)}

While the axiom of replacement requires the relationϕ to befunctional over the setz (as in, for everyx inz there is associated exactly oney), the Axiom of Collection does not.It merely requires there be associated at least oney, and it asserts the existence of a set which collects at least one suchy for each suchx.In classicalZFC{\displaystyle {\mathsf {ZFC}}}, the Collection schema implies theAxiom schema of replacement. When making use of Powerset (and only then), they can be shown to be classically equivalent.

WhileIZF{\displaystyle {\mathsf {IZF}}} is based on intuitionistic rather than classical logic, it is consideredimpredicative.It allows formation of sets via a power set operation and using the generalAxiom of Separation with any proposition, including ones which containquantifiers which are not bounded. Thus new sets can be formed in terms of the universe of all sets, distancing the theory from the bottom-up constructive perspective. So it is even easier to definesets{xBQ(x)}{\displaystyle \{x\in B\mid Q(x)\}} with undecidable membership, namely by making use of undecidable predicates defined on a set.The power set axiom further implies the existence of a set oftruth values. In the presence of excluded middle, this set has two elements. In the absence of it, the set of truth values is also considered impredicative. The axioms ofIZF{\displaystyle {\mathsf {IZF}}} are strong enough so that fullPEM is already implied byPEM for bounded formulas. See also the previous discussion in the section on the Exponentiation axiom. And by the discussion about Separation, it is thus already implied by the particular formulax.(0x0x){\displaystyle \forall x.{\big (}0\in x\lor 0\notin x{\big )}}, the principle that knowledge of membership of0{\displaystyle 0} shall always be decidable, no matter the set.

Metalogic

[edit]

As implied above, the subcountability property cannot be adopted for all sets, given the theory provesPω{\displaystyle {\mathcal {P}}_{\omega }} to be a set.The theory has many of the nicenumerical existence properties and is e.g. consistent with Church's thesis principle as well as withωω{\displaystyle \omega \to \omega } being subcountable. It also has the disjunctive property.

IZF{\displaystyle {\mathsf {IZF}}} with Replacement instead of Collection has the general existence property, even when adopting relativized dependent choice on top of it all. But justIZF{\displaystyle {\mathsf {IZF}}} as formulated does not. The combination of schemas including full separation spoils it.

Even withoutPEM, theproof theoretic strength ofIZF{\displaystyle {\mathsf {IZF}}} equals that ofZF{\displaystyle {\mathsf {ZF}}}. AndHA{\displaystyle {\mathsf {HA}}} proves them equiconsistent and they prove the sameΠ10{\displaystyle \Pi _{1}^{0}}-sentences.

Intuitionistic Z

[edit]

Again on the weaker end, as with its historical counterpartZermelo set theory, one may denote byIZ{\displaystyle {\mathsf {IZ}}} the intuitionistic theory set up likeIZF{\displaystyle {\mathsf {IZF}}} but without Replacement, Collection or Induction.

Intuitionistic KP

[edit]

Let us mention another very weak theory that has been investigated, namely Intuitionistic (or constructive)Kripke–Platek set theoryIKP{\displaystyle {\mathsf {IKP}}}.It has not only Separation but also Collection restricted toΔ0{\displaystyle \Delta _{0}}-formulas, i.e. it is similar toBCST{\displaystyle {\mathsf {BCST}}} but with Induction instead of full Replacement. The theory does not fit into the hierarchy as presented above, simply because it hasAxiom schema of Set Induction from the start. This enables theorems involving the class of ordinals. The theory has the disjunction property.Of course, weaker versions ofIKP{\displaystyle {\mathsf {IKP}}} are obtained by restricting the induction schema to narrower classes of formulas, sayΣ1{\displaystyle \Sigma _{1}}. The theory is especially weak when studied without Infinity.

Sorted theories

[edit]

Constructive set theory

[edit]

As he presented it, Myhill's systemCST{\displaystyle {\mathsf {CST}}} is a theory using constructive first-order logic with identity and two moresorts beyond sets, namelynatural numbers andfunctions. Its axioms are:

  • The usualAxiom of Extensionality for sets, as well as one for functions, and the usualAxiom of union.
  • The Axiom of restricted, or predicative,separation, which is a weakened form of theSeparation axiom from classical set theory, requiring that anyquantifications be bounded to another set, as discussed.
  • A form of theAxiom of Infinity asserting that the collection of natural numbers (for which he introduces a constantω{\displaystyle \omega }) is in fact a set.
  • The axiom of Exponentiation, asserting that for any two sets, there is a third set which contains all (and only) the functions whose domain is the first set, and whose range is the second set. This is a greatly weakened form of theAxiom of power set in classical set theory, to which Myhill, among others, objected on the grounds of itsimpredicativity.

And furthermore:

  • The usualPeano axioms for natural numbers.
  • Axioms asserting that thedomain andrange of a function are both sets. Additionally, anAxiom of non-choice asserts the existence of a choice function in cases where the choice is already made. Together these act like the usualReplacement axiom in classical set theory.

One can roughly identify the strength of this theory with a constructive subtheories ofZF{\displaystyle {\mathsf {ZF}}} when comparing with the previous sections.

And finally the theory adopts

Bishop style set theory

[edit]

Set theory in the flavor ofErrett Bishop's constructivist school mirrors that of Myhill, but is set up in a way that sets come equipped with relations that govern their discreteness.Commonly, Dependent Choice is adopted.

A lot ofanalysis andmodule theory has been developed in this context.

Category theories

[edit]

Not all formal logic theories of sets need to axiomize the binary membership predicate "{\displaystyle \in }" directly. A theory like the Elementary Theory of the Categories Of Set (ETCS{\displaystyle {\mathsf {ETCS}}}), e.g. capturing pairs of composable mappings between objects, can also be expressed with a constructive background logic.Category theory can be set up as a theory of arrows and objects, althoughfirst-order axiomatizations only in terms of arrows are possible.

Beyond that, topoi also haveinternal languages that can be intuitionistic themselves and capture anotion of sets.

Good models of constructive set theories in category theory are the pretoposes mentioned in the Exponentiation section. For some good set theory, this may require enoughprojectives, an axiom about surjective "presentations" of set, implying Countable and Dependent Choice.

See also

[edit]

References

[edit]
  1. ^Feferman, Solomon (1998),In the Light of Logic, New York: Oxford University Press, pp. 280–283,293–294,ISBN 0-195-08030-0
  2. ^Troelstra, A. S., van Dalen D.,Constructivism in mathematics: an introduction 1; Studies in Logic and the Foundations of Mathematics; Springer, 1988;
  3. ^Bridges D., Ishihara H., Rathjen M., Schwichtenberg H. (Editors),Handbook of Constructive Mathematics; Studies in Logic and the Foundations of Mathematics; (2023) pp. 20-56
  4. ^Myhill, "Some properties of Intuitionistic Zermelo-Fraenkel set theory", Proceedings of the 1971 Cambridge Summer School in Mathematical Logic (Lecture Notes in Mathematics 337) (1973) pp. 206-231
  5. ^Crosilla, Laura;Set Theory: Constructive and Intuitionistic ZF; Stanford Encyclopedia of Philosophy; 2009
  6. ^Peter Aczel and Michael Rathjen,Notes on Constructive Set Theory, Reports Institut Mittag-Leffler, Mathematical Logic - 2000/2001, No. 40
  7. ^John L. Bell,Intuitionistic Set Theorys, 2018
  8. ^Jeon, Hanul (2022), "Constructive Ackermann's interpretation",Annals of Pure and Applied Logic,173 (5): 103086,arXiv:2010.04270,doi:10.1016/j.apal.2021.103086,S2CID 222271938
  9. ^Shapiro, S., McCarty, C. & Rathjen, M.,Intuitionistic sets and numbers: small set theory and Heyting arithmetic,https://doi.org/10.1007/s00153-024-00935-4, Arch. Math. Logic (2024)
  10. ^Gambino, N. (2005)."Presheaf models for constructive set theories"(PDF). In Laura Crosilla and Peter Schuster (ed.).From Sets and Types to Topology and Analysis(PDF). pp. 62–96.doi:10.1093/acprof:oso/9780198566519.003.0004.ISBN 9780198566519.
  11. ^Scott, D. S. (1985). Category-theoretic models for Intuitionistic Set Theory. Manuscript slides of a talk given at Carnegie-Mellon University
  12. ^Benno van den Berg,Predicative topos theory and models for constructive set theory, Netherlands University, PhD thesis, 2006
  13. ^Jech, Thomas (2003),Set Theory, Springer Monographs in Mathematics (Third Millennium ed.), Berlin, New York:Springer-Verlag, p. 642,ISBN 978-3-540-44085-7,Zbl 1007.03002
  14. ^Gert Smolka,Set Theory in Type Theory, Lecture Notes, Saarland University, Jan. 2015
  15. ^Gert Smolka and Kathrin Stark,Hereditarily Finite Sets in Constructive Type Theory, Proc. of ITP 2016, Nancy, France, Springer LNCS, May 2015
  16. ^Diener, Hannes (2020). "Constructive Reverse Mathematics".arXiv:1804.05495 [math.LO].
  17. ^Sørenson, Morten; Urzyczyn, Paweł (1998),Lectures on the Curry-Howard Isomorphism,CiteSeerX 10.1.1.17.7385, p. 239
  18. ^Smith, Peter (2007).An introduction to Gödel's Theorems(PDF). Cambridge, U.K.: Cambridge University Press.ISBN 978-0-521-67453-9.MR 2384958., p. 297
  19. ^Pradic, Cécilia; Brown, Chad E. (2019). "Cantor-Bernstein implies Excluded Middle".arXiv:1904.09193 [math.LO].
  20. ^Michael Rathjen,Choice principles in constructive and classical set theories, Cambridge University Press: 31 March 2017
  21. ^Gitman, Victoria (2011),What is the theory ZFC without power set,arXiv:1110.2430
  22. ^Shulman, Michael (2019), "Comparing material and structural set theories",Annals of Pure and Applied Logic,170 (4): 465-504,arXiv:1808.05204,doi:10.1016/j.apal.2018.11.002
  23. ^Errett Bishop,Foundations of Constructive Analysis, July 1967
  24. ^Robert S. Lubarsky,On the Cauchy Completeness of the Constructive Cauchy Reals, July 2015
  25. ^Matthew Ralph John Hendtlass,Constructing fixed points and economic equilibria, PhD Thesis, University of Leeds, April 2013
  26. ^Aczel, Peter: 1978. The type theoretic interpretation of constructive set theory. In: A. MacIntyre et al. (eds.), Logic Colloquium '77, Amsterdam: North-Holland, 55–66.
  27. ^Rathjen, M. (2004),"Predicativity, Circularity, and Anti-Foundation"(PDF), in Link, Godehard (ed.),One Hundred Years of Russell ́s Paradox: Mathematics, Logic, Philosophy, Walter de Gruyter,ISBN 978-3-11-019968-0
  28. ^Lindström, Ingrid: 1989.A construction of non-well-founded sets within Martin-Löf type theory. Journal of Symbolic Logic 54: 57–64.

Further reading

[edit]

External links

[edit]
Mathematical logic
Set theory
Type theory
Category theory
Intuitionistic
Fuzzy
Substructural
Paraconsistent
Description
Many-valued
Digital logic
Others
General
Theorems (list)
 and paradoxes
Logics
Traditional
Propositional
Predicate
Set theory
Types ofsets
Maps and cardinality
Set theories
Formal systems (list),
language and syntax
Example axiomatic
systems
 (list)
Proof theory
Model theory
Computability theory
Related
Overview
Venn diagram of set intersection
Axioms
Operations
  • Concepts
  • Methods
Set types
Theories
Set theorists
Retrieved from "https://en.wikipedia.org/w/index.php?title=Constructive_set_theory&oldid=1272892033"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp