System F (alsopolymorphic lambda calculus orsecond-order lambda calculus) is atyped lambda calculus that introduces, tosimply typed lambda calculus, a mechanism ofuniversal quantification over types. System F formalizesparametric polymorphism inprogramming languages, thus forming a theoretical basis for languages such asHaskell andML. It was discovered independently bylogicianJean-Yves Girard (1972) andcomputer scientistJohn C. Reynolds.
Whereassimply typed lambda calculus has variables ranging over terms, and binders for them, System F additionally has variables ranging overtypes, and binders for them. As an example, the fact that the identity function can have any type of the formA →A would be formalized in System F as the statement
where is atype variable. The upper-case is traditionally used to denote type-level functions, as opposed to the lower-case which is used for value-level functions. (The superscripted means that the bound variablex is of type; the expression after the colon is the type of the lambda expression preceding it.)
As aterm rewriting system, System F isstrongly normalizing. However,type inference in System F (without explicit type annotations) isundecidable. Under theCurry–Howard isomorphism, System F corresponds tosecond-order propositional intuitionistic logic. System F can be seen as part of thelambda cube, together with even more expressive typed lambda calculi, including those withdependent types.
According to Girard, the "F" inSystem F was picked by chance.[1]
The typing rules of System F are those of simply typed lambda calculus with the addition of the following:
| (1) | (2) |
where are types, is a type variable, and in the context indicates that is bound. The first rule is that of application, and the second is that of abstraction.[2][3]
The type is defined as:, where is atype variable. This means: is the type of all functions which take as input a type α and two expressions of type α, and produce as output an expression of type α (note that we consider to beright-associative.)
The following two definitions for the Boolean values and are used, extending the definition ofChurch Booleans:
(Note that the above two functions requirethree — not two — arguments. The latter two should be lambda expressions, but the first one should be a type. This fact is reflected in the fact that the type of these expressions is; the universal quantifier binding the α corresponds to the Λ binding the alpha in the lambda expression itself. Also, note that is a convenient shorthand for, but it is not a symbol of System F itself, but rather a "meta-symbol". Likewise, and are also "meta-symbols", convenient shorthands, of System F "assemblies" (in theBourbaki sense); otherwise, if such functions could be named (within System F), then there would be no need for the lambda-expressive apparatus capable of defining functions anonymously and for thefixed-point combinator, which works around that restriction.)
Then, with these two-terms, we can define some logic operators (which are of type):
Note that in the definitions above, is a type argument to, specifying that the other two parameters that are given to are of type. As in Church encodings, there is no need for anIFTHENELSE function as one can just use raw-typed terms as decision functions. However, if one is requested:
will do.Apredicate is a function which returns a-typed value. The most fundamental predicate isISZERO which returns if and only if its argument is theChurch numeral0:
Furthermore, theexistential quantifier (and therefore existential types) can be implemented in system F by the following:[4][5]
System F allows recursive constructions to be embedded in a natural manner, related to that inMartin-Löf's type theory. Abstract structures (S) are created usingconstructors. These are functions typed as:
Recursivity is manifested whenS itself appears within one of the types. If you havem of these constructors, you can define the type ofS as:
For instance, the natural numbers can be defined as an inductive datatypeN with constructors
The System F type corresponding to this structure is. The terms of this type comprise a typed version of theChurch numerals, the first few of which are:
If we reverse the order of the curried arguments (i.e.,), then the Church numeral forn is a function that takes a functionf as argument and returns thenth power off. That is to say, a Church numeral is ahigher-order function – it takes a single-argument functionf, and returns another single-argument function.
The version of System F used in this article is as an explicitly typed, or Church-style, calculus. The typing information contained in λ-terms makestype-checking straightforward.Joe Wells (1994) settled an "embarrassing open problem" by proving that type checking isundecidable for a Curry-style variant of System F, that is, one that lacks explicit typing annotations.[6][7]
Wells's result implies thattype inference for System F is impossible.A restriction of System F known as "Hindley–Milner", or simply "HM", does have an easy type inference algorithm and is used for manystatically typedfunctional programming languages such asHaskell 98 and theML family. Over time, as the restrictions of HM-style type systems have become apparent, languages have steadily moved to more expressive logics for their type systems.GHC, a Haskell compiler, goes beyond HM (as of 2008) and uses System F extended with non-syntactic type equality;[8] non-HM features inOCaml's type system includeGADT.[9][10]
In second-orderintuitionistic logic, the second-order polymorphic lambda calculus (F2) was discovered by Girard (1972) and independently by Reynolds (1974).[11] Girard proved therepresentation theorem: that in second-order intuitionistic predicate logic (P2), functions from the natural numbers to the natural numbers that can be proved total, form a projection from P2 into F2.[11] Reynolds proved theabstraction theorem: that every term in F2 satisfies a logical relation, which can be embedded into the logical relations P2.[11] Reynolds proved that a Girard projection followed by a Reynolds embedding form the identity, i.e., theGirard–Reynolds isomorphism.[11]
While System F corresponds to the first axis ofBarendregt'slambda cube,System Fω or thehigher-order polymorphic lambda calculus combines the first axis (polymorphism) with the second axis (type operators); it is a different, more complex system.
System Fω can be defined inductively on a family of systems, where induction is based on thekinds permitted in each system:
In the limit, we can define system to be
That is, Fω is the system which allows functions from types to types where the argument (and result) may be of any order.
Note that although Fω places no restrictions on theorder of the arguments in these mappings, it does restrict theuniverse of the arguments for these mappings: they must be types rather than values. System Fω does not permit mappings from values to types (dependent types), though it does permit mappings from values to values ( abstraction), mappings from types to values ( abstraction), and mappings from types to types ( abstraction at the level of types).
System F<:, pronounced "F-sub", is an extension of system F withsubtyping. System F<: has been of central importance toprogramming language theory since the 1980s because the core offunctional programming languages, like those in theML family, support bothparametric polymorphism andrecord subtyping, which can be expressed inSystem F<:.[12][13]
However, in [3] it was shown that the obvious rules of conversion for this system, called F by chance, were converging.