Movatterモバイル変換

Indicator function

From Wikipedia, the free encyclopedia

Mathematical function characterizing set membership

This article is about the 0–1 indicator function. For the 0–infinity indicator function, seecharacteristic function (convex analysis).

This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(December 2009) (Learn how and when to remove this message)

A three-dimensional plot of an indicator function, shown over a square two-dimensional domain (setX): the "raised" portion overlays those two-dimensional points which are members of the "indicated" subset (A).

Inmathematics, anindicator function or acharacteristic function of asubset of aset is afunction that maps elements of the subset to one, and all other elements to zero. That is, ifA is a subset of some setX, then the indicator function ofA is the function $\mathbf {1} _{A}$ defined by $\mathbf {1} _{A}\!(x)=1$ if $x\in A,$ and $\mathbf {1} _{A}\!(x)=0$ otherwise. Other common notations are𝟙_A and $\chi _{A}.$ ^[a]

The indicator function ofA is theIverson bracket of the property of belonging toA; that is,

$\mathbf {1} _{A}(x)=\left[\ x\in A\ \right].$

For example, theDirichlet function is the indicator function of therational numbers as a subset of thereal numbers.

Definition

[edit]

Given an arbitrary setX, the indicator function of a subsetA ofX is the function $\mathbf {1} _{A}\colon X\rightarrow \{0,1\}$ defined by $\operatorname {\mathbf {1} } _{A}\!(x)={\begin{cases}1&{\text{if }}x\in A\\0&{\text{if }}x\notin A\,.\end{cases}}$

TheIverson bracket provides the equivalent notation $\left[\ x\in A\ \right]$ or⟦ x ∈A ⟧, that can be used instead of $\mathbf {1} _{A}\!(x).$

The function $\mathbf {1} _{A}$ is sometimes denoted𝟙_A,I_A,χ_A^[a] or even justA.^[b]

Notation and terminology

[edit]

The notation $\chi _{A}$ is also used to denote thecharacteristic function inconvex analysis, which is defined as if using thereciprocal of the standard definition of the indicator function.

A related concept instatistics is that of adummy variable. (This must not be confused with "dummy variables" as that term is usually used in mathematics, also called abound variable.)

The term "characteristic function" has an unrelated meaning inclassic probability theory. For this reason,traditional probabilists use the termindicator function for the function defined here almost exclusively, while mathematicians in other fields are more likely to use the termcharacteristic function to describe the function that indicates membership in a set.

Infuzzy logic andmodern many-valued logic, predicates are thecharacteristic functions of aprobability distribution. That is, the strict true/false valuation of the predicate is replaced by a quantity interpreted as the degree of truth.

Basic properties

[edit]

Theindicator orcharacteristicfunction of a subsetA of some setXmaps elements ofX to thecodomain $\{0,\,1\}.$

This mapping issurjective only whenA is a non-emptyproper subset ofX. If $A=X,$ then $\mathbf {1} _{A}\equiv 1.$ By a similar argument, if $A=\emptyset$ then $\mathbf {1} _{A}\equiv 0.$

If $A {\displaystyle A}$ and $B {\displaystyle B}$ are two subsets of $X, {\displaystyle X,}$ then ${\begin{aligned}\mathbf {1} _{A\cap B}(x)~&=~\min {\bigl \{}\mathbf {1} _{A}(x),\ \mathbf {1} _{B}(x){\bigr \}}~~=~\mathbf {1} _{A}(x)\cdot \mathbf {1} _{B}(x),\\\mathbf {1} _{A\cup B}(x)~&=~\max {\bigl \{}\mathbf {1} _{A}(x),\ \mathbf {1} _{B}(x){\bigr \}}~=~\mathbf {1} _{A}(x)+\mathbf {1} _{B}(x)-\mathbf {1} _{A}(x)\cdot \mathbf {1} _{B}(x)\,,\end{aligned}}$

and the indicator function of thecomplement of $A {\displaystyle A}$ i.e. $A^{\complement }$ is: $\mathbf {1} _{A^{\complement }}=1-\mathbf {1} _{A}.$

More generally, suppose $A_{1},\dotsc ,A_{n}$ is a collection of subsets ofX. For any $x\in X:$

$\prod _{k\in I}\left(\ 1-\mathbf {1} _{A_{k}}\!\left(x\right)\ \right)$

is a product of0s and1s. This product has the value1 at precisely those $x\in X$ that belong to none of the sets $A_{k}$ and is 0 otherwise. That is

$\prod _{k\in I}(1-\mathbf {1} _{A_{k}})=\mathbf {1} _{X-\bigcup _{k}A_{k}}=1-\mathbf {1} _{\bigcup _{k}A_{k}}.$

Expanding the product on the left hand side,

$\mathbf {1} _{\bigcup _{k}A_{k}}=1-\sum _{F\subseteq \{1,2,\dotsc ,n\}}(-1)^{|F|}\mathbf {1} _{\bigcap _{F}A_{k}}=\sum _{\emptyset \neq F\subseteq \{1,2,\dotsc ,n\}}(-1)^{|F|+1}\mathbf {1} _{\bigcap _{F}A_{k}}$

where $|F|$ is thecardinality ofF. This is one form of the principle ofinclusion-exclusion.

As suggested by the previous example, the indicator function is a useful notational device incombinatorics. The notation is used in other places as well, for instance inprobability theory: ifX is aprobability space with probability measure $\mathbb {P}$ andA is ameasurable set, then $\mathbf {1} _{A}$ becomes arandom variable whoseexpected value is equal to the probability ofA:

$\operatorname {\mathbb {E} } _{X}\left\{\ \mathbf {1} _{A}(x)\ \right\}\ =\ \int _{X}\mathbf {1} _{A}(x)\ \operatorname {d\ \mathbb {P} } (x)=\int _{A}\operatorname {d\ \mathbb {P} } (x)=\operatorname {\mathbb {P} } (A).$

This identity is used in a simple proof ofMarkov's inequality.

In many cases, such asorder theory, the inverse of the indicator function may be defined. This is commonly called thegeneralized Möbius function, as a generalization of the inverse of the indicator function in elementarynumber theory, theMöbius function. (See paragraph below about the use of the inverse in classical recursion theory.)

Mean, variance and covariance

[edit]

Given aprobability space $\textstyle (\Omega ,{\mathcal {F}},\operatorname {P} )$ with $A\in {\mathcal {F}},$ the indicator random variable $\mathbf {1} _{A}\colon \Omega \rightarrow \mathbb {R}$ is defined by $\mathbf {1} _{A}(\omega )=1$ if $\omega \in A,$ otherwise $\mathbf {1} _{A}(\omega )=0.$

Mean: $\ \operatorname {\mathbb {E} } (\mathbf {1} _{A}(\omega ))=\operatorname {\mathbb {P} } (A)\$ (also called "Fundamental Bridge").

Variance: $\ \operatorname {Var} (\mathbf {1} _{A}(\omega ))=\operatorname {\mathbb {P} } (A)(1-\operatorname {\mathbb {P} } (A)).$

Covariance: $\ \operatorname {Cov} (\mathbf {1} _{A}(\omega ),\mathbf {1} _{B}(\omega ))=\operatorname {\mathbb {P} } (A\cap B)-\operatorname {\mathbb {P} } (A)\operatorname {\mathbb {P} } (B).$

Characteristic function in recursion theory, Gödel's and Kleene's representing function

[edit]

Kurt Gödel described therepresenting function in his 1934 paper "On undecidable propositions of formal mathematical systems" (the symbol "¬" indicates logical inversion, i.e. "NOT"):^[1]^: 42

There shall correspond to each class or relationR a representing function $\phi (x_{1},\ldots x_{n})=0$ if $R(x_{1},\ldots x_{n})$ and $\phi (x_{1},\ldots x_{n})=1$ if $\neg R(x_{1},\ldots x_{n}).$

Kleene offers up the same definition in the context of theprimitive recursive functions as a functionφ of a predicateP takes on values0 if the predicate is true and1 if the predicate is false.^[2]

For example, because the product of characteristic functions $\phi _{1}*\phi _{2}*\cdots *\phi _{n}=0$ whenever any one of the functions equals0, it plays the role of logical OR: IF $\phi _{1}=0\$ OR $\ \phi _{2}=0$ OR ... OR $\phi _{n}=0$ THEN their product is0. What appears to the modern reader as the representing function's logical inversion, i.e. the representing function is0 when the functionR is "true" or satisfied", plays a useful role in Kleene's definition of the logical functions OR, AND, and IMPLY,^[2]^: 228 the bounded-^[2]^: 228 and unbounded-^[2]^{: 279 ff}mu operators and the CASE function.^[2]^: 229

Characteristic function in fuzzy set theory

[edit]

In classical mathematics, characteristic functions of sets only take values1 (members) or0 (non-members). Infuzzy set theory, characteristic functions are generalized to take value in the real unit interval[0, 1], or more generally, in somealgebra orstructure (usually required to be at least aposet orlattice). Such generalized characteristic functions are more usually calledmembership functions, and the corresponding "sets" are calledfuzzy sets. Fuzzy sets model the gradual change in the membershipdegree seen in many real-worldpredicates like "tall", "warm", etc.

Smoothness

[edit]

Notes

[edit]

^^a ^bTheGreek letterχ appears because it is the initial letter of the Greek wordχαρακτήρ, which is the ultimate origin of the wordcharacteristic.
^The set of all indicator functions onX can be identified with the set operator ${\mathcal {P}}(X),$ thepower set ofX. Consequently, both sets are denoted by the conventionalabuse of notation as $2^{X},$ in analogy to the relation for the count of elements in the powerset and the original set. This is a special case $\left(Y=\{0,\,1\}\right)$ of the notation $Y^{X}$ for the set of all functions $f {\displaystyle f}$ such that $f:X\mapsto Y\,.$

References

[edit]

^Davis, Martin, ed. (1965).The Undecidable. New York, NY: Raven Press Books. pp. 41–74.
^^a ^b ^c ^d ^eKleene, Stephen (1971) [1952].Introduction to Metamathematics (Sixth reprint, with corrections ed.). Netherlands: Wolters-Noordhoff Publishing and North Holland Publishing Company. p. 227.
^Serre.Course in Arithmetic. p. 5.
^Lange, Rutger-Jan (2012). "Potential theory, path integrals and the Laplacian of the indicator".Journal of High Energy Physics.2012 (11):29–30.arXiv:1302.0864.Bibcode:2012JHEP...11..032L.doi:10.1007/JHEP11(2012)032.S2CID 56188533.

Sources

[edit]

Folland, G.B. (1999).Real Analysis: Modern Techniques and Their Applications (Second ed.). John Wiley & Sons, Inc.ISBN 978-0-471-31716-6.
Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford (2001). "Section 5.2: Indicator random variables".Introduction to Algorithms (Second ed.). MIT Press and McGraw-Hill. pp. 94–99.ISBN 978-0-262-03293-3.
Davis, Martin, ed. (1965).The Undecidable. New York, NY: Raven Press Books.
Kleene, Stephen (1971) [1952].Introduction to Metamathematics (Sixth reprint, with corrections ed.). Netherlands: Wolters-Noordhoff Publishing and North Holland Publishing Company.
Boolos, George;Burgess, John P.;Jeffrey, Richard C. (2002).Computability and Logic. Cambridge UK: Cambridge University Press.ISBN 978-0-521-00758-0.
Zadeh, L.A. (June 1965)."Fuzzy sets".Information and Control.8 (3). San Diego:338–353.doi:10.1016/S0019-9958(65)90241-X.ISSN 0019-9958.Zbl 0139.24606.Wikidata Q25938993.
Goguen, Joseph (1967). "L-fuzzy sets".Journal of Mathematical Analysis and Applications.18 (1):145–174.doi:10.1016/0022-247X(67)90189-8.hdl:10338.dmlcz/103980.