Movatterモバイル変換

Function composition

From Wikipedia, the free encyclopedia

Operation on mathematical functions

This article is about the mathematical concept. For the computer science concept, seeFunction composition (computer science).

"Ring operator" redirects here; not to be confused withoperator ring oroperator assistance.

"∘" redirects here. For the character, seeDegree symbol § Lookalikes.

Function
x ↦f (x)
History of the function concept
Types bydomain andcodomain
`X` →𝔹 𝔹 →`X` 𝔹ⁿ →`X` `X` →ℤ ℤ →`X` `X` →ℝ ℝ →`X` ℝⁿ →`X` `X` →ℂ ℂ →`X` ℂⁿ →`X`
Classes/properties
Constant Identity Linear Polynomial Rational Algebraic Analytic Smooth Continuous Measurable Injective Surjective Bijective
Constructions
Restriction Composition λ Inverse
Generalizations
Relation (Binary relation) Set-valued Multivalued Partial Implicit Space Higher-order Morphism Functor
List of specific functions
v t e

Inmathematics, thecomposition operator $\circ$ takes twofunctions, $f {\displaystyle f}$ and $g {\displaystyle g}$ , and returns a new function $h(x):=(g\circ f)(x)=g(f(x))$ . Thus, the functiong isapplied after applyingf tox. $(g\circ f)$ is pronounced "the composition ofg andf".^[1]

Reverse composition applies the operation in the opposite order, applying $f {\displaystyle f}$ first and $g {\displaystyle g}$ second. Intuitively, reverse composition is a chaining process in which the output of functionf feeds the input of functiong.

The composition of functions is a special case of thecomposition of relations, sometimes also denoted by $\circ$ . As a result, all properties of composition of relations are true of composition of functions,^[2] such asassociativity.

Examples

[edit]

Concrete example for the composition of two functions.

Composition of functions on a finiteset: Iff = {(1, 1), (2, 3), (3, 1), (4, 2)}, andg = {(1, 2), (2, 3), (3, 1), (4, 2)}, theng ∘f = {(1, 2), (2, 1), (3, 2), (4, 3)}, as shown in the figure.
Composition of functions on aninfinite set: Iff:R →R (whereR is the set of allreal numbers) is given byf(x) = 2x + 4 andg:R →R is given byg(x) =x³, then:
(f ∘g)(x) =f(g(x)) =f(x³) = 2x³ + 4, and
(g ∘f)(x) =g(f(x)) =g(2x + 4) = (2x + 4)³.
If an airplane's altitude at time t isa(t), and the air pressure at altitudex isp(x), then(p ∘a)(t) is the pressure around the plane at time t.
Function defined on finite sets which change the order of their elements such aspermutations can be composed on the same set, this being composition of permutations.

Properties

[edit]

The composition of functions is alwaysassociative—a property inherited from thecomposition of relations.^[2] That is, iff,g, andh are composable, thenf ∘ (g ∘ h) = (f ∘ g) ∘h.^[3] Since the parentheses do not change the result, they are generally omitted.

In a strict sense, the compositiong ∘ f is only meaningful if the codomain off equals the domain ofg; in a wider sense, it is sufficient that the former be an impropersubset of the latter.^{[nb 1]}Moreover, it is often convenient to tacitly restrict the domain off, such thatf produces only values in the domain ofg. For example, the compositiong ∘ f of the functionsf :R →(−∞,+9] defined byf(x) = 9 −x² andg :[0,+∞) →R defined by $g(x)={\sqrt {x}}$ can be defined on theinterval[−3,+3].

Compositions of tworeal functions, theabsolute value and acubic function, in different orders, show a non-commutativity of composition.

The functionsg andf are said tocommute with each other ifg ∘ f =f ∘ g. Commutativity is a special property, attained only by particular functions, and often in special circumstances. For example,|x| + 3 = |x + 3| only whenx ≥ 0. The picture shows another example.

The composition ofone-to-one (injective) functions is always one-to-one. Similarly, the composition ofonto (surjective) functions is always onto. It follows that the composition of twobijections is also a bijection. Theinverse function of a composition (assumed invertible) has the property that(f ∘ g)⁻¹ =g⁻¹∘f⁻¹.^[4]

Derivatives of compositions involving differentiable functions can be found using thechain rule.Higher derivatives of such functions are given byFaà di Bruno's formula.^[3]

Composition of functions is sometimes described as a kind ofmultiplication on a function space, but has very different properties frompointwise multiplication of functions (e.g. composition is notcommutative).^[5]

Composition monoids

[edit]

Main article:Transformation monoid

Suppose one has two (or more) functionsf:X →X,g:X →X having the same domain and codomain; these are often calledtransformations. Then one can form chains of transformations composed together, such asf ∘f ∘g ∘f. Such chains have thealgebraic structure of amonoid, called atransformation monoid or (much more seldom) acomposition monoid. In general, transformation monoids can have remarkably complicated structure. One particular notable example is thede Rham curve. The set ofall functionsf:X →X is called thefull transformation semigroup^[6] orsymmetric semigroup^[7] on X. (One can actually define two semigroups depending how one defines the semigroup operation as the left or right composition of functions.^[8])

Composition of ashear mapping(red) and a clockwise rotation by 45°(green). On the left is the original object. Above is shear, then rotate. Below is rotate, then shear.

If the given transformations arebijective (and thus invertible), then the set of all possible combinations of these functions forms atransformation group (also known as apermutation group); and one says that the group isgenerated by these functions.

The set of all bijective functionsf:X →X (calledpermutations) forms a group with respect to function composition. This is thesymmetric group, also sometimes called thecomposition group. A fundamental result in group theory,Cayley's theorem, essentially says that any group is in fact just a subgroup of a symmetric group (up to isomorphism).^[9]

In the symmetric semigroup (of all transformations) one also finds a weaker, non-unique notion of inverse (called a pseudoinverse) because the symmetric semigroup is aregular semigroup.^[10]

Functional powers

[edit]

Main article:Iterated function

IfY⊆X, then $f:X\to Y$ may compose with itself; this is sometimes denoted as $f^{2}$ . That is:

(f\circ f)(x)=f(f(x))=f^{2}(x)

(f\circ f\circ f)(x)=f(f(f(x)))=f^{3}(x)

(f\circ f\circ f\circ f)(x)=f(f(f(f(x))))=f^{4}(x)

More generally, for anynatural numbern ≥ 2, thenthfunctionalpower can be defined inductively byf ⁿ =f ∘f ⁿ⁻¹ =f ⁿ⁻¹ ∘f, a notation introduced byHans Heinrich Bürmann^{[citation needed]}^[11]^[12] andJohn Frederick William Herschel.^[13]^[11]^[14]^[12] Repeated composition of such a function with itself is calledfunction iteration.

By convention,f ⁰ is defined as the identity map onf 's domain,id_X.
IfY =X andf:X →X admits aninverse functionf ⁻¹, negative functional powersf ⁻ⁿ are defined forn > 0 as thenegated power of the inverse function:f ⁻ⁿ = (f ⁻¹)ⁿ.^[13]^[11]^[12]

Note: Iff takes its values in aring (in particular for real or complex-valuedf ), there is a risk of confusion, asf ⁿ could also stand for then-fold product of f, e.g.f ²(x) =f(x) ·f(x).^[12] For trigonometric functions, usually the latter is meant, at least for positive exponents.^[12] For example, intrigonometry, this superscript notation represents standardexponentiation when used withtrigonometric functions:

sin²(x) = sin(x) · sin(x).

However, for negative exponents (especially −1), it nevertheless usually refers to the inverse function, e.g.,tan⁻¹ = arctan ≠ 1/tan.

In some cases, when, for a given functionf, the equationg ∘g =f has a unique solutiong, that function can be defined as thefunctional square root off, then written asg =f ^1/2.

More generally, whengⁿ =f has a unique solution for some natural numbern > 0, thenf ^m/n can be defined asg^m.

Under additional restrictions, this idea can be generalized so that theiteration count becomes a continuous parameter; in this case, such a system is called aflow, specified through solutions ofSchröder's equation. Iterated functions and flows occur naturally in the study offractals anddynamical systems.

To avoid ambiguity, some mathematicians^{[citation needed]} choose to use∘ to denote the compositional meaning, writingf^∘n(x) for then-th iterate of the functionf(x), as in, for example,f^∘3(x) meaningf(f(f(x))). For the same purpose,f^[n](x) was used byBenjamin Peirce^[15]^[12] whereasAlfred Pringsheim andJules Molk suggestedⁿf(x) instead.^[16]^[12]^{[nb 2]}

Alternative notations

[edit]

Many mathematicians, particularly ingroup theory, omit the composition symbol, writinggf forg ∘f.^[17]

During the mid-20th century, some mathematicians adoptedpostfix notation, writingxf forf(x) and(xf)g forg(f(x)).^[18] This can be more natural thanprefix notation in many cases, such as inlinear algebra whenx is arow vector andf andg denotematrices and the composition is bymatrix multiplication. The order is important because function composition is not necessarily commutative. Having successive transformations applying and composing to the right agrees with the left-to-right reading sequence.

Mathematicians who use postfix notation may write "fg", meaning first applyf and then applyg, in keeping with the order the symbols occur in postfix notation, thus making the notation "fg" ambiguous. Computer scientists may write "f ;g" for this,^[19] thereby disambiguating the order of composition. To distinguish the left composition operator from a text semicolon, in theZ notation the ⨾ character is used for leftrelation composition.^[20] Since all functions arebinary relations, it is correct to use the [fat] semicolon for function composition as well (see the article oncomposition of relations for further details on this notation).

Composition operator

[edit]

Main article:Composition operator

Given a function g, thecomposition operatorC_g is defined as thatoperator which maps functions to functions as $C_{g}f=f\circ g.$ Composition operators are studied in the field ofoperator theory.

In programming languages

[edit]

Main article:Function composition (computer science)

Function composition appears in one form or another in numerousprogramming languages.

Multivariate functions

[edit]

Partial composition is possible formultivariate functions. The function resulting when some argumentx_i of the functionf is replaced by the functiong is called a composition off andg in some computer engineering contexts, and is denotedf |_{x_i =g} $f|_{x_{i}=g}=f(x_{1},\ldots ,x_{i-1},g(x_{1},x_{2},\ldots ,x_{n}),x_{i+1},\ldots ,x_{n}).$

Wheng is a simple constantb, composition degenerates into a (partial) valuation, whose result is also known asrestriction orco-factor.^[21]

$f|_{x_{i}=b}=f(x_{1},\ldots ,x_{i-1},b,x_{i+1},\ldots ,x_{n}).$

In general, the composition of multivariate functions may involve several other functions as arguments, as in the definition ofprimitive recursive function. Givenf, an-ary function, andnm-ary functionsg₁, ...,g_n, the composition off withg₁, ...,g_n, is them-ary function $h(x_{1},\ldots ,x_{m})=f(g_{1}(x_{1},\ldots ,x_{m}),\ldots ,g_{n}(x_{1},\ldots ,x_{m})).$

This is sometimes called thegeneralized composite orsuperposition off withg₁, ...,g_n.^[22] The partial composition in only one argument mentioned previously can be instantiated from this more general scheme by setting all argument functions except one to be suitably chosenprojection functions. Hereg₁, ...,g_n can be seen as a single vector/tuple-valued function in this generalized scheme, in which case this is precisely the standard definition of function composition.^[23]

A set of finitaryoperations on some base setX is called aclone if it contains all projections and is closed under generalized composition. A clone generally contains operations of variousarities.^[22] The notion of commutation also finds an interesting generalization in the multivariate case; a functionf of arityn is said to commute with a functiong of aritym iff is ahomomorphism preservingg, and vice versa, that is:^[22] $f(g(a_{11},\ldots ,a_{1m}),\ldots ,g(a_{n1},\ldots ,a_{nm}))=g(f(a_{11},\ldots ,a_{n1}),\ldots ,f(a_{1m},\ldots ,a_{nm})).$

A unary operation always commutes with itself, but this is not necessarily the case for a binary (or higher arity) operation. A binary (or higher arity) operation that commutes with itself is calledmedial or entropic.^[22]

Generalizations

[edit]

Composition can be generalized to arbitrarybinary relations.IfR ⊆X×Y andS ⊆Y ×Z are two binary relations, then their composition amounts to

$R\circ S=\{(x,z)\in X\times Z:(\exists y\in Y)((x,y)\in R\,\land \,(y,z)\in S)\}$ .

Considering a function as a special case of a binary relation (namelyfunctional relations), function composition satisfies the definition for relation composition. A small circleR∘S has been used for theinfix notation of composition of relations, as well as functions. When used to represent composition of functions $(g\circ f)(x)\ =\ g(f(x))$ however, the text sequence is reversed to illustrate the different operation sequences accordingly.

The composition is defined in the same way forpartial functions and Cayley's theorem has its analogue called theWagner–Preston theorem.^[24]

Thecategory of sets with functions asmorphisms is the prototypicalcategory. The axioms of a category are in fact inspired from the properties (and also the definition) of function composition.^[25] The structures given by composition are axiomatized and generalized incategory theory with the concept ofmorphism as the category-theoretical replacement of functions. The reversed order of composition in the formula(f ∘ g)⁻¹ = (g⁻¹ ∘f ⁻¹) applies forcomposition of relations usingconverse relations, and thus ingroup theory. These structures formdagger categories.

The standard "foundation" for mathematics starts withsets and their elements. It is possible to start differently, by axiomatising not elements of sets but functions between sets. This can be done by using the language of categories and universal constructions.

. . . the membership relation for sets can often be replaced by the composition operation for functions. This leads to an alternative foundation for Mathematics upon categories -- specifically, on the category of all functions. Now much of Mathematics is dynamic, in that it deals with morphisms of an object into another object of the same kind. Such morphisms (like functions)form categories, and so the approach via categories fits well with the objective of organizing and understanding Mathematics. That, in truth, should be the goal of a proper philosophy of Mathematics.
-Saunders Mac Lane,Mathematics: Form and Function^[26]

Typography

[edit]

The composition symbol∘ is encoded asU+2218 ∘RING OPERATOR (&compfn;, &SmallCircle;); see theDegree symbol article for similar-appearing Unicode characters. InTeX, it is written\circ.

Notes

[edit]

^The strict sense is used,e.g., incategory theory, where a subset relation is modelled explicitly by aninclusion function.
^Alfred Pringsheim's andJules Molk's (1907) notationⁿf(x) to denote function compositions must not be confused withRudolf von Bitter Rucker's (1982)notationⁿx, introduced by Hans Maurer (1901) andReuben Louis Goodstein (1947) fortetration, or withDavid Patterson Ellerman's (1995)ⁿx pre-superscript notation forroots.

References

[edit]

^"Composition of Functions".nool.ontariotechu.ca. Retrieved2025-02-07.
^^a ^bVelleman, Daniel J. (2006).How to Prove It: A Structured Approach.Cambridge University Press. p. 232.ISBN 978-1-139-45097-3.
^^a ^bWeisstein, Eric W."Composition".mathworld.wolfram.com. Retrieved2020-08-28.
^Rodgers, Nancy (2000).Learning to Reason: An Introduction to Logic, Sets, and Relations.John Wiley & Sons. pp. 359–362.ISBN 978-0-471-37122-9.
^"3.4: Composition of Functions".Mathematics LibreTexts. 2020-01-16. Retrieved2020-08-28.
^Hollings, Christopher (2014).Mathematics across the Iron Curtain: A History of the Algebraic Theory of Semigroups.American Mathematical Society. p. 334.ISBN 978-1-4704-1493-1.
^Grillet, Pierre A. (1995).Semigroups: An Introduction to the Structure Theory.CRC Press. p. 2.ISBN 978-0-8247-9662-4.
^Dömösi, Pál; Nehaniv, Chrystopher L. (2005).Algebraic Theory of Automata Networks: An introduction. SIAM. p. 8.ISBN 978-0-89871-569-9.
^Carter, Nathan (2009-04-09).Visual Group Theory. MAA. p. 95.ISBN 978-0-88385-757-1.
^Ganyushkin, Olexandr; Mazorchuk, Volodymyr (2008).Classical Finite Transformation Semigroups: An Introduction.Springer Science & Business Media. p. 24.ISBN 978-1-84800-281-4.
^^a ^b ^cHerschel, John Frederick William (1820)."Part III. Section I. Examples of the Direct Method of Differences".A Collection of Examples of the Applications of the Calculus of Finite Differences. Cambridge, UK: Printed by J. Smith, sold by J. Deighton & sons. pp. 1–13 [5–6].Archived from the original on 2020-08-04. Retrieved2020-08-04.[1] (NB. Inhere, Herschel refers to his1813 work and mentionsHans Heinrich Bürmann's older work.)
^^a ^b ^c ^d ^e ^f ^gCajori, Florian (1952) [March 1929]. "§472. The power of a logarithm / §473. Iterated logarithms / §533. John Herschel's notation for inverse functions / §535. Persistence of rival notations for inverse functions / §537. Powers of trigonometric functions".A History of Mathematical Notations. Vol. 2 (3rd corrected printing of 1929 issue, 2nd ed.). Chicago, USA:Open court publishing company. pp. 108,176–179, 336, 346.ISBN 978-1-60206-714-1. Retrieved2016-01-18.[…] §473.Iterated logarithms […] We note here the symbolism used byPringsheim andMolk in their jointEncyclopédie article: "²log_b a = log_b (log_b a), …,^k+1log_b a = log_b (^klog_b a)."^[a] […] §533.John Herschel's notation for inverse functions, sin⁻¹ x, tan⁻¹ x, etc., was published by him in thePhilosophical Transactions of London, for the year 1813. He says (p. 10): "This notation cos.⁻¹ e must not be understood to signify 1/cos. e, but what is usually written thus, arc (cos.=e)." He admits that some authors use cos.^m A for (cos. A)^m, but he justifies his own notation by pointing out that sinced² x, Δ³ x, Σ² x meandd x, ΔΔΔ x, ΣΣ x, we ought to write sin.² x for sin. sin. x, log.³ x for log. log. log. x. Just as we writed⁻ⁿ V=∫ⁿ V, we may write similarly sin.⁻¹ x=arc (sin.=x), log.⁻¹ x.=c^x. Some years later Herschel explained that in 1813 he usedfⁿ(x),f⁻ⁿ(x), sin.⁻¹ x, etc., "as he then supposed for the first time. The work of a German Analyst,Burmann, has, however, within these few months come to his knowledge, in which the same is explained at a considerably earlier date. He[Burmann], however, does not seem to have noticed the convenience of applying this idea to the inverse functions tan⁻¹, etc., nor does he appear at all aware of the inverse calculus of functions to which it gives rise." Herschel adds, "The symmetry of this notation and above all the new and most extensive views it opens of the nature of analytical operations seem to authorize its universal adoption."^[b] […] §535.Persistence of rival notations for inverse function.— […] The use of Herschel's notation underwent a slight change inBenjamin Peirce's books, to remove the chief objection to them; Peirce wrote: "cos^[−1] x," "log^[−1] x."^[c] […] §537.Powers of trigonometric functions.—Three principal notations have been used to denote, say, the square of sin x, namely, (sin x)², sin x², sin² x. The prevailing notation at present is sin² x, though the first is least likely to be misinterpreted. In the case of sin² x two interpretations suggest themselves; first, sin x ⋅ sin x; second,^[d] sin (sin x). As functions of the last type do not ordinarily present themselves, the danger of misinterpretation is very much less than in case of log² x, where log x ⋅ log x and log (log x) are of frequent occurrence in analysis. […] The notation sinⁿ x for (sin x)ⁿ has been widely used and is now the prevailing one. […]{{cite book}}:ISBN / Date incompatibility (help) (xviii+367+1 pages including 1 addenda page) (NB. ISBN and link for reprint of 2nd edition by Cosimo, Inc., New York, USA, 2013.)
^^a ^bHerschel, John Frederick William (1813) [1812-11-12]. "On a Remarkable Application of Cotes's Theorem".Philosophical Transactions of the Royal Society of London.103 (Part 1). London:Royal Society of London, printed by W. Bulmer and Co., Cleveland-Row, St. James's, sold by G. and W. Nicol, Pall-Mall: 8–26 [10].doi:10.1098/rstl.1813.0005.JSTOR 107384.S2CID 118124706.
^Peano, Giuseppe (1903).Formulaire mathématique (in French). Vol. IV. p. 229.
^Peirce, Benjamin (1852).Curves, Functions and Forces. Vol. I (new ed.). Boston, USA. p. 203.{{cite book}}: CS1 maint: location missing publisher (link)
^Pringsheim, Alfred;Molk, Jules (1907).Encyclopédie des sciences mathématiques pures et appliquées (in French). Vol. I. p. 195. Part I.
^Ivanov, Oleg A. (2009-01-01).Making Mathematics Come to Life: A Guide for Teachers and Students.American Mathematical Society. pp. 217–.ISBN 978-0-8218-4808-1.
^Gallier, Jean (2011).Discrete Mathematics. Springer. p. 118.ISBN 978-1-4419-8047-2.
^Barr, Michael; Wells, Charles (1998).Category Theory for Computing Science(PDF). p. 6. Archived fromthe original(PDF) on 2016-03-04. Retrieved2014-08-23. (NB. This is the updated and free version of book originally published byPrentice Hall in 1990 asISBN 978-0-13-120486-7.)
^ISO/IEC 13568:2002(E), p. 23
^Bryant, R. E. (August 1986)."Logic Minimization Algorithms for VLSI Synthesis"(PDF).IEEE Transactions on Computers.C-35 (8):677–691.doi:10.1109/tc.1986.1676819.S2CID 10385726.
^^a ^b ^c ^dBergman, Clifford (2011).Universal Algebra: Fundamentals and Selected Topics.CRC Press. pp. 79–80,90–91.ISBN 978-1-4398-5129-6.
^Tourlakis, George (2012).Theory of Computation.John Wiley & Sons. p. 100.ISBN 978-1-118-31533-0.
^Lipscomb, S. (1997).Symmetric Inverse Semigroups. AMS Mathematical Surveys and Monographs. p. xv.ISBN 0-8218-0627-0.
^Hilton, Peter; Wu, Yel-Chiang (1989).A Course in Modern Algebra.John Wiley & Sons. p. 65.ISBN 978-0-471-50405-4.
^"Saunders Mac Lane - Quotations".Maths History. Retrieved2024-02-13.