Movatterモバイル変換

Word problem (mathematics)

From Wikipedia, the free encyclopedia

Decision problem pertaining to equivalence of expressions

This article is about algorithmic word problems in mathematics and computer science. For the term 'word problem' as often used in education, seeWord problem (mathematics education). For other uses, seeWord problem.

Incomputational mathematics, aword problem is theproblem of deciding whether two given expressions are equivalent with respect to a set ofrewriting identities. A prototypical example is theword problem for groups, but there are many other instances as well. Somedeep results of computational theory concern theundecidability of this question in many important cases.^[1]

Background and motivation

[edit]

Incomputer algebra one often wishes to encode mathematical expressions using an expression tree. But there are often multiple equivalent expression trees. The question naturally arises of whether there is an algorithm which, given as input two expressions, decides whether they represent the same element. Such an algorithm is called asolution to the word problem. For example, imagine that $x, y, z {\displaystyle x,y,z}$ are symbols representingreal numbers - then a relevant solution to the word problem would, given the input $(x\cdot y)/z\mathrel {\overset {?}{=}} (x/z)\cdot y$ , produce the outputEQUAL, and similarly produceNOT_EQUAL from $(x\cdot y)/z\mathrel {\overset {?}{=}} (x/x)\cdot y$ .

The most direct solution to a word problem takes the form of a normal form theorem and algorithm that maps every element in anequivalence class of expressions to a single encoding known as thenormal form - the word problem is then solved by comparing these normal forms viasyntactic equality.^[1] For example one might decide that $x\cdot y\cdot z^{-1}$ is the normal form of $(x\cdot y)/z$ , $(x/z)\cdot y$ , and $(y/z)\cdot x$ , and devise a transformation system to rewrite those expressions to that form, in the process proving that all equivalent expressions will be rewritten to the same normal form.^[2] But not all solutions to the word problem use a normal form theorem - there are algebraic properties that indirectly imply the existence of an algorithm.^[1]

While the word problem asks whether two terms containingconstants are equal, a proper extension of the word problem known as theunification problem asks whether two terms $t_{1},t_{2}$ containingvariables haveinstances that are equal, or in other words whether the equation $t_{1}=t_{2}$ has any solutions. As a common example, $2+3\mathrel {\overset {?}{=}} 8+(-3)$ is a word problem in theinteger group $\mathbb {Z}$ ,while $2+x\mathrel {\overset {?}{=}} 8+(-x)$ is a unification problem in the same group; since the former terms happen to be equal in $\mathbb {Z}$ , the latter problem has thesubstitution $\{x\mapsto 3\}$ as a solution.

History

[edit]

One of the most deeply studied cases of the word problem is in the theory ofsemigroups andgroups. A timeline of papers relevant to theNovikov–Boone theorem is as follows:^[3]^[4]

1910 (1910):Axel Thue poses a general problem of term rewriting on tree-like structures. He states "A solution of this problem in the most general case may perhaps be connected with unsurmountable difficulties".^[5]^[6]
1911 (1911):Max Dehn poses the word problem forfinitely presented groups.^[7]
1912 (1912):Dehn presentsDehn's algorithm, and proves it solves the word problem for thefundamental groups of closed orientable two-dimensionalmanifolds of genus greater than or equal to 2.^[8] Subsequent authors have greatly extended it to a wide range ofgroup-theoretic decision problems.^[9]^[10]^[11]
1914 (1914):Axel Thue poses the word problem for finitely presented semigroups.^[12]
1930 (1930) – 1938 (1938):TheChurch–Turing thesis emerges, defining formal notions ofcomputability and undecidability.^[13]
1947 (1947):Emil Post andAndrey Markov Jr. independently construct finitely presented semigroups with unsolvable word problem.^[14]^[15] Post's construction is built onTuring machines while Markov's uses Post's normal systems.^[3]
1950 (1950):Alan Turing shows the word problem forcancellation semigroups is unsolvable,^[16] by furthering Post's construction. The proof is difficult to follow but marks a turning point in the word problem for groups.^[3]^: 342
1955 (1955):Pyotr Novikov gives the first published proof that the word problem for groups is unsolvable, using Turing's cancellation semigroup result.^[17]^[3]^: 354 The proof contains a "Principal Lemma" equivalent toBritton's Lemma.^[3]^: 355
1954 (1954) – 1957 (1957):William Boone independently shows the word problem for groups is unsolvable, using Post's semigroup construction.^[18]^[19]
1957 (1957) – 1958 (1958):John Britton gives another proof that the word problem for groups is unsolvable, based on Turing's cancellation semigroups result and some of Britton's earlier work.^[20] An early version of Britton's Lemma appears.^[3]^: 355
1958 (1958) – 1959 (1959):Boone publishes a simplified version of his construction.^[21]^[22]
1961 (1961):Graham Higman characterises thesubgroups of finitely presented groups withHigman's embedding theorem,^[23] connecting recursion theory with group theory in an unexpected way and giving a very different proof of the unsolvability of the word problem.^[3]
1961 (1961) – 1963 (1963):Britton presents a greatly simplified version of Boone's 1959 proof that the word problem for groups is unsolvable.^[24] It uses a group-theoretic approach, in particularBritton's Lemma. This proof has been used in a graduate course, although more modern and condensed proofs exist.^[25]
1977 (1977):Gennady Makanin proves that the existential theory of equations overfree monoids is solvable.^[26]

The word problem for semi-Thue systems

[edit]

The accessibility problem forstring rewriting systems (semi-Thue systems or semigroups) can be stated as follows: Given a semi-Thue system $T:=(\Sigma ,R)$ and two words (strings) $u,v\in \Sigma ^{*}$ , can $u {\displaystyle u}$ be transformed into $v {\displaystyle v}$ by applying rules from $R {\displaystyle R}$ ? Note that the rewriting here is one-way. The word problem is the accessibility problem for symmetric rewrite relations, i.e. Thue systems.^[27]

The accessibility and word problems areundecidable, i.e. there is no general algorithm for solving this problem.^[28] This even holds if we limit the systems to have finite presentations, i.e. a finite set of symbols and a finite set of relations on those symbols.^[27] Even the word problem restricted toground terms is not decidable for certain finitely presented semigroups.^[29]^[30]

The word problem for groups

[edit]

Main article:Word problem for groups

Given apresentation $\langle S\mid {\mathcal {R}}\rangle$ for a groupG, the word problem is the algorithmic problem of deciding, given as input two words inS, whether they represent the same element ofG. The word problem is one of three algorithmic problems for groups proposed byMax Dehn in 1911. It was shown byPyotr Novikov in 1955 that there exists a finitely presented groupG such that the word problem forG isundecidable.^[31]

The word problem in combinatorial calculus and lambda calculus

[edit]

Main article:Combinatory logic § Undecidability of combinatorial calculus

One of the earliest proofs that a word problem is undecidable was forcombinatory logic: when are two strings of combinators equivalent? Because combinators encode all possibleTuring machines, and the equivalence of two Turing machines is undecidable, it follows that the equivalence of two strings of combinators is undecidable.Alonzo Church observed this in 1936.^[32]

Likewise, one has essentially the same problem in (untyped)lambda calculus: given two distinct lambda expressions, there is no algorithm that can discern whether they are equivalent or not;equivalence is undecidable. For several typed variants of the lambda calculus, equivalence is decidable by comparison of normal forms.

The word problem for abstract rewriting systems

[edit]

Solving the word problem: deciding if $x{\stackrel {*}{\leftrightarrow }}y$ usually requires heuristic search (red,green), while deciding $x\downarrow =y\downarrow$ is straightforward (grey).

{\displaystyle x{\stackrel {*}{\leftrightarrow }}y} — Solving the word problem: deciding if $x{\stackrel {*}{\leftrightarrow }}y$ usually requires heuristic search (red,green), while deciding $x\downarrow =y\downarrow$ is straightforward (grey).

The word problem for anabstract rewriting system (ARS) is quite succinct: given objectsx andy are they equivalent under ${\stackrel {*}{\leftrightarrow }}$ ?^[29] The word problem for an ARS isundecidable in general. However, there is acomputable solution for the word problem in the specific case where every object reduces to a unique normal form in a finite number of steps (i.e. the system isconvergent): two objects are equivalent under ${\stackrel {*}{\leftrightarrow }}$ if and only if they reduce to the same normal form.^[33]TheKnuth-Bendix completion algorithm can be used to transform a set of equations into a convergentterm rewriting system.

The word problem in universal algebra

[edit]

Inuniversal algebra one studiesalgebraic structures consisting of agenerating setA, a collection ofoperations onA of finite arity, and a finite set of identities that these operations must satisfy. The word problem for an algebra is then to determine, given two expressions (words) involving the generators and operations, whether they represent the same element of the algebra modulo the identities. The word problems for groups and semigroups can be phrased as word problems for algebras.^[1]

The word problem on freeHeyting algebras is difficult.^[34] The only known results are that the free Heyting algebra on one generator is infinite, and that the freecomplete Heyting algebra on one generator exists (and has one more element than the free Heyting algebra).

The word problem for free lattices

[edit]

Example computation ofx∧z ~x∧z∧(x∨y)

		x∧z∧(x∨y)	≤_~	x∧z
by 5.	since	x∧z	≤_~	x∧z
by 1.	since	x∧z	=	x∧z

		x∧z	≤_~	x∧z∧(x∨y)
by 7.	since	x∧z	≤_~	x∧z	and			x∧z	≤_~	x∨y
by 1.	since	x∧z	=	x∧z		by 6.	since	x∧z	≤_~	x
						by 5.	since	x	≤_~	x
						by 1.	since	x	=	x

The word problem onfree lattices and more generally freebounded lattices has a decidable solution. Bounded lattices are algebraic structures with the twobinary operations ∨ and ∧ and the two constants (nullary operations) 0 and 1. The set of all well-formedexpressions that can be formulated using these operations on elements from a given set of generatorsX will be calledW(X). This set of words contains many expressions that turn out to denote equal values in every lattice. For example, ifa is some element ofX, thena ∨ 1 = 1 anda ∧ 1 =a. The word problem for free bounded lattices is the problem of determining which of these elements ofW(X) denote the same element in the free bounded latticeFX, and hence in every bounded lattice.

The word problem may be resolved as follows. A relation ≤_~ onW(X) may be definedinductively by settingw ≤_~vif and only if one of the following holds:

w =v (this can be restricted to the case wherew andv are elements ofX),
w = 0,
v = 1,
w =w₁ ∨w₂ and bothw₁ ≤_~v andw₂ ≤_~v hold,
w =w₁ ∧w₂ and eitherw₁ ≤_~v orw₂ ≤_~v holds,
v =v₁ ∨v₂ and eitherw ≤_~v₁ orw ≤_~v₂ holds,
v =v₁ ∧v₂ and bothw ≤_~v₁ andw ≤_~v₂ hold.

This defines apreorder ≤_~ onW(X), so anequivalence relation can be defined byw ~v whenw ≤_~v andv ≤_~w. One may then show that thepartially ordered quotient setW(X)/~ is the free bounded latticeFX.^[35]^[36] Theequivalence classes ofW(X)/~ are the sets of all wordsw andv withw ≤_~v andv ≤_~w. Two well-formed wordsv andw inW(X) denote the same value in every bounded lattice if and only ifw ≤_~v andv ≤_~w; the latter conditions can be effectively decided using the above inductive definition. The table shows an example computation to show that the wordsx∧z andx∧z∧(x∨y) denote the same value in every bounded lattice. The case of lattices that are not bounded is treated similarly, omitting rules 2 and 3 in the above construction of ≤_~.

Example: A term rewriting system to decide the word problem in the free group

[edit]

Bläsius and Bürckert^[37]demonstrate theKnuth–Bendix algorithm on an axiom set for groups. The algorithm yields aconfluent andnoetherian term rewrite system that transforms every term into a uniquenormal form.^[38] The rewrite rules are numbered incontiguous since some rules became redundant and were deleted during the algorithm run.The equality of two terms follows from the axioms if and only if both terms are transformed into literally the same normal form term. For example, the terms

((a^{-1}\cdot a)\cdot (b\cdot b^{-1}))^{-1}\mathrel {\overset {R2}{\rightsquigarrow }} (1\cdot (b\cdot b^{-1}))^{-1}\mathrel {\overset {R13}{\rightsquigarrow }} (1\cdot 1)^{-1}\mathrel {\overset {R1}{\rightsquigarrow }} 1^{-1}\mathrel {\overset {R8}{\rightsquigarrow }} 1

, and

b\cdot ((a\cdot b)^{-1}\cdot a)\mathrel {\overset {R17}{\rightsquigarrow }} b\cdot ((b^{-1}\cdot a^{-1})\cdot a)\mathrel {\overset {R3}{\rightsquigarrow }} b\cdot (b^{-1}\cdot (a^{-1}\cdot a))\mathrel {\overset {R2}{\rightsquigarrow }} b\cdot (b^{-1}\cdot 1)\mathrel {\overset {R11}{\rightsquigarrow }} b\cdot b^{-1}\mathrel {\overset {R13}{\rightsquigarrow }} 1

share the same normal form, viz. $1 {\displaystyle 1}$ ; therefore both terms are equal in every group.As another example, the term $1\cdot (a\cdot b)$ and $b\cdot (1\cdot a)$ has the normal form $a\cdot b$ and $b\cdot a$ , respectively. Since the normal forms are literally different, the original terms cannot be equal in every group. In fact, they are usually different innon-abelian groups.

Group axioms used in Knuth–Bendix completion
A1	$1\cdot x$	$=x$
A2	$x^{-1}\cdot x$	$=1$
A3	$(x\cdot y)\cdot z$	$=x\cdot (y\cdot z)$

Term rewrite system obtained from Knuth–Bendix completion
R1	$1\cdot x$	$\rightsquigarrow x$
R2	$x^{-1}\cdot x$	$\rightsquigarrow 1$
R3	$(x\cdot y)\cdot z$	$\rightsquigarrow x\cdot (y\cdot z)$
R4	$x^{-1}\cdot (x\cdot y)$	$\rightsquigarrow y$
R8	$1^{-1}$	$\rightsquigarrow 1$
R11	$x\cdot 1$	$\rightsquigarrow x$
R12	$(x^{-1})^{-1}$	$\rightsquigarrow x$
R13	$x\cdot x^{-1}$	$\rightsquigarrow 1$
R14	$x\cdot (x^{-1}\cdot y)$	$\rightsquigarrow y$
R17	$(x\cdot y)^{-1}$	$\rightsquigarrow y^{-1}\cdot x^{-1}$

References

[edit]

^^a ^b ^c ^dEvans, Trevor (1978)."Word problems".Bulletin of the American Mathematical Society.84 (5): 790.doi:10.1090/S0002-9904-1978-14516-9.
^Cohen, Joel S. (2002).Computer algebra and symbolic computation: elementary algorithms. Natick, Mass.: A K Peters. pp. 90–92.ISBN 1568811586.
^^a ^b ^c ^d ^e ^f ^gMiller, Charles F. (2014). Downey, Rod (ed.)."Turing machines to word problems"(PDF).Turing's Legacy: 330.doi:10.1017/CBO9781107338579.010.hdl:11343/51723.ISBN 9781107338579. Retrieved6 December 2021.
^Stillwell, John (1982)."The word problem and the isomorphism problem for groups".Bulletin of the American Mathematical Society.6 (1):33–56.doi:10.1090/S0273-0979-1982-14963-1.
^Müller-Stach, Stefan (12 September 2021). "Max Dehn, Axel Thue, and the Undecidable". p. 13.arXiv:1703.09750 [math.HO].
^Steinby, Magnus; Thomas, Wolfgang (2000). "Trees and term rewriting in 1910: on a paper by Axel Thue".Bulletin of the European Association for Theoretical Computer Science.72:256–269.CiteSeerX 10.1.1.32.8993.MR 1798015.
^Dehn, Max (1911)."Über unendliche diskontinuierliche Gruppen".Mathematische Annalen.71 (1):116–144.doi:10.1007/BF01456932.ISSN 0025-5831.MR 1511645.S2CID 123478582.
^Dehn, Max (1912)."Transformation der Kurven auf zweiseitigen Flächen".Mathematische Annalen.72 (3):413–421.doi:10.1007/BF01456725.ISSN 0025-5831.MR 1511705.S2CID 122988176.
^Greendlinger, Martin (June 1959). "Dehn's algorithm for the word problem".Communications on Pure and Applied Mathematics.13 (1):67–83.doi:10.1002/cpa.3160130108.
^Lyndon, Roger C. (September 1966)."On Dehn's algorithm".Mathematische Annalen.166 (3):208–228.doi:10.1007/BF01361168.hdl:2027.42/46211.S2CID 36469569.
^Schupp, Paul E. (June 1968)."On Dehn's algorithm and the conjugacy problem".Mathematische Annalen.178 (2):119–130.doi:10.1007/BF01350654.S2CID 120429853.
^Power, James F. (27 August 2013). "Thue's 1914 paper: a translation".arXiv:1308.5858 [cs.FL].
^SeeHistory of the Church–Turing thesis. The dates are based onOn Formally Undecidable Propositions of Principia Mathematica and Related Systems andSystems of Logic Based on Ordinals.
^Post, Emil L. (March 1947)."Recursive Unsolvability of a problem of Thue"(PDF).Journal of Symbolic Logic.12 (1):1–11.doi:10.2307/2267170.JSTOR 2267170.S2CID 30320278. Retrieved6 December 2021.
^Mostowski, Andrzej (September 1951). "A. Markov. Névožmoinost' nékotoryh algoritmov v téorii associativnyh sistém (Impossibility of certain algorithms in the theory of associative systems). Doklady Akadémii Nauk SSSR, vol. 77 (1951), pp. 19–20".Journal of Symbolic Logic.16 (3): 215.doi:10.2307/2266407.JSTOR 2266407.
^Turing, A. M. (September 1950). "The Word Problem in Semi-Groups With Cancellation".The Annals of Mathematics.52 (2):491–505.doi:10.2307/1969481.JSTOR 1969481.
^Novikov, P. S. (1955). "On the algorithmic unsolvability of the word problem in group theory".Proceedings of the Steklov Institute of Mathematics (in Russian).44:1–143.Zbl 0068.01301.
^Boone, William W. (1954). "Certain Simple, Unsolvable Problems of Group Theory. I".Indagationes Mathematicae (Proceedings).57:231–237.doi:10.1016/S1385-7258(54)50033-8.
^Boone, William W. (1957)."Certain Simple, Unsolvable Problems of Group Theory. VI".Indagationes Mathematicae (Proceedings).60:227–232.doi:10.1016/S1385-7258(57)50030-9.
^Britton, J. L. (October 1958). "The Word Problem for Groups".Proceedings of the London Mathematical Society.s3-8 (4):493–506.doi:10.1112/plms/s3-8.4.493.
^Boone, William W. (1958)."The word problem"(PDF).Proceedings of the National Academy of Sciences.44 (10):1061–1065.Bibcode:1958PNAS...44.1061B.doi:10.1073/pnas.44.10.1061.PMC 528693.PMID 16590307.Zbl 0086.24701.
^Boone, William W. (September 1959). "The Word Problem".The Annals of Mathematics.70 (2):207–265.doi:10.2307/1970103.JSTOR 1970103.
^Higman, G. (8 August 1961). "Subgroups of finitely presented groups".Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.262 (1311):455–475.Bibcode:1961RSPSA.262..455H.doi:10.1098/rspa.1961.0132.S2CID 120100270.
^Britton, John L. (January 1963). "The Word Problem".The Annals of Mathematics.77 (1):16–32.doi:10.2307/1970200.JSTOR 1970200.
^Simpson, Stephen G. (18 May 2005)."A Slick Proof of the Unsolvability of the Word Problem for Finitely Presented Groups"(PDF). Retrieved6 December 2021.
^"Subgroups of finitely presented groups".Mathematics of the USSR-Sbornik.103 (145):147–236. 13 February 1977.doi:10.1070/SM1977v032n02ABEH002376.
^^a ^bMatiyasevich, Yuri; Sénizergues, Géraud (January 2005)."Decision problems for semi-Thue systems with a few rules".Theoretical Computer Science.330 (1):145–169.doi:10.1016/j.tcs.2004.09.016.
^Davis, Martin (1978)."What is a Computation?"(PDF).Mathematics Today Twelve Informal Essays. pp. 257–259.doi:10.1007/978-1-4613-9435-8_10.ISBN 978-1-4613-9437-2. Retrieved5 December 2021.
^^a ^bBaader, Franz; Nipkow, Tobias (5 August 1999).Term Rewriting and All That. Cambridge University Press. pp. 59–60.ISBN 978-0-521-77920-3.
^
- Matiyasevich, Yu. V. (1967)."Простые примеры неразрешимых ассоциативных исчислений" [Simple examples of undecidable associative calculi].Doklady Akademii Nauk SSSR (in Russian).173 (6):1264–1266.ISSN 0869-5652.
- Matiyasevich, Yu. V. (1967). "Simple examples of undecidable associative calculi".Soviet Mathematics.8 (2):555–557.ISSN 0197-6788.
^Novikov, P. S. (1955). "On the algorithmic unsolvability of the word problem in group theory".Trudy Mat. Inst. Steklov (in Russian).44:1–143.
^Statman, Rick (2000). "On the Word Problem for Combinators".Rewriting Techniques and Applications. Lecture Notes in Computer Science. Vol. 1833. pp. 203–213.doi:10.1007/10721975_14.ISBN 978-3-540-67778-9.
^Beke, Tibor (May 2011)."Categorification, term rewriting and the Knuth–Bendix procedure".Journal of Pure and Applied Algebra.215 (5): 730.doi:10.1016/j.jpaa.2010.06.019.
^Peter T. Johnstone,Stone Spaces, (1982) Cambridge University Press, Cambridge,ISBN 0-521-23893-5.(See chapter 1, paragraph 4.11)
^Whitman, Philip M. (January 1941). "Free Lattices".The Annals of Mathematics.42 (1):325–329.doi:10.2307/1969001.JSTOR 1969001.
^Whitman, Philip M. (1942). "Free Lattices II".Annals of Mathematics.43 (1):104–115.doi:10.2307/1968883.JSTOR 1968883.
^K. H. Bläsius and H.-J. Bürckert, ed. (1992).Deduktionsssysteme. Oldenbourg. p. 291.; here: p. 126, 134
^Apply rules in any order to a term, as long as possible; the result doesn't depend on the order; it is the term's normal form.