Movatterモバイル変換

Tag system

From Wikipedia, the free encyclopedia

Deterministic model of computation

In thetheory of computation, atag system is a deterministicmodel of computation published byEmil Leon Post in 1943 as a simple form of aPost canonical system.^[1] A tag system may also be viewed as an abstract machine, called aPost tag machine (not to be confused withPost–Turing machines)—briefly, afinite-state machine whose only tape is aFIFO queue of unbounded length, such that in each transition the machine reads the symbol at the head of the queue, deletes a constant number of symbols from the head, and appends to the tail a symbol-string that depends solely on the first symbol read in this transition.

Because all of the indicated operations are performed in a single transition, a tag machine strictly has only one state.

Definitions

[edit]

Atag system is a triplet (m,A,P), where

m is a positive integer, called thedeletion number.
A is a finite alphabet of symbols, one of which can be a specialhalting symbol. All finite (possibly empty) strings onA are calledwords.
P is a set ofproduction rules, assigning a wordP(x) (called aproduction) to each symbolx inA. The production (sayP(H)) assigned to the halting symbol is seen below to play no role in computations, but for convenience is taken to beP(H) ='H'.

Ahalting word is a word that either begins with the halting symbol or whose length is less thanm.

A transformationt (called thetag operation) is defined on the set of non-halting words, such that ifx denotes the leftmost symbol of a wordS, thent(S) is the result of deleting the leftmostm symbols ofS and appending the wordP(x) on the right. Thus, the system processes the m-symbol head into a tail of variable length, but the generated tail depends solely on the first symbol of the head.

Acomputation by a tag system is a finite sequence of words produced by iterating the transformationt, starting with an initially given word and halting when a halting word is produced. (By this definition, a computation is not considered to exist unless a halting word is produced in finitely-many iterations. Alternative definitions allow nonhalting computations, for example by using a special subset of the alphabet to identify words that encode output.)

The termm-tag system is often used to emphasise the deletion number. Definitions vary somewhat in the literature (cf. References), the one presented here being that of Rogozhin.^[2]

The use of a halting symbol in the above definition allows the output of a computation to be encoded in the final word alone, whereas otherwise the output would be encoded in the entire sequence of words produced by iterating the tag operation.

A common alternative definition uses no halting symbol and treats all words of length less thanm as halting words. Another definition is the original one used byPost (1943) (described in the historical note below), in which the only halting word is the empty string.

Example: A simple 2-tag illustration

[edit]

This is merely to illustrate a simple 2-tag system that uses a halting symbol.

2-tag system     Alphabet: {a,b,c,H}     Production rules:         a  -->  ccbaH         b  -->  cca         c  -->  ccComputation    Initial word: baa                    acca                      caccbaH                        ccbaHcc                          baHcccc                            Hcccccca (halt).

Example: Computation of Collatz sequences

[edit]

This simple 2-tag system is adapted fromDe Mol (2008). It uses no halting symbol, but halts on any word of length less than 2, and computes a slightly modified version of theCollatz sequence.

In the original Collatz sequence, the successor ofn is either⁠n/2⁠ (for even n) or 3n + 1 (for odd n). The value 3n + 1 is clearly even for odd n, hence the next term after 3n + 1 is surely⁠3n + 1/2⁠. In the sequence computed by the tag system below we skip this intermediate step, hence the successor ofn is⁠3n + 1/2⁠ for odd n.

In this tag system, a positive integern is represented by the word aa...a withn a's.

2-tag system     Alphabet: {a,b,c}     Production rules:         a  -->  bc         b  -->  a         c  -->  aaaComputation    Initial word: aaa <--> n=3                    abc                        cbc                        caaa                          aaaaa  <--> 5                            aaabc                              abcbc                                cbcbc                                  cbcaaa                                    caaaaaa                                      aaaaaaaa  <--> 8                                        aaaaaabc                                          aaaabcbc                                            aabcbcbc                                              bcbcbcbc                                                bcbcbca                                                  bcbcaa                                                    bcaaa                                                      aaaa  <--> 4                                                        aabc                                                          bcbc                                                            bca                                                              aa  <--> 2                                                                bc                                                                  a  <--> 1                                                                   (halt)

Turing-completeness ofm-tag systems

[edit]

For eachm > 1, the set ofm-tag systems isTuring-complete; i.e., for eachm > 1, it is the case that for any givenTuring machineT, there is anm-tag system thatemulatesT. In particular, a 2-tag system can be constructed to emulate aUniversal Turing machine, as was done byWang (1963) and byCocke & Minsky (1964).

Conversely, a Turing machine can be shown to be a Universal Turing Machine by proving that it can emulate a Turing-complete class ofm-tag systems. For example,Rogozhin (1996) proved the universality of the class of 2-tag systems with alphabet {a₁, ...,a_n,H} and corresponding productions {a_na_nW₁, ...,a_na_nW_n-1,a_na_n,H}, where theW_k are nonempty words; he then proved the universality of a very small (4-state, 6-symbol) Turing machine by showing that it can simulate this class of tag systems.

The 2-tag system is an efficient simulator of universal Turing machines, in $O(t^{4}\ln ^{2}t)$ time. That is, if $M {\displaystyle M}$ is a deterministic single-tape Turing machine that runs in time $t {\displaystyle t}$ , then there is a 2-tag system that simulates it in $O(t^{4}\ln ^{2}t)$ time.^[3]

The 2-tag halting problem

[edit]

This version of thehalting problem is among the simplest, most-easily describedundecidable decision problems:

Given an arbitrary positive integern and a list ofn+1 arbitrary wordsP₁,P₂,...,P_n,Q on the alphabet {1,2,...,n}, does repeated application of the tag operationt:ijX →XP_i eventually convertQ into a word of length less than 2? That is, does the sequenceQ,t¹(Q),t²(Q),t³(Q), ... terminate?

Historical note on the definition of tag system

[edit]

The above definition differs from that ofPost (1943), whose tag systems use no halting symbol, but rather halt only on the empty word, with the tag operationt being defined as follows:

Ifx denotes the leftmost symbol of a nonempty wordS, thent(S) is the operation consisting offirst appending the wordP(x) to the right end ofS, andthen deleting the leftmostm symbols of the result —deleting all if there be less thanm symbols.

The above remark concerning the Turing-completeness of the set ofm-tag systems, for anym > 1, applies also to these tag systems as originally defined by Post.

Origin of the name "tag"

[edit]

According to a footnote inPost (1943), B. P. Gill suggested the name for an earlier variant of the problem in which the firstm symbols are left untouched, but rather a check mark indicating the current position moves to the right bym symbols every step. The name for the problem of determining whether or not the check mark ever touches the end of the sequence was then dubbed the "problem of tag", referring to the children'sgame of tag.

Cyclic tag systems

[edit]

A cyclic tag system is a modification of the original tag system. The alphabet consists of only two symbols,0 and1, and the production rules comprise a list of productions considered sequentially, cycling back to the beginning of the list after considering the "last" production on the list. For each production, the leftmost symbol of the word is examined—if the symbol is1, the current production is appended to the right end of the word; if the symbol is0, no characters are appended to the word; in either case, the leftmost symbol is then deleted. The system halts if and when the word becomes empty.^[4]

Example

[edit]

Cyclic Tag System Productions: (010, 000, 1111)Computation Initial Word: 11001    Production         Word    ----------         --------------       010             11001       000              1001010       1111              001010000       010                01010000       000                 1010000       1111                 010000000       010                   10000000        .                      .        .                      .

Cyclic tag systems were created byMatthew Cook and were used in Cook's demonstration that theRule 110 cellular automaton is universal.^[5] A key part of the demonstration was that cyclic tag systems can emulate aTuring-complete class of tag systems.

Emulation of tag systems by cyclic tag systems

[edit]

Anm-tag system with alphabet {a₁, ...,a_n} and corresponding productions {P₁, ...,P_n} is emulated by a cyclic tag system withm*n productions (Q₁, ...,Q_n, -, -, ..., -), where all but the firstn productions are the empty string (denoted by '-'). TheQ_k are encodings of the respectiveP_k, obtained by replacing each symbol of the tag system alphabet by a length-n binary string as follows (these are to be applied also to the initial word of a tag system computation):

a₁ = 100...00a₂ = 010...00...a_n = 000...01

That is,a_k is encoded as a binary string with a1 in the k^th position from the left, and0's elsewhere. Successive lines of a tag system computation will then occur encoded as every (m*n)^th line of its emulation by the cyclic tag system.

Example

[edit]

This is a very small example to illustrate the emulation technique.

2-tag system    Production rules: (a --> bb, b --> abH, H --> H)    Alphabet encoding: a = 100, b = 010, H = 001     Production encodings: (bb = 010 010, abH = 100 010 001, H = 001)Cyclic tag system     Productions: (010 010, 100 010 001, 001, -, -, -)Tag system computation    Initial word: ba                    abH                      Hbb (halt)Cyclic tag system computation    Initial word: 010 100 (=ba)    Production       Word    ----------       -------------------------------  * 010 010          010 100  (=ba)    100 010 001       10 100    001                0 100 100 010 001    -                    100 100 010 001    -                     00 100 010 001    -                      0 100 010 001  * 010 010                  100 010 001  (=abH)    100 010 001               00 010 001 010 010    001                        0 010 001 010 010    -                            010 001 010 010    -                             10 001 010 010    -                              0 001 010 010  * 010 010       emulated halt -->  001 010 010  (=Hbb)    100 010 001                       01 010 010    001                                1 010 010    -                                    010 010 001    ...                                    ...

Every sixth line (marked by '*') produced by the cyclic tag system is the encoding of a corresponding line of the tag system computation, until the emulated halt is reached.

Notes

[edit]

^Post 1943.
^Rogozhin 1996.
^Woods, Damien; Neary, Turlough (2009-02-17)."The complexity of small universal Turing machines: A survey"(PDF).Theoretical Computer Science. Computational Paradigms from Nature.410 (4):443–450.doi:10.1016/j.tcs.2008.09.051.ISSN 0304-3975.S2CID 10257004.
^In a chapter 14 titled "Very Simple Bases for Computability",Minsky (1967) presents a very readable (and exampled) subsection 14.6The Problem of "Tag" and Monogenic Canonical Systems (pp. 267–273) (this sub-section is indexed as "tag system"). Minsky relates his frustrating experiences with the general problem: "Post found this (00, 1101) problem "intractable," and so did I, even with the help of a computer." He comments that an "effective way to decide, for any string S, whether this process will ever repeat when started with S" is unknown although a few specific cases have been proven unsolvable. In particular he mentions Cocke's Theorem and Corollary 1964.
^Cook 2004.